BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018781
(350 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 287/343 (83%), Positives = 315/343 (91%), Gaps = 1/343 (0%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K L+ SLF CS LAHDFSIVGYSPEHLTS+DKL+ELFESW+S HGK Y +EEKLH
Sbjct: 7 KTSFLTFFASLFVCSVLAHDFSIVGYSPEHLTSVDKLVELFESWISGHGKAYNSLEEKLH 66
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY 127
RFE+FKENLKHIDQRNKEVTSYWLGLNEFAD+SHEEFK+K+LGL P+FP R++ S +FSY
Sbjct: 67 RFEVFKENLKHIDQRNKEVTSYWLGLNEFADLSHEEFKSKFLGLYPEFP-RKKSSEDFSY 125
Query: 128 RDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELID 187
RDV LPKS+DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSLSEQ+LID
Sbjct: 126 RDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVAGNLTSLSEQQLID 185
Query: 188 CDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
CDTSFNNGCNGGLMDYAF++IV +GGLHKEEDYPYLMEEGTC++K+EEMEVVTISGY DV
Sbjct: 186 CDTSFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYLMEEGTCDEKREEMEVVTISGYHDV 245
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSD 307
P NDEQSLLKALAHQP+SVAI+ASG DFQFYSGGVF+GPCG +LDHGVAAVGYG S G D
Sbjct: 246 PRNDEQSLLKALAHQPLSVAIDASGRDFQFYSGGVFSGPCGTDLDHGVAAVGYGSSSGID 305
Query: 308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
YIIVKNSWGPKWGERGY+RMKRNTGKPEGLCGINKMAS P K+
Sbjct: 306 YIIVKNSWGPKWGERGYLRMKRNTGKPEGLCGINKMASYPTKQ 348
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 586 bits (1511), Expect = e-165, Method: Compositional matrix adjust.
Identities = 265/341 (77%), Positives = 307/341 (90%), Gaps = 1/341 (0%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
LL+++S S CS+LA DFSIVGY+PE LTS +KL+ELFESWMS+H K YK +EEK+HRF
Sbjct: 13 LLVAISASALLCSALARDFSIVGYTPEQLTSTEKLLELFESWMSEHSKVYKSVEEKVHRF 72
Query: 70 EIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGL-KPQFPTRRQPSAEFSYR 128
E+F+ENL HIDQRN E+ SYWLGLNEFAD++HEEFK +YLGL KPQF +RQPSA F YR
Sbjct: 73 EVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYR 132
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
D+ LPKSVDWRKKGAV PVK+QG CGSCWAFSTVAAVEGINQI +GNL+SLSEQELIDC
Sbjct: 133 DITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDC 192
Query: 189 DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
DT+FN+GCNGGLMDYAF+YI+++GGLHKE+DYPYLMEEG C+++KE++E VTISGY+DVP
Sbjct: 193 DTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVP 252
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDY 308
END++SL+KALAHQPVSVAIEASG DFQFY GGVF G CG +LDHGVAAVGYG SKGSDY
Sbjct: 253 ENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGQCGTDLDHGVAAVGYGSSKGSDY 312
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+IVKNSWGP+WGE+G+IRMKRNTGKPEGLCGINKMAS P K
Sbjct: 313 VIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASYPTK 353
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 585 bits (1509), Expect = e-165, Method: Compositional matrix adjust.
Identities = 264/341 (77%), Positives = 306/341 (89%), Gaps = 1/341 (0%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
LL+++S S C + A DFSIVGY+PEHLT+ DKL+ELFESWMS+H K YK +EEK+HRF
Sbjct: 13 LLVAISASALLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRF 72
Query: 70 EIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGL-KPQFPTRRQPSAEFSYR 128
E+F+ENL HIDQRN E+ SYWLGLNEFAD++HEEFK +YLGL KPQF +RQPSA F YR
Sbjct: 73 EVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYR 132
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
D+ LPKSVDWRKKGAV PVK+QG CGSCWAFSTVAAVEGINQI +GNL+SLSEQELIDC
Sbjct: 133 DITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDC 192
Query: 189 DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
DT+FN+GCNGGLMDYAF+YI+++GGLHKE+DYPYLMEEG C+++KE++E VTISGY+DVP
Sbjct: 193 DTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVP 252
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDY 308
END++SL+KALAHQPVSVAIEASG DFQFY GGVF G CG +LDHGVAAVGYG SKGSDY
Sbjct: 253 ENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDY 312
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+IVKNSWGP+WGE+G+IRMKRNTGKPEGLCGINKMAS P K
Sbjct: 313 VIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASYPTK 353
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 580 bits (1494), Expect = e-163, Method: Compositional matrix adjust.
Identities = 264/324 (81%), Positives = 295/324 (91%)
Query: 26 HDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE 85
DFSIVGYSPE LT +DKLI FESW+SKHGK YK +EEKLHRFE+F+ENL HID+RNKE
Sbjct: 382 RDFSIVGYSPEDLTCIDKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKE 441
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
V+SYWLGLNEFAD+SHEEFK+KYLGL+ +FP R S EF YRDV LP+SVDWRKKGAV
Sbjct: 442 VSSYWLGLNEFADLSHEEFKSKYLGLRAEFPRSRDYSGEFRYRDVADLPESVDWRKKGAV 501
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
T VKNQG+CGSCWAFSTVAAVEGINQIV+GNLT+LSEQELIDCDT+FN+GCNGGLMDYAF
Sbjct: 502 THVKNQGACGSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAF 561
Query: 206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVS 265
+I ++GGLHKE+DYPYLMEEGTCE++KE++++VTISGY+DVPE DE+SLLKALAHQP+S
Sbjct: 562 AFIASNGGLHKEDDYPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLS 621
Query: 266 VAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYI 325
VAIEASG DFQFYSGGVF GPCG ELDHGVAAVGYG SKG DYIIVKNSWGPKWGE+GYI
Sbjct: 622 VAIEASGRDFQFYSGGVFNGPCGTELDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYI 681
Query: 326 RMKRNTGKPEGLCGINKMASIPLK 349
RMKRNTGK EGLCGINKMAS P K
Sbjct: 682 RMKRNTGKTEGLCGINKMASYPTK 705
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 573 bits (1478), Expect = e-161, Method: Compositional matrix adjust.
Identities = 272/350 (77%), Positives = 307/350 (87%), Gaps = 2/350 (0%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + SK L+ LS +LF ++AHDFSIVGYSPEHL SMDK IELFESWMSKH KTY+
Sbjct: 1 MALSTFSKATLI-LSATLFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYR 59
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
IEEKLHRFEIF +NLKHID+ NK+V+SYWLGLNEFAD+SHEEFK+KYLGL+ +FP R++
Sbjct: 60 SIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFP-RKR 118
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
S FSY DV+ LP+SVDWR KGAVTPVKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSL
Sbjct: 119 SSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSL 178
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQELIDCD SFNNGC GGLMDYAF+YI+++ GL KEEDYPYLMEEG C +KE+ EVVT
Sbjct: 179 SEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVVT 238
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
ISGY+DVP NDEQSLLKAL+HQPVSVAIEAS +FQFY GG+FTG CG ++DHGV AVGY
Sbjct: 239 ISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVGY 298
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G S+G+DYIIVKNSWGPKWGE GYIRMKRNTGKPEGLCGIN+MAS P K+
Sbjct: 299 GSSEGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYPTKE 348
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 573 bits (1476), Expect = e-161, Method: Compositional matrix adjust.
Identities = 273/347 (78%), Positives = 304/347 (87%), Gaps = 1/347 (0%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
FS SK L+L+ S LFA + DFSIVGYS E L SMDKLIELFESWMSKHGK Y+ IE
Sbjct: 3 FSFSKALVLACSFCLFASLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSKHGKIYQSIE 62
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA 123
EKL RFEIFK+NLKHID+RNK V++YWLGLNEFAD+SH+EFKNKYLGLK + RR+
Sbjct: 63 EKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRESPE 122
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
EF+Y+DV+ LPKSVDWRKKGAV PVKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSLSEQ
Sbjct: 123 EFTYKDVE-LPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQ 181
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
ELIDCD ++NNGCNGGLMDYAF +IV +GGLHKEEDYPY+MEEGTCE KEE EVVTISG
Sbjct: 182 ELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISG 241
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
Y DVP+N+EQSLLKALA+QP+SVAIEASG DFQFYSGGVF G CG++LDHGVAAVGYG +
Sbjct: 242 YHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGYGTA 301
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
KG DYIIVKNSWG KWGE+GYIRM+RN GKPEG+CGI KMAS P KK
Sbjct: 302 KGVDYIIVKNSWGSKWGEKGYIRMRRNIGKPEGICGIYKMASYPTKK 348
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 271/350 (77%), Positives = 305/350 (87%), Gaps = 2/350 (0%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + SK L+ LS +LF + AHDFSIVGYSPEHL SMDK IELFESWMSKH K Y+
Sbjct: 1 MALSTFSKATLI-LSATLFITYATAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKAYR 59
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
IEEKLHRFEIF +NLKHID+ NK+V+SYWLGLNEFAD+SHEEFK+KYLGL+ +FP R++
Sbjct: 60 SIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFP-RKR 118
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
S FSY DV+ LP+SVDWR KGAVTPVKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSL
Sbjct: 119 SSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSL 178
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQELIDCD SFNNGC GGLMDYAF+YI+++ GL KEEDYPYLMEEG C +KE+ EVVT
Sbjct: 179 SEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVVT 238
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
ISGY+DVP NDEQSLLKAL+HQPVSVAIEAS +FQFY GG+FTG CG ++DHGV AVGY
Sbjct: 239 ISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVGY 298
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G S+G+DYIIVKNSWGPKWGE GYIRMKRNTGKPEGLCGIN+MAS P K+
Sbjct: 299 GSSEGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYPTKE 348
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 570 bits (1468), Expect = e-160, Method: Compositional matrix adjust.
Identities = 271/347 (78%), Positives = 304/347 (87%), Gaps = 1/347 (0%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
FS SK L L+ S LFA ++A DFSIVGYS E L SMDKLIELFESWMS+HGK Y+ IE
Sbjct: 3 FSSSKALFLACSFCLFASLAVAGDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYQSIE 62
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA 123
EKLHRF+IFK+NLKHID+RNK V++YWLGLNEFAD+SH+EFKNKYLGLK + RR+
Sbjct: 63 EKLHRFDIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRESPE 122
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
EF+Y+D + LPKSVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSLSEQ
Sbjct: 123 EFTYKDFE-LPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQ 181
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
ELIDCD ++NNGCNGGLMDYAF +IV +GGLHKEEDYPY+MEEGTCE KEE EVVTISG
Sbjct: 182 ELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISG 241
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
Y DVP+N+EQSLLKAL +QP+SVAIEASG DFQFYSGGVF G CG++LDHGVAAVGYG S
Sbjct: 242 YHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGYGTS 301
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
KG +YIIVKNSWG KWGE+GYIRM+RN GKPEG+CGI KMAS P KK
Sbjct: 302 KGVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEGICGIYKMASYPTKK 348
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 269/350 (76%), Positives = 303/350 (86%), Gaps = 1/350 (0%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MAF + L +L+ S LFA + DFSIVGYS E L SMDKLIELFESW+S+HGK Y+
Sbjct: 1 MAFSTSKALRVLACSFCLFASFTFGRDFSIVGYSSEDLKSMDKLIELFESWISRHGKIYQ 60
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
IEEKLHRFEIFK+NLKHID+RNK V++YWLGLNEFAD+SH+EFKNKYLGLK + RR+
Sbjct: 61 SIEEKLHRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRE 120
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
EF+Y+DV+ LPKSVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSL
Sbjct: 121 SPEEFTYKDVE-LPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSL 179
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQELIDCD ++NNGCNGGLMDYAF +IV + GLHKEEDYPY+MEEGTCE KEE EVVT
Sbjct: 180 SEQELIDCDRTYNNGCNGGLMDYAFSFIVENDGLHKEEDYPYIMEEGTCEMAKEETEVVT 239
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
ISGY DVP+N+EQSLLKALA+QP+SVAIEASG DFQFYSGGVF G CG++LDHGVAAVGY
Sbjct: 240 ISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGY 299
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G +KG DYI VKNSWG KWGE+GYIRM+RN GKPEG+CGI KMAS P KK
Sbjct: 300 GTAKGVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGICGIYKMASYPTKK 349
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 567 bits (1460), Expect = e-159, Method: Compositional matrix adjust.
Identities = 269/350 (76%), Positives = 302/350 (86%), Gaps = 1/350 (0%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MAF S L+L++ S LFA + DFSIVGYS E L SMDKLIELFESWMS+HGK Y+
Sbjct: 1 MAFSSSKALVLIACSFCLFASLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYE 60
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
IEEKL RFEIFK+NLKHID+RNK V++YWLGLNEFAD+SH EF NKYLGLK + RR+
Sbjct: 61 NIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHREFNNKYLGLKVDYSRRRE 120
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
EF+Y+DV+ LPKSVDWRKKGAV PVKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSL
Sbjct: 121 SPEEFTYKDVE-LPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSL 179
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQELIDCD ++NNGCNGGLMDYAF +IV +GGLHKEEDYPY+MEEGTCE KEE +VVT
Sbjct: 180 SEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETQVVT 239
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
ISGY DVP+N+EQSLLKALA+QP+SVAIEASG DFQFYSGGVF G CG++LDHGVAAVGY
Sbjct: 240 ISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGY 299
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G +KG DYI VKNSWG KWGE+GYIRM+RN GKPEG+CGI KMAS P KK
Sbjct: 300 GTAKGVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGICGIYKMASYPTKK 349
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 566 bits (1459), Expect = e-159, Method: Compositional matrix adjust.
Identities = 265/349 (75%), Positives = 300/349 (85%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA S LL +S+++FA S+ A DFSIVGYSP+ LTSMDKL +LFESWMSKHGK+Y+
Sbjct: 1 MALSPFSNFFLLFISMAVFAYSAFARDFSIVGYSPDDLTSMDKLTDLFESWMSKHGKSYR 60
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
EEKLHRFE+F++NLKHID+ NK+V+SYWLGLNEFAD+SHEEFK KYLGLK + P RR
Sbjct: 61 SFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGLKIELPKRRD 120
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
EFSY+DV LPKSVDWRKKGAV VKNQG+CGSCWAFSTVAAVEGINQIV+GNLT+L
Sbjct: 121 SPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTAL 180
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQELIDCD FNNGCNGGLMDYAF +I+++GGL KEEDYPY+MEEGTC +KKEE+EVVT
Sbjct: 181 SEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEGTCGEKKEELEVVT 240
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
ISGY DVPE++EQS LKALA+QP+SVAIEAS FQFYSGG+F G CG ELDHGVAAVGY
Sbjct: 241 ISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHCGTELDHGVAAVGY 300
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G SKG DYI VKNSWG KWGE+GYIRMKRN GKPEG+CGI KMAS P K
Sbjct: 301 GTSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMASYPTK 349
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 563 bits (1451), Expect = e-158, Method: Compositional matrix adjust.
Identities = 271/352 (76%), Positives = 305/352 (86%), Gaps = 4/352 (1%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MAFFS SK L+L+ SL LF + DFSIVGYS E L SMDKLIELFESWMS+HGK Y+
Sbjct: 1 MAFFS-SKTLVLTCSLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYE 59
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
IEEKL RFE+FK+NLKHID+RNK V++YWLGLNEFAD+SH+EFKNKYLGLK RR+
Sbjct: 60 TIEEKLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVNLSQRRE 119
Query: 121 PS--AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
S EF+YRDV LPKSVDWRKKGAVTPVKNQG CGSCWAFSTVAAVEGINQIV+GNLT
Sbjct: 120 SSNEEEFTYRDVD-LPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 178
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
SLSEQELIDCDT++NNGCNGGLMDYAF +IV +GGLHKE+DYPY+MEE TCE KKEE +V
Sbjct: 179 SLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGGLHKEDDYPYIMEESTCEMKKEETQV 238
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
VTI+GY DVP+N+EQSLLKALA+QP+SVAIEAS DFQFYSGGVF G CG++LDHGV+AV
Sbjct: 239 VTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGGVFDGHCGSDLDHGVSAV 298
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
GYG SK DYIIVKNSWG KWGE+G+IRMKRN GKPEG+CG+ KMAS P KK
Sbjct: 299 GYGTSKNLDYIIVKNSWGAKWGEKGFIRMKRNIGKPEGICGLYKMASYPTKK 350
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 562 bits (1449), Expect = e-158, Method: Compositional matrix adjust.
Identities = 267/350 (76%), Positives = 301/350 (86%), Gaps = 1/350 (0%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MAF S L+L++ S LFA + DFSIVGYS E L SMDKLIELFESWMS+HGK Y+
Sbjct: 1 MAFSSSKALVLIACSFCLFASLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYE 60
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
IEEKL RFEIFK+NLKHID+RNK V++YWLGL+EFAD+SH EF NKYLGLK + RR+
Sbjct: 61 NIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLSEFADLSHREFNNKYLGLKVDYSRRRE 120
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
EF+Y+DV+ LPKSVDWRKKGAV PVKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSL
Sbjct: 121 SPEEFTYKDVE-LPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSL 179
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQELIDCD ++NNGCNGGLMDYAF +IV +GGLHKEEDYPY+MEEG CE KEE +VVT
Sbjct: 180 SEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGACEMTKEETQVVT 239
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
ISGY DVP+N+EQSLLKALA+QP+SVAIEASG DFQFYSGGVF G CG++LDHGVAAVGY
Sbjct: 240 ISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGY 299
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G +KG DYI VKNSWG KWGE+GYIRM+RN GKPEG+CGI KMAS P KK
Sbjct: 300 GTAKGVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGICGIYKMASYPTKK 349
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 562 bits (1448), Expect = e-158, Method: Compositional matrix adjust.
Identities = 273/351 (77%), Positives = 303/351 (86%), Gaps = 3/351 (0%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MAFFS K L+L+ SL LF + DFSIVGYS E L SMDKLIELFESWMS+HGK Y+
Sbjct: 1 MAFFS-PKTLVLTCSLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYE 59
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
IEEKL RFE+FK+NLKHID RNK V++YWLGLNEFAD+SH+EFKNKYLGLK RR+
Sbjct: 60 TIEEKLLRFEVFKDNLKHIDDRNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRE 119
Query: 121 PSAE-FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S E F+YRDV LPKSVDWRKKGAVTPVKNQG CGSCWAFSTVAAVEGINQIV+GNLTS
Sbjct: 120 SSEEEFTYRDVD-LPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTS 178
Query: 180 LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVV 239
LSEQELIDCDT++NNGCNGGLMDYAF +IV +GGLHKEEDYPY+MEE TCE KKE EVV
Sbjct: 179 LSEQELIDCDTTYNNGCNGGLMDYAFSFIVKNGGLHKEEDYPYIMEESTCEMKKEVSEVV 238
Query: 240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVG 299
TI+GY DVP+N+EQSLLKALA+QP+SVAIEASG DFQFYSGGVF G CG+ELDHGV+AVG
Sbjct: 239 TINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSELDHGVSAVG 298
Query: 300 YGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
YG SKG DYIIVKNSWG KWGE+G+IRMKRN GK EG+CG+ KMAS P KK
Sbjct: 299 YGTSKGLDYIIVKNSWGAKWGEKGFIRMKRNIGKSEGICGLYKMASYPTKK 349
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 561 bits (1447), Expect = e-157, Method: Compositional matrix adjust.
Identities = 265/343 (77%), Positives = 297/343 (86%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
SKLL L++ +S F +S DFSIVGY PE LTSMD+LIELFE W+S HGK Y+ IEEK
Sbjct: 4 SKLLPLAMCMSFFVVTSFGKDFSIVGYWPEDLTSMDRLIELFEEWISNHGKIYETIEEKW 63
Query: 67 HRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFS 126
HRFE+FK+NLKHID+ NK+VTSYWLG+NEFAD++H+EFKN YLGLK + RQ EF+
Sbjct: 64 HRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSRTRQSPEEFT 123
Query: 127 YRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
Y+DV LPKSVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGIN+IV GNLTSLSEQELI
Sbjct: 124 YKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGGNLTSLSEQELI 183
Query: 187 DCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQD 246
DCD +NNGC+GGLMDYAF +IV+SGGLHKEEDYPYL E TC++KK E+EVVTISGY+D
Sbjct: 184 DCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVESTCDNKKGELEVVTISGYKD 243
Query: 247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGS 306
VPEN+E SL+KALAHQP+SVAIEASG DFQFYSGGVF GPCG +LDHGV AVGYG SKG
Sbjct: 244 VPENNEASLIKALAHQPLSVAIEASGRDFQFYSGGVFDGPCGTQLDHGVTAVGYGSSKGV 303
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
DYIIVKNSWGPKWGE+GYIRMKRNTGKP GLCGINKMAS P K
Sbjct: 304 DYIIVKNSWGPKWGEKGYIRMKRNTGKPAGLCGINKMASYPTK 346
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 560 bits (1443), Expect = e-157, Method: Compositional matrix adjust.
Identities = 260/335 (77%), Positives = 296/335 (88%)
Query: 16 LSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
+S FA S LA DFSIVGY+PE LTS D++I+LFESW+SKH K Y+ IEEK HRFEIFK+N
Sbjct: 1 MSFFASSCLARDFSIVGYAPEDLTSRDRIIDLFESWISKHQKIYESIEEKWHRFEIFKDN 60
Query: 76 LKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPK 135
L HID+ NK+V +YWLGLNEFAD+SHEEFKNKYLGL RR+ S EF+Y+DV ++PK
Sbjct: 61 LFHIDETNKKVVNYWLGLNEFADLSHEEFKNKYLGLNVDLSNRRECSEEFTYKDVSSIPK 120
Query: 136 SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNG 195
SVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSLSEQEL+DCDT++NNG
Sbjct: 121 SVDWRKKGAVTDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNNG 180
Query: 196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
CNGGLMDYAF YI+++GGLHKEEDYPY+MEEGTCE +K E EVVTISGY DVP+N E+SL
Sbjct: 181 CNGGLMDYAFAYIISNGGLHKEEDYPYIMEEGTCEMRKAESEVVTISGYHDVPQNSEESL 240
Query: 256 LKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSW 315
LKALA+QP+SVAI+ASG DFQFYSGGVF G CG ELDHGVAAVGYG +KG D+I+VKNSW
Sbjct: 241 LKALANQPLSVAIDASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGSAKGLDFIVVKNSW 300
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G KWGE+G+IRMKRNTGKP GLCGINKMAS P KK
Sbjct: 301 GSKWGEKGFIRMKRNTGKPAGLCGINKMASYPTKK 335
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 558 bits (1439), Expect = e-156, Method: Compositional matrix adjust.
Identities = 264/349 (75%), Positives = 297/349 (85%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA +S L++ +S F +S DFSIVGY PE LTSMD+LIELFE W+S HGK Y+
Sbjct: 1 MAPSPYSFYFFLAMCMSFFVVTSFGKDFSIVGYWPEDLTSMDRLIELFEEWISNHGKIYE 60
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
IEEK HRFE+FK+NLKHID+ NK+VTSYWLG+NEFAD++H+EFKN YLGLK + RQ
Sbjct: 61 TIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSRTRQ 120
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
EF+Y+DV LPKSVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGIN+IV GNLTSL
Sbjct: 121 SPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGGNLTSL 180
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQELIDCD +NNGC+GGLMDYAF +IV+SGGLHKEEDYPYL E TC++KK E+EVVT
Sbjct: 181 SEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVESTCDNKKGELEVVT 240
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
ISGY+DVPEN+E SL+KALAHQP+SVAIEASG DFQFYSGGVF GPCG +LDHGV AVGY
Sbjct: 241 ISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGGVFDGPCGTQLDHGVTAVGY 300
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G SKG DYIIVKNSWGPKWGE+GYIRMKRNTGKP GLCGINKMAS P K
Sbjct: 301 GSSKGVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAGLCGINKMASYPTK 349
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 558 bits (1439), Expect = e-156, Method: Compositional matrix adjust.
Identities = 270/352 (76%), Positives = 303/352 (86%), Gaps = 4/352 (1%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MAFFS SK L+L+ SL LF + DFSIVGYS E L SMDKLIELFESWMS+HGK Y+
Sbjct: 1 MAFFS-SKTLVLTCSLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYE 59
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
IEEKL RFE+FK+NLKHID RNK V++YWLGLNEFAD+SH+EFKNKYLGLK RR+
Sbjct: 60 TIEEKLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRE 119
Query: 121 PS--AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
S EF+YRDV LPKSVDWRKKGAVTPVKNQG CGSCWAFSTVAAVEGINQIV+GNLT
Sbjct: 120 SSNEEEFTYRDVD-LPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 178
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
SLSEQELIDCDT++NNGCNGGLMDYAF +I +GGLHKEEDYPY+MEE TCE KKEE +V
Sbjct: 179 SLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGGLHKEEDYPYIMEESTCEMKKEETQV 238
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
VTI+GY DVP+N+EQSLLKALA+QP+SVAIEAS DFQFYSGGVF G CG++LDHGV+AV
Sbjct: 239 VTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGGVFDGHCGSDLDHGVSAV 298
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
GYG SK DYIIVKNSWG KWGE+G+IRMKR+ GKPEG+CG+ KMAS P KK
Sbjct: 299 GYGTSKNLDYIIVKNSWGAKWGEKGFIRMKRDIGKPEGICGLYKMASYPTKK 350
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 269/350 (76%), Positives = 301/350 (86%), Gaps = 2/350 (0%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
M+ S+S L L++SLS A S A D SIVGY+PE LTS DKLI+LFESW+S+ G+ Y+
Sbjct: 1 MSPSSYSFLFFLAVSLSFLAYSGFARD-SIVGYAPEDLTSNDKLIDLFESWISRFGRVYE 59
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
EEKL RFEIFK+NL HID NK+V +YWLGLNEFAD+SHEEFKNKYLGLKP R Q
Sbjct: 60 SAEEKLERFEIFKDNLFHIDDTNKKVRNYWLGLNEFADLSHEEFKNKYLGLKPDLSKRAQ 119
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
EF+Y+DV A+PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSL
Sbjct: 120 CPEEFTYKDV-AIPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSL 178
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQELIDCDT++NNGCNGGLMDYAF YIVA+GGLHKEEDYPY+MEEGTC+ +KEE + VT
Sbjct: 179 SEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGLHKEEDYPYIMEEGTCDMRKEESDAVT 238
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
ISGY DVP+N E+SLLKALA+QP+S+AIEASG DFQFYSGGVF G CG ELDHGVAAVGY
Sbjct: 239 ISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGY 298
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G SKG DYIIVKNSWGPKWGE+GYIRMKR T KPEG+CGI KMAS P KK
Sbjct: 299 GTSKGLDYIIVKNSWGPKWGEKGYIRMKRKTSKPEGICGIYKMASYPTKK 348
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 554 bits (1428), Expect = e-155, Method: Compositional matrix adjust.
Identities = 262/335 (78%), Positives = 294/335 (87%)
Query: 16 LSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
+S FA S LA DFSIVGY+PE LTS DK+I+LFESW+SKHGK Y+ IEEK RFEIFK+N
Sbjct: 1 MSFFANSGLARDFSIVGYTPEDLTSGDKIIDLFESWISKHGKIYESIEEKWLRFEIFKDN 60
Query: 76 LKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPK 135
L HID+ NK+V +YWLGLNEF+D+SHEEFKNKYLGLK RR+ S EF+Y+DV ++PK
Sbjct: 61 LFHIDETNKKVVNYWLGLNEFSDLSHEEFKNKYLGLKVDMSERRECSQEFNYKDVMSIPK 120
Query: 136 SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNG 195
SVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSLSEQEL+DCDT+ N G
Sbjct: 121 SVDWRKKGAVTDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYG 180
Query: 196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
CNGGLMDYAF YI+++GGLHKE DYPY+MEEGTCE +KEE EVVTISGY DVP+N E+SL
Sbjct: 181 CNGGLMDYAFSYIISNGGLHKEVDYPYIMEEGTCEMRKEESEVVTISGYHDVPQNSEESL 240
Query: 256 LKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSW 315
LKALA+QP+SVAIEASG DFQFYSGGVF G CG +LDHGVAAVGYG + G DYIIVKNSW
Sbjct: 241 LKALANQPLSVAIEASGRDFQFYSGGVFDGHCGTQLDHGVAAVGYGSTNGLDYIIVKNSW 300
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G KWGE+GYIRMKRNTGKP GLCGINKMAS P KK
Sbjct: 301 GSKWGEKGYIRMKRNTGKPAGLCGINKMASYPTKK 335
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 542 bits (1397), Expect = e-152, Method: Compositional matrix adjust.
Identities = 263/355 (74%), Positives = 299/355 (84%), Gaps = 6/355 (1%)
Query: 1 MAFFSHSKLLLLSLSLSLFACS---SLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGK 57
MA S S++L L+LS S + +HD+SIVGYSPE L S DKLIELFE+W+S K
Sbjct: 1 MALSSPSRILCFPLALSAATLSLSVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEK 60
Query: 58 TYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPT 117
Y+ +EEKL RFE+FK+NLKHID+ NK+V SYWLGLNEFAD+SHEEFK YLGLK
Sbjct: 61 AYETVEEKLLRFEVFKDNLKHIDETNKKVKSYWLGLNEFADLSHEEFKKMYLGLKTDIVR 120
Query: 118 RRQPS--AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
R + AEF+YRDV+A+PKSVDWRKKGAV VKNQGSCGSCWAFSTVAAVEGIN+IV+G
Sbjct: 121 RDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTG 180
Query: 176 NLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
NLT+LSEQELIDCDT++NNGCNGGLMDYAF+YIV +GGL KEEDYPY MEEGTCE +K+E
Sbjct: 181 NLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDE 240
Query: 236 MEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG-GVFTGPCGAELDHG 294
E VTI G+QDVP NDE+SLLKALAHQP+SVAI+ASG +FQFYSG VF G CG +LDHG
Sbjct: 241 SETVTIDGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYSGVSVFDGRCGVDLDHG 300
Query: 295 VAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VAAVGYG SKGSDYIIVKNSWGPKWGE+GYIR+KRNTGKPEGLCGINKMAS P K
Sbjct: 301 VAAVGYGSSKGSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCGINKMASFPTK 355
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 540 bits (1391), Expect = e-151, Method: Compositional matrix adjust.
Identities = 255/352 (72%), Positives = 298/352 (84%), Gaps = 3/352 (0%)
Query: 1 MAF-FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY 59
MAF FS K L + +S+ ACS+LA++FSI+GY+PE LTS+ K+I LFESW++KH K Y
Sbjct: 1 MAFIFSSKKTSLFLVFVSVLACSALANEFSILGYAPEDLTSIHKVIHLFESWLAKHSKIY 60
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+ ++EKLHRFEIF +NLKHID NK+V++YWLGLNEFAD++HEEFKNK+LGLK + P R+
Sbjct: 61 ESLDEKLHRFEIFMDNLKHIDDTNKKVSNYWLGLNEFADLTHEEFKNKFLGLKGELPERK 120
Query: 120 QPSAE-FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
S E FSYRD LPKSVDWRKKGAV PVKNQG CGSCWAFSTVAAVEGINQIV+GNLT
Sbjct: 121 DESIEEFSYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 180
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQELIDCDT+FNNGCNGGLMDYAF Y++ SG LHKEE+YPY+M EGTC++KK+ E
Sbjct: 181 MLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSG-LHKEEEYPYIMSEGTCDEKKDVSET 239
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
VTISGY DVP N+E S LKALA+QP+SVAIEASG DFQFYSGGVF G CG ELDHGVAAV
Sbjct: 240 VTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAV 299
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
GYG +KG DY+IV+NSWGPKWGE+GYIRMKR TGKP G+CG+ MAS P K+
Sbjct: 300 GYGTTKGLDYVIVRNSWGPKWGEKGYIRMKRKTGKPHGMCGLYMMASYPTKQ 351
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 539 bits (1389), Expect = e-151, Method: Compositional matrix adjust.
Identities = 261/346 (75%), Positives = 288/346 (83%), Gaps = 26/346 (7%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S S + L ++ SL CS +AHDFSIVGYSPEHLTSM KL ELFESWMSKHGKTY+ IEE
Sbjct: 4 SVSSIFLFTIFTSLVICSVVAHDFSIVGYSPEHLTSMHKLTELFESWMSKHGKTYESIEE 63
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE 124
KLHR E+FK+NL HID+RN++VT+YWL LNEFAD+SHEEFK+K + RR
Sbjct: 64 KLHRLEVFKDNLMHIDRRNRDVTTYWLALNEFADLSHEEFKSKLAQI------RR----- 112
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
+KGAV PVKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSLSEQE
Sbjct: 113 ---------------LEKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQE 157
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
LIDCDTSFN+GCNGGLMDYAF YIV +GGLHKEEDYPYLMEEGTC++K+EEMEVVTISGY
Sbjct: 158 LIDCDTSFNSGCNGGLMDYAFDYIVNNGGLHKEEDYPYLMEEGTCDEKREEMEVVTISGY 217
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
DVPEN+E+SLLKALAHQP+S+AIEASG DFQFY GVF GPCG +LDHGVAAVGYG SK
Sbjct: 218 HDVPENNEESLLKALAHQPLSIAIEASGRDFQFYGRGVFNGPCGTDLDHGVAAVGYGSSK 277
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G DYIIVKNSWGPKWGE+GYIRMKRNTGKPEGLCGINKMAS P KK
Sbjct: 278 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPTKK 323
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 539 bits (1388), Expect = e-151, Method: Compositional matrix adjust.
Identities = 262/345 (75%), Positives = 294/345 (85%), Gaps = 2/345 (0%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
FS SK L+L+ S LFA + DFSIVGYS E L SMDKLIELFESWMSKHGK Y+ IE
Sbjct: 3 FSFSKALVLACSFCLFASLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSKHGKIYQSIE 62
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA 123
EKL RFEIFK+NLKHID+RNK V++YWLGLNEFAD+SH+EFKNKYLGLK + RR+
Sbjct: 63 EKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRESPE 122
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
EF+Y+DV+ LPKSVDWRKKGAV PVKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSLSEQ
Sbjct: 123 EFTYKDVE-LPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQ 181
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
ELIDCD +++NGCNGGLMDYAF +IV +GGLHKEEDYPY+MEEGTCE KEE EVVTISG
Sbjct: 182 ELIDCDRTYSNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISG 241
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
Y DVP+N+EQSLLKALA+Q +SVAIEASG DFQFYSGGVF G CG++LDHGVAAVGYG +
Sbjct: 242 YHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGYGTA 301
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
KG DYIIVKNSWG KWGE+GYIRM R T + G +MAS PL
Sbjct: 302 KGVDYIIVKNSWGSKWGEKGYIRM-RGTLETRGNLRYLQMASYPL 345
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 536 bits (1382), Expect = e-150, Method: Compositional matrix adjust.
Identities = 254/352 (72%), Positives = 297/352 (84%), Gaps = 3/352 (0%)
Query: 1 MAF-FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY 59
MAF FS K LL L +S+ ACS+LAH+FSI+GY+PE LTS+ K+I LFESW+ KH K Y
Sbjct: 1 MAFIFSSKKTSLLFLFVSILACSALAHEFSILGYAPEDLTSIHKVIHLFESWLVKHSKFY 60
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+ ++EKLHRFEIF +NLKHID+ NK+V++YWLGLNEFAD++HEEFK+K+LG K + R+
Sbjct: 61 ESLDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERK 120
Query: 120 -QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
+ S EF YRD LPKSVDWRKKGAV PVKNQG CGSCWAFSTVAAVEGINQIV+GNLT
Sbjct: 121 DESSKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 180
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQELIDCDT+FNNGCNGGLMDYAF Y++ SG LHKEE+YPY+M EGTC++KK+ E
Sbjct: 181 MLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSG-LHKEEEYPYIMSEGTCDEKKDVSEK 239
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
VTISGY DVP NDE S LKALA+QP+SVAIEASG DFQFYSGGVF G CG ELDHGVAAV
Sbjct: 240 VTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAV 299
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
GYG +KG DY+IV+NSWGPKWGE+GYIRMKR +GKP G+CG+ MAS P K+
Sbjct: 300 GYGTTKGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYPTKQ 351
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 535 bits (1378), Expect = e-149, Method: Compositional matrix adjust.
Identities = 253/326 (77%), Positives = 285/326 (87%), Gaps = 2/326 (0%)
Query: 26 HDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE 85
HD+SIVGYSPE L S DKLIELFE+W+S K Y+ +EEK RFE+FK+NLKHID+ NK+
Sbjct: 29 HDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKK 88
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPS--AEFSYRDVKALPKSVDWRKKG 143
SYWLGLNEFAD+SHEEFK YLGLK R + AEF+YRDV+A+PKSVDWRKKG
Sbjct: 89 GKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKG 148
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
AV VKNQGSCGSCWAFSTVAAVEGIN+IV+GNLT+LSEQELIDCDT++NNGCNGGLMDY
Sbjct: 149 AVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDY 208
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQP 263
AF+YIV +GGL KEEDYPY MEEGTCE +K+E E VTI+G+QDVP NDE+SLLKALAHQP
Sbjct: 209 AFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQP 268
Query: 264 VSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERG 323
+SVAI+ASG +FQFYSGGVF G CG +LDHGVAAVGYG SKGSDYIIVKNSWGPKWGE+G
Sbjct: 269 LSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKG 328
Query: 324 YIRMKRNTGKPEGLCGINKMASIPLK 349
YIR+KRNTGKPEGLCGINKMAS P K
Sbjct: 329 YIRLKRNTGKPEGLCGINKMASFPTK 354
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 533 bits (1374), Expect = e-149, Method: Compositional matrix adjust.
Identities = 253/352 (71%), Positives = 296/352 (84%), Gaps = 3/352 (0%)
Query: 1 MAF-FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY 59
MAF FS K LL L +S+ ACS LAH+FSI+GY+PE LTS+ K+I LFESW+ KH K Y
Sbjct: 1 MAFIFSSKKTSLLFLFVSILACSPLAHEFSILGYAPEDLTSIHKVIHLFESWLVKHSKFY 60
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+ ++EKLHRFEIF +NLKHID+ NK+V++YWLGLNEFAD++HEEFK+K+LG K + R+
Sbjct: 61 ESLDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERK 120
Query: 120 -QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
+ S EF YRD LPKSVDWRKKGAV PVKNQG CG+CWAFSTVAAVEGINQIV+GNLT
Sbjct: 121 DESSKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGNCWAFSTVAAVEGINQIVTGNLT 180
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQELIDCDT+FNNGCNGGLMDYAF Y++ SG LHKEE+YPY+M EGTC++KK+ E
Sbjct: 181 MLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSG-LHKEEEYPYIMSEGTCDEKKDVSEK 239
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
VTISGY DVP NDE S LKALA+QP+SVAIEASG DFQFYSGGVF G CG ELDHGVAAV
Sbjct: 240 VTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAV 299
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
GYG +KG DY+IV+NSWGPKWGE+GYIRMKR +GKP G+CG+ MAS P K+
Sbjct: 300 GYGTTKGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYPTKQ 351
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 533 bits (1372), Expect = e-149, Method: Compositional matrix adjust.
Identities = 252/324 (77%), Positives = 281/324 (86%), Gaps = 21/324 (6%)
Query: 26 HDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE 85
DFSIVGYSPE LT +DKLI FESW+SKHGK YK +EEKLHRFE+F+ENL HID+RNKE
Sbjct: 27 RDFSIVGYSPEDLTCIDKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKE 86
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
V+SYWLGLNEFAD+SHEEFK+K DV LP+SVDWRKKGAV
Sbjct: 87 VSSYWLGLNEFADLSHEEFKSK---------------------DVADLPESVDWRKKGAV 125
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
T VKNQG+CGSCWAFSTVAAVEGINQIV+GNLT+LSEQELIDCDT+FN+GCNGGLMDYAF
Sbjct: 126 THVKNQGACGSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAF 185
Query: 206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVS 265
+I ++GGLHKE+DYPYLMEEGTCE++KE++++VTISGY+DVPE DE+SLLKALAHQP+S
Sbjct: 186 AFIASNGGLHKEDDYPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLS 245
Query: 266 VAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYI 325
VAIEASG DFQFYSGGVF GPCG ELDHGVAAVGYG SKG DYIIVKNSWGPKWGE+GYI
Sbjct: 246 VAIEASGRDFQFYSGGVFNGPCGTELDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYI 305
Query: 326 RMKRNTGKPEGLCGINKMASIPLK 349
RMKRNTGK EGLCGINKMAS P K
Sbjct: 306 RMKRNTGKTEGLCGINKMASYPTK 329
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 513 bits (1322), Expect = e-143, Method: Compositional matrix adjust.
Identities = 249/335 (74%), Positives = 274/335 (81%), Gaps = 6/335 (1%)
Query: 20 ACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHI 79
AC + DFSIVGYS E L+S D++IELFE W++KH K Y EEKLHRFE+FK+NLKHI
Sbjct: 122 ACVARNSDFSIVGYSEEDLSSNDRIIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHI 181
Query: 80 DQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA--LPKSV 137
D+ N+EVTSYWLGLNEFAD++HEEFK YLGL P P R + F Y DV A LPKSV
Sbjct: 182 DKVNREVTSYWLGLNEFADLTHEEFKATYLGLAPPAPAR-ESRGSFKYEDVSADDLPKSV 240
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCN 197
DWR KGAVT VKNQG CGSCWAFSTVAAVEGIN IV+GNLT+LSEQELIDC NNGCN
Sbjct: 241 DWRTKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCN 300
Query: 198 GGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED-KKEEMEVVTISGYQDVPENDEQSLL 256
GGLMDYAF YI +SGGLH EE YPYLMEEG+C D KK E E VTISGY+DVP ++EQ+L+
Sbjct: 301 GGLMDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALI 360
Query: 257 KALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG--KSKGSDYIIVKNS 314
KALAHQPVSVAIEASG FQFYSGGVF GPCG +LDHGVAAVGYG K KG DYIIV+NS
Sbjct: 361 KALAHQPVSVAIEASGRHFQFYSGGVFDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRNS 420
Query: 315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WG KWGE+GYIRMKR TGK EGLCGINKMAS P K
Sbjct: 421 WGAKWGEKGYIRMKRGTGKGEGLCGINKMASYPTK 455
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 250/341 (73%), Positives = 277/341 (81%), Gaps = 6/341 (1%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFK 73
L L + AC + DFSIVGYS E L+S ++L+ELFE W++KH K Y EEKLHRFE+FK
Sbjct: 15 LLLCVGACVARNSDFSIVGYSEEDLSSNERLVELFEKWLAKHQKAYASFEEKLHRFEVFK 74
Query: 74 ENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA- 132
+NLKHID+ N+EVTSYWLGLNEFAD++H+EFK YLGL P RR S F Y DV A
Sbjct: 75 DNLKHIDKINREVTSYWLGLNEFADLTHDEFKAAYLGLDAA-PARRGSSRSFRYEDVSAS 133
Query: 133 -LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS 191
LPKSVDWRKKGAVT VKNQG CGSCWAFSTVAAVEGIN IV+GNLT+LSEQELIDC
Sbjct: 134 DLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVD 193
Query: 192 FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED-KKEEMEVVTISGYQDVPEN 250
N+GCNGGLMDYAF YI +SGGLH EE YPYLMEEG+C D KK E E VTISGY+DVP N
Sbjct: 194 GNSGCNGGLMDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKAESEAVTISGYEDVPAN 253
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG--KSKGSDY 308
DEQ+L+KALAHQPVSVAIEASG FQFYSGGVF GPCGA+LDHGVAAVGYG K KG DY
Sbjct: 254 DEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVAAVGYGSDKGKGHDY 313
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
IIV+NSWG +WGE+GYIRMKR T EGLCGINKMAS P K
Sbjct: 314 IIVRNSWGAQWGEKGYIRMKRGTSNGEGLCGINKMASYPTK 354
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 502 bits (1293), Expect = e-139, Method: Compositional matrix adjust.
Identities = 234/298 (78%), Positives = 262/298 (87%)
Query: 52 MSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGL 111
MSKHGK+Y+ EEKLHRFE+F++NLKHID+ NK+V+SYWLGLNEFAD+SHEEFK KYLGL
Sbjct: 1 MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGL 60
Query: 112 KPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQ 171
K + P RR EFSY+DV LPKSVDWRKKGAV VKNQG+CGSCWAFSTVAAVEGINQ
Sbjct: 61 KIELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQ 120
Query: 172 IVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED 231
IV+GNLT+LSEQELIDCD FNNGCNGGLMDYAF +I+++GGL KEEDYPY+MEEGTC +
Sbjct: 121 IVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEGTCGE 180
Query: 232 KKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAEL 291
KKEE+EVVTISGY DVPE++EQS LKALA+QP+SVAIEAS FQFYSGG+F G CG EL
Sbjct: 181 KKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHCGTEL 240
Query: 292 DHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
DHGVAAVGYG SKG DYI VKNSWG KWGE+GYIRMKRN GKPEG+CGI KMAS P K
Sbjct: 241 DHGVAAVGYGTSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMASYPTK 298
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 253/348 (72%), Positives = 282/348 (81%), Gaps = 6/348 (1%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
SKL + L L + AC + DFSIVGYS E L+S D+L+ELFE W++KH K Y EEKL
Sbjct: 3 SKLSVAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEEKL 62
Query: 67 HRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFS 126
HRFE+FK+NLK ID+ N+EVTSYWLGLNEFAD++H+EFK YLGL P P RR S F
Sbjct: 63 HRFEVFKDNLKLIDEINREVTSYWLGLNEFADLTHDEFKTTYLGLSPP-PARRSSSRSFR 121
Query: 127 YRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
Y +V A LPK+VDWRKKGAVT VKNQG CGSCWAFSTVAAVEGIN IV+GNLT+LSEQE
Sbjct: 122 YENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQE 181
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED-KKEEMEVVTISG 243
LIDC N+GCNGG+MDYAF YI +SGGLH EE YPYLMEEG+C D KK E E V+ISG
Sbjct: 182 LIDCSVDGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVSISG 241
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-- 301
Y+DVP DEQ+L+KALAHQPVSVAIEASG FQFYSGGVF GPCGA+LDHGVAAVGYG
Sbjct: 242 YEDVPTKDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVAAVGYGSD 301
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
K KG DYIIVKNSWG KWGE+GYIRMKR TGK EGLCGINKMAS P K
Sbjct: 302 KGKGHDYIIVKNSWGGKWGEKGYIRMKRGTGKSEGLCGINKMASYPTK 349
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 499 bits (1286), Expect = e-139, Method: Compositional matrix adjust.
Identities = 240/338 (71%), Positives = 276/338 (81%), Gaps = 3/338 (0%)
Query: 1 MAFFSHSKLLLLSLSLSL-FACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY 59
MAF SK L + + F +H+FSI+GY+PE LTS+ K+I LFES + KH K Y
Sbjct: 1 MAFIFSSKKTSAFLCICIGFGMFGFSHEFSILGYAPEDLTSIHKVIHLFESSLVKHSKIY 60
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+ +EKLHRFEIF +NLKHID+ NK+V++YWLGLNEFAD++HEEFKNK+LG K + R+
Sbjct: 61 ESFDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKNKFLGFKGELAERK 120
Query: 120 QPSAE-FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
S E F YRD LPKSVDWRKKGAV+PVKNQG CGSCWAFSTVAAVEGINQIV+GNLT
Sbjct: 121 DESIEQFRYRDFVDLPKSVDWRKKGAVSPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 180
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQELIDCDT+FNNGCNGGLMDYAF Y V GLHKEE+YPY+M EGTC++K++ E
Sbjct: 181 VLSEQELIDCDTTFNNGCNGGLMDYAFAY-VTRNGLHKEEEYPYIMSEGTCDEKRDASEK 239
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
VTISGY DVP N+E S LKALA+QP+SVAIEASG DFQFYSGGVF G CG ELDHGVAAV
Sbjct: 240 VTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAV 299
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
GYG SKG DY+IV+NSWGPKWGE+GYIRMKRNTGKP G
Sbjct: 300 GYGTSKGLDYVIVRNSWGPKWGEKGYIRMKRNTGKPMG 337
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 496 bits (1277), Expect = e-138, Method: Compositional matrix adjust.
Identities = 238/325 (73%), Positives = 268/325 (82%), Gaps = 4/325 (1%)
Query: 28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT 87
FSIVGYSPE L D+LI+LFE W++K+ K Y EEKLHRFE+FK+NL HID+ NK+VT
Sbjct: 46 FSIVGYSPEDLVHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVT 105
Query: 88 SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV--KALPKSVDWRKKGAV 145
+YWLGLN FAD++H+EFK YLGL+ Q T++ + F Y V +P SVDWRKKGAV
Sbjct: 106 TYWLGLNAFADLTHDEFKATYLGLR-QPETKKTTDSRFRYGGVADDDVPASVDWRKKGAV 164
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
T VKNQG CGSCWAFSTVAAVEGINQIV+GNLTSLSEQEL+DC T NNGCNGG+MD AF
Sbjct: 165 TDVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAF 224
Query: 206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEME-VVTISGYQDVPENDEQSLLKALAHQPV 264
YI +SGGL EE YPYLMEEG C+DK + E VVTISGY+DVP NDEQ+L+KALAHQP+
Sbjct: 225 SYIASSGGLRTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPL 284
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGY 324
SVAIEASG FQFYSGGVF GPCG+ELDHGVAAVGYG SKG DYIIVKNSWG WGE+GY
Sbjct: 285 SVAIEASGRHFQFYSGGVFNGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGSHWGEKGY 344
Query: 325 IRMKRNTGKPEGLCGINKMASIPLK 349
IRMKR TGKPEGLCGINKMAS P K
Sbjct: 345 IRMKRGTGKPEGLCGINKMASYPTK 369
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 483 bits (1243), Expect = e-134, Method: Compositional matrix adjust.
Identities = 238/343 (69%), Positives = 271/343 (79%), Gaps = 14/343 (4%)
Query: 20 ACSSLA--HDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLK 77
AC ++A + SIVGYS E L S ++L+ELFE +M+K+ K Y +EEKL RFE+FK+NL
Sbjct: 22 ACVAVAMPSELSIVGYSEEDLASHERLMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLN 81
Query: 78 HIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE-FSYRDVKA--LP 134
HID+ NK++T YWLGLNEFAD++H+EFK YLGL P RR + + F Y +V+A LP
Sbjct: 82 HIDEENKKITGYWLGLNEFADLTHDEFKAAYLGLTLT-PARRNSNDQLFRYEEVEAASLP 140
Query: 135 KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN 194
K VDWRKKGAVT VKNQG CGSCWAFSTVAAVEGIN IV+GNLT LSEQELIDCDT NN
Sbjct: 141 KEVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNN 200
Query: 195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTC-------EDKKEEMEVVTISGYQDV 247
GC+GGLMDYAF YI A+GGLH EE YPYLMEEGTC +D E VTISGY+DV
Sbjct: 201 GCSGGLMDYAFSYIAANGGLHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDV 260
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-SKGS 306
P N+EQ+LLKALAHQPVSVAIEASG +FQFYSGGVF GPCG LDHGV AVGYG SKG
Sbjct: 261 PRNNEQALLKALAHQPVSVAIEASGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTASKGH 320
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
DYIIVKNSWG WGE+GYIRM+R TGK +GLCGINKMAS P K
Sbjct: 321 DYIIVKNSWGSHWGEKGYIRMRRGTGKHDGLCGINKMASYPTK 363
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 237/329 (72%), Positives = 262/329 (79%), Gaps = 8/329 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
+FSIVGYS E L S D+LIELFE W++K+ K Y EEK+ RFE+FK+NL HID NK+V
Sbjct: 30 EFSIVGYSEEDLASHDRLIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKKV 89
Query: 87 TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP----SAEFSYRDVK--ALPKSVDWR 140
TSYWLGLNEFAD++H+EFK YLGL P PTR S EF Y + +PK +DWR
Sbjct: 90 TSYWLGLNEFADLTHDEFKATYLGLTPP-PTRSNSKHYSSEEFRYGKMSNGEVPKEMDWR 148
Query: 141 KKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGL 200
KK AVT VKNQG CGSCWAFSTVAAVEGIN IV+GNLTSLSEQELIDC T NNGCNGGL
Sbjct: 149 KKNAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGL 208
Query: 201 MDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA 260
MDYAF YI ++GGL EE YPY MEEG C++ K VVTISGY+DVP NDEQ+L+KALA
Sbjct: 209 MDYAFSYIASTGGLRTEEAYPYAMEEGDCDEGKGAA-VVTISGYEDVPANDEQALVKALA 267
Query: 261 HQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWG 320
HQPVSVAIEASG FQFYSGGVF GPCG +LDHGV AVGYG SKG DYIIVKNSWGP WG
Sbjct: 268 HQPVSVAIEASGRHFQFYSGGVFDGPCGEQLDHGVTAVGYGTSKGQDYIIVKNSWGPHWG 327
Query: 321 ERGYIRMKRNTGKPEGLCGINKMASIPLK 349
E+GYIRMKR TGK EGLCGINKMAS P K
Sbjct: 328 EKGYIRMKRGTGKGEGLCGINKMASYPTK 356
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 239/366 (65%), Positives = 275/366 (75%), Gaps = 30/366 (8%)
Query: 14 LSLSLFA---CSSLAH---DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
LS+SL A C +LA DFSIVGYS E L+S + L ELFE W+S+H + Y +EEKL
Sbjct: 19 LSVSLLAGSSCLALARPSGDFSIVGYSEEDLSSHESLAELFERWLSRHRRAYASLEEKLR 78
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY 127
RF++FK+NL HID+ N++V+SYWLGLNEFAD++H+EFK YLGL+ +
Sbjct: 79 RFQVFKDNLHHIDETNRKVSSYWLGLNEFADLTHDEFKATYLGLRSSVGDGGSGIDDDDE 138
Query: 128 RDV---------KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
+ +LPKSVDWR KGAVT VKNQG CGSCWAFSTVAAVEGINQIV+GNLT
Sbjct: 139 PEEEEGYEGVDGASLPKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 198
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTC--------- 229
+LSEQELIDCDT NNGCNGGLMDYAF YI +GGLH EE YPYLMEEGTC
Sbjct: 199 ALSEQELIDCDTDGNNGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCQRSSSSEKK 258
Query: 230 -----EDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
ED ++ VVTISGY+DVP N+EQ+LLKALA QPVSVAIEASG +FQFYSGGVF
Sbjct: 259 WPGSSEDANDDAAVVTISGYEDVPRNNEQALLKALAQQPVSVAIEASGRNFQFYSGGVFD 318
Query: 285 GPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
GPCG +LDHGVAAVGYG +KG DYIIVKNSWGP WGE+GYIRM+R TGK +GLCGINKM
Sbjct: 319 GPCGTQLDHGVAAVGYGTAAKGHDYIIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKM 378
Query: 344 ASIPLK 349
AS P K
Sbjct: 379 ASYPTK 384
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 234/328 (71%), Positives = 260/328 (79%), Gaps = 10/328 (3%)
Query: 28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEV 86
FSIVGYSPE LT D+L+ LFE W++K+ K Y EEKL RFE+FK+NL HID+ N KEV
Sbjct: 52 FSIVGYSPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEV 111
Query: 87 TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKAL----PKSVDWRKK 142
TSYWLGLN FAD++H+EFK YLGL P +R F Y V P SVDWRKK
Sbjct: 112 TSYWLGLNAFADLTHDEFKATYLGLLP----KRTSGGRFRYGGVGDGGDEVPASVDWRKK 167
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAVT VKNQG CGSCWAFSTVAAVEGINQIV+GNLTSLSEQ+L+DC T NNGC+GG+MD
Sbjct: 168 GAVTEVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMD 227
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV-VTISGYQDVPENDEQSLLKALAH 261
AF +I GL EE YPYLMEEG C+D+ + EV VTISGY+DVP NDEQ+L+KALAH
Sbjct: 228 NAFSFIATGAGLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAH 287
Query: 262 QPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGE 321
QPVSVAIEASG FQFYSGGVF GPCG+ELDHGVAAVGYG SKG DYIIVKNSWG WGE
Sbjct: 288 QPVSVAIEASGRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGTHWGE 347
Query: 322 RGYIRMKRNTGKPEGLCGINKMASIPLK 349
+GYIRMKR TGKPEGLCGINKMAS P K
Sbjct: 348 KGYIRMKRGTGKPEGLCGINKMASYPTK 375
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 477 bits (1228), Expect = e-132, Method: Compositional matrix adjust.
Identities = 234/328 (71%), Positives = 260/328 (79%), Gaps = 10/328 (3%)
Query: 28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEV 86
FSIVGYSPE LT D+L+ LFE W++K+ K Y EEKL RFE+FK+NL HID+ N KEV
Sbjct: 66 FSIVGYSPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEV 125
Query: 87 TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKAL----PKSVDWRKK 142
TSYWLGLN FAD++H+EFK YLGL P +R F Y V P SVDWRKK
Sbjct: 126 TSYWLGLNAFADLTHDEFKATYLGLLP----KRTSGGRFRYGGVGDGGDEVPASVDWRKK 181
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAVT VKNQG CGSCWAFSTVAAVEGINQIV+GNLTSLSEQ+L+DC T NNGC+GG+MD
Sbjct: 182 GAVTEVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMD 241
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV-VTISGYQDVPENDEQSLLKALAH 261
AF +I GL EE YPYLMEEG C+D+ + EV VTISGY+DVP NDEQ+L+KALAH
Sbjct: 242 NAFSFIATGAGLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAH 301
Query: 262 QPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGE 321
QPVSVAIEASG FQFYSGGVF GPCG+ELDHGVAAVGYG SKG DYIIVKNSWG WGE
Sbjct: 302 QPVSVAIEASGRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGTHWGE 361
Query: 322 RGYIRMKRNTGKPEGLCGINKMASIPLK 349
+GYIRMKR TGKPEGLCGINKMAS P K
Sbjct: 362 KGYIRMKRGTGKPEGLCGINKMASYPTK 389
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 239/374 (63%), Positives = 274/374 (73%), Gaps = 34/374 (9%)
Query: 9 LLLLSLSLSLFACSSLA---HDFSIVGYSPEHLTSMDKLIELFESWMSKHGK-TYKCIEE 64
+++L + L L +C L DFSIVGYS E L+S + L ELFE W+S+H K Y +EE
Sbjct: 7 VVVLCIGL-LSSCVGLGLARGDFSIVGYSEEDLSSHESLAELFERWLSRHRKGAYASLEE 65
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR------ 118
KL RFE+FK+NL HID+ N++V+SYWLGLNEFAD++H+EFK YLGL P
Sbjct: 66 KLRRFEVFKDNLHHIDETNRKVSSYWLGLNEFADLTHDEFKATYLGLSPSGGGGDVVHMH 125
Query: 119 ------------RQPSAEFSYR----DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
S+ F +R D LPKSVDWR KGAVT VKNQG CGSCWAFST
Sbjct: 126 HDDDDEEPEEEGSSSSSSFRFRYEGVDAARLPKSVDWRSKGAVTGVKNQGQCGSCWAFST 185
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
VAAVEGINQIV+GNLT+LSEQEL+DCDT NNGCNGGLMDYAF YI +GGLH EE YPY
Sbjct: 186 VAAVEGINQIVTGNLTALSEQELVDCDTDGNNGCNGGLMDYAFSYIAHNGGLHTEEAYPY 245
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
LMEEGTC + VVTISGY+DVP N+EQ+LLKALAHQPVSVAIEASG + QFYSGGV
Sbjct: 246 LMEEGTCS-RGSSAAVVTISGYEDVPRNNEQALLKALAHQPVSVAIEASGRNLQFYSGGV 304
Query: 283 FTGPCGAELDHGVAAVGY---GKSKG---SDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
F GPCG +LDHGVAAVGY GK G +DYIIVKNSWGP WGE+GYIRM+R TGK +G
Sbjct: 305 FDGPCGTQLDHGVAAVGYGTAGKDNGHVVADYIIVKNSWGPSWGEKGYIRMRRGTGKRQG 364
Query: 337 LCGINKMASIPLKK 350
LCGINKM S P K
Sbjct: 365 LCGINKMPSYPTKN 378
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 206/272 (75%), Positives = 243/272 (89%), Gaps = 1/272 (0%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
LL+++S S C + A DFSIVGY+PEHLT+ DKL+ELFESWMS+H K YK +EEK+HRF
Sbjct: 13 LLVAISASALLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRF 72
Query: 70 EIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGL-KPQFPTRRQPSAEFSYR 128
E+F+ENL HIDQRN E+ SYWLGLNEFAD++HEEFK +YLGL KPQF +RQPSA F YR
Sbjct: 73 EVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYR 132
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
D+ LPKSVDWRKKGAV PVK+QG CGSCWAFSTVAAVEGINQI +GNL+SLSEQELIDC
Sbjct: 133 DITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDC 192
Query: 189 DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
DT+FN+GCNGGLMDYAF+YI+++GGLHKE+DYPYLMEEG C+++KE++E VTISGY+DVP
Sbjct: 193 DTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVP 252
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
END++SL+KALAHQPVSVAIEASG DFQFY G
Sbjct: 253 ENDDESLVKALAHQPVSVAIEASGRDFQFYKG 284
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 218/287 (75%), Positives = 248/287 (86%), Gaps = 2/287 (0%)
Query: 26 HDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE 85
HD+SIVGYSPE L S DKLIELFE+W+S K Y+ +EEK RFE+FK+NLKHID+ NK+
Sbjct: 29 HDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKK 88
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPS--AEFSYRDVKALPKSVDWRKKG 143
SYWLGLNEFAD+SHEEFK YLGLK R + AEF+YRDV+A+PKSVDWRKKG
Sbjct: 89 GKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKG 148
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
AV VKNQGSCGSCWAFSTVAAVEGIN+IV+GNLT+LSEQELIDCDT++NNGCNGGLMDY
Sbjct: 149 AVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDY 208
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQP 263
AF+YIV +GGL KEEDYPY MEEGTCE +K+E E VTI+G+QDVP NDE+SLLKALAHQP
Sbjct: 209 AFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQP 268
Query: 264 VSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYII 310
+SVAI+ASG +FQFYSGGVF G CG +LDHGVAAVGYG SKGSDYII
Sbjct: 269 LSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYII 315
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 219/348 (62%), Positives = 258/348 (74%), Gaps = 5/348 (1%)
Query: 7 SKLLLLSLSLSLFACSSLA--HDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
SKL +L L L ACS+ A HD S+VGYS E L +KL+ LF SW KH K Y +E
Sbjct: 3 SKLSMLFLLLGFVACSATASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE 62
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR-RQP-- 121
K+ R+EIFK NL+HI + N+ SYWLGLN FAD++HEEFK YLGLKP R QP
Sbjct: 63 KVKRYEIFKRNLRHIVETNRRNGSYWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHG 122
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
S F Y + LP +VDWRKKGAVTPVKNQG CGSCWAFSTVAAVEGINQIV+G L SLS
Sbjct: 123 STTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLS 182
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DCD +FN+GC GGLMD+AF YI+ + G++ EEDYPYLMEEG C +K+ +V+TI
Sbjct: 183 EQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSKVITI 242
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
+GY+DVPEN E SLLKALAHQPVSV I A DFQFY GG+F G CG + DH + AVGYG
Sbjct: 243 TGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHALTAVGYG 302
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G DYII+KNSWG WGE+GY R++R TGKPEG+C I K+AS P K
Sbjct: 303 SYYGQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYPTK 350
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 449 bits (1156), Expect = e-124, Method: Compositional matrix adjust.
Identities = 215/349 (61%), Positives = 257/349 (73%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA S L LSL ++ S+ +D S+VGYS E L KL++LF SW KH K Y
Sbjct: 1 MAMGSKLSLFFLSLGFVAYSSSASHNDPSVVGYSQEDLALPYKLVDLFSSWSVKHSKIYV 60
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
EEK+ R+E+FK+NLKHI + N+ SYWLGLN+FAD++HEEFK+ YLGLK +
Sbjct: 61 SPEEKVKRYEVFKQNLKHIVETNRRNGSYWLGLNQFADVAHEEFKSTYLGLKTGMDGPAR 120
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
F Y + LP SVDWRKKGAVTPVKNQG CGSCWAFSTVAAVEGINQI +G L SL
Sbjct: 121 APTAFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIATGKLESL 180
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQEL+DCDT+F++GC GG MD+AF YI+ + G+H ++DYPYLMEEG C++K+ + +VVT
Sbjct: 181 SEQELMDCDTTFDHGCGGGFMDFAFAYIMGNLGIHTDDDYPYLMEEGYCKEKQPQSKVVT 240
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
ISGY+DVPEN E SLLKALAHQP+SV I A DFQFY GVF G CG ELDH + AVGY
Sbjct: 241 ISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQFYKRGVFEGSCGTELDHALTAVGY 300
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G S G DYII+KNSWG WGE+GY R+KR TGKPEG+C I MAS P K
Sbjct: 301 GSSDGQDYIIMKNSWGKSWGEQGYFRIKRGTGKPEGVCSIYSMASYPTK 349
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 449 bits (1156), Expect = e-124, Method: Compositional matrix adjust.
Identities = 218/348 (62%), Positives = 257/348 (73%), Gaps = 5/348 (1%)
Query: 7 SKLLLLSLSLSLFACSSLA--HDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
SKL +L L L ACS+ A HD S+VGYS E L +KL+ LF SW KH K Y +E
Sbjct: 12 SKLSMLFLLLGFVACSATASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE 71
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR-RQP-- 121
K+ R+EIFK NL+HI + N+ SYWLGLN FAD++HEEFK YLGLKP R QP
Sbjct: 72 KVKRYEIFKRNLRHIVETNRRNGSYWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHG 131
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
S F Y + LP +VDWRKKGAVTPVKNQG CGSCWAFSTVAAVEGINQIV+G L SLS
Sbjct: 132 STTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLS 191
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DCD +FN+GC GGLMD+AF YI+ + G++ EEDYPYLMEEG C +K+ +V+TI
Sbjct: 192 EQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSKVITI 251
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
+GY+DVP N E SLLKALAHQPVSV I A DFQFY GG+F G CG + DH + AVGYG
Sbjct: 252 TGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHALTAVGYG 311
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G DYII+KNSWG WGE+GY R++R TGKPEG+C I K+AS P K
Sbjct: 312 SYYGQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYPTK 359
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 218/299 (72%), Positives = 238/299 (79%), Gaps = 8/299 (2%)
Query: 57 KTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFP 116
K Y EEK+ RFE+FK+NL HID NK+VTSYWLGLNEFAD++H+EFK YLGL P P
Sbjct: 38 KAYASFEEKVRRFEVFKDNLNHIDDINKKVTSYWLGLNEFADLTHDEFKATYLGLTPP-P 96
Query: 117 TRRQP----SAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGIN 170
TR S EF Y + +PK +DWRKK AVT VKNQG CGSCWAFSTVAAVEGIN
Sbjct: 97 TRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGIN 156
Query: 171 QIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE 230
IV+GNLTSLSEQELIDC T NNGCNGGLMDYAF YI ++GGL EE YPY MEEG C+
Sbjct: 157 AIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTEEAYPYAMEEGDCD 216
Query: 231 DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAE 290
+ K VVTISGY+DVP NDEQ+L+KALAHQPVSVAIEASG FQFYSGGVF GPCG +
Sbjct: 217 EGKGAA-VVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGEQ 275
Query: 291 LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
LDHGV AVGYG SKG DYIIVKNSWGP WGE+GYIRMKR TGK EGLCGINKMAS P K
Sbjct: 276 LDHGVTAVGYGTSKGQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCGINKMASYPTK 334
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 212/346 (61%), Positives = 255/346 (73%), Gaps = 4/346 (1%)
Query: 8 KLLLLSLSLSLFACSSLAH-DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
KL +L L L+ ACS+ H D S+VGYS E L ++L+ LF+SW KH K Y +EKL
Sbjct: 4 KLPVLVLFLAFAACSASHHRDPSVVGYSQEDLALPNRLVNLFKSWSVKHRKIYVSPKEKL 63
Query: 67 HRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLK---PQFPTRRQPSA 123
R+ IFK+NL HI + N++ SYWLGLN+FAD++HEEFK +LGLK + + +
Sbjct: 64 KRYGIFKQNLMHIAETNRKNGSYWLGLNQFADITHEEFKANHLGLKQGLSRMGAQTRTPT 123
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y LP SVDWR KGAVTPVKNQG CGSCWAFS+VAAVEGINQIV+G L SLSEQ
Sbjct: 124 TFRYAAAANLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQ 183
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCDT ++GC GGLMD+AF YI+ S G+H E+DYPYLMEEG C++K+ VVTI+G
Sbjct: 184 ELMDCDTMLDHGCEGGLMDFAFAYIMGSQGIHAEDDYPYLMEEGYCKEKQPYANVVTITG 243
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
Y+DVPEN E SLLKALAHQPVSV I A DFQFY GGVF G C ELDH + AVGYG S
Sbjct: 244 YEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYKGGVFDGSCSDELDHALTAVGYGSS 303
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G +YI +KNSWG WGE+GY+R+K TGKPEG+CGI MAS P+K
Sbjct: 304 YGQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPVK 349
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 422 bits (1086), Expect = e-116, Method: Compositional matrix adjust.
Identities = 200/251 (79%), Positives = 225/251 (89%), Gaps = 1/251 (0%)
Query: 41 MDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMS 100
MDKLIELFESWMS+HGK Y+ IEEKL RFEIFK+NLKHID+ NK V++YWLGLNEFAD+S
Sbjct: 1 MDKLIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVSNYWLGLNEFADLS 60
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
H EFK +YLGLK F TRR+ S EF+YRDV LPKSVDWRKKGAVT +KNQGSCGSCWAF
Sbjct: 61 HHEFKKQYLGLKVDFSTRRESSEEFTYRDVD-LPKSVDWRKKGAVTNIKNQGSCGSCWAF 119
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
STVAAVEGINQIV+GNLTSLSEQELIDCD ++N+GCNGGLMDYAF +IV +GGLHKE+DY
Sbjct: 120 STVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGGLMDYAFSFIVENGGLHKEDDY 179
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY+MEEGTCE KEE +VVTISGY DVP+N+EQSLLKALA+QP+SVAIEASG DFQFYSG
Sbjct: 180 PYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 239
Query: 281 GVFTGPCGAEL 291
GVF G CG +L
Sbjct: 240 GVFDGHCGTQL 250
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 206/346 (59%), Positives = 254/346 (73%), Gaps = 4/346 (1%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
LL L+LS A S+ DFSI+GY + L D ++EL+E W+++H K Y + EK +R
Sbjct: 5 LLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGEKQNR 64
Query: 69 FEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTR--RQPSAEF 125
F +FK+N +I Q N + SY LGLN+FAD+SHEEFK YLG K R PS +
Sbjct: 65 FSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSNSPSPRY 124
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
Y D + LP+S+DWR+KGAVT VK+QGSCGSCWAFSTVAAVEGINQIV+GNLTSLSEQEL
Sbjct: 125 QYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQEL 184
Query: 186 IDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ 245
+DCDTS+N GCNGGLMDYAF++I+ +GGL E+DYPY +G+C+ ++ VVTI Y+
Sbjct: 185 VDCDTSYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDGSCDAYRKNAHVVTIDDYE 244
Query: 246 DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKG 305
DVPENDE+SL KA A+QP+SVAIEASG FQFY GVFT CG +LDHGV VGYG G
Sbjct: 245 DVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSTCGTQLDHGVTLVGYGSESG 304
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNT-GKPEGLCGINKMASIPLKK 350
+DY IVKNSWG WGE+G+IR++RN G G+CGI AS PLKK
Sbjct: 305 TDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGIAMEASYPLKK 350
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 206/351 (58%), Positives = 250/351 (71%), Gaps = 13/351 (3%)
Query: 9 LLLLSLSLSLFACSSLAH-DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
+ + L L+ ACS+ H D S+VGYS E L L F SW KHGK Y EKL
Sbjct: 7 VAVFVLFLAFAACSANHHRDPSVVGYSQEDLALPSSL---FRSWSVKHGKLYASPTEKLE 63
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFP------TRRQP 121
R+EIFK+NL HI + N++ SYWLGLN+FAD++HEEFK YLGLK P TR
Sbjct: 64 RYEIFKQNLMHIAETNRKNGSYWLGLNQFADVAHEEFKASYLGLKRALPRAGAPQTRTPT 123
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
+ ++ +LP SVDWR KGAVTPVKNQG CGSCWAFS+VAAVEGINQIV+G L SLS
Sbjct: 124 AFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLS 183
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT- 240
EQEL+DCDT+ ++GC GG MD AF Y++ S G+H E+DYPYLMEEG C++K+ + +T
Sbjct: 184 EQELVDCDTTLDHGCEGGTMDLAFAYMMGSQGIHAEDDYPYLMEEGYCKEKQPCVLGITE 243
Query: 241 --ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
++G++DVPEN E SLLKALAHQPVSV I A DFQFY GGVF G C ELDH + AV
Sbjct: 244 QDLTGFEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYRGGVFDGACSVELDHALTAV 303
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
GYG S G +YI +KNSWG WGE+GY+R+K TGKPEG+CGI MAS P+K
Sbjct: 304 GYGSSYGQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPVK 354
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 202/346 (58%), Positives = 252/346 (72%), Gaps = 4/346 (1%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
LL L+LS A S+ DFSI+ Y + L D ++EL+E W+++H K Y ++EK +
Sbjct: 5 LLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDEKQKK 64
Query: 69 FEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTR--RQPSAEF 125
F +FK+N +I Q N + SY LGLN+FAD+SHEEFK YLG K R R PS +
Sbjct: 65 FSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKAAYLGTKLDAKKRLSRSPSPRY 124
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
Y + LP+S+DWR+KGAVT VKNQGSCGSCWAFSTVAAVEGINQIV+GNLTSLSEQEL
Sbjct: 125 QYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQEL 184
Query: 186 IDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ 245
+DCDTS+N GCNGGLMDYAF++I+++GGL E+DYPY G+C+ ++ VVTI Y+
Sbjct: 185 VDCDTSYNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKANNGSCDAYRKNAHVVTIDDYE 244
Query: 246 DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKG 305
DVPENDE+SL KA A+QP+SVAIEASG FQFY GVFT CG +LDHGV VGYG G
Sbjct: 245 DVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSNCGTQLDHGVTLVGYGSESG 304
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNT-GKPEGLCGINKMASIPLKK 350
DY +VKNSWG WGE+G+I+++RN G G+CGI AS P+KK
Sbjct: 305 IDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGIAMEASYPVKK 350
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 402 bits (1034), Expect = e-110, Method: Compositional matrix adjust.
Identities = 194/357 (54%), Positives = 255/357 (71%), Gaps = 9/357 (2%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLT-----SMDKLIELFESWMSKH 55
M F S + + L LS F SS A D SI+ Y H T + D+++ ++E W+ K
Sbjct: 2 MGLFGSSAAMFVLLFLS-FTLSS-ASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQ 59
Query: 56 GKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQF 115
GK Y + E+ RF++FK+NL+ ID+ N E +Y LGLN FAD+++EE+++ YLG +
Sbjct: 60 GKVYNALGEREKRFQVFKDNLRFIDEHNSENRTYKLGLNGFADLTNEEYRSTYLGARGGM 119
Query: 116 PTRR--QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIV 173
R + S ++ R ++LP SVDWRK+GAV VK+QGSCGSCWAFST+AAVEGIN+IV
Sbjct: 120 KRNRLRKTSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIV 179
Query: 174 SGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKK 233
+G+L SLSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ EEDYPYL +G C+ +
Sbjct: 180 TGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDTYR 239
Query: 234 EEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDH 293
+ +VVTI Y+DVP N E +L KA+A+QPVSVAIEA G DFQFY+ G+F+G CG +LDH
Sbjct: 240 KNAKVVTIDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLDH 299
Query: 294 GVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
GVAAVGYG G DY IV+NSWG WGE GY+RM R+ P G+CGI AS P+KK
Sbjct: 300 GVAAVGYGTENGKDYWIVRNSWGKSWGENGYLRMARSINSPTGICGIAMEASYPIKK 356
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 195/358 (54%), Positives = 253/358 (70%), Gaps = 8/358 (2%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEH-----LTSMDKLIELFESWMSKH 55
M S + + L L L S+ A D SI+GY H + + ++ ++E+W++KH
Sbjct: 1 MGLCRSSSSMAVFLFLLLGLASASAXDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKH 60
Query: 56 GKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQF 115
GK+Y + EK RF+IFK+NL+ ID+ N E +Y +GLN FAD+++EE+++ YLG +
Sbjct: 61 GKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAA 120
Query: 116 PTR--RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIV 173
R + S +++R +LP+SVDWRKKGAV VK+QGSCGSCWAFST+AAVEGIN+IV
Sbjct: 121 KRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIV 180
Query: 174 SGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKK 233
+G L SLSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ EEDYPY +G C+ +
Sbjct: 181 TGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYR 240
Query: 234 EEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDH 293
+ VVTI GY+DVPENDE+SL KA+A+QPVSVAIEA G +FQ Y G+FTG CG LDH
Sbjct: 241 KNAXVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDH 300
Query: 294 GVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTG-KPEGLCGINKMASIPLKK 350
GV AVGYG G DY IVKNSWG WGE GYIRM+R+ G CGI AS P+KK
Sbjct: 301 GVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPIKK 358
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 196/337 (58%), Positives = 246/337 (72%), Gaps = 5/337 (1%)
Query: 17 SLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENL 76
S A S+ DFSI+ S + L D ++EL+E W+++H + Y ++EK RF +FK+N
Sbjct: 13 SAMAGSASRADFSII--SSKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNF 70
Query: 77 KHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR--RQPSAEFSYRDVKALP 134
+I + N+ SY LGLN+FAD+SHEEFK YLG K R R PS + Y D + LP
Sbjct: 71 LYIHEHNQGNRSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSRPPSRRYQYSDGEDLP 130
Query: 135 KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN 194
+S+DWR+KGAVT VK+QGSCGSCWAFSTVAAVEGINQIV+G+L SLSEQEL+DCDTS+N
Sbjct: 131 ESIDWREKGAVTSVKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQ 190
Query: 195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQS 254
GCNGGLMDYAF++I+ +GGL EEDYPY +G+C+ ++ VVTI Y+DVPENDE+S
Sbjct: 191 GCNGGLMDYAFEFIINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKS 250
Query: 255 LLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNS 314
L KA A+QP+SVAIEASG +FQFY GVFT CG +LDHGV VGYG G+DY VKNS
Sbjct: 251 LKKAAANQPISVAIEASGREFQFYDSGVFTSTCGTQLDHGVTLVGYGSESGTDYWTVKNS 310
Query: 315 WGPKWGERGYIRMKRNTG-KPEGLCGINKMASIPLKK 350
WG WGE G+IR++RN G+CGI AS P+KK
Sbjct: 311 WGKSWGEEGFIRLQRNIEVASTGMCGIAMEASYPVKK 347
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 195/346 (56%), Positives = 249/346 (71%), Gaps = 7/346 (2%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLT-----SMDKLIELFESWMSKHGKTYKCIEEK 65
+L L +FA SS A D SI+ Y H T + D+++ ++E W+ KHGK Y + EK
Sbjct: 1 MLMLLFLVFALSS-AFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEK 59
Query: 66 LHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR-RQPSAE 124
RFEIFK+NL IDQ N E +Y +GLN FAD+++EEF++ YLG + R + S
Sbjct: 60 EKRFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDR 119
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
++ R +LP SVDWRK+GAV VK+QG CGSCWAFST+AAVEGIN+IV+G+L +LSEQE
Sbjct: 120 YAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQE 179
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
L+DCDTS+N GCNGGLMDYAF++I+ +GG+ E+DYPYL +G C+ ++ +VV+I Y
Sbjct: 180 LVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSY 239
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
+DVPENDE +L KA+A+QPVSVAIE G +FQ Y+ GVFTG CG LDHGVAAVGYG K
Sbjct: 240 EDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTEK 299
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G DY IV+NSWG WGE GYIRM+RN P G CGI S P+KK
Sbjct: 300 GKDYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPIKK 345
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/354 (55%), Positives = 253/354 (71%), Gaps = 12/354 (3%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEH-----LTSMDKLIELFESWMSKHGKTY 59
S S + L L L L + A D SI+GY H + + ++ ++E+W++KHGK+Y
Sbjct: 7 SSSMAVFLFLLLGLAS----ALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSY 62
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR- 118
+ EK RF+IFK+NL+ ID+ N E +Y +GLN FAD+++EE+++ YLG + R
Sbjct: 63 NALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRRS 122
Query: 119 -RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ S +++R +LP+SVDWRKKGAV VK+QGSCGSCWAFST+AAVEGIN+IV+G L
Sbjct: 123 SNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGL 182
Query: 178 TSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
SLSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ EEDYPY +G C+ ++ +
Sbjct: 183 ISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAK 242
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAA 297
VVTI GY+DVPENDE+SL KA+A+QPVSVAIEA G +FQ Y G+FTG CG LDHGV A
Sbjct: 243 VVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTA 302
Query: 298 VGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTG-KPEGLCGINKMASIPLKK 350
VGYG G DY IVKNSWG WGE GYIRM+R+ G CGI AS P+KK
Sbjct: 303 VGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPIKK 356
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 396 bits (1018), Expect = e-108, Method: Compositional matrix adjust.
Identities = 190/334 (56%), Positives = 242/334 (72%), Gaps = 6/334 (1%)
Query: 23 SLAHDFSIVGYSPEHLT-----SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLK 77
S A D SI+ Y H T + D+++ ++E W+ KHGK Y + EK RFEIFK+NL
Sbjct: 21 SSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLM 80
Query: 78 HIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR-RQPSAEFSYRDVKALPKS 136
IDQ N E +Y +GLN FAD+++EEF++ YLG + R + S ++ R +LP S
Sbjct: 81 FIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDRYAPRVGDSLPDS 140
Query: 137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGC 196
VDWRK+GAV VK+QG CGSCWAFST+AAVEGIN+IV+G+L +LSEQEL+DCDTS+N GC
Sbjct: 141 VDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGC 200
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
NGGLMDYAF++I+ +GG+ E+DYPYL +G C+ ++ +VV+I Y+DVPENDE +L
Sbjct: 201 NGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALK 260
Query: 257 KALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWG 316
KA+A+QPVSVAIE G +FQ Y+ GVFTG CG LDHGVAAVGYG KG DY IV+NSWG
Sbjct: 261 KAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTEKGKDYWIVRNSWG 320
Query: 317 PKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
WGE GYIRM+RN P G CGI S P+KK
Sbjct: 321 KSWGESGYIRMERNIASPTGKCGIAIEPSYPIKK 354
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 190/342 (55%), Positives = 243/342 (71%), Gaps = 9/342 (2%)
Query: 18 LFACSSL--AHDFSIVGYSPEHLT-----SMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
LF S+L A D SI+ Y H T + D+++ ++E W+ KHGK Y + EK RFE
Sbjct: 5 LFFASTLSSASDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLVKHGKAYNSLGEKERRFE 64
Query: 71 IFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR--RQPSAEFSYR 128
+FK+NL+ ID+ N E +Y +GLN FAD+++EE+++ YLG R+ S ++ R
Sbjct: 65 VFKDNLRFIDEHNSENRTYRVGLNRFADLTNEEYRSMYLGALSGIRRNKLRKISDRYTPR 124
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
+LP SVDWRK+GAV VK+QGSCGSCWAFS VAAVEGIN+IV+G+L SLSEQEL+DC
Sbjct: 125 VGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSEQELVDC 184
Query: 189 DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
D S+N GCNGGLMDY F++I+ +GG+ EEDYPYL +G C+ ++ VV+I Y+DVP
Sbjct: 185 DNSYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVSIDSYEDVP 244
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDY 308
N+E +L KA+A+QPVSVAIEA G DFQ YS GVF+G CG LDHGV AVGYG G DY
Sbjct: 245 VNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGYGTENGQDY 304
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
IV+NSWG WGE GY+RM RN KP G+CGI AS P+KK
Sbjct: 305 WIVRNSWGKSWGESGYLRMARNIRKPTGICGIAMEASYPIKK 346
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 196/348 (56%), Positives = 245/348 (70%), Gaps = 10/348 (2%)
Query: 12 LSLSLSLFACSSLAHDFSIVGYSPEHL-----TSMDKLIELFESWMSKHGKTYKCIEEKL 66
LSL L + +S A D SIV Y H + D+++ ++E+W+ KHGK Y + EK
Sbjct: 8 LSLFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKE 67
Query: 67 HRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFP--TRR--QPS 122
RF IFK+NL+ ID+ N + +Y LGLN FAD+++EE+++ YLG+KP TR+ + S
Sbjct: 68 KRFGIFKDNLRFIDEHNSQNLTYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKVSRKS 127
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
F+ R ALP +DWRK+GAV VK+QGSCGSCWAFST+AAVEGINQIV+G+L SLSE
Sbjct: 128 DRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSE 187
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ EEDYPY + C+ ++ VV+I
Sbjct: 188 QELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANVVSID 247
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
GY+DVPENDE +L KA+A QPVSVAIEA G FQ Y GVFTG CG LDHGVAAVGYG
Sbjct: 248 GYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAVGYGT 307
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRN-TGKPEGLCGINKMASIPLK 349
G DY IV NSWG WGE GYIRM+RN G G CGI S P+K
Sbjct: 308 ENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPIK 355
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 196/346 (56%), Positives = 242/346 (69%), Gaps = 9/346 (2%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMDKL----IELFESWMSKHGKTYKCIEEKLHRF 69
L+ F LA D SI+ Y+ +H ++ + L+E W+ K+GK Y + EK RF
Sbjct: 11 LATFYFLSVCLAIDMSIIDYNLKHGQVPERTEAETLRLYEMWLVKYGKAYNALGEKERRF 70
Query: 70 EIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR---QPSAEF 125
EIFK+NLK +DQ N SY LGLN+FAD+S+EE++ YLG + R SA +
Sbjct: 71 EIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGGPKSARY 130
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
++D LP+SVDWR+KGAV PVK+QG CGSCWAFSTV AVEGINQIV+GNLTSLSEQEL
Sbjct: 131 LFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQEL 190
Query: 186 IDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ 245
+DCD +N GCNGGLMDYAF++I+ +GG+ EEDYPY + C+ ++ VVTI GY+
Sbjct: 191 VDCDKVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDSMCDPNRKNARVVTIDGYE 250
Query: 246 DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKG 305
DVP+NDE+SL KA+A+QPVSVAIEA G FQ Y GVFTG CG +LDHGV AVGYG G
Sbjct: 251 DVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGVFTGSCGTQLDHGVVAVGYGTENG 310
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLKK 350
DY +V+NSWGP WGE GYIRM+RN E G CGI AS P KK
Sbjct: 311 VDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIAMEASYPTKK 356
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 194/360 (53%), Positives = 252/360 (70%), Gaps = 12/360 (3%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLT-----SMDKLIELFESWMSKH 55
M FS S + + L + S+L D SIV Y HLT + D+++ ++E W+ K+
Sbjct: 1 MGLFSSSSAMFVFLFFTFTLSSAL--DMSIVSYDQTHLTKSSWRTDDEVMAIYEEWLVKN 58
Query: 56 GKTY---KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLK 112
GK + + EK RF++FK+NL+ ID+ N E SY +GLN FAD+++EE+++ YLG +
Sbjct: 59 GKAHSNNNALGEKERRFQVFKDNLRFIDEHNSENRSYKVGLNRFADLTNEEYRSMYLGAR 118
Query: 113 PQFPTRR--QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGIN 170
R + S + R +LP SVDWRK+GAV VK+QGSCGSCWAFST+AAVEGIN
Sbjct: 119 SGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGIN 178
Query: 171 QIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE 230
+IV+G+L SLSEQEL+DCD S+N GCNGGLMDYAF++I+ +GG+ EEDYPYL +GTC+
Sbjct: 179 KIVTGDLISLSEQELVDCDRSYNEGCNGGLMDYAFQFIINNGGIDSEEDYPYLARDGTCD 238
Query: 231 DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAE 290
++ +VVTI Y+DVP NDE++L KA+A+QPVSVAIEA G +FQFY G+FTG CG
Sbjct: 239 TYRKNAKVVTIDNYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQFYQSGIFTGRCGTA 298
Query: 291 LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
LDHGVAAVGYG G DY IV+NSWG WGE GYIRM+RN G CGI S P+KK
Sbjct: 299 LDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYIRMERNIATATGKCGIAIEPSYPIKK 358
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/343 (54%), Positives = 241/343 (70%), Gaps = 6/343 (1%)
Query: 13 SLSLSLFACSSL--AHDFSIVGYSPEH--LTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
S++ LF C + A D SI+ Y H + + + ++E W++ HGK Y I EK R
Sbjct: 8 SVACLLFLCFAFSSALDMSIISYDQTHPPQRTDAEAMAIYEKWLTTHGKAYNAIGEKERR 67
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP--SAEFS 126
FEIFK+NL+ +D+ N SY +GLN FAD+++EE+++ +LG + R S ++
Sbjct: 68 FEIFKDNLRFVDEHNAVAGSYRVGLNRFADLTNEEYRSMFLGGNMEMKERSASTKSDRYA 127
Query: 127 YRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
+R LP SVDWR+KGAV+PVK+QG CGSCWAFST++AVEGINQIV+G L SLSEQEL+
Sbjct: 128 FRAGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQELV 187
Query: 187 DCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQD 246
DCD S+N GCNGGLMDY F++I+ +GG+ EEDYPY +GTC+ ++ VV+I+GY+D
Sbjct: 188 DCDKSYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVVSINGYED 247
Query: 247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGS 306
VPE+DE SL KA+A+QPVSVAIEA G FQ Y GVFTG CG LDHGV AVGYG G
Sbjct: 248 VPEDDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTGHCGTNLDHGVVAVGYGTENGV 307
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
DY V+NSWGPKWGE GYI+++RN G CGI MAS P K
Sbjct: 308 DYWTVRNSWGPKWGENGYIKLERNINATSGKCGIASMASYPTK 350
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 190/344 (55%), Positives = 244/344 (70%), Gaps = 7/344 (2%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
L+LL +++ A + A+ +IV Y L S D ++++F W+ H + Y+ + EK HR
Sbjct: 12 LVLLVIAIGQQADAGRAN--AIVDYEGNQLHSDDAILDVFHQWLETHSRVYRSLSEKHHR 69
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR 128
F+IFKEN +I NK+ SYWLGLN+F+D++H+EF+ +YLG KP R++ A F Y
Sbjct: 70 FQIFKENFLYIHAHNKQQKSYWLGLNKFSDLTHQEFRAQYLGTKP--VNRQRKEANFMYE 127
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
DV+A PK VDWR KGAVT VK+QG+CGSCWAFS V +VEG+N I +G L SLSEQEL+DC
Sbjct: 128 DVEAEPK-VDWRLKGAVTDVKDQGACGSCWAFSAVGSVEGVNAIKTGELVSLSEQELVDC 186
Query: 189 DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
D N GCNGGLMDYAF++I+ +GG+ E+DYPY +G C++ + +VV I YQDVP
Sbjct: 187 DRKQNQGCNGGLMDYAFEFIIKNGGIDTEKDYPYKARDGRCDEGRRNSKVVVIDDYQDVP 246
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSD 307
E +L+KAL PVSVAIEA G DFQ Y GGVFTGPCG+ELDHGV AVGYG G +
Sbjct: 247 TQSESALMKALTKNPVSVAIEAGGRDFQHYQGGVFTGPCGSELDHGVLAVGYGTDDDGVN 306
Query: 308 YIIVKNSWGPKWGERGYIRMKR-NTGKPEGLCGINKMASIPLKK 350
Y IVKNSWGP WGE+GYIRM+R + +G CGIN AS P+KK
Sbjct: 307 YWIVKNSWGPGWGEKGYIRMERFGSDSTDGKCGINIEASFPIKK 350
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 187/354 (52%), Positives = 250/354 (70%), Gaps = 14/354 (3%)
Query: 10 LLLSLSLSLFACSSL--AHDFSIVGYSPEH---------LTSMDKLIELFESWMSKHGKT 58
L+ +LS FA S+ A D SI+ Y H L + D++ L+ESW+ KHGKT
Sbjct: 3 LIPMATLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKT 62
Query: 59 YKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP---QF 115
Y + EK RF+IFK+NL+ ID+ N +Y LGLN+FAD+++EE++ Y G+K +
Sbjct: 63 YNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMTYTGIKTIDDKK 122
Query: 116 PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
+ S ++YR +LP+ VDWR++GAVT VK+QGSCGSCWAFST +VEG+N+IV+G
Sbjct: 123 KLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIVTG 182
Query: 176 NLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
+L S+SEQEL++CDTS+N GCNGGLMDYAF++I+ +GG+ EEDYPY ++G C+ K+
Sbjct: 183 DLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNKKN 242
Query: 236 MEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGV 295
+VVTI Y+DVP NDE SL KA+++QPV+VAIEA G DFQFY+ G+FTG CG LDHGV
Sbjct: 243 AKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDHGV 302
Query: 296 AAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
A GYG G DY +VKNSWG +WGE GY++M+RN G CGI AS P+K
Sbjct: 303 LAAGYGTEDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAMEASYPIK 356
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 188/350 (53%), Positives = 246/350 (70%), Gaps = 6/350 (1%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
S SK + L +F SS A D SI+ + D++ L+E+W+ KHGK Y +
Sbjct: 1 MSTSKSTIFLLFSIIFIVSSSALDLSIIDRAFNRPD--DEIASLYETWLVKHGKNYNGLG 58
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRR 119
EK RF IFK+NL+ +D+RN E S+ LGLN FAD+++EE+++ YLG +P+ + R
Sbjct: 59 EKQLRFNIFKDNLRFVDERNSENLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGR 118
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S +++R LP+SVDWRKKGAV +K+QGSCGSCWAFS +AAVEG+NQIV+G+L S
Sbjct: 119 SKSDRYAFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVTGDLIS 178
Query: 180 LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVV 239
LSEQEL++CDTS+N+GC+GGLMDYAF++I+ + G+ +EDYPY +G C+ ++ +VV
Sbjct: 179 LSEQELVECDTSYNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRDGRCDTNRKNAKVV 238
Query: 240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVG 299
TI Y+D P DE+SL KA+A+QPVSVAIE G DFQ Y GVFTG CG LDHGVA VG
Sbjct: 239 TIDDYEDSPVYDEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHGVAVVG 298
Query: 300 YGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
YG G DY IV+NSWG WGE GYIRM+RNT P G+CGI S P+K
Sbjct: 299 YGTEDGLDYWIVRNSWGDTWGEGGYIRMQRNTKLPSGICGIAIEPSYPIK 348
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 191/327 (58%), Positives = 241/327 (73%), Gaps = 7/327 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
D SI+G T D+++ ++ESW+ KHGK+Y I EK RF+IFK+NL+ ID+ N E
Sbjct: 26 DMSIIGELSSSRTD-DEVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAES 84
Query: 87 TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV----KALPKSVDWRKK 142
+Y +GLN FAD++++E+++ YLG + RR + + S R V ++LP SVDWR+K
Sbjct: 85 RTYKVGLNRFADLTNDEYRSMYLGARTG-SRRRLSTQKRSDRYVPVAGESLPDSVDWREK 143
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV VK+QGSCGSCWAFST+AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMD
Sbjct: 144 GAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 203
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF++I+ +GG+ EEDYPY +G C+ ++ +VVTI Y+DVP N+EQ+L KA+A+Q
Sbjct: 204 YAFEFIIKNGGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQ 263
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
PVSVAIEASG FQFY GVFTG CG LDHGV AVGYG DY IVKNSWG WGE
Sbjct: 264 PVSVAIEASGMAFQFYESGVFTGNCGTALDHGVTAVGYGTENSVDYWIVKNSWGSSWGES 323
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLK 349
GYIRM+RNTG G CGI S P+K
Sbjct: 324 GYIRMERNTGAT-GKCGIAVEPSYPIK 349
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 198/362 (54%), Positives = 246/362 (67%), Gaps = 19/362 (5%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEH-----LTSMDKLIE-LFESWMSK 54
M F S ++LL + + S A D SI+ Y H + D +E ++E+WM +
Sbjct: 1 MGFLKLSPMILLLAMIGV----SYAMDMSIISYDENHHITTETSRSDSEVERIYEAWMVE 56
Query: 55 HGKTYKCIE----EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG 110
HGK EK RFEIFK+NL+ ID+ N + SY LGL FAD+++EE+++ YLG
Sbjct: 57 HGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEYRSMYLG 116
Query: 111 LKPQFPTRR--QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEG 168
KP T+R + S + R ALP SVDWRK+GAV VK+QGSCGSCWAFST+ AVEG
Sbjct: 117 AKP---TKRVLKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEG 173
Query: 169 INQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT 228
IN+IV+G+L SLSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ E DYPY +G
Sbjct: 174 INKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGR 233
Query: 229 CEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCG 288
C+ ++ +VVTI Y+DVPEN E SL KALAHQP+SVAIEA G FQ YS GVF G CG
Sbjct: 234 CDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGLCG 293
Query: 289 AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
ELDHGV AVGYG G DY IV+NSWG +WGE GYI+M RN P G CGI AS P+
Sbjct: 294 TELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIEAPTGKCGIAMEASYPI 353
Query: 349 KK 350
KK
Sbjct: 354 KK 355
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 196/360 (54%), Positives = 245/360 (68%), Gaps = 15/360 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSM-----DKLIE-LFESWMSK 54
M F S ++LL + + S A D SI+ Y H S D +E ++E+WM +
Sbjct: 1 MGFLKLSPMILLLAMIGV----SYAIDMSIISYDENHHISTVSSRSDAEVERIYEAWMVE 56
Query: 55 HGKTYKCIE----EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG 110
HGK EK RFEIFK+NL++ID+ N + SY LGL FAD++++E+++ YLG
Sbjct: 57 HGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTKNLSYKLGLTRFADLTNDEYRSMYLG 116
Query: 111 LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGIN 170
KP + S + R ALP SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN
Sbjct: 117 AKP-VKRVLKTSDRYEARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGIN 175
Query: 171 QIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE 230
+IV+G+L SLSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ E DYPY +G C+
Sbjct: 176 KIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCD 235
Query: 231 DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAE 290
++ +VVTI Y+DVPEN E SL KALAHQP+SVAIEA G FQ YS GVF G CG E
Sbjct: 236 QNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGICGTE 295
Query: 291 LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
LDHGV AVGYG G DY IV+NSWG +WGE GYI+M RN +P G CGI AS P+KK
Sbjct: 296 LDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIAEPTGKCGIAMEASYPIKK 355
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 194/354 (54%), Positives = 242/354 (68%), Gaps = 12/354 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEH------LTSMDKLIELFESWMSKHGKTYKC 61
+ L+L SL+ F S A D SI+ Y H L + D+L+ L+ESW+ KH K Y
Sbjct: 14 QCLVLFFSLASFLMLSSASDMSIITYDETHGLNSPPLRTHDQLLSLYESWLVKHHKNYNA 73
Query: 62 IEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
+ EK RF IFK+N+ +D+ N SY LGLN+FAD++++E+++ YL K R+
Sbjct: 74 LGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRSLYLSGKMMKRERKN 133
Query: 121 P----SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
S F + D LP+SVDWR +GAV PVK+QG CGSCWAFSTV AVEGIN+IV+G
Sbjct: 134 EDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFSTVGAVEGINKIVTGE 193
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L SLSEQEL+DCD +N GCNGGLMDYAF++IV +GG+ E+DYPY +G C+ ++
Sbjct: 194 LISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDTEDDYPYKGVDGLCDQNRKNA 253
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+VVTI+GY+DVP NDE+SL KA+AHQPVSVAIEA G FQ Y GVFTG CG ELDHGV
Sbjct: 254 KVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGVFTGQCGTELDHGVV 313
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
AVGYG G DY IV+NSWGP WGE GYIR++RN G CGI AS P K
Sbjct: 314 AVGYGSENGKDYWIVRNSWGPDWGESGYIRLERNVASTSTGKCGIAMQASYPTK 367
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 386 bits (991), Expect = e-105, Method: Compositional matrix adjust.
Identities = 188/334 (56%), Positives = 241/334 (72%), Gaps = 9/334 (2%)
Query: 26 HDFSIVGYSPEH-----LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHID 80
D SI+ Y +H + S D++ E+FESW+ KHGK+Y ++EK RF+IF++NLK+ID
Sbjct: 23 EDMSIITYDQQHPAKGLVRSEDEVKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYID 82
Query: 81 QRNK-EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV--KALPKSV 137
++N E SY LGLN FAD+++EE++ YLG K S Y V +LP S+
Sbjct: 83 EKNSLENRSYKLGLNRFADITNEEYRTGYLGAKRDASRNMVKSKSDRYAPVAGDSLPDSI 142
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCN 197
DWR+KGAVT VK+QGSCGSCWAFST+AAVEG+NQ+ +GNL SLSEQEL+DCD N GCN
Sbjct: 143 DWREKGAVTGVKDQGSCGSCWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCN 202
Query: 198 GGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED-KKEEMEVVTISGYQDVPENDEQSLL 256
GG M YAF++I+ +GG+ EEDYPY ++G C+ ++ +V +I GY++VP N+E+SL
Sbjct: 203 GGDMGYAFQFIIKNGGIDSEEDYPYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQ 262
Query: 257 KALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWG 316
KA+A+QPVSVAIEA G DFQ YS G+FTG CG +LDHGVAAVGYG G DY IVKNSWG
Sbjct: 263 KAVANQPVSVAIEAGGYDFQLYSSGIFTGSCGTDLDHGVAAVGYGTENGVDYWIVKNSWG 322
Query: 317 PKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
WGE+GY+RM+RN GLCGI AS P KK
Sbjct: 323 DYWGEKGYVRMQRNVKAKTGLCGIAMEASYPTKK 356
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 190/352 (53%), Positives = 246/352 (69%), Gaps = 15/352 (4%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD------KLIELFESWMSKHGKTYKCI 62
+LLL++ + + S A D SI+ Y +H + + ++ ++E+WM KHGK +
Sbjct: 8 ILLLAMMIGV----SYAADMSIISYDEKHHITAENERSDAEVARIYEAWMEKHGKKAQSN 63
Query: 63 ----EEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
EEK RFEIFK+NL+ ID+ N + SY LGL FAD+++EE+++ YLG K +
Sbjct: 64 GLVGEEKDQRFEIFKDNLRFIDEHNNKNLSYKLGLTRFADLTNEEYRSIYLGAKSKKRVL 123
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
+ S + R A+P SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN+IV+G+L
Sbjct: 124 KT-SDRYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLI 182
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
SLSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ EEDYPY +G C+ ++ +V
Sbjct: 183 SLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQTRKNAKV 242
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
VTI Y+DVPEN+E +L K LA+QP+SVAIEA G FQ YS GVF G CG ELDHGV AV
Sbjct: 243 VTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGVFDGICGTELDHGVVAV 302
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
GYG G DY IV+NSWG WGE GYI+M RN +P G CGI AS P+KK
Sbjct: 303 GYGTENGKDYWIVRNSWGGSWGESGYIKMARNIAEPTGKCGIAMEASYPIKK 354
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/331 (54%), Positives = 232/331 (70%), Gaps = 7/331 (2%)
Query: 23 SLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQR 82
+ A D SIV Y S +++ ++ WM++HG TY I E+ RFE F++NL++IDQ
Sbjct: 21 AAAADMSIVSYGER---SEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 83 NKE----VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVD 138
N V S+ LGLN FAD+++EE+++ YLG + + R+ SA + D LP+SVD
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAADNDELPESVD 137
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
WRKKGAV VK+QG CGSCWAFS +AAVEGINQIV+G++ LSEQEL+DCDTS+N GCNG
Sbjct: 138 WRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNG 197
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
GLMDYAF++I+ +GG+ EEDYPY + C+ K+ +VVTI GY+DVP N E+SL KA
Sbjct: 198 GLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKA 257
Query: 259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPK 318
+A+QP+SVAIEA G FQ Y G+FTG CG LDHGVAAVGYG G DY +V+NSWG
Sbjct: 258 VANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGSV 317
Query: 319 WGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WGE GYIRM+RN G CGI S P K
Sbjct: 318 WGEDGYIRMERNIKASSGKCGIAVEPSYPTK 348
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/310 (58%), Positives = 221/310 (71%), Gaps = 8/310 (2%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFK 105
L+E WM HG+ Y I EK RF+IF++N ++I++ N++V +YWLGLN FADM+H+EFK
Sbjct: 33 LYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFK 92
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
Y G K P + F Y+D LP DWR KGAV VKNQG+CGSCWAFSTVAA
Sbjct: 93 ALYFGTK--VPLSNTIKSGFRYKDATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTVAA 150
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
VEG+NQIV+G L SLSEQEL+DCD N GCNGGLMD AF++I+ +GGL E DYPY
Sbjct: 151 VEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPYKAV 210
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG 285
G+C++ + VVTI G++DVP E LLKA+A+QPVSVAIEASG +FQ YSGGV+TG
Sbjct: 211 SGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVYTG 270
Query: 286 PCGAELDHGVAAVGYGKSK-----GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
CG ELDHGV AVGYG SK +DY IV+NSWG WGE GYIR++RN P G CGI
Sbjct: 271 HCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASPRGKCGI 330
Query: 341 NKMASIPLKK 350
MAS P+K
Sbjct: 331 AMMASYPVKN 340
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 191/336 (56%), Positives = 243/336 (72%), Gaps = 9/336 (2%)
Query: 2 AFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKC 61
FS SKL+ + LSL S A DFSIVGYS + LTS++ I LFESWM KH K YK
Sbjct: 3 TIFSISKLIFVVTCLSLHLGLSSA-DFSIVGYSQDDLTSIESSIRLFESWMLKHDKVYKT 61
Query: 62 IEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ--FPTRR 119
I+EK++RFE FK+NL +ID+ NK+ SYWLGLNEFAD++H+EFK KY+G P+ +
Sbjct: 62 IDEKIYRFETFKDNLMYIDETNKKNNSYWLGLNEFADLTHDEFKEKYVGSIPEDSMIIEQ 121
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
EF + V P+S+DWR+KGAVTPVKNQ CGSCWAFSTVA VEGIN+IV+GNL S
Sbjct: 122 SDDVEFPNKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWAFSTVATVEGINKIVTGNLIS 181
Query: 180 LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVV 239
LSEQEL+DCD ++GC GG + KY+V G+H E++YPY ++G C K ++ V
Sbjct: 182 LSEQELLDCDRR-SHGCKGGYQTTSLKYVV-DNGVHTEKEYPYEKKQGNCRAKNKKGLKV 239
Query: 240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVG 299
I+GY+ VP NDE SL+K ++ QPVSV +E+ G FQFY GGVF GPCG +LDH V AVG
Sbjct: 240 YINGYKRVPSNDEISLIKTISIQPVSVLVESKGRPFQFYKGGVFGGPCGTKLDHAVTAVG 299
Query: 300 YGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
YGK DYI++KNSWGPKWG++GYI++KR +G+ E
Sbjct: 300 YGK----DYILIKNSWGPKWGDKGYIKIKRASGQSE 331
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 181/328 (55%), Positives = 235/328 (71%), Gaps = 7/328 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE- 85
D SIV Y S ++ ++ WM+ HG+TY + E+ R+++F++NL++ID N
Sbjct: 23 DMSIVSYGER---SXEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAA 79
Query: 86 ---VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK 142
V S+ LGLN FAD++++E++ YLG + + R+ A + D + LP+SVDWR K
Sbjct: 80 DAGVHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAADNEDLPESVDWRAK 139
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV VK+QGSCGSCWAFST+AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMD
Sbjct: 140 GAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMD 199
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF++I+ +GG+ E+DYPY +G C+ ++ +VVTI Y+DVP NDE+SL KA+A+Q
Sbjct: 200 YAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQ 259
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
PVSVAIEA+GT FQ YS G+FTG CG LDHGV AVGYG G DY IVKNSWG WGE
Sbjct: 260 PVSVAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGES 319
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLKK 350
GY+RM+RN G CGI S PLK+
Sbjct: 320 GYVRMERNIKASSGKCGIAVEPSYPLKE 347
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 191/347 (55%), Positives = 250/347 (72%), Gaps = 10/347 (2%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SKL+ ++ L + S A DFSIVGYS + LTS ++LI LFESWM KH + Y IEE
Sbjct: 6 SISKLIFVATCLIVHVGLSSA-DFSIVGYSQDDLTSTERLIRLFESWMLKHDRVYNNIEE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPS- 122
K+HRFEIFK+NL +ID+ NK+ SYWLGLNEF D++H+EFK KY+G + F T Q +
Sbjct: 65 KIHRFEIFKDNLMYIDETNKKNNSYWLGLNEFVDLTHDEFKEKYVGSIGEDFVTIEQSND 124
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
EF Y+ V P+S+DWR KGAVTPVK CGSCWAFSTVA VEGIN+IV+G L SLSE
Sbjct: 125 EEFPYKHVVDYPESIDWRDKGAVTPVK-PNPCGSCWAFSTVATVEGINKIVTGKLISLSE 183
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DCD ++GC GG + +Y+V G+H E++YPY ++G C K+++ V I+
Sbjct: 184 QELLDCDRR-SHGCKGGYQTTSLQYVV-DNGVHTEKEYPYEKKQGKCRAKEKKGTKVQIT 241
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
GY+ VP NDE SL++A+A+QPVSV +E+ G FQ Y GG+F GPCG +LDH V A+GYGK
Sbjct: 242 GYKRVPANDEISLIQAIANQPVSVLLESKGRAFQLYKGGIFNGPCGTKLDHAVTAIGYGK 301
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+ YI++KNSWGP WGE+GY+++KR +GK EG CG+ K + P K
Sbjct: 302 T----YILIKNSWGPNWGEKGYLKIKRASGKSEGTCGVYKSSYFPTK 344
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/331 (54%), Positives = 242/331 (73%), Gaps = 9/331 (2%)
Query: 27 DFSIVGYSPEHLTSMD-KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE 85
D SI+ Y D +++ ++E+W+ KHGK+Y + E+ RFEIFK+NL+ I++ N
Sbjct: 32 DMSIISYGDRLEKRTDAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV 91
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR-----QPSAEFSYRDVKALPKSVDWR 140
+Y +GLN FAD+++EE++++YLG + + TRR + S +S+R + LP+SVDWR
Sbjct: 92 NRTYKVGLNRFADLTNEEYRSRYLGRRDE--TRRGLRASRVSDRYSFRAGEDLPESVDWR 149
Query: 141 KKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGL 200
+KGAV PVK+QG+CGSCWAFST+AAVEGINQI +G+L SLSEQEL+DCD S+N GCNGGL
Sbjct: 150 EKGAVVPVKDQGNCGSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGL 209
Query: 201 MDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA 260
MDYAF++I+ +GG+ EEDYPY + TC+ ++ VV+I GY+DVP+NDE+SL KA+A
Sbjct: 210 MDYAFEFIINNGGIDSEEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVA 269
Query: 261 HQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWG 320
+QPVSVAIEA G FQ Y GVFTG CG +LDHGV AVGYG DY IV+NSWGP WG
Sbjct: 270 NQPVSVAIEAGGRAFQLYQSGVFTGQCGTQLDHGVVAVGYGTENSVDYWIVRNSWGPNWG 329
Query: 321 ERGYIRMKRN-TGKPEGLCGINKMASIPLKK 350
E GYI+++RN G G CGI S P+K
Sbjct: 330 ESGYIKLERNLAGTETGKCGIAIEPSYPIKN 360
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 181/328 (55%), Positives = 235/328 (71%), Gaps = 7/328 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE- 85
D SIV Y S ++ ++ WM+ HG+TY + E+ R+++F++NL++ID N
Sbjct: 28 DMSIVSYGER---SDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAA 84
Query: 86 ---VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK 142
V S+ LGLN FAD++++E++ YLG + + R+ A + D + LP+SVDWR K
Sbjct: 85 DAGVHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAADNEDLPESVDWRAK 144
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV VK+QGSCGSCWAFST+AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMD
Sbjct: 145 GAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMD 204
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF++I+ +GG+ E+DYPY +G C+ ++ +VVTI Y+DVP NDE+SL KA+A+Q
Sbjct: 205 YAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQ 264
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
PVSVAIEA+GT FQ YS G+FTG CG LDHGV AVGYG G DY IVKNSWG WGE
Sbjct: 265 PVSVAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGES 324
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLKK 350
GY+RM+RN G CGI S PLK+
Sbjct: 325 GYVRMERNIKASSGKCGIAVEPSYPLKE 352
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/325 (56%), Positives = 243/325 (74%), Gaps = 8/325 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
DF+IVGYS + LTS+++L+ LFESW ++ K YK I+EK++RFEIFK+NL +ID+ NK+
Sbjct: 1 DFAIVGYSQDDLTSIERLVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKKN 60
Query: 87 TSYWLGLNEFADMSHEEFKNKYLGLKPQFPT--RRQPSAEFSYRDVKALPKSVDWRKKGA 144
+SYWLGLNEFAD++H+EFK KY+G + T + EF Y+ V P+S+DWR+KGA
Sbjct: 61 SSYWLGLNEFADLTHDEFKAKYVGSLGEDSTIIEQSDDEEFPYKHVVDYPESIDWRQKGA 120
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYA 204
VTPVKNQ CGSCWAFSTVA VEGIN+IV+G L SLSEQEL+DCD ++GC GG +
Sbjct: 121 VTPVKNQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCKGGYQTTS 179
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
+Y VA G+H E++YPY ++G C K ++ V I+GY+ VP N+E SL++A+A+QPV
Sbjct: 180 LQY-VADNGVHTEKEYPYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPV 238
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGY 324
SV +E+ G FQFY GG+F GPCG ++DH V AVGYGK +YI++KNSWGPKWGE+GY
Sbjct: 239 SVVVESKGRAFQFYKGGIFEGPCGTKVDHAVTAVGYGK----NYILIKNSWGPKWGEKGY 294
Query: 325 IRMKRNTGKPEGLCGINKMASIPLK 349
IR+KR +GK +G CG+ + P K
Sbjct: 295 IRIKRASGKSKGTCGVYSSSYFPTK 319
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 188/332 (56%), Positives = 234/332 (70%), Gaps = 8/332 (2%)
Query: 27 DFSIVGYSPEH-----LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
D SIV Y+ +H L + ++ ++E W+ +HGK Y + EK RFEIFK+NL+ ID+
Sbjct: 25 DMSIVDYNIKHGTKYPLRTDSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDE 84
Query: 82 RNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR--RQPSAEFSYRDVKALPKSVDW 139
N SY +GLN FAD+++EE+K +LG K + R S + ++D LP++VDW
Sbjct: 85 HNSVDRSYKVGLNRFADLTNEEYKAMFLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDW 144
Query: 140 RKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGG 199
R+KGAV PVK+QG CGSCWAFSTV AVEGINQIV+G L SLSEQEL+DCD S+N GCNGG
Sbjct: 145 REKGAVVPVKDQGQCGSCWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGG 204
Query: 200 LMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKAL 259
LMDYAF++I+ +GG+ EEDYPY + C+ ++ +VVTI GY+DVPENDE SL KA+
Sbjct: 205 LMDYAFEFIINNGGIDTEEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAV 264
Query: 260 AHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKW 319
AHQPVSVAIEA G FQ Y GVFTG CG ELDHGV AVGYG G +Y IV+NSWG W
Sbjct: 265 AHQPVSVAIEAGGRAFQLYKSGVFTGRCGTELDHGVVAVGYGTENGVNYWIVRNSWGSAW 324
Query: 320 GERGYIRMKRNTGKPE-GLCGINKMASIPLKK 350
GE GYIRM+RN + G CGI S P KK
Sbjct: 325 GESGYIRMERNVANTKTGKCGIAIQPSYPTKK 356
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/356 (52%), Positives = 247/356 (69%), Gaps = 14/356 (3%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHL-------TSMDKLIELFESWMSKHGKTY 59
+K ++ +L +LF+ S A D SI+ Y H + D++ +E W+++HG+ Y
Sbjct: 2 AKTIITTLLFALFSSLSYAIDMSIIDYKNNHYARKWTLQSDEDQVKNRYEMWLAEHGRAY 61
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKP----Q 114
+ EK RFEIFK+NL+ I+ N +Y +GLN+FAD+++EE++ YLG K +
Sbjct: 62 NALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNEEYRTMYLGTKSDARRR 121
Query: 115 FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVS 174
F + PS ++ R + +P SVDWRK+GAV P+KNQGSCGSCWAFSTVAAVEGINQIV+
Sbjct: 122 FVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFSTVAAVEGINQIVT 181
Query: 175 GNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
G + +LSEQEL+DCD N+GCNGGLMDYAF++I+++GG+ E+ YPY EG C+ ++
Sbjct: 182 GEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGMDTEKHYPYRGVEGRCDPVRK 241
Query: 235 EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHG 294
+VV+I GY+DVP N E++L KA+AHQPV VAIEASG FQ YS GVFTG CG E+DHG
Sbjct: 242 NYKVVSIDGYEDVPRN-ERALQKAVAHQPVCVAIEASGRAFQLYSSGVFTGECGEEVDHG 300
Query: 295 VAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
V VGYG G DY IV+NSWG KWGE GY++M+RN K G CGI AS P K
Sbjct: 301 VVVVGYGSEDGVDYWIVRNSWGTKWGENGYVKMERNVKKSHLGKCGIMTEASYPTK 356
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/332 (55%), Positives = 238/332 (71%), Gaps = 13/332 (3%)
Query: 25 AHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK 84
A D SI+ Y H ++E+W+ KHGK Y + EK RF+IFK+NL+ I++ N
Sbjct: 31 AMDMSIIDYDESHTR------HVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNG 84
Query: 85 E-VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR-----QPSAEFSYRDVKALPKSVD 138
SY LGLN+FAD+++EE++ +LG + + P + + + ++YR + LP VD
Sbjct: 85 AGDKSYKLGLNKFADLTNEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVD 144
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
WR+KGAVTP+K+QG CGSCWAFSTV AVEGINQIV+GNLTSLSEQEL+DCD +N GCNG
Sbjct: 145 WREKGAVTPIKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNG 204
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
GLMDYAF++IV +GG+ EEDYPY ++ TC+ ++ VVTI GY+DVP NDE+SL+KA
Sbjct: 205 GLMDYAFEFIVQNGGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKA 264
Query: 259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPK 318
+A+QPVSVAIEA G +FQ Y GVFTG CG LDHGV AVGYG G+DY +V+NSWG
Sbjct: 265 VANQPVSVAIEAGGMEFQLYQSGVFTGRCGTNLDHGVVAVGYGTENGTDYWLVRNSWGSA 324
Query: 319 WGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
WGE GYI+++RN E G CGI AS P+K
Sbjct: 325 WGENGYIKLERNVQNTETGKCGIAIEASYPIK 356
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 179/310 (57%), Positives = 227/310 (73%), Gaps = 7/310 (2%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
E+++ W++KHGK Y I+E+ RF+IFKENLK ID N E +Y +GLN FAD+++EE++
Sbjct: 33 EIYDLWLAKHGKAYNGIDEREKRFQIFKENLKFIDDHNSENRTYKVGLNMFADLTNEEYR 92
Query: 106 NKYLGLKPQFPTRR-----QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
YLG + P RR S ++ ++ LP+S+DWR +GAV PVKNQGSCGSCWAF
Sbjct: 93 ALYLGTRSP-PARRVMKAKTASRRYAVNNLDRLPESMDWRTRGAVAPVKNQGSCGSCWAF 151
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
ST+AAVEGINQIV+G L SLSEQEL+ CD +N+GCNGGLMDYAF++I+ +GGL EEDY
Sbjct: 152 STIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDY 211
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY +G C+ ++ +VV+I Y+DVP NDE+SL KA+AHQPVSVAIEASG Q Y
Sbjct: 212 PYEAFDGQCDPTRKNAKVVSIDAYEDVPANDEESLKKAVAHQPVSVAIEASGLALQLYQS 271
Query: 281 GVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK-PEGLCG 339
GVFTG CG+ LDHGV AVGYGK G DY +V+NSWG WGE GY +++RN EG CG
Sbjct: 272 GVFTGKCGSALDHGVVAVGYGKENGVDYWLVRNSWGTSWGEDGYFKLERNVKHITEGKCG 331
Query: 340 INKMASIPLK 349
I AS P+K
Sbjct: 332 IAMQASYPVK 341
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 181/310 (58%), Positives = 219/310 (70%), Gaps = 8/310 (2%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFK 105
L+E WM HG+ Y I EK RF+IF++N ++I++ N++V +YWLGLN FADM+H+EFK
Sbjct: 33 LYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFK 92
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
Y G K P + F Y D LP DWR KGAV VKNQG+CGSCWAFSTVAA
Sbjct: 93 ALYFGTK--VPLSNTIKSGFRYEDATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTVAA 150
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
VEG+NQIV+G L SLSEQEL+DCD N GCNGGLMD AF++I+ +GGL E DYPY
Sbjct: 151 VEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPYKAV 210
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG 285
G+C++ + VVTI G++DVP E LLKA+A+QPVSVAIEASG +FQ YSGGV+TG
Sbjct: 211 SGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVYTG 270
Query: 286 PCGAELDHGVAAVGYGKSK-----GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
CG ELDHGV AVGYG SK +DY IV+NSWG WGE GYIR++RN G CGI
Sbjct: 271 HCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASSRGKCGI 330
Query: 341 NKMASIPLKK 350
MAS P+K
Sbjct: 331 AMMASYPVKN 340
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 182/331 (54%), Positives = 232/331 (70%), Gaps = 7/331 (2%)
Query: 23 SLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQR 82
+ A D SIV Y S +++ ++ WM++HG TY I E+ RFE F++NL++IDQ
Sbjct: 21 AAAADMSIVSYGER---SEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 83 NKE----VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVD 138
N V S+ LGLN FAD+++EE+++ YLG + + R+ SA + D LP+SVD
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAADNDELPESVD 137
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
WRKKGAV VK+QG CGSCWAFS +AAVEGINQIV+G++ LSEQEL+DCDTS+N GCNG
Sbjct: 138 WRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNG 197
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
GLMDYAF++I+ +GG+ EEDYPY + C+ K+ +VVTI GY+DVP N E+SL KA
Sbjct: 198 GLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKA 257
Query: 259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPK 318
+A+QP+SVAIEA G FQ Y G+FTG CG LDHGVAAVGYG G DY +V+NSWG
Sbjct: 258 VANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGSV 317
Query: 319 WGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WGE GYIRM+RN G CGI S P K
Sbjct: 318 WGEDGYIRMERNIKASSGKCGIAVEPSYPTK 348
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/350 (53%), Positives = 245/350 (70%), Gaps = 8/350 (2%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLT-----SMDKLIELFESWMSKHGKTYKC 61
S + ++L +LF SS A D SI+ Y H + + D+++ ++ESW+ KHGK+Y
Sbjct: 5 SPSMAIALLFALFVASS-ALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKSYNA 63
Query: 62 IEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
+ EK RF+IFK+NL+ ID+ N E SY +GLN FAD+++EE+++ YLG K + +
Sbjct: 64 LGEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKPKLSKV 123
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
S ++ R +LP+SVDWR KGAV P+K+QGSCGSCWAFSTV AVEGINQIV+G L +L
Sbjct: 124 KSDRYAPRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITL 183
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQEL+DCD S+N GC+GGLMDY F++I+ +GG+ ++DYPYL + C+ ++ +VVT
Sbjct: 184 SEQELVDCDKSYNEGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVT 243
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
I Y+DVP N+E++L KA+A QPVSV IE G FQFY G+FTG CG LDHGV VGY
Sbjct: 244 IDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGY 303
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRN-TGKPEGLCGINKMASIPLK 349
G KG DY IV+NSWG WGE GYIRM+RN G G CGI S PLK
Sbjct: 304 GTEKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLK 353
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 180/336 (53%), Positives = 235/336 (69%), Gaps = 8/336 (2%)
Query: 23 SLAHDFSIVGYSPEHLTSM----DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKH 78
+LA D SI+ Y H S+ D+++ ++ SW+ KHGK+Y + EK RF+IFK+NL++
Sbjct: 20 ALASDMSIINYDQTHTNSLIRTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 79
Query: 79 IDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPT---RRQPSAEFSYRDVKALP 134
ID N + SY LGLN FAD+++EE++ KYLG K + + PS ++ + + LP
Sbjct: 80 IDNHNADPDRSYELGLNRFADLTNEEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEELP 139
Query: 135 KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN 194
S+DWR+KGAV VK+QGSCGSCWAFS + AVEGINQI +G L +LSEQEL+DCD S+N
Sbjct: 140 DSIDWREKGAVAAVKDQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNE 199
Query: 195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQS 254
GC GGLMDYAF +I+ +GG+ + DYPY +GTC KE +VVTI Y+DVP DE++
Sbjct: 200 GCEGGLMDYAFNFIIKNGGIDSDLDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKA 259
Query: 255 LLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNS 314
L KA A+QP+SVAIEA G DFQ Y G+FTG CG +DHGV VGYG +G DY IV+NS
Sbjct: 260 LQKAAANQPISVAIEAGGMDFQLYVSGIFTGKCGTAVDHGVVVVGYGSEEGMDYWIVRNS 319
Query: 315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
WG WGE GY++M+RN GK GLCGI S P+K
Sbjct: 320 WGAAWGEAGYLKMQRNVGKSSGLCGITIEPSYPVKN 355
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 188/344 (54%), Positives = 242/344 (70%), Gaps = 9/344 (2%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEH-----LTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
L +LFA SS A D SI+ Y H + +++ L+E W+ KHGK Y + EK R
Sbjct: 2 LLFALFALSS-ALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKR 60
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLK--PQFPTRRQPSAEFS 126
F+IFK+NL+ IDQ+N E +Y LGLN FAD+++EE++ +YLG K P R PS ++
Sbjct: 61 FQIFKDNLRFIDQQNAENRTYKLGLNRFADLTNEEYRARYLGTKIDPNRRLGRTPSNRYA 120
Query: 127 YRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
R + LP SVDWRK+GAV PVK+Q SCGSCWAFS + AVEGIN+IV+G+L SLSEQEL+
Sbjct: 121 PRVGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQELV 180
Query: 187 DCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQD 246
DCDT +N GCNGGLMDYAF++I+ +GG+ EEDYPY +G C++ ++ +VV+I GY+D
Sbjct: 181 DCDTGYNMGCNGGLMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGYED 240
Query: 247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGS 306
V DE +L KA+A+QPVSVA+E G +FQ YS GVFTG CG LDHGV AVGYG G
Sbjct: 241 VNTYDELALKKAVANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYGTDNGH 300
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
D+ IV+NSWG WGE GYIR++RN G G CGI S P+K
Sbjct: 301 DFWIVRNSWGADWGEEGYIRLERNLGNSRSGKCGIAIEPSYPIK 344
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 185/347 (53%), Positives = 245/347 (70%), Gaps = 5/347 (1%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SK++ L+ L + S A DF VGYS + LTS+++LI+LF+SWM KH K Y+ I+E
Sbjct: 6 SISKIIFLATCLIIHMSLSSA-DFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ--PS 122
K++RFEIF++NL +ID+ NK+ SYWLGLN FAD+S++EFK KY+G + T + +
Sbjct: 65 KIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGSVAEDFTGLEHFDN 124
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
+F+Y+ V P+S+DWR KGAVTPVKNQGSCGSCWAFST+A VEG+N+IV+GNL LSE
Sbjct: 125 EDFTYKHVTNYPQSIDWRAKGAVTPVKNQGSCGSCWAFSTIATVEGVNKIVTGNLLELSE 184
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DCD + ++GC GG + +Y VA G+H + YPY + C + V I+
Sbjct: 185 QELVDCDKN-SHGCKGGYQTTSLQY-VADNGVHTSKVYPYQAKAMQCRATDKPGPKVKIT 242
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
GY+ VP N E S L ALA+QP+SV +EA G FQ Y GVF GPCG +LDH V AVGYG
Sbjct: 243 GYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGT 302
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
S G +YII+KNSWGP WGE+GY+R+KR +G +G CG+ K + P K
Sbjct: 303 SDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 349
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 185/347 (53%), Positives = 245/347 (70%), Gaps = 5/347 (1%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SK++ L+ L + S A DF VGYS + LTS+++LI+LF+SWM KH K Y+ I+E
Sbjct: 6 SISKIIFLATCLIIHMGLSSA-DFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ--PS 122
K++RFEIF++NL +ID+ NK+ SYWLGLN FAD+S++EFK KY+G + T + +
Sbjct: 65 KIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDN 124
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
+F+Y+ V P+S+DWR KGAVTPVKNQG+CGSCWAFST+A VEGIN+IV+GNL LSE
Sbjct: 125 EDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSE 184
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DCD + GC GG + +Y VA+ G+H + YPY ++ C + V I+
Sbjct: 185 QELVDCD-KHSYGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKIT 242
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
GY+ VP N E S L ALA+QP+SV +EA G FQ Y GVF GPCG +LDH V AVGYG
Sbjct: 243 GYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGT 302
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
S G +YII+KNSWGP WGE+GY+R+KR +G +G CG+ K + P K
Sbjct: 303 SDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 349
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 192/347 (55%), Positives = 237/347 (68%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
+ ++L+L + + ++ + DF + + S D L EL+E W S H + +EEK
Sbjct: 3 RFIVLALCMLMVLETTKSLDFH-----EKDVESEDSLWELYERWKSHH-TIARSLEEKAK 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FK N+KHI + NK+ SY L LN+F DM+ EEF+ Y G + F RQ +
Sbjct: 57 RFNVFKHNVKHIHETNKKENSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGERQTTK 116
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y +V LP SVDWRK GAVTPVKNQG CGSCWAFSTV AVEGINQI + LTSLSEQ
Sbjct: 117 SFMYANVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQ 176
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCDT+ N GCNGGLMD AF++I GGL E YPY + TC+ KE VV+I G
Sbjct: 177 ELVDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDG 236
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
++DVP+N E L+KA+AHQPVSVAI+A G+DFQFYS GVFTG CG EL+HGVA VGYG +
Sbjct: 237 HEDVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTT 296
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G+ Y IVKNSWG +WGE+GYIRM+R EGLCGI AS PLK
Sbjct: 297 IDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPLK 343
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 184/347 (53%), Positives = 242/347 (69%), Gaps = 9/347 (2%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD------KLIELFESWMSKHGKTYK--CI 62
++ L L++ A +S A D SI+ Y +H S +++ ++E+W+ KHGK +
Sbjct: 1 MVILFLAMVAVAS-AVDMSIISYDEKHGVSTTGGRSDAEVMSIYEAWLVKHGKAQNQNSL 59
Query: 63 EEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPS 122
EK RFEIFK+NL+ ID NK+ SY LGL FAD++++E+++KYLG K + R+ S
Sbjct: 60 VEKDRRFEIFKDNLRFIDDHNKKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS 119
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
+ R LP+S+DWRKKGAV VK+QGSCGSCWAFST+ AVEGINQIV+G+L +LSE
Sbjct: 120 QRYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLITLSE 179
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ ++DYPY +GTC+ ++ +VVTI
Sbjct: 180 QELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTID 239
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
Y+DVP E+SL KA+AHQPVSVAIEA G FQ Y G+F G CG +LDHGV AVGYG
Sbjct: 240 SYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGIFDGTCGTQLDHGVVAVGYGT 299
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G DY IV+NSWG WGE GY++M RN G CGI S P+K
Sbjct: 300 ENGKDYWIVRNSWGKSWGESGYLKMARNIASSSGKCGIAIEPSYPIK 346
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 180/328 (54%), Positives = 234/328 (71%), Gaps = 7/328 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE- 85
D SIV Y S ++ ++ WM+ HG+TY + E+ R+++F++NL++ID N
Sbjct: 26 DMSIVSYGER---SDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAA 82
Query: 86 ---VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK 142
V S+ LGLN FAD++++E++ YLG + + R+ A + D + LP+SVDWR K
Sbjct: 83 DAGVHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAADNEDLPESVDWRAK 142
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV VK+QGS GSCWAFST+AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMD
Sbjct: 143 GAVAEVKDQGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMD 202
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF++I+ +GG+ E+DYPY +G C+ ++ +VVTI Y+DVP NDE+SL KA+A+Q
Sbjct: 203 YAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQ 262
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
PVSVAIEA+GT FQ YS G+FTG CG LDHGV AVGYG G DY IVKNSWG WGE
Sbjct: 263 PVSVAIEAAGTQFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGES 322
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLKK 350
GY+RM+RN G CGI S PLK+
Sbjct: 323 GYVRMERNIKASSGKCGIAVEPSYPLKE 350
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 184/335 (54%), Positives = 233/335 (69%), Gaps = 11/335 (3%)
Query: 27 DFSIVGYSPEHLT-----SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
D SI+ Y H S +++ L+E W++KHG+ + EK RFEIFK+N++ ID
Sbjct: 24 DMSIISYDEAHGVQGLERSEEEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDA 83
Query: 82 RNKEV----TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP--SAEFSYRDVKALPK 135
N S+ LGLN FADM++EE++ YLG +P RR S + Y + LP+
Sbjct: 84 HNAAADSGHRSFRLGLNRFADMTNEEYRTVYLGTRPASHRRRARLGSDRYRYNAGEELPE 143
Query: 136 SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNG 195
SVDWR KGAVT VK+QGSCGSCWAFST+AAVEGIN+IV+G+L SLSEQEL+DCD N G
Sbjct: 144 SVDWRDKGAVTTVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQG 203
Query: 196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
CNGGLMDYAF++I+ +GG+ EEDYPY +G C+ ++ +VV+I GY+DVP NDE++L
Sbjct: 204 CNGGLMDYAFEFIINNGGIDTEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEKAL 263
Query: 256 LKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSW 315
KA+A+QPVSVAIEA G +FQ Y G+FTG CG +LDHGV AVGYG G DY IV+NSW
Sbjct: 264 QKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRNSW 323
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G WGE GYIRM+RN G CGI +S P KK
Sbjct: 324 GGDWGESGYIRMERNVNASTGKCGIAMESSYPTKK 358
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 178/327 (54%), Positives = 231/327 (70%), Gaps = 7/327 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
D SIV Y S +++ ++ WMS+H +TY I E+ RFE+F++NL++IDQ N
Sbjct: 23 DMSIVSYGER---SEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAA 79
Query: 87 T----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK 142
S+ LGLN FAD+++EE+++ YLG + + R+ SA + D + LP++VDWRKK
Sbjct: 80 DAGLHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQADDNEELPETVDWRKK 139
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV +K+QG CGSCWAFS +AAVEGINQIV+G++ LSEQEL+DCDTS+N GCNGGLMD
Sbjct: 140 GAVAAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMD 199
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF++I+ +GG+ EEDYPY + C+ K+ +VVTI GY+DVP N E+SL KA+A+Q
Sbjct: 200 YAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQ 259
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
P+SVAIEA G FQ Y G+FTG CG LDHGVAAVGYG G DY +V+NSWG WGE
Sbjct: 260 PISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGTVWGED 319
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLK 349
GYIRM+RN G CGI S P K
Sbjct: 320 GYIRMERNIKASSGKCGIAVEPSYPTK 346
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 183/357 (51%), Positives = 245/357 (68%), Gaps = 12/357 (3%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD------KLIELFESWMSK 54
M F + +L L++ A SS A D SI+ Y +H S +++ ++E+W+ K
Sbjct: 1 MGFLKPTMAILF---LAMVAVSS-AVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVK 56
Query: 55 HGK--TYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLK 112
HGK + + EK RFEIFK+NL+ +D+ N++ SY LGL FAD++++E+++KYLG K
Sbjct: 57 HGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAK 116
Query: 113 PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 172
+ R+ S + R LP+S+DWRKKGAV VK+QG CGSCWAFST+ AVEGINQI
Sbjct: 117 MEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQI 176
Query: 173 VSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDK 232
V+G+L +LSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ ++DYPY +GTC+
Sbjct: 177 VTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQI 236
Query: 233 KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD 292
++ +VVTI Y+DVP E+SL KA+AHQP+S+AIEA G FQ Y G+F G CG +LD
Sbjct: 237 RKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLD 296
Query: 293 HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
HGV AVGYG G DY IV+NSWG WGE GY+RM RN G CGI S P+K
Sbjct: 297 HGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIK 353
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 184/347 (53%), Positives = 244/347 (70%), Gaps = 5/347 (1%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SK++ L+ L + S A DF VGYS + LTS+++LI+LF+SWM KH K Y+ I+E
Sbjct: 6 SISKIIFLATCLIIHMGLSSA-DFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ--PS 122
K++RFEIF++NL +ID+ NK+ SYWLGLN FAD+S++EFK KY+G + T + +
Sbjct: 65 KIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDN 124
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
+F+Y+ V P+S+DWR KGAVTPVKNQG+CGSCWAFST+A VEGIN+IV+GNL LSE
Sbjct: 125 EDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSE 184
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DCD + GC GG + +Y VA+ G+H + YPY ++ C + V I+
Sbjct: 185 QELVDCD-KHSYGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKIT 242
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
GY+ VP N E S L ALA+QP+S +EA G FQ Y GVF GPCG +LDH V AVGYG
Sbjct: 243 GYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGT 302
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
S G +YII+KNSWGP WGE+GY+R+KR +G +G CG+ K + P K
Sbjct: 303 SDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 349
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 184/340 (54%), Positives = 236/340 (69%), Gaps = 6/340 (1%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMDK----LIELFESWMSKHGKTYKCIEEKLHRF 69
L L++ SS A D SI+ Y H T + + L+E W+ KHGK + EK RF
Sbjct: 5 LFLAMIVVSS-AMDMSIISYDKNHHTVSSRSDVEVSRLYEEWVVKHGKAQNSLTEKDRRF 63
Query: 70 EIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRD 129
EIFK+NL+ ID+ N + SY LGL +FAD++++E+++ YLG + + + S + R
Sbjct: 64 EIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRLKRKATKT-SLRYEARV 122
Query: 130 VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD 189
A+P+SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN+IV+G+L SLSEQEL+DCD
Sbjct: 123 GDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCD 182
Query: 190 TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPE 249
TS+N GCNGGLMDYAF++I+ +GG+ EEDYPY +G C+ ++ +VVTI Y+DVP
Sbjct: 183 TSYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDSYEDVPA 242
Query: 250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYI 309
N E+SL KAL+HQP+SVAIE G FQ Y G+F G CG +LDHGV AVGYG G DY
Sbjct: 243 NSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGTENGKDYW 302
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
IVKNSWG WGE GYIRM+RN G CGI S P+K
Sbjct: 303 IVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPIK 342
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 183/357 (51%), Positives = 245/357 (68%), Gaps = 12/357 (3%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD------KLIELFESWMSK 54
M F + +L L++ A SS A D SI+ Y +H S +++ ++E+W+ K
Sbjct: 1 MGFLKPTMAILF---LAMVAVSS-AVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVK 56
Query: 55 HGK--TYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLK 112
HGK + + EK RFEIFK+NL+ +D+ N++ SY LGL FAD++++E+++KYLG K
Sbjct: 57 HGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAK 116
Query: 113 PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 172
+ R+ S + R LP+S+DWRKKGAV VK+QG CGSCWAFST+ AVEGINQI
Sbjct: 117 MEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQI 176
Query: 173 VSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDK 232
V+G+L +LSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ ++DYPY +GTC+
Sbjct: 177 VTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQI 236
Query: 233 KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD 292
++ +VVTI Y+DVP E+SL KA+AHQP+S+AIEA G FQ Y G+F G CG +LD
Sbjct: 237 RKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLD 296
Query: 293 HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
HGV AVGYG G DY IV+NSWG WGE GY+RM RN G CGI S P+K
Sbjct: 297 HGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIK 353
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 184/335 (54%), Positives = 231/335 (68%), Gaps = 11/335 (3%)
Query: 27 DFSIVGYSPEHLT-----SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
D SI+ Y H S +++ L+E W++KHG+ Y + EK RFEIFK+N+ ID
Sbjct: 24 DMSIISYDEAHGVRGLERSEEEMRILYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDA 83
Query: 82 RNKEV----TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP--SAEFSYRDVKALPK 135
N S+ LGLN FADM++EE++ YLG +P RR S + Y + LP+
Sbjct: 84 HNAAADAGHRSFRLGLNRFADMTNEEYRAVYLGTRPAGHRRRARVGSDRYRYNAGEDLPE 143
Query: 136 SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNG 195
SVDWR KGAV VK+QGSCGSCWAFSTVAAVEGIN+IV+G+L SLSEQEL+DCD +N G
Sbjct: 144 SVDWRAKGAVAAVKDQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQG 203
Query: 196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
CNGGLMDY F++I+ +GG+ EEDYPY +G C+ ++ +VV+I GY+DVP NDE++L
Sbjct: 204 CNGGLMDYGFEFIINNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKAL 263
Query: 256 LKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSW 315
KA+A+QPVSVAIEA G +FQ Y G+FTG CG +LDHGV AVGYG G DY IV+NSW
Sbjct: 264 QKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRNSW 323
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G WGE GYIRM+RN G CGI S P KK
Sbjct: 324 GGDWGESGYIRMERNVNTSTGKCGIAIEPSYPTKK 358
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/348 (54%), Positives = 242/348 (69%), Gaps = 10/348 (2%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLT---SMDKLIELFESWMSKHGKTYKCIEEK 65
+ + L ++FA SS A D SI+ Y H S ++L+ ++E W+ KHGK Y + EK
Sbjct: 38 MATILLLFTVFAVSS-ALDMSIISYDNAHAATSRSDEELMSMYEQWLVKHGKVYNALGEK 96
Query: 66 LHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR---QP 121
RF+IFK+NL+ ID N +E +Y LGLN FAD+++EE++ KYLG K P RR P
Sbjct: 97 EKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEYRAKYLGTKID-PNRRLGKTP 155
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
S ++ R LP+SVDWRK+GAV PVK+QG CGSCWAFS + AVEGIN+IV+G L SLS
Sbjct: 156 SNRYAPRVGDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLS 215
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DCDT +N GCNGGLMDYAF++I+ +GG+ EEDYPY +G C+ ++ +VV+I
Sbjct: 216 EQELVDCDTGYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRGVDGRCDTYRKNAKVVSI 275
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
Y+DVP DE +L KA+A+QPVSVAIE G +FQ Y GVFTG CG LDHGV AVGYG
Sbjct: 276 DDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYG 335
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPL 348
+ G DY IV+NSWGP WGE GYIR++RN G CGI S PL
Sbjct: 336 TANGHDYWIVRNSWGPSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 383
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 184/341 (53%), Positives = 237/341 (69%), Gaps = 6/341 (1%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLT----SMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
L L++ SS A D SI+ Y H T S ++ L+E W+ KHGK + EK RF
Sbjct: 5 LFLTMIVVSS-AMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRRF 63
Query: 70 EIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRD 129
EIFK+NL+ ID+ N + SY LGL +FAD++++E+++ YLG + + + S + R
Sbjct: 64 EIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRLKRKATKS-SLRYEVRV 122
Query: 130 VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD 189
A+P+SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN+IV+G+L +LSEQEL+DCD
Sbjct: 123 GDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCD 182
Query: 190 TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPE 249
TS+N GCNGGLMDYAF++I+ +GG+ EEDYPY +G C+ ++ +VVTI Y+DVP
Sbjct: 183 TSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDLYEDVPA 242
Query: 250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYI 309
N E+SL KAL+HQP+SVAIE G FQ Y G+F G CG +LDHGV AVGYG G DY
Sbjct: 243 NSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGTENGKDYW 302
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
IVKNSWG WGE GYIRM+RN G CGI S P+K
Sbjct: 303 IVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPIKN 343
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 177/327 (54%), Positives = 232/327 (70%), Gaps = 7/327 (2%)
Query: 28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-- 85
SIV Y + ++ ++ WM+ HG+TY + + R+++F++NL++ID N
Sbjct: 27 MSIVSYGER---TDEEARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAAD 83
Query: 86 --VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKG 143
V S+ LGLN FAD++++E+ YLG + + R+ A + D + LP+SVDWR KG
Sbjct: 84 AGVHSFRLGLNRFADLTNDEYPATYLGARTRPQRDRKLGARYHAADNEDLPESVDWRAKG 143
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
AV VK+QGSCG+CWAFST+AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMDY
Sbjct: 144 AVAEVKDQGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDY 203
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQP 263
AF++I+ +GG+ E+DYPY +G C+ ++ +VVTI Y+DVP NDE+SL KA+A+QP
Sbjct: 204 AFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQP 263
Query: 264 VSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERG 323
VSVAIEA+GT FQ YS G+FTG CG LDHGV AVGYG G DY IVKNSWG WGE G
Sbjct: 264 VSVAIEAAGTAFQLYSSGIFTGSCGTRLDHGVTAVGYGTENGKDYWIVKNSWGSSWGESG 323
Query: 324 YIRMKRNTGKPEGLCGINKMASIPLKK 350
Y+RM+RN G CGI S PLK+
Sbjct: 324 YVRMERNIKASSGKCGIAVEPSYPLKE 350
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 376 bits (965), Expect = e-102, Method: Compositional matrix adjust.
Identities = 182/357 (50%), Positives = 244/357 (68%), Gaps = 12/357 (3%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD------KLIELFESWMSK 54
M F + +L L++ SS A D SI+ Y +H S +++ ++E+W+ K
Sbjct: 1 MGFLKPTMAILF---LAMVTVSS-AVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVK 56
Query: 55 HGK--TYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLK 112
HGK + + EK RFEIFK+NL+ +D+ N++ SY LGL FAD++++E+++KYLG K
Sbjct: 57 HGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAK 116
Query: 113 PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 172
+ R+ S + R LP+S+DWRKKGAV VK+QG CGSCWAFST+ AVEGINQI
Sbjct: 117 MEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQI 176
Query: 173 VSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDK 232
V+G+L +LSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ ++DYPY +GTC+
Sbjct: 177 VTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQI 236
Query: 233 KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD 292
++ +VVTI Y+DVP E+SL KA+AHQP+S+AIEA G FQ Y G+F G CG +LD
Sbjct: 237 RKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLD 296
Query: 293 HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
HGV AVGYG G DY IV+NSWG WGE GY+RM RN G CGI S P+K
Sbjct: 297 HGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIK 353
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 185/353 (52%), Positives = 242/353 (68%), Gaps = 9/353 (2%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLT----SMDKLIELFESWMSKHG 56
M + + ++L L++ SS A D SI+ Y H T S ++ L+E W+ KHG
Sbjct: 1 MKLLNSATVILF---LTMIVVSS-AMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHG 56
Query: 57 KTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFP 116
K + EK RFEIFK+NL+ ID+ N + SY LGL +FAD++++E+++ YLG + +
Sbjct: 57 KAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRLKRK 116
Query: 117 TRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
+ S + R A+P+SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN+IV+G+
Sbjct: 117 ATKS-SLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGD 175
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L +LSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ EEDYPY +G C+ ++
Sbjct: 176 LITLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNA 235
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+VVTI Y+DVP N E+SL KAL+HQP+SVAIE G FQ Y G+F G CG +LDHGV
Sbjct: 236 KVVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVV 295
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
AVGYG G DY IVKNSWG WGE GYIRM+RN G CGI S P+K
Sbjct: 296 AVGYGTENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPIK 348
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 177/327 (54%), Positives = 231/327 (70%), Gaps = 7/327 (2%)
Query: 28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-- 85
SIV Y S ++ ++ WM+ HG+TY + E+ RFE+F++NL+++D N
Sbjct: 29 MSIVSYGER---SEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAAD 85
Query: 86 --VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKG 143
V S+ LGLN FAD++++E++ YLG++ + R+ + D + LP+SVDWR KG
Sbjct: 86 AGVHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKG 145
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
AV VK+QGSCGSCWAFST+AAVEGINQIV+G++ SLSEQEL+DCDTS+N GCNGGLMDY
Sbjct: 146 AVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDY 205
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQP 263
AF++I+ +GG+ EEDYPY +G C+ ++ +VVTI Y+DVP N E+SL KA+A+QP
Sbjct: 206 AFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQP 265
Query: 264 VSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERG 323
+SVAIEA G FQ Y+ G+FTG CG LDHGV AVGYG G DY IVKNSWG WGE G
Sbjct: 266 ISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESG 325
Query: 324 YIRMKRNTGKPEGLCGINKMASIPLKK 350
Y+RM+RN G CGI S PLKK
Sbjct: 326 YVRMERNIKASSGKCGIAVEPSYPLKK 352
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 176/327 (53%), Positives = 231/327 (70%), Gaps = 7/327 (2%)
Query: 28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-- 85
SIV Y S ++ ++ WM+ HG+TY + E+ RFE+F++NL+++D N
Sbjct: 29 MSIVSYGER---SEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAAD 85
Query: 86 --VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKG 143
V S+ LGLN FAD++++E++ YLG++ + R+ + D + LP+SVDWR KG
Sbjct: 86 AGVHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKG 145
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
AV +K+QGSCGSCWAFST+AAVEGINQIV+G++ SLSEQEL+DCDTS+N GCNGGLMDY
Sbjct: 146 AVAEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDY 205
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQP 263
AF++I+ +GG+ EEDYPY +G C+ ++ +VVTI Y+DVP N E+SL KA+A+QP
Sbjct: 206 AFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQP 265
Query: 264 VSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERG 323
+SVAIEA G FQ Y+ G+FTG CG LDHGV AVGYG G DY IVKNSWG WGE G
Sbjct: 266 ISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESG 325
Query: 324 YIRMKRNTGKPEGLCGINKMASIPLKK 350
Y+RM+RN G CGI S PLKK
Sbjct: 326 YVRMERNIKASSGKCGIAVEPSYPLKK 352
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 177/328 (53%), Positives = 233/328 (71%), Gaps = 7/328 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
D SIV Y S +++ ++ WM+++G+TY I E+ RFE+F++NL+++DQ N
Sbjct: 24 DMSIVSYGER---SEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAA 80
Query: 87 T----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK 142
S+ LGLN FAD+++EE+++ YLG++ + R+ S + D + LP+SVDWR+K
Sbjct: 81 DAGLHSFRLGLNRFADLTNEEYRDTYLGVRTKPVRERRLSGRYQAADNEELPESVDWREK 140
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV VK+QG CGSCWAFS +AAVEGINQIV+G++ +LSEQEL+DCDTS+N GCNGGLMD
Sbjct: 141 GAVAKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQGCNGGLMD 200
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF++I+ +GG+ EEDYPY + C+ K+ +VVTI GY+DVP N E SL KA+A+Q
Sbjct: 201 YAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQ 260
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
P+SVAIEA G FQ Y G+FTG CG LDHGV AVGYG G DY IVKNSWG WGE
Sbjct: 261 PISVAIEAGGRAFQLYKSGIFTGRCGTALDHGVTAVGYGSENGKDYWIVKNSWGTVWGED 320
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLKK 350
GY+R++RN G CGI S PLKK
Sbjct: 321 GYVRLERNIKATSGKCGIAIEPSYPLKK 348
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 189/351 (53%), Positives = 241/351 (68%), Gaps = 13/351 (3%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEH------LTSMDKLIELFESWMSKHGKTYKCI 62
+ + L ++FA SS A D SI+ Y H L + ++L+ ++E W+ KHGK Y +
Sbjct: 15 MAAIVLLFTVFAVSS-ALDMSIISYDSAHADKAATLRTEEELMSMYEQWLVKHGKVYNAL 73
Query: 63 EEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR-- 119
EK RF+IFK+NL+ ID N E +Y LGLN FAD+++EE++ KYLG K P RR
Sbjct: 74 GEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYLGTKID-PNRRLG 132
Query: 120 -QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
PS ++ R LP SVDWRK+GAV PVK+QG CGSCWAFS + AVEGIN+IV+G L
Sbjct: 133 KTPSNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELI 192
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
SLSEQEL+DCDT +N GCNGGLMDYAF++I+ +GG+ +EDYPY +G C+ ++ +V
Sbjct: 193 SLSEQELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDSDEDYPYRGVDGRCDTYRKNAKV 252
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
V+I Y+DVP DE +L KA+A+QPVSVAIE G +FQ Y GVFTG CG LDHGV AV
Sbjct: 253 VSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAV 312
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPL 348
GYG +KG DY IV+NSWG WGE GYIR++RN G CGI S PL
Sbjct: 313 GYGTAKGHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 363
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 187/348 (53%), Positives = 236/348 (67%), Gaps = 7/348 (2%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
L + SLSL + S + +D T +++++E W+ KHGK Y I EK R
Sbjct: 14 FLFMVFSLSLASMSIIDYDLPADPLQSTERTEA-HMMKMYEHWLVKHGKNYNAIGEKERR 72
Query: 69 FEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFKNKYLGLK----PQFPTRRQPSA 123
FEIFK+NL+ +D++N +Y LGL +FAD+++EE++ YLG K + T R
Sbjct: 73 FEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKMEKKEKLRTERSQRY 132
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
+ LP VDWR+KGAVT VK+QG CGSCWAFSTV +VEGINQIV+G+L SLSEQ
Sbjct: 133 LHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGINQIVTGDLISLSEQ 192
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCD ++N GCNGGLMDYAF++I+ +GG+ E DYPY + C+ ++ VVTI G
Sbjct: 193 ELVDCDKAYNQGCNGGLMDYAFEFIIKNGGIDSEADYPYRASDNMCDSNRKNAHVVTIDG 252
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
Y+DVPENDE+SL KA+A+QPVSVAIEA G +FQ Y GVFTG CG LDHGV AVGYG
Sbjct: 253 YEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSGVFTGRCGTNLDHGVVAVGYGTE 312
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLKK 350
G DY IV+NSWGPKWGE GYIRM+RN + G CGI AS P KK
Sbjct: 313 NGIDYWIVRNSWGPKWGESGYIRMERNVASTDTGKCGIAMEASYPTKK 360
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 179/336 (53%), Positives = 239/336 (71%), Gaps = 10/336 (2%)
Query: 25 AHDFSIVGYS--PEHLTSM---DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHI 79
A SI+ Y+ P H +S ++++ ++ W++KHGK Y I E+ RFEIFK+NLK +
Sbjct: 19 AAHMSIIDYNTNPNHKSSSRTDEEVMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFV 78
Query: 80 DQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP----QFPTRRQPSAEFSYRDVKALPK 135
D+ N E SY +GLN FAD+++EE+++ +LG K +F + S ++ +D LP+
Sbjct: 79 DEHNSENRSYKVGLNRFADLTNEEYRSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPE 138
Query: 136 SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNG 195
SVDWR+ GAV P+K+QGSCGSCWAFSTVAAVEG+NQI +G + LSEQEL+DCD +++ G
Sbjct: 139 SVDWRESGAVAPIKDQGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAG 198
Query: 196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
CNGGLMDYAF++I+ +GG+ EEDYPY +GTC+ +++ +VV+I+ Y+DVP DE +L
Sbjct: 199 CNGGLMDYAFEFIINNGGIDTEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMAL 258
Query: 256 LKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSW 315
KA+AHQPVSVAIEASG FQ Y GVFTG CG LDHGV VGYG G+D+ IV+NSW
Sbjct: 259 KKAVAHQPVSVAIEASGRAFQLYLSGVFTGECGRALDHGVVVVGYGTDNGADHWIVRNSW 318
Query: 316 GPKWGERGYIRMKRN-TGKPEGLCGINKMASIPLKK 350
G WGE GYIRM+RN G CGI AS P+K
Sbjct: 319 GTSWGENGYIRMERNVVDNFGGKCGIAMQASYPIKN 354
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 183/324 (56%), Positives = 228/324 (70%), Gaps = 7/324 (2%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYW 90
+ +S H +++ L+ESW+ HGK Y I EK RFEIFK+NL+ ID+ N+E +Y
Sbjct: 45 IPHSDAHQRPDEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRESRTYK 104
Query: 91 LGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKAL----PKSVDWRKKGAVT 146
+GL FAD+++EE++ ++LG + F + + SA S R AL P VDWRKKGAV
Sbjct: 105 VGLTRFADLTNEEYRARFLGGR--FSRKPRLSAAKSGRYAAALGDDLPDDVDWRKKGAVA 162
Query: 147 PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFK 206
VK+QG CGSCWAFS+VAAVEGINQIV+G L LSEQEL+DCD SFN GCNGGLMDYAF+
Sbjct: 163 TVKDQGQCGSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQ 222
Query: 207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSV 266
+I+ +GG+ EEDYPY + C+ ++ +VVTI GY+DVPENDE SL KA+A+QPVSV
Sbjct: 223 FIIGNGGIDTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSV 282
Query: 267 AIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIR 326
AIEA G FQ Y GVFTG CG +LDHGV AVGYG G+DY IV+NSWG WGE GYIR
Sbjct: 283 AIEAGGRAFQLYQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIR 342
Query: 327 MKRNTGK-PEGLCGINKMASIPLK 349
++RN G CGI S P K
Sbjct: 343 LERNVANITTGKCGIAVQPSYPTK 366
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 186/353 (52%), Positives = 246/353 (69%), Gaps = 9/353 (2%)
Query: 5 SHSKLLLLSLSLSL-FACSSLAHDFSIVGYSPEHL--TSMDKLIELFESWMSKHGKTYKC 61
+HS L +SL L L F+ S A D SI+ Y H+ S D++ L+ESW+ +HGK+Y
Sbjct: 3 AHSSTLTISLLLMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNA 62
Query: 62 IEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
+ EK RF+IFK+NLK+ID++N SY LGL +FAD+++EE+++ YLG K RR+
Sbjct: 63 LGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSS-GDRRK 121
Query: 121 PSAEFSYRDV----KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
S S R + +LP+SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IV+GN
Sbjct: 122 LSKNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGN 181
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L SLSEQEL+DCD S+N GC+GGLMDYAF++++ +GG+ EEDYPY C+ ++
Sbjct: 182 LISLSEQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNA 241
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+VV I Y+DVP N+E++L KA+AHQPVS+AIEA G D Q Y G+FTG CG +DHGV
Sbjct: 242 KVVKIDSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHGVV 301
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
A GYG G DY IV+NSWG KWGE+GY+R++RN GLCG+ S P+K
Sbjct: 302 AAGYGSENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGLATEPSYPVK 354
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 181/331 (54%), Positives = 230/331 (69%), Gaps = 7/331 (2%)
Query: 23 SLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQR 82
+ A D SIV Y S +++ ++ WM++H TY I E+ RFE F+ NL++IDQ
Sbjct: 20 AAAADMSIVFYGER---SEEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQH 76
Query: 83 NKE----VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVD 138
N V S+ LGLN FAD+++EE+++ YLG + + R+ SA + D LP+SVD
Sbjct: 77 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAADNDELPESVD 136
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
WRKKGAV VK+QG CGSCWAFS +AAVEGINQIV+G++ LSEQEL+DCDTS+N GCNG
Sbjct: 137 WRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNG 196
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
GLMDYAF++I+ +GG+ EEDYPY + C+ K+ +VVTI GY+DVP N E+SL KA
Sbjct: 197 GLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKA 256
Query: 259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPK 318
+A+QP+SVAIEA G FQ Y G+FTG CG LDHGVAAVGYG G DY +V+NSWG
Sbjct: 257 VANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGSV 316
Query: 319 WGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WGE GYIRM+RN G CGI S P K
Sbjct: 317 WGENGYIRMERNIKASSGKCGIAVEPSYPTK 347
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 183/347 (52%), Positives = 243/347 (70%), Gaps = 5/347 (1%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SK++ L+ L + S A DF VGYS + LTS+++LI+LF+SWM KH K Y+ I+E
Sbjct: 6 SISKIIFLATCLIIHMGLSSA-DFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ--PS 122
K++RFEIF++NL +ID+ NK+ SYWLGLN FAD+S++EFK KY+G + T + +
Sbjct: 65 KIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDN 124
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
+F+Y+ V P+S+DWR KGAVTPVKNQG+CGSCWAFST+A VEGIN+IV+GNL LSE
Sbjct: 125 EDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSE 184
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DCD + GC GG + +Y VA+ G+H + YP ++ C + V I+
Sbjct: 185 QELVDCD-KHSYGCKGGYQTTSLQY-VANNGVHTSKVYPCQAKQYKCRATDKPGPKVKIT 242
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
GY+ VP N E S L ALA+QP+S +EA G FQ Y GVF GPCG +LDH V AVGYG
Sbjct: 243 GYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGT 302
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
S G +YII+KNSWGP WGE+GY+R+KR +G +G CG+ K + P K
Sbjct: 303 SDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 349
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 187/330 (56%), Positives = 234/330 (70%), Gaps = 8/330 (2%)
Query: 27 DFSIV-GYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE 85
D++I G PE + + I +E W+ KHG+ Y + EK RFEIFK+NLK ID+ N
Sbjct: 5 DYNIKHGQVPERTEAETRRI--YEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSV 62
Query: 86 VT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR---QPSAEFSYRDVKALPKSVDWRK 141
SY LGLN+FAD+S++E+++ YLG + R S + +++ LP++VDWR+
Sbjct: 63 GNPSYKLGLNKFADLSNDEYRSVYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWRE 122
Query: 142 KGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLM 201
KGAV PVK+QG CGSCWAFSTV AVEGINQIV+GNLTSLSEQEL+DCD ++N GCNGGLM
Sbjct: 123 KGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLM 182
Query: 202 DYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH 261
DYAF +I+ +GG+ EEDYPY + C+ ++ VVTI GY+DVP+NDE+SL KA+A+
Sbjct: 183 DYAFDFIIENGGIDTEEDYPYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVAN 242
Query: 262 QPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGE 321
QPVSVAIEA G FQ Y GVFTG CG +LDHGV VGYG G DY IV+NSWGP WGE
Sbjct: 243 QPVSVAIEAGGRGFQLYQSGVFTGSCGTQLDHGVVTVGYGTEHGVDYWIVRNSWGPAWGE 302
Query: 322 RGYIRMKRNTGKPE-GLCGINKMASIPLKK 350
GYIRM+R+ E G CGI AS P KK
Sbjct: 303 NGYIRMERDVASTETGKCGIAMEASYPTKK 332
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 179/334 (53%), Positives = 235/334 (70%), Gaps = 7/334 (2%)
Query: 22 SSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
S HD + + + S D+++ +++ W+ KHGK Y + EK RFEIFK NL+ ID+
Sbjct: 2 SIFNHDDNHLSHDQSSWRSDDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDE 61
Query: 82 RNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR-----QPSAEFSYRDVKALPKS 136
N + +Y +GL +FAD++++E++ +LG + P RR PS ++Y+ LP+S
Sbjct: 62 HNSQNRTYKVGLTKFADLTNQEYRAMFLGTRSD-PKRRLMKSKNPSERYAYKAGDKLPES 120
Query: 137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGC 196
VDWR KGAV P+K+QGSCGSCWAFSTVAAVEGINQIV+G L SLSEQEL+DCD +N GC
Sbjct: 121 VDWRGKGAVNPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGC 180
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
NGGLMDYAF++I+ +GGL E+DYPYL + TC+ K + + V+I G++DV DE++L
Sbjct: 181 NGGLMDYAFQFIINNGGLDTEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQ 240
Query: 257 KALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWG 316
KA+AHQPVSVAIEASG QFY GVFTG CG LDHGV VGYG KG DY +V+NSWG
Sbjct: 241 KAVAHQPVSVAIEASGMALQFYQSGVFTGECGTALDHGVVVVGYGTEKGLDYWLVRNSWG 300
Query: 317 PKWGERGYIRMKRNTGKP-EGLCGINKMASIPLK 349
+WGE GYI+M+RN G CGI +S P+K
Sbjct: 301 TEWGEHGYIKMQRNVRDTYTGRCGIAMESSYPVK 334
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 181/326 (55%), Positives = 230/326 (70%), Gaps = 5/326 (1%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS 88
+I+ Y L S D ++++F W+ +H + Y + EK RF+IFK+NL +I NK+ S
Sbjct: 33 AIMDYEAHELHSDDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQEKS 92
Query: 89 YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE-FSYRDVKALPKSVDWRKKGAVTP 147
YWLGLN+F+D++H+EF+ YLG++P + + F Y DV A + VDWRKKGAV+
Sbjct: 93 YWLGLNKFSDLTHDEFRALYLGIRPAGRAHGLRNGDRFIYEDVVA-EEMVDWRKKGAVSD 151
Query: 148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKY 207
VK+QGSCGSCWAFS + +VEG+N IV+G L SLSEQEL+DCD N GCNGGLMDYAF +
Sbjct: 152 VKDQGSCGSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMDYAFDF 211
Query: 208 IVASGGLHKEEDYPYLMEEGTCED-KKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSV 266
I+ +GG+ EEDYPY +G C++ +KE +VV I YQDVP E SLLKA++ PVSV
Sbjct: 212 IIKNGGIDTEEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKNPVSV 271
Query: 267 AIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYI 325
AIEA G DFQ Y GGVFTGPCG +LDHGV AVGYG G +Y IVKNSWGP WGE+GYI
Sbjct: 272 AIEAGGRDFQHYQGGVFTGPCGTDLDHGVLAVGYGTDDDGVNYWIVKNSWGPSWGEKGYI 331
Query: 326 RMKR-NTGKPEGLCGINKMASIPLKK 350
RM+R + G CGIN S P+KK
Sbjct: 332 RMERMGSNSTSGKCGINIEPSFPIKK 357
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 184/335 (54%), Positives = 230/335 (68%), Gaps = 12/335 (3%)
Query: 27 DFSIV--GYSPEHLTSMDKLIELFESWMSKHGKTY--------KCIEEKLHRFEIFKENL 76
DFSI+ GY P+ L+S ++L LF+SWM +HGK+Y EK R+ IFK+NL
Sbjct: 34 DFSILDLGYDPQDLSSEERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNL 93
Query: 77 KHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV--KALP 134
+ I N++ Y+LGLN FAD+++EEF+ + G + R EF Y V K LP
Sbjct: 94 RFIHGENEKNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSYEEFRYGSVQLKDLP 153
Query: 135 KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN 194
S+DWR+KGAV VK+QGSCGSCWAFS VAA+EG+N++ +G L SLSEQEL+DCD +
Sbjct: 154 DSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDE 213
Query: 195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQS 254
GCNGGLMDYAF +++ +GGL E DYPY C+ K +VVTI GY+DVP NDE +
Sbjct: 214 GCNGGLMDYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETA 273
Query: 255 LLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNS 314
LLKA+AHQPVSVAI+A G+ QFY G+FTG CG +LDHGV VGYGK G Y I+KNS
Sbjct: 274 LLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGKEDGKAYWIIKNS 333
Query: 315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WG WGE+GYI+M RNTG GLCGIN AS P K
Sbjct: 334 WGSNWGEKGYIKMARNTGLAAGLCGINMEASYPTK 368
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 188/347 (54%), Positives = 236/347 (68%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
+ ++L+L + + ++ DF + + S + L EL+E W S H + +EEK
Sbjct: 3 RFIVLALCMLMVLETTKGLDFH-----NKDVESENSLWELYERWRSHH-TVARSLEEKAK 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FK N+KHI + NK+ SY L LN+F DM+ EEF+ Y G + F ++ +
Sbjct: 57 RFNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATK 116
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y +V LP SVDWRK GAVTPVKNQG CGSCWAFSTV AVEGINQI + LTSLSEQ
Sbjct: 117 SFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQ 176
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCDT+ N GCNGGLMD AF++I GGL E YPY + TC+ KE VV+I G
Sbjct: 177 ELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDG 236
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
++DVP+N E L+KA+A+QPVSVAI+A G+DFQFYS GVFTG CG EL+HGVA VGYG +
Sbjct: 237 HEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTT 296
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G+ Y IVKNSWG +WGE+GYIRM+R EGLCGI AS PLK
Sbjct: 297 IDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPLK 343
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 173/319 (54%), Positives = 226/319 (70%), Gaps = 8/319 (2%)
Query: 40 SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNE 95
S D++ L+++W ++H ++Y ++E R EIF++NL+ IDQ N S+ LGL
Sbjct: 39 SDDEVHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTR 98
Query: 96 FADMSHEEFKNKYLGLKPQFPTRRQPSA----EFSYRDVKALPKSVDWRKKGAVTPVKNQ 151
FAD+++EE+++ YLG++ RR+ S + +R LP S+DWR KGAV VK+Q
Sbjct: 99 FADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKDQ 158
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVAS 211
GSCGSCWAFST+AAVEGIN IV+G+L SLSEQEL+DCDT +N GCNGGLMDYAF++I+++
Sbjct: 159 GSCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIISN 218
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEAS 271
GG+ +EDYPY +G+C+ ++ VVTI Y+DVP NDE+SL KA+A+QPVSVAIEA
Sbjct: 219 GGIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAG 278
Query: 272 GTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNT 331
G FQ Y G+FTG CG ELDHGV A+GYG G Y IVKNSWG WGE GYIRM+RN
Sbjct: 279 GRAFQLYESGIFTGYCGTELDHGVTAIGYGSENGKYYWIVKNSWGSDWGESGYIRMERNI 338
Query: 332 GKPEGLCGINKMASIPLKK 350
G CGI AS P+K
Sbjct: 339 NSATGKCGIAMEASYPIKN 357
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 184/356 (51%), Positives = 246/356 (69%), Gaps = 14/356 (3%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHL-------TSMDKLIELFESWMSKHGKTY 59
+K ++ +L +L + S A D SI+ Y H + D++ +E W+++HG+ Y
Sbjct: 2 AKTIITTLLFALSSSLSYAIDMSIIDYKNNHYARKWTLQSDEDQVKNRYEMWLAEHGRAY 61
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKP----Q 114
+ EK RFEIFK+NL+ I++ N +Y +GLN+FAD+++EE++ YLG K +
Sbjct: 62 NALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFADLTNEEYRTMYLGTKSDARRR 121
Query: 115 FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVS 174
F + PS ++ R + +P SVDWRK+GAV P+KNQGSCGSCWAFSTVAAV GINQIV+
Sbjct: 122 FVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFSTVAAVGGINQIVT 181
Query: 175 GNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
G + +LSEQEL+DCD N+GCNGGLMDYAF++I+++GG+ E+ YPY EG C+ ++
Sbjct: 182 GEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGMDTEKHYPYRGVEGRCDPVRK 241
Query: 235 EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHG 294
+VV+I GY+DVP N E++L KA+AHQPV VAIEASG FQ YS GVFTG CG E+DHG
Sbjct: 242 NYKVVSIDGYEDVPRN-ERALQKAVAHQPVCVAIEASGRAFQLYSSGVFTGECGEEVDHG 300
Query: 295 VAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
V VGYG G DY IV+NSWG KWGE GY++M+RN K G CGI AS P K
Sbjct: 301 VVVVGYGSEDGVDYWIVRNSWGTKWGENGYVKMERNVKKSHLGKCGIMTEASYPTK 356
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 188/359 (52%), Positives = 249/359 (69%), Gaps = 15/359 (4%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYS---PEHLTSM---DKLIELFESWMSKHGK 57
S + L++ L +S F SLA D SI+ Y P+ TS +++ ++E W+ KHGK
Sbjct: 6 LSPAMKLMIVLIISSFT-VSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGK 64
Query: 58 TYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPT 117
+Y + EK RFEIFK+NLK ID+ N ++Y LGL FAD+++EE+++K+LG K P
Sbjct: 65 SYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKID-PN 123
Query: 118 RR------QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQ 171
RR S ++ R LP+SVDWRK+GAV VK+Q SCGSCWAFS +AAVEGIN+
Sbjct: 124 RRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINK 183
Query: 172 IVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED 231
IV+G+L SLSEQEL+DCDTS+N GCNGGLMDYAF++I+++GG+ E+DYPY +G C+
Sbjct: 184 IVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQ 243
Query: 232 KKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAEL 291
++ +VVTI Y+DVP DE +L KA+A+QP++VA+E G +FQ Y GVFTG CG L
Sbjct: 244 NRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTAL 303
Query: 292 DHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
DHGVAAVGYG G DY IV+NSWG WGE+GYIR++RN G CGI S P+K
Sbjct: 304 DHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIK 362
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 172/330 (52%), Positives = 235/330 (71%), Gaps = 5/330 (1%)
Query: 25 AHDFSIVGYSPEHL--TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQR 82
A D SI+ Y H ++ D ++ +ESW+ KHGK+Y + EK RF+IFK+N +ID++
Sbjct: 19 AADMSIITYDQTHAVGSTDDVIMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQ 78
Query: 83 NK-EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV--KALPKSVDW 139
N + S+ LGLN FAD+++EE+++KY G++ + ++ Y + ++LP+SVDW
Sbjct: 79 NAAKDRSFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVSGKSQRYASLAGESLPESVDW 138
Query: 140 RKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGG 199
R+ GAV VK+QG CGSCWAFST++AVEGINQI +G L +LSEQEL+DCD S+N GCNGG
Sbjct: 139 REHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGG 198
Query: 200 LMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKAL 259
LMD AF++I+ +GG+ + DYPY +G C+ ++ +VVTI Y+DVPE DE++L KA
Sbjct: 199 LMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAA 258
Query: 260 AHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKW 319
A+QP+SVAIEASG DFQFY G+FTG CG +LDHGV VGYG G DY IV+NSWG W
Sbjct: 259 ANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYGTENGKDYWIVRNSWGADW 318
Query: 320 GERGYIRMKRNTGKPEGLCGINKMASIPLK 349
GE+GY+RM+R G+CGI S P+K
Sbjct: 319 GEKGYLRMERGISSKAGICGITSEPSYPVK 348
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 188/359 (52%), Positives = 249/359 (69%), Gaps = 15/359 (4%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYS---PEHLTSM---DKLIELFESWMSKHGK 57
S + L++ L +S F SLA D SI+ Y P+ TS +++ ++E W+ KHGK
Sbjct: 6 LSPAMKLMIVLIISSFT-VSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGK 64
Query: 58 TYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPT 117
+Y + EK RFEIFK+NLK ID+ N ++Y LGL FAD+++EE+++K+LG K P
Sbjct: 65 SYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKID-PN 123
Query: 118 RR------QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQ 171
RR S ++ R LP+SVDWRK+GAV VK+Q SCGSCWAFS +AAVEGIN+
Sbjct: 124 RRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINK 183
Query: 172 IVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED 231
IV+G+L SLSEQEL+DCDTS+N GCNGGLMDYAF++I+++GG+ E+DYPY +G C+
Sbjct: 184 IVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQ 243
Query: 232 KKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAEL 291
++ +VVTI Y+DVP DE +L KA+A+QP++VA+E G +FQ Y GVFTG CG L
Sbjct: 244 NRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTAL 303
Query: 292 DHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
DHGVAAVGYG G DY IV+NSWG WGE+GYIR++RN G CGI S P+K
Sbjct: 304 DHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIK 362
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 191/354 (53%), Positives = 250/354 (70%), Gaps = 11/354 (3%)
Query: 7 SKLLLLSL-SLSLFACSSLAHDFSIVGYSPEHLT-----SMDKLIELFESWMSKHGKTYK 60
+KLL+LSL L+ + +S + D SI+ Y EH S ++++ L+ESW+ +HGK+Y
Sbjct: 2 AKLLILSLFVLAAVSSASASADMSIITYDEEHPAKGLSRSDEEVMALYESWLVEHGKSYN 61
Query: 61 CIE-EKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
+ EK RFEIFK+NL++ID++N + SY LGLN FAD+++EE+++ YLG K R
Sbjct: 62 GLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNRFADLTNEEYRSTYLGAKTDARRR 121
Query: 119 ---RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
+ ++ + +LP S+DWR+KGAV VK+QGSCGSCWAFST+AAVEGINQIV+G
Sbjct: 122 IAKTKSDRRYAPKAGGSLPDSIDWREKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTG 181
Query: 176 NLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
L SLSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ E DYPY G C+ ++
Sbjct: 182 ELISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTEADYPYTGRYGRCDQTRKN 241
Query: 236 MEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGV 295
+VV+I GY+DV DE +L +A+A QPVSVAIEA G DFQ YS G+FTG CG +LDHGV
Sbjct: 242 AKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEAGGRDFQLYSSGIFTGSCGTDLDHGV 301
Query: 296 AAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
AVGYG G DY IVKNSW WGE+GY+RM+RN GLCGI S P K
Sbjct: 302 TAVGYGTENGVDYWIVKNSWAASWGEKGYLRMQRNVKDKNGLCGIAIEPSYPTK 355
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 182/335 (54%), Positives = 230/335 (68%), Gaps = 12/335 (3%)
Query: 27 DFSIV--GYSPEHLTSMDKLIELFESWMSKHGKTY--------KCIEEKLHRFEIFKENL 76
D+SI+ GY P+ L+S ++L LF+SWM +HGK+Y EK R+ IFK+NL
Sbjct: 34 DYSILDLGYDPQDLSSEERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNL 93
Query: 77 KHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV--KALP 134
+ I N++ Y+LGLN FAD+++EEF+ + G + R EF Y V K LP
Sbjct: 94 RFIHGENEKNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSHEEFRYGSVQLKDLP 153
Query: 135 KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN 194
S+DWR+KGAV VK+QGSCGSCWAFS VAA+EG+N++ +G L SLSEQEL+DCD +
Sbjct: 154 DSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDE 213
Query: 195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQS 254
GCNGGLMDYAF +++ +GGL E DYPY C+ K +VVTI GY+DVP NDE +
Sbjct: 214 GCNGGLMDYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETA 273
Query: 255 LLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNS 314
LLKA+AHQPVSVAI+A G+ QFY G+FTG CG +LDHGV VGYGK G Y I+KNS
Sbjct: 274 LLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGKEDGKAYWIIKNS 333
Query: 315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WG WGE+GY++M RNTG GLCGIN AS P K
Sbjct: 334 WGSNWGEKGYVKMARNTGLAAGLCGINMEASYPTK 368
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 178/328 (54%), Positives = 230/328 (70%), Gaps = 7/328 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE- 85
D SIV Y S ++ L+ W ++HGK+Y + E+ R+ F++NL++ID+ N
Sbjct: 22 DMSIVSYGER---SEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAA 78
Query: 86 ---VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK 142
V S+ LGLN FAD+++EE+++ YLGL+ + R+ S + D +ALP+SVDWR K
Sbjct: 79 DAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTK 138
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV +K+QG CGSCWAFS +AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMD
Sbjct: 139 GAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 198
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF +I+ +GG+ E+DYPY ++ C+ ++ +VVTI Y+DV N E SL KA+A+Q
Sbjct: 199 YAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQ 258
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
PVSVAIEA G FQ YS G+FTG CG LDHGVAAVGYG G DY IV+NSWG WGE
Sbjct: 259 PVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGES 318
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLKK 350
GY+RM+RN G CGI S PLKK
Sbjct: 319 GYVRMERNIKASSGKCGIAVEPSYPLKK 346
>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
endopeptidase; AltName: Full=Papaya peptidase B;
AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
Precursor
gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
Length = 348
Score = 369 bits (948), Expect = e-100, Method: Compositional matrix adjust.
Identities = 190/346 (54%), Positives = 246/346 (71%), Gaps = 5/346 (1%)
Query: 5 SHSKLLLLSLSLSLFACSSLAH-DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
S SKLL +++ L F SL++ DFSIVGYS + LTS ++LI+LF SWM KH K YK ++
Sbjct: 6 SFSKLLFVAICL--FGHMSLSYCDFSIVGYSQDDLTSTERLIQLFNSWMLKHNKNYKNVD 63
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA 123
EKL+RFEIFK+NLK+ID+RNK + YWLGLNEF+D+S++EFK KY+G P+ T +
Sbjct: 64 EKLYRFEIFKDNLKYIDERNKMINGYWLGLNEFSDLSNDEFKEKYVGSLPEDYTNQPYDE 123
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
EF D+ LP+SVDWR KGAVTPVK+QG C SCWAFSTVA VEGIN+I +GNL LSEQ
Sbjct: 124 EFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKIKTGNLVELSEQ 183
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCD + GCN G + +Y VA G+H YPY+ ++ TC + V +G
Sbjct: 184 ELVDCDKQ-SYGCNRGYQSTSLQY-VAQNGIHLRAKYPYIAKQQTCRANQVGGPKVKTNG 241
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
V N+E SLL A+AHQPVSV +E++G DFQ Y GG+F G CG ++DH V AVGYGKS
Sbjct: 242 VGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVDHAVTAVGYGKS 301
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G YI++KNSWGP WGE GYIR++R +G G+CG+ + + P+K
Sbjct: 302 GGKGYILIKNSWGPGWGENGYIRIRRASGNSPGVCGVYRSSYYPIK 347
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 369 bits (948), Expect = e-100, Method: Compositional matrix adjust.
Identities = 178/328 (54%), Positives = 230/328 (70%), Gaps = 7/328 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE- 85
D SIV Y S ++ L+ W ++HGK+Y + E+ R+ F++NL++ID+ N
Sbjct: 23 DMSIVSYGER---SEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAA 79
Query: 86 ---VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK 142
V S+ LGLN FAD+++EE+++ YLGL+ + R+ S + D +ALP+SVDWR K
Sbjct: 80 DAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTK 139
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV +K+QG CGSCWAFS +AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMD
Sbjct: 140 GAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 199
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF +I+ +GG+ E+DYPY ++ C+ ++ +VVTI Y+DV N E SL KA+A+Q
Sbjct: 200 YAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQ 259
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
PVSVAIEA G FQ YS G+FTG CG LDHGVAAVGYG G DY IV+NSWG WGE
Sbjct: 260 PVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGES 319
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLKK 350
GY+RM+RN G CGI S PLKK
Sbjct: 320 GYVRMERNIKASSGKCGIAVEPSYPLKK 347
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 369 bits (948), Expect = e-100, Method: Compositional matrix adjust.
Identities = 178/328 (54%), Positives = 229/328 (69%), Gaps = 7/328 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE- 85
D SIV Y S ++ L+ W ++HGK Y + E+ R+ F++NL++ID+ N
Sbjct: 22 DMSIVSYGER---SEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAA 78
Query: 86 ---VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK 142
V S+ LGLN FAD+++EE+++ YLGL+ + R+ S + D +ALP+SVDWR K
Sbjct: 79 DAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTK 138
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV +K+QG CGSCWAFS +AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMD
Sbjct: 139 GAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 198
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF +I+ +GG+ E+DYPY ++ C+ ++ +VVTI Y+DV N E SL KA+A+Q
Sbjct: 199 YAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQ 258
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
PVSVAIEA G FQ YS G+FTG CG LDHGVAAVGYG G DY IV+NSWG WGE
Sbjct: 259 PVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGES 318
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLKK 350
GY+RM+RN G CGI S PLKK
Sbjct: 319 GYVRMERNIKASSGKCGIAVEPSYPLKK 346
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 369 bits (948), Expect = e-100, Method: Compositional matrix adjust.
Identities = 189/337 (56%), Positives = 236/337 (70%), Gaps = 7/337 (2%)
Query: 19 FACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKC-IEEKLHRFEIFKENLK 77
+ S+ A DF+ G++ E L S L L+++W +H + EE RFEIFKEN+K
Sbjct: 18 WVLSASASDFT-PGFTDEDLESEKSLRSLYDNWALQHRSSRSLDSEEHAERFEIFKENVK 76
Query: 78 HIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ-PSAEFSYRDVKALPKS 136
+ID NK+ + Y LGLN+FAD+S+EEFK Y+G K R+ S F Y++ + LP S
Sbjct: 77 YIDSVNKKDSPYKLGLNKFADLSNEEFKAIYMGTKMDLRGDREVQSGSFMYQNSEPLPAS 136
Query: 137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGC 196
+DWR+KGAV VKNQG CGSCWAFSTVA+VEGIN I +GNL SLSEQ+L+DC T N+GC
Sbjct: 137 IDWRQKGAVAAVKNQGHCGSCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTE-NSGC 195
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKK--EEMEVVTISGYQDVPENDEQS 254
NGGLMD AF+YI+ +GG+ E++YPY E C K + V I G++DVP N+EQ+
Sbjct: 196 NGGLMDTAFQYIINNGGIVTEDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQA 255
Query: 255 LLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKN 313
L +A+AHQPVSVAIEASG DFQFYS GVFTG CG LDHGV AVGYG S +G +Y IV+N
Sbjct: 256 LKEAVAHQPVSVAIEASGQDFQFYSTGVFTGKCGTALDHGVVAVGYGTSPEGINYWIVRN 315
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
SWGPKWGE GYIRM++ EG CGI AS P KK
Sbjct: 316 SWGPKWGEEGYIRMQQGIEAAEGKCGIAMQASYPTKK 352
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 369 bits (948), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 174/326 (53%), Positives = 233/326 (71%), Gaps = 10/326 (3%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWL 91
Y + + +++ +E W+++HGKTY + EK RF IF +NLK ID+ N SY +
Sbjct: 21 YVTSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKV 80
Query: 92 GLNEFADMSHEEFKNKYLGLKPQFPTRR-------QPSAEFSYRDVKALPKSVDWRKKGA 144
GLN+FAD+++EE+++ YLG K P RR + S ++ ++ + P VDWR++GA
Sbjct: 81 GLNQFADLTNEEYRSMYLGTKVD-PYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGA 139
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYA 204
V+PVKNQG CGSCWAFSTVA+VEGIN+IV+G+L SLSEQEL+DCD +N+GCNGG MDYA
Sbjct: 140 VSPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYA 199
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
F++IV++GG+ E DYPY C+ + + ++V+I GY+DVP +E++L+KA+AHQPV
Sbjct: 200 FQFIVSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPV 259
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGY 324
SV IEASG FQ Y+ GV TG CG LDHGV VGYG G DY IV+NSWGP+WGE GY
Sbjct: 260 SVGIEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYGSENGKDYWIVRNSWGPEWGEDGY 319
Query: 325 IRMKRN-TGKPEGLCGINKMASIPLK 349
IRM+RN P G+CGI MAS P+K
Sbjct: 320 IRMERNMVDTPVGMCGITLMASYPIK 345
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 177/334 (52%), Positives = 236/334 (70%), Gaps = 8/334 (2%)
Query: 23 SLAHDFSIVGYSPEHLTSM---DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHI 79
+ A D SI+ Y H D+ LFESW+ HGK+Y + E+ RF+IFK NL++I
Sbjct: 17 AAATDMSIITYDETHAVGFKTDDEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYI 76
Query: 80 DQRN-KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE---FSYRDVKALPK 135
D++N E + LGLN+FAD+++EE+++KY G+K + R++ SA+ ++ ++LP+
Sbjct: 77 DEQNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSK-DLRKKVSAKSGRYATLSGESLPE 135
Query: 136 SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNG 195
SVDWR+ GAV VK+QGSCGSCWAFST++AVEGINQI +G L +LSEQEL+DCD S+N G
Sbjct: 136 SVDWRESGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEG 195
Query: 196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
CNGGLMDYAF++I+ +GG+ + DYPY +G C+ ++ +VVTI Y+DVP DE +L
Sbjct: 196 CNGGLMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELAL 255
Query: 256 LKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSW 315
KA A+QP+SVAIEASG DFQFY G+FTG CG LDHGV VGYG G DY IV+NSW
Sbjct: 256 KKAAANQPISVAIEASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGTENGKDYWIVRNSW 315
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G WGE GY+RM+R G+CGI S P+K
Sbjct: 316 GADWGENGYLRMERGISSKTGICGIAIEPSYPVK 349
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 176/311 (56%), Positives = 225/311 (72%), Gaps = 7/311 (2%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEF 104
+ L+E W+ KHGK Y + EK RF+IFK+NL+ ID N + +Y LGLN FAD+++EE+
Sbjct: 1 MSLYEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNADNRTYKLGLNRFADLTNEEY 60
Query: 105 KNKYLGLKPQFPTRR-----QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
+ +YLG + P RR S ++ R LP+SVDWR + AV PVK+QG+CGSCWA
Sbjct: 61 RARYLGTRID-PNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWA 119
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
FST+ AVEGIN+IV+G+L SLSEQEL+DCDTS+N GCNGGLMDYA+++I+ +GG+ EED
Sbjct: 120 FSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDSEED 179
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS 279
YPY +GTC+ ++ +VVTI Y+DVP NDE +L KA+A+QPVSVAIE G +FQ Y
Sbjct: 180 YPYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYV 239
Query: 280 GGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLC 338
GVFTG CG LDHGV AVGYG KG DY IV+NSWG WGE GY+R++RN K G C
Sbjct: 240 SGVFTGRCGTALDHGVVAVGYGSVKGHDYWIVRNSWGASWGEEGYVRLERNLAKSRSGKC 299
Query: 339 GINKMASIPLK 349
GI S P+K
Sbjct: 300 GIAIEPSYPIK 310
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 369 bits (946), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 180/341 (52%), Positives = 240/341 (70%), Gaps = 16/341 (4%)
Query: 25 AHDFSIVGYS------PEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKH 78
A D SI+ Y P S D+++ ++ESW+ +H K Y + EK RF IFK+NL+
Sbjct: 24 AVDMSIISYDHNHNLLPSSSRSDDEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEF 83
Query: 79 IDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLG--------LKPQFPTRRQPSAEFSYRD 129
IDQ N + + ++ +GLN+FAD+++EEF++ YLG + S + +++
Sbjct: 84 IDQHNSDDSQTFKVGLNKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKE 143
Query: 130 VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD 189
LP++VDWRK GAV VK+QG CGSCWAFST+AAVEGINQIV+G L SLSEQEL+DCD
Sbjct: 144 GDELPEAVDWRKNGAVAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCD 203
Query: 190 TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPE 249
TS+N+GC+GGLMDYA+++I+ +GG+ + DYPY ++G C+ ++ +VVTI ++DVPE
Sbjct: 204 TSYNSGCDGGLMDYAYEFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPE 263
Query: 250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYI 309
NDE++L KA+AHQPVSVAIEA G+ FQFY GVFTG CGA+LDHGV AVGYG G DY
Sbjct: 264 NDEKALQKAVAHQPVSVAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYGSDDGKDYW 323
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
IV+NSWG WGE GYIRM+RN + G CGI S P+K
Sbjct: 324 IVRNSWGADWGESGYIRMERNLETVKTGKCGIAIEPSYPIK 364
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 180/353 (50%), Positives = 246/353 (69%), Gaps = 9/353 (2%)
Query: 5 SHSKLLLLSLSLSL-FACSSLAHDFSIVGYSPEHL--TSMDKLIELFESWMSKHGKTYKC 61
+HS L +S+ L L F+ S A D SI+ Y H+ + D++ L+ESW+ +HGK+Y
Sbjct: 3 AHSSTLTISILLMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNA 62
Query: 62 IEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
+ EK RF+IFK+NL++ID++N SY LGL +FAD+++EE+++ YLG K R++
Sbjct: 63 LGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSS-GDRKK 121
Query: 121 PSAEFSYRDV----KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
S S R + +LP+S+DWR+KG + VK+QGSCGSCWAFS VAA+E IN IV+GN
Sbjct: 122 LSKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGN 181
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L SLSEQEL+DCD S+N GC+GGLMDYAF++++ +GG+ EEDYPY G C+ ++
Sbjct: 182 LISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNA 241
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+VV I Y+DVP N+E++L KA+AHQPVS+A+EA G DFQ Y G+FTG CG +DHGV
Sbjct: 242 KVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVV 301
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
GYG G DY IV+NSWG WGE GY+R++RN GLCG+ S P+K
Sbjct: 302 IAGYGTENGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCGLAIEPSYPVK 354
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 192/347 (55%), Positives = 242/347 (69%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
KL L+ SL+L + DF + L + +KL EL+E W S H + ++EK
Sbjct: 3 KLFLVLFSLALVLRLGESFDFHE-----KELETEEKLWELYERWRSHH-TVSRSLDEKDK 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FK N+ ++ NK+ Y L LN+FADM++ EF++ Y G K + F + +
Sbjct: 57 RFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRHHYAGSKIKHHRSFLGASRANG 116
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y +V+ +P SVDWRKKGAVTPVK+QG CGSCWAFSTV AVEGINQI + L SLSEQ
Sbjct: 117 TFMYANVEDVPPSVDWRKKGAVTPVKDQGKCGSCWAFSTVVAVEGINQIKTNELVSLSEQ 176
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCDTS N GCNGGLMD AF++I GG++ EE+YPY+ E G C+ +K VV+I G
Sbjct: 177 ELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTEENYPYMAEGGECDIQKRNSPVVSIDG 236
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
Y+DVP NDE SLLKA+A+QPVSVAI+ASG+DFQFYS GVFTG CG ELDHGVA VGYG +
Sbjct: 237 YEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQFYSEGVFTGDCGTELDHGVAIVGYGTT 296
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G+ Y IV+NSWGP+WGE+GYIRM+R EGLCGI S P+K
Sbjct: 297 LDGTKYWIVRNSWGPEWGEKGYIRMQREIDAEEGLCGIAMQPSYPIK 343
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 180/318 (56%), Positives = 230/318 (72%), Gaps = 9/318 (2%)
Query: 40 SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFAD 98
S D+++ L++SW+ +HGK Y I E+ RFEIFK+NL+ ID+ N T+Y LGLN+FAD
Sbjct: 37 SDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFAD 96
Query: 99 MSHEEFKNKYLGLKPQFPTRRQ-----PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGS 153
++++E++ K+LG + P RR PS+ +++R LP SVDWR GAV+PVK+QGS
Sbjct: 97 LTNQEYRAKFLGTRTD-PRRRLMKSKIPSSRYAHRAGDNLPDSVDWRDHGAVSPVKDQGS 155
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGG 213
CGSCWAFST+A VEGIN+IVSG L SLSEQEL+DCD S++ GCNGGLMDYAF++I+ +GG
Sbjct: 156 CGSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAGCNGGLMDYAFQFIMDNGG 215
Query: 214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGT 273
+ E+DYPYL C+ K+ +VV+I GY+DVP N+E +L KA+AHQPVS+AIEA G
Sbjct: 216 IDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVP-NNENALKKAVAHQPVSIAIEAGGR 274
Query: 274 DFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
FQ Y GVF G CG LDHGV AVGYG G DY IV+NSWG WGE GYIRM+RN
Sbjct: 275 AFQLYESGVFNGECGLALDHGVVAVGYGTDDNGQDYWIVRNSWGSNWGENGYIRMERNIN 334
Query: 333 KPEGLCGINKMASIPLKK 350
G CGI AS P+K
Sbjct: 335 ANTGKCGIAMEASYPVKN 352
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 366 bits (940), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 175/315 (55%), Positives = 225/315 (71%), Gaps = 5/315 (1%)
Query: 40 SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADM 99
S ++++ +++ WM+KHGK Y + EK RFEIFK+NLK ID+ N + +Y +GLN FAD+
Sbjct: 38 SEEEVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQNRTYKVGLNRFADL 97
Query: 100 SHEEFKNKYLGL----KPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
++EE++ YLG K +F + S ++ + LP+SVDWR+ GAV PVK+Q SCG
Sbjct: 98 TNEEYRAIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSCG 157
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
SCWAFSTVAAVEGINQIV+G L SLSEQEL+DCDT ++ GCNGGLMDYAF +I+ +GGL
Sbjct: 158 SCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNGGLD 217
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
E+DYPY +G C + +VV+I GY+DVP DE++L KA+AHQPVSVA+EA G
Sbjct: 218 TEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRAL 277
Query: 276 QFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP- 334
Q Y G+FTG CG LDHG+ AVGYG G+DY IV+NSWG WGE GYIRM+RN
Sbjct: 278 QLYVSGIFTGECGTALDHGIVAVGYGTENGTDYWIVRNSWGSSWGENGYIRMERNMADAF 337
Query: 335 EGLCGINKMASIPLK 349
G CGI AS P+K
Sbjct: 338 SGKCGIAMEASYPIK 352
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 176/309 (56%), Positives = 221/309 (71%), Gaps = 4/309 (1%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEE 103
L E F +W KHGK Y +EE HR+ ++K+NL++I + +++ SYWLGL +FAD++++E
Sbjct: 42 LSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEKNRSYWLGLTKFADITNDE 101
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
F+ +Y G + R + F Y D +A P+SVDWRKKGAVT VK+QGSCGSCWAFS +
Sbjct: 102 FRRQYTGTRIDRSKRSKRKTGFRYADSEA-PESVDWRKKGAVTTVKDQGSCGSCWAFSAI 160
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
+VEGIN I +G SLSEQEL+DCD +N GCNGGLMDYAF +I+ +GG+ E DYPY
Sbjct: 161 GSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILENGGIDTENDYPYK 220
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
+G C++ K+ VVTI GY+DVPENDE++L KA+A QPVSVAIEA G DFQ YSGGVF
Sbjct: 221 GLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGGVF 280
Query: 284 TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE---GLCGI 340
TG CG +LDHGV AVGYG DY IVKNSWG WGE GY+RM+RN GLCGI
Sbjct: 281 TGECGTDLDHGVLAVGYGSEGSLDYWIVKNSWGEYWGESGYLRMQRNIKDSNHQFGLCGI 340
Query: 341 NKMASIPLK 349
N S +K
Sbjct: 341 NIEPSYAVK 349
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 187/347 (53%), Positives = 232/347 (66%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
KLLL+ LS++L S + DF + ++S + L +L+E W S H + + EK
Sbjct: 5 KLLLIVLSIALVLVVSESFDFH-----DKDVSSDESLWDLYERWRSHH-TVSRNLNEKQK 58
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FK N+ H+ NK Y L LN+FADM++ EFK Y G K F + S
Sbjct: 59 RFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSG 118
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y + P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI + L LSEQ
Sbjct: 119 TFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
ELIDCD N GCNGGLM+YAF+YI GG+ E YPY +G+C+ KE + V+I G
Sbjct: 179 ELIDCDNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDATKENVPAVSIDG 238
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
++ VP NDE +LLKA+A+QPVSVAI+A G+DFQFYS GVFTG CG EL+HGVA VGYG +
Sbjct: 239 HETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT 298
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G++Y IV+NSWG +WGE+GYIRMKRN EGLCGI AS P+K
Sbjct: 299 VDGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPVK 345
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 176/328 (53%), Positives = 228/328 (69%), Gaps = 7/328 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE- 85
D SIV Y S ++ L+ W ++HGK+Y + E+ R+ F++NL++ID+ N
Sbjct: 22 DMSIVSYGER---SEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAA 78
Query: 86 ---VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK 142
V S+ LGLN FAD+++EE+++ YLGL+ + R+ S + D +ALP+SVDWR K
Sbjct: 79 DAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTK 138
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV +K+QG CGSCWAFS +AAVE INQIV+G+L SLSEQEL+DCDTS+N GCNGGLMD
Sbjct: 139 GAVAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 198
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF +I+ +GG+ E+DYPY ++ C+ ++ +VVTI Y+DV N E SL KA+ +Q
Sbjct: 199 YAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQ 258
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
PVSVAIEA G FQ YS G+FTG CG LDHGVAAVGYG G DY IV+NSWG WGE
Sbjct: 259 PVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGES 318
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLKK 350
GY+RM+RN G CGI S PLKK
Sbjct: 319 GYVRMERNIKASSGKCGIAVEPSYPLKK 346
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 363 bits (933), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 177/346 (51%), Positives = 231/346 (66%), Gaps = 17/346 (4%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
LLLLS + S A SI+ YS +++++++E W+ KH K Y ++EK R
Sbjct: 9 LLLLSFTFSH------ATAMSIINYSE------NEVMDMYEEWLVKHRKVYNGLDEKEKR 56
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR----RQPSAE 124
F++FK+NL I N + +Y LGLN+FAD+++EE++ YLG + R +
Sbjct: 57 FQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHR 116
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
++Y LP VDWR KGAV P+K+QG+CGSCWAFSTVAAVEGIN IV+G SLSEQE
Sbjct: 117 YAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQE 176
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
L+DCD ++ GCNGGLMDYAF++I+ +GG+ EEDYPY +GTC+ K++ +VV I GY
Sbjct: 177 LVDCDREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGY 236
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
+DVP N+E +L KA++HQPVSVAIEASG Q Y GVFTG CG LDHGV VGYG
Sbjct: 237 EDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGTEN 296
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNT-GKPEGLCGINKMASIPLK 349
G DY +V+NSWG WGE GY +M+RN EG CGI S P+K
Sbjct: 297 GVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVK 342
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 363 bits (933), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 179/347 (51%), Positives = 237/347 (68%), Gaps = 10/347 (2%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLT-----SMDKLIELFESWMSKHGKTYKCIEEK 65
+L + ++ A SS A D SI+ Y H S ++++ ++E W+ KHGK Y +EEK
Sbjct: 11 ILIVLFTVLAVSS-ALDMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHGKVYNAVEEK 69
Query: 66 LHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR--QPSA 123
RF+IFK+NL I++ N +Y +GLN F+D+S+EE+++KYLG K P+R +PS
Sbjct: 70 EKRFQIFKDNLNFIEEHNAVNRTYKVGLNRFSDLSNEEYRSKYLGTKID-PSRMMARPSR 128
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
+S R LP+SVDWRK+GAV VKNQ C CWAFS +AAVEGIN+IV+GNLT+LSEQ
Sbjct: 129 RYSPRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLTALSEQ 188
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCD + N GC+GGL+DYAF++I+ +GG+ EEDYP+ +G C+ K VTI G
Sbjct: 189 ELLDCDRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADGICDQYKINARAVTIDG 248
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
Y+ VP DE +L KA+A+QPVSVAIEA G +FQ Y G+FTG CG +DHGV AVGYG
Sbjct: 249 YERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSIDHGVTAVGYGTE 308
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGK-PEGLCGINKMASIPLK 349
G DY IVKNSWG WGE GY+ M+RN + G CGI + P+K
Sbjct: 309 NGIDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPIK 355
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 363 bits (933), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 186/328 (56%), Positives = 231/328 (70%), Gaps = 10/328 (3%)
Query: 32 GYSPEHLTSMDKLIELFESWMSKHGKTYKC-IEEKLHRFEIFKENLKHIDQRNKEVTSYW 90
G++ E L S + L L++ W +H T +E RFEIFKEN+KHID NK+ Y
Sbjct: 29 GFTDEELESDESLRGLYDKWALQHRSTRSLDSDEHARRFEIFKENVKHIDSVNKKDGPYK 88
Query: 91 LGLNEFADMSHEEFKNKYLGLKPQ-----FPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
LGLN+FAD+S+EEFK ++ K + R S F Y++ K LP S+DWRKKGAV
Sbjct: 89 LGLNKFADLSNEEFKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAV 148
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
TPVKNQG CGSCWAFST+A+VEGIN I +G L SLSEQ+L+DC + N GCNGGLMD AF
Sbjct: 149 TPVKNQGQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDC-SKENAGCNGGLMDNAF 207
Query: 206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT--ISGYQDVPENDEQSLLKALAHQP 263
+YI+ +GG+ E++YPY E G C K E + + I G++DVP N+E +L KA+AHQP
Sbjct: 208 QYIIDNGGIVTEDEYPYTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQP 267
Query: 264 VSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGER 322
VS+AIEASG DFQFYS GVFTG CG ELDHGV VGYGKS +G +Y IV+NSWGP+WGE+
Sbjct: 268 VSIAIEASGHDFQFYSTGVFTGKCGTELDHGVVVVGYGKSPEGINYWIVRNSWGPEWGEQ 327
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLKK 350
GYIRM+R EG CGI+ AS P KK
Sbjct: 328 GYIRMQRGIEATEGKCGISMQASYPTKK 355
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 363 bits (932), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 177/312 (56%), Positives = 222/312 (71%), Gaps = 8/312 (2%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT--SYWLGLNEFADMSHEEF 104
L+E W+ +GK Y + EK RFEIF +NL++ID N+ SY LGL FAD+++EE+
Sbjct: 37 LYEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYTLGLTRFADLTNEEY 96
Query: 105 KNKYLGLKP-QFPTRRQPSAEFSYRDVKA----LPKSVDWRKKGAVTPVKNQGSCGSCWA 159
++ YLG+KP Q RR A RD+ A LP+ VDWR+KGAV P+K+QG CGSCWA
Sbjct: 97 RSTYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAVAPIKDQGGCGSCWA 156
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
FSTVAAVEGINQIV+G+L LSEQEL+DCDT++N GCNGGLMDYAF++I+++GG+ EED
Sbjct: 157 FSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYAFQFIISNGGIDTEED 216
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS 279
YPY +G C+ ++ +VV+I Y+DV ENDE +L A+AHQPVSVAIE G FQ Y
Sbjct: 217 YPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVSVAIEGGGRSFQLYK 276
Query: 280 GGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRN-TGKPEGLC 338
G+F G CG +LDHGV AVGYG G DY IV+NSWG WGE GYIRM+RN G C
Sbjct: 277 SGIFDGRCGIDLDHGVVAVGYGTESGKDYWIVRNSWGKSWGEAGYIRMERNLPSSSSGKC 336
Query: 339 GINKMASIPLKK 350
GI S P+KK
Sbjct: 337 GIAIEPSYPIKK 348
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 363 bits (931), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 177/321 (55%), Positives = 223/321 (69%), Gaps = 7/321 (2%)
Query: 35 PEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLN 94
P + L F +W KHGK Y EE+ HRF ++K+NL++I + +++ SYWLGL
Sbjct: 32 PTDVGKDQLLAGQFAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRHSEKNLSYWLGLT 91
Query: 95 EFADMSHEEFKNKYLGLKPQFPTR----RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKN 150
+FAD+++EEF+ +Y G + R R + F Y + +A PKS+DWR+KGAVT VK+
Sbjct: 92 KFADLTNEEFRRQYTGTRIDRSRRLKKGRNATGSFRYANSEA-PKSIDWREKGAVTSVKD 150
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVA 210
QGSCGSCWAFS V +VEGIN I +G+ SLS QEL+DCD +N GCNGGLMDYAF +++
Sbjct: 151 QGSCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQ 210
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEA 270
+GG+ E+DYPY +G C+ K VVTI Y+DVPENDE++L KA+A QPVSVAIEA
Sbjct: 211 NGGIDTEKDYPYQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEA 270
Query: 271 SGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRN 330
G DFQ YSGGVFTG CG +LDHGV AVGYG KG DY IVKNSWG WGE GY+RM+RN
Sbjct: 271 GGRDFQLYSGGVFTGRCGTDLDHGVLAVGYGSEKGLDYWIVKNSWGEYWGESGYLRMQRN 330
Query: 331 TGKPE--GLCGINKMASIPLK 349
GLCGIN S +K
Sbjct: 331 LKDDNGYGLCGINIEPSYAVK 351
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 363 bits (931), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 178/318 (55%), Positives = 229/318 (72%), Gaps = 9/318 (2%)
Query: 40 SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFAD 98
S D+++ L++SW+ +HGK Y I E+ RFEIFK+NL+ ID+ N T+Y LGLN+FAD
Sbjct: 38 SDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFAD 97
Query: 99 MSHEEFKNKYLGLKPQFPTRRQ-----PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGS 153
++++E++ K+LG + P RR PS+ +++R LP SV+WR GAV+ VK+QGS
Sbjct: 98 LTNQEYRAKFLGTRTD-PRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGS 156
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGG 213
CGSCWAFS +AAVEGIN+IVSG L SLSEQEL+DCD S++ GCNGGLMDYAF++I+ +GG
Sbjct: 157 CGSCWAFSAIAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDNGG 216
Query: 214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGT 273
+ E+DYPYL C+ K+ +VV+I GY+DVP N+E +L KA+AHQPVS+AIEA G
Sbjct: 217 IDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVP-NNENALKKAVAHQPVSIAIEAGGR 275
Query: 274 DFQFYSGGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
FQ Y GVF G CG LDHGV AVGYG G DY IV+NSWG WGE GYIRM+RN
Sbjct: 276 AFQLYESGVFNGECGLALDHGVVAVGYGSDDNGQDYWIVRNSWGGNWGENGYIRMERNIN 335
Query: 333 KPEGLCGINKMASIPLKK 350
G CGI AS P+K
Sbjct: 336 ANTGKCGIAMEASYPVKN 353
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 363 bits (931), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 176/328 (53%), Positives = 228/328 (69%), Gaps = 7/328 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE- 85
D SIV Y S ++ L+ W ++HGK+Y + E+ R+ F++NL++ID+ N
Sbjct: 22 DMSIVSYGER---SEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAA 78
Query: 86 ---VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK 142
V S+ LGLN FAD+++EE+++ YLGL+ + R+ S + D +ALP+SVDWR K
Sbjct: 79 DAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTK 138
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV +K+Q GSCWAFS +AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMD
Sbjct: 139 GAVAEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 198
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF +I+ +GG+ E+DYPY ++ C+ ++ +VVTI Y+DV N E SL KA+A+Q
Sbjct: 199 YAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQ 258
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
PVSVAIEA G FQ YS G+FTG CG LDHGVAAVGYG G DY IV+NSWG WGE
Sbjct: 259 PVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGES 318
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLKK 350
GY+RM+RN G CGI S PLKK
Sbjct: 319 GYVRMERNIKASSGKCGIAVEPSYPLKK 346
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 176/346 (50%), Positives = 232/346 (67%), Gaps = 17/346 (4%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
LLLLS + S A SI+ YS +++++++E W+ KH K Y ++EK R
Sbjct: 9 LLLLSFTFSH------ATAMSIINYSE------NEVMDMYEEWLVKHRKVYNGLDEKEKR 56
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR----RQPSAE 124
F++FK+NL I N + +Y LGLN+FAD++++E++ YLG + R +
Sbjct: 57 FQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHR 116
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
++Y LP VDWR KGAV P+K+QG+CGSCWAFSTVAAVEGIN IV+G SLSEQE
Sbjct: 117 YAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQE 176
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
L+DCD ++ GCNGGLMDYAF++I+ +GG+ EEDYPY +GTC++ K++ +VV I GY
Sbjct: 177 LVDCDREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGY 236
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
+DVP N+E +L KA++HQPVSVAIEASG Q Y GVFTG CG LDHGV VGYG
Sbjct: 237 EDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGTEN 296
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNT-GKPEGLCGINKMASIPLK 349
G DY +V+NSWG WGE GY +M+RN EG CGI S P+K
Sbjct: 297 GVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVK 342
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 362 bits (929), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 186/347 (53%), Positives = 231/347 (66%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
KLLL+ LS++L S + DF + ++S + L +L+E W S H + + EK
Sbjct: 5 KLLLIVLSIALVLVVSESFDFH-----DKDVSSDESLWDLYERWRSHH-TVSRNLNEKQK 58
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FK N+ H+ NK Y L LN+FADM++ EFK Y G K F + S
Sbjct: 59 RFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGTKVNHHRMFRGTPRVSG 118
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y + P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI + L LSEQ
Sbjct: 119 TFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
ELIDCD N GCNGGLM+YAF+YI GG+ E YPY +G+C+ KE + V+I G
Sbjct: 179 ELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDG 238
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
++ VP NDE +LLKA+A+QPVSVAI+A G+DFQFYS GVFTG CG EL+HGVA VGYG +
Sbjct: 239 HETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT 298
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G++Y IV+NSWG +WGE+G IRMKRN EGLCGI AS P+K
Sbjct: 299 VDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPVK 345
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 362 bits (929), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 176/348 (50%), Positives = 227/348 (65%), Gaps = 16/348 (4%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
S LL LS +LS +S +I Y+ + +M +E W+ KH K Y + EK
Sbjct: 10 STLLFLSFTLSCAIDTS-----TITNYTDNEVMTM------YEEWLVKHQKVYNGLREKD 58
Query: 67 HRFEIFKENLKHI-DQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR----RQP 121
RF++FK+NL I + N + +Y LGLN+FADM++EE++ Y G K R +
Sbjct: 59 KRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKST 118
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
++Y LP VDWR KGAV P+K+QGSCGSCWAFSTVA VE IN+IV+G SLS
Sbjct: 119 GHRYAYSAGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DCD ++N GCNGGLMDYAF++I+ +GG+ ++DYPY +G C+ K+ +VV I
Sbjct: 179 EQELVDCDRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNI 238
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
G++DVP DE +L KA+AHQPVS+AIEASG D Q Y GVFTG CG LDHGV VGYG
Sbjct: 239 DGFEDVPPYDENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYG 298
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G DY +V+NSWG WGE GY +M+RN P G CGI AS P+K
Sbjct: 299 SENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPVK 346
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 362 bits (929), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 179/354 (50%), Positives = 240/354 (67%), Gaps = 14/354 (3%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHL-----TSMDKLIELFESWMSKHGKTYKC 61
SKL +L ++L+ SLA D I+ Y H + D+++ ++E W+ KHGK Y
Sbjct: 3 SKLTILFITLTFTL--SLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNA 60
Query: 62 IEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ- 120
+ EK RFEIFK+NL ID+ N + S+ LGLN FAD+++EE++ ++LG + P RR
Sbjct: 61 LGEKEKRFEIFKDNLGFIDEHNSKNLSFRLGLNRFADLTNEEYRTRFLGTRIN-PNRRNR 119
Query: 121 ----PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
+ ++ R LP+SVDWRK+GAV VK+QGSCGSCWAFS +AAVEG+N++ +G+
Sbjct: 120 KVNSQTNRYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLATGD 179
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L SLSEQEL+DCDTS+N GCNGGLMDYAF++I+ L EEDYPY +G C+ ++
Sbjct: 180 LISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRCDQNRKNA 239
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+VV+I Y+DVP DE +L KA+A+Q ++VA+E G +FQ Y GVFTG CG LDHGVA
Sbjct: 240 KVVSIDQYEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGTALDHGVA 299
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
AVGYG G DY IV+NSWG WGE GYIR++RN + G CGI S P+K
Sbjct: 300 AVGYGTENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPSYPIK 353
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 174/334 (52%), Positives = 227/334 (67%), Gaps = 11/334 (3%)
Query: 28 FSIVGYSPEHLTSMDKLIE-----LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQR 82
SI+ Y+ EH + E L+E W+++HG+ Y + E+ RF +F +NL+ +D
Sbjct: 27 MSIISYNEEHAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAH 86
Query: 83 NKEVT--SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR---DVKALPKSV 137
N+ + LG+N+FAD++++EF+ YLG + RR + YR + LP+SV
Sbjct: 87 NERAAEHGFRLGMNQFADLTNDEFRAAYLGARIPASRRRGTAVGERYRHGGGAEELPESV 146
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGC 196
DWR+KGAV PVKNQG CGSCWAFS V++VE +NQIV+G + +LSEQEL++C T N+GC
Sbjct: 147 DWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGC 206
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
NGGLMD AF +I+ +GG+ E DYPY +G C+ +E +VV+I G++DVPENDE+SL
Sbjct: 207 NGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQ 266
Query: 257 KALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWG 316
KA+AHQPVSVAIEA G +FQ Y GVFTG C LDHGV AVGYG G DY IV+NSWG
Sbjct: 267 KAVAHQPVSVAIEAGGREFQLYKAGVFTGTCTTNLDHGVVAVGYGTENGKDYWIVRNSWG 326
Query: 317 PKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
KWGE GYIRM+RN G CGI MAS P KK
Sbjct: 327 AKWGEDGYIRMERNVNATTGKCGIAMMASYPTKK 360
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 174/334 (52%), Positives = 227/334 (67%), Gaps = 11/334 (3%)
Query: 28 FSIVGYSPEHLTSMDKLIE-----LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQR 82
SI+ Y+ EH + E L+E W+++HG+ Y + E+ RF +F +NL+ +D
Sbjct: 84 MSIISYNEEHAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAH 143
Query: 83 NKEVT--SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR---DVKALPKSV 137
N+ + LG+N+FAD++++EF+ YLG + RR + YR + LP+SV
Sbjct: 144 NERAAEHGFRLGMNQFADLTNDEFRAAYLGARIPASRRRGTAVGERYRHGGGAEELPESV 203
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGC 196
DWR+KGAV PVKNQG CGSCWAFS V++VE +NQIV+G + +LSEQEL++C T N+GC
Sbjct: 204 DWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGC 263
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
NGGLMD AF +I+ +GG+ E DYPY +G C+ +E +VV+I G++DVPENDE+SL
Sbjct: 264 NGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQ 323
Query: 257 KALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWG 316
KA+AHQPVSVAIEA G +FQ Y GVFTG C LDHGV AVGYG G DY IV+NSWG
Sbjct: 324 KAVAHQPVSVAIEAGGREFQLYKAGVFTGTCTTNLDHGVVAVGYGTENGKDYWIVRNSWG 383
Query: 317 PKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
KWGE GYIRM+RN G CGI MAS P KK
Sbjct: 384 AKWGEDGYIRMERNVNATTGKCGIAMMASYPTKK 417
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 178/323 (55%), Positives = 226/323 (69%), Gaps = 9/323 (2%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
+ + L + D L +++E W K + EKL RF +FK N+ H+ + NK Y L
Sbjct: 25 FHEKELETEDNLWDMYERWRHKVATNHG---EKLRRFNVFKSNVLHVHETNKMDKPYKLK 81
Query: 93 LNEFADMSHEEFKNKYLGLKPQFPTR-----RQPSAEFSYRDVKALPKSVDWRKKGAVTP 147
LN+FADM++ EF++ Y G K R R S F Y +V+++P SVDWRKKGAV P
Sbjct: 82 LNKFADMTNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVESVPTSVDWRKKGAVAP 141
Query: 148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKY 207
VK+QG CGSCWAFSTVAAVEGIN+I + L SLSEQEL+DCDT N GCNGGLMD AF +
Sbjct: 142 VKDQGQCGSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDF 201
Query: 208 IVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVA 267
I +GGL +E+ YPY E+G C+ K VV+I G++DVP+NDEQSL+KA+A+QPV+VA
Sbjct: 202 IKKTGGLTREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVA 261
Query: 268 IEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIR 326
I+A +DFQFYS GVFTG CG +LDHGVAAVGYG + G+ Y IV+NSWG +WGE+GYIR
Sbjct: 262 IDAGSSDFQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSEWGEKGYIR 321
Query: 327 MKRNTGKPEGLCGINKMASIPLK 349
M+R GLCGI AS P+K
Sbjct: 322 MERGISDKRGLCGIAMEASYPIK 344
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 176/300 (58%), Positives = 217/300 (72%), Gaps = 2/300 (0%)
Query: 52 MSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLG 110
+ KH K Y + K RFEIFK+NL+ ID+ NK V S+ LGLN+FAD+S+EE+K+ +LG
Sbjct: 11 LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70
Query: 111 LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGIN 170
+ + S F Y LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN
Sbjct: 71 GRMVRDRKGFESDRFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGIN 130
Query: 171 QIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE 230
QI +G+L SLSEQEL+DCD FN GCNGG MDYAF++IV +GG+ E+DYPY +G C+
Sbjct: 131 QIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDGQCD 190
Query: 231 DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAE 290
++ +VVTI+G++DVP+NDE+SL KA+AHQPVSVAIEA G FQ Y G+F G CG +
Sbjct: 191 QNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGLCGTD 250
Query: 291 LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
LDHGV AVGYG G DY IV+NSWGP WGE GYIR++RN G CGI S P K
Sbjct: 251 LDHGVVAVGYGTEDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQPSYPTK 310
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 186/347 (53%), Positives = 231/347 (66%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
KLLL+ LS++L S + DF + ++S + L +L+E W S H + + EK
Sbjct: 5 KLLLIVLSIALVLVVSESFDFH-----DKDVSSDESLWDLYERWRSHH-TVSRNLNEKQK 58
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FK N+ H+ NK Y L LN+FADM++ EFK Y G K F + S
Sbjct: 59 RFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSG 118
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y + P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI + L LSEQ
Sbjct: 119 TFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
ELIDCD N GCNGGLM+YAF+YI GG+ E YPY +G+C+ KE + V+I G
Sbjct: 179 ELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDG 238
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
++ VP NDE +LLKA+A+QPVSVAI+A G+DFQFYS GVFTG CG EL+HGVA VGYG +
Sbjct: 239 HETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT 298
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G++Y IV+NSWG +WGE+G IRMKRN EGLCGI AS P+K
Sbjct: 299 VDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPVK 345
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 185/347 (53%), Positives = 234/347 (67%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K L ++LSL+L + + DF + L S + L +L+E W S H ++EK
Sbjct: 5 KFLFVALSLALVLGITESLDFH-----EKDLESEESLWDLYERWRSHH-TVSTSLDEKHK 58
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FKEN+ H+ + NK Y L LN+FADM++ EF++ Y G K + F + +
Sbjct: 59 RFNVFKENVMHVHKTNKMGKPYKLKLNKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGNG 118
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y V+ +P SVDWRKKGAVT VK+QG CGSCWAFST+ AVEGIN I + L SLSEQ
Sbjct: 119 SFMYGKVEKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINYIKTNELVSLSEQ 178
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCDT+ N GCNGGLM+YAF++I G+ E YPY E+G C+ KE V+I G
Sbjct: 179 ELVDCDTTENQGCNGGLMEYAFEFIKKKRGITTESTYPYKAEDGHCDAAKENNPAVSIDG 238
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
Y+ VPENDE +LLKA A+QPVSVAI+A G+DFQFYS GVF G CG ELDHGVA VGYG +
Sbjct: 239 YEKVPENDEDALLKAAANQPVSVAIDAGGSDFQFYSEGVFIGECGTELDHGVAVVGYGTT 298
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G+ Y IV+NSWGP+WGE+GYIRM+R EGLCGI AS P+K
Sbjct: 299 LDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKEGLCGIAMEASYPIK 345
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 173/334 (51%), Positives = 227/334 (67%), Gaps = 11/334 (3%)
Query: 28 FSIVGYSPEHLTSMDKLIE-----LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQR 82
SI+ Y+ EH + E L+E W+++HG+ Y + E+ RF +F +NL+ +D
Sbjct: 24 MSIISYNEEHAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAH 83
Query: 83 NKEVT--SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR---DVKALPKSV 137
N+ + LG+N+FAD++++EF+ YLG + RR + YR + LP+SV
Sbjct: 84 NERAAEHGFRLGMNQFADLTNDEFRAAYLGARIPAARRRGTAVGERYRHGGGAEELPESV 143
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGC 196
DWR+KGAV PVKNQG CGSCWAFS V++VE +NQIV+G + +LSEQEL++C T N+GC
Sbjct: 144 DWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGC 203
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
NGGLMD AF +I+ +GG+ E DYPY +G C+ +E +VV+I G++DVPENDE+SL
Sbjct: 204 NGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQ 263
Query: 257 KALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWG 316
KA+AHQPVSVAIEA G +FQ Y GVF+G C LDHGV AVGYG G DY IV+NSWG
Sbjct: 264 KAVAHQPVSVAIEAGGREFQLYKAGVFSGTCTTNLDHGVVAVGYGTENGKDYWIVRNSWG 323
Query: 317 PKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
KWGE GYIRM+RN G CGI MAS P KK
Sbjct: 324 AKWGEDGYIRMERNVNATTGKCGIAMMASYPTKK 357
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 179/350 (51%), Positives = 239/350 (68%), Gaps = 13/350 (3%)
Query: 12 LSLSLSLFACSSLAHDFSIVGYSPEH-----LTSMDKLIELFESWMSKHGKTYKCIEEKL 66
LS L LF S A D SI+ ++ H S +++I ++ W++KH KTY + E+
Sbjct: 7 LSTLLFLFFTLSSAWDMSILSHNHGHHHQSSWRSDNEVISMYNWWLAKHSKTYNKLGERE 66
Query: 67 HRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR-----Q 120
RFEIFK NL+ ID+ N +Y +GL FAD+++EE++ K+LG K P RR
Sbjct: 67 KRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFLGTKSD-PKRRLMKSKN 125
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
PS ++++ LP+S+DWR+ GAV+ +K+QGSCGSCWAFST+AAVEG+N+IV+G L SL
Sbjct: 126 PSQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTIAAVEGVNKIVTGELISL 185
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQEL+DCD S+N GCNGGLMD AF++I+ +GG+ ++DYPY +G C+ K + + VT
Sbjct: 186 SEQELVDCDRSYNAGCNGGLMDNAFQFIINNGGIDTDKDYPYQAVDGKCDTTKVKNKAVT 245
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
I G++DV DE +L KA+AHQPVSVAIEASG QFY GVFTG CG+ LDHGV VGY
Sbjct: 246 IDGFEDVMAFDEMALQKAVAHQPVSVAIEASGMALQFYQSGVFTGECGSALDHGVVIVGY 305
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP-EGLCGINKMASIPLK 349
G G DY +V+NSWG WGE GYI+M+RN G CGI +S P+K
Sbjct: 306 GTEDGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGKCGIAMESSYPIK 355
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 180/325 (55%), Positives = 221/325 (68%), Gaps = 11/325 (3%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
+ E L S + L L+E W +H + + +K RF +FK N++ I + N+ Y L
Sbjct: 34 FGAEDLASEEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRRDEPYKLR 92
Query: 93 LNEFADMSHEEFKNKYLGLK----PQFPTRRQ---PSAEFSYRDVKALPKSVDWRKKGAV 145
LN F DM+ +EF+ Y G + F RQ SA F Y D + +P SVDWR+KGAV
Sbjct: 93 LNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAV 152
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
T VK+QG CGSCWAFST+AAVEGIN I + NLTSLSEQ+L+DCDT N GCNGGLMDYAF
Sbjct: 153 TDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAF 212
Query: 206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVS 265
+YI GG+ E+ YPY + +C KK VVTI GY+DVP NDE +L KA+AHQPVS
Sbjct: 213 QYIAKHGGVAAEDAYPYRARQASC--KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVS 270
Query: 266 VAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGY 324
VAIEASG+ FQFYS GVF+G CG ELDHGV AVGYG + G+ Y +VKNSWGP+WGE+GY
Sbjct: 271 VAIEASGSHFQFYSEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGY 330
Query: 325 IRMKRNTGKPEGLCGINKMASIPLK 349
IRM R+ EG CGI AS P+K
Sbjct: 331 IRMARDVAAKEGHCGIAMEASYPVK 355
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 180/326 (55%), Positives = 222/326 (68%), Gaps = 12/326 (3%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
+ E L S + L L+E W +H + + +K RF +FK N++ I + N+ Y L
Sbjct: 141 FGAEDLASEEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRRDEPYKLR 199
Query: 93 LNEFADMSHEEFKNKYLGLK----PQFPTRRQPSA----EFSYRDVKALPKSVDWRKKGA 144
LN F DM+ +EF+ Y G + F RQ S+ F Y D + +P SVDWR+KGA
Sbjct: 200 LNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGA 259
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYA 204
VT VK+QG CGSCWAFST+AAVEGIN I + NLTSLSEQ+L+DCDT N GCNGGLMDYA
Sbjct: 260 VTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYA 319
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
F+YI GG+ E+ YPY + +C KK VVTI GY+DVP NDE +L KA+AHQPV
Sbjct: 320 FQYIAKHGGVAAEDAYPYRARQASC--KKSPAPVVTIDGYEDVPANDESALKKAVAHQPV 377
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERG 323
SVAIEASG+ FQFYS GVF+G CG ELDHGVAAVGYG + G+ Y +VKNSWGP+WGE+G
Sbjct: 378 SVAIEASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKG 437
Query: 324 YIRMKRNTGKPEGLCGINKMASIPLK 349
YIRM R+ EG CGI AS P+K
Sbjct: 438 YIRMARDVAAKEGHCGIAMEASYPVK 463
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 178/340 (52%), Positives = 229/340 (67%), Gaps = 19/340 (5%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE- 85
D SIV Y S ++ L+ W ++HGK Y + E+ R+ F++NL++ID+ N
Sbjct: 22 DMSIVSYGER---SEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAA 78
Query: 86 ---VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK 142
V S+ LGLN FAD+++EE+++ YLGL+ + R+ S + D +ALP+SVDWR K
Sbjct: 79 DAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTK 138
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV +K+QG CGSCWAFS +AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMD
Sbjct: 139 GAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 198
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKK------------EEMEVVTISGYQDVPEN 250
YAF +I+ +GG+ E+DYPY ++ C+ + + +VVTI Y+DV N
Sbjct: 199 YAFDFIINNGGIDTEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPN 258
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYII 310
E SL KA+A+QPVSVAIEA G FQ YS G+FTG CG LDHGVAAVGYG G DY I
Sbjct: 259 SETSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWI 318
Query: 311 VKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
V+NSWG WGE GY+RM+RN G CGI S PLKK
Sbjct: 319 VRNSWGKSWGESGYVRMERNIKASSGKCGIAVEPSYPLKK 358
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 360 bits (923), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 178/309 (57%), Positives = 216/309 (69%), Gaps = 6/309 (1%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKN 106
L+E W S H + + EK RF +FK N H+ NK Y L LN+FADM++ EF+N
Sbjct: 37 LYERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANKMDKPYKLKLNKFADMTNHEFRN 95
Query: 107 KYLGLKPQ----FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
Y G K + F + + F Y V +P SVDWRKKGAVT VK+QG CGSCWAFST
Sbjct: 96 TYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFST 155
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
+ AVEGINQI + L SLSEQEL+DCDT N GCNGGLMDYAF++I GG+ E +YPY
Sbjct: 156 IVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPY 215
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+GTC+ KE V+I G+++VPENDE +LLKA+A+QPVSVAI+A G+DFQFYS GV
Sbjct: 216 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 275
Query: 283 FTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
FTG CG ELDHGVA VGYG + G+ Y VKNSWGP+WGE+GYIRM+R EGLCGI
Sbjct: 276 FTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIA 335
Query: 342 KMASIPLKK 350
AS P+KK
Sbjct: 336 MEASYPIKK 344
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 360 bits (923), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 166/315 (52%), Positives = 220/315 (69%), Gaps = 5/315 (1%)
Query: 40 SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADM 99
S D+++ ++E W+ KH K Y + EK RF+IFK+NL ID+ N + +Y +GLN+FADM
Sbjct: 31 SNDEVMTMYEEWLVKHQKVYNGLREKDQRFQIFKDNLNFIDEHNAQNYTYIVGLNKFADM 90
Query: 100 SHEEFKNKYLGLKPQFPTR----RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
++EE+++ YLG + R + ++Y LP VDWR KGA+T +K+QGSCG
Sbjct: 91 TNEEYRDMYLGTRSDIKRRIMKNKITGHRYAYNSGDRLPVHVDWRLKGAITHIKDQGSCG 150
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
SCWAFST+A VE IN+IV+G L SLSEQEL+DCD +FN GCNGGLMDYAF++I+ +GG+
Sbjct: 151 SCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIGNGGID 210
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
++ YPY EG C+ +++ ++V+I GY+DVP N+E +L KA+AHQPVSVAIEASG
Sbjct: 211 TDQHYPYKGFEGRCDPTRKKAKIVSIDGYEDVPSNNENALKKAVAHQPVSVAIEASGRAL 270
Query: 276 QFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNT-GKP 334
Q Y GVFTG CG LDH V VGYG G DY +V+NSWG WGE GY +M+RN G
Sbjct: 271 QLYQSGVFTGKCGTSLDHAVVIVGYGSENGLDYWLVRNSWGTNWGEDGYFKMERNVKGTH 330
Query: 335 EGLCGINKMASIPLK 349
G CGI AS P+K
Sbjct: 331 TGKCGIAVEASYPVK 345
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 360 bits (923), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 185/347 (53%), Positives = 234/347 (67%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
KLL + LS SL + + DF + L S + L +L+E W S H + + EK
Sbjct: 5 KLLWVVLSFSLVLGVANSFDFH-----DKDLASEESLWDLYERWRSHH-TVSRSLGEKHK 58
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPT--RRQP--SA 123
RF +FK NL H+ NK Y L LN+FADM++ EF++ Y G K P R P +
Sbjct: 59 RFNVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENG 118
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y V ++P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI + L +LSEQ
Sbjct: 119 AFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQ 178
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCD N GCNGGLM+ AF++I GG+ E +YPY +EGTC+ K V+I G
Sbjct: 179 ELVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDG 238
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
+++VP NDE +LLKA+A+QPVSVAI+A G+DFQFYS GVFTG C +L+HGVA VGYG +
Sbjct: 239 HENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTT 298
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G++Y IV+NSWGP+WGE GYIRM+RN K EGLCGI + S P+K
Sbjct: 299 VDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIK 345
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 176/322 (54%), Positives = 225/322 (69%), Gaps = 6/322 (1%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
+ + L S + L +L+E W S H + + EK RF +FK N+ H+ NK Y L
Sbjct: 25 FHEKDLESEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLK 83
Query: 93 LNEFADMSHEEFKNKYLGLK----PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
LN+FADM++ EF++ Y G K F + S F Y V ++P SVDWRKKGAVT V
Sbjct: 84 LNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDV 143
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
K+QG CGSCWAFST+ AVEGINQI + L SLSEQEL+DCD N GCNGGLM+ AF++I
Sbjct: 144 KDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFI 203
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
GG+ E +YPY +EGTC++ K V+I G+++VP NDE +LLKA+A+QPVSVAI
Sbjct: 204 KQKGGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAI 263
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRM 327
+A G+DFQFYS GVFTG C +L+HGVA VGYG + G++Y IV+NSWGP+WGE+GYIRM
Sbjct: 264 DAGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRM 323
Query: 328 KRNTGKPEGLCGINKMASIPLK 349
+RN K EGLCGI MAS P+K
Sbjct: 324 QRNISKKEGLCGIAMMASYPIK 345
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 178/309 (57%), Positives = 222/309 (71%), Gaps = 6/309 (1%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
EL+E W S H + ++EK RF +FK N+ ++ NK+ Y L LN+FADM++ EF+
Sbjct: 36 ELYERWRSHH-TVSRSLDEKDKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFR 94
Query: 106 NKYLGLKPQ----FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
+ Y G K + F + + F Y ++P +VDWRKKGAVTPVK+QG CGSCWAFS
Sbjct: 95 HHYAGSKIKHHRTFLGASRANGTFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGSCWAFS 154
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
TV AVEGINQI + L SLSEQEL+DCDTS N GCNGGLMD AF++I GG++ EE+YP
Sbjct: 155 TVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTEENYP 214
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
Y+ E G C+ +K VV+I G++DVP NDE SLLKA+A+QPVSVAI+ASG+DFQFYS G
Sbjct: 215 YMAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQFYSEG 274
Query: 282 VFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
VFTG CG ELDHGVA VGYG + + Y IVKNSWGP+WGE+GYIRM+R EGLCGI
Sbjct: 275 VFTGDCGTELDHGVAIVGYGTTLDRTKYWIVKNSWGPEWGEKGYIRMQREIDAEEGLCGI 334
Query: 341 NKMASIPLK 349
S P+K
Sbjct: 335 AMQPSYPIK 343
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 358 bits (920), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 181/359 (50%), Positives = 239/359 (66%), Gaps = 14/359 (3%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEH-----LTSMDKLIELFESWMSKHGKTY 59
+ S +L++ + +LF ++ A D SI+ Y H S ++ ++E W KHGK
Sbjct: 6 NRSPMLVILIVFTLFT-ATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLN 64
Query: 60 KCIE--EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ--- 114
I+ EK RFEIFK+NLK ID+ N E +Y +GLN FAD+S+EE++++YLG K
Sbjct: 65 NNIDGSEKDKRFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIG 124
Query: 115 --FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 172
+ S ++ LPKSVDWR +GAV VK+QGSCGSCWAFST+AAVEGIN+I
Sbjct: 125 MMMARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKI 184
Query: 173 VSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDK 232
V+G L SLSEQEL+DCD + N GC+GGLM+YAF++I+ +GG+ +EDYPY +G C+
Sbjct: 185 VTGELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVDGKCDQY 244
Query: 233 KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD 292
K+ VV+I Y+ VP DE +L KA+A+QP+SVAIEA G +FQ Y G+FTG CG LD
Sbjct: 245 KKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGKCGTALD 304
Query: 293 HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRN-TGKPEGLCGINKMASIPLKK 350
HGV AVGYG G DY IV+NSWG WGE GY+RM+RN G CGI +S P+KK
Sbjct: 305 HGVTAVGYGTENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGIVMQSSYPIKK 363
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 176/322 (54%), Positives = 225/322 (69%), Gaps = 6/322 (1%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
+ + L S + L +L+E W S H + + EK RF +FK N+ H+ NK Y L
Sbjct: 25 FHEKDLESEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLK 83
Query: 93 LNEFADMSHEEFKNKYLGLK----PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
LN+FADM++ EF++ Y G K F + S F Y V ++P SVDWRKKGAVT V
Sbjct: 84 LNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDV 143
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
K+QG CGSCWAFST+ AVEGINQI + L SLSEQEL+DCD N GCNGGLM+ AF++I
Sbjct: 144 KDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFI 203
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
GG+ E +YPY +EGTC++ K V+I G+++VP NDE +LLKA+A+QPVSVAI
Sbjct: 204 KQKGGITTESNYPYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAI 263
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRM 327
+A G+DFQFYS GVFTG C +L+HGVA VGYG + G++Y IV+NSWGP+WGE+GYIRM
Sbjct: 264 DAGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRM 323
Query: 328 KRNTGKPEGLCGINKMASIPLK 349
+RN K EGLCGI MAS P+K
Sbjct: 324 QRNISKKEGLCGIAMMASYPIK 345
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 176/322 (54%), Positives = 224/322 (69%), Gaps = 6/322 (1%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
+ + L S + L +L+E W S H + + EK RF +FKEN+ H+ NK Y L
Sbjct: 25 FHEKDLASEESLWDLYERWRSHH-TVSRSLTEKHKRFNVFKENVMHVHNTNKMDKPYKLK 83
Query: 93 LNEFADMSHEEFKNKYLGLK----PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
LN+FADM++ EF++ Y G K F + + F Y V ++P SVDWRKKGAVT V
Sbjct: 84 LNKFADMTNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVGSVPASVDWRKKGAVTDV 143
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
K+QG CGSCWAFSTV AVEGINQI + L SLSEQEL+DCD N GCNGGLM+ AF++I
Sbjct: 144 KDQGQCGSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFI 203
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
GG+ E +YPY +EGTC+ K V+I G+++VP NDE +LLKA+A+QPVSVAI
Sbjct: 204 KQKGGITTESNYPYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAI 263
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRM 327
+A G+DFQFYS GV TG C +L+HGVA VGYG + G++Y IV+NSWGP+WGE+GYIRM
Sbjct: 264 DAGGSDFQFYSEGVLTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRM 323
Query: 328 KRNTGKPEGLCGINKMASIPLK 349
+RN K EGLCGI MAS P+K
Sbjct: 324 QRNISKKEGLCGIAMMASYPIK 345
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 184/347 (53%), Positives = 233/347 (67%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K L + LSLSL + + DF + L S + L +L+E W S H + + +K
Sbjct: 5 KFLWVVLSLSLVLGVANSFDFH-----DKDLESEESLWDLYERWRSHH-TVSRSLGDKHK 58
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FK N+ H+ NK Y L LN+FADM++ EF++ Y G K F + +
Sbjct: 59 RFNVFKANMMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRDMPRGNG 118
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y V ++P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI + L SLSEQ
Sbjct: 119 TFMYEKVGSVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSEQ 178
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCDT N GCNGGLM+ AF++I GG+ E YPY ++GTC+ K V+I G
Sbjct: 179 ELVDCDTEENAGCNGGLMESAFQFIKQKGGITTESYYPYTAQDGTCDASKANDLAVSIDG 238
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
+++VP NDE +LLKA+A+QPVSVAI+A G+DFQFYS GVFTG C EL+HGVA VGYG +
Sbjct: 239 HENVPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGAT 298
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G+ Y IV+NSWGP+WGE GYIRM+RN K EGLCGI +AS P+K
Sbjct: 299 VDGTSYWIVRNSWGPEWGELGYIRMQRNISKKEGLCGIAMLASYPIK 345
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 181/327 (55%), Positives = 220/327 (67%), Gaps = 9/327 (2%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS 88
S V + E L S + L L+E W +H + + +K RF +FKEN++ I N+
Sbjct: 28 SAVEFGAEDLASEEALWALYERWRGRHA-VARDLGDKARRFNVFKENVRLIHDFNQRDEP 86
Query: 89 YWLGLNEFADMSHEEFKNKYLGLK----PQFPTRRQPSAE-FSYRDVKALPKSVDWRKKG 143
Y L LN F DM+ +EF+ Y G + F RQ SA F Y + LP SVDWR+KG
Sbjct: 87 YKLRLNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKG 146
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
AVT VK+QG CGSCWAFST+AAVEGIN I + NLTSLSEQ+L+DCDT N GC+GGLMDY
Sbjct: 147 AVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDY 206
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQP 263
AF+YI GG+ E+ YPY + +C KK VTI GY+DVP NDE +L KA+AHQP
Sbjct: 207 AFQYIAKHGGVAAEDAYPYKARQASC--KKSPAPAVTIDGYEDVPANDESALKKAVAHQP 264
Query: 264 VSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGER 322
VSVAIEASG+ FQFYS GVF G CG ELDHGV AVGYG + G+ Y +VKNSWGP+WGE+
Sbjct: 265 VSVAIEASGSHFQFYSEGVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEK 324
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLK 349
GYIRM R+ EG CGI AS P+K
Sbjct: 325 GYIRMARDVAAKEGHCGIAMEASYPVK 351
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 357 bits (916), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 179/341 (52%), Positives = 235/341 (68%), Gaps = 7/341 (2%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCI-EEKLHRF 69
+++L LF S A SI+ P+ + D+++ L++ W +KHGK + + E +RF
Sbjct: 9 IMALLFFLFIALSAASPSSII---PQR--TDDEVMALYDQWRAKHGKLHNNLGAEPENRF 63
Query: 70 EIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR-QPSAEFSYR 128
IFK+NLK ID+ N + Y LGLN FAD+++EE++++YLG K +RR + S + R
Sbjct: 64 HIFKDNLKFIDEINAQNLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSNRYLPR 123
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
LP S+DWR KGAV PVK+QGSCGSCWAFSTVA+VE INQIV+G+L +LSEQEL+DC
Sbjct: 124 LGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDC 183
Query: 189 DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
D S+N GCNGGLMDYAF++I+ +GGL EEDYPY + +C K+ +VV I Y+DVP
Sbjct: 184 DRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVAIDSYEDVP 243
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDY 308
N+E++L KA++ Q VSVAIE G FQ Y G+FTG CG +LDHGV VGYG G DY
Sbjct: 244 VNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYGSEGGVDY 303
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
IV+NSWG WGE GY++M+RN P GLCGI S P K
Sbjct: 304 WIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPTK 344
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 357 bits (916), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 183/347 (52%), Positives = 232/347 (66%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
KLL + LS SL + + DF + L S + L +L+E W S H + + EK
Sbjct: 4 KLLWVVLSFSLVLGVANSFDFH-----DKDLASEESLWDLYERWRSHH-TVSRSLGEKHK 57
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FK NL H+ NK Y L LN+FADM++ EF++ Y G K F +
Sbjct: 58 RFNVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRGTPHENG 117
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y V ++P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI + L +LSEQ
Sbjct: 118 AFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQ 177
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCD N GCNGGLM+ AF++I GG+ E +YPY +EGTC+ K V+I G
Sbjct: 178 ELVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDG 237
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
+++VP NDE +LLKA+A+QPVSVAI+A G+DFQFYS GVFTG C +L+HGVA VGYG +
Sbjct: 238 HENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTT 297
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G++Y IV+NSWGP+WGE GYIRM+RN K EGLCGI + S P+K
Sbjct: 298 VDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIK 344
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 357 bits (915), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 177/310 (57%), Positives = 219/310 (70%), Gaps = 7/310 (2%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHE 102
L+E F +W KHGK Y E+ LHRF ++K+NL +I R+ E +Y LGL +FAD+++E
Sbjct: 50 LLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYI--RHSETNRTYSLGLTKFADLTNE 107
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
EF+ Y G + R + F Y D +A P+SVDWRK GAVT VK+QGSCGSCWAFS
Sbjct: 108 EFRRMYTGTRIDRSRRAKRRTGFRYADSEA-PESVDWRKNGAVTSVKDQGSCGSCWAFSA 166
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
V +VEGIN I +G SLSEQEL+DCD +N GCNGGLMDYAF +I+ +GG+ E+DYPY
Sbjct: 167 VGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGGIDTEKDYPY 226
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+G C++ K+ VVTI GY+DVPENDE++L KA+A QPVSVAIEA G DFQ Y+ GV
Sbjct: 227 KGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYAQGV 286
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRN---TGKPEGLCG 339
F+G CG +LDHGV AVGYG G DY IVKNSWG WGE GY+RMKRN + GLCG
Sbjct: 287 FSGECGTDLDHGVLAVGYGTEDGVDYWIVKNSWGEYWGESGYLRMKRNMKDSNDGPGLCG 346
Query: 340 INKMASIPLK 349
IN S +K
Sbjct: 347 INIEPSYAVK 356
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 356 bits (914), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 185/345 (53%), Positives = 237/345 (68%), Gaps = 3/345 (0%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SKLL +++ L + S DFSIVGYS + LTS ++LI+LF SWM H K Y+ ++E
Sbjct: 6 SISKLLFVAICLFVHMSVSFG-DFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE 124
KL+RFEIFK+NL +ID+ NK+ SYWLGLNEFAD+S++EF KY+G + E
Sbjct: 65 KLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEE 124
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
F D LP++VDWRKKGAVTPV++QGSCGSCWAFS VA VEGIN+I +G L LSEQE
Sbjct: 125 FINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQE 184
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
L+DC+ ++GC GG YA +Y VA G+H YPY ++GTC K+ +V SG
Sbjct: 185 LVDCERR-SHGCKGGYPPYALEY-VAKNGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGV 242
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
V N+E +LL A+A QPVSV +E+ G FQ Y GG+F GPCG ++DH V AVGYGKS
Sbjct: 243 GRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSG 302
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G YI++KNSWG WGE+GYIR+KR G G+CG+ K + P K
Sbjct: 303 GKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPTK 347
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 356 bits (914), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 170/332 (51%), Positives = 229/332 (68%), Gaps = 10/332 (3%)
Query: 27 DFSIVGYSPEHLTSMDKLIEL-----FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
D SI+ Y+ EH + E ++ W++++G++Y + E+ RF +F +NLK +D
Sbjct: 23 DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDA 82
Query: 82 RN---KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVD 138
N E + LG+N FAD++++EF++ +LG K R + + V+ LP+SVD
Sbjct: 83 HNARADEHGGFRLGMNRFADLTNDEFRSTFLGAK-VVERSRAAGERYRHDGVEELPESVD 141
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCN 197
WR+KGAV PVKNQG CGSCWAFS V+ VE INQ+V+G + +LSEQEL++C T+ N+GCN
Sbjct: 142 WREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCN 201
Query: 198 GGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLK 257
GGLMD AF +I+ +GG+ E+DYPY +G C+ +E +VV+I G++DVP+NDE+SL K
Sbjct: 202 GGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQK 261
Query: 258 ALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGP 317
A+AHQPVSVAIEA G +FQ Y GVF+G CG LDHGV AVGYG G DY IV+NSWGP
Sbjct: 262 AVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGP 321
Query: 318 KWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
KWGE GY+RM+RN G CGI MAS P K
Sbjct: 322 KWGESGYVRMERNINATTGKCGIAMMASYPTK 353
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 181/353 (51%), Positives = 236/353 (66%), Gaps = 14/353 (3%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKC----IEE 64
L+L ++SL+L + A + + ++ + L S + L L+E W S + + ++
Sbjct: 5 LVLAAVSLALLVLAPPAR--AGIPFTEKDLASEESLRALYEQWRSHYMVSRPAGLQEQDD 62
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYL-GLKPQF------PT 117
K F +FKEN+++I + NK+ S+ L LN+FADM+ +EF+ Y G + +
Sbjct: 63 KARWFNVFKENVRYIHEANKKGRSFRLALNKFADMTTDEFRRAYAAGSRTRHHRALSSGI 122
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
RR F Y LP +VDWR++GAVT +K+QG CGSCWAFST+AAVEGIN+I +G L
Sbjct: 123 RRHGDGSFMYAQAGNLPLAVDWRQRGAVTGIKDQGQCGSCWAFSTIAAVEGINKIRTGKL 182
Query: 178 TSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
SLSEQEL+DCD N GCNGGLMDYAF+YI +GG+ E +YPYL E+ +C KE
Sbjct: 183 VSLSEQELVDCDDVDNQGCNGGLMDYAFQYIKRNGGITTESNYPYLAEQRSCNKAKERSH 242
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAA 297
VTI GY+DVP N+E +L KA+A+QPVS+AIEASG DFQFYS GVFTG CG ELDHGVAA
Sbjct: 243 DVTIDGYEDVPANNEDALQKAVANQPVSIAIEASGQDFQFYSEGVFTGSCGTELDHGVAA 302
Query: 298 VGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG ++ G+ Y IVKNSWG WGERGYIRM+R +GLCGI S P K
Sbjct: 303 VGYGITRDGTKYWIVKNSWGEDWGERGYIRMQRGISDSQGLCGIAMEPSYPTK 355
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 177/348 (50%), Positives = 227/348 (65%), Gaps = 20/348 (5%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
LL LS +LS +S +I+ Y+ + +M +E W+ +H K Y + +K R
Sbjct: 10 LLFLSFTLSYAIKTS-----TIINYTDNEVMAM------YEEWLVRHQKGYNELGKKDKR 58
Query: 69 FEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE--- 124
F++FK+NL I + N + +Y LGLN+FADM++EE++ YLG K R +
Sbjct: 59 FQVFKDNLGFIQEHNNNLNNTYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKSTGH 118
Query: 125 ---FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
FS RD LP VDWR KGAV P+K+QGSCGSCWAFSTVA VE IN+IV+G SLS
Sbjct: 119 RYAFSARD--RLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLS 176
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DCD ++N GCNGGLMDYAF++I+ +GG+ ++DYPY +G C+ K+ +VV I
Sbjct: 177 EQELVDCDRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
GY+DVP DE +L KA+AHQPVSVAIEASG Q Y GVFTG CG LDHGV VGYG
Sbjct: 237 DGYEDVPPYDENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHGVVVVGYG 296
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G DY +V+NSWG WGE GY +M+RN G CGI AS P+K
Sbjct: 297 SENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPVK 344
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 169/306 (55%), Positives = 216/306 (70%), Gaps = 2/306 (0%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHEEF 104
+LFESW +HGKTY E+KL+RF+IF+EN + + + N + +SY L LN FAD++H EF
Sbjct: 30 KLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEF 89
Query: 105 KNKYLGLKPQFPTRRQPSAEFSYRD-VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
K LGL + + F D V +P S+DWRKKGAV+ VK+QG+CG+CW+FS
Sbjct: 90 KASRLGLSAFSTSGKLSRRNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFSAT 149
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
A+EGIN+IV+G+L SLSEQEL+DCD S+NNGC GGLMDYA+++++ + G+ EEDYPY
Sbjct: 150 GAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQ 209
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
E TC +K + VVTI GY DVP+N+E+ LLKA+A QPVSV I S FQ YS G+F
Sbjct: 210 AREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIF 269
Query: 284 TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
TGPC LDH V VGYG G DY IVKNSWG WG GY+ M RN+G +GLCGIN +
Sbjct: 270 TGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINML 329
Query: 344 ASIPLK 349
AS P+K
Sbjct: 330 ASFPVK 335
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 168/310 (54%), Positives = 225/310 (72%), Gaps = 5/310 (1%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEF 104
+ +++ W++KHGK Y + E+ RFEIFK NL+ ID+ N + +Y +GL +FAD+++EE+
Sbjct: 1 MSMYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNHTYKVGLTKFADLTNEEY 60
Query: 105 KNKYLGLKPQFPTR----RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
+ +LG + R + PS ++++ LP+SVDWR KGAV P+K+QGSCGSCWAF
Sbjct: 61 RAMFLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAF 120
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
STVAAVEGINQIV+G L SLSEQEL+DCD ++N GCNGGLMDYAF++I+ +GGL E+DY
Sbjct: 121 STVAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGLDTEKDY 180
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY+ ++ C+ K + + V+I G++DV DE++L KA+AHQPVSVAIEASG QFY
Sbjct: 181 PYVGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFYQS 240
Query: 281 GVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP-EGLCG 339
GVFTG CG LDHGV VGY G DY +V+NSWG +WGE GYI+M+RN G G CG
Sbjct: 241 GVFTGECGTALDHGVVVVGYASENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGRCG 300
Query: 340 INKMASIPLK 349
I +S P+K
Sbjct: 301 IAMESSYPVK 310
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 173/340 (50%), Positives = 230/340 (67%), Gaps = 3/340 (0%)
Query: 13 SLSLSLFACSSLAHDFSIVGYSPEHLTSMD-KLIELFESWMSKHGKTYKCIEEKLHRFEI 71
S++L+L S L S+ + T + + ++E W+ ++ K Y + EK RFEI
Sbjct: 7 SITLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEI 66
Query: 72 FKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV 130
FK+NLK +++ + +Y +GL FAD++++EF+ YL K + ++ Y+
Sbjct: 67 FKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGEKYLYKVG 126
Query: 131 KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT 190
+LP ++DWR KGAV PVK+QGSCGSCWAFS + AVEGINQI +G L SLSEQEL+DCDT
Sbjct: 127 DSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDT 186
Query: 191 SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE-GTCEDKKEEMEVVTISGYQDVPE 249
S+N+GC GGLMDYAFK+I+ +GG+ EEDYPY+ + C K+ VVTI GY+DVP+
Sbjct: 187 SYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQ 246
Query: 250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYI 309
NDE+SL KALA+QP+SVAIEA G FQ Y+ GVFTG CG LDHGV AVGYG G DY
Sbjct: 247 NDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGSEGGQDYW 306
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
IV+NSWG WGE GY +++RN + G CG+ MAS P K
Sbjct: 307 IVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPTK 346
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 174/348 (50%), Positives = 224/348 (64%), Gaps = 16/348 (4%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
S LL LS +LS +S +I Y+ + +M +E W+ KH K Y + EK
Sbjct: 10 STLLFLSFTLSCAIDTS-----TITNYTDNEVMTM------YEEWLVKHQKVYNGLGEKD 58
Query: 67 HRFEIFKENLKHI-DQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR----RQP 121
RF++FK+NL I + N + +Y LGLN+FADM++EE++ Y G K R +
Sbjct: 59 KRFQVFKDNLGFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKST 118
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
++Y LP VDWR KGAV P+K+QGSCGSCWAFSTVA VE IN+IV+G SLS
Sbjct: 119 GHRYAYSAGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DCD ++N GCNGGLMDYAF++I+ +GG+ ++DYPY +G C+ K+ + V I
Sbjct: 179 EQELVDCDRAYNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKAVNI 238
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
GY+DVP DE +L KA+A QPVS+AIEASG Q Y GVFTG CG LDHGV VGYG
Sbjct: 239 DGYEDVPPYDENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVVVVGYG 298
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G DY +V+NSWG WGE GY +M+RN P G CGI AS P+K
Sbjct: 299 SENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPVK 346
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 354 bits (908), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 184/345 (53%), Positives = 238/345 (68%), Gaps = 3/345 (0%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SKLL +++ L + S DFSIVGYS + LTS ++LI+LF SWM H K Y+ ++E
Sbjct: 6 SISKLLFVAICLFVHMSVSFG-DFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE 124
KL+RFEIFK+NL +ID+ NK+ SY LGLNEFAD+S++EF KY+G + E
Sbjct: 65 KLYRFEIFKDNLNYIDETNKKNNSYRLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEE 124
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
F D+ LP++VDWRKKGAVTPV++QGSCGSCWAFS VA VEGIN+I +G L LSEQE
Sbjct: 125 FINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQE 184
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
L+DC+ ++GC GG YA +Y VA G+H YPY ++GTC K+ +V SG
Sbjct: 185 LVDCERR-SHGCKGGYPPYALEY-VAKNGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGV 242
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
V N+E +LL A+A QPVSV +E+ G FQ Y GG+F GPCG ++DH V AVGYGKS
Sbjct: 243 GRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSG 302
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G YI++KNSWG WGE+GYIR+KR G G+CG+ K + P+K
Sbjct: 303 GKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPIK 347
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 353 bits (907), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 180/348 (51%), Positives = 233/348 (66%), Gaps = 11/348 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
KLL ++L L+L + + DF + L S + L +L+E W S H ++EK
Sbjct: 3 KLLFVALYLALVLGFTESFDFH-----EKDLESEESLWDLYEKWRSHH-TVSTSLDEKRK 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPT--RRQP--SA 123
RF +F+ N+ H+ NK Y L LN+FADM++ EF+ Y K + T R P +
Sbjct: 57 RFNVFRANVLHVHNTNKMDKPYKLKLNKFADMTNHEFRTAYASSKVKHHTMFRGAPLGNG 116
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y ++ +P S+DWRKKGAVTPVK+QG CGSCWAFST+ AVEGIN I + L SLSEQ
Sbjct: 117 SFMYGNIDKVPASIDWRKKGAVTPVKDQGKCGSCWAFSTIVAVEGINFIKTNKLISLSEQ 176
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DC+T N+GCNGGLMDYAF++I G+ E +YPY ++G C+ K V+I G
Sbjct: 177 ELVDCNTGENHGCNGGLMDYAFEFITKQKGITTEANYPYRAQDGHCDANKANQPAVSIDG 236
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
++DV N+E +LLKA+A+QPVSVAI+A G+DFQFYS GVFTG CG ELDHGVA VGYG +
Sbjct: 237 HEDVLHNNENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGECGKELDHGVAIVGYGTT 296
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G+ Y IV+NSWGP+WGERGYIRM+R GLCGI AS P+KK
Sbjct: 297 VDGTKYWIVRNSWGPEWGERGYIRMQRGISDRRGLCGIAMEASYPIKK 344
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 353 bits (906), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 183/340 (53%), Positives = 231/340 (67%), Gaps = 14/340 (4%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFK 73
+SL+L C L F+I S D + E WMS++GK YK +E+ RF+IF
Sbjct: 10 ISLALVFCLGL---FAIQVTS--RTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFT 64
Query: 74 ENLKHIDQRNKEVT-SYWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQPSAEFSYRD 129
EN+ +++ N + T SY LG+N+FAD+++EEF +NK+ G TR + F Y +
Sbjct: 65 ENVNYVEASNADDTKSYKLGINQFADLTNEEFVASRNKFKGHMCSSITR---TTTFKYEN 121
Query: 130 VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD 189
V A+P +VDWRKKGAVTPVKNQG CG CWAFS VAA EGI+++ +G L SLSEQEL+DCD
Sbjct: 122 VSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCD 181
Query: 190 T-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
T + GC GGLMD AFK+I+ + GL E YPY +GTC K ++ VTI+GY+DVP
Sbjct: 182 TKGVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVP 241
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSD 307
N EQ+L KA+A+QP+SVAI+ASG+DFQFY GVFTG CG ELDHGV AVGYG S G+
Sbjct: 242 ANSEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTK 301
Query: 308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
Y +VKNSWG WGE GYI M+R EGLCGI AS P
Sbjct: 302 YWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYP 341
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 353 bits (906), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 181/347 (52%), Positives = 230/347 (66%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
KL L+ +L+L + DF + L + +K EL+E W S H + ++EK
Sbjct: 3 KLFLVLFTLALVLRLGESFDFH-----EKELETEEKFWELYERWRSHH-TVSRSLDEKHK 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR----RQPSA 123
RF +FK N+ ++ NK+ Y L LN+FADM++ EF+ Y G K + + +
Sbjct: 57 RFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRQHYAGSKIKHHRTLLGASRANG 116
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y + +P S+DWRKKGAVTPVK+QG CGSCWAFSTV AVEGINQI + L SLSEQ
Sbjct: 117 TFMYANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGINQIKTKKLVSLSEQ 176
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCDT+ N GCNGGLMD AF +I GG+ EE YPY E+ C+ +K VV+I G
Sbjct: 177 ELVDCDTTENQGCNGGLMDPAFDFIKKRGGITTEERYPYKAEDDKCDIQKRNTPVVSIDG 236
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
++DVP NDE +LLKA+A+QP+SVAI+ASG+ FQFYS GVFTG CG ELDHGVA VGYG +
Sbjct: 237 HEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGVFTGECGTELDHGVAIVGYGTT 296
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G+ Y IVKNSWG WGE+GYIRM+R EGLCGI S P+K
Sbjct: 297 VDGTKYWIVKNSWGAGWGEKGYIRMQRKVDAEEGLCGIAMQPSYPIK 343
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 353 bits (906), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 190/343 (55%), Positives = 234/343 (68%), Gaps = 13/343 (3%)
Query: 14 LSLSLFACSSLAHDFSI--VGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE--KLHRF 69
L + LF L+ FSI G S L D+ E WMS+HG+ Y +E K RF
Sbjct: 4 LQIFLFVALVLSFCFSIQLAGLSRPLL---DEDSMRHEEWMSQHGRVYADEQEDHKNKRF 60
Query: 70 EIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPS--AEFSY 127
+FKEN++ I++ N T + L +N+FAD+++EEF+ Y G K Q + F Y
Sbjct: 61 NVFKENVERIEEFNDGKT-FKLAINQFADLTNEEFRASYNGFKGPMVLSSQITKPTPFRY 119
Query: 128 RDVK-ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
+V ALP SVDWRKKGAVTPVKNQG CG CWAFS VAA+EGI QI +G L SLSEQEL+
Sbjct: 120 ENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISLSEQELV 179
Query: 187 DCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ 245
DCDT ++GC GGLMD AF++I+ +GGL E +YPY E+GTC K V+I+GY+
Sbjct: 180 DCDTKGIDHGCEGGLMDTAFEFIINNGGLTTESNYPYKGEDGTCNFNKTNPIAVSITGYE 239
Query: 246 DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK- 304
DVP NDEQ+L+KA+AHQPVSVAIEA G+DFQFYS GVFTG CG ELDH V AVGYG+S+
Sbjct: 240 DVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGVFTGECGTELDHAVTAVGYGESED 299
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GS Y IVKNSWG KWGE GYI M+++ +GLCGI AS P
Sbjct: 300 GSKYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCGIAMQASYP 342
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 353 bits (905), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 177/347 (51%), Positives = 229/347 (65%), Gaps = 15/347 (4%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
+ L SL + AC Y + + S + L L++ W S H + + E+ R
Sbjct: 7 IFLFSLVILQTACG--------FDYDDKEIESEEGLSTLYDRWRSHHS-VPRSLNEREKR 57
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP------QFPTRRQPS 122
F +F+ N+ H+ NK+ SY L LN+FAD++ EFKN Y G Q P R
Sbjct: 58 FNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQ 117
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
+ + ++ LP SVDWRKKGAVT +KNQG CGSCWAFSTVAAVEGIN+I + L SLSE
Sbjct: 118 FMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSE 177
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DCDT N GCNGGLM+ AF++I +GG+ E+ YPY +G C+ K+ +VTI
Sbjct: 178 QELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTID 237
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
G++DVPENDE +LLKA+A+QPVSVAI+A +DFQFYS GVFTG CG EL+HGVAAVGYG
Sbjct: 238 GHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGS 297
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+G Y IV+NSWG +WGE GYI+++R +PEG CGI AS P+K
Sbjct: 298 ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIK 344
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 353 bits (905), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 172/307 (56%), Positives = 213/307 (69%), Gaps = 3/307 (0%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEE 103
+++FE W+ ++ K Y + EK RFEIF +NLK + + N SY LGL FAD+++EE
Sbjct: 34 VKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFADLTNEE 93
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
F+ YL K + S + + LP VDWR KGAV PVK+QGSCGSCWAFS +
Sbjct: 94 FRAIYLRSKMERTRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKDQGSCGSCWAFSAI 153
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
AVEGINQI +G L SLSEQEL+DCDTS+NNGC GGLMDYAF++I+++GG+ EEDYPY
Sbjct: 154 GAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFIISNGGIDTEEDYPYT 213
Query: 224 -MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
++ C K+ VVTI GY+DVPEN E SL KALA+QP+SVAIEA G FQ Y GV
Sbjct: 214 ATDDNICNTDKKNTRVVTIDGYEDVPEN-ENSLKKALANQPISVAIEAGGRGFQLYKSGV 272
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
FTG CG LDHGV AVGYG S+G DY I++NSWG WGE GYI+++RN G CG+
Sbjct: 273 FTGTCGTALDHGVVAVGYGTSEGQDYWIIRNSWGSNWGESGYIKLQRNIKDSSGKCGVAM 332
Query: 343 MASIPLK 349
MAS P K
Sbjct: 333 MASYPTK 339
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 175/316 (55%), Positives = 223/316 (70%), Gaps = 10/316 (3%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS--YWLGLNEF 96
T D + E WMS++GK YK +E+ RF+IF EN+ +I+ NK + Y LG+N+F
Sbjct: 29 TLQDDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQF 88
Query: 97 ADMSHEEF---KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGS 153
AD++++EF +NK+ G TR ++ F Y + A+P SVDWRKKGAVTPVKNQG
Sbjct: 89 ADLTNDEFTSSRNKFKGHMCSSITR---TSTFKYENASAIPSSVDWRKKGAVTPVKNQGQ 145
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASG 212
CG CWAFS VAA EGI+++ +G L SLSEQEL+DCDT + GC GGLMD AFK+I+ +
Sbjct: 146 CGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNH 205
Query: 213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASG 272
GL+ E +YPY +GTC K + VTI+GY+DVP N+EQ+L KA+A+QP+SVAI+ASG
Sbjct: 206 GLNTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPISVAIDASG 265
Query: 273 TDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNT 331
+DFQFY GVFTG CG ELDHGV AVGYG S G+ Y +VKNSWG +WGE GYI M+R
Sbjct: 266 SDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEEGYIMMQRGV 325
Query: 332 GKPEGLCGINKMASIP 347
EGLCGI AS P
Sbjct: 326 DAAEGLCGIAMQASYP 341
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 178/355 (50%), Positives = 232/355 (65%), Gaps = 19/355 (5%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + + LL SL ++L SLA D S + S ++++ ++E W+ KH K Y
Sbjct: 1 MASITITSLLFFSL-ITL----SLAMDTS--------MRSNEEVMTMYEEWLVKHHKVYN 47
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ-----F 115
+ EK RFEIFK+NL ID+ N + +Y +GLN+FAD ++EE++N YLG K
Sbjct: 48 GLGEKDQRFEIFKDNLGFIDEHNAQNYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVM 107
Query: 116 PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
+ +++ LP VDWR KGAV +K+QGSCGSCWAFST+A VE IN+IV+G
Sbjct: 108 KIKITTGHRYAFNSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTG 167
Query: 176 NLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
L SLSEQEL+DCD +FN GCNGGLMDYAF++IV +GG+ E+DYPY EG C+ ++
Sbjct: 168 KLVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKN 227
Query: 236 MEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGV 295
+VV+I GY+DVP +E +L KA+ HQPVSVAIEA G Q Y GVFTG CG LDHGV
Sbjct: 228 AKVVSIDGYEDVPAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGV 287
Query: 296 AAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
VGYG G DY +V+NSWG WGE GY +++RN K G CGI AS P+K
Sbjct: 288 VVVGYGFENGVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPVK 342
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 169/331 (51%), Positives = 225/331 (67%), Gaps = 9/331 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIEL-----FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
D SI+ Y+ EH + E ++ W++++G++Y + E RF +F +NL+ D
Sbjct: 28 DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADA 87
Query: 82 RNKEVTS--YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDW 139
N + LG+N FAD+++EEF+ +LG K R + + V+ LP+SVDW
Sbjct: 88 HNARADDHGFRLGMNRFADLTNEEFRATFLGAK-VVERSRAAGERYRHDGVEELPESVDW 146
Query: 140 RKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNG 198
R+KGAV PVKNQG CGSCWAFS V+ VE INQ+V+G + +LSEQEL++C T+ N+GCNG
Sbjct: 147 REKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNG 206
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
GLMD AF +I+ +GG+ E+DYPY +G C+ +E +VV+I G++DVP+NDE+SL KA
Sbjct: 207 GLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKA 266
Query: 259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPK 318
+AHQPVSVAIEA G +FQ Y GVF+G CG LDHGV AVGYG G DY IV+NSWGPK
Sbjct: 267 VAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPK 326
Query: 319 WGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WGE GY+RM+RN G CGI MAS P K
Sbjct: 327 WGESGYVRMERNINVTTGKCGIAMMASYPTK 357
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 352 bits (903), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 175/322 (54%), Positives = 220/322 (68%), Gaps = 6/322 (1%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
+ + L + + L L+E W S H + ++EK RF +FKEN+ + + NK+ Y L
Sbjct: 23 FHQKELETEESLWNLYERWRSHH-TVSRSLDEKHKRFNVFKENVNFVHEFNKKDEPYKLK 81
Query: 93 LNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
LN+FADM++ EF++ Y G K F + + F Y VK++P SVDWRKKGAVTP+
Sbjct: 82 LNKFADMTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPI 141
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
K+QG CGSCWAFSTV AVEGIN I + L SLSEQEL+DCDTS N GCNGGLM YAF++I
Sbjct: 142 KDQGQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFI 201
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
GG+ E+ YPY E+GTC+ K VV+I G++ VP N+E +LLKA A+QP+SVAI
Sbjct: 202 KEKGGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAI 261
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRM 327
+A G+ FQFYS GVF G CG +LDHGVA VGYG + G+ Y IVKNSWG WGE GYIRM
Sbjct: 262 DAGGSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRM 321
Query: 328 KRNTGKPEGLCGINKMASIPLK 349
KR EGLCGI AS P+K
Sbjct: 322 KRGISAKEGLCGIAVEASYPIK 343
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 352 bits (903), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 174/349 (49%), Positives = 231/349 (66%), Gaps = 14/349 (4%)
Query: 16 LSLFACSSLAHDFSIVGYSPEHLTSMDKLIE-----LFESWMSKHGK-TYKCIEEKLHRF 69
+S F + D SI+ Y+ EH + E ++ W ++HG + E+ RF
Sbjct: 15 VSGFGACAAGPDMSIISYNAEHGARGLERTEAEARAIYGLWRAEHGSGNSNSLGEEERRF 74
Query: 70 EIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA-- 123
F +NL+ +D N + + LG+N FAD++++EF+ YLG+K R +
Sbjct: 75 RAFWDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEFRAAYLGVKGAGQRRSARAGVG 134
Query: 124 -EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
+ + V+ LP++VDWR+KGAV PVKNQG CGSCWAFS V+AVE INQ+V+G L +LSE
Sbjct: 135 ERYRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSAVESINQLVTGELVTLSE 194
Query: 183 QELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
QEL++CD + +NGCNGGLMD AF +I+ +GG+ E+DYPY +G C+ + +VV+I
Sbjct: 195 QELVECDINGQSNGCNGGLMDDAFDFIINNGGIDTEDDYPYKALDGKCDINRRNAKVVSI 254
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
G++DVPENDE+SL KA+AHQPVSVAIEA G +FQ Y GVFTG CG ELDHGV AVGYG
Sbjct: 255 DGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFTGRCGTELDHGVVAVGYG 314
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G DY IV+NSWGPKWGE GY+RM+RN G CGI M+S P KK
Sbjct: 315 TENGKDYWIVRNSWGPKWGEAGYLRMERNINATTGKCGIAMMSSYPTKK 363
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 352 bits (903), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 182/352 (51%), Positives = 236/352 (67%), Gaps = 25/352 (7%)
Query: 3 FFSHSKLLLLSLSLSLFACSSLA-HDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKC 61
F+ S L+L L L F SS D S + E E WM+++G+ YK
Sbjct: 7 FYQVSFALVLCLGLWAFQVSSRTLQDAS--------------MQERHEQWMARYGRVYKD 52
Query: 62 IEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPT 117
++EK RF IFKEN+ +I+ N Y LG+N+FAD+++EEF +NK+ G T
Sbjct: 53 LQEKEKRFSIFKENVNYIEASNNAGDKPYKLGVNQFADLTNEEFIATRNKFKGHMSSSIT 112
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
R + F Y +V A P +VDWR++GAVTPVKNQG+CG CWAFS VAA EGI+++ +GNL
Sbjct: 113 R---TTTFKYENVTA-PSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNL 168
Query: 178 TSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQEL+DCDTS + GC GGLMD AFK+I+ +GGL+ E YPY +GTC +E
Sbjct: 169 VSLSEQELVDCDTSGADQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEAT 228
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
V TI+GY+DVP N+EQ+L +A+A+QP+S+AI+ASG+DFQ Y GVFTG CG +LDHGVA
Sbjct: 229 HVATITGYEDVPSNNEQALQQAVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVA 288
Query: 297 AVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VGYG S G+ Y +VKNSWG WGE GYIRM+R+ PEGLCG+ S P
Sbjct: 289 VVGYGVSDDGTKYWLVKNSWGADWGEEGYIRMQRDVDAPEGLCGLAMQPSYP 340
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 167/308 (54%), Positives = 220/308 (71%), Gaps = 6/308 (1%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-----YWLGLNEFADMSHE 102
+SW+ KH K Y + EK RF IF++NL+ IDQ N + LGLN+FAD++++
Sbjct: 5 LQSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTND 64
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
EF+ Y G+K S ++ ++ LP+SVDWRKKGAV+ VK+QG CGSCWAFS
Sbjct: 65 EFRRIYFGVKRPEKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAFSA 124
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
+ AVEGIN+IV+G+L +LSEQEL+DCDTS+N+GC+GGLMDYAF++I+ +GG+ ++DYPY
Sbjct: 125 IGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKDYPY 184
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+G+C+ ++ +VVTI G +DVP N+E++L KA+AHQPV +AIEA G DFQ Y GV
Sbjct: 185 KATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYKSGV 244
Query: 283 FTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
FTG CG LDHGV AVGYG + G DY IV+NSWG WGE GYIRM+RNT G CGI
Sbjct: 245 FTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTESKSGKCGIA 304
Query: 342 KMASIPLK 349
S P+K
Sbjct: 305 IEPSYPVK 312
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 351 bits (901), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 177/347 (51%), Positives = 230/347 (66%), Gaps = 15/347 (4%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
+ L SL + AC Y + + S + L +L++ W S H + + E+ R
Sbjct: 7 IFLFSLVILETACG--------FDYEDKEIESEEGLSKLYDRWRSHH-SVPRSLHEREKR 57
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP------QFPTRRQPS 122
F +F+ N+ H+ NK+ SY L LN+FAD++ EFKN Y G K Q P R
Sbjct: 58 FNVFRHNVMHVHNSNKKNRSYKLKLNKFADLTIHEFKNAYTGSKIKHHRMLQGPKRGSKQ 117
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
+ + +V LP SVDWRKKGAVT +KNQG CGSCWAFSTVAAVEGIN+I + L SLSE
Sbjct: 118 FMYDHENVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSE 177
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DCDT+ N GCNGGLM+ AF++I +GG+ E+ YPY +G C+ K+ +VTI
Sbjct: 178 QELVDCDTNQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTID 237
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
G+++VPENDE +LLKA+A+QPVSVAI+A +DFQFYS GVFTG CG EL+HGVA VGYG
Sbjct: 238 GHENVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELNHGVATVGYGS 297
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G Y IV+NSWG +WGE GYI+++R +PEG CGI AS P+K
Sbjct: 298 QGGKKYWIVRNSWGTEWGEGGYIKIERGIDEPEGRCGIAMEASYPIK 344
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 351 bits (900), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 182/343 (53%), Positives = 238/343 (69%), Gaps = 10/343 (2%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFK 73
++L+L A S L+ SI ++ + L S D L L+E W + H + ++EK RF +FK
Sbjct: 7 IALALVALSFLSIAQSIP-FTEKDLASEDSLWNLYEKWRTHH-TVARDLDEKNRRFNVFK 64
Query: 74 ENLKHIDQRN-KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ----PSAEFSYR 128
EN+K I + N K+ Y L LN+F DM+++EF++KY G K Q ++ + F Y
Sbjct: 65 ENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYE 124
Query: 129 DVKALPK-SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELID 187
+V +LP S+DWR KGAVT VK+QG CGSCWAFST+A+VEGINQI +G L SLSEQEL+D
Sbjct: 125 NVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELVD 184
Query: 188 CDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
CDTS+N GCNGGLMDYAF++I G+ E+ YPY ++GTC VV+I G+QDV
Sbjct: 185 CDTSYNEGCNGGLMDYAFEFI-QKNGITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDV 243
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GS 306
P N+E +L++A+A+QP+SV+IEASG FQFYS GVFTG CG ELDHGVA VGYG ++ G+
Sbjct: 244 PANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGT 303
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
Y IVKNSWG +WGE GYIRM+R G CGI AS P+K
Sbjct: 304 KYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPIK 346
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 350 bits (899), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 183/350 (52%), Positives = 234/350 (66%), Gaps = 14/350 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + + + L+L L A +S A S+ H SM E E WM ++G+ YK
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARSL------HEASM---YERHEDWMVQYGREYK 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+EK R++IFK+N+ I+ NK + SY L +NEFAD+++EEF+ K +
Sbjct: 52 DADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTE 111
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S F Y +V A+P +VDWRKKGAVTP+K+QG CGSCWAFS VAA+EGI Q+ +G L S
Sbjct: 112 ATS--FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLIS 169
Query: 180 LSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCDTS + GC+GGLMD AFK+I + GL E +YPY +GTC KK
Sbjct: 170 LSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPA 229
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+GY+DVP N+E++L KA+AHQP++VAI+ASG++FQFYS GVFTG CG ELDHGVAAV
Sbjct: 230 AKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAV 289
Query: 299 GYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG S G Y +VKNSW WGE GYIRM+R+ EGLCGI AS P
Sbjct: 290 GYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 339
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 350 bits (899), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 179/341 (52%), Positives = 226/341 (66%), Gaps = 9/341 (2%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
L +SL+L C L F+I S D + E WMS++GK YK +E+ RF+
Sbjct: 7 LYHISLALLFCLGL---FAIQVTS--RTLQDDSMYERHGQWMSQYGKIYKDHQERETRFK 61
Query: 71 IFKENLKHIDQRNK--EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR 128
IFKEN+ +I+ N + SY LG+N+FAD+++EEF K + + F Y
Sbjct: 62 IFKENVNYIETFNNADDTKSYKLGINQFADLTNEEFIASRNKFKGHMCSSIMRTTSFKYE 121
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
+V +P +VDWRKKGAVTPVKNQG CG CWAFS VAA EGI+++ +G L SLSEQEL+DC
Sbjct: 122 NVSGIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDC 181
Query: 189 DT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
DT + GC GGLMD AFK+I+ + GL E YPY +GTC K ++ VTI+GY+DV
Sbjct: 182 DTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDV 241
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GS 306
P N EQ+L KA+A+QP+SVAI+ASG+DFQFY GVFTG CG ELDHGV AVGYG S G+
Sbjct: 242 PANSEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGACGTELDHGVTAVGYGVSNDGT 301
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
Y +VKNSWG WGE GYI M+R EG+CGI AS P
Sbjct: 302 KYWLVKNSWGTDWGEEGYIMMQRGIEAAEGICGIAMQASYP 342
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 350 bits (898), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 183/352 (51%), Positives = 236/352 (67%), Gaps = 25/352 (7%)
Query: 3 FFSHSKLLLLSLSLSLFACSSLA-HDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKC 61
F+ S L+L L L F SS D S + E E WM+++GK YK
Sbjct: 7 FYQISFALVLCLGLWAFQVSSRTLQDAS--------------MHERHEQWMARYGKVYKD 52
Query: 62 IEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPT 117
++EK RF IF+EN+K+I+ N Y LG+N+F D++++EF +NK+ G T
Sbjct: 53 LQEKEKRFNIFQENVKYIEASNNAGNKPYKLGVNQFTDLTNKEFIATRNKFKGHMSSSIT 112
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
R + F Y +V A P +VDWR++GAVTPVKNQG+CG CWAFS VAA EGI+++ +GNL
Sbjct: 113 R---TTTFKYENVTA-PSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNL 168
Query: 178 TSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQEL+DCDTS + GC GGLMD AFK+I+ +GGL+ E YPY +GTC +E
Sbjct: 169 VSLSEQELVDCDTSGADQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVT 228
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
V TI+GY+DVP N+EQ+L +A+A+QP+SVAI+ASG+DFQ Y GVFTG CG +LDHGVA
Sbjct: 229 HVATITGYEDVPSNNEQALQQAVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVA 288
Query: 297 AVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VGYG S G+ Y +VKNSWG WGE GYIRM+R+ PEGLCGI S P
Sbjct: 289 VVGYGVSDDGTKYWLVKNSWGEDWGEEGYIRMQRDVEAPEGLCGIAMQPSYP 340
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 350 bits (898), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 172/317 (54%), Positives = 220/317 (69%), Gaps = 6/317 (1%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFA 97
L S + +L+E W S H + + +K RF +FK N+ H+ NK Y L LN+FA
Sbjct: 30 LASEESFWDLYERWRSHH-TVSRSLGDKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFA 88
Query: 98 DMSHEEFKNKYLGLKPQ----FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGS 153
DM++ EF++ Y G K F + + F Y V ++P SVDWRK GAVT VK+QG
Sbjct: 89 DMTNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSVDWRKNGAVTGVKDQGQ 148
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGG 213
CGSCWAFSTV AVEGINQI + L SLSEQEL+DCDT N GCNGGLM+ AF++I GG
Sbjct: 149 CGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIKQKGG 208
Query: 214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGT 273
+ E +YPY ++GTC+ K V+I G+++VP NDE +LLKA+A+QPVSVAI+A G+
Sbjct: 209 ITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAGGS 268
Query: 274 DFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
DFQFYS GVFTG C EL+HGVA VGYG + G++Y V+NSWGP+WGE+GYIRM+R+
Sbjct: 269 DFQFYSEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQRSIS 328
Query: 333 KPEGLCGINKMASIPLK 349
K EGLCGI MAS P+K
Sbjct: 329 KKEGLCGIAMMASYPIK 345
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 182/348 (52%), Positives = 237/348 (68%), Gaps = 11/348 (3%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
+K +LL+L ++L A +A + ++ + L S + L L+E W S H + + EK
Sbjct: 3 TKSMLLALVVAL-AFVGVAR---TIPFNEKDLASEESLWGLYERWRSHH-TVSRDLSEKN 57
Query: 67 HRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQF--PTRRQPSA- 123
RF +FKEN K I + NK+ Y LGLN+FADM+++EF++ Y G K R P A
Sbjct: 58 KRFNVFKENAKFIHEFNKKDAPYKLGLNKFADMTNQEFRSTYAGSKIHHHRTQRGTPRAT 117
Query: 124 -EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
F Y +V ++P SVDWR +GAV PVK+QG CGSCWAFST+A+VEGIN+I + L LS
Sbjct: 118 GSFMYENVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKTNQLVPLSG 177
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
Q+L+DCDT N GCNGGLMDYAF++I ++GG+ E YPY E+G+C + VVTI
Sbjct: 178 QQLVDCDTDQNEGCNGGLMDYAFEFIKSNGGITSESAYPYTAEQGSCAS-ESSAPVVTID 236
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
GY+DVP N+E +L+KA+A+Q VSVAIEASG FQFYS GVFTG CG ELDHGVA VGYG
Sbjct: 237 GYEDVPANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNELDHGVAVVGYGA 296
Query: 303 SK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
++ G+ Y IV+NSWG +WGE+GYIRM+R GLCGI S PLK
Sbjct: 297 TRDGTKYWIVRNSWGAEWGEKGYIRMQRGIRARHGLCGIAMEPSYPLK 344
>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
AltName: Allergen=Car p 1; Flags: Precursor
gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
gi|387885|gb|AAA72774.1| papain [synthetic construct]
gi|225437|prf||1303270A papain
Length = 345
Score = 350 bits (897), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 182/351 (51%), Positives = 236/351 (67%), Gaps = 18/351 (5%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SKLL +++ L ++ S DFSIVGYS LTS ++LI+LFESWM KH K YK I+E
Sbjct: 6 SISKLLFVAICLFVYMGLSFG-DFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSA 123
K++RFEIFK+NLK+ID+ NK+ SYWLGLN FADMS++EFK KY G + + T
Sbjct: 65 KIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTT-----T 119
Query: 124 EFSYRDV-----KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
E SY +V +P+ VDWR+KGAVTPVKNQGSCGSCWAFS V +EGI +I +GNL
Sbjct: 120 ELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLN 179
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
SEQEL+DCD + GCNGG A + +VA G+H YPY + C +++
Sbjct: 180 EYSEQELLDCDRR-SYGCNGGYPWSALQ-LVAQYGIHYRNTYPYEGVQRYCRSREKGPYA 237
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
G + V +E +LL ++A+QPVSV +EA+G DFQ Y GG+F GPCG ++DH VAAV
Sbjct: 238 AKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAV 297
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
GY G +YI++KNSWG WGE GYIR+KR TG G+CG+ + P+K
Sbjct: 298 GY----GPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVK 344
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 350 bits (897), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 172/340 (50%), Positives = 225/340 (66%), Gaps = 3/340 (0%)
Query: 13 SLSLSLFACSSLAHDFSIVGYSPEHLTSMD-KLIELFESWMSKHGKTYKCIEEKLHRFEI 71
S++L+L S L S+ + T + + ++E W+ ++ K Y + EK RFEI
Sbjct: 7 SITLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEKETRFEI 66
Query: 72 FKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV 130
F +NLK+I++ N ++ +GL FAD++++EF+ YL K + + Y+
Sbjct: 67 FTDNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGERYLYKVG 126
Query: 131 KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT 190
LP +DWR KGAV PVK+QG+CGSCWAFS + AVEGINQI +G L SLSEQEL+DCDT
Sbjct: 127 DTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDT 186
Query: 191 SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL-MEEGTCEDKKEEMEVVTISGYQDVPE 249
S+N GC GGLMDYAFK+I+ +GG+ EEDYPY ++ C K+ VVTI GY+DVP+
Sbjct: 187 SYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVPQ 246
Query: 250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYI 309
NDE+SL KALA+QP+SVAIEA G FQ Y GVFTG CG LDHGV AVGYG G DY
Sbjct: 247 NDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYGSEGGQDYW 306
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
IV+NSWG WGE GY +++RN + G CG+ MAS P K
Sbjct: 307 IVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPTK 346
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 350 bits (897), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 170/307 (55%), Positives = 214/307 (69%), Gaps = 31/307 (10%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEF 104
+ ++E+W++KHGK+Y + EK RF+IFK+NL+ ID+ N E +Y
Sbjct: 1 MAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTY--------------- 45
Query: 105 KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
+ S +++R +LP+SVDWRKKGAV VK+QGSCGSCWAFST+A
Sbjct: 46 ---------------KISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIA 90
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
AVEGIN+IV+G L SLSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ EEDYPY
Sbjct: 91 AVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKA 150
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
+G C+ ++ +VVTI GY+DVPENDE+SL KA+A+QPVSVAIEA G +FQ Y G+FT
Sbjct: 151 SDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFT 210
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTG-KPEGLCGINKM 343
G CG LDHGV AVGYG G DY IVKNSWG WGE GYIRM+R+ G CGI
Sbjct: 211 GRCGTALDHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAME 270
Query: 344 ASIPLKK 350
AS P+KK
Sbjct: 271 ASYPIKK 277
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 171/311 (54%), Positives = 218/311 (70%), Gaps = 14/311 (4%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHE 102
+ E E WM+++G+ YK EK R+ IFKEN+ ID N + SY LG+N+FAD+S+E
Sbjct: 35 MYERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYKLGVNQFADLSNE 94
Query: 103 EFK---NKYLG--LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
EFK N++ G PQ + F Y +V A+P ++DWRKKGAVTPVK+QG CG C
Sbjct: 95 EFKASRNRFKGHMCSPQ-------AGPFRYENVSAVPATMDWRKKGAVTPVKDQGQCGCC 147
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHK 216
WAFS VAA+EGINQ+ +G L SLSEQE++DCDT + GCNGGLMD AFK+I + GL
Sbjct: 148 WAFSAVAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTT 207
Query: 217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
E +YPY +GTC +KE I+G++DVP N E +L+KA+A QPVSVAI+A G +FQ
Sbjct: 208 EANYPYTGTDGTCNTQKEATHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQ 267
Query: 277 FYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
FYS G+FTG CG +LDHGV AVGYG S G+ Y +VKNSWG +WGE GYIRM+++ EG
Sbjct: 268 FYSSGIFTGSCGTQLDHGVTAVGYGISDGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEG 327
Query: 337 LCGINKMASIP 347
LCGI AS P
Sbjct: 328 LCGIAMQASYP 338
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 181/350 (51%), Positives = 233/350 (66%), Gaps = 14/350 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + + + L+L L A +S A +++ S + E E WM+++G+ YK
Sbjct: 1 MASVNQYQYICLALLFFLAAWASQATARNLLEAS---------MYERHEDWMAQYGRVYK 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+EK R++IFK+N+ I+ NK + SY L +NEFAD+++EEF+ K +
Sbjct: 52 DADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTE 111
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S F Y V A+P +VDWRKKGAVTP+K+QG CGSCWAFS VAA+EGI Q+ +G L S
Sbjct: 112 ATS--FKYEHVAAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLIS 169
Query: 180 LSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCDTS + GCNGGLMD AFK+I + GL E +YPY +GTC KK
Sbjct: 170 LSEQELVDCDTSGEDQGCNGGLMDDAFKFIEQNHGLATEANYPYAGTDGTCNRKKAAHPA 229
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+GY+DVP N+E++L KA+AHQP++VAI+A G +FQFYS GVFTG CG ELDHGVAAV
Sbjct: 230 AKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAV 289
Query: 299 GYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG S G Y +VKNSWG WGE GYIRM+R+ EGLCGI AS P
Sbjct: 290 GYGTSDDGMKYWLVKNSWGTGWGEVGYIRMQRDVTAKEGLCGIAMQASYP 339
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 181/350 (51%), Positives = 234/350 (66%), Gaps = 14/350 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + + + L+L L A +S A ++ H SM E E WM ++G+ YK
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARNL------HEASM---YERHEDWMVQYGREYK 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+EK R++IFK+N+ I+ NK + SY L +NEFAD+++EEF+ K +
Sbjct: 52 DADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTE 111
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S F Y +V A+P +VDWRKKGAVTP+K+QG CGSCWAFS VAA+EGI Q+ +G L S
Sbjct: 112 ATS--FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLIS 169
Query: 180 LSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCDTS + GC+GGLMD AFK+I + GL E +YPY +GTC KK
Sbjct: 170 LSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPA 229
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+GY+DVP N+E++L KA+AHQP++VAI+A G++FQFYS GVFTG CG ELDHGV+AV
Sbjct: 230 AKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAV 289
Query: 299 GYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG S G Y +VKNSWG WGE GYIRM+R+ EGLCGI AS P
Sbjct: 290 GYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 339
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 174/327 (53%), Positives = 228/327 (69%), Gaps = 8/327 (2%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCI--EEKLHRFEIFKENLKHIDQRNKEVTS 88
V ++ + L S + L L+E+W S H + + + E + RF +FKEN+++I + NK+
Sbjct: 23 VPFTEKDLASEESLRGLYETWRSHHTVSRRGLGAEAEARRFNVFKENVRYIHEANKKDRP 82
Query: 89 YWLGLNEFADMSHEEFKNKYLGLKPQF-----PTRRQPSAEFSYRDVKALPKSVDWRKKG 143
+ L LN+FADM+ +EF+ Y G + + RRQ F Y D + LP +VDWR+KG
Sbjct: 83 FRLALNKFADMTTDEFRRTYAGSRVRHHRSLSGGRRQGGGSFMYADAENLPAAVDWRQKG 142
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
AVTP+K+QG CGSCWAFST+ AVEGIN+I +G L SLSEQEL+DC+ N+GCNGGLMD
Sbjct: 143 AVTPIKDQGQCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGENDGCNGGLMDV 202
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQP 263
AF++I +GG+ E YPY E+ +C+ KE V+I GY+DVP NDE +L KA+A+QP
Sbjct: 203 AFQFIQQNGGITTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESALQKAVANQP 262
Query: 264 VSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGER 322
VSVAI+ASG DFQFYS GVFT G +LDHGVAAVGYG ++ G+ Y IVKNSWG WGE+
Sbjct: 263 VSVAIDASGNDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEK 322
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLK 349
GYIRM+R + EGLCGI AS P K
Sbjct: 323 GYIRMQRGVKQAEGLCGIAMEASYPTK 349
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 171/337 (50%), Positives = 226/337 (67%), Gaps = 14/337 (4%)
Query: 28 FSIVGYSPEHLTSMDKLIE-----LFESWMSKHGKTYKCIEE----KLHRFEIFKENLKH 78
SI+ Y+ EH + E +++ W+++HG+ Y + E + RF +F +NL+
Sbjct: 32 MSIITYNEEHGARGLERTEPEVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRF 91
Query: 79 IDQRNKEVTS--YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA--LP 134
+D N+ + + LG+N+FAD++++EF+ YLG R E D A LP
Sbjct: 92 VDAHNERAGARGFRLGMNQFADLTNDEFRAAYLGAMVPAARRGAVVGERYRHDGAAEELP 151
Query: 135 KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-N 193
+SVDWR+KGAV PVKNQG CGSCWAFS V++VE +NQIV+G + +LSEQEL++C T N
Sbjct: 152 ESVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGN 211
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
+GCNGGLMD AF +I+ +GG+ E+DYPY +G C+ ++ VV+I G++DVPENDE+
Sbjct: 212 SGCNGGLMDAAFDFIIKNGGIDTEDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPENDEK 271
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
SL KA+AHQPVSVAIEA G +FQ Y GVF+G C LDHGV AVGYG G DY IV+N
Sbjct: 272 SLQKAVAHQPVSVAIEAGGREFQLYKSGVFSGSCTTNLDHGVVAVGYGAENGKDYWIVRN 331
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
SWGPKWGE GYIRM+RN G CGI MAS P KK
Sbjct: 332 SWGPKWGEAGYIRMERNVNASTGKCGIAMMASYPTKK 368
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 181/350 (51%), Positives = 232/350 (66%), Gaps = 14/350 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + + + L+L L A +S A ++ H SM E E WM+++G+ YK
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARNL------HEASM---YERHEDWMAQYGRVYK 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+EK R++IFK+N+ I+ NK + SY L +NEFAD+++EEF K +
Sbjct: 52 DADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFGTSRNRFKAHICSTE 111
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S F Y +V A+P ++DWRKKGAVTP+K+QG CGSCWAFS VAA+EGI Q+ +G L S
Sbjct: 112 ATS--FKYENVTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLIS 169
Query: 180 LSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCDTS + GCNGGLMD AFK+I + GL E +YPY +GTC KK
Sbjct: 170 LSEQELVDCDTSGEDQGCNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKKAAHPA 229
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+GY+DVP N+E++L KA+ HQP++VAI+A G +FQFYS GVFTG CG ELDHGVAAV
Sbjct: 230 AKINGYEDVPANNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAV 289
Query: 299 GYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG S G Y +VKNSWG WGE GYIRM+R+ EGLCGI AS P
Sbjct: 290 GYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 339
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 182/350 (52%), Positives = 233/350 (66%), Gaps = 14/350 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + + + L+L L A +S A + H SM E E WM ++G+ YK
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARXL------HEASM---YERHEDWMVQYGREYK 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+EK R++IFK+N+ I+ NK + SY L +NEFAD+++EEF+ K +
Sbjct: 52 DADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTE 111
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S F Y +V A+P +VDWRKKGAVTP+K+QG CGSCWAFS VAA+EGI Q+ +G L S
Sbjct: 112 ATS--FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLIS 169
Query: 180 LSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCDTS + GC+GGLMD AFK+I + GL E +YPY +GTC KK
Sbjct: 170 LSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPA 229
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+GY+DVP N+E++L KA+AHQP++VAI+ASG++FQFYS GVFTG CG ELDHGVAAV
Sbjct: 230 AKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAV 289
Query: 299 GYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG S G Y +VKNSW WGE GYIRM+R+ EGLCGI AS P
Sbjct: 290 GYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVTVKEGLCGIAMQASYP 339
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 173/334 (51%), Positives = 228/334 (68%), Gaps = 15/334 (4%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKH---------GKTYKCIE-EKLHRFEIFKENLKHID 80
V ++ + L S + L L+E W S++ G K + + RF +FKEN+K+I
Sbjct: 21 VPFTEKDLASEESLRGLYERWRSRYTVSPSTPGSGLRGKLADHDPARRFNVFKENVKYIH 80
Query: 81 QRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSAEFSYRDVKALPKS 136
+ NK+ + L LN+FADM+ +E ++ Y G + + R+ F+Y D + LP +
Sbjct: 81 EANKKDRPFRLALNKFADMTTDELRHSYAGSRVRHHRALSGGRRAQGNFTYSDAENLPPA 140
Query: 137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGC 196
VDWR+KGAVT +K+QG CGSCWAFST+AAVE IN+I +G L SLSEQEL+DCD + GC
Sbjct: 141 VDWREKGAVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLSEQELMDCDNVNDQGC 200
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
+GGLMDYAF++I +GG+ E +YPY ++ TC+ KE V I GY+DVP NDE +L
Sbjct: 201 DGGLMDYAFQFIQKNGGVTSEANYPYQGQQNTCDQAKENTHDVAIDGYEDVPANDESALQ 260
Query: 257 KALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSW 315
KA+A+QPVSVAIEASG DFQFYS GVFTG C +LDHGVAAVGYG ++ G+ Y IVKNSW
Sbjct: 261 KAVAYQPVSVAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAVGYGTARDGTKYWIVKNSW 320
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G WGE+GYIRM+R + EGLCGI AS P+K
Sbjct: 321 GLDWGEKGYIRMQRGVSQAEGLCGIAMQASYPIK 354
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 182/355 (51%), Positives = 231/355 (65%), Gaps = 23/355 (6%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLA-----HDFSIVGYSPEHLTSMDKLIELFESWMSKH 55
MAF K+L ++L L C+ A H+ + G H E WM+KH
Sbjct: 1 MAFLCKGKILPIALFFVLAMCADQAASRELHELEMTG---RH-----------EKWMAKH 46
Query: 56 GKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQ 114
GK YK +EKL RF+IFK N+ I+ N SY LG+N+FAD+++EEF+ + G K
Sbjct: 47 GKVYKDDKEKLRRFQIFKSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAFWNGYKRP 106
Query: 115 FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVS 174
R+ + F Y +V ALP S+DWR KGAVTP+K+QG CGSCWAFS VAA EGI+++ +
Sbjct: 107 LGASRKITP-FKYENVTALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRT 165
Query: 175 GNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKK 233
G L SLSEQEL+DCD + GC GGLM AFK+I GG+ E +YPY +G C+ KK
Sbjct: 166 GKLVSLSEQELVDCDVKGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGRDGKCDTKK 225
Query: 234 EEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDH 293
E V I+GYQ VP+N E +LLKA+A+QPVSVAI+A FQFY G+FTG CG +++H
Sbjct: 226 EASRAVKITGYQAVPKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTGICGKDINH 285
Query: 294 GVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GVAAVGYG+S GS Y IVKNSWG +WGE+GYIRMKR+ EGLCGI S P
Sbjct: 286 GVAAVGYGRSNSGSKYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAMECSYP 340
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 347 bits (891), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 180/350 (51%), Positives = 229/350 (65%), Gaps = 14/350 (4%)
Query: 1 MAFFSHSKLLLLSL-SLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY 59
MA S KL+ ++L + L+ + + H +M+ E E WM K+G+ Y
Sbjct: 1 MATISERKLMFVALLVVGLWVSQAWSRSL--------HDAAMN---ERHEMWMVKYGRVY 49
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
K EK RFEIF+ N++ I+ NK Y L +NEFAD+++EEFK G K
Sbjct: 50 KDNSEKERRFEIFRNNVEFIESFNKPGNRPYKLDINEFADLTNEEFKASRNGYKRSSNVG 109
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
+ F Y +V A+P S+DWR+KGAVTP+K+QG CG CWAFS VAA+EGI ++ +G L
Sbjct: 110 LSEKSSFRYGNVTAVPTSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLI 169
Query: 179 SLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
SLSEQEL+DCDTS + GC GGLMD AF++I +GGL E +YPY +GTC K +
Sbjct: 170 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGND 229
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAA 297
I+GY+DVP N E +LLKA+A QPVSVAI+ASG+ FQFYSGGVFTG CG ELDHGV A
Sbjct: 230 AAKITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTA 289
Query: 298 VGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VGYG S G+ Y +VKNSWG WGE GYIRM+R+ EGLCGI +S P
Sbjct: 290 VGYGTSDGTKYWLVKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSYP 339
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 173/309 (55%), Positives = 217/309 (70%), Gaps = 10/309 (3%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK--EVTSYWLGLNEFADMSHEE 103
E E WM+ +GK YK +E+ RF+IF EN+K+I+ N SY LG+N+FAD+++EE
Sbjct: 37 ERHERWMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFADLTNEE 96
Query: 104 F---KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
F +NK+ G R + F Y +V A+P +VDWRKKGAVTPVKNQG CG CWAF
Sbjct: 97 FVASRNKFKGHMCSSIIR---TTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAF 153
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEED 219
S VAA EGI+++ +G L SLSEQEL+DCDT + GC GGLMD AFK+I+ + GL+ E
Sbjct: 154 SAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQ 213
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS 279
YPY +GTC K ++ TI+GY+DVP N+EQ+L KA+A+QP+SVAI+ASG+DFQFY
Sbjct: 214 YPYQGVDGTCNANKASIQATTITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYK 273
Query: 280 GGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GVFTG CG ELDHGV AVGYG S G+ Y +VKNSWG WGE GYI M+R EGLC
Sbjct: 274 SGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLC 333
Query: 339 GINKMASIP 347
GI AS P
Sbjct: 334 GIAMQASYP 342
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 177/342 (51%), Positives = 230/342 (67%), Gaps = 9/342 (2%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
L S+SL+LF C L F+I + L + E E WM +GK YK ++E+ +R
Sbjct: 7 LYHSISLALFFCLGL---FAIQ-VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRL 62
Query: 70 EIFKENLKHIDQRNKEVTS--YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY 127
+IFKEN+ +I+ N + Y LG+N+FAD+++EEF K + ++ F Y
Sbjct: 63 KIFKENVNYIEASNNAGNNKLYKLGINQFADLTNEEFIASRNKFKGHMCSSITKTSTFKY 122
Query: 128 RDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELID 187
+ ++P +VDWRKKGAVTPVKNQG CG CWAFS VAA EGI+++ +G L SLSEQEL+D
Sbjct: 123 ENA-SVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVD 181
Query: 188 CDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQD 246
CDT + GC GGLMD AFK+I+ + GL+ E YPY +GTC K + VTI+GY+D
Sbjct: 182 CDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYED 241
Query: 247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKG 305
VP N+EQ+L KA+A+QP+SVAI+ASG+DFQFY GVFTG CG ELDHGV AVGYG + G
Sbjct: 242 VPANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDG 301
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ Y +VKNSWG WGE GYI+M+R EGLCGI AS P
Sbjct: 302 TKYWLVKNSWGTDWGEEGYIKMQRGVDAAEGLCGIAMEASYP 343
>gi|2098464|pdb|1PCI|A Chain A, Procaricain
gi|2098465|pdb|1PCI|B Chain B, Procaricain
gi|2098466|pdb|1PCI|C Chain C, Procaricain
Length = 322
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 177/323 (54%), Positives = 226/323 (69%), Gaps = 2/323 (0%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
DFSIVGYS + LTS ++LI+LF SWM H K Y+ ++EKL+RFEIFK+NL +ID+ NK+
Sbjct: 1 DFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKN 60
Query: 87 TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVT 146
SYWLGLNEFAD+S++EF KY+G + EF D+ LP++VDWRKKGAVT
Sbjct: 61 NSYWLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEEFINEDIVNLPENVDWRKKGAVT 120
Query: 147 PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFK 206
PV++QGSCGSCWAFS VA VEGIN+I +G L LSEQEL+DC+ ++GC GG YA +
Sbjct: 121 PVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALE 179
Query: 207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSV 266
Y VA G+H YPY ++GTC K+ +V SG V N+E +LL A+A QPVSV
Sbjct: 180 Y-VAKNGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSV 238
Query: 267 AIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIR 326
+E+ G FQ Y GG+F GPCG ++D V AVGYGKS G YI++KNSWG WGE+GYIR
Sbjct: 239 VVESKGRPFQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIR 298
Query: 327 MKRNTGKPEGLCGINKMASIPLK 349
+KR G G+CG+ K + P K
Sbjct: 299 IKRAPGNSPGVCGLYKSSYYPTK 321
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 347 bits (889), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 169/311 (54%), Positives = 218/311 (70%), Gaps = 14/311 (4%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHE 102
+ E E WM+++G+ YK E+ R+ IFKEN+ ID N + SY LG+N+FAD+++E
Sbjct: 1 MYERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNE 60
Query: 103 EFK---NKYLG--LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
EFK N++ G PQ + F Y +V A+P +VDWRK+GAVTPVK+QG CG C
Sbjct: 61 EFKASRNRFKGHMCSPQ-------AGPFRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCC 113
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHK 216
WAFS VAA+EGIN++ +G L SLSEQE++DCDT + GCNGGLMD AFK+I + GL
Sbjct: 114 WAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTT 173
Query: 217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
E +YPY +GTC KK + I+G++DVP N E +L+KA+A QPVSVAI+A G+DFQ
Sbjct: 174 EANYPYKGTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQ 233
Query: 277 FYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
FYS G+FTG C +LDHGV AVGYG S GS Y +VKNSWG +WGE GYIRM+++ EG
Sbjct: 234 FYSSGIFTGSCDTQLDHGVTAVGYGVSDGSKYWLVKNSWGAQWGEEGYIRMQKDISAKEG 293
Query: 337 LCGINKMASIP 347
LCGI AS P
Sbjct: 294 LCGIAMQASYP 304
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 177/333 (53%), Positives = 224/333 (67%), Gaps = 8/333 (2%)
Query: 21 CSSLAHDFSI-VGYSPEHLTSMDK--LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLK 77
C+ LA F+I V S S+++ + E + WM+++G+ YK EK R IF+ENLK
Sbjct: 9 CTPLALLFTIGVLASLAAARSLNEASMTETHDQWMARYGRVYKTANEKNRRSTIFQENLK 68
Query: 78 HIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS 136
+I NK Y LG+NEFAD+++EEF K + F Y +V A+P +
Sbjct: 69 YIQTFNKANNKPYKLGVNEFADLTNEEFTTSRNKFKSHVCAT--VTNVFRYENVTAVPAT 126
Query: 137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNG 195
+DWRKKGAVTP+KNQG CG CWAFS VAA+EGI Q+ +G L SLSEQEL+DCDT+ + G
Sbjct: 127 MDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQELVDCDTNGEDQG 186
Query: 196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
C GGLMDYAF +I + GL E +YPY +GTC KE TI+G++DVP N E +L
Sbjct: 187 CEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEANHAATITGHEDVPANSESAL 246
Query: 256 LKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNS 314
LKA+A+QP+SVAI+ASG+DFQFYS GVFTG CG ELDHGV AVGYG + G+ Y +VKNS
Sbjct: 247 LKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYGTAADGTKYWLVKNS 306
Query: 315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
WG WGE GYI+M+R EGLCGI AS P
Sbjct: 307 WGTSWGEEGYIQMQRGVAAAEGLCGIAMQASYP 339
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 346 bits (888), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 176/340 (51%), Positives = 227/340 (66%), Gaps = 21/340 (6%)
Query: 19 FACSSLAHDFSIVGYSPEHLTSMDKL----IELFESWMSKHGKTYKCIEEKLHRFEIFKE 74
F C +L I+G P T+ L E E WM+++G+ YK E+ R+ IFKE
Sbjct: 9 FVCLAL---LFILGAWPSKSTARTLLDAPMYERHEQWMTQYGRVYKDDNERATRYSIFKE 65
Query: 75 NLKHIDQRNKEV-TSYWLGLNEFADMSHEEFK---NKYLG--LKPQFPTRRQPSAEFSYR 128
N+ ID N + SY LG+N+FAD+++EEFK N++ G PQ + F Y
Sbjct: 66 NVARIDAFNSQTGKSYKLGVNQFADLTNEEFKASRNRFKGHMCSPQ-------AGPFRYE 118
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
+V A+P +VDWRK+GAVTPVK+QG CG CWAFS VAA+EGIN++ +G L SLSEQE++DC
Sbjct: 119 NVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDC 178
Query: 189 DTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
DT + GCNGGLMD AFK+I + GL E +YPY +GTC K + I+G++DV
Sbjct: 179 DTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYKGTDGTCNTNKAAIHAAKITGFEDV 238
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSD 307
P N E +L+KA+A QPVSVAI+A G+DFQFYS G+FTG C +LDHGV AVGYG S GS
Sbjct: 239 PANSEAALMKAVAKQPVSVAIDAGGSDFQFYSSGIFTGSCDTQLDHGVTAVGYGVSDGSK 298
Query: 308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
Y +VKNSWG +WGE GYIRM+++ EGLCGI AS P
Sbjct: 299 YWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYP 338
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 346 bits (888), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 181/350 (51%), Positives = 233/350 (66%), Gaps = 14/350 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + + + L+L L A +S A ++ H SM E E WM+++G+ YK
Sbjct: 1 MASVNQYRYICLALLFVLAAWASHAKARNL------HEASM---YERHEDWMAQYGRVYK 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
EK R++IFK+N+ I+ NK + SY L +NEFAD+++EEF+ K +
Sbjct: 52 DAGEKSKRYKIFKDNVARIESFNKAMNKSYKLSINEFADLTNEEFRASRNRFKAHICSTE 111
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S F Y V A+P +VDWRKKGAVTP+K+QG CGSCWAFS VAA+EGI Q+ +G L S
Sbjct: 112 ATS--FKYEHVXAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLIS 169
Query: 180 LSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCDTS + GC+GGLMD AFK+I + GL E +YPY +GTC KK
Sbjct: 170 LSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPA 229
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+GY+DVP N+E++L KA+AHQP++VAI+A G +FQFYS GVFTG CG ELDHGV+AV
Sbjct: 230 AKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVSAV 289
Query: 299 GYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG S G Y +VKNSWG WGE GYIRM+R+ + EGLCGI AS P
Sbjct: 290 GYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTEKEGLCGIAMQASYP 339
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 175/324 (54%), Positives = 226/324 (69%), Gaps = 10/324 (3%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
Y E L S + L L+E W S H + + EK RF +FKENLKHI + N++ Y L
Sbjct: 25 YKEEDLASEESLWNLYERWRSHH-TVSRSLTEKNQRFNVFKENLKHIHKVNQKDRPYKLR 83
Query: 93 LNEFADMSHEEFKNKYLGLKPQF-----PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTP 147
LN+FADM++ EF Y G K +RRQ F++ + LP S+DWRK+GAVT
Sbjct: 84 LNKFADMTNHEFLQHYGGSKVSHYRMFHGSRRQTG--FAHENTSNLPSSIDWRKQGAVTG 141
Query: 148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKY 207
VK+QG CGSCWAFS+VAAVEGIN+I +G L SLSEQEL+DC+ S N+GC+GGLM+ AF +
Sbjct: 142 VKDQGKCGSCWAFSSVAAVEGINKIKTGELISLSEQELVDCN-SVNHGCDGGLMEQAFSF 200
Query: 208 IVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVA 267
I +GGL E +YPY ++G C+ K +VTI GY+ VPENDE +L++A+A+QPVS+A
Sbjct: 201 IEKTGGLTTENNYPYRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIA 260
Query: 268 IEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIR 326
I+A G DFQFYS GV+TG CG EL+HGVA VGYG ++ G+ Y IVKNSWG +WGE G+IR
Sbjct: 261 IDAGGQDFQFYSEGVYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWGSEWGENGFIR 320
Query: 327 MKRNTGKPEGLCGINKMASIPLKK 350
M+R EGLCGI AS P+K+
Sbjct: 321 MQRENDVEEGLCGITLEASYPIKQ 344
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 169/339 (49%), Positives = 228/339 (67%), Gaps = 15/339 (4%)
Query: 27 DFSIVGYSPEHLTSMDKLIE-----LFESWMSKHG----KTYKCIEEKLHRFEIFKENLK 77
D SI+ Y+ EH + E +++ W++++G I E+ RF F +NL
Sbjct: 27 DMSIIAYNAEHGARGLERTEAEARAVYDLWLAENGGGSSPNANSIPERERRFRAFWDNLN 86
Query: 78 HIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYLGLKPQFPTR-RQPSAEFSYRDVKA 132
+D N + Y LG+N FAD++++EF+ YLG+K Q R + + +
Sbjct: 87 FVDAHNARAAAGEEGYRLGMNRFADLTNDEFRAAYLGVKAQRARPGRMVGERYRHDGAEE 146
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP++VDWR+KGAV PVKNQG CGSCWAFS V+ VE INQIV+G + +LSEQEL++CDT+
Sbjct: 147 LPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNG 206
Query: 193 -NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
++GCNGGLMD AF++I+ +GG+ E+DYPY +G C+ ++ +VV+I G++DVPEND
Sbjct: 207 QSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPEND 266
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIV 311
E+SL KA+AHQPVSVAIEA G +FQ Y GVF+G CG +LDHGV AVGYG G DY IV
Sbjct: 267 EKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYWIV 326
Query: 312 KNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
+NSWGP WGE GY+RM+RN G CGI M+S P KK
Sbjct: 327 RNSWGPNWGESGYLRMERNINVTSGKCGIAMMSSYPTKK 365
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 170/326 (52%), Positives = 224/326 (68%), Gaps = 8/326 (2%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCI--EEKLHRFEIFKENLKHIDQRNKEVTS 88
+ ++ + L S + L L+E W S + + + + + + RF +FKEN ++I + NK+
Sbjct: 23 IPFTEKDLASEENLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYIHEGNKKDRP 82
Query: 89 YWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSAEFSYRDVKALPKSVDWRKKGA 144
+ L LN+FADM+ +EF+ Y G + + R+ F Y D LP +VDWR+KGA
Sbjct: 83 FRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGSFRYGDADNLPPAVDWRQKGA 142
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYA 204
VT +K+QG CGSCWAFST+ AVEGIN+I +G L SLSEQEL+DCD N GC+GGLMDYA
Sbjct: 143 VTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYA 202
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
F++I + G+ E +YPY E+G+C+ KE+ VTI GY+DVP NDE +L KA+A QPV
Sbjct: 203 FQFIHKN-GITTESNYPYQGEQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAGQPV 261
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERG 323
SVAI+ASG DFQFYS GVFTG C +LDHGVAAVGYG ++ G+ Y IVKNSWG WGE+G
Sbjct: 262 SVAIDASGNDFQFYSEGVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKG 321
Query: 324 YIRMKRNTGKPEGLCGINKMASIPLK 349
YIRM+R + EG CGI AS P K
Sbjct: 322 YIRMQRGVSQAEGQCGIAMQASYPTK 347
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 177/350 (50%), Positives = 230/350 (65%), Gaps = 13/350 (3%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + LL++L L + A + H ++M +E E WM+KHGK YK
Sbjct: 1 MALLCKGQFLLIALFFVLAMWADQASTREL------HESTM---VERHEKWMAKHGKVYK 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
EEKL RF+IFK N++ I+ N SY LG+N FAD+++EEF+ + G K R
Sbjct: 52 DDEEKLRRFQIFKNNVEFIESSNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDASR 111
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
+ F Y +V ALP S+DWR+KGAVT +K+Q CGSCWAFS VAA EG++++ +G L S
Sbjct: 112 IVTP-FKYENVTALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVS 170
Query: 180 LSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCD + GC GGLM+ AFK+I +GG+ E +Y Y +G C+ KKE V
Sbjct: 171 LSEQELVDCDVKGEDKGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHV 230
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+GYQ VPEN E +LLKA+AHQPVSV+I+A FQFY G++ G CG++L+HGVAAV
Sbjct: 231 AKITGYQVVPENSEAALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAV 290
Query: 299 GYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG S GS Y IVKNSWGP+WGERGY+RMKR+ +GLCGI S P
Sbjct: 291 GYGTSSSGSKYWIVKNSWGPEWGERGYVRMKRDITSRKGLCGIAMDCSYP 340
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 170/335 (50%), Positives = 224/335 (66%), Gaps = 14/335 (4%)
Query: 28 FSIVGYSPEHLTSMDKLIE-----LFESWMSKHGK-TYKCIEEKLHRFEIFKENLKHIDQ 81
SI+ Y+ EH + E ++E W+ +HG+ + E RF +F +NL+ +D
Sbjct: 31 MSIISYNEEHGARGLERTEAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDA 90
Query: 82 RNKEVT--SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA---EFSYRDVKALPKS 136
N+ + LG+N+FAD++++EF+ YLG + P R +A + + + LP+S
Sbjct: 91 HNERAGEHGFRLGMNQFADLTNDEFRAAYLGAR--IPAARSGNAVGEMYRHDGAEELPES 148
Query: 137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNG 195
VDWR+KGAV PVKNQG CGSCWAFS V++VE INQIV+G + +LSEQEL++C T N+G
Sbjct: 149 VDWREKGAVAPVKNQGQCGSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSG 208
Query: 196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
CNGGLMD AF +I+ +GG+ E+DYPY +G C+ + +VV+I ++DVPENDE+SL
Sbjct: 209 CNGGLMDAAFNFIIKNGGIDTEDDYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEKSL 268
Query: 256 LKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSW 315
KA+AHQPVSVAIEA G FQ Y GVF+G C LDHGV AVGYG G DY IV+NSW
Sbjct: 269 QKAVAHQPVSVAIEAGGRQFQLYKSGVFSGSCTTNLDHGVVAVGYGTENGKDYWIVRNSW 328
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
GPKWGE GYIRM+RN G CGI MAS P KK
Sbjct: 329 GPKWGEAGYIRMERNINATTGKCGIAMMASYPTKK 363
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 171/289 (59%), Positives = 205/289 (70%), Gaps = 10/289 (3%)
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLK----PQFPTRRQ---P 121
F +FK N++ I + N+ Y L LN F DM+ +EF+ Y G + F RQ
Sbjct: 70 FNVFKANVRLIHEFNRRDEPYKLRLNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSSA 129
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
SA F Y D + +P SVDWR+KGAVT VK+QG CGSCWAFST+AAVEGIN I + NLTSLS
Sbjct: 130 SASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLS 189
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQ+L+DCDT N GCNGGLMDYAF+YI GG+ E+ YPY + +C KK VVTI
Sbjct: 190 EQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASC--KKSPAPVVTI 247
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
GY+DVP NDE +L KA+AHQPVSVAIEASG+ FQFYS GVF+G CG ELDHGVAAVGYG
Sbjct: 248 DGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAAVGYG 307
Query: 302 -KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+ G+ Y +VKNSWGP+WGE+GYIRM R+ EG CGI AS P+K
Sbjct: 308 VTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPVK 356
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 173/340 (50%), Positives = 231/340 (67%), Gaps = 14/340 (4%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFK 73
+SL++ C + F + S + + + E E WM+++GK YK +E+ RF IFK
Sbjct: 557 ISLAMLLCMAFLA-FQVTCRSLQDAS----MYERHEQWMTRYGKVYKDPQEREKRFRIFK 611
Query: 74 ENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQPSAEFSYRD 129
EN+ +I+ N Y L +N+FAD+++EEF +N++ G R + F Y +
Sbjct: 612 ENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIR---TTTFKYEN 668
Query: 130 VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD 189
V A+P +VDWR+KGAVTP+K+QG CG CWAFS VAA EGI+ + SG L SLSEQEL+DCD
Sbjct: 669 VTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCD 728
Query: 190 TS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
T + GC GGLMD AFK+++ + GL+ E +YPY +G C + +VVTI+GY+DVP
Sbjct: 729 TKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVP 788
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSD 307
N+E++L KA+A+QPVSVAI+ASG+DFQFY GVFTG CG ELDHGV AVGYG S G++
Sbjct: 789 ANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTE 848
Query: 308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
Y +VKNSWG +WGE GYIRM+R EGLCGI AS P
Sbjct: 849 YWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYP 888
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 173/340 (50%), Positives = 231/340 (67%), Gaps = 14/340 (4%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFK 73
+SL++ C + F + S + + + E E WM+++GK YK +E+ RF IFK
Sbjct: 28 ISLAMLLCMAFLA-FQVTCRSLQDAS----MYERHEQWMTRYGKVYKDPQEREKRFRIFK 82
Query: 74 ENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQPSAEFSYRD 129
EN+ +I+ N Y L +N+FAD+++EEF +N++ G R + F Y +
Sbjct: 83 ENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIR---TTTFKYEN 139
Query: 130 VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD 189
V A+P +VDWR+KGAVTP+K+QG CG CWAFS VAA EGI+ + SG L SLSEQEL+DCD
Sbjct: 140 VTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCD 199
Query: 190 T-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
T + GC GGLMD AFK+++ + GL+ E +YPY +G C + +VVTI+GY+DVP
Sbjct: 200 TKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVP 259
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSD 307
N+E++L KA+A+QPVSVAI+ASG+DFQFY GVFTG CG ELDHGV AVGYG S G++
Sbjct: 260 ANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTE 319
Query: 308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
Y +VKNSWG +WGE GYIRM+R EGLCGI AS P
Sbjct: 320 YWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYP 359
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 173/346 (50%), Positives = 224/346 (64%), Gaps = 17/346 (4%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTS----MDKLIELFESWMSKHGKTYKCIEEK 65
L S++L+ C +G +TS +D + E E WMS++ K YK +E+
Sbjct: 7 LYYSIALTFIFC---------LGLCAIQVTSRSLQVDSMYERHEQWMSQYSKVYKDPQER 57
Query: 66 LHRFEIFKENLKHIDQRNKEVTS--YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA 123
R +IF N+ +I+ N + + Y LG+N+FAD+++EEF K + +
Sbjct: 58 EERHKIFTANVNYIEVFNNDANNKLYKLGINQFADLTNEEFIASRNKFKGHMCSSIAKTT 117
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y +V A+P +VDWRKKGAVTPVKNQG CG CWAFS VAA EGI ++ +G L SLSEQ
Sbjct: 118 TFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLSEQ 177
Query: 184 ELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
EL+DCDT + GC GGLMD AFK+I+ + GL E YPY +GTC K + TI+
Sbjct: 178 ELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAAYPYQGVDGTCNANKASIHAATIT 237
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG- 301
GY+DVP N+EQ+L KA+A+QP+SVAI+ASG+DFQFY GVF+G CG ELDHGV AVGYG
Sbjct: 238 GYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFSGSCGTELDHGVTAVGYGV 297
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ G+ Y +VKNSWG WGE GYIRM+R EGLCGI AS P
Sbjct: 298 GNDGTKYWLVKNSWGTDWGEEGYIRMQRGVDAAEGLCGIAMQASYP 343
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 166/302 (54%), Positives = 210/302 (69%), Gaps = 3/302 (0%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNK 107
E WM GK Y EK RFEIFK+N+++I+ N Y L +N+FAD+++EE K
Sbjct: 39 EQWMETFGKVYADAAEKERRFEIFKDNVEYIESFNTAGNKPYKLSVNKFADLTNEELKVA 98
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
G + TR F Y +V A+P ++DWRKKGAVTP+K+QG CGSCWAFSTVAA E
Sbjct: 99 RNGYRRPLQTRPMKVTSFKYENVTAVPATMDWRKKGAVTPIKDQGQCGSCWAFSTVAATE 158
Query: 168 GINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
GINQ+ +G L SLSEQEL+DCDT + GC GGLM+ F++I+ + G+ E +YPY +
Sbjct: 159 GINQLTTGKLVSLSEQELVDCDTQGEDQGCEGGLMEDGFEFIIKNHGITTEANYPYQAAD 218
Query: 227 GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP 286
GTC KKE + I+GY+ VP N E +LLKA+A QP+SV+I+A G+DFQFYS GVFTG
Sbjct: 219 GTCNSKKEASRIAKITGYESVPANSEAALLKAVASQPISVSIDAGGSDFQFYSSGVFTGQ 278
Query: 287 CGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMAS 345
CG ELDHGV AVGYG+ S G+ Y +VKNSWG WGE GYIRM+R+T EGLCGI +S
Sbjct: 279 CGTELDHGVTAVGYGETSDGTKYWLVKNSWGTSWGEEGYIRMQRDTEAEEGLCGIAMDSS 338
Query: 346 IP 347
P
Sbjct: 339 YP 340
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 343 bits (881), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 164/310 (52%), Positives = 211/310 (68%), Gaps = 6/310 (1%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFA 97
++ + ELFE W ++HGK+Y EEKL+R +F +N + + N + +SY L LN +A
Sbjct: 20 SATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYA 79
Query: 98 DMSHEEFKNKYLGLKPQFPTRRQ--PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
D++H EFK LG P R P RDV P S+DWRKKGAVT VK+QGSCG
Sbjct: 80 DLTHHEFKVSRLGFSPALRNFRPVLPQEPSLPRDV---PDSLDWRKKGAVTAVKDQGSCG 136
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
+CW+FS A+EGINQI++G+L SLSEQELIDCD S+N+GC GGLMDYA+++++++ G+
Sbjct: 137 ACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGID 196
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
E DYPY +G+C K + VVTI GY D+P NDE LL+A+A QPVSV I S F
Sbjct: 197 TENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAF 256
Query: 276 QFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
Q YS G+F+GPC LDH V VGYG G DY IVKNSWG WG GY+ M+RN+G E
Sbjct: 257 QLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSE 316
Query: 336 GLCGINKMAS 345
G+CGINK+AS
Sbjct: 317 GVCGINKLAS 326
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 343 bits (881), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 168/342 (49%), Positives = 230/342 (67%), Gaps = 21/342 (6%)
Query: 27 DFSIVGYSPEHLTSMDKLIE-----LFESWMSKHG----KTYKCIEEKLHRFEIFKENLK 77
D SI+ Y+ EH + E +++ W+++HG I E+ RF F +NL+
Sbjct: 24 DMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSYPNANSIPERERRFRAFWDNLR 83
Query: 78 HIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA----EFSYRD 129
+D N + + L +N FAD++++EF+ YLG+K Q R +P + +
Sbjct: 84 FVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGQ---RARPGRVVGERYRHDG 140
Query: 130 VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD 189
+ LP++VDWR+KGAV PVKNQG CGSCWAFS ++ VE INQIV+G + +LSEQEL++CD
Sbjct: 141 AEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAISTVESINQIVTGEMVTLSEQELVECD 200
Query: 190 TSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
T+ ++GCNGGLMD AF++I+ +GG+ E+DYPY +G C+ ++ +VV+I G++DVP
Sbjct: 201 TNGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVP 260
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDY 308
ENDE+SL KA+AHQPVSVAIEA G +FQ Y GVF+G CG +LDHGV AVGYG G DY
Sbjct: 261 ENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDY 320
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
IV+NSWGP WGE GY+RM+RN G CGI M+S P KK
Sbjct: 321 WIVRNSWGPNWGEAGYLRMERNINVTSGKCGIAMMSSYPTKK 362
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 343 bits (881), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 176/348 (50%), Positives = 231/348 (66%), Gaps = 16/348 (4%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD-KLIELFESWMSKHGKTYKCIEEK 65
+K +SL+L CS + + T D + E E WM ++ K YK +E+
Sbjct: 3 AKNQFYQISLALLFCSGF------LAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQER 56
Query: 66 LHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQP 121
RF+IFKEN+ +I+ N Y LG+N+FAD+++EEF +N++ G TR
Sbjct: 57 ERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCSSITR--- 113
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
+ F Y +V A+P +VDWR+KGAVTP+K+QG CG CWAFS VAA EGI+ + +G L SLS
Sbjct: 114 TTTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLS 173
Query: 182 EQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
EQE++DCDT + GC GG MD AFK+I+ + GL+ E +YPY +G C K V T
Sbjct: 174 EQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVAT 233
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
I+GY+DVP N+E++L KA+A+QPVSVAI+ASG+DFQFY GVFTG CG ELDHGV AVGY
Sbjct: 234 ITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGY 293
Query: 301 GKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G S G++Y +VKNSWG +WGE GYIRM+R EGLCGI MAS P
Sbjct: 294 GVSADGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYP 341
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 343 bits (881), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 175/347 (50%), Positives = 232/347 (66%), Gaps = 14/347 (4%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
+K +SL+L CS F + + + + + E E WM ++ K YK +E+
Sbjct: 3 AKNQFYQISLALLFCSGFL-TFQVTCRTLQDAS----MYERHEEWMGRYAKVYKDPQERE 57
Query: 67 HRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQPS 122
RF+IFKEN+ +I+ N Y LG+N+FAD+++EEF +N++ G TR +
Sbjct: 58 RRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCSSITR---T 114
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
F Y +V A+P +VDWR+KGAVTP+K+QG CG CWAFS VAA EGI+ + +G L SLSE
Sbjct: 115 TTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSE 174
Query: 183 QELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
QE++DCDT + GC GG MD AFK+I+ + GL+ E +YPY +G C K V TI
Sbjct: 175 QEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATI 234
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
+GY+DVP N+E++L KA+A+QPVSVAI+ASG+DFQFY GVFTG CG ELDHGV AVGYG
Sbjct: 235 TGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYG 294
Query: 302 KSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
S G++Y +VKNSWG +WGE GYIRM+R EGLCGI MAS P
Sbjct: 295 VSADGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYP 341
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 343 bits (881), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 166/309 (53%), Positives = 209/309 (67%), Gaps = 1/309 (0%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMS 100
D + ELF+ W KHGKTY EE+ R +IFK+N + Q N +Y L LN FAD++
Sbjct: 26 DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLT 85
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
H EFK LGL P+ S S +P SVDWRKKGAVT VK+QGSCG+CW+F
Sbjct: 86 HHEFKASRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 145
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
S A+EGINQIV+G+L SLSEQELIDCD S+N GCNGGLMDYAF++++ + G+ E+DY
Sbjct: 146 SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 205
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY +GTC+ K + +VVTI Y V NDE++L++A+A QPVSV I S FQ YS
Sbjct: 206 PYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSS 265
Query: 281 GVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
G+F+GPC LDH V VGYG G DY IVKNSWG WG G++ M+RNT +G+CGI
Sbjct: 266 GIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGI 325
Query: 341 NKMASIPLK 349
N +AS P+K
Sbjct: 326 NMLASYPIK 334
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 343 bits (880), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 171/326 (52%), Positives = 222/326 (68%), Gaps = 8/326 (2%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCI--EEKLHRFEIFKENLKHIDQRNKEVTS 88
V ++ + L S + L L+E W S + + + + + + RF +FKEN +++ + NK
Sbjct: 24 VPFTEKDLASEESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYVHEGNKRDRP 83
Query: 89 YWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSAEFSYRDVKALPKSVDWRKKGA 144
+ L LN+FADM+ +EF+ Y G + + R+ F Y D LP +VDWR+KGA
Sbjct: 84 FRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGA 143
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYA 204
VT +K+QG CGSCWAFST+ AVEGIN+I +G L SLSEQEL+DCD N GC GGLMDYA
Sbjct: 144 VTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCEGGLMDYA 203
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
F++I G+ E +YPY E+G+C+ KE + VTI GY+DVP NDE +L KA+A QPV
Sbjct: 204 FQFI-QKNGITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPV 262
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERG 323
SVAI+ASG DFQFYS GVFTG C +LDHGVAAVGYG ++ G+ Y IVKNSWG WGE+G
Sbjct: 263 SVAIDASGQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKG 322
Query: 324 YIRMKRNTGKPEGLCGINKMASIPLK 349
YIRM+R + EGLCGI AS P K
Sbjct: 323 YIRMQRGVSQTEGLCGIAMQASYPTK 348
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 162/305 (53%), Positives = 211/305 (69%), Gaps = 2/305 (0%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHEEF 104
+LFE+W +HGK+Y EE+ HR ++F++N + + N K +SY L LN FAD++H EF
Sbjct: 27 QLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEF 86
Query: 105 KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
K LGL + E + V +P S+DWR KG VT VK+QGSCG+CW+FS
Sbjct: 87 KTSRLGLSAAPLNLAHRNLEIT-GVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSATG 145
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
A+EGIN+IV+G+L SLSEQELI+CD S+N+GC GGLMDYAF++++ + G+ EEDYPY
Sbjct: 146 AIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRA 205
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
+GTC + + VVTI Y DVPEN+E+ LL+A+A QPVSV I S FQ YS G+FT
Sbjct: 206 RDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFT 265
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
GPC LDH V VGYG G DY IVKNSWG WG RGY+ M+RN+G +G+CGIN +A
Sbjct: 266 GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLA 325
Query: 345 SIPLK 349
S P+K
Sbjct: 326 SYPVK 330
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 174/349 (49%), Positives = 230/349 (65%), Gaps = 13/349 (3%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA F KLL +L+L + A ++ G + L ++E E WM++HG+ YK
Sbjct: 1 MAAFKTVKLLP-ALALLIVAI------WASQGEAGRSLGENKSMLERHEQWMAQHGRVYK 53
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
EK HRFEIF+ N++ I+ N E + LG+N+FAD+++EEFK + LKP ++
Sbjct: 54 NAAEKAHRFEIFRANVERIESFNAENHKFKLGVNQFADLTNEEFKTRNT-LKP---SKMA 109
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
+ F Y +V A+P ++DWR KGAVTP+K+QG CGSCWAFS VAA EGI ++ +G L SL
Sbjct: 110 STKSFKYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLISL 169
Query: 181 SEQELIDCD-TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVV 239
SEQE++DCD TS + GCNGG MD AF+YI+ + G+ E +YPY +GTC KK
Sbjct: 170 SEQEVVDCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPYKAADGTCNTKKAASHAA 229
Query: 240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVG 299
+I+GY+DV N E +LLKA A+QP++VAI+A FQ YS GVFTG CG +LDHGV VG
Sbjct: 230 SITGYEDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGVFTGDCGTDLDHGVTLVG 289
Query: 300 YG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
YG S G+ Y +VKNSWG WGE GYIRM+R+ EGLCGI AS P
Sbjct: 290 YGATSDGTKYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGIAMDASYP 338
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 178/342 (52%), Positives = 225/342 (65%), Gaps = 20/342 (5%)
Query: 28 FSIVGYSPEHLTSMDKLIE--------LFESWMSKH----GKTYKCIEEKLHRFEIFKEN 75
SI+ Y+ EH +++E +++ W+++H G + E RF +F +N
Sbjct: 37 MSIIRYNAEHGVRGLEVVERTEAEARAVYDLWVARHRHGGGSHNGLVGEYERRFRVFWDN 96
Query: 76 LKHIDQRNK---EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA 132
LK +D N E + LG+N FAD++++EF+ YLG P R A + + V+A
Sbjct: 97 LKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRHVGEA-YRHDGVEA 155
Query: 133 LPKSVDWRKKGAVT-PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC-DT 190
LP SVDWR KGAV PVKNQG CGSCWAFS VAAVEGIN+IV+G L SLSEQEL++C
Sbjct: 156 LPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARN 215
Query: 191 SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
N+GCNGG+MD AF +I +GGL EEDYPY +G C K+ +VV+I G++DVPEN
Sbjct: 216 GANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPEN 275
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG--KSKGSDY 308
DE SL KA+AHQPVSVAI+A G +FQ Y GVFTG CG LDHGV AVGYG + G+DY
Sbjct: 276 DELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDY 335
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
V+NSWGP WGE GYIRM+RN G CGI MAS P+KK
Sbjct: 336 WTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 377
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 166/309 (53%), Positives = 209/309 (67%), Gaps = 1/309 (0%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMS 100
D + ELF+ W KHGKTY EE+ R +IFK+N + Q N +Y L LN FAD++
Sbjct: 26 DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLT 85
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
H EFK LGL P+ S S +P SVDWRKKGAVT VK+QGSCG+CW+F
Sbjct: 86 HHEFKASRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 145
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
S A+EGINQIV+G+L SLSEQELIDCD S+N GCNGGLMDYAF++++ + G+ E+DY
Sbjct: 146 SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 205
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY +GTC+ K + +VVTI Y V NDE++L++A+A QPVSV I S FQ YS
Sbjct: 206 PYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSR 265
Query: 281 GVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
G+F+GPC LDH V VGYG G DY IVKNSWG WG G++ M+RNT +G+CGI
Sbjct: 266 GIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGI 325
Query: 341 NKMASIPLK 349
N +AS P+K
Sbjct: 326 NMLASYPIK 334
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 189/354 (53%), Positives = 238/354 (67%), Gaps = 15/354 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA S++ LLS+ L L + + LA + + + L S + L L+E W + H + +
Sbjct: 1 MAKLSYA---LLSVVLVLGSVA-LAQS---IPFDEKDLASEESLWSLYEKWRAHHAVS-R 52
Query: 61 CIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHEEFKNKYLGLK-PQFPTR 118
+++ RF +FKEN+K I + N K+ +Y L LN+F DM+++EF++ Y G K T
Sbjct: 53 DLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDMTNQEFRSTYAGSKIDHHMTL 112
Query: 119 R--QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
R + + EFSY LP SVDWR+KGAVT VK+QG CGSCWAFSTV AVEGINQI +
Sbjct: 113 RGVKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQIKTNE 172
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L SLSEQ+L+DCDT N+GCNGGLMDYAF +I +GGL E+ YPYL E+ +C +
Sbjct: 173 LVSLSEQQLVDCDTK-NSGCNGGLMDYAFDFIKNNGGLSSEDSYPYLAEQKSCGSEANSA 231
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
VVTI GYQDVP N+E +L+KA+A+QPVSVAIEASG FQFYS GVF+G CG ELDHGVA
Sbjct: 232 -VVTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQFYSQGVFSGHCGTELDHGVA 290
Query: 297 AVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
AVGYG G Y IVKNSWG WGE GYIRM+R G CGI AS P+K
Sbjct: 291 AVGYGVDDDGKKYWIVKNSWGEGWGESGYIRMERGIKDKRGKCGIAMEASYPIK 344
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 179/347 (51%), Positives = 232/347 (66%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K++L++LSL L + + DF + L S + L +L+E W S H + +EEK
Sbjct: 3 KVILVALSLVLVFGLAESFDFD-----EKDLASEESLWDLYERWRSYH-TVSRDLEEKNK 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FKEN KH+ + N+ Y L LN+FADM++ EF++ Y G K + R+ +
Sbjct: 57 RFNVFKENTKHVHKVNQMDKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTG 116
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F + LP SVDWRKKGAVT +K+QG CGSCWAFSTV VEGINQI + L SLSEQ
Sbjct: 117 GFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQ 176
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
+LIDCD S ++GCNGGLM+ AF++I +GG+ E +YPY ++ C+ K VVTI G
Sbjct: 177 QLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDG 236
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
++ VP NDE++L+KA+AHQPVSVAI+A G+D QFYS GVF G CG ELDHGVA VGYG +
Sbjct: 237 HESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTT 296
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G+ Y IVKNSWG +WGE+GYIRM R EG CGI AS P+K
Sbjct: 297 LDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPVK 343
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 179/347 (51%), Positives = 232/347 (66%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K++L++LSL L + + DF + L S + L +L+E W S H + +EEK
Sbjct: 5 KVILVALSLVLVFGLAESFDFD-----EKDLASEESLWDLYERWRSYH-TVSRDLEEKNK 58
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FKEN KH+ + N+ Y L LN+FADM++ EF++ Y G K + R+ +
Sbjct: 59 RFNVFKENTKHVHKVNQMDKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTG 118
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F + LP SVDWRKKGAVT +K+QG CGSCWAFSTV VEGINQI + L SLSEQ
Sbjct: 119 GFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQ 178
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
+LIDCD S ++GCNGGLM+ AF++I +GG+ E +YPY ++ C+ K VVTI G
Sbjct: 179 QLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDG 238
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
++ VP NDE++L+KA+AHQPVSVAI+A G+D QFYS GVF G CG ELDHGVA VGYG +
Sbjct: 239 HESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTT 298
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G+ Y IVKNSWG +WGE+GYIRM R EG CGI AS P+K
Sbjct: 299 LDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPVK 345
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 173/348 (49%), Positives = 233/348 (66%), Gaps = 16/348 (4%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD-KLIELFESWMSKHGKTYKCIEEK 65
+K+ +SL+LF C + + T D + E E WM+++GK YK EEK
Sbjct: 3 TKIQFHHISLALFFC------LGFLAFQVASRTLQDASMYERHEQWMARYGKVYKDPEEK 56
Query: 66 LHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQP 121
RF +FKEN+ +I+ N Y LG+N+FAD++ EEF +N++ G +
Sbjct: 57 EKRFRVFKENVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNG---HTRSSNTR 113
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
+ F Y +V LP S+DWR+KGAVTP+KNQGSCG CWAFS +AA EGI++I +G L SLS
Sbjct: 114 TTTFKYENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLS 173
Query: 182 EQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
EQE++DCDT ++GC GG MD AFK+I+ + G++ E YPY +G C K+E + T
Sbjct: 174 EQEVVDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHAAT 233
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
I+GY+DVP N+E++L KA+A+QPVSVAI+ASG DFQFY G+FTG CG ELDHGV AVGY
Sbjct: 234 ITGYEDVPINNEKALQKAVANQPVSVAIDASGADFQFYKSGIFTGSCGTELDHGVTAVGY 293
Query: 301 GK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+ ++G+ Y +VKNSWG +WGE GYI M+R EG+CGI MAS P
Sbjct: 294 GENNEGTKYWLVKNSWGTEWGEEGYIMMQRGVKAVEGICGIAMMASYP 341
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 177/344 (51%), Positives = 228/344 (66%), Gaps = 16/344 (4%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
L +SL+LF C L + + L D + E E WM+ +GK YK +E+ R
Sbjct: 7 LYHVSLALFFCLGLL----AIQVTSRTLQD-DSIFERHEQWMTHYGKVYKNPQEREKRLR 61
Query: 71 IFKENLKHIDQRNKEVTS--YWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQPSAEF 125
IF ENLK+I+ N + Y LG+N+FAD+++EEF +NK+ G R + F
Sbjct: 62 IFTENLKYIEASNNAGNNKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIR---TTTF 118
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
Y + ++P +VDWRKKGAVTPVKNQG CG CWAFS +AA EGI++I +G L SLSEQEL
Sbjct: 119 KYENT-SVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQEL 177
Query: 186 IDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DCDT+ + GC GGLMD AFK+I+ + G+ E YPY +GTC+ + TI+GY
Sbjct: 178 VDCDTNGVDQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGY 237
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
+DVP N+E +L KA+A+QP+SVAI+ASG+DFQFY GVFTG CG ELDHGV AVGYG S
Sbjct: 238 EDVPANNENALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISN 297
Query: 305 -GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+ Y +VKNSWG WGE GYIRM+R+ EGLCGI AS P
Sbjct: 298 DGTKYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYP 341
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 342 bits (877), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 180/351 (51%), Positives = 230/351 (65%), Gaps = 15/351 (4%)
Query: 1 MAFFSHSKLLLLSL-SLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY 59
MA S +KL+ ++L + L+A + + H +M+ E E WM+K+G+ Y
Sbjct: 1 MATVSENKLMFVALLVVGLWASQAWSRSL--------HDAAMN---ERHEMWMAKYGRVY 49
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
K EK RFEIF+ N++ I+ NK Y L +NEFAD+++EEFK G K
Sbjct: 50 KDNSEKERRFEIFRNNVEFIESFNKLGNRPYKLDINEFADLTNEEFKVSKNGYKRSSGVG 109
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
+ F Y +V A+P S+DWR+ GAVTP+K+QG CG CWAFS VAA+EGI ++ +G L
Sbjct: 110 LTEKSSFRYANVTAVPTSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLI 169
Query: 179 SLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
SLSEQEL+DCDTS + GC GGLMD AF++I +GGL E +YPY +GTC K +
Sbjct: 170 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGND 229
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAA 297
I+GY+DVP N E +LLKA+A QPVSVAI+ASG+ FQFYSGGVFTG CG ELDHGV A
Sbjct: 230 AAKITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTA 289
Query: 298 VGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VGYG S G+ Y +VKNSWG WGE GYIRM+R+ EGLCGI S P
Sbjct: 290 VGYGTSDDGTKYWLVKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQPSYP 340
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 178/348 (51%), Positives = 233/348 (66%), Gaps = 12/348 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ ++LS +L +A F ++ + L S + L +L+E W S H + ++EK +
Sbjct: 5 KVFFVALSFALVL--RVAESFE---FNEKDLESEEGLWDLYERWRSHH-TVSRSLDEKHN 58
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FK N+ H+ NK Y L LN FADM++ EF++ Y G K F + +
Sbjct: 59 RFNVFKGNVMHVHSSNKMDKPYKLKLNRFADMTNHEFRSIYAGSKVNHHRMFRGTPRGNG 118
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y++V +P SVDWRKKGAVT VK+QG CGSCWAFST+ AVEGINQI + L LSEQ
Sbjct: 119 TFMYQNVDRVPSSVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTHKLVPLSEQ 178
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCDT+ N GCNGGLM+ AF++I G+ +YPY ++GTC+ K V+I G
Sbjct: 179 ELVDCDTTQNQGCNGGLMESAFEFI-KQYGITTASNYPYEAKDGTCDASKVNEPAVSIDG 237
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
+++VP N+E +LLKA+AHQPVSVAIEA G DFQFYS GVFTG CG LDHGVA VGYG +
Sbjct: 238 HENVPVNNEAALLKAVAHQPVSVAIEAGGIDFQFYSEGVFTGNCGTALDHGVAIVGYGTT 297
Query: 304 K-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
+ G+ Y VKNSWG +WGE+GYIRMKR+ +GLCGI AS P+KK
Sbjct: 298 QDGTKYWTVKNSWGSEWGEKGYIRMKRSISVKKGLCGIAMEASYPIKK 345
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 181/347 (52%), Positives = 227/347 (65%), Gaps = 13/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K++L S+ L LA F Y+ E L S ++L +L+E W S H + + EK
Sbjct: 5 KVILAVFSVVLVF--RLADSFD---YTEEDLASEERLRDLYERWRSHH-TVSRSLAEKQE 58
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FKENLKHI + N + Y L LN FADM++ EF Y G K +RQ +
Sbjct: 59 RFNVFKENLKHIHKVNHKDRPYKLKLNSFADMTNHEFLQHYGGSKVSHYRVLRGQRQGTG 118
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
+ D LP SVDWRK GAVT +K+QG CGSCWAFSTVAAVEGIN+I +G L SLSEQ
Sbjct: 119 SM-HEDTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGINKIKTGELISLSEQ 177
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCD+ N+GCNGGLM+ AF +I GGL E YPY +E C+ K VV I G
Sbjct: 178 ELVDCDSD-NHGCNGGLMEDAFNFIKQIGGLTSENTYPYRAKEEPCDSNKMNSPVVNIDG 236
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
Y+ VPENDE +L+KA+A+QPV++A++A G D QFYS +FTG CG EL+HGVA VGYG +
Sbjct: 237 YEMVPENDENALMKAVANQPVAIAMDAGGKDLQFYSEAIFTGDCGTELNHGVALVGYGTT 296
Query: 304 K-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+ G+ Y IVKNSWG WGE+GYIRM+R EGLCGI AS P+K
Sbjct: 297 QDGTKYWIVKNSWGTDWGEKGYIRMQRGIDAEEGLCGITMEASYPVK 343
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 165/312 (52%), Positives = 213/312 (68%), Gaps = 7/312 (2%)
Query: 43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSH 101
++ LFE+W +HGKTY EEKL R ++F++N + + N + +SY L LN FAD++H
Sbjct: 25 EIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTH 84
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSYRD----VKALPKSVDWRKKGAVTPVKNQGSCGSC 157
EFK LGL + + S R V +P SVDWRK GAVT VK+QG+CG+C
Sbjct: 85 HEFKASRLGLSS--AASASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGAC 142
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKE 217
W+FS A+EGIN+IV+G+L SLSEQEL+DCD S+NNGC GG+MDYAF++++ + G+ E
Sbjct: 143 WSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTE 202
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
EDYPY + +C +K + VVTI GY DVP+N+E+ LLKA+A+QPVSV I S FQ
Sbjct: 203 EDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQL 262
Query: 278 YSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
YS G+FTGPC LDH V VGYG G DY IVKNSWG WG GY+ M+RN+G GL
Sbjct: 263 YSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGL 322
Query: 338 CGINKMASIPLK 349
CGIN +AS P K
Sbjct: 323 CGINMLASYPKK 334
>gi|52546920|gb|AAU81593.1| cysteine proteinase [Petunia x hybrida]
Length = 210
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 158/208 (75%), Positives = 183/208 (87%), Gaps = 3/208 (1%)
Query: 54 KHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP 113
+HGK Y+ IEEKLHRFEIFKENLKHID+RNK V++YWLGLNEF+D+SH+EFK YLGLK
Sbjct: 3 QHGKIYESIEEKLHRFEIFKENLKHIDERNKIVSNYWLGLNEFSDLSHDEFKKMYLGLKV 62
Query: 114 Q---FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGIN 170
++Q +F YRD LPKSVDWRKKGAVTPVKNQG CGSCWAFSTVAAVEGIN
Sbjct: 63 DHDLLNNKKQSQQDFEYRDFVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGIN 122
Query: 171 QIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE 230
QI +GNLTSLSEQELIDCDT++NNGCNGGLMDYAF++I+++GGLHKE+DYPYLMEEGTC+
Sbjct: 123 QIKTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFQFIISNGGLHKEDDYPYLMEEGTCD 182
Query: 231 DKKEEMEVVTISGYQDVPENDEQSLLKA 258
+K++E EVVTI GY+DVP NDEQSLLKA
Sbjct: 183 EKRDESEVVTIDGYRDVPANDEQSLLKA 210
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 177/344 (51%), Positives = 227/344 (65%), Gaps = 16/344 (4%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
L +SL+LF C L + + L D + E E WM+ +GK YK +E+ R
Sbjct: 7 LYHVSLALFFCLGLL----AIQVTSRTLQD-DSIFERHEQWMTHYGKVYKNPQEREKRLR 61
Query: 71 IFKENLKHIDQRNKEVTS--YWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQPSAEF 125
IF ENLK+I+ N Y LG+N+FAD+++EEF +NK+ G R + F
Sbjct: 62 IFTENLKYIEASNNAGNKKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIR---TTTF 118
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
Y + ++P +VDWRKKGAVTPVKNQG CG CWAFS +AA EGI++I +G L SLSEQEL
Sbjct: 119 KYENT-SVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQEL 177
Query: 186 IDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DCDT+ + GC GGLMD AFK+I+ + G+ E YPY +GTC+ + TI+GY
Sbjct: 178 VDCDTNGVDQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGY 237
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
+DVP N+E +L KA+A+QP+SVAI+ASG+DFQFY GVFTG CG ELDHGV AVGYG S
Sbjct: 238 EDVPANNENALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISN 297
Query: 305 -GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+ Y +VKNSWG WGE GYIRM+R+ EGLCGI AS P
Sbjct: 298 DGTKYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYP 341
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 174/342 (50%), Positives = 227/342 (66%), Gaps = 13/342 (3%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFK 73
+S SL S+L S + + D+++ ++ESW+ +HGK+Y ++EK RFEIFK
Sbjct: 8 ISKSLLFFSTLLILSSAIDIENSVQRTNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFK 67
Query: 74 ENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV-- 130
ENL+ ID N + SY LGLN FAD++ EE+++ YLGLK R P + S + +
Sbjct: 68 ENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLK------RGPKTDVSNQYMPK 121
Query: 131 --KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
ALP VDWR GAV VKNQG C SCWAFS VAAVEGIN+IV+GNL SLSEQEL+DC
Sbjct: 122 VGDALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDC 181
Query: 189 D-TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
T GCN GLM AFK+I+ +GG++ E +YPY ++G C + + VTI Y++V
Sbjct: 182 GRTQITKGCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCNLSLKNQKYVTIDSYKNV 241
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSD 307
P N+E +L KA+A+QPVSV +E+ G F+ Y+ G+FTG CG +DHGV VGYG +G D
Sbjct: 242 PSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYGTERGMD 301
Query: 308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
Y IVKNSWG WGE GYIR++RN G G CGI KM S P+K
Sbjct: 302 YWIVKNSWGTNWGESGYIRIQRNIGG-AGKCGIAKMPSYPVK 342
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 177/349 (50%), Positives = 229/349 (65%), Gaps = 13/349 (3%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MAF + + + L+L L A S A ++ S + E E WMS+ G+ Y
Sbjct: 1 MAFTTRNGCISLALIFLLGALVSQAMARTLQDAS---------MHEKHEEWMSRFGRVYN 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
EK R++IFKEN++ I+ NK SY LG+N+FAD+++EEFK K + +
Sbjct: 52 DGNEKEIRYKIFKENVQRIESFNKASGKSYKLGINQFADLTNEEFKTSRNRFKGHMCSSQ 111
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
+ F Y ++ A P S+DWRKKGAVT +K+QG CGSCWAFS VAAVEGI Q+ + L S
Sbjct: 112 --AGPFRYENLTAAPSSMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLIS 169
Query: 180 LSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCDT + GC GGLMD AFK+I + GL E +YPY +GTC K+E
Sbjct: 170 LSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHA 229
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+G++DVP N+E +L+KA+A QPVSVAI+A G FQFYS G+FTG CG ELDHGVAAV
Sbjct: 230 AKINGFEDVPANNEGALMKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAV 289
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG+S G +Y +VKNSWG +WGE GYIRM+++ EGLCGI AS P
Sbjct: 290 GYGESNGMNYWLVKNSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYP 338
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 171/340 (50%), Positives = 229/340 (67%), Gaps = 14/340 (4%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFK 73
+SL++ C + F + S + + + E E WM+++GK YK +E+ RF IFK
Sbjct: 10 ISLAMLLCMAFLA-FQVTCRSLQDAS----MYERHEQWMTRYGKVYKDPQEREKRFRIFK 64
Query: 74 ENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQPSAEFSYRD 129
EN+ +I+ N Y L +N+FAD+++EEF +N++ G R + F Y +
Sbjct: 65 ENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIR---TTTFKYEN 121
Query: 130 VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD 189
V A+P +VDWR+KGAVTP+K+QG CG CWAFS VAA EGI+ + SG L SLSEQEL+DCD
Sbjct: 122 VTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCD 181
Query: 190 T-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
T + GC GGLMD AFK+++ + GL+ E +YPY +G C + + TI+GY+DVP
Sbjct: 182 TKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNVNEAANDAATITGYEDVP 241
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSD 307
N+E++L KA+A+QPVSVAI+ASG+DFQFY GVFTG CG ELDHGV AVGYG S G++
Sbjct: 242 ANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTE 301
Query: 308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
Y +VKNSWG +WGE GYIRM+R EGLCGI AS P
Sbjct: 302 YWLVKNSWGTEWGEEGYIRMQRGVNSEEGLCGIAMQASYP 341
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 341 bits (874), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 178/346 (51%), Positives = 224/346 (64%), Gaps = 15/346 (4%)
Query: 6 HSKLLLLSL-SLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
SK++ ++L + ++A +L+ V S H E WM +G+TYK I E
Sbjct: 4 ESKIICITLLIMGVWASQALSRTLHEVSMSERH-----------EDWMGLYGRTYKDIAE 52
Query: 65 KLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA 123
K RF+IFKEN+++I+ N Y L +NEFAD ++EEFK G R
Sbjct: 53 KERRFKIFKENVEYIESVNSAGNRRYKLSINEFADQTNEEFKASRNGYNMSSRPRSSEIT 112
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y +V A+P S+DWRKKGAVTP+K+QG CG CWAFS VAA+EG+ Q+ +G L SLSEQ
Sbjct: 113 SFRYENVAAVPSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQ 172
Query: 184 ELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
EL+DCDTS + GC GGLMD AF++I+ +GGL E +YPY + TC KK I
Sbjct: 173 ELVDCDTSGEDQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIK 232
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
Y+DVP N E +LLKA+A PVSVAI+A G+DFQFYS GVFTG CG ELDHGV AVGYGK
Sbjct: 233 NYEDVPANSEAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGK 292
Query: 303 S-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ G+ Y +VKNSWG WGE GYI M+R+ G EGLCGI AS P
Sbjct: 293 TDDGTKYWLVKNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYP 338
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 177/342 (51%), Positives = 227/342 (66%), Gaps = 20/342 (5%)
Query: 28 FSIVGYSPEHLTSMDKLIE--------LFESWMSKH----GKTYKCIEEKLHRFEIFKEN 75
SI+ Y+ EH +++E +++ W+++H G + E RF +F +N
Sbjct: 38 MSIIRYNAEHGVRGLEVVERTEAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDN 97
Query: 76 LKHIDQRN---KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA 132
LK +D N E + LG+N FAD++++EF+ YLG P R + + V+A
Sbjct: 98 LKFVDAHNAHADEHGGFRLGMNRFADLTNDEFRAAYLGTTPA-GRGRHVGEMYRHDGVEA 156
Query: 133 LPKSVDWRKKGAV-TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS 191
LP SVDWR KGAV +PVKNQG CGSCWAFS VAAVEGIN+IV+G L SLSEQEL++C +
Sbjct: 157 LPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARN 216
Query: 192 F-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
N+GCNGG+MD AF +I +GGL EEDYPY +G C+ K+ +VV+I G++DVPEN
Sbjct: 217 RGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPEN 276
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK--SKGSDY 308
DE SL KA+AHQPVSVAI+A G +FQ Y GVFTG CG LDHGV AVGYG + G+DY
Sbjct: 277 DELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDY 336
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
V+NSWGP WGE GYIRM+RN G CGI MAS P+KK
Sbjct: 337 WTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 378
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 172/365 (47%), Positives = 232/365 (63%), Gaps = 44/365 (12%)
Query: 27 DFSIVGYSPEHLTSMDKLIEL-----FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
D SI+ Y+ EH + E ++ W++++G++Y + E+ RF +F +NLK +D
Sbjct: 23 DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDA 82
Query: 82 RN---KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE-FSYRDVKALPKSV 137
N E + LG+N FAD++++EF+ +LG K F R + + E + + V+ LP+SV
Sbjct: 83 HNARADEHGGFRLGMNRFADLTNDEFRATFLGAK--FVERSRAAGERYRHDGVEELPESV 140
Query: 138 DWRKKGAVTPVKNQGSC--------------------------------GSCWAFSTVAA 165
DWR+KGAV PVKNQG C GSCWAFS V+
Sbjct: 141 DWREKGAVAPVKNQGQCVDRIIVWNSMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVST 200
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
VE INQ+V+G + +LSEQEL++C T+ N+GCNGGLMD AF +I+ +GG+ E+DYPY
Sbjct: 201 VESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKA 260
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
+G C+ +E +VV+I G++DVP+NDE+SL KA+AHQPVSVAIEA G +FQ Y GVF+
Sbjct: 261 VDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFS 320
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
G CG LDHGV AVGYG G DY IV+NSWGPKWGE GY+RM+RN G CGI MA
Sbjct: 321 GRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMA 380
Query: 345 SIPLK 349
S P K
Sbjct: 381 SYPTK 385
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 168/311 (54%), Positives = 219/311 (70%), Gaps = 11/311 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS--YWLGLNEFADMSH 101
+ E E WM +GK YK ++E+ +R +IFKEN+ +I+ N + Y LG+N+FAD+++
Sbjct: 37 IYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLGINQFADITN 96
Query: 102 EEF---KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
EEF +NK+ G T+ ++ F Y + ++P +VDWRKKGAVTPVKNQG CG CW
Sbjct: 97 EEFIASRNKFKGHMCSSITK---TSTFKYENA-SVPSTVDWRKKGAVTPVKNQGQCGCCW 152
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKE 217
AFS VAA EGI+++ +G L SLSEQEL+DCDT + GC GGLMD AFK+I+ + GLH E
Sbjct: 153 AFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLHTE 212
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
YPY +GTC + TI+GY+DVP N+E +L KA+A+QP+SVAI+ASG+DFQF
Sbjct: 213 AQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISVAIDASGSDFQF 272
Query: 278 YSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
Y GVFTG CG +LDHGV AVGYG S G+ Y +VKNSWG WGE GYIRM+R+ +G
Sbjct: 273 YKSGVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEEGYIRMQRSVDAAQG 332
Query: 337 LCGINKMASIP 347
LCGI MAS P
Sbjct: 333 LCGIAMMASYP 343
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 169/311 (54%), Positives = 220/311 (70%), Gaps = 11/311 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS--YWLGLNEFADMSH 101
+ E E WM +GK YK ++E+ +R +IFKEN+ +I+ N + Y LG+N+FAD+++
Sbjct: 37 IYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLGINQFADLTN 96
Query: 102 EEF---KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
EEF +NK+ G T+ ++ F Y + ++P +VDWRKKGAVTPVKNQG CG CW
Sbjct: 97 EEFIASRNKFKGHMCSSITK---TSTFKYENA-SVPSTVDWRKKGAVTPVKNQGQCGCCW 152
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKE 217
AFS VAA EGI+++ +G L SLSEQEL+DCDT + GC GGLMD AFK+I+ + GL+ E
Sbjct: 153 AFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTE 212
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
YPY +GTC K + VTI+GY+DVP N+EQ+L KA+A+QP+SVAI+ASG+DFQF
Sbjct: 213 AQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQF 272
Query: 278 YSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
Y GVFTG CG ELDHGV AVGYG + G+ Y +VKNSWG WGE GYI+M+R EG
Sbjct: 273 YKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIKMQRGVDAAEG 332
Query: 337 LCGINKMASIP 347
LCGI AS P
Sbjct: 333 LCGIAMEASYP 343
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 227/345 (65%), Gaps = 14/345 (4%)
Query: 18 LFACSSLAHDFSIVGYSPEHLT-------SMDKLIELFESWMSKHG--KTYKCIEEKLHR 68
+ ++ A D SI+ Y+ EH + + ++ W++++G E R
Sbjct: 15 IVGAATAAPDMSIISYNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERR 74
Query: 69 FEIFKENLKHIDQRN---KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEF 125
F +F +NLK +D N E + LG+N FAD+++EEF+ +LG K R +
Sbjct: 75 FLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGAKVA-ERSRAAGERY 133
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
+ V+ LP+SVDWR+KGAV PVKNQG CGSCWAFS V+ VE INQ+V+G + +LSEQEL
Sbjct: 134 RHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQEL 193
Query: 186 IDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
++C T+ N+GCNGGLMD AF +I+ +GG+ E+DYPY +G C+ +E +VV+I G+
Sbjct: 194 VECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGF 253
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
+DVP+NDE+SL KA+AHQPVSVAIEA G +FQ Y GVF+G CG LDHGV AVGYG
Sbjct: 254 EDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDN 313
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G DY IV+NSWGPKWGE GY+RM+RN G CGI MAS P K
Sbjct: 314 GKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPTK 358
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 340 bits (872), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 177/342 (51%), Positives = 226/342 (66%), Gaps = 20/342 (5%)
Query: 28 FSIVGYSPEHLTSMDKLIE--------LFESWMSKH---GKTYK-CIEEKLHRFEIFKEN 75
SI+ Y+ EH +++E +++ W+++H G ++ + E RF +F +N
Sbjct: 37 MSIIRYNAEHGVRGLEVVERTEAEARAVYDLWVARHRHGGDSHNGLVGEYERRFRVFWDN 96
Query: 76 LKHIDQRN---KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA 132
LK +D N E + LG+N FAD++++EF+ YLG P R A + + V+
Sbjct: 97 LKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRHVGEA-YRHDGVEV 155
Query: 133 LPKSVDWRKKGAVT-PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC-DT 190
LP SVDWR KGAV PVKNQG CGSCWAFS VAAVEGIN+IV+G L SLSEQEL++C
Sbjct: 156 LPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARN 215
Query: 191 SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
N+GCNGG+MD AF +I +GGL EEDYPY +G C K+ +VV+I G++DVPEN
Sbjct: 216 GANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPEN 275
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK--SKGSDY 308
DE SL KA+AHQPVSVAI+A G +FQ Y GVFTG CG LDHGV AVGYG + G+DY
Sbjct: 276 DELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDY 335
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
V+NSWGP WGE GYIRM+RN G CGI MAS P+KK
Sbjct: 336 WTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 377
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 340 bits (872), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 169/322 (52%), Positives = 219/322 (68%), Gaps = 6/322 (1%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
+ + L S + +L+E W S + + + +K RF +FK N+ H+ NK Y L
Sbjct: 25 FHDKDLASEESFWDLYERWRS-YRTVSRSLGDKHKRFNVFKANVMHVHNTNKMDKPYKLK 83
Query: 93 LNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
LN+FADM++ EF++ Y G K F + + F Y V ++P S DWRK GAVT V
Sbjct: 84 LNKFADMTNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSADWRKNGAVTGV 143
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
K+QG CGSCWAFSTV AVEGINQI + L SLSEQEL+DCDT N GCNGGLM+ AF++I
Sbjct: 144 KDQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFI 203
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
GG+ E +YPY ++GTC+ K V+I G+++VP NDE +LLKA+A+QPVSVAI
Sbjct: 204 KQKGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAI 263
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRM 327
+A G DFQFY GVFTG C EL+HGVA VGYG + G++Y V+NSWGP+WGE+GYIRM
Sbjct: 264 DAGGFDFQFYFEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRM 323
Query: 328 KRNTGKPEGLCGINKMASIPLK 349
+R+ K EGLCGI MAS P+K
Sbjct: 324 QRSIFKKEGLCGIAMMASYPIK 345
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 340 bits (872), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 170/326 (52%), Positives = 223/326 (68%), Gaps = 8/326 (2%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCI--EEKLHRFEIFKENLKHIDQRNKEVTS 88
V ++ + L S + L L+E W S + + + + + + RF +FK+N +++ + NK
Sbjct: 24 VPFTEKDLASEESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKQNARYVHEGNKRDMP 83
Query: 89 YWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSAEFSYRDVKALPKSVDWRKKGA 144
+ L LN+FADM+ +EF+ Y G + + R+ F Y D LP +VDWR+KGA
Sbjct: 84 FRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGA 143
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYA 204
VT +K+QG CGSCWAFST+ AVEGIN+I +G L SLSEQEL+DCD N GC+GGLMDYA
Sbjct: 144 VTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYA 203
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
F++I G+ E +YPY E+G+C+ KE + VTI GY+DVP NDE +L KA+A QPV
Sbjct: 204 FQFI-QKNGITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPV 262
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERG 323
SVAI+ASG DFQFYS GVFTG C +LDHGVAAVGYG ++ G+ Y IVKNSWG WGE+G
Sbjct: 263 SVAIDASGQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKG 322
Query: 324 YIRMKRNTGKPEGLCGINKMASIPLK 349
YIRM+R + EGLCGI AS P K
Sbjct: 323 YIRMQRGVSQTEGLCGIAMQASYPTK 348
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 340 bits (871), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 177/344 (51%), Positives = 231/344 (67%), Gaps = 14/344 (4%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIEL-FESWMSKHGKTYKCIEEKLHRF 69
+ S SL L +LA F+ Y T +D L+ + E WM+++G+ YK EK R+
Sbjct: 1 MASNSLKLLI--ALALVFATSAYLATSRTLLDSLMAVRHEQWMAQYGRVYKNEVEKTKRY 58
Query: 70 EIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQPSAEF 125
IFKEN+++I+ NK T Y LG+N FAD++++EF +N Y+ P + F
Sbjct: 59 NIFKENVEYIESFNKAGTKPYKLGINAFADLTNKEFIASRNGYI-----LPHECSSNTPF 113
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
Y +V A+P +VDWRKKGAVTPVK+QG CG CWAFS VAA+EGI ++ +GNL SLSEQEL
Sbjct: 114 RYENVSAVPTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQEL 173
Query: 186 IDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DCD + GC GGLMD AF +I+ + GL E +YPY +G+C+ K ISGY
Sbjct: 174 VDCDVKGIDQGCEGGLMDDAFTFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGY 233
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
+DVP N E +L KA+A+QPVSVAI+A G+DFQFYS GVFTG CG ELDHGV AVGYG ++
Sbjct: 234 EDVPANSESALEKAVANQPVSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAE 293
Query: 305 -GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GS Y +VKNSWG WGE+GYIRM+++ EGLCGI +S P
Sbjct: 294 DGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQSSYP 337
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 340 bits (871), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 175/348 (50%), Positives = 230/348 (66%), Gaps = 16/348 (4%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD-KLIELFESWMSKHGKTYKCIEEK 65
+K +SL+L CS + + T D + E E WM ++ K YK +E+
Sbjct: 3 AKNQFYQISLALLFCSGF------LAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQER 56
Query: 66 LHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQP 121
RF+IFKEN+ +I+ N Y LG+N+FAD+++EEF +N++ G TR
Sbjct: 57 ERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCSSITR--- 113
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
+ F Y +V A+P +VDWR+KGAVTP+K+QG CG CWAFS VAA EGI+ + +G L SLS
Sbjct: 114 TTTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLS 173
Query: 182 EQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
EQE++DCDT + GC GG MD AFK+I+ + GL+ E +YPY +G C K V T
Sbjct: 174 EQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVAT 233
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
I+GY+DVP N+E++L KA+A+QPVSVAI+ASG+DFQFY GVFTG CG ELDHGV AVGY
Sbjct: 234 ITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGY 293
Query: 301 GKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G S G++Y +VKNSWG +WGE GYIRM+R EGL GI MAS P
Sbjct: 294 GVSADGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLXGIAMMASYP 341
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 339 bits (870), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 168/307 (54%), Positives = 217/307 (70%), Gaps = 6/307 (1%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEE 103
+ + ++ WM K+G+ YK EE RF I++ N+++ID N S+ L N FAD+++EE
Sbjct: 15 IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAENNFADLTNEE 74
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
FK YLG K T P F Y ++ LP +VDWR++GAVTP+KNQG CGSCWAFS V
Sbjct: 75 FKATYLGYK----TVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCWAFSAV 130
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
AAVEGIN+I +G L SLSEQEL+DCD TS N GCNGG M AF++I + GL E +YPY
Sbjct: 131 AAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFIKRT-GLTTEIEYPY 189
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
E C ++KE+ + V+ISGY+ VP NDE+SL A+A+QPVSVAI+A G +FQFYSGG+
Sbjct: 190 QGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQFYSGGI 249
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
F+G CG +L+HGVA VGYG++ Y +VKNSWG WGE GYIRMKR++ +G CGI
Sbjct: 250 FSGNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGTDWGESGYIRMKRDSTDRQGTCGIAM 309
Query: 343 MASIPLK 349
MAS P K
Sbjct: 310 MASYPTK 316
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 339 bits (870), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 176/337 (52%), Positives = 225/337 (66%), Gaps = 10/337 (2%)
Query: 18 LFACS-SLAHDFSIVGYSPEHLTSMDK-LIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
LF C+ +L F+ + T D + E E WM+ HGK YK EK +++IF EN
Sbjct: 6 LFHCTLALFLIFAFCAFEANARTLEDAPMRERHEQWMATHGKVYKHSYEKEQKYQIFMEN 65
Query: 76 LKHIDQ-RNKEVTSYWLGLNEFADMSHEEFK--NKYLGLKPQFPTRRQPSAEFSYRDVKA 132
++ I+ N Y LG+N FAD+++EEFK N++ G ++R + F Y +V A
Sbjct: 66 VQRIEAFNNAGXKPYKLGINHFADLTNEEFKAINRFKG---HVCSKRTRTTTFRYENVTA 122
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-S 191
+P S+DWR+KGAVTP+K+QG CG CWAFS VAA EGI ++ +G L SLSEQEL+DCDT
Sbjct: 123 VPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQELVDCDTKG 182
Query: 192 FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
+ GC GGLMD AFK+I+ + GL E YPY +GTC K + +I GY+DVP N
Sbjct: 183 VDQGCEGGLMDDAFKFILQNKGLATEAIYPYEGFDGTCNAKADGNHAGSIKGYEDVPANS 242
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYII 310
E +LLKA+A+QPVSVAIEASG FQFYSGGVFTG CG LDHGV +VGYG G+ Y +
Sbjct: 243 ESALLKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVTSVGYGVGDDGTKYWL 302
Query: 311 VKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VKNSWG KWGE+GYIRM+R+ EGLCGI +AS P
Sbjct: 303 VKNSWGVKWGEKGYIRMQRDVAAKEGLCGIAMLASYP 339
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 175/346 (50%), Positives = 228/346 (65%), Gaps = 14/346 (4%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD-KLIELFESWMSKHGKTYKCIEEK 65
SK +L SL+L F + + T D + E E WM+++GK YK EK
Sbjct: 3 SKTVLNITSLTLLLV------FGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEK 56
Query: 66 LHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSHEEFK--NKYLGLKPQFPTRRQPS 122
R +IFKEN++ I+ N SY LG+N+FAD+++EEFK N++ G TR +
Sbjct: 57 ELRSKIFKENVQRIEAFNNAGNKSYKLGINQFADLTNEEFKARNRFKGHMCSNSTR---T 113
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
F Y V ++P S+DWR+KGAVTP+K+QG CG CWAFS VAA EGI ++ +G L SLSE
Sbjct: 114 PTFKYEHVTSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSE 173
Query: 183 QELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
QEL+DCDT + GC GGLMD AFK+I+ + GL+ E YPY + TC E + +I
Sbjct: 174 QELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASI 233
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
G++DVP N E +LLKA+A+QP+SVAI+ASG++FQFYS GVFTG CG ELDHGV AVGYG
Sbjct: 234 KGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGVFTGSCGTELDHGVTAVGYG 293
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+ Y +VKNSWG +WGE+GYIRM+R+ EGLCG AS P
Sbjct: 294 SDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCGFAMQASYP 339
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 170/360 (47%), Positives = 227/360 (63%), Gaps = 16/360 (4%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD----------KLIELFESWMSK 54
+ S +L+L L++ + +C++ A D SIV + H + + +FESWM K
Sbjct: 4 AKSAMLVLLLAMVISSCAT-AMDMSIVSSNDNHHVTNGPGRRQGVFDAEATLMFESWMVK 62
Query: 55 HGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ 114
HGK Y+ + EK R IF++NL+ I RN E SY LGLN FAD+S E+ G P+
Sbjct: 63 HGKVYESVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYAQICHGADPR 122
Query: 115 FPTRR---QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQ 171
P S + D LPKSVDWR +GAVT VK+QG C SCWAFSTV AVEG+N+
Sbjct: 123 PPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFSTVGAVEGLNK 182
Query: 172 IVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED 231
IV+G L +LSEQ+LI+C+ NNGC GG ++ A+++I+ +GGL + DYPY G C D
Sbjct: 183 IVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCND 241
Query: 232 K-KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAE 290
+ KE + V I GY+++P NDE +L+KA+AHQPV+ +++S +FQ Y+ GVF G CG
Sbjct: 242 RLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASGVFDGTCGTN 301
Query: 291 LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
L+HGV VGYG G DY IV+NS G WGE GY++M RN P GLCGI AS PLK
Sbjct: 302 LNHGVVVVGYGTENGRDYWIVRNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASYPLKN 361
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 172/332 (51%), Positives = 226/332 (68%), Gaps = 7/332 (2%)
Query: 21 CSSLAHDFSIVGYSPEHL--TSMDKLI-ELFESWMSKHGKTYKCIEEKLHRFEIFKENLK 77
C SLA F + + + + T D I E E WM++ + Y +EK R++IFKEN++
Sbjct: 9 CISLALIFFLGALASQAIARTLQDASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQ 68
Query: 78 HIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS 136
I+ NK SY LG+N+FAD+++EEFK K + + + F Y ++ A+P S
Sbjct: 69 RIESFNKASEKSYKLGINQFADLTNEEFKTSRNRFKGHMCSSQ--AGPFRYENITAVPSS 126
Query: 137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNG 195
+DWRK+GAVT +K+QG CGSCWAFS VAAVEGI Q+ + L SLSEQEL+DCDT + G
Sbjct: 127 MDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQG 186
Query: 196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
C GGLMD AFK+I + GL E +YPY +GTC K+E I+G++DVP N+E +L
Sbjct: 187 CQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGAL 246
Query: 256 LKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSW 315
+KA+A QPVSVAI+A G +FQFYS G+FTG CG ELDHGVAAVGYG+S G +Y +VKNSW
Sbjct: 247 MKAVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYGESNGMNYWLVKNSW 306
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G +WGE GYIRM+++ EGLCGI AS P
Sbjct: 307 GTQWGEEGYIRMQKDIDAKEGLCGIAMQASYP 338
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 182/352 (51%), Positives = 234/352 (66%), Gaps = 17/352 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
KL L L S A + + + + + L + D L L+E W S H + + ++EK
Sbjct: 2 KLFSLILVASFLASVAA----TAIDIADKDLETEDSLWNLYERWRSHHTVS-RDLDEKQK 56
Query: 68 RFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQF------PTRRQ 120
RF +FKEN ++I NK Y L LN+FAD+++ EF++ Y G + R
Sbjct: 57 RFNVFKENPRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRGSRRGG 116
Query: 121 PSAEFSYR--DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
+ F Y+ D ++LP S+DWR+KGAVT VK+QG CGSCWAFSTVAAVEGINQI + L
Sbjct: 117 ATNSFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLL 176
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
SLSEQELIDCDT NNGCNGGLMDYAF +I +GG+ E +YPY E+ C +K+ V
Sbjct: 177 SLSEQELIDCDTDENNGCNGGLMDYAFDFIKKNGGISSEAEYPYAAEDSYCATEKKS-HV 235
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
V+I G++DVP NDE SLLKA+A+QPVS+AIEASG DFQFYS GVFTG G ELDHGVA V
Sbjct: 236 VSIDGHEDVPANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGVFTGRSGTELDHGVAIV 295
Query: 299 GYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
GYGK+ +G+ Y IV+NSWG +WGE+GYIR+ + LCG+ AS P+K
Sbjct: 296 GYGKTQQGTKYWIVRNSWGAEWGEKGYIRISAASDSKR-LCGLAMEASYPIK 346
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 181/353 (51%), Positives = 234/353 (66%), Gaps = 21/353 (5%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
+ L + L L+L S+L+ + + L S D L L+E W S H + + +++K
Sbjct: 2 ASLFPVLLVLALAFGSTLS-----IPIKEKDLESEDSLWSLYERWRSHHAVS-RDLDQKQ 55
Query: 67 HRFEIFKENLKHIDQ--RNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPS-- 122
RF +FKEN+K I + +NK+VT + L LN+F DM+++EF+ KY G K + S
Sbjct: 56 KRFNVFKENVKFIHEFNKNKDVT-FKLALNKFGDMTNQEFRAKYAGSKVHHHRTMKGSRH 114
Query: 123 -----AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
A+F Y + A P S+DWR++GAV VKNQG CGSCWAFS +AAVEGINQIV+ L
Sbjct: 115 GSGSGAKFMYENAVA-PPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVEGINQIVTKEL 173
Query: 178 TSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
LSEQELIDCDT N GC+GGLMDYAF++I +GG+ E+ YPY E+ TC K+
Sbjct: 174 VPLSEQELIDCDTDQNQGCSGGLMDYAFEFIKNNGGITTEDVYPYQAEDATC---KKNSP 230
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAA 297
V I GY+DVP NDE +L+KA+A+QPV+VAIEASG FQFYS GVFTG CG ELDHGVA
Sbjct: 231 AVVIDGYEDVPTNDEDALMKAVANQPVAVAIEASGYVFQFYSEGVFTGRCGTELDHGVAV 290
Query: 298 VGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG ++ G+ Y V+NSWG WGE GY+RM+R GLCGI AS P+K
Sbjct: 291 VGYGTTQDGTKYWTVRNSWGADWGESGYVRMQRGIKATHGLCGIAMQASYPIK 343
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 164/341 (48%), Positives = 225/341 (65%), Gaps = 17/341 (4%)
Query: 27 DFSIVGYSPEHLTSMDKLIE-----LFESWMSKHG----KTYKCIEEKLHRFEIFKENLK 77
D SI+ Y+ EH + E +++ W+++HG I ++ RF F +NL+
Sbjct: 26 DMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLR 85
Query: 78 HIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA---EFSYRDV 130
+D N + + L +N FAD++++EF+ YLG+K R + +
Sbjct: 86 FVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGA 145
Query: 131 KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT 190
+ LP++VDWR+KGAV PVKNQG CGSCWAFS V+ VE INQIV+G + +LSEQEL++CD
Sbjct: 146 EELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDI 205
Query: 191 SF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPE 249
+ ++GCNGGLMD AF++I+ +GG+ E+DYPY +G C+ ++ +VV+I G++DVPE
Sbjct: 206 NGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPE 265
Query: 250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYI 309
NDE+SL KA+AH PVSVAIEA G +FQ Y GVF+G CG +LDHGV AVGYG G DY
Sbjct: 266 NDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYW 325
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
IV+NSWGP WGE GY+RM+RN G CGI M+S P KK
Sbjct: 326 IVRNSWGPNWGEAGYLRMERNINVTSGKCGIAMMSSYPTKK 366
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 164/341 (48%), Positives = 225/341 (65%), Gaps = 17/341 (4%)
Query: 27 DFSIVGYSPEHLTSMDKLIE-----LFESWMSKHG----KTYKCIEEKLHRFEIFKENLK 77
D SI+ Y+ EH + E +++ W+++HG I ++ RF F +NL+
Sbjct: 26 DMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLR 85
Query: 78 HIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA---EFSYRDV 130
+D N + + L +N FAD++++EF+ YLG+K R + +
Sbjct: 86 FVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGA 145
Query: 131 KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT 190
+ LP++VDWR+KGAV PVKNQG CGSCWAFS V+ VE INQIV+G + +LSEQEL++CD
Sbjct: 146 EELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDI 205
Query: 191 SF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPE 249
+ ++GCNGGLMD AF++I+ +GG+ E+DYPY +G C+ ++ +VV+I G++DVPE
Sbjct: 206 NGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPE 265
Query: 250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYI 309
NDE+SL KA+AH PVSVAIEA G +FQ Y GVF+G CG +LDHGV AVGYG G DY
Sbjct: 266 NDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYW 325
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
IV+NSWGP WGE GY+RM+RN G CGI M+S P KK
Sbjct: 326 IVRNSWGPNWGEAGYLRMERNINVTSGKCGIAMMSSYPTKK 366
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 163/307 (53%), Positives = 210/307 (68%), Gaps = 31/307 (10%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEF 104
+ ++E+W+ KHGK+Y + E+ RFEIFK+NL+ I++ N +Y +G
Sbjct: 1 MAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVG------------ 48
Query: 105 KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
+S+R + LP+SVDWR+KGAV PVK+QG+CGSCWAFST+A
Sbjct: 49 ------------------DRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTIA 90
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
AVEGINQI +G+L SLSEQEL+DCD S+N GCNGGLMDYAF++I+ +GG+ EEDYPY
Sbjct: 91 AVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYRA 150
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
+ TC+ ++ VV+I GY+DVP+NDE+SL KA+A+QPVSVAIEA G FQ Y GVFT
Sbjct: 151 ADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGVFT 210
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRN-TGKPEGLCGINKM 343
G CG +LDHGV AVGYG DY IV+NSWGP WGE GYI+++RN G G CGI
Sbjct: 211 GQCGTQLDHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCGIAIE 270
Query: 344 ASIPLKK 350
S P+K
Sbjct: 271 PSYPIKN 277
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 169/338 (50%), Positives = 224/338 (66%), Gaps = 10/338 (2%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMD-KLIELFESWMSKHGKTYKCIEEKLHRFEIF 72
+SL++ C + + + T D + E E WM+++GK YK +E+ RF +F
Sbjct: 10 ISLAMLLC------MTFLAFQVTCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVF 63
Query: 73 KENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK 131
KEN+ +I+ N SY LG+N+FAD++++EF G K + + F + +V
Sbjct: 64 KENVNYIEAFNNAANKSYKLGINQFADLTNKEFIAPRNGFKGHMCSSIIRTTTFKFENVT 123
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT- 190
A P +VDWR+KGAVTP+K+QG CG CWAFS VAA EGI+ + +G L SLSEQEL+DCDT
Sbjct: 124 ATPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTK 183
Query: 191 SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
+ GC GGLMD AFK+I+ + GL+ E +YPY +G C + TI+GY+DVP N
Sbjct: 184 GVDQGCEGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEAAKNAATITGYEDVPAN 243
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYI 309
+E +L KA+A+QPVSVAI+ASG+DFQFY GVFTG CG ELDHGV AVGYG S G++Y
Sbjct: 244 NEMALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYW 303
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+VKNSWG +WGE GYIRM+R EGLCGI AS P
Sbjct: 304 LVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYP 341
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 171/351 (48%), Positives = 229/351 (65%), Gaps = 20/351 (5%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
M+ S LL+LSL+L + +D +++ ++ESW+ + GK+Y
Sbjct: 10 MSLLFFSTLLILSLALDIENSVQRTND---------------QVMAMYESWLVEQGKSYN 54
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
++EK RFEIFKENL+ ID N + SY LGLN FAD++ EE+++ YLGLK +
Sbjct: 55 SLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLK--MGPKT 112
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S E+ + +ALP VDWR GAV VKNQG C SCWAFS V AVEGIN+IV+GNL S
Sbjct: 113 DVSNEYMPKVGEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLIS 172
Query: 180 LSEQELIDC-DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DC T GCN GLM AF++I+ +GG++ E++YPY ++G C + +
Sbjct: 173 LSEQELVDCGRTQRTKGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCNLSLKNQKY 232
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
VTI Y++VP N+E +L KA+A+QPVSV +E+ G F+ Y+ G+FTG CG +DHGV V
Sbjct: 233 VTIDNYKNVPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTIV 292
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
GYG +G DY IVKNSWG WGE GYIR++RN G G CGI +M S P+K
Sbjct: 293 GYGTERGMDYWIVKNSWGTNWGENGYIRIQRNIGGA-GKCGIARMPSYPVK 342
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 175/350 (50%), Positives = 227/350 (64%), Gaps = 15/350 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
M F S L++ ++L A S LA S+ S + E E WM+ +G+ YK
Sbjct: 1 MGFVSQCFCLVVMVTLGALA-SQLAAARSLQDAS---------MRERHEEWMASYGRVYK 50
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
I EK R++IF+EN+ I+ NK+ Y L +N+FAD+++EEFK K + +
Sbjct: 51 DINEKQKRYKIFEENVALIESSNKDANKPYKLSVNQFADLTNEEFKASRNRFKGHICSTK 110
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S F Y +V A+P ++DWR KGAVTPVK+QG CG CWAFS VAA EGI ++ +G L S
Sbjct: 111 STS--FKYGNVSAVPSAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKLTTGELIS 168
Query: 180 LSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCDTS + GC GGLMD AF +I + GL E +YPY +GTC K+ +
Sbjct: 169 LSEQELVDCDTSGVDQGCEGGLMDNAFTFIQHNHGLASEANYPYKGVDGTCNTNKQAIHA 228
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+G++DVP N E++LL A+AHQPVSVAI+A G+ FQFYS GVF G CG +LDHGV AV
Sbjct: 229 AEINGFEDVPANSEEALLNAVAHQPVSVAIDAGGSGFQFYSKGVFIGACGTQLDHGVTAV 288
Query: 299 GYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG S G+ Y +VKNSWG +WGE GYIRM+R+ EGLCGI AS P
Sbjct: 289 GYGTSDDGTKYWLVKNSWGTQWGEEGYIRMQRDVDAKEGLCGIAMKASYP 338
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 175/354 (49%), Positives = 230/354 (64%), Gaps = 22/354 (6%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSP--EHLTSMDKLIELFESWMSKHGKTYKC 61
F H ++ L S F FSI P L + IE WM+KHG+ Y
Sbjct: 3 FKHMQIFLFVAIFSSFY-------FSISLSRPLDNELIMQKRHIE----WMTKHGRVYAD 51
Query: 62 IEEKLHRFEIFKENLKHIDQRNK--EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
++EK +R+ +FK N++ I+ N ++ L +N+FAD++++EF++ Y G K
Sbjct: 52 VKEKSNRYVVFKSNVERIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSS 111
Query: 120 QP---SAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVS 174
Q + F Y++V ALP SVDWR KGAVTP+KNQGSCG CWAFS VAA+EG QI
Sbjct: 112 QSQTKTTSFRYQNVSSGALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKK 171
Query: 175 GNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
G L SLSEQ+L+DCDT+ + GC GGLMD AF++I+A+GGL E +YPY E+ TC KK
Sbjct: 172 GKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIMATGGLTTESNYPYKGEDATCNSKKT 230
Query: 235 EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHG 294
+ +I+GY+DVP NDEQ+L+KA+AHQPVSV IE G DFQFYS GVFTG C LDH
Sbjct: 231 NPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHA 290
Query: 295 VAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
V A+GYG+S GS Y I+KNSWG KWGE GY+R++++ +GLCG+ AS P
Sbjct: 291 VTAIGYGQSTNGSKYWIIKNSWGTKWGESGYMRIQKDIKDKQGLCGLAMKASYP 344
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 164/341 (48%), Positives = 225/341 (65%), Gaps = 17/341 (4%)
Query: 27 DFSIVGYSPEHLTSMDKLIE-----LFESWMSKHG----KTYKCIEEKLHRFEIFKENLK 77
D SI+ Y+ EH + E +++ W+++HG I ++ RF F +NL+
Sbjct: 26 DMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLR 85
Query: 78 HIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA---EFSYRDV 130
+D N + + L +N FAD++++EF+ YLG+K R + +
Sbjct: 86 FVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGA 145
Query: 131 KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT 190
+ LP++VDWR+KGAV PVKNQG CGSCWAFS V+ VE INQIV+G + +LSEQEL++CD
Sbjct: 146 EELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDI 205
Query: 191 SF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPE 249
+ ++GCNGGLMD AF++I+ +GG+ E+DYPY +G C+ ++ +VV+I G++DVPE
Sbjct: 206 NGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPE 265
Query: 250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYI 309
NDE+SL KA+AH PVSVAIEA G +FQ Y GVF+G CG +LDHGV AVGYG G DY
Sbjct: 266 NDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYW 325
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
IV+NSWGP WGE GY+RM+RN G CGI M+S P KK
Sbjct: 326 IVRNSWGPNWGEAGYLRMERNINVTSGKCGIAMMSSYPTKK 366
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 167/305 (54%), Positives = 216/305 (70%), Gaps = 6/305 (1%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEE 103
+ + ++ WM K+G+ YK EE RF I++ N+++ID N S+ L N FAD+++EE
Sbjct: 15 IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAENNFADLTNEE 74
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
FK YLG K T P F Y ++ LP +VDWR++GAVTP+KNQG CGSCWAFS V
Sbjct: 75 FKATYLGYK----TVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCWAFSAV 130
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
AAVEGIN+I +G L SLSEQEL+DCD TS N GCNGG M AF++I + GL E +YPY
Sbjct: 131 AAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFIKRT-GLTTEIEYPY 189
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
E C ++KE+ + V+ISGY+ VP NDE+SL A+A+QPVSVAI+A G +FQFYSGG+
Sbjct: 190 QGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQFYSGGI 249
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
F+G CG +L+HGVA VGYG++ Y +VKNSWG WGE GYIRMKR++ +G CGI
Sbjct: 250 FSGNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGTDWGESGYIRMKRDSTDKQGTCGIAM 309
Query: 343 MASIP 347
MAS P
Sbjct: 310 MASYP 314
>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
Length = 365
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 170/326 (52%), Positives = 221/326 (67%), Gaps = 8/326 (2%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCI--EEKLHRFEIFKENLKHIDQRNKEVTS 88
V + + L S + L L+E W S + + + + + RF +FK+N +++ + NK
Sbjct: 24 VPLTEKDLASEESLRGLYERWRSHYTVSRRGLGADAGERRFNVFKQNARYVHEGNKRDMP 83
Query: 89 YWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSAEFSYRDVKALPKSVDWRKKGA 144
+ L LN+FADM+ +EF+ Y G + + R+ F Y D LP +VDWR+KGA
Sbjct: 84 FRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGA 143
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYA 204
VT +K+QG CGSCWAFST+ AVEGIN+I +G L SLSEQEL+DCD N GC+GGLMDYA
Sbjct: 144 VTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYA 203
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
F++I G+ E +YPY E+G+C+ KE + VTI GY+DVP NDE +L KA+A QPV
Sbjct: 204 FQFI-QKNGITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPV 262
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERG 323
SVAI+ASG DFQFYS GVFTG C +LDHGVAAVGYG ++ G+ Y IVKNSWG WGE+G
Sbjct: 263 SVAIDASGQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKG 322
Query: 324 YIRMKRNTGKPEGLCGINKMASIPLK 349
YIRM+R + EGLCGI AS P K
Sbjct: 323 YIRMQRGVSQTEGLCGIAMQASYPTK 348
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 164/314 (52%), Positives = 211/314 (67%), Gaps = 7/314 (2%)
Query: 43 KLIELFESWMSKHGKTY-KCIEEKLHRFEIFKENLKHIDQRNKEVTS--YWLGLNEFADM 99
++ ++E WM++HGK + E RF F +NL+ +D N + Y LG+N FAD+
Sbjct: 47 QVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGYRLGINRFADL 106
Query: 100 SHEEFKNKYLGLKPQFPTRRQPSAE-FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
++ EF+ YL + T + E + + V+ALP+ VDWR+KGAV PVKNQG CGSCW
Sbjct: 107 TNAEFRAAYLSAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVAPVKNQGQCGSCW 166
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNG-CNGGLMDYAFKYIVASGGLHKE 217
AFS V AVEGINQIV+G L +LSEQEL+DC + NG C+GG+MD AF +IV +GG+ +
Sbjct: 167 AFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVGNGGIDTD 226
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
+DYPY +G C+ K VV+I G++ VP NDE+SL KA+AHQPV+VAIEA G +FQ
Sbjct: 227 KDYPYTARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVAVAIEAGGREFQL 286
Query: 278 YSGGVFTGPCGAELDHGVAAVGYGKSK--GSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
Y GVFTG CG LDHGV AVGYG G DY +V+NSWG WGE GYIRM+RN G
Sbjct: 287 YQSGVFTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGYIRMERNVGARA 346
Query: 336 GLCGINKMASIPLK 349
G CGI AS P+K
Sbjct: 347 GKCGIAMEASYPVK 360
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 171/324 (52%), Positives = 220/324 (67%), Gaps = 6/324 (1%)
Query: 28 FSIVGYSPEHLTSMDKLIEL-FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
F+ Y T D L+ + E WM+++G+ YK EK RF IFKEN+++I+ NK
Sbjct: 16 FATSAYLATSRTLSDSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAG 75
Query: 87 TS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
T Y LG+N FAD++++EFK G K P + F Y +V ++P +VDWR KGAV
Sbjct: 76 TKPYKLGINAFADLTNQEFKASRNGYK--LPHDCSSNTPFRYENVSSVPTTVDWRTKGAV 133
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYA 204
TPVK+QG CG CWAFS VAA+EGI ++ +GNL SLSEQEL+DCD + GC GGLMD A
Sbjct: 134 TPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDA 193
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
F +I+ + GL E +YPY +G+C+ K ISGY+DVP N E +L KA+A+QPV
Sbjct: 194 FSFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPV 253
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERG 323
SVAI+A G+DFQFYS GVFTG CG ELDHGV AVGYG ++ GS Y +VKNSWG WGE+G
Sbjct: 254 SVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKG 313
Query: 324 YIRMKRNTGKPEGLCGINKMASIP 347
YIRM+++ EGLCGI +S P
Sbjct: 314 YIRMQKDIEAKEGLCGIAMQSSYP 337
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 166/316 (52%), Positives = 209/316 (66%), Gaps = 8/316 (2%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMS 100
D + ELF+ W KHGKTY EE+ R +IFK+N + Q N +Y L LN FAD++
Sbjct: 24 DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLT 83
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
H EFK LGL P+ S S +P SVDWRKKGAVT VK+QGSCG+CW+F
Sbjct: 84 HHEFKASRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 143
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
S A+EGINQIV+G+L SLSEQELIDCD S+N GCNGGLMDYAF++++ + G+ E+DY
Sbjct: 144 SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 203
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY +GTC+ K + +VVTI Y V NDE++L++A+A QPVSV I S FQ YS
Sbjct: 204 PYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSS 263
Query: 281 -------GVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
G+F+GPC LDH V VGYG G DY IVKNSWG WG G++ M+RNT
Sbjct: 264 KFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTEN 323
Query: 334 PEGLCGINKMASIPLK 349
+G+CGIN +AS P+K
Sbjct: 324 SDGVCGINMLASYPIK 339
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 175/352 (49%), Positives = 232/352 (65%), Gaps = 21/352 (5%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSP--EHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
+ L + + LF + FSI P L + IE WM+KHG+ Y ++E+
Sbjct: 1 MALKHMQIFLFVAIFSSFCFSITLSRPLDNELIMQKRHIE----WMTKHGRVYADVKEEN 56
Query: 67 HRFEIFKENLKHIDQRNK--EVTSYWLGLNEFADMSHEEFKNKYLGLK------PQFPTR 118
+R+ +FK N++ I+ N ++ L +N+FAD++++EF++ Y G K Q T+
Sbjct: 57 NRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTK 116
Query: 119 RQPSAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
P F Y++V ALP SVDWRKKGAVTP+KNQGSCG CWAFS VAA+EG QI G
Sbjct: 117 MSP---FRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGK 173
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L SLSEQ+L+DCDT+ + GC GGLMD AF++I A+GGL E +YPY E+ TC KK
Sbjct: 174 LISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIKATGGLTTESNYPYKGEDATCNSKKTNP 232
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+ +I+GY+DVP NDEQ+L+KA+AHQPVSV IE G DFQFYS GVFTG C LDH V
Sbjct: 233 KATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVT 292
Query: 297 AVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
A+GYG+S GS Y I+KNSWG KWGE GY+R++++ +GLCG+ AS P
Sbjct: 293 AIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 160/306 (52%), Positives = 217/306 (70%), Gaps = 5/306 (1%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSHE 102
+++ E WM++HG+ Y ++EK R+ IFKEN++ I+ N Y LG+N+FAD+++E
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
EF+ Y G K Q + + S+ F Y ++ +P S+DWR GAVTPVK+QG+CG CWAFST
Sbjct: 61 EFRAMYHGYKRQ--SSKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCWAFST 118
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
VAA+EGI ++ +GNL SLSEQ+L+DC T+ N GC GGLMD AF+YI+ +GGL E++YPY
Sbjct: 119 VAAIEGIIKLQTGNLISLSEQQLVDC-TAGNKGCQGGLMDTAFQYIIRNGGLTSEDNYPY 177
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+GTC +K I+GY+DVP+N+E +LL+A+A QPVSVA++ G DF+FY GV
Sbjct: 178 QGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFYKSGV 237
Query: 283 FTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
F G CG L+HGV A+GYG S G+DY +VKNSWG WGE GY RM+R G EGLCG+
Sbjct: 238 FEGDCGTNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASEGLCGVA 297
Query: 342 KMASIP 347
AS P
Sbjct: 298 MDASYP 303
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 176/352 (50%), Positives = 231/352 (65%), Gaps = 21/352 (5%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSP--EHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
+ L + + LF + FSI P L + IE WM+KHG+ Y ++E+
Sbjct: 1 MALKHMQIFLFVAIFSSFCFSITLSRPLDNELIMQKRHIE----WMTKHGRVYADVKEEN 56
Query: 67 HRFEIFKENLKHIDQRNK--EVTSYWLGLNEFADMSHEEFKNKYLGLK------PQFPTR 118
+R+ +FK N++ I+ N ++ L +N+FAD++++EF + Y G K Q T+
Sbjct: 57 NRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTK 116
Query: 119 RQPSAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
P F Y++V ALP SVDWRKKGAVTP+KNQGSCG CWAFS VAA+EG QI G
Sbjct: 117 MSP---FRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGK 173
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L SLSEQ+L+DCDT+ + GC GGLMD AF++I A+GGL E DYPY E+ TC KK
Sbjct: 174 LISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIKATGGLTTESDYPYKGEDATCNSKKTNP 232
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+ +I+GY+DVP NDEQ+L+KA+AHQPVSV IE G DFQFYS GVFTG C LDH V
Sbjct: 233 KATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVT 292
Query: 297 AVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
A+GYG+S GS Y I+KNSWG KWGE GY+R++++ +GLCG+ AS P
Sbjct: 293 AIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 166/321 (51%), Positives = 220/321 (68%), Gaps = 14/321 (4%)
Query: 42 DKLIELFESWMSKHGKTYKCIE----EKLHRFEIFKENLKHID--QRNKEVTSYWLGLNE 95
+++ ++ W ++HGKT ++ RF IFK+NL+ ID N + +Y LGL +
Sbjct: 43 EEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTK 102
Query: 96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR------DVKALPKSVDWRKKGAVTPVK 149
F D++++E++ YLG + + P RR A+ + + K +P++VDWR+KGAV P+K
Sbjct: 103 FTDLTNDEYRKLYLGARTE-PARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIK 161
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIV 209
+QG+CGSCWAFST AAVEGIN+IV+G L SLSEQEL+DCD S+N GCNGGLMDYAF++I+
Sbjct: 162 DQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIM 221
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIE 269
+GGL+ E+DYPY G C + VV+I GY+DVP DE +L KA+++QPVSVAIE
Sbjct: 222 KNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIE 281
Query: 270 ASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
A G FQ Y G+FTG CG LDH V AVGYG G DY IV+NSWGP+WGE GYIRM+R
Sbjct: 282 AGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMER 341
Query: 330 N-TGKPEGLCGINKMASIPLK 349
N G CGI AS P+K
Sbjct: 342 NLAASKSGKCGIAVEASYPVK 362
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 337 bits (863), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 180/351 (51%), Positives = 230/351 (65%), Gaps = 18/351 (5%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + + + ++L L A +S A S+ H SM E E WM+++G+ YK
Sbjct: 1 MASTNQYQYVSMALLFILAAWASQATSRSL------HEASM---YERHEDWMARYGRMYK 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
EK RF+IFK+N+ I+ NK + +Y L +NEFAD+++EEF++ L+ +F
Sbjct: 52 DANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRS----LRNRFKAHI 107
Query: 120 QPSAE-FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
A F Y +V A+P ++DWRKKGAVTP+K+Q CG CWAFS VAA EGI QI +G L
Sbjct: 108 CSEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLI 167
Query: 179 SLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
SLSEQEL+DCDT N GC+GGLMD AF++I G L E YPY ++GTC KKE
Sbjct: 168 SLSEQELVDCDTGGENQGCSGGLMDDAFRFIKIHG-LASEATYPYEGDDGTCNSKKEAHP 226
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAA 297
I GY+DVP N+E++L KA+AHQPV+VAI+A G +FQFY+ GVFTG CG ELDHGVAA
Sbjct: 227 AAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAA 286
Query: 298 VGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VGYG G Y +VKNSWG WGE GYIRM+R+ EGLCGI AS P
Sbjct: 287 VGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 337
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 337 bits (863), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 170/361 (47%), Positives = 226/361 (62%), Gaps = 16/361 (4%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD----------KLIELFESWMS 53
++ S +L+ L+L + +C++ A D S+V + H + + +FESWM
Sbjct: 3 YAKSAMLIFLLALVIASCAT-AMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMV 61
Query: 54 KHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP 113
KHGK Y + EK R IF++NL+ I RN E SY LGLN FAD+S E+ G P
Sbjct: 62 KHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGEICHGADP 121
Query: 114 QFPTRR---QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGIN 170
+ P S + D LPKSVDWR +GAVT VK+QG C SCWAFSTV AVEG+N
Sbjct: 122 RPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLN 181
Query: 171 QIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE 230
+IV+G L +LSEQ+LI+C+ NNGC GG ++ A+++I+ +GGL + DYPY G CE
Sbjct: 182 KIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCE 240
Query: 231 DK-KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGA 289
+ KE+ + V I GY+++P NDE +L+KA+AHQPV+ +++S +FQ Y GVF G CG
Sbjct: 241 GRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGT 300
Query: 290 ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
L+HGV VGYG G DY IVKNS G WGE GY++M RN P GLCGI AS PLK
Sbjct: 301 NLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPLK 360
Query: 350 K 350
Sbjct: 361 N 361
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 337 bits (863), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 175/347 (50%), Positives = 223/347 (64%), Gaps = 15/347 (4%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
F H + L L L +AC + + PE + E E WM ++G+ YK
Sbjct: 25 FKHFMIAALIL-LGAWACQATSRTL------PEA-----SMFERHEQWMIQYGRVYKDEA 72
Query: 64 EKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPS 122
EK RF+IF +N+K I++ NK+ SY L +NEFAD ++EEF+ G K +R +
Sbjct: 73 EKSVRFQIFMDNVKFIEEFNKDGRQSYKLAVNEFADQTNEEFQASRNGYKMAVSSRPSQT 132
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
F Y +V A+P S+DWRKKGAVTPVK+QG CGSCWAFST+AA EGI ++ +G L SLSE
Sbjct: 133 TLFRYENVTAVPSSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSE 192
Query: 183 QELIDCD-TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
QEL+DCD T + GC GG M+ F++IV + G+ E YPY +GTC K+E I
Sbjct: 193 QELVDCDKTGEDQGCEGGYMEDGFEFIVKNKGIALEASYPYTAADGTCNSKEEASRAAKI 252
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
SGY+ VP N E +LLKA+A+QPVSV+I+ASG FQFYS GVFTG CG +LDHGV AVGYG
Sbjct: 253 SGYEKVPANSETALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDLDHGVTAVGYG 312
Query: 302 K-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
K S G+ Y +VKNSWG WG+ GYI M+R GLCGI AS P
Sbjct: 313 KTSDGTKYWLVKNSWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYP 359
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 336 bits (862), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 169/338 (50%), Positives = 223/338 (65%), Gaps = 14/338 (4%)
Query: 25 AHDFSIVGYSPEHLT-------SMDKLIELFESWMSKHG--KTYKCIEEKLHRFEIFKEN 75
A D SI+ Y+ EH + + ++ W++++G E RF +F +N
Sbjct: 21 ASDMSIISYNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDN 80
Query: 76 LKHIDQRN---KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA 132
LK +D N E + LG+N FAD+++EEF+ +LG K R + + V+
Sbjct: 81 LKFVDAHNARADEGGGFRLGMNRFADLTNEEFRATFLGAKVA-ERSRAAGERYRHDGVEE 139
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP+SVDWR+KGAV PVKNQG CGSCWAFS V+ VE INQ+V+G + +LSEQEL++C T+
Sbjct: 140 LPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNG 199
Query: 193 -NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
N+GCNGGLM AF +I+ +GG+ E+DYPY +G C+ +E +VV+I G++DVP+ND
Sbjct: 200 QNSGCNGGLMADAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQND 259
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIV 311
E+SL KA+AHQPVSVAIEA G +FQ Y GVF+G CG LDHGV AVGYG G DY IV
Sbjct: 260 EKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIV 319
Query: 312 KNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+NSWGPKWGE GY+RM+RN G CGI MAS P K
Sbjct: 320 RNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPTK 357
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 336 bits (862), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 166/321 (51%), Positives = 221/321 (68%), Gaps = 14/321 (4%)
Query: 42 DKLIELFESWMSKHGKTYKCIE----EKLHRFEIFKENLKHIDQRNKEV--TSYWLGLNE 95
+++ ++ W ++HGKT ++ RF IFK+NL+ ID N++ +Y LGL +
Sbjct: 43 EEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTK 102
Query: 96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR------DVKALPKSVDWRKKGAVTPVK 149
F D++++E++ YLG + + P RR A+ + + K +P++VDWR+KGAV P+K
Sbjct: 103 FTDLTNDEYRKLYLGARTE-PARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIK 161
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIV 209
+QG+CGSCWAFST AAVEGIN+IV+G L SLSEQEL+DCD S+N GCNGGLMDYAF++I+
Sbjct: 162 DQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIM 221
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIE 269
+GGL+ E+DYPY G C + VV+I GY+DVP DE +L KA+++QPVSVAIE
Sbjct: 222 KNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIE 281
Query: 270 ASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
A G FQ Y G+FTG CG LDH V AVGYG G DY IV+NSWGP+WGE GYIRM+R
Sbjct: 282 AGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMER 341
Query: 330 N-TGKPEGLCGINKMASIPLK 349
N G CGI AS P+K
Sbjct: 342 NLAASKSGKCGIAVEASYPVK 362
>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
Length = 363
Score = 336 bits (862), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 172/329 (52%), Positives = 223/329 (67%), Gaps = 17/329 (5%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
DFSIVGYS LTS ++LI+LFESWM KH K YK I+EK++RFEIFK+NLK+ID+ NK+
Sbjct: 45 DFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKN 104
Query: 87 TSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDV-----KALPKSVDWR 140
SYWLGLN FADMS++EFK KY G + + T E SY +V +P+ VDWR
Sbjct: 105 NSYWLGLNVFADMSNDEFKEKYTGSIAGNYTT-----TELSYEEVLNDGDVNIPEYVDWR 159
Query: 141 KKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGL 200
+KGAVTPVKNQGSCGS WAFS V+ +E I +I +GNL SEQEL+DCD + GCNGG
Sbjct: 160 QKGAVTPVKNQGSCGSAWAFSAVSTIESIIKIRTGNLNEYSEQELLDCDRR-SYGCNGGY 218
Query: 201 MDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA 260
A + +VA G+H YPY + C +++ G + V +E +LL ++A
Sbjct: 219 PWSALQ-LVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIA 277
Query: 261 HQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWG 320
+QPVSV +EA+G DFQ Y GG+F GPCG ++DH VAAVGY G +YI+++NSWG WG
Sbjct: 278 NQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGY----GPNYILIRNSWGTGWG 333
Query: 321 ERGYIRMKRNTGKPEGLCGINKMASIPLK 349
E GYIR+KR TG G+CG+ + P+K
Sbjct: 334 ENGYIRIKRGTGNSYGVCGLYTSSFYPVK 362
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 162/317 (51%), Positives = 216/317 (68%), Gaps = 13/317 (4%)
Query: 45 IELFESWMSKHGKTYK----CIEEKLHRFEIFKENLKHID--QRNKEVTSYWLGLNEFAD 98
+ ++ W +HGK+ I ++ RF IFK+NL+ ID N + +Y LGL FA+
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 99 MSHEEFKNKYLGLKPQFPTRRQPSAE------FSYRDVKALPKSVDWRKKGAVTPVKNQG 152
++++E+++ YLG + + P RR A+ + +V +P +VDWR+KGAV +K+QG
Sbjct: 61 LTNDEYRSLYLGARTE-PVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119
Query: 153 SCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASG 212
+CGSCWAFST AAVEGIN+IV+G L SLSEQEL+DCD S+N GCNGGLMDYAF++I+ +G
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179
Query: 213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASG 272
GL+ E+DYPY G C + VVTI GY+DVP DE +L +A+++QPVSVAI+A G
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239
Query: 273 TDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
FQ Y G+FTG CG +DH V AVGYG G DY IV+NSWG +WGE GYIRM+RN
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVA 299
Query: 333 KPEGLCGINKMASIPLK 349
G CGI AS P+K
Sbjct: 300 SKSGKCGIAIEASYPVK 316
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 165/307 (53%), Positives = 206/307 (67%), Gaps = 3/307 (0%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEF 104
ELF+ W +HGKTY EE+ R +IFK+N + Q N +Y L LN FAD++H EF
Sbjct: 30 ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89
Query: 105 KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
K LGL + S S +P SVDWRKKGAVT VK+QGSCG+CW+FS
Sbjct: 90 KASRLGLSVSASSLIMASKGQSLGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATG 149
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
A+EGINQIV+G+L SLSEQELIDCD S+N GCNGGLMDYAF++++ + G+ E+DYPY
Sbjct: 150 AMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQE 209
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS--GGV 282
+GTC+ K + +VVTI Y V NDE++L +A+A QPVSV I S FQ YS G+
Sbjct: 210 RDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRVSGI 269
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
F+GPC LDH V VGYG G DY IVKNSWG WG G++ M+RNTG EG+CGIN
Sbjct: 270 FSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICGINM 329
Query: 343 MASIPLK 349
+AS P+K
Sbjct: 330 LASYPIK 336
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 176/347 (50%), Positives = 227/347 (65%), Gaps = 15/347 (4%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD-KLIELFESWMSKHGKTYKCIEEK 65
SK +L SL+L F + + T D L E E WM+++GK Y EK
Sbjct: 3 SKTVLNISSLALLLV------FGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEK 56
Query: 66 LHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFK--NKYLGLKPQFPTRRQPS 122
R IFKEN++ I+ N Y LG+N+FAD+++EEFK N++ G TR +
Sbjct: 57 ELRSNIFKENVQRIEAFNNAGNKPYKLGINQFADLTNEEFKARNRFKGHMCSNSTR---T 113
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
F Y DV ++P S+DWR+KGAVTP+K+QG CG CWAFS VAA EGI ++ +G L SLSE
Sbjct: 114 PTFKYEDVSSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSE 173
Query: 183 QELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
QEL+DCDT + GC GGLMD AFK+I+ + GL+ E YPY + TC E + +I
Sbjct: 174 QELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASI 233
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
G++DVP N E +LLKA+A+QP+SVAI+ASG++FQFYS G+FTG CG ELDHGV AVGYG
Sbjct: 234 KGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYG 293
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
S G+ Y +VKNSWG +WGE GYIRM+R+ EGLCGI AS P
Sbjct: 294 VSDDGTKYWLVKNSWGEQWGEEGYIRMQRDVAAEEGLCGIAMQASYP 340
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 161/302 (53%), Positives = 208/302 (68%), Gaps = 3/302 (0%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNK 107
E WM+ +GK Y EK RF+IFK N+++I+ N Y L +N+FAD ++E+FK
Sbjct: 39 EQWMATYGKVYVDAAEKERRFKIFKNNVEYIESFNTAGNKPYKLSVNKFADQTNEKFKGA 98
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
G + F TR F Y +V A+P ++DWRKKGAVTP+K+QG CGSCWAFSTVAA E
Sbjct: 99 RNGYRRPFQTRPMKVTSFKYENVTAVPATMDWRKKGAVTPIKDQGQCGSCWAFSTVAATE 158
Query: 168 GINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
GINQ+ +G L SLSEQEL+DCD + GC GGLM+ F++I+ + G+ E +YPY +
Sbjct: 159 GINQLTTGKLVSLSEQELVDCDNQGEDQGCEGGLMEDGFEFIIKNHGITTEANYPYQAAD 218
Query: 227 GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP 286
GTC KK+ + I+GY+ VP N E LLK +A+QP+SV+I+A G+DFQFYS GVFTG
Sbjct: 219 GTCNSKKQASHIAKITGYESVPANSEAELLKVVANQPISVSIDAGGSDFQFYSSGVFTGK 278
Query: 287 CGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMAS 345
CG ELDHGV AVGYG+ S G+ Y +VKNSW WGE GYIRM+R+ EGLCGI +S
Sbjct: 279 CGTELDHGVTAVGYGETSDGTKYWLVKNSWXTSWGEEGYIRMQRDIDAEEGLCGIAMDSS 338
Query: 346 IP 347
P
Sbjct: 339 YP 340
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 169/322 (52%), Positives = 221/322 (68%), Gaps = 16/322 (4%)
Query: 42 DKLIELFESWMSKHGKTYKCIE----EKLHRFEIFKENLKHID---QRNKEVTSYWLGLN 94
+++ ++ W + HGKT ++ RF IFK+NL+ ID ++NK T Y LGL
Sbjct: 43 EEVRSIYLQWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEKNKNAT-YKLGLT 101
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR------DVKALPKSVDWRKKGAVTPV 148
+F D+++EE+++ YLG + + P RR A+ + D K +P++VDWR KGAV P+
Sbjct: 102 KFTDLTNEEYRSLYLGARTE-PVRRIAKAKNVNQKYSAAVDGKEVPETVDWRLKGAVNPI 160
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
K+QG+CGSCWAFST AAVEGIN+IV+G L SLSEQEL+DCD S+N GCNGGLMDYAF++I
Sbjct: 161 KDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVDCDNSYNQGCNGGLMDYAFQFI 220
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
+ +GGL E+DYPY G C + +VV+I GY+DVP DE +L +A++ QPVSVAI
Sbjct: 221 MKNGGLKTEKDYPYRGFGGKCNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVSVAI 280
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMK 328
EA G FQ Y G+FTG CG LDH V AVGYG G DY IV+NSWGP+WGE GYIRM+
Sbjct: 281 EAGGRIFQHYQTGIFTGNCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRME 340
Query: 329 RNTGKPE-GLCGINKMASIPLK 349
RN + G CGI AS P+K
Sbjct: 341 RNLASSKSGKCGIAVEASYPVK 362
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 176/351 (50%), Positives = 230/351 (65%), Gaps = 22/351 (6%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD----KLIELFESWMSKHGKTYKCI 62
+K +SL+L C +G+ +TS + E E WM+++ K YK
Sbjct: 3 AKNQFYHISLALLFC---------LGFWAFQVTSRTLQDASMYERHEEWMARYAKVYKDP 53
Query: 63 EEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPTR 118
EE+ RF+IFKEN+ +I+ N Y LG+N+FAD+++EEF +NK+ G TR
Sbjct: 54 EEREKRFKIFKENVNYIEAFNNAADKPYKLGINQFADLTNEEFIAPRNKFKGHMCSSITR 113
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
+ F Y +V ALP +VDWR+KGAVTP+K+QG CG CWAFS VAA EGI+ + SG L
Sbjct: 114 ---TTTFKYENVTALPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLI 170
Query: 179 SLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
SLSEQE++DCDT + GC GG MD AFK+I+ + GL+ E +YPY +G C +
Sbjct: 171 SLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEANYPYKAVDGKCNANEAANH 230
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAA 297
TI+GY+DVP N+E++L KA+A+QPVSVAI+ASG+DFQFY GVFTG CG +LDHGV A
Sbjct: 231 AATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKTGVFTGSCGTQLDHGVTA 290
Query: 298 VGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VGYG S G+ Y +VKNSWG +WGE GYI M+R EGLCGI MAS P
Sbjct: 291 VGYGVSADGTQYWLVKNSWGTEWGEEGYIMMQRGVKAQEGLCGIAMMASYP 341
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 165/321 (51%), Positives = 219/321 (68%), Gaps = 14/321 (4%)
Query: 42 DKLIELFESWMSKHGKTYKCIE----EKLHRFEIFKENLKHID--QRNKEVTSYWLGLNE 95
+++ ++ W ++HGKT ++ RF IFK+NL+ ID N + +Y LGL +
Sbjct: 43 EEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTK 102
Query: 96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR------DVKALPKSVDWRKKGAVTPVK 149
F D++++E++ YLG + + P RR A+ + + K +P++VDWR+KGAV P+K
Sbjct: 103 FTDLTNDEYRKLYLGARTE-PARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIK 161
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIV 209
+QG+CGSCWAFST AAVEGIN+IV+G L SLSEQEL+DCD S+N GCNGGLMDYAF++I+
Sbjct: 162 DQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIM 221
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIE 269
+GGL+ E+DYPY G C + VV+I GY+DVP DE +L KA+++QPV VAIE
Sbjct: 222 KNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVRVAIE 281
Query: 270 ASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
A G FQ Y G+FTG CG LDH V AVGYG G DY IV+NSWGP+WGE GYIRM+R
Sbjct: 282 AGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMER 341
Query: 330 N-TGKPEGLCGINKMASIPLK 349
N G CGI AS P+K
Sbjct: 342 NLAASKSGKCGIAVEASYPVK 362
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 166/331 (50%), Positives = 221/331 (66%), Gaps = 9/331 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIEL-----FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
D SI+ Y+ EH + E ++ W++++G++Y + E RF +F +NL+ D
Sbjct: 27 DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADA 86
Query: 82 RNKEVTS--YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDW 139
N + LG+N FAD+++EEF+ +LG K R + + V+ LP+SVDW
Sbjct: 87 HNARADDHGFRLGMNRFADLTNEEFRATFLGAK-VVERSRAAGERYRHDGVEELPESVDW 145
Query: 140 RKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGG 199
R+KGAV PVKNQG CGSCWAFS V+ VE INQ+V+G + +LSEQEL++C T+ NG G
Sbjct: 146 REKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNGGCNG 205
Query: 200 -LMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
LMD AF +I+ +GG+ E+DYPY +G C+ +E +VV+I G++DVP+NDE+SL KA
Sbjct: 206 GLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKA 265
Query: 259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPK 318
+AHQPVSVAIEA G +FQ Y GVF+G CG LDHGV AVGYG G DY IV+NSWGPK
Sbjct: 266 VAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPK 325
Query: 319 WGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WGE GY+RM+RN G CGI MAS P K
Sbjct: 326 WGESGYVRMERNINVTTGKCGIAMMASYPTK 356
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 163/318 (51%), Positives = 217/318 (68%), Gaps = 15/318 (4%)
Query: 45 IELFESWMSKHGKTYK----CIEEKLHRFEIFKENLKHID--QRNKEVTSYWLGLNEFAD 98
+ ++ W +HGK+ I ++ RF IFK+NL+ ID N + +Y LGL FA+
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 99 MSHEEFKNKYLGLKPQFPTRRQPSAE-------FSYRDVKALPKSVDWRKKGAVTPVKNQ 151
++++E+++ YLG + + P RR A+ + DV+ +P +VDWR+KGAV +K+Q
Sbjct: 61 LTNDEYRSLYLGARTE-PVRRITKAKNVNMKYSAAVNDVE-VPVTVDWRQKGAVNAIKDQ 118
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVAS 211
G+CGSCWAFST AAVEGIN+IV+G L SLSEQEL+DCD S+N GCNGGLMDYAF++I+ +
Sbjct: 119 GTCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKN 178
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEAS 271
GGL+ E+DYPY G C + VVTI GY+DVP DE +L +A+++QPVSVAI+A
Sbjct: 179 GGLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAG 238
Query: 272 GTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNT 331
G FQ Y G+FTG CG +DH V AVGYG G DY IV+NSWG +WGE GYIRM+RN
Sbjct: 239 GRAFQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNV 298
Query: 332 GKPEGLCGINKMASIPLK 349
G CGI AS P+K
Sbjct: 299 ASKSGKCGIAIEASYPVK 316
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 170/366 (46%), Positives = 225/366 (61%), Gaps = 24/366 (6%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD------------------KLIELF 48
S LL+L L++ + +C++ A D S+V Y H + + +F
Sbjct: 6 SALLILLLAMVIASCAT-AMDMSVVTYDDNHHVTAGPGHHVTAGPGRRNGVFDVEASLIF 64
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKY 108
ESW+ KHGK Y + EK R IFK+NL+ I RN E Y LGLN FAD+S E+K
Sbjct: 65 ESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSENLGYRLGLNRFADLSLHEYKEIC 124
Query: 109 LGLKPQFPTRR---QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
G P+ P S + LPKSVDWR +GAVT VK+QG C SCWAFSTV A
Sbjct: 125 HGADPKPPRNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGA 184
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
VEG+N+IV+G L +LSEQ+LI+C+ NNGC GG ++ A+++IV++GGL + DYPY
Sbjct: 185 VEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIVSNGGLGTDNDYPYKAV 243
Query: 226 EGTCEDK-KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
G C+ + KE ++ V I GY+++P NDE +L+KA+AHQPV+ I++S +FQ Y GVF
Sbjct: 244 NGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQLYESGVFD 303
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
G CG L+HGV VGYG G +Y IV+NSWG WGE GY++M RN P GLCGI
Sbjct: 304 GRCGTNLNHGVVVVGYGTENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRGLCGIAMRV 363
Query: 345 SIPLKK 350
S PLK
Sbjct: 364 SYPLKN 369
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 170/324 (52%), Positives = 220/324 (67%), Gaps = 6/324 (1%)
Query: 28 FSIVGYSPEHLTSMDKLIEL-FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
F+ Y T D L+ + E WM+++G+ Y+ EK RF IFKEN+++I+ NK
Sbjct: 18 FATSAYLATSRTLSDSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAG 77
Query: 87 TS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
T Y LG+N FAD++++EFK G K P + F Y +V ++P +VDWR KGAV
Sbjct: 78 TKPYKLGINAFADLTNQEFKASRNGYK--LPHDCSSNTPFRYENVSSVPTTVDWRTKGAV 135
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYA 204
TPVK+QG CG CWAFS VAA+EGI ++ +GNL SLSEQEL+DCD + GC GGLMD A
Sbjct: 136 TPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDA 195
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
F +I+ + GL E +YPY +G+C+ K ISGY+DVP N E +L KA+A+QPV
Sbjct: 196 FSFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPV 255
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERG 323
SVAI+A G+DFQFYS GVFTG CG ELDHGV AVGYG ++ GS Y +VKNSWG WGE+G
Sbjct: 256 SVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKG 315
Query: 324 YIRMKRNTGKPEGLCGINKMASIP 347
YIRM+++ EGLCGI +S P
Sbjct: 316 YIRMQKDIEAKEGLCGIAMQSSYP 339
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 175/351 (49%), Positives = 230/351 (65%), Gaps = 22/351 (6%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD----KLIELFESWMSKHGKTYKCI 62
+K +SL+L C +G+ +TS + E E WM+++ K YK
Sbjct: 3 AKNQFYHISLALLFC---------LGFWAFQVTSRTLQDASMYERHEEWMARYAKVYKDP 53
Query: 63 EEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPTR 118
EE+ RF+IFKEN+ +I+ N Y LG+N+FAD+++EEF +N++ G TR
Sbjct: 54 EEREKRFKIFKENVNYIEAFNNAANKPYKLGINQFADLTNEEFIAPRNRFKGHMCSSITR 113
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
+ F Y +V ALP +VDWR+KGAVTP+K+QG CG CWAFS VAA EGI+ + SG L
Sbjct: 114 ---TTTFKYENVTALPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLI 170
Query: 179 SLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
SLSEQE++DCDT + GC GG MD AFK+I+ + GL+ E +YPY +G C +
Sbjct: 171 SLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEANYPYKAVDGKCNANEAANH 230
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAA 297
TI+GY+DVP N+E++L KA+A+QPVSVAI+ASG+DFQFY GVFTG CG +LDHGV A
Sbjct: 231 AATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKTGVFTGSCGTQLDHGVTA 290
Query: 298 VGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VGYG S G+ Y +VKNSWG +WGE GYI M+R EGLCGI MAS P
Sbjct: 291 VGYGVSADGTQYWLVKNSWGTEWGEEGYIMMQRGVKAQEGLCGIAMMASYP 341
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 172/344 (50%), Positives = 226/344 (65%), Gaps = 14/344 (4%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
L +SL+L C L V + L + E + WM ++ K Y +E RF+
Sbjct: 7 LYYISLALLMCLGLW----AVQVTSRTLQDA-SMYERHQQWMGQYAKIYNDHQEWEKRFQ 61
Query: 71 IFKENLKHIDQRNKEVTSYW-LGLNEFADMSHEEF---KNKYLGLKPQFPTRRQPSAEFS 126
IFKEN+ +I+ NKE ++ LG+N+F D+++EEF +N++ G R + +
Sbjct: 62 IFKENVNYIETSNKEGGRFYKLGVNQFVDLTNEEFIAPRNRFKGHMCSSIIR---TNTYK 118
Query: 127 YRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
Y +V +P +VDWR+KGAVTPVK+QG CG CWAFS VAA EGI+Q+ +G L SLSEQEL+
Sbjct: 119 YENVTTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELV 178
Query: 187 DCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ 245
DCDT + GC GGLMD AFK+I+ + GL E YPY +GTC + + TI+ Y+
Sbjct: 179 DCDTKGVDQGCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVDGTCNANEASINAATITSYE 238
Query: 246 DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-K 304
DVP N+EQ+L KA+A+QP+SVAI+ASG+DFQFY+ GVFTG CG ELDHGV AVGYG S
Sbjct: 239 DVPTNNEQALQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVSDD 298
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
G+ Y +VKNSWG WGE GYIRM+R EGLCGI AS P+
Sbjct: 299 GTKYWLVKNSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASYPI 342
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 177/344 (51%), Positives = 230/344 (66%), Gaps = 14/344 (4%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCI-EEKLHRF 69
+++L LF S A SI+ P+ + D+++ L++ W +KHGK + + E +RF
Sbjct: 9 IMALLFFLFIALSAASPSSII---PQR--TDDEVMALYDQWRAKHGKLHNNLGAEPENRF 63
Query: 70 EIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR-QPSAEFSYR 128
IFK+NLK ID+ N + Y LGLN FAD+++EE++++YLG K +RR + S + R
Sbjct: 64 HIFKDNLKFIDEINAQNLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSNRYLPR 123
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
LP S+DWR KGAV PVK+QGSCGSCWAFSTVA+VE INQIV+G+L +LSEQEL+DC
Sbjct: 124 LGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDC 183
Query: 189 DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
D S+N GCNGGLMDYAF++I+ +GGL EEDYPY + +C K+ I GY+DVP
Sbjct: 184 DRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKN----AIDGYEDVP 239
Query: 249 ENDEQSLLKA---LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKG 305
N+E++L KA VSVAIE G FQ Y G+FTG CG +LDHGV VGYG G
Sbjct: 240 VNNEKALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYGSEGG 299
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
DY IV+NSWG WGE GY++M+RN P GLCGI S P K
Sbjct: 300 VDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPTK 343
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 161/302 (53%), Positives = 208/302 (68%), Gaps = 3/302 (0%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNK 107
E WM+ +GK Y EK RF+IFK N+++I+ N Y L +N+FAD ++E+FK
Sbjct: 39 EQWMATYGKVYVDAAEKERRFKIFKNNVEYIESFNTAGNKPYKLSVNKFADQTNEKFKGA 98
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
G + F TR F Y +V A+P ++DWRKKGAVT +K+QG CGSCWAFSTVAA E
Sbjct: 99 RNGYRRPFQTRPMKVTSFKYENVTAVPATMDWRKKGAVTLIKDQGQCGSCWAFSTVAATE 158
Query: 168 GINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
GINQ+ +G L SLSEQEL+DCD + GC GGLM+ F++I+ + G+ E +YPY +
Sbjct: 159 GINQLTTGKLVSLSEQELVDCDIQGEDQGCEGGLMEDGFEFIIKNHGITTEANYPYQAAD 218
Query: 227 GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP 286
GTC KK+ + I+GY+ VP N E LLK +A+QP+SV+I+A G+DFQFYS GVFTG
Sbjct: 219 GTCNSKKQASHIAKITGYESVPANSEAELLKVVANQPISVSIDAGGSDFQFYSSGVFTGK 278
Query: 287 CGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMAS 345
CG ELDHGV AVGYG+ S G+ Y +VKNSWG WGE GYIRM+R+ EGLCGI +S
Sbjct: 279 CGTELDHGVTAVGYGETSDGTKYWLVKNSWGTSWGEEGYIRMQRDIDTEEGLCGIAMDSS 338
Query: 346 IP 347
P
Sbjct: 339 YP 340
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 334 bits (857), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 160/333 (48%), Positives = 218/333 (65%), Gaps = 5/333 (1%)
Query: 19 FACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKH 78
F + LA ++ + LT ++ E WM+K+G+ Y + EK R E+FK N+
Sbjct: 82 FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141
Query: 79 IDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK--ALPKS 136
I+ N + L N+FADM+ +EF+ + G KP P + + +F Y +V ALP S
Sbjct: 142 IELVNAGNDKFSLEANQFADMTVDEFRAAHTGYKP-VPANKGRTTQFKYANVSLDALPAS 200
Query: 137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS-FNNG 195
+DWR KGAVTP+K+QG CG CWAFSTVA+VEGI ++ +G L SLSEQEL+DCD + G
Sbjct: 201 MDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDGMDQG 260
Query: 196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
C GGLMD AF++I+ +GGL E +YPY + +C KE +V +I GY+DVP NDE SL
Sbjct: 261 CEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVPSNDETSL 320
Query: 256 LKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNS 314
LKA+A QPVS+A++ F+FY GGV +G CG ELDHG+AAVGYG S G+ + ++KNS
Sbjct: 321 LKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGITSDGTKFWLMKNS 380
Query: 315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
WG WGE+G+IRM+R+ EGLCG+ S P
Sbjct: 381 WGTSWGEKGFIRMERDIADEEGLCGLAMQPSYP 413
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 334 bits (857), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 164/306 (53%), Positives = 218/306 (71%), Gaps = 11/306 (3%)
Query: 50 SWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT---SYWLGLNEFADMSHEEFKN 106
+WM++HG+ Y EK +R+ +FK N++ I++ N EV ++ L +N+FAD+++EEF++
Sbjct: 39 AWMTEHGRVYADANEKNNRYVVFKRNVESIERLN-EVQYGLTFKLAVNQFADLTNEEFRS 97
Query: 107 KYLGLKPQ--FPTRRQPSAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
Y G K +R +P++ F Y+ V ALP SVDWRKKGAVTP+K+QGSCGSCWAFS
Sbjct: 98 MYTGYKGNSVLSSRTKPTS-FRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSA 156
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
VAA+EG+ QI G L SLSEQEL+DCDT+ ++GC GG M+ AF Y + +GGL E +YPY
Sbjct: 157 VAAIEGVAQIKKGKLISLSEQELVDCDTN-DDGCMGGYMNSAFNYTMTTGGLTSESNYPY 215
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+GTC K + +I G++DVP NDE++L+KA+AH PVS+ I GT FQFYS GV
Sbjct: 216 KSTDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGV 275
Query: 283 FTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
F+G C LDHGVA VGYGK S GS Y I+KNSWGPKWGERGY+R+K++T G CG+
Sbjct: 276 FSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKKDTKAKHGQCGLA 335
Query: 342 KMASIP 347
AS P
Sbjct: 336 MNASYP 341
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 334 bits (856), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 158/312 (50%), Positives = 217/312 (69%), Gaps = 5/312 (1%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEF 96
L + +++ E WM++HG+ Y ++EK R+ IFKEN++ I+ N Y LG+N+F
Sbjct: 30 LDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKF 89
Query: 97 ADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGS 156
AD+++EEF+ Y G K Q + + S+ F Y ++ +P S+DWR GAVTPVK+QG+CG
Sbjct: 90 ADLTNEEFRAMYHGYKRQ--SSKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGC 147
Query: 157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHK 216
CWAFSTVAA+EGI ++ +GNL SLSEQ+L+DC T+ N GC GGLMD AF+YI+ +GGL
Sbjct: 148 CWAFSTVAAIEGIIKLQTGNLISLSEQQLVDC-TAGNKGCQGGLMDTAFQYIIRNGGLTS 206
Query: 217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
E++YPY +GTC +K I+GY+DVP+N+E +LL+A+A QPVSV ++ G DFQ
Sbjct: 207 EDNYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVGVDGGGNDFQ 266
Query: 277 FYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
FY GVF G CG + +H V A+GYG G+DY +VKNSWG WGE GY+RM+R G E
Sbjct: 267 FYKSGVFNGDCGTQQNHAVTAIGYGTDIDGTDYWLVKNSWGTSWGENGYMRMRRGIGSSE 326
Query: 336 GLCGINKMASIP 347
GLCG+ AS P
Sbjct: 327 GLCGVAMDASYP 338
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 159/311 (51%), Positives = 218/311 (70%), Gaps = 6/311 (1%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMS 100
D++I +FESW+ ++GK+Y + EK RFEIFK+NL+ +D+ N +V SY +GLN+F+D++
Sbjct: 42 DEVIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLT 101
Query: 101 HEEFKNKYLGLKPQFPTR-RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
E+ + YLG K F R S + R LP SVDWRKKGAV VKNQG+CGSCW
Sbjct: 102 DAEYSSIYLGTK--FNIRMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGNCGSCWT 159
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEE 218
F+++AAVEGIN+IV+GNL SLSEQE++DC + NNGCNGG + A+++I+ +GG++ E
Sbjct: 160 FASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGGINTEA 219
Query: 219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
+YPY +G C+ K+ + VTI Y++VP N+E++L KA+A QPVSV I ++ T F+ Y
Sbjct: 220 NYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNSTAFKSY 279
Query: 279 SGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
G+F GPCG +DHGV VGYG G DY IV+NSWGP WGE GY+RM+RN G G C
Sbjct: 280 KSGIFNGPCGPRIDHGVTIVGYGTEGGKDYWIVRNSWGPNWGESGYVRMQRNVGG-SGKC 338
Query: 339 GINKMASIPLK 349
I + P+K
Sbjct: 339 FIARAPVYPVK 349
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 169/337 (50%), Positives = 223/337 (66%), Gaps = 18/337 (5%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE--------EKLHRFEIFKENLKHIDQR 82
+ ++ L+S + L L+E W S++ + E RF +F EN ++I +
Sbjct: 25 IPFTESDLSSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEA 84
Query: 83 NKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFP---TRRQPSAEFSYR----DVKALP 134
N+ + L LN+FADM+ +EF+ Y G + + + + S+R D LP
Sbjct: 85 NRRGGRPFRLALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGGEGGSFRYGGDDEDNLP 144
Query: 135 KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN 194
+VDWR++GAVT +K+QG CGSCWAFSTVAAVEG+N+I +G L +LSEQEL+DCDT N
Sbjct: 145 PAVDWRERGAVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQ 204
Query: 195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQS 254
GC+GGLMDYAF++I +GG+ E +YPY E+G C K VTI GY+DVP NDE +
Sbjct: 205 GCDGGLMDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESA 264
Query: 255 LLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKN 313
L KA+A+QPV+VA+EASG DFQFYS GVFTG CG +LDHGVAAVGYG ++ G+ Y IVKN
Sbjct: 265 LQKAVANQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKN 324
Query: 314 SWGPKWGERGYIRMKRN-TGKPEGLCGINKMASIPLK 349
SWG WGERGYIRM+R + GLCGI AS P+K
Sbjct: 325 SWGEDWGERGYIRMQRGVSSDSNGLCGIAMEASYPVK 361
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 333 bits (855), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 170/356 (47%), Positives = 233/356 (65%), Gaps = 12/356 (3%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY- 59
M F + +L L L +F S+ + + S H S +++ +F+ WMSKHGKTY
Sbjct: 1 MGFVRPVCMTILFL-LIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYT 59
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+ EK RF+ FK+NL+ IDQ N + SY LGL FAD++ +E+++ L P P +
Sbjct: 60 NALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRD----LFPGSPKPK 115
Query: 120 QPSAEFSYRDV----KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
Q + + S R V LP+SVDWR++GAV+ +K+QG+C SCWAFSTVAAVEG+N+IV+G
Sbjct: 116 QRNLKTSRRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTG 175
Query: 176 NLTSLSEQELIDCDTSFNNGCNG-GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
L SLSEQEL+DC+ NNGC G GLMD AF++++ + GL E+DYPY +G+C K+
Sbjct: 176 ELISLSEQELVDCNL-VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQV 234
Query: 235 EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHG 294
+ V+TI Y+DVP NDE SL KA+AHQPVSV ++ +F Y ++ GPCG LDH
Sbjct: 235 HLLVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHA 294
Query: 295 VAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
+ VGYG G DY IV+NSWG WG+ GYI++ RN P+GLCGI +AS P+K
Sbjct: 295 LVIVGYGSENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPIKN 350
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 333 bits (853), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 164/307 (53%), Positives = 216/307 (70%), Gaps = 9/307 (2%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT--SYWLGLNEFADMSHEEFKN 106
+ WM++HG+TY + EK +R+ +FK N++ I++ N ++ L +N+FAD++++EF+
Sbjct: 39 DEWMAEHGRTYADMNEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRF 98
Query: 107 KYLGLKPQFPTRRQP---SAEFSYRDV--KALPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
Y G K F Q S F Y++V ALP +VDWRKKGAVTP+KNQGSCG CWAFS
Sbjct: 99 MYTGYKGDFVLFSQSQTKSTSFRYQNVFFGALPIAVDWRKKGAVTPIKNQGSCGCCWAFS 158
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
VAA+EG QI G L SLSEQ+L+DCDT+ + GC+GGLMD AF++I+A+GGL E +YP
Sbjct: 159 AVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCSGGLMDTAFEHIMATGGLTTESNYP 217
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
Y E+ C+ K + +I+GY+DVP NDE +L+KA+AHQPVSV IE G DFQFYS G
Sbjct: 218 YKGEDANCKIKSTKPSAASITGYEDVPVNDENALMKAVAHQPVSVGIEGGGFDFQFYSSG 277
Query: 282 VFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
VFTG C LDH V AVGY +S GS Y I+KNSWG KWGE GY+R+K++ EGLCG+
Sbjct: 278 VFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIKKDIKDKEGLCGL 337
Query: 341 NKMASIP 347
AS P
Sbjct: 338 AMKASYP 344
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 333 bits (853), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 170/335 (50%), Positives = 219/335 (65%), Gaps = 16/335 (4%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTY----KCIEEKLHRFEIFKENLKHIDQRN-KE 85
+ +S L S + L L+E W S + + +++ RF +FKEN +++ + N K+
Sbjct: 24 IPFSERDLASEESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKD 83
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK---------ALPKS 136
+ L LN+FADM+ +EF+ Y G + + R Q S+ + LP +
Sbjct: 84 GRPFRLALNKFADMTTDEFRRTYAGSRTRH-HRAQLGEARSFAHAQHGRGGSGTTNLPPA 142
Query: 137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGC 196
VDWR +GAVT VK+QG CGSCWAFS +AAVEG+N+I++G L SLSEQEL+DCD N GC
Sbjct: 143 VDWRLRGAVTGVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGC 202
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
+GGLMDYAF+YI +GG+ E +YPYL E+ +C KE VTI GY+DVP N+E +L
Sbjct: 203 DGGLMDYAFQYIQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQ 262
Query: 257 KALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSW 315
KA+A QPV+VAIEASG DFQFYS GVFTG CG +LDHGVAAVGYG + G+ Y VKNSW
Sbjct: 263 KAVASQPVAVAIEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSW 322
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G WGERGYIRM+R GLCGI S P KK
Sbjct: 323 GEDWGERGYIRMQRGVPDSRGLCGIAMEPSYPTKK 357
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 168/309 (54%), Positives = 214/309 (69%), Gaps = 8/309 (2%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHE 102
+ E E WM+ HGK Y EK +++ FKEN++ I+ N Y LG+N FAD+++E
Sbjct: 36 MRERHEQWMAIHGKVYTHSYEKEQKYQTFKENVQRIEAFNHAGNKPYKLGINHFADLTNE 95
Query: 103 EFK--NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
EFK N++ G TR + F Y ++ A+P ++DWR++GAVTP+K+QG CG CWAF
Sbjct: 96 EFKAINRFKGHVCSKITR---TPTFRYENMTAVPATLDWRQEGAVTPIKDQGQCGCCWAF 152
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEED 219
S VAA EGI ++ +G L SLSEQEL+DCDT + GC GGLMD AFK+I+ + GL E
Sbjct: 153 SAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKGLAAEAI 212
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS 279
YPY +GTC K E +I GY+DVP N E +LLKA+A+QPVSVAIEASG +FQFYS
Sbjct: 213 YPYEGVDGTCNAKAEGNHATSIKGYEDVPANSESALLKAVANQPVSVAIEASGFEFQFYS 272
Query: 280 GGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GGVFTG CG LDHGV AVGYG S G+ Y +VKNSWG KWG++GYIRM+R+ EGLC
Sbjct: 273 GGVFTGSCGTNLDHGVTAVGYGVSDDGTKYWLVKNSWGVKWGDKGYIRMQRDVAAKEGLC 332
Query: 339 GINKMASIP 347
GI +AS P
Sbjct: 333 GIAMLASYP 341
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 167/312 (53%), Positives = 214/312 (68%), Gaps = 13/312 (4%)
Query: 48 FESWMSKHGKTYKCIEEKL---HRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEE 103
+ SW +K GK +C HRFE FKEN ++I++ N+ SY LGLN+F+D++ EE
Sbjct: 13 YASWCAKFGK--ECASSNSLGDHRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEE 70
Query: 104 FKNKYLGLKPQF---PTRRQP---SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
F+ ++LGL+P P + P E +++V LP SVDWR+ GAVT K+QGSCG C
Sbjct: 71 FRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVD-LPASVDWRQHGAVTAPKDQGSCGGC 129
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKE 217
WAF+T A+EGINQIV+G L SLSEQELIDCD + GC+GGLM+ A+++IV +GGL E
Sbjct: 130 WAFATTGAIEGINQIVTGQLVSLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTE 189
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
DYPY E C KK VV I GY+ +PE DEQ+LL A+A QPVSVAIE + DFQ
Sbjct: 190 TDYPYHASESHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGASKDFQH 249
Query: 278 YSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
Y+ GVFTG CG E++HGV VGYG G DY IVKNSW WG+ G+++M+RNTGK GL
Sbjct: 250 YASGVFTGHCGEEINHGVLIVGYGTEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGL 309
Query: 338 CGINKMASIPLK 349
C IN +AS P+K
Sbjct: 310 CSINTLASYPVK 321
>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
Length = 318
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 167/314 (53%), Positives = 221/314 (70%), Gaps = 9/314 (2%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SKLL +++ LS+ S FSIVGYSP+ LTS +KLI LF+SWM ++ K YK I+E
Sbjct: 6 SFSKLLFVAICLSVHMGLSYGA-FSIVGYSPDDLTSTEKLINLFDSWMVEYDKVYKDIDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ-FPTRRQPS- 122
K++RFEIFK+NLK+ID+ NK+ +YWLGL F D++++EFK KY+G P+ + T +P+
Sbjct: 65 KIYRFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKYVGSIPENWSTTEEPND 124
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
EF Y DV +P S+DWR+KGAVTPV+NQGSCGSCW FS+VAAVEGIN+IV+G L SLSE
Sbjct: 125 KEFIYDDVVNIPASIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEGINKIVTGQLVSLSE 184
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DC+ + GC GG YA +Y VA+ G+H + YPY + C + + V
Sbjct: 185 QELLDCERR-SYGCRGGFPPYALQY-VANSGIHLRQYYPYEGVQRQCRAAQAKGPKVKTD 242
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
G V N+EQ+L++ +A QPVS+ +EA G FQ Y GG+F GPCG +DH VAAVGYG
Sbjct: 243 GVGRVQRNNEQALIQRIAIQPVSIVVEAKGRAFQNYRGGIFAGPCGTSIDHAVAAVGYGN 302
Query: 303 SKGSDYIIVKNSWG 316
YI++KNSWG
Sbjct: 303 G----YILIKNSWG 312
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 162/307 (52%), Positives = 205/307 (66%), Gaps = 7/307 (2%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEF 104
+LFE+W ++GKTY EEK R ++F+EN + Q N SY L LN FAD++H EF
Sbjct: 27 DLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFADLTHHEF 86
Query: 105 KNKYLGLKPQFPTRRQPSAEFSYRDVKAL--PKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
K LG P R S V+ L P +VDWRK GAVT VK+QG+CG CW+FST
Sbjct: 87 KASRLGFSPG----RAQSIRSVGTPVQELHVPPAVDWRKSGAVTGVKDQGNCGGCWSFST 142
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
A+EGIN+IV+G+L SLSEQEL+DCD S+N+GC GGLMDYA+++++ + G+ E DYPY
Sbjct: 143 TGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKNQGIDSEADYPY 202
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+ + C +K + +VTI GY D+P NDE+ LL+ +A QPVSV I S FQ YS GV
Sbjct: 203 VGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGSEKTFQLYSKGV 262
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
+TGPC + LDH V VGYG G D+ IVKNSWG WG RGYI M RN G EG+CGIN
Sbjct: 263 YTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRNNGTAEGICGINM 322
Query: 343 MASIPLK 349
+AS P K
Sbjct: 323 LASYPAK 329
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 172/344 (50%), Positives = 227/344 (65%), Gaps = 16/344 (4%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD-KLIELFESWMSKHGKTYKCIEEKLHRF 69
L +SL+L C + + T D + E WM+++ K YK +E+ RF
Sbjct: 7 LYHISLALLFC------MGFLAFQVTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRF 60
Query: 70 EIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQPSAEF 125
IFKEN+ +I+ N + SY L +N+FAD+++EEF +N++ G TR + F
Sbjct: 61 RIFKENVNYIETFNSADNKSYKLDINQFADLTNEEFIAPRNRFKGHMCSSITR---TTTF 117
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
Y +V +P +VDWR+KGAVTP+K+QG CG CWAFS VAA EGI+ + +G L SLSEQE+
Sbjct: 118 KYENVTVIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEV 177
Query: 186 IDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DCDT + GC GG MD AFK+I+ + GL+ E +YPY +G C K TI+GY
Sbjct: 178 VDCDTKGQDQGCAGGFMDGAFKFIIQNHGLNTEPNYPYKAADGKCNAKAAANHAATITGY 237
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
+DVP N+E++L KA+A+QPVSVAI+ASG+DFQFY GVFTG CG ELDHGV AVGYG S
Sbjct: 238 EDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSA 297
Query: 305 -GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G++Y +VKNSWG +WGE GYIRM+R EGLCGI MAS P
Sbjct: 298 DGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYP 341
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 166/312 (53%), Positives = 214/312 (68%), Gaps = 13/312 (4%)
Query: 48 FESWMSKHGKTYKCIEEKL---HRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEE 103
+ SW +K GK +C RFE FKEN ++I++ N+ SY LGLN+F+D++ EE
Sbjct: 13 YASWCAKFGK--ECASSNSLGDRRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEE 70
Query: 104 FKNKYLGLKPQF---PTRRQP---SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
F+ ++LGL+P P + P E +++V LP SVDWRK GAVT K+QGSCG C
Sbjct: 71 FRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVD-LPASVDWRKHGAVTAPKDQGSCGGC 129
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKE 217
WAF+T A+EGINQIV+G L SLSEQELIDCD + GC+GGLM+ A+++IV +GGL E
Sbjct: 130 WAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTE 189
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
DYPY E C KK VV I GY+ +P+ DEQ+LL+A+A QPVSVAIE + DFQ
Sbjct: 190 TDYPYHASESHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVAKQPVSVAIEGASKDFQH 249
Query: 278 YSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
Y+ GVFTG CG E++HGV VGYG G DY IVKNSW WG+ G+++M+RNTGK GL
Sbjct: 250 YASGVFTGHCGEEINHGVLIVGYGTEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGL 309
Query: 338 CGINKMASIPLK 349
C IN +AS P+K
Sbjct: 310 CSINTLASYPVK 321
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 168/353 (47%), Positives = 224/353 (63%), Gaps = 9/353 (2%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPE---HLTSMDKLIELFESWMSKHGKTYKC 61
+ S +L+L +++ + +C++ A D S+V Y H + +FESWM KHGK Y
Sbjct: 4 AKSAMLILLVAMVIASCAT-AIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGS 62
Query: 62 IEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR-- 119
+ EK R IF++NL+ I+ RN E SY LGL FAD+S E+K G P+ P
Sbjct: 63 VAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVF 122
Query: 120 -QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
S + LPKSVDWR +GAVT VK+QG C SCWAFSTV AVEG+N+IV+G L
Sbjct: 123 MTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELV 182
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDK-KEEME 237
+LSEQ+LI+C+ NNGC GG ++ A+++I+ +GGL + DYPY G C+ + KE +
Sbjct: 183 TLSEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNK 241
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAA 297
V I GY+++P NDE +L+KA+AHQPV+ I++S +FQ Y GVF G CG L+HGV
Sbjct: 242 NVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVV 301
Query: 298 VGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
VGYG G DY +VKNS G WGE GY++M RN P GLCGI AS PLK
Sbjct: 302 VGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPLKN 354
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 167/335 (49%), Positives = 218/335 (65%), Gaps = 18/335 (5%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIE--------EKLHRFEIFKENLKHIDQRNK 84
++ L+S + L L+E W S++ + E RF +F EN ++I + N+
Sbjct: 27 FTESDLSSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANR 86
Query: 85 EV-TSYWLGLNEFADMSHEEFKNKYLGLKPQF-------PTRRQPSAEFSYRDVKALPKS 136
+ L LN+FADM+ +EF+ Y G + + S + D LP +
Sbjct: 87 RGGRPFRLALNKFADMTTDEFRRTYAGSRARHHRSLRGGRGGEGGSFRYGGDDEDNLPPA 146
Query: 137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGC 196
VDWR++GAVT +K+QG CGSCWAFS VAAVEG+N+I +G L +LSEQEL+DCDT N GC
Sbjct: 147 VDWRERGAVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGC 206
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
+GGLMDYAF++I +GG+ E +YPY E+G C K VTI GY+DVP NDE +L
Sbjct: 207 DGGLMDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQ 266
Query: 257 KALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSW 315
KA+A+QPV+VA+EASG DFQFYS GVFTG CG +LDHGVAAVGYG ++ G+ Y IVKNSW
Sbjct: 267 KAVANQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSW 326
Query: 316 GPKWGERGYIRMKRN-TGKPEGLCGINKMASIPLK 349
G WGERGYIRM+R + GLCGI AS P+K
Sbjct: 327 GEDWGERGYIRMQRGVSSDSNGLCGIAMEASYPVK 361
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 159/307 (51%), Positives = 208/307 (67%), Gaps = 4/307 (1%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFK 105
++E W+ ++ K Y + EK RF+IFK+NLK +D+ N ++ +GL FAD+++EEF+
Sbjct: 43 MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR 102
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
YL K + + + Y++ LP VDWR GAV VK+QG+CGSCWAFS V A
Sbjct: 103 AIYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGA 162
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
VEGINQI +G L SLSEQEL+DCD F N GC+GG+M+YAF++I+ +GG+ ++DYPY
Sbjct: 163 VEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNA 222
Query: 225 EE-GTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+ G C DK VVTI GY+DVP +DE+SL KA+AHQPVSVAIEAS FQ Y GV
Sbjct: 223 NDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGV 282
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
TG CG LDHGV VGYG + G DY I++NSWG WG+ GY++++RN P G CGI
Sbjct: 283 MTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAM 342
Query: 343 MASIPLK 349
M S P K
Sbjct: 343 MPSYPTK 349
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 173/348 (49%), Positives = 227/348 (65%), Gaps = 12/348 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
KL + LS +S DF + L + + + +L+E W H T + E L
Sbjct: 2 KLFFIVLSFLCLLQASKGFDFD-----EKELETEENVWKLYERWRDHHSVT-RASHEALK 55
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG--LKPQFPTR--RQPSA 123
RF +F+ N+ H+ + NK+ Y L +N FAD++H EF++ Y G +K R ++ S
Sbjct: 56 RFNVFRHNVLHVHRTNKKNKPYKLKVNRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSG 115
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y +V +P SVDWR+KGAVT VKNQ CGSCWAFSTVAAVEGIN+I + L SLSEQ
Sbjct: 116 GFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQ 175
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT-CEDKKEEMEVVTIS 242
EL+DCDT N GC GGLM+ AF++I +GG+ EE YPY + C K + E VTI
Sbjct: 176 ELVDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSNDVQFCRAKSIDGETVTID 235
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
G++ VPENDE++LLKA+AHQPVSVAI+A +DFQ YS GVF G CG +L+HGV VGYG+
Sbjct: 236 GHEHVPENDEEALLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGE 295
Query: 303 SK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+K G+ Y IV+NSWGP+WGE GY+R++R + EG CGI AS P K
Sbjct: 296 TKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTK 343
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 167/351 (47%), Positives = 227/351 (64%), Gaps = 23/351 (6%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
M+ S LL+LSL+L + + D++ ++ESW+ KHGK+Y
Sbjct: 10 MSLLFFSTLLILSLAL-------------------DAKRTNDEVKAMYESWLIKHGKSYN 50
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+ E+ RFEIFKE L+ ID+ N + + SY +GLN+FAD+++EEF++ YLG + +
Sbjct: 51 SLGERERRFEIFKETLRFIDEHNADTSRSYKVGLNQFADLTNEEFRSTYLGFT-RGSNKT 109
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
+ S + R + LP VDWR +GAV +KNQG CGSCWAFS +AAVEGIN+IV+GNL S
Sbjct: 110 KVSNRYEPRVGQVLPDYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGINKIVTGNLIS 169
Query: 180 LSEQELIDCD-TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DC T GC+GG M F++I+ +GG++ EE+YPY +EG C+ + +
Sbjct: 170 LSEQELVDCGRTQSTKGCDGGYMTDGFEFIINNGGINTEENYPYTAQEGQCDLNLQNEKY 229
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
VTI Y++VP +E +L A+A+QPVSVA+E++G FQ YS G+FTGPCG DH V V
Sbjct: 230 VTIDNYENVPYYNEWALQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGTATDHAVTIV 289
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
GYG G DY IVKNSW WGE GY+R+ RN G G CGI M S P+K
Sbjct: 290 GYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 339
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 164/309 (53%), Positives = 212/309 (68%), Gaps = 11/309 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHE 102
L E E WM++HGK Y+ EK RF IFK+N++ I+ N + Y L +N AD++ +
Sbjct: 36 LQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLSVNHLADLTLD 95
Query: 103 EFK---NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
EFK N Y + +F T F Y +V A+P +VDWR KGAVTP+K+QG CGSCWA
Sbjct: 96 EFKASRNGYKKIDREFTT-----TSFKYENVTAIPAAVDWRVKGAVTPIKDQGQCGSCWA 150
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEE 218
FSTVAA EGINQI +G L SLSEQEL+DCDT + GC GGLM+ F++I+ +GG+ E
Sbjct: 151 FSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSET 210
Query: 219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
+YPY +G+C + V I+GY+ VP N E+SLLKA+A+QP+SV+I+AS + F FY
Sbjct: 211 NYPYKAADGSC-NTATTTPVAKITGYEKVPVNSEKSLLKAVANQPISVSIDASDSSFMFY 269
Query: 279 SGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
S G++TG CG ELDHGV AVGYG + G+DY IVKNSWG WGE+GYIRM+R EGLC
Sbjct: 270 SSGIYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIAAKEGLC 329
Query: 339 GINKMASIP 347
GI +S P
Sbjct: 330 GIAMDSSYP 338
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 159/307 (51%), Positives = 208/307 (67%), Gaps = 4/307 (1%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFK 105
++E W+ ++ K Y + EK RF+IFK+NLK +D+ N ++ +GL FAD+++EEF+
Sbjct: 43 MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR 102
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
YL K + + + Y++ LP VDWR GAV VK+QG+CGSCWAFS V A
Sbjct: 103 AIYLRKKMERNKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGA 162
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
VEGINQI +G L SLSEQEL+DCD F N GC+GG+M+YAF++I+ +GG+ ++DYPY
Sbjct: 163 VEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNA 222
Query: 225 EE-GTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+ G C DK VVTI GY+DVP +DE+SL KA+AHQPVSVAIEAS FQ Y GV
Sbjct: 223 NDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGV 282
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
TG CG LDHGV VGYG + G DY I++NSWG WG+ GY++++RN P G CGI
Sbjct: 283 MTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAM 342
Query: 343 MASIPLK 349
M S P K
Sbjct: 343 MPSYPTK 349
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 161/310 (51%), Positives = 211/310 (68%), Gaps = 8/310 (2%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV------TSYWLGLNEFADM 99
ELFE W +H KTY EEKL+R ++F++N + Q N+ +SY L LN FAD+
Sbjct: 31 ELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADL 90
Query: 100 SHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
+H EFK LGL ++P + S RD+ +P +DWR+ GAVTPVK+Q SCG+CWA
Sbjct: 91 THHEFKTTRLGLPLTLLRFKRPQNQQS-RDLLHIPSQIDWRQSGAVTPVKDQASCGACWA 149
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
FS A+EGIN+IV+G+L SLSEQELIDCDTS+N+GC GGLMD+A+++++ + G+ E+D
Sbjct: 150 FSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGIDTEDD 209
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS 279
YPY + +C K + VTI Y DVP ++E+ +LKA+A QPVSV I S +FQ YS
Sbjct: 210 YPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGSEREFQLYS 268
Query: 280 GGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
G+FTGPC LDH V VGYG G DY IVKNSWG WG GYI M RN+G +G+CG
Sbjct: 269 KGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNSKGICG 328
Query: 340 INKMASIPLK 349
IN +AS P+K
Sbjct: 329 INTLASYPVK 338
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 167/342 (48%), Positives = 215/342 (62%), Gaps = 14/342 (4%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
L + L+L L ACS A + + H+ WM++HG+TYK EK R
Sbjct: 7 LWMALLALGLGACSPAAAELGDASMAERHV-----------EWMARHGRTYKDAAEKEQR 55
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR 128
IFK N+++I+ N Y L N+FAD++HEEFK + G KP ++ F +
Sbjct: 56 LGIFKSNVEYIESFNAGKRKYQLAANQFADLTHEEFKAMHTGFKPSGTGAKKAGNGFRHG 115
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
+ ++P SVDWR KGAVTPVK+QG CGSCWAF+ VAAVEGI +IV+G L SLSEQ+L+DC
Sbjct: 116 SLSSVPDSVDWRSKGAVTPVKDQGLCGSCWAFTVVAAVEGITKIVTGKLISLSEQQLVDC 175
Query: 189 DT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
D + GC GG MD AF++IV +GG+ E +YPY + C V TI ++DV
Sbjct: 176 DVHGKDQGCQGGDMDAAFEFIVNNGGITSEANYPYEEVQRLCNAHNASFVVATIESHEDV 235
Query: 248 PENDEQSLLKALAHQPVSVAIEA-SGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKG 305
P NDE++L KA+A+QPVSV I+A S DFQ YSGGVF+G CG +LDH V VGYG S G
Sbjct: 236 PTNDEKALRKAVANQPVSVGIDAGSSLDFQLYSGGVFSGECGTDLDHAVTVVGYGTTSDG 295
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ Y + KNSWG WGE GYIRM+R+ EGLCGI AS P
Sbjct: 296 TKYWLAKNSWGETWGENGYIRMERDVAAKEGLCGIAMQASYP 337
>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
Length = 360
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 173/323 (53%), Positives = 222/323 (68%), Gaps = 7/323 (2%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
++ L S L +L+E W S H T + ++EK +RF +FK N+ H+ NK Y L
Sbjct: 25 FNEHDLDSEKSLWDLYERWRSHHTVT-RSLDEKHNRFNVFKANVMHVHNTNKLDKPYKLK 83
Query: 93 LNEFADMSHEEFKNKYLGLK----PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
LN+FADM++ EF+ Y K F + F Y +VK +P S+DWRKKGAVT V
Sbjct: 84 LNKFADMTNYEFRRIYADSKVSHHRMFRGMSNENGTFMYENVKNVPSSIDWRKKGAVTDV 143
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
K+QG CGSCWAFST+ AVEGINQI + L SLSEQEL+DCDT N GCNGGLM+YAF++I
Sbjct: 144 KDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGGNEGCNGGLMEYAFEFI 203
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
G+ E +YPY ++GTC+ KKE+ V+I GY++VP N+E +LLKA A QPVSVAI
Sbjct: 204 -KQNGITTESNYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVSVAI 262
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKG-SDYIIVKNSWGPKWGERGYIRM 327
+A G +FQFYS GVF+G CG +L+HGVA VGYG ++ + Y IVKNSWG +WGE+GYIRM
Sbjct: 263 DAGGYNFQFYSEGVFSGHCGTDLNHGVAVVGYGVTQDRTKYWIVKNSWGSEWGEQGYIRM 322
Query: 328 KRNTGKPEGLCGINKMASIPLKK 350
+R EGLCGI AS P+KK
Sbjct: 323 QRGISHKEGLCGIAMEASYPIKK 345
>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
Length = 318
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 166/314 (52%), Positives = 217/314 (69%), Gaps = 9/314 (2%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SKLL +++ LS+ S FSIVGYSP+ LTS +KLI LF+SWM ++ K YK I+E
Sbjct: 6 SFSKLLFVAICLSVHMGLSYGA-FSIVGYSPDDLTSTEKLINLFDSWMVEYDKVYKDIDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ--FPTRRQPS 122
K++RFEIFK+NLK+ID+ NK+ +YWLGL F D++++EFK KY+G P+ T
Sbjct: 65 KIYRFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKYVGSIPENWSTTEESND 124
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
EF Y DV +P S+DWR+KGAVTPV+NQGSCGSCW FS+VAAVEGIN+IV+G L SLSE
Sbjct: 125 KEFIYDDVVNIPASIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEGINKIVTGQLVSLSE 184
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DC+ + GC GG YA +Y VA+ G+H + YPY + C + + V
Sbjct: 185 QELLDCERR-SYGCRGGFPPYALQY-VANSGIHLRQYYPYEGVQRQCRAAQAKGPKVKTD 242
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
G V N+EQ+L++ +A QPVS+ +EA G FQ Y GG+F GPCG +DH VAAVGYG
Sbjct: 243 GVGRVQRNNEQALIQRIAIQPVSIVVEAKGRAFQNYRGGIFAGPCGTSIDHAVAAVGYGN 302
Query: 303 SKGSDYIIVKNSWG 316
YI++KNSWG
Sbjct: 303 G----YILIKNSWG 312
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 330 bits (847), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 166/345 (48%), Positives = 230/345 (66%), Gaps = 6/345 (1%)
Query: 7 SKLLLLSLSLSLFACSS--LAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S +L ++ + L C++ +A + + + ++ + + F+ W+ +HG+ YK +E
Sbjct: 3 STILTTTIFILLMLCNTCVIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDE 62
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE 124
+ RF I++ N+++I +N + SY L N+FAD+++EEF++ Y+GL + R +
Sbjct: 63 REVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQSTYMGLSTRL---RSHNTG 119
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
F Y + LP+S DWRK+GAVT + +QG CG CWAF+ VAAVEGIN+I SG L SLSEQE
Sbjct: 120 FRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISLSEQE 179
Query: 185 LIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
LIDCD S N GC GGLM+ A+ +I+ +GGL E+DYPY +GTC+ +K +ISG
Sbjct: 180 LIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYAASISG 239
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
Y++VP ++E L A AHQPVSVAI+A G FQFYS GVF+G CG +L+HGV VGYGK
Sbjct: 240 YEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVVGYGKE 299
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
+ Y IVKNSWG WGE GYIRMKR+T EG+CGI AS PL
Sbjct: 300 TINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQASYPL 344
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 167/349 (47%), Positives = 222/349 (63%), Gaps = 9/349 (2%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPE---HLTSMDKLIELFESWMSKHGKTYKCIEEK 65
+L+L +++ + +C++ A D S+V Y H + +FESWM KHGK Y + EK
Sbjct: 1 MLILLVAMVIASCAT-AIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAEK 59
Query: 66 LHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR---QPS 122
R IF++NL+ I+ RN E SY LGL FAD+S E+K G P+ P S
Sbjct: 60 ERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMTSS 119
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
+ LPKSVDWR +GAVT VK+QG C SCWAFSTV AVEG+N+IV+G L +LSE
Sbjct: 120 DRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSE 179
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDK-KEEMEVVTI 241
Q+LI+C+ NNGC GG ++ A+++I+ +GGL + DYPY G C+ + KE + V I
Sbjct: 180 QDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMI 238
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
GY+++P NDE +L+KA+AHQPV+ I++S +FQ Y GVF G CG L+HGV VGYG
Sbjct: 239 DGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYG 298
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G DY +VKNS G WGE GY++M RN P GLCGI AS PLK
Sbjct: 299 TENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPLKN 347
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 168/294 (57%), Positives = 202/294 (68%), Gaps = 8/294 (2%)
Query: 62 IEEKLHRFEIFKENLKHIDQRNK---EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
I E RF +F +NLK +D N E + LG+N FAD+++ EF+ YLG P R
Sbjct: 82 IGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAGRGR 141
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVT-PVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
R A + + V+ALP SVDWR KGAV PVKNQG CGSCWAFS VAAVEGIN+IV+G L
Sbjct: 142 RVGEA-YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGEL 200
Query: 178 TSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQEL++C + N+GCNGG+MD AF +I +GGL EEDYPY +G C K
Sbjct: 201 VSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSR 260
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+VV+I G++DVPENDE SL KA+AHQPVSVAI+A G +FQ Y GVFTG CG LDHGV
Sbjct: 261 KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVV 320
Query: 297 AVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
AVGYG + G+ Y V+NSWGP WGE GYIRM+RN G CGI MAS P+
Sbjct: 321 AVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 161/299 (53%), Positives = 215/299 (71%), Gaps = 11/299 (3%)
Query: 50 SWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT---SYWLGLNEFADMSHEEFKN 106
+WM++HG+ Y EK +R+ +FK N++ I++ N EV ++ L +N+FAD+++EEF++
Sbjct: 33 AWMTEHGRVYADANEKNNRYVVFKRNVESIERLN-EVQYGLTFKLAVNQFADLTNEEFRS 91
Query: 107 KYLGLKPQ--FPTRRQPSAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
Y G K +R +P++ F Y+ V ALP SVDWRKKGAVTP+K+QGSCGSCWAFS
Sbjct: 92 MYTGYKGNSVLSSRTKPTS-FRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSA 150
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
VAA+EG+ QI G L SLSEQEL+DCDT+ ++GC GG M+ AF Y + +GGL E +YPY
Sbjct: 151 VAAIEGVAQIKKGKLISLSEQELVDCDTN-DDGCMGGYMNSAFNYTMTTGGLTSESNYPY 209
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+GTC K + +I G++DVP NDE++L+KA+AH PVS+ I GT FQFYS GV
Sbjct: 210 KSTDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGV 269
Query: 283 FTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
F+G C LDHGVA VGYGK S GS Y I+KNSWGPKWGERGY+R+K++T G CG+
Sbjct: 270 FSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKKDTKAKHGQCGL 328
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 168/294 (57%), Positives = 202/294 (68%), Gaps = 8/294 (2%)
Query: 62 IEEKLHRFEIFKENLKHIDQRNK---EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
I E RF +F +NLK +D N E + LG+N FAD+++ EF+ YLG P R
Sbjct: 82 IGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAGRGR 141
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVT-PVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
R A + + V+ALP SVDWR KGAV PVKNQG CGSCWAFS VAAVEGIN+IV+G L
Sbjct: 142 RVGEA-YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGEL 200
Query: 178 TSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQEL++C + N+GCNGG+MD AF +I +GGL EEDYPY +G C K
Sbjct: 201 VSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSR 260
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+VV+I G++DVPENDE SL KA+AHQPVSVAI+A G +FQ Y GVFTG CG LDHGV
Sbjct: 261 KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVV 320
Query: 297 AVGYG--KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
AVGYG + G+ Y V+NSWGP WGE GYIRM+RN G CGI MAS P+
Sbjct: 321 AVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 166/306 (54%), Positives = 208/306 (67%), Gaps = 6/306 (1%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEE 103
+E E+WM+++G+ YK EK R IFK N++ I+ NK Y L +NEFAD+++EE
Sbjct: 1 MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEE 60
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
F+ G K + F Y +V A+P ++DWRKKGAVTP+K+QG CG CWAFS V
Sbjct: 61 FQASRNGYKMSAHLSSSSTKPFRYENVSAVPSTMDWRKKGAVTPIKDQGQCGCCWAFSAV 120
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
AA EGI Q+ +G L SLSEQEL+DCDTS + GCNGGLMD AF +I+ + GL E +YPY
Sbjct: 121 AATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEANYPY 180
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+G C K I+GY+DVP N E +LLKA+A+QPVSVAI+A G+ FQFYS GV
Sbjct: 181 QGADGACNSGK---AAAKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYSSGV 237
Query: 283 FTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
FTG CG +LDHGV AVGYG S G+ Y +VKNSWG WGE GYIRM+R+ EGLCGI
Sbjct: 238 FTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERDIDAQEGLCGIA 297
Query: 342 KMASIP 347
AS P
Sbjct: 298 MEASYP 303
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 166/323 (51%), Positives = 213/323 (65%), Gaps = 7/323 (2%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
+ + + S + L EL+E W +H + + + EK RF +FK+N++ I + N+ Y L
Sbjct: 33 FGDKDVASEEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRRDEPYKLR 91
Query: 93 LNEFADMSHEEFKNKYLGLK----PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
LN F DM+ +EF+ Y + F R + + F Y + LP +VDWR+KGAV V
Sbjct: 92 LNRFGDMTADEFRRAYASSRVSHHRMFRGRGERRSGFMYAGARDLPAAVDWREKGAVGAV 151
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKY 207
K+QG CGSCWAFST+AAVEGIN I + NLT+LSEQ+L+DCDT N GC+GGLMD AF+Y
Sbjct: 152 KDQGQCGSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQY 211
Query: 208 IVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVA 267
I GG+ YPY + +C+ VTI GY+DVP N E +L KA+A+QPVSVA
Sbjct: 212 IAKHGGVAASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVA 271
Query: 268 IEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIR 326
IEA G+ FQFYS GVF G CG ELDHGVAAVGYG + G+ Y IV+NSWG WGE+GYIR
Sbjct: 272 IEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIR 331
Query: 327 MKRNTGKPEGLCGINKMASIPLK 349
MKR+ EGLCGI AS P+K
Sbjct: 332 MKRDVSAKEGLCGIAMEASYPIK 354
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 330 bits (845), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 165/346 (47%), Positives = 233/346 (67%), Gaps = 10/346 (2%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIE-LFESWMSKHGKTYKCIEEKLHRF 69
+ S + +F SL F + L + +++ + WM+KHG+ Y ++EK +R+
Sbjct: 1 MASKQIQIFLIVSLISSFCLSITLSRPLDDNELIMQKRHDEWMAKHGRVYADMKEKNNRY 60
Query: 70 EIFKENLKHIDQRNKEVT--SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP---SAE 124
+FK N++ I++ N ++ L +N+FAD++++EF++ Y G K Q ++
Sbjct: 61 VVFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSSQSGTKTSS 120
Query: 125 FSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
F Y++V ALP SVDWRKKGAVTP+KNQG+CG CWAFS VAA+EG +I G L SLSE
Sbjct: 121 FRYQNVSSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKKGKLISLSE 180
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
Q+L+DCDT+ + GC+GGLMD AF++I+A+GGL E +YPY ++ TC+ K + +I+
Sbjct: 181 QQLVDCDTN-DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGKDATCKIKNTKPTATSIT 239
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
GY+DVP NDE++L+KA+AHQPVS+ IE G DFQFY GVFTG C LDH V AVGYG+
Sbjct: 240 GYEDVPVNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDHAVTAVGYGQ 299
Query: 303 -SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
S GS Y I+KNSWG KWGE GY+R+K++ +GLCG+ AS P
Sbjct: 300 SSNGSKYWIIKNSWGTKWGESGYMRIKKDVKDKKGLCGLAMKASYP 345
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 330 bits (845), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 170/355 (47%), Positives = 229/355 (64%), Gaps = 23/355 (6%)
Query: 1 MAFFSHSKLL--LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKT 58
MA +S +L L++LSLSL S S +++ ++E W+ KH K
Sbjct: 1 MASILYSLILFGLITLSLSLDMSSG---------------RSNKEVMTMYEKWLVKHQKV 45
Query: 59 YKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
Y + EK RF+IFK+NL ID+ N SY +GLNEF+D++++E+++ YL +
Sbjct: 46 YYGLGEKNQRFQIFKDNLIFIDEHNAPNHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIK 105
Query: 119 RQ-PSAEFSYR--DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
+ S ++Y+ LP SVDWR GA+TP+KNQGSCG+CWAFS VAAVE IN+IV+G
Sbjct: 106 NKITSVRYAYKAGHNNKLPVSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTG 163
Query: 176 NLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
+L SLSEQEL+DCD + N GCNGG A+++IV +GGL + DYPYL + TC K+
Sbjct: 164 SLVSLSEQELVDCDRTKNKGCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKN 223
Query: 236 MEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGV 295
+VV+I+GY++V N E +L++A+A+QPVSV IEA G DFQ Y GVFTG CG LDH V
Sbjct: 224 TKVVSINGYKNVQRNSESALMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAV 283
Query: 296 AAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
VGYG G DY +VKNSWG WGERGY++++RN G CGI A+ P K
Sbjct: 284 VVVGYGSENGKDYWLVKNSWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYPTK 338
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 329 bits (844), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 168/355 (47%), Positives = 226/355 (63%), Gaps = 28/355 (7%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
M+ S LL+LS +L + + +D ++ +++ESW+ + GK+Y
Sbjct: 10 MSLLFFSTLLILSSALDIVNSAQRTND---------------QVRDMYESWLVEQGKSYN 54
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
++EK RFEIFK+NL+ ID N + S+ LGLN FAD++ EE+++ YLG K
Sbjct: 55 SLDEKEMRFEIFKDNLRIIDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFKSG----- 109
Query: 120 QPSAEFSYRDV----KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
P A+ S R V LP VDWR GAV VKNQG C SCWAFS VAAVEGIN+I++G
Sbjct: 110 -PKAKVSNRYVPKVGDVLPNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTG 168
Query: 176 NLTSLSEQELIDC-DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
NL SLSEQEL+DC T GCN G M AF++I+ +GG++ E++YPY ++G C +
Sbjct: 169 NLLSLSEQELVDCGRTQSTRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQCNRYLQ 228
Query: 235 EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHG 294
+ VTI Y++VP N+E +L A+AHQPVSV +E+ G F+ Y+ G+FT CG +DHG
Sbjct: 229 NQKYVTIDDYENVPSNNEWALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTAIDHG 288
Query: 295 VAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
V VGYG +G DY IVKNSWG WGE GYIR++RN G G CGI +MAS P+K
Sbjct: 289 VTIVGYGTERGLDYWIVKNSWGTNWGENGYIRIQRNIGGA-GKCGIARMASYPVK 342
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 165/309 (53%), Positives = 213/309 (68%), Gaps = 11/309 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHE 102
L E E WMS++GK YK EK RF IFK+N++ I+ N + Y L +N AD++ +
Sbjct: 36 LQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLD 95
Query: 103 EFK---NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
EFK N Y + +F T F Y +V A+P++VDWR KGAVTP+K+QG CGSCWA
Sbjct: 96 EFKASRNGYKKIDREFAT-----TSFKYENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWA 150
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEE 218
FSTVAA+EGINQI +G L SLSEQEL+DCDT + GC GGLM+ F++I+ +GG+ E
Sbjct: 151 FSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSET 210
Query: 219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
+YPY +G+C + V I+GY+ VP N E SLLKA+A+QP+SV+I+AS + F FY
Sbjct: 211 NYPYKAADGSC-NTATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFY 269
Query: 279 SGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
S G++TG CG ELDHGV AVGYG + G+DY IVKNSWG WGE+GYIRM+R EGLC
Sbjct: 270 SSGIYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKEGLC 329
Query: 339 GINKMASIP 347
GI +S P
Sbjct: 330 GIAMDSSYP 338
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 170/357 (47%), Positives = 233/357 (65%), Gaps = 13/357 (3%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY- 59
M F + +L L L +F S+ + + S H S +++ +F+ WMSKHGKTY
Sbjct: 1 MGFVRPVCMTILFL-LIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYT 59
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+ EK RF+ FK+NL+ IDQ N + SY LGL FAD++ +E+++ L P P +
Sbjct: 60 NALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRD----LFPGSPKPK 115
Query: 120 QPSAEFSYRDV----KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
Q + + S R V LP+SVDWR++GAV+ +K+QG+C SCWAFSTVAAVEG+N+IV+G
Sbjct: 116 QRNLKTSRRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTG 175
Query: 176 NLTSLSEQELIDCDTSFNNGCNG-GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
L SLSEQEL+DC+ NNGC G GLMD AF++++ + GL E+DYPY +G+C K+
Sbjct: 176 ELISLSEQELVDCNL-VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQS 234
Query: 235 -EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDH 293
+V+TI Y+DVP NDE SL KA+AHQPVSV ++ +F Y ++ GPCG LDH
Sbjct: 235 TSNKVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDH 294
Query: 294 GVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
+ VGYG G DY IV+NSWG WG+ GYI++ RN P+GLCGI +AS P+K
Sbjct: 295 ALVIVGYGSENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPIKN 351
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 165/309 (53%), Positives = 210/309 (67%), Gaps = 8/309 (2%)
Query: 48 FESWMSKHGKTYKC-IEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKN 106
F+ WM ++ K Y I+E RF ++ ENL +I N TS+WL LN FAD++ +EF+N
Sbjct: 45 FQQWMMQYTKAYANDIKELETRFSVWLENLNYILAYNARTTSHWLHLNAFADLTTDEFRN 104
Query: 107 KYLG--LKPQFPTRRQPSAEFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
+ LG K + + R S+ F Y +V A LP +DWRKKGAVT VKNQG CGSCWAF+T
Sbjct: 105 R-LGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFAT 163
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
+VEGIN IV+G L SLSEQEL+DCDT + GC+GGLMDYA+++I+ +GGL E+DYPY
Sbjct: 164 TGSVEGINAIVTGELASLSEQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGLDTEDDYPY 223
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
E+G C K+ VVTI GY D+PENDE +L KA AHQP++VAIEA FQ Y GGV
Sbjct: 224 TAEDGVCVAAKKNRRVVTIDGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGGV 283
Query: 283 FTGP-CGAELDHGVAAVGYGKSKG-SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
+ P CG L+HGV VGYGK +Y IVKNSWGP+WG+ GYIR++ +G+CGI
Sbjct: 284 YDDPTCGTSLNHGVLVVGYGKDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQGMCGI 343
Query: 341 NKMASIPLK 349
S P K
Sbjct: 344 AMAPSFPTK 352
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 177/325 (54%), Positives = 211/325 (64%), Gaps = 12/325 (3%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWL 91
+ L S + L +L+E W + H + ++ EK RF FKEN + I NK Y L
Sbjct: 27 FDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRL 85
Query: 92 GLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE-----FSYRDVKALPKSVDWRKKGAVT 146
LN F DM EEF++ + + RR+P+A F Y D LP+SVDWR+KGAVT
Sbjct: 86 RLNRFGDMGREEFRSGFADSRIN-DLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVT 144
Query: 147 PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFK 206
VKNQG CGSCWAFSTV AVEGIN I +G+L SLSEQELIDCDT NGC GGLM+ AF+
Sbjct: 145 AVKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTD-ENGCQGGLMENAFE 203
Query: 207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEM-EVVTISGYQDVPENDEQSLLKALAHQPVS 265
+I + GG+ E YPY GTC+ + VV I G+Q VP E +L KA+AHQPVS
Sbjct: 204 FIKSHGGITTESAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVS 263
Query: 266 VAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGY 324
VAI+A G QFYS GVFTG CG +LDHGVAAVGYG S G+ Y IVKNSWGP WGE GY
Sbjct: 264 VAIDAGGQALQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGEGGY 323
Query: 325 IRMKRNTGKPEGLCGINKMASIPLK 349
IRM+R TG GLCGI AS P+K
Sbjct: 324 IRMQRGTGN-GGLCGIAMEASFPIK 347
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 165/309 (53%), Positives = 212/309 (68%), Gaps = 11/309 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHE 102
L E E WMS++GK YK EK RF IFK+N++ I+ N + Y L +N AD++ +
Sbjct: 36 LQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLD 95
Query: 103 EFK---NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
EFK N Y + +F T F Y +V A+P++VDWR KGAVTP+K+QG CGSCWA
Sbjct: 96 EFKASRNGYKKIDREFAT-----TSFKYENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWA 150
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEE 218
FSTVAA+EGINQI +G L SLSEQEL+DCDT + GC GGLM+ F++I+ +GG+ E
Sbjct: 151 FSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSET 210
Query: 219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
+YPY +G+C V I+GY+ VP N E SLLKA+A+QP+SV+I+AS + F FY
Sbjct: 211 NYPYKAADGSCS-AATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFY 269
Query: 279 SGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
S G++TG CG ELDHGV AVGYG + G+DY IVKNSWG WGE+GYIRM+R EGLC
Sbjct: 270 SSGIYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKEGLC 329
Query: 339 GINKMASIP 347
GI +S P
Sbjct: 330 GIAMDSSYP 338
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 167/311 (53%), Positives = 212/311 (68%), Gaps = 22/311 (7%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHE 102
+ E E WM+++G+ YK EK R+ IFKEN+ ID N + SY LG+N+FAD+S+E
Sbjct: 1 MYERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNE 60
Query: 103 EFK---NKYLG--LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
EFK N++ G PQ + F Y +V A+P ++DWRKKGAVTPVK+QG C
Sbjct: 61 EFKASRNRFKGHMCSPQ-------AGPFRYENVSAVPATMDWRKKGAVTPVKDQGQC--- 110
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHK 216
VAA+EGINQ+ +G L SLSEQE++DCDT + GCNGGLMD AFK+I + GL
Sbjct: 111 -----VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTT 165
Query: 217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
E +YPY +GTC +KE I+G+QDVP N E +L+KA+A QPVSVAI+A G +FQ
Sbjct: 166 EANYPYTGTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQ 225
Query: 277 FYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
FYS G+FTG CG ELDHGV AVGYG S G+ Y +VKNSWG +WGE GYIRM+++ EG
Sbjct: 226 FYSSGIFTGSCGTELDHGVTAVGYGGSDGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEG 285
Query: 337 LCGINKMASIP 347
LCGI AS P
Sbjct: 286 LCGIAMQASYP 296
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 176/349 (50%), Positives = 224/349 (64%), Gaps = 12/349 (3%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + K +L+L L L A S V H T LIE E WM+K+ K YK
Sbjct: 1 MASSTRQKQYILALFLLL------AVGISRVISRELHETETS-LIERHEQWMAKYDKVYK 53
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
EK RF IFK+N++ I+ N Y LG+N AD++ EEFK GLK +
Sbjct: 54 DAAEKEKRFLIFKDNVEFIESFNAAGNKPYKLGVNHLADLTIEEFKASRNGLKRSYDYEV 113
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
++ F Y +V A+P SVDWRKKGAVTP+K+QG CGSCWAFSTVAA EGI++I +G L S
Sbjct: 114 GTTS-FKYENVTAIPASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVS 172
Query: 180 LSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCD + GC GG M+ F++I+ +GG+ E +YPY +G+C K
Sbjct: 173 LSEQELVDCDRKGTDQGCEGGYMEDGFEFIIKNGGITTEANYPYKAVDGSC--KNATAPA 230
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I GY+ VP N E++LLKA+A+QPVSV+I+A+ F FYS G+FTG CG ELDHGV AV
Sbjct: 231 AQIKGYEKVPVNSEKALLKAVANQPVSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAV 290
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG++ G+DY IVKNSWG WGE+GYIRM+R EGLCGI +S P
Sbjct: 291 GYGRANGTDYWIVKNSWGTVWGEQGYIRMQRGIAAKEGLCGIAMDSSYP 339
>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
Length = 348
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 167/348 (47%), Positives = 228/348 (65%), Gaps = 14/348 (4%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
+K+ +LS+SL+LF DF+ + L + L +L+E W S+H + +EK
Sbjct: 4 NKVFVLSISLALFIGVVNCIDFT-----EKDLATDKSLWDLYERWGSQH-MVSRAPDEKK 57
Query: 67 HRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK----NKYLGLKPQFPTRRQPS 122
RF +FK N+ HI++ N+ Y L LNEFADM++ EFK +K L + RRQ
Sbjct: 58 KRFNVFKYNVNHINRVNQLGKPYKLKLNEFADMTNHEFKAGFDSKILHFRMLKGKRRQ-- 115
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
F++ P S+DWR GAV P+KNQG CGSCWAFST+ VEGIN+I + L SLSE
Sbjct: 116 TPFTHAKTTDPPPSIDWRTNGAVNPIKNQGRCGSCWAFSTIVGVEGINKIKTNQLVSLSE 175
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DC+T GCNGGLM+ +++I +GG+ E+ YPY G C+ K VV I
Sbjct: 176 QELVDCETDCE-GCNGGLMENGYEFIKETGGVTTEQIYPYFARNGRCDISKRNSPVVKID 234
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
G+++VP NDE ++L+A+A+QPVS+AI+A G +FQFYS GVF G CG EL+HGVA VGYG
Sbjct: 235 GFENVPANDESAMLRAVANQPVSIAIDAGGLNFQFYSQGVFNGACGTELNHGVAIVGYGT 294
Query: 303 SK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
++ G++Y IV+NSWG WGE+GY+RM+R PEGLCG+ AS P+K
Sbjct: 295 TQDGTNYWIVRNSWGTGWGEQGYVRMQRGVNVPEGLCGLAMDASYPIK 342
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 177/324 (54%), Positives = 220/324 (67%), Gaps = 9/324 (2%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
+ L S D L L+E W +H + + EK RF +F+EN++ I + N+ Y L
Sbjct: 32 FGDHDLASEDSLWALYERWREQH-TVARDLGEKARRFNVFRENVRLIHEFNRGDAPYKLR 90
Query: 93 LNEFADMSHEEFKNKYLGLK---PQFPTRRQPSAEF---SYRDVKALPKSVDWRKKGAVT 146
LN F DM+ +EF+ Y + + + ++ F S V+ +P SVDWR+KGAVT
Sbjct: 91 LNRFGDMTADEFRRAYASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVT 150
Query: 147 PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFK 206
VK+QG CGSCWAFST+AAVEGIN I S NLTSLSEQ+L+DCDT N GCNGGLMDYAF+
Sbjct: 151 AVKDQGQCGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCNGGLMDYAFQ 210
Query: 207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSV 266
YI GG+ E+ YPY + + +KK VVTI GY+DVP NDE +L KA+A QPV+V
Sbjct: 211 YIAKHGGVAAEDAYPYKARQASSCNKKPSA-VVTIDGYEDVPANDETALKKAVAAQPVAV 269
Query: 267 AIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYI 325
AIEASG+ FQFYS GVF G CG ELDHGVAAVGYG + G+ Y IVKNSWGP+WGE+GYI
Sbjct: 270 AIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWGPEWGEKGYI 329
Query: 326 RMKRNTGKPEGLCGINKMASIPLK 349
RMKR+ EGLCGI AS P+K
Sbjct: 330 RMKRDVKDKEGLCGIAMEASYPVK 353
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 163/292 (55%), Positives = 204/292 (69%), Gaps = 10/292 (3%)
Query: 63 EEKLHRFEIFKENLKHIDQRNKEVTS--YWLGLNEFADMSHEEF---KNKYLGLKPQFPT 117
+E+ R IF +N+ +I+ N V + Y L +N+FAD+++EEF +NK+ G
Sbjct: 2 QEREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIASRNKFKGHMCSSII 61
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
R + F Y + A+P +VDWRKKGAVTPVKNQG CGSCWAFS VAA EGI+Q+ +G L
Sbjct: 62 R---TTTFKYENASAIPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKL 118
Query: 178 TSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQELIDCDT + GC GGLMD AFK+I+ + GL E YPY +GTC K +
Sbjct: 119 VSLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKASI 178
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
VTI+GY+DVP N+E +L KA+A+QP+SVAI+ASG+DFQFY+ GVFTG CG ELDHGV
Sbjct: 179 HAVTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVT 238
Query: 297 AVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
AVGYG + G+ Y +VKNSWG WGE GYIRM+R EGLCGI AS P
Sbjct: 239 AVGYGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYP 290
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 159/304 (52%), Positives = 211/304 (69%), Gaps = 9/304 (2%)
Query: 51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN--KEVTSYWLGLNEFADMSHEEFKNKY 108
WM++HG+ Y EK +R+ +FK N++ I++ N + ++ L +N+FAD+++EEF++ Y
Sbjct: 41 WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 100
Query: 109 LGLKPQ--FPTRRQPSAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
G K +R +P++ F Y++V ALP SVDWRKKGAVTP+K+QG CGSCWAFS VA
Sbjct: 101 TGFKGNSVLSSRTKPTS-FRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVA 159
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
A+EG+ QI G L SLSEQEL+DCDT+ + GC GGLMD AF Y + GGL E +YPY
Sbjct: 160 AIEGVAQIKKGKLISLSEQELVDCDTN-DGGCMGGLMDTAFNYTITIGGLTSESNYPYKS 218
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
GTC K + +I G++DVP NDE++L+KA+AH PVS+ I FQFYS GVF+
Sbjct: 219 TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFS 278
Query: 285 GPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
G C LDHGV AVGYG+SK G Y I+KNSWGPKWGERGY+R+K++ G CG+
Sbjct: 279 GECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKDIKPKHGQCGLAMN 338
Query: 344 ASIP 347
AS P
Sbjct: 339 ASYP 342
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 166/300 (55%), Positives = 209/300 (69%), Gaps = 9/300 (3%)
Query: 52 MSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLG 110
M+++G+ YK EK RF+IFK+N+ I+ NK + +Y L +NEFAD+++EEF++
Sbjct: 1 MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRS---- 56
Query: 111 LKPQFPTRRQPSAE-FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGI 169
L+ +F A F Y +V A+P ++DWRKKGAVTP+K+Q CG CWAFS VAA EGI
Sbjct: 57 LRNRFKAHICSEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGI 116
Query: 170 NQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT 228
QI +G L SLSEQEL+DCDT N GC+GGLMD AF++I G L E YPY ++GT
Sbjct: 117 TQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFIKIHG-LASEATYPYEGDDGT 175
Query: 229 CEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCG 288
C KKE I GY+DVP N+E++L KA+AHQPV+VAI+A G +FQFY+ GVFTG CG
Sbjct: 176 CNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCG 235
Query: 289 AELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
ELDHGVAAVGYG G Y +VKNSWG WGE GYIRM+R+ EGLCGI AS P
Sbjct: 236 TELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 295
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 162/327 (49%), Positives = 220/327 (67%), Gaps = 11/327 (3%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----V 86
+ S + S ++ ++ W ++HG EE R+E F++NL++ID+ N +
Sbjct: 26 IASSSGQIRSEEETRRMYAEWTAQHGSPITNEEEG--RYEAFRDNLRYIDEHNAAADAGI 83
Query: 87 TSYWLGLNEFADMSHEEFKNKYLGLKPQ---FPTRRQPSAEFSYRDVKALPKSVDWRKKG 143
S+ LGLN FA +++EE++ YLGL+ + R+PSA + D +ALP+SVDWR+KG
Sbjct: 84 HSFRLGLNRFAGLTNEEYRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVDWREKG 143
Query: 144 AVTPVKNQG-SCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
AV VK+QG SCGS WAFS +AAVE INQIV+G L SLSEQEL+DCDTS+N GC+GGLMD
Sbjct: 144 AVGKVKDQGRSCGSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNAGCDGGLMD 203
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
AF++I+++GG+ +EDYPY +C+ K + VTI Y+D+ N E+SL KA+++Q
Sbjct: 204 DAFEFIISNGGIDTDEDYPYKARNDSCDANKRNRKAVTIDDYEDLRMN-EKSLQKAVSNQ 262
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
PVSVAIEA G DFQ Y G+FTG CG +LDH VGYG G+DY IVK S+G WGE
Sbjct: 263 PVSVAIEAGGRDFQLYKSGIFTGTCGTDLDHATTIVGYGSENGTDYWIVKESYGTSWGES 322
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLK 349
GY RM+RN + G CGI + S P+K
Sbjct: 323 GYARMERNIKETSGKCGIAMLPSYPVK 349
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 327 bits (838), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 159/315 (50%), Positives = 202/315 (64%), Gaps = 13/315 (4%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT---------SYWLGLNEFA 97
LFE+W ++HGK Y E+ R F +N + N SY L LN FA
Sbjct: 41 LFEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFA 100
Query: 98 DMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRD---VKALPKSVDWRKKGAVTPVKNQGSC 154
D++H EF+ LG + R P +E + V A+P+++DWR+ GAVT VK+QGSC
Sbjct: 101 DLTHAEFRAARLG-RLAVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGSC 159
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGL 214
G+CW+FS A+EGIN+I +G+L SLSEQELIDCD S+N GC GGLMDYA+++++ +GG+
Sbjct: 160 GACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRFVIKNGGI 219
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
E+DYPY +GTC K + VVTI GY DVP N E SLL+A+A QP+SV I S
Sbjct: 220 DTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSARA 279
Query: 275 FQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
FQ YS G+F GPC LDH V VGYG G DY IVKNSWG +WG +GY+ M RNTG
Sbjct: 280 FQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNTGSS 339
Query: 335 EGLCGINKMASIPLK 349
G+CGIN MAS P K
Sbjct: 340 SGICGINMMASFPTK 354
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 162/323 (50%), Positives = 216/323 (66%), Gaps = 12/323 (3%)
Query: 35 PEHLTS------MDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVT 87
P HL + +D L ++++ W+ +HGK Y E RF+IFKEN+ +I+ N +
Sbjct: 19 PIHLLTRISWHFIDPLWQVYQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNN 78
Query: 88 SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTP 147
S+ LGLN+FAD+++ EF+ Y+G + Q P + + V SVDWRKKG VT
Sbjct: 79 SHSLGLNKFADLTNSEFRGLYVG-RLQRPAPFHEVGDIAL--VADTATSVDWRKKGGVTE 135
Query: 148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKY 207
+K+QG CGSCWAFS VAAVEG+ + +G L SLSEQEL+DCDT+ N GC+GG+MDYAF+Y
Sbjct: 136 IKDQGDCGSCWAFSAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQY 195
Query: 208 IVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVA 267
++ +GG+ + +YPY G C+ K + TI+G+Q +P E+ LL+A+A+QPVSVA
Sbjct: 196 MIRNGGITSQSNYPYRALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVA 255
Query: 268 IEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIR 326
IEA G DFQ YS GVFTG CG+ LDHGVA VGYG + G Y +VKNSWG WGE GY+R
Sbjct: 256 IEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGSGWGESGYVR 315
Query: 327 MKRNTGKPEGLCGINKMASIPLK 349
M+R G G+CGIN AS P K
Sbjct: 316 MERQ-GPGAGVCGINLDASYPTK 337
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 162/309 (52%), Positives = 212/309 (68%), Gaps = 4/309 (1%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHE 102
+I E WM+ HG+ Y EK RF+IFK N+ +ID N + SY L +N+FAD++++
Sbjct: 51 MIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLEVNKFADLTND 110
Query: 103 EFKNKYLGLKPQFPTRRQP-SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
EF+ G K Q + S F Y +V A+P VDWRK+GAVTPVK+QG CG CWAFS
Sbjct: 111 EFRASRNGYKKQPDSDSHVVSGLFRYANVSAVPDEVDWRKEGAVTPVKDQGDCGCCWAFS 170
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
VAA+EGIN++ +G L SLSEQEL+DCD + GC GGLM+ AF++I GL E Y
Sbjct: 171 AVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFIEKRKGLAAESVY 230
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY E+G C KK + ISG++ VP N+E++LL+A+A+QPVS+AI+ASG +FQFYSG
Sbjct: 231 PYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAIDASGYEFQFYSG 290
Query: 281 GVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
GVFTG CG ELDH + AVGYG + G+ Y ++KNSWG WGE GYIR+KR++ EGLCG
Sbjct: 291 GVFTGSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGYIRIKRDSLAKEGLCG 350
Query: 340 INKMASIPL 348
I S P+
Sbjct: 351 IAMDPSYPV 359
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 326 bits (836), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 162/341 (47%), Positives = 211/341 (61%), Gaps = 10/341 (2%)
Query: 12 LSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIEL-FESWMSKHGKTYKCIEEKLHRFE 70
+ +L C+ F++ L D LI E WM+++G+ Y + EK R E
Sbjct: 1 MGFLFALVVCT-----FALGALGARDLADDDWLIAARHEQWMARYGRVYSDVAEKARRLE 55
Query: 71 IFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV 130
+FK N+ I+ N +WL N+FAD++ +EF+ + G K Q + + F Y +V
Sbjct: 56 VFKANVGFIESVNAGNHKFWLEANQFADITKDEFRAMHKGYKMQVIGSKARATGFRYANV 115
Query: 131 KA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
LP SVDWR GAVTPVK+QG CG CWAFSTVA++EGI ++ +G L SLSEQEL+DC
Sbjct: 116 SIDDLPASVDWRANGAVTPVKDQGQCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDC 175
Query: 189 DTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
D N GC GGLMD AF++IV +GGL E DYPY +GTC KE +I GY+DV
Sbjct: 176 DVGMQNKGCGGGLMDNAFEFIVNNGGLDTEADYPYTGADGTCNSNKESNIAASIKGYEDV 235
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGS 306
P NDE SL KA+A QPVS+A++ F+FY GGV TG CG ELDHGVAAVGYG G+
Sbjct: 236 PANDEASLQKAVAAQPVSIAVDGGDDLFRFYKGGVLTGACGTELDHGVAAVGYGVAGDGT 295
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
Y +VKNSWG WGE G+IR++R+ G+CG+ S P
Sbjct: 296 KYWLVKNSWGTSWGEDGFIRLERDVADEAGMCGLAMKPSYP 336
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 158/307 (51%), Positives = 214/307 (69%), Gaps = 5/307 (1%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSHE 102
+++ E WM++HG+ Y ++EK R+ IFKEN++ I+ N Y LG+N+FAD+++E
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
EF+ + G K Q + + S+ F + ++ A+P S+DWRK GAVTPVK+QG+CG CWAFS
Sbjct: 61 EFRAMHHGYKRQ--SSKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCCWAFSA 118
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
VAA+EGI ++ +G L SLSEQ+L+DCD + GC GGLMD AF++I+ +GGL E YP
Sbjct: 119 VAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSEATYP 178
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
Y +GTC+ KK I+GY+DVP N+E +LL+A+A QPVSVA+E G DFQFY G
Sbjct: 179 YQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQFYKSG 238
Query: 282 VFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
VF G CG LDH V A+GYG S G++Y +VKNSWG WGE GY+RM+R G EGLCG+
Sbjct: 239 VFKGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRGIGAREGLCGV 298
Query: 341 NKMASIP 347
AS P
Sbjct: 299 AMDASYP 305
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 326 bits (836), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 162/315 (51%), Positives = 204/315 (64%), Gaps = 16/315 (5%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT---------SYWLGLNEFA 97
LF++W ++HGK Y EE+ R +F +N + N V SY L LN FA
Sbjct: 40 LFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFA 99
Query: 98 DMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVK----ALPKSVDWRKKGAVTPVKNQG 152
D++HEEF+ LG + R P+A YR + A+P ++DWR+ GAVT VK+QG
Sbjct: 100 DLTHEEFRAARLGRIAAGAAALRSPAAPV-YRGLDGGLGAVPDALDWRENGAVTKVKDQG 158
Query: 153 SCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASG 212
SCG+CW+FS A+EGIN+I +G+L SLSEQELIDCD S+N+GC GGLMDYA+K++V +G
Sbjct: 159 SCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNG 218
Query: 213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASG 272
G+ EEDYPY +GTC K + +VTI GY DVP N E LL+A+A QPVSV I S
Sbjct: 219 GIDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSA 278
Query: 273 TDFQFYS-GGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNT 331
FQ YS G+F GPC LDH V VGYG G DY IVKNSWG WG +GY+ M RNT
Sbjct: 279 RAFQLYSQQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNT 338
Query: 332 GKPEGLCGINKMASI 346
G +G+CGIN MAS
Sbjct: 339 GDSKGVCGINMMASF 353
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 164/338 (48%), Positives = 225/338 (66%), Gaps = 16/338 (4%)
Query: 16 LSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
L++ C+SL + L+ ++E E+WM ++G+ YK EK RFE+FK+N
Sbjct: 9 LAILGCASLCSSV----LAARELSDA-AMVERHENWMVEYGRVYKDAAEKARRFEVFKDN 63
Query: 76 LKHIDQRNKEVTS-YWLGLNEFADMSHEEFK-NKYLGLKPQFPTRRQPSAEFSYRD--VK 131
+ ++ N + +WLG+N+FAD++ EEFK NK G KP + P+ F Y + V
Sbjct: 64 VAFVESFNTNKNNKFWLGINQFADLTIEEFKANK--GFKP-ISAEKVPTTGFKYENLSVS 120
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT- 190
ALP +VDWR KGAVTP+KNQG CG CWAFS VAA+EGI ++ +GNL SLSEQEL+DCDT
Sbjct: 121 ALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTH 180
Query: 191 SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
S + GC GG MD AF++++ +GGL YPY +G C K TI G++DVP N
Sbjct: 181 SMDEGCEGGWMDSAFEFVIKNGGLATVSSYPYKAVDGKC--KGGSKSAATIKGHEDVPVN 238
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYI 309
DE +L+KA+A+QPVSVA++AS F YSGGV TG CG ELDHG+AA+GYG +S G+ Y
Sbjct: 239 DEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYW 298
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
I+KNSWG WGE+G++RM+++ +G+CG+ S P
Sbjct: 299 ILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 469
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 163/310 (52%), Positives = 209/310 (67%), Gaps = 8/310 (2%)
Query: 48 FESWMSKHGKTY-KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKN 106
F+ W H ++Y + E +RF+++ ENL+++ N TS+WL LN AD+S E+K+
Sbjct: 13 FKEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNARTTSHWLTLNHLADLSTPEYKS 72
Query: 107 KYLGLKPQFPT-RRQPSAEFSYRDV--KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
K LG Q R + F Y DV +ALP ++DWRKK AV VKNQG CGSCWAF+T
Sbjct: 73 KLLGFDNQARVARNKLKTGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSCWAFATT 132
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
+VEGIN IV+G+L SLSEQEL+DCDT + GC+GGLMDYA+ +I+ + G++ EEDYPY
Sbjct: 133 GSVEGINAIVTGSLVSLSEQELVDCDTEQDKGCSGGLMDYAYAWIIKNKGINTEEDYPYT 192
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
+G C+ K + VVTI Y+DVPENDE +L KA AHQPV+VAIEA FQ Y GGV+
Sbjct: 193 AMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSFQLYGGGVY 252
Query: 284 TGP-CGAELDHGVAAVGYGKS---KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
P CG L+HGV VGYGK GS+Y IVKNSWG +WG+ GYIR+K + EGLCG
Sbjct: 253 DDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGSTDAEGLCG 312
Query: 340 INKMASIPLK 349
I S P+K
Sbjct: 313 IAMAPSYPVK 322
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 325 bits (834), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 172/342 (50%), Positives = 217/342 (63%), Gaps = 15/342 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K +L+L L L C+S + H SM E E WM K+GK YK EK
Sbjct: 7 KQHILALVLLLSICTSQVMSRYL------HEASMS---ERHEQWMKKYGKVYKDAAEKQK 57
Query: 68 RFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFS 126
R IFK+N++ I+ N Y LG+N AD ++EEF + G K + + P F
Sbjct: 58 RLLIFKDNVEFIESFNAAGNKPYKLGINHLADQTNEEFVASHNGYKHKASHSQTP---FK 114
Query: 127 YRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
Y +V +P +VDWR+ GAVT VK+QG CGSCWAFSTVAA EGI QI + L SLSEQEL+
Sbjct: 115 YENVTGVPNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELV 174
Query: 187 DCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQD 246
DCD S ++GC+GG M+ F++I+ +GG+ E +YPY +GTC+ KE I GY+
Sbjct: 175 DCD-SVDHGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYET 233
Query: 247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KG 305
VP N E +L KA+A+QPVSV I+A G+ FQFYS GVFTG CG +LDHGV AVGYG + G
Sbjct: 234 VPANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDG 293
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ Y IVKNSWG +WGE GYIRM+R T EGLCGI AS P
Sbjct: 294 TQYWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYP 335
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 165/338 (48%), Positives = 223/338 (65%), Gaps = 16/338 (4%)
Query: 16 LSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
L++ C+SL + L+ ++E E+WM ++G+ YK EK RFE FK N
Sbjct: 9 LAILGCASLCSSV----LAARELSDA-AMVERHENWMVEYGRVYKDAAEKARRFEAFKHN 63
Query: 76 LKHIDQRN-KEVTSYWLGLNEFADMSHEEFK-NKYLGLKPQFPTRRQPSAEFSYRD--VK 131
+ ++ N + +WLG+N+FAD++ EEFK NK G KP P+ F Y + V
Sbjct: 64 VAFVESFNTNKKNKFWLGVNQFADLTTEEFKANK--GFKP-ISAEMVPTTGFKYENLSVS 120
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT- 190
ALP +VDWR KGAVTP+KNQG CG CWAFS VAA+EGI ++ +GNL SLSEQEL+DCDT
Sbjct: 121 ALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTH 180
Query: 191 SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
S + GC GG MD AF++++ +GGL E YPY +G C K TI G++DVP N
Sbjct: 181 SMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKC--KGGSKSAATIKGHEDVPVN 238
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYI 309
DE +L+KA+A+QPVSVA++AS F YSGGV TG CG ELDHG+AA+GYG +S G+ Y
Sbjct: 239 DEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYW 298
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
I+KNSWG WGE+G++RM+++ +G+CG+ S P
Sbjct: 299 ILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336
>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 161/321 (50%), Positives = 215/321 (66%), Gaps = 12/321 (3%)
Query: 37 HLTSMDKLIELFESWMSKHGKTY-KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNE 95
H S +++ +F+ WMSKHGKTY + EK RF+ FK+NL+ IDQ N + SY LGL
Sbjct: 37 HNRSNEEVGFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTR 96
Query: 96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR----DVKALPKSVDWRKKGAVTPVKNQ 151
FAD++ +E+++ L P P +Q + S R D LP+SVDWR +GAV+ +K+Q
Sbjct: 97 FADLTVQEYRD----LFPGSPKPKQRNLRISRRYVPLDGDQLPESVDWRNEGAVSAIKDQ 152
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG-GLMDYAFKYIVA 210
G+C SCWAFSTVAAVEGIN+IV+G L SLSEQEL+DC+ NNGC G G MD AF++++
Sbjct: 153 GTCNSCWAFSTVAAVEGINKIVTGELVSLSEQELVDCNL-VNNGCYGSGTMDAAFQFLIN 211
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKE-EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIE 269
+GGL + DYPY +G C K+ +++TI Y+DVP NDE SL KA+AHQPVSV ++
Sbjct: 212 NGGLDSDTDYPYQGSQGYCNRKESTSNKIITIDSYEDVPANDEISLQKAVAHQPVSVGVD 271
Query: 270 ASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
+F Y G++ GPCG +LDH + VGYG G DY IV+NSWG WG+ GY +M R
Sbjct: 272 KKSQEFMLYRSGIYNGPCGTDLDHALVIVGYGSENGQDYWIVRNSWGTTWGDAGYAKMAR 331
Query: 330 NTGKPEGLCGINKMASIPLKK 350
N P G+CGI +AS P+K
Sbjct: 332 NFEYPSGVCGIAMLASYPVKN 352
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 325 bits (832), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 164/341 (48%), Positives = 220/341 (64%), Gaps = 16/341 (4%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMD-KLIELFESWMSKHGKTYKCIEEKLHRFEIF 72
+SL++ C++ + + T D + E E WM++HGK YK E+ RF IF
Sbjct: 106 ISLAMLLCTAF------LAFQVTCCTLQDASMYERHEQWMTRHGKVYKDPREREKRFRIF 159
Query: 73 KENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQPSAEFSYR 128
EN+ +++ N Y LG+N+F D++++EF +N++ G R + F Y
Sbjct: 160 NENVNYVEAFNNAANKPYKLGINQFXDLTNQEFIAPRNRFKGHMCSSIIR---TTTFKYE 216
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
+V +P +VDWR+ GAVTPVK+QG CG CWAFS VAA EGI+ + G L SLSEQEL+DC
Sbjct: 217 NVTTVPSTVDWRQNGAVTPVKDQGQCGCCWAFSAVAATEGIHALSGGKLISLSEQELVDC 276
Query: 189 DTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
DT + GC GGLMD A+K+I+ + GL+ E +YPY +G C + TI+GY+DV
Sbjct: 277 DTKGVDQGCEGGLMDDAYKFIIQNHGLNTEANYPYKGVDGKCNANEAANHAATITGYEDV 336
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GS 306
P N+E++L KA+A+QPVSVAI+AS +DFQFY G FTG CG ELDHGV AVGYG S G+
Sbjct: 337 PANNEKALQKAVANQPVSVAIDASSSDFQFYKSGAFTGSCGTELDHGVTAVGYGVSDHGT 396
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
Y +VKNSWG +WGE GYIRM+R EG+CGI AS P
Sbjct: 397 KYWLVKNSWGTEWGEEGYIRMQRGVDSEEGVCGIAMQASYP 437
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 325 bits (832), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 163/321 (50%), Positives = 213/321 (66%), Gaps = 5/321 (1%)
Query: 33 YSPEHL---TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSY 89
+S EH + M + + +E W+ +HG+ YK +E F I++ N++ I+ N + S+
Sbjct: 27 FSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQNFSF 86
Query: 90 WLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVK 149
L N+FADM++EE+K Y+GL +R+ S+ F K LP SVDWRK GAVTPV+
Sbjct: 87 TLTDNQFADMTNEEYKALYMGLGTSETSRKNQSS-FKRERSKVLPISVDWRKMGAVTPVR 145
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYI 208
NQG CGSCWAFSTVAAVEGIN+I +G L SLSEQEL+DCD S N GCNGG M AFK+I
Sbjct: 146 NQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFI 205
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
+GG+ +YPY+ E+G C K VV ISGY+ VP N+E+ L A+A QPVSVAI
Sbjct: 206 KQNGGITTARNYPYIGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAI 265
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMK 328
+A G +FQ YS G+F G CG +L+H V +GYG+ G Y +VKNSWG WGE GY RM
Sbjct: 266 DAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYGEDNGKKYWLVKNSWGTGWGEAGYARMI 325
Query: 329 RNTGKPEGLCGINKMASIPLK 349
R++ EG+CGI AS P+K
Sbjct: 326 RDSRDDEGICGIAMEASYPIK 346
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 178/349 (51%), Positives = 232/349 (66%), Gaps = 15/349 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEH-LTSMDKLIELFESWMSKHGKTYKCIEEKL 66
KLL +SLSL+L + DF+ EH L S L L+E W S H T + ++EK
Sbjct: 5 KLLFISLSLALIFTVANTFDFN------EHDLESEKSLWNLYERWRSHHTVT-RNLDEKH 57
Query: 67 HRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLK----PQFPTRRQPS 122
+RF +FK N+ H+ NK Y L LN+F DM++ EF+ Y K F +
Sbjct: 58 NRFNVFKANVMHVHNTNKLDKPYKLKLNKFGDMTNYEFRRIYADSKISHHRMFRGMSHEN 117
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
F Y + +P S+DWR KGAVT VK+QG CGSCWAFST+AAVEGINQI + L SLSE
Sbjct: 118 GTFMYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSE 177
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
Q+L+DCDT N GCNGGLM+YAF++I G+ E +YPY ++GTC+ +KE+ + V+I
Sbjct: 178 QQLVDCDTEENEGCNGGLMEYAFEFI-KQNGITTESNYPYAAKDGTCDVEKED-KAVSID 235
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
G+++VP N+E +LLKA A QPVSVAI+A G +FQFYS GVFTG C +L+HGVA VGYG
Sbjct: 236 GHENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNHGVAIVGYGV 295
Query: 303 SKG-SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
++ + Y I+KNSWG +WGE+GYIRM+R EGLCGI AS P+KK
Sbjct: 296 TQDRTKYWIMKNSWGSEWGEQGYIRMQRGISSREGLCGIAMEASYPIKK 344
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 163/321 (50%), Positives = 213/321 (66%), Gaps = 5/321 (1%)
Query: 33 YSPEHL---TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSY 89
+S EH + M + + +E W+ +HG+ YK +E F I++ N++ I+ N + S+
Sbjct: 23 FSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQNFSF 82
Query: 90 WLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVK 149
L N+FADM++EE+K Y+GL +R+ S+ F K LP SVDWRK GAVTPV+
Sbjct: 83 TLTDNQFADMTNEEYKALYMGLGTSETSRKNQSS-FKRERSKVLPISVDWRKMGAVTPVR 141
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYI 208
NQG CGSCWAFSTVAAVEGIN+I +G L SLSEQEL+DCD S N GCNGG M AFK+I
Sbjct: 142 NQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFI 201
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
+GG+ +YPY+ E+G C K VV ISGY+ VP N+E+ L A+A QPVSVAI
Sbjct: 202 KQNGGITTARNYPYIGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAI 261
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMK 328
+A G +FQ YS G+F G CG +L+H V +GYG+ G Y +VKNSWG WGE GY RM
Sbjct: 262 DAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYGEDNGKKYWLVKNSWGTGWGEAGYARMI 321
Query: 329 RNTGKPEGLCGINKMASIPLK 349
R++ EG+CGI AS P+K
Sbjct: 322 RDSRDDEGICGIAMEASYPIK 342
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 160/310 (51%), Positives = 207/310 (66%), Gaps = 7/310 (2%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEF 104
+ ++E W+ KH K Y + EK RF+IFK+NL+ ID+ N + SY +GLN+FAD+++EE+
Sbjct: 1 MTMYEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQNYSYKVGLNKFADINNEEY 60
Query: 105 KNKYLGLKPQFPTRRQPSA----EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
++ YLG K R + +Y V K VDWR KGAVT +K+QGSCGSCWAF
Sbjct: 61 RDMYLGTKSDAKRRVMKTKITGHRITYNSVIVTVK-VDWRLKGAVTHIKDQGSCGSCWAF 119
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
ST+A VE IN+IV+G SLSEQEL+DCD +FN GCNGGLMDYAF++I+ +GG+ ++DY
Sbjct: 120 STIATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDY 179
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY E C+ K+ +VV+I GY+DVP +L KA+AHQPVSVAI G Q Y
Sbjct: 180 PYNGFERKCDPTKKNAKVVSIDGYEDVPSY-MNALKKAVAHQPVSVAIAGLGRALQLYQS 238
Query: 281 GVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRM-KRNTGKPEGLCG 339
GVFTG CG +LDHGV VGYG G DY +V+NSWG WGE GY ++ RN CG
Sbjct: 239 GVFTGKCGTDLDHGVVVVGYGSENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKCG 298
Query: 340 INKMASIPLK 349
I AS P+K
Sbjct: 299 IAMEASYPVK 308
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 220/347 (63%), Gaps = 14/347 (4%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
+ LLL+++ L CS+ +G + + + E WM++ G+ YK EK
Sbjct: 6 ANLLLVAIVGCLCLCSTAVLAARELGDADNAMAAR------HEQWMAQFGRVYKDPAEKA 59
Query: 67 HRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYL--GLKPQFPTRRQPSAE 124
HR E+FK N+ I+ N E +WLG N+FAD++++EF+ G+K Q R P+
Sbjct: 60 HRLEVFKANVAFIESFNAENHEFWLGANQFADLTNDEFRASKTNKGIK-QGGVRDAPTG- 117
Query: 125 FSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
F Y DV ALP SVDWR KGAVTP+KNQG CGSCWAFS VAA EG+ ++ +G L SLSE
Sbjct: 118 FKYSDVSIDALPASVDWRTKGAVTPIKNQGQCGSCWAFSAVAATEGVVKLSTGKLVSLSE 177
Query: 183 QELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
QEL+DCD + GC GG MD AFK+I+ +GGL E +YPY E+ C+ + TI
Sbjct: 178 QELVDCDVHGVDQGCMGGWMDDAFKFIIKNGGLTTEANYPYTGEDDKCKSNETVNVAATI 237
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
GY+DVP NDE +L+KA+AHQPVSV ++ FQ Y+GGV TG CG E+DHG+AA+GYG
Sbjct: 238 KGYEDVPANDESALMKAVAHQPVSVVVDGGDMTFQLYAGGVMTGSCGVEMDHGIAAIGYG 297
Query: 302 -KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
S G+ Y ++KNSWG WGE+G++RM ++ G+CG+ S P
Sbjct: 298 ATSNGTKYWLMKNSWGTTWGEKGFLRMAKDIPDKRGMCGLAMKPSYP 344
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 167/358 (46%), Positives = 224/358 (62%), Gaps = 16/358 (4%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEH--LTSMDKLIE--------LFESWMSKHG 56
S L+L +++ + +C++ A D S+V + H TS +L +F+SWM KHG
Sbjct: 6 SATLILLVAMVITSCAT-AMDMSVVSSNNNHHLTTSPGRLHSGFDAEASLIFDSWMVKHG 64
Query: 57 KTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFP 116
K Y + EK R IF++NL+ I RN E SY LGL +FAD+S E+ G P+ P
Sbjct: 65 KVYGSVAEKERRLTIFEDNLRFISNRNAENLSYRLGLTQFADLSLHEYGEVCHGADPRPP 124
Query: 117 TRR---QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIV 173
S + LPKSVDWR +GAVT VK+QG C SCWAFSTV AVEG+N+IV
Sbjct: 125 RNHVFMTSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIV 184
Query: 174 SGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDK- 232
+G L +LSEQ+LI+C+ NNGC GG ++ A+++I+ +GGL + DYPY G C+ +
Sbjct: 185 TGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRL 243
Query: 233 KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD 292
KE + V I G++++P NDE +L+KA+AHQPV+ I++S +FQ Y GVF G CG L+
Sbjct: 244 KENNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLN 303
Query: 293 HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
HGV VGYG G DY +VKNS G WGE GY++M RN P GLCGI AS PLK
Sbjct: 304 HGVVVVGYGTENGRDYWLVKNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASYPLKN 361
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 167/326 (51%), Positives = 211/326 (64%), Gaps = 7/326 (2%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS 88
+ + + L S + L +L+E W H + EK RF FK+N+++I + NK
Sbjct: 27 AAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPG 85
Query: 89 YWLGLNEFADMSHEEFKNKYLGLKPQFPTR----RQPSAEFSYRDVKALPKSVDWRKKGA 144
Y LN F DM EEF+ + G R P F Y V+ LP++VDWR+KGA
Sbjct: 86 Y-APLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGA 144
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYA 204
VT VK+QG CGSCWAFSTV +VEGIN I +G L SLSEQELIDCDT+ N+GC GGLM+ A
Sbjct: 145 VTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENA 204
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
F+YI SGG+ E YPY GTC+ + +V I G+Q+VP N E +L KA+A+QPV
Sbjct: 205 FEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPV 264
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERG 323
SVAI+A FQFYS GVF G CG +LDHGVA VGYG++ G++Y IVKNSWG WGE G
Sbjct: 265 SVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGG 324
Query: 324 YIRMKRNTGKPEGLCGINKMASIPLK 349
YIRM+R++G GLCGI AS P+K
Sbjct: 325 YIRMQRDSGYDGGLCGIAMEASYPVK 350
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 177/353 (50%), Positives = 230/353 (65%), Gaps = 17/353 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MAF + +L +L LF LA S V H T+ L E E+WM+++GK YK
Sbjct: 1 MAFTGQKQHML---ALFLF----LAVGISQVMPRKLHQTA---LRERHENWMAEYGKIYK 50
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKP--QFPT 117
EK RF+IFK+N++ I+ N Y LG+N AD++ EEFK+ GLK +F T
Sbjct: 51 DAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFST 110
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQG-SCGSCWAFSTVAAVEGINQIVSGN 176
F Y +V +P+++DWR KGAVTP+K+QG CGSCWAFSTVAA EGI QI +G
Sbjct: 111 TTFKLNGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGM 170
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L SLSEQEL+DCD S ++GC+GGLM+ F++I+ +GG+ E +YPY +GTC+ KE
Sbjct: 171 LMSLSEQELVDCD-SVDHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEAS 229
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
I GY+ VP N E++L +A+A+QPVSV+I+A G+ FQFYS GVFTG CG +LDHGV
Sbjct: 230 PAAQIKGYETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVT 289
Query: 297 AVGYGKSKGS--DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VGYG + +Y IVKNSWG +WGE GYIRM+R EGLCGI AS P
Sbjct: 290 VVGYGTTDDGTHEYWIVKNSWGTQWGEEGYIRMQRGIDALEGLCGIAMDASYP 342
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 164/338 (48%), Positives = 224/338 (66%), Gaps = 17/338 (5%)
Query: 16 LSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
L++ C+SL + L+ ++E E+WM ++G+ YK EK RFE FK N
Sbjct: 9 LAILGCASLCSSV----LAARELSDA-AMVERHENWMVEYGRVYKDAAEKARRFEAFKHN 63
Query: 76 LKHIDQRN-KEVTSYWLGLNEFADMSHEEFK-NKYLGLKPQFPTRRQPSAEFSYRD--VK 131
+ ++ N + +WLG+N+FAD++ EEFK NK G KP + P+ F Y + V
Sbjct: 64 VAFVESFNTNKKNKFWLGVNQFADLTTEEFKANK--GFKPT--AEKVPTTGFKYENLSVS 119
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT- 190
ALP +VDWR KGAVTP+KNQG CG CWAFS VAA+EGI ++ +GNL SLSEQEL+DCDT
Sbjct: 120 ALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTH 179
Query: 191 SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
S + GC GG MD AF++++ +GGL E +YPY +G C K TI G++DVP N
Sbjct: 180 SMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKC--KGGSKSAATIKGHEDVPVN 237
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYI 309
+E +L+KA+A+QPVSVA++AS F YSGGV TG CG ELDHG+AA+GYG +S G+ Y
Sbjct: 238 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYW 297
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
I+KNSWG WGE+G++RM+++ G+CG+ S P
Sbjct: 298 ILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 335
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 171/342 (50%), Positives = 217/342 (63%), Gaps = 15/342 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K +L+L L L C+S ++ H SM E E WM K+GK YK EK
Sbjct: 7 KQHILALVLLLSICTSQVMSRNL------HEASMS---ERHEQWMKKYGKVYKDAAEKQK 57
Query: 68 RFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFS 126
R IFK+N++ I+ N Y L +N AD ++EEF + G K + + P F
Sbjct: 58 RLLIFKDNVEFIESFNAAGNRPYKLSINHLADQTNEEFVASHNGYKHKGSHSQTP---FK 114
Query: 127 YRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
Y +V +P +VDWR+ GAVT VK+QG CGSCWAFSTVAA EGI QI + L SLSEQEL+
Sbjct: 115 YENVTGVPNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELV 174
Query: 187 DCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQD 246
DCD S ++GC+GG M+ F++I+ +GG+ E +YPY +GTC+ KE I GY+
Sbjct: 175 DCD-SVDHGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYET 233
Query: 247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KG 305
VP N E +L KA+A+QPVSV I+A G+ FQFYS GVFTG CG +LDHGV AVGYG + G
Sbjct: 234 VPANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDG 293
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ Y IVKNSWG +WGE GYIRM+R T EGLCGI AS P
Sbjct: 294 TQYWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYP 335
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 323 bits (828), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 156/297 (52%), Positives = 208/297 (70%), Gaps = 9/297 (3%)
Query: 51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN--KEVTSYWLGLNEFADMSHEEFKNKY 108
WM++HG+ Y EK +R+ +FK N++ I++ N + ++ L +N+FAD+++EEF++ Y
Sbjct: 35 WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 94
Query: 109 LGLKPQ--FPTRRQPSAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
G K +R +P++ F Y++V ALP SVDWRKKGAVTP+K+QG CGSCWAFS VA
Sbjct: 95 TGFKGNSVLSSRTKPTS-FRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVA 153
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
A+EG+ QI G L SLSEQEL+DCDT+ + GC GGLMD AF Y + GGL E +YPY
Sbjct: 154 AIEGVAQIKKGKLISLSEQELVDCDTN-DGGCMGGLMDTAFNYTITIGGLTSESNYPYKS 212
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
GTC K + +I G++DVP NDE++L+KA+AH PVS+ I FQFYS GVF+
Sbjct: 213 TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFS 272
Query: 285 GPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
G C LDHGV AVGYG+SK G Y I+KNSWGPKWGERGY+R+K++ G CG+
Sbjct: 273 GECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKDIKPKHGQCGL 329
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 323 bits (828), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 167/326 (51%), Positives = 211/326 (64%), Gaps = 7/326 (2%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS 88
+ + + L S + L +L+E W H + EK RF FK+N+++I + NK
Sbjct: 27 AAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPG 85
Query: 89 YWLGLNEFADMSHEEFKNKYLGLKPQFPTR----RQPSAEFSYRDVKALPKSVDWRKKGA 144
Y LN F DM EEF+ + G R P F Y V+ LP++VDWR+KGA
Sbjct: 86 Y-PPLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGA 144
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYA 204
VT VK+QG CGSCWAFSTV +VEGIN I +G L SLSEQELIDCDT+ N+GC GGLM+ A
Sbjct: 145 VTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENA 204
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
F+YI SGG+ E YPY GTC+ + +V I G+Q+VP N E +L KA+A+QPV
Sbjct: 205 FEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPV 264
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERG 323
SVAI+A FQFYS GVF G CG +LDHGVA VGYG++ G++Y IVKNSWG WGE G
Sbjct: 265 SVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGG 324
Query: 324 YIRMKRNTGKPEGLCGINKMASIPLK 349
YIRM+R++G GLCGI AS P+K
Sbjct: 325 YIRMQRDSGYDGGLCGIAMEASYPVK 350
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 323 bits (828), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 171/347 (49%), Positives = 225/347 (64%), Gaps = 12/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K L ++ L++ ++++ + + L S + L +L+E W S H + + EK
Sbjct: 5 KAFLFAVVLAVILVAAMSMEIT-----ERDLASEESLWDLYERWRSHH-TVSRDLSEKRK 58
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE--F 125
RF +FK N+ HI + N++ Y L LN FADM++ EF+ Y + A F
Sbjct: 59 RFNVFKANVHHIHKVNQKDKPYKLKLNSFADMTNHEFREFYSSKVKHYRMLHGSRANTGF 118
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
+ ++LP SVDWRK+GAVT VKNQG CGSCWAFSTV VEGIN+I +G L SLSEQEL
Sbjct: 119 MHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQEL 178
Query: 186 IDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ 245
+DC+T N GCNGGLM+ A+++I SGG+ E YPY +G+C+ K VTI G++
Sbjct: 179 VDCETD-NEGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDSSKMNAPAVTIDGHE 237
Query: 246 DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG-PCGAELDHGVAAVGYGKS- 303
VP NDE +L+KA+A+QPVSVAI+ASG+D QFYS GV+ G CG ELDHGVA VGYG +
Sbjct: 238 MVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVGYGTAL 297
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
G+ Y IVKNSWG WGE+GYIRM+R E G+CGI AS PLK
Sbjct: 298 DGTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPLK 344
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 323 bits (828), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 163/342 (47%), Positives = 223/342 (65%), Gaps = 13/342 (3%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFK 73
+S+SL S+L S + + D+++ ++ESW+ + GK+Y ++EK RFEIFK
Sbjct: 10 ISMSLLFFSTLLILSSALDIKNSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFK 69
Query: 74 ENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK- 131
ENL+ ID N + SY LGLN FAD++ EE+++ YLG K P A+ S R V
Sbjct: 70 ENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSG------PKAKVSNRYVPK 123
Query: 132 ---ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
LP VDWR GAV VK+QG C SCWAFS VAAVEGIN+IV+GNL SLSEQEL+DC
Sbjct: 124 VGVVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDC 183
Query: 189 -DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
T GCN G M+ AF++I+ +GG++ E++YPY ++G C+ ++ VTI Y+ +
Sbjct: 184 GRTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCDWYRKNQRYVTIDNYEQL 243
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSD 307
P N+E L A+A+QP++V +E+ G F+ Y+ G++TG CG +DHGV VGYG +G D
Sbjct: 244 PANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTIVGYGTERGLD 303
Query: 308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
Y IVKNSWG WGE GYIR++RN G G CGI + S P+K
Sbjct: 304 YWIVKNSWGTNWGENGYIRIQRNIGGA-GKCGIAMVPSYPVK 344
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 172/313 (54%), Positives = 217/313 (69%), Gaps = 13/313 (4%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMS 100
D + E+ E WM +HGK YK EK RF IFKEN+ +I+ N SY LGLN FAD++
Sbjct: 33 DPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENVNYIEAFNNVGNKSYKLGLNHFADLT 92
Query: 101 HEEF---KNKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGS 156
+ EF +NK+ G L T F Y++V +P +VDWR++GAVTPVKNQG CG
Sbjct: 93 NHEFIAARNKFNGYLHGSIITT------FKYKNVSDVPSAVDWRQEGAVTPVKNQGQCGC 146
Query: 157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLH 215
CWAFS VA+ EGI+++ +GNL SLSEQEL+DCDT+ + GC GGLMD AF++I+ + GL
Sbjct: 147 CWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTNGEDQGCEGGLMDDAFEFIIQNNGLS 206
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
E +YPY +GTC + TISGY++VP NDEQ+L KA+A+QPVSVAI+ASG+DF
Sbjct: 207 TEAEYPYQGVDGTCNKTEVGSSAATISGYENVPVNDEQALQKAVANQPVSVAIDASGSDF 266
Query: 276 QFYSGGVFTGPCGAELDHGVAAVGYGKSKG-SDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
QFY GVFTG CG ELDHGVA VGYG + ++Y +VKNSWG +WGE GYIRM+R
Sbjct: 267 QFYKSGVFTGSCGTELDHGVAVVGYGVGEDETEYWLVKNSWGTQWGEEGYIRMQRGVDAS 326
Query: 335 EGLCGINKMASIP 347
EGLCGI S P
Sbjct: 327 EGLCGIAMQPSYP 339
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 171/347 (49%), Positives = 226/347 (65%), Gaps = 14/347 (4%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
++L+S LSL S DF + L + + + +L+E W H + + E + R
Sbjct: 6 IVLISF-LSLLQASK-GFDFD-----EKELETEENVWKLYERWRGHHSVS-RASHEAIKR 57
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG--LKPQFPTR--RQPSAE 124
F +F+ N+ H+ + NK+ Y L +N FAD++H EF++ Y G +K R ++ S
Sbjct: 58 FNVFRHNVLHVHRTNKKNKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGG 117
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
F Y +V +P SVDWR+KGAVT VKNQ CGSCWAFSTVAAVEGIN+I + L SLSEQE
Sbjct: 118 FMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQE 177
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT-CEDKKEEMEVVTISG 243
L+DCDT N GC GGLM+ AF++I +GG+ EE YPY + C E VTI G
Sbjct: 178 LVDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDG 237
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
++ VPENDE+ LLKA+AHQPVSVAI+A +DFQ YS GVF G CG +L+HGV VGYG++
Sbjct: 238 HEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGET 297
Query: 304 K-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
K G+ Y IV+NSWGP+WGE GY+R++R + EG CGI AS P K
Sbjct: 298 KNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTK 344
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 157/343 (45%), Positives = 215/343 (62%), Gaps = 11/343 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K LLL++ + CSS +G + ++E E WM+K + YK EK
Sbjct: 5 KALLLAIVGCICLCSSAVLSARELGDTA--------MVERHEQWMAKFNRVYKDGTEKAQ 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA-EFS 126
RFE+FK N+ I+ N E +WLG+N+F D++++EF+ + R P+ ++S
Sbjct: 57 RFEVFKANVAFIESFNAENRKFWLGVNQFTDLTNDEFRATKTNKGLKMSGGRAPTGFKYS 116
Query: 127 YRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
+ ALP +VDWR KG VTP+K+QG CG CWAFS V A EGI ++ +G L SLSEQEL+
Sbjct: 117 NVSIDALPTAVDWRTKGVVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKLISLSEQELV 176
Query: 187 DCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ 245
DCD + GC GG MD AFK+I+ +GGL E +YPY ++G C+ V TI GY+
Sbjct: 177 DCDVHGVDQGCEGGEMDDAFKFIIKNGGLTTEANYPYTAQDGQCKTSIASNSVATIKGYE 236
Query: 246 DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSK 304
DVP NDE SL+KA+A+QPVSVA++ FQ YSGGV TG CG +LDHG+AA+GYG S
Sbjct: 237 DVPANDESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIAAIGYGMTSD 296
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+ Y ++KNSWG WGE GY+RM+++ G+CG+ S P
Sbjct: 297 GTKYWLLKNSWGTTWGESGYLRMEKDISDKSGMCGLAMQPSYP 339
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 159/324 (49%), Positives = 213/324 (65%), Gaps = 7/324 (2%)
Query: 28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT 87
F G + L ++ ESWMS++G++YK EK +FE+FK N ID N +
Sbjct: 17 FFASGLAARELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFKANAAFIDSFNAKNH 76
Query: 88 SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK--ALPKSVDWRKKGAV 145
+WLG+N+FAD+++EEFK K + + S FSY +V ALP ++DWR KGAV
Sbjct: 77 KFWLGINQFADITNEEFKVTKTN-KGFISNKVRASTGFSYENVSIDALPATIDWRTKGAV 135
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYA 204
TPVK+QG CG CWAFS VAA EGI ++ +G L SLSEQEL+DCD + GC GGLMD A
Sbjct: 136 TPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDA 195
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
FK+I+ +GGL +E YPY E+G C K TI Y+DVP N+E +L+KA+A+QPV
Sbjct: 196 FKFIITNGGLTQESSYPYDAEDGKC--KSGSKSAGTIKSYEDVPANNEGALMKAVANQPV 253
Query: 265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERG 323
SVA++ FQFYSGGV TG CG +LDHG+AA+GYG S G+ Y ++KNSWG WGE G
Sbjct: 254 SVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWLMKNSWGTSWGENG 313
Query: 324 YIRMKRNTGKPEGLCGINKMASIP 347
++RM+++ +G+CG+ S P
Sbjct: 314 FLRMEKDIADKKGMCGLAMEPSYP 337
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 166/351 (47%), Positives = 225/351 (64%), Gaps = 24/351 (6%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
LLL L + CS+ +G E ++ E WM +HG+ YK +K HR
Sbjct: 7 LLLAILGCGVCLCSAAVLAARELGGDDEL-----AMVARHEQWMVQHGRVYKDETDKAHR 61
Query: 69 FEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFK----NKYLGLKPQFPTRRQ 120
F +FK N+K I+ N + +WLG+N+FAD++++EF+ NK G P +
Sbjct: 62 FLVFKANVKFIESFNAAAAAGNRKFWLGVNQFADLTNDEFRATKTNK--GFNPN--VVKV 117
Query: 121 PSAEFSYRD--VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
P+ F Y++ + ALP++VDWR KGAVTP+K+QG CG CWAFS VAA EGI +I +G LT
Sbjct: 118 PTG-FRYQNLSIDALPQTVDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLT 176
Query: 179 SLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
SLSEQEL+DCD + GCNGG MD AFK+I+ +GGL E +YPY ++G C K
Sbjct: 177 SLSEQELVDCDVHGEDQGCNGGEMDDAFKFIIKNGGLTTESNYPYTAQDGQC--KSGSNG 234
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAA 297
TI GY+DVP NDE +L+KA+A QPVSVA++ FQFYSGGV TG CG +LDHG+AA
Sbjct: 235 AATIKGYEDVPANDEAALMKAVASQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAA 294
Query: 298 VGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+GYGK S G+ Y ++KNSWG WGE G++RM+++ +G+CG+ S P
Sbjct: 295 IGYGKTSDGTKYWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMQPSYP 345
>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
Length = 449
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 156/301 (51%), Positives = 198/301 (65%), Gaps = 2/301 (0%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
FE+W ++HG++Y E+ R F +N + N SY L LN FAD++H+EF+
Sbjct: 38 FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYALALNAFADLTHDEFRAA 97
Query: 108 YLGLKPQFPTRRQPSAEFSYRD--VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
LG R A + D V A+P +VDWR+ GAVT VK+QGSCG+CW+FS A
Sbjct: 98 RLGRLAAAGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATGA 157
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
+EGIN+I +G+L SLSEQELIDCD S+N+GC GGLMDYA+K++V +GG+ E DYPY
Sbjct: 158 MEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRET 217
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG 285
+GTC K + VVTI GY+DVP N+E LL+A+A QPVSV I S FQ YS G+F G
Sbjct: 218 DGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFDG 277
Query: 286 PCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMAS 345
PC LDH + VGYG G DY IVKNSWG WG +GY+ M RNTG G+CGIN+M S
Sbjct: 278 PCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQMPS 337
Query: 346 I 346
Sbjct: 338 F 338
>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
Length = 464
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 176/342 (51%), Positives = 225/342 (65%), Gaps = 20/342 (5%)
Query: 28 FSIVGYSPEHLTSMDKLIE--------LFESWMSKH----GKTYKCIEEKLHRFEIFKEN 75
SI+ Y+ EH +++E +++ W+++H G + E RF +F +N
Sbjct: 38 MSIIRYNAEHGVRGLEVVERTEAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDN 97
Query: 76 LKHIDQRNKEVTS---YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA 132
LK +D N + LG+N FAD++++EF+ YLG P R + + V+A
Sbjct: 98 LKFVDAHNAHADGHGGFRLGMNRFADLTNDEFRAAYLGTTPA-GRGRHVGEMYRHDGVEA 156
Query: 133 LPKSVDWRKKGAV-TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC-DT 190
LP SVDWR KGAV +PVKNQG CGSCWAFS VAAVEGIN+IV+G L SLSEQEL++C
Sbjct: 157 LPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARN 216
Query: 191 SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
N+GCNGG+MD AF +I +GGL EEDYPY +G C+ K+ +VV+I G++DVPEN
Sbjct: 217 GGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPEN 276
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK--SKGSDY 308
DE SL KA+AHQPVSVAI+A G +FQ Y GVFTG CG LDHGV AVGYG + G+DY
Sbjct: 277 DELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDY 336
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
V+NSWGP WGE GYIRM+RN G CGI MAS P+KK
Sbjct: 337 WTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 378
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 168/349 (48%), Positives = 228/349 (65%), Gaps = 17/349 (4%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S + LLLL++ L+ ACS F + L+ + E E WM+ +G+ YK E
Sbjct: 4 SRAFLLLLAI-LTGCACS-----FPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAE 57
Query: 65 KLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFK-NKYLGLKPQFPTRRQPS 122
K RFE+FK+NL ++ N + + +WLG+N+FAD++ EEFK NK G KP P+
Sbjct: 58 KARRFEVFKDNLAFVESFNADKKNKFWLGVNQFADLTTEEFKANK--GFKP-ISAEEVPT 114
Query: 123 AEFSYRD--VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
F Y + V ALP +VDWR KGAVTP+KNQG CG CWAFS VAA+EGI ++ + NL SL
Sbjct: 115 TGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSL 174
Query: 181 SEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVV 239
SEQEL+DCDT S + GC GG MD AF++++ +GGL E YPY +G C K
Sbjct: 175 SEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKC--KGGSKSAA 232
Query: 240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVG 299
TI G++DVP N+E +L+KA+A QPVSVA++AS F YSGGV TG CG +LDHG+AA+G
Sbjct: 233 TIKGHEDVPPNNEAALMKAVASQPVSVAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIG 292
Query: 300 YG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
YG +S G+ Y I+KNSWG WGE+ ++RM+++ +G+CG+ S P
Sbjct: 293 YGVESDGTKYWILKNSWGTTWGEKRFLRMEKDISDKQGMCGLAMKPSYP 341
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 168/328 (51%), Positives = 212/328 (64%), Gaps = 8/328 (2%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-T 87
+ + + L S + L +L+E W H + EK RF FK+N+++I + NK
Sbjct: 27 AAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRGGR 85
Query: 88 SYWLGLNEFADMSHEEFKNKYLGLKPQFPTR----RQPSAEFSYRDVKALPKSVDWRKKG 143
Y L LN F DM EEF+ + G R P F Y V+ LP++VDWR+KG
Sbjct: 86 GYRLRLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKG 145
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
AVT VK+QG CGSCWAFSTV +VEGIN I +G L SLSEQELIDCDT+ N+GC GGLM+
Sbjct: 146 AVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMEN 205
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCED-KKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
AF+YI SGG+ E YPY GTC+ + +V I G+Q+VP N E +L KA+A+Q
Sbjct: 206 AFEYIKHSGGITTESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVANQ 265
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGE 321
PVSVAI+A FQFYS GVF G CG +LDHGVA VGYG++ G++Y IVKNSWG WGE
Sbjct: 266 PVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGE 325
Query: 322 RGYIRMKRNTGKPEGLCGINKMASIPLK 349
GYIRM+R++G GLCGI AS P+K
Sbjct: 326 GGYIRMQRDSGYDGGLCGIAMEASYPVK 353
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 160/318 (50%), Positives = 200/318 (62%), Gaps = 16/318 (5%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT--------------SYWLGL 93
F++W ++HGK Y EE+ R +F +N + N SY L L
Sbjct: 36 FDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTLAL 95
Query: 94 NEFADMSHEEFKNKYLG-LKPQFPTR-RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQ 151
N FAD++HEEF+ LG + P R R + A+P ++DWRK GAVT VK+Q
Sbjct: 96 NAFADLTHEEFRAARLGRIAPGAALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTKVKDQ 155
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVAS 211
GSCG+CW+FS A+EGIN+I +G+L SLSEQELIDCD S+N+GC GGLMDYA+K+++ +
Sbjct: 156 GSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVIKN 215
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEAS 271
GG+ EEDYPY +GTC K + VVTI GY DVP N E LL+A+A QPVSV I S
Sbjct: 216 GGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVGICGS 275
Query: 272 GTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNT 331
FQ Y G+F GPC LDH V VGYG G DY IVKNSWG WG +GY+ M RNT
Sbjct: 276 ARAFQLYYQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNT 335
Query: 332 GKPEGLCGINKMASIPLK 349
G +G+CGIN MAS P K
Sbjct: 336 GDSKGVCGINMMASFPTK 353
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 172/320 (53%), Positives = 209/320 (65%), Gaps = 9/320 (2%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEF 96
L S + L +L+E W S H + + EK RF FK N I NK Y L LN F
Sbjct: 36 LESEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRF 94
Query: 97 ADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYR--DVKALPKSVDWRKKGAVTPVKNQGS 153
DM EF+ ++G L+ P++ F Y +V LP SVDWR+KGAVT VK+QG
Sbjct: 95 GDMDQAEFRATFVGDLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGG 213
CGSCWAFSTV +VEGIN I +G+L SLSEQELIDCDT+ N+GC GGLMD AF+YI +GG
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 214 LHKEEDYPYLMEEGTCEDKKEEME---VVTISGYQDVPENDEQSLLKALAHQPVSVAIEA 270
L E YPY GTC + VV I G+QDVP N E+ L +A+A+QPVSVA+EA
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 271 SGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKR 329
SG F FYS GVFTG CG ELDHGVA VGYG ++ G Y VKNSWGP WGE+GYIR+++
Sbjct: 275 SGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEK 334
Query: 330 NTGKPEGLCGINKMASIPLK 349
++G GLCGI AS P+K
Sbjct: 335 DSGASGGLCGIAMEASYPVK 354
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 162/315 (51%), Positives = 209/315 (66%), Gaps = 14/315 (4%)
Query: 37 HLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNE 95
H TSM E E WM+++GK YK EK RF+IFK+N++ I+ N + Y LG+N
Sbjct: 30 HETSMR---ERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLGVNH 86
Query: 96 FADMSHEEFKNKYLGLKP--QFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGS 153
AD++ EEFK G K +F T F Y +V A+P ++DWR KGAVTP+K+QG
Sbjct: 87 LADLTVEEFKASRNGFKRPHEFST-----TTFKYENVTAIPAAIDWRTKGAVTPIKDQGQ 141
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASG 212
CGSCWAFST+AA EGI+QI +G L SLSEQEL+DCDT + GC GG M+ F++I+ +G
Sbjct: 142 CGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNG 201
Query: 213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASG 272
G+ E +YPY +G C K V I GY+ VP N E +L KA+A+QPVSV+I+A G
Sbjct: 202 GITSETNYPYKAVDGKC--NKATSPVAQIKGYEKVPPNSETALQKAVANQPVSVSIDADG 259
Query: 273 TDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
F FYS G++ G CG ELDHGV AVGYG + G+DY IVKNSWG +WGE+GY+RM+R
Sbjct: 260 AGFMFYSSGIYNGECGTELDHGVTAVGYGTANGTDYWIVKNSWGTQWGEKGYVRMQRGIA 319
Query: 333 KPEGLCGINKMASIP 347
GLCGI +S P
Sbjct: 320 AKHGLCGIALDSSYP 334
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 156/312 (50%), Positives = 199/312 (63%), Gaps = 10/312 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-------YWLGLNEFADMS 100
FE+W ++HGK Y E+ R F EN + N V S Y L LN FAD++
Sbjct: 39 FEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLT 98
Query: 101 HEEFKNKYLG---LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
H+EF+ LG + P PS V A+P ++DWR+ GAVT VK+QGSCG+C
Sbjct: 99 HDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCGAC 158
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKE 217
W+FS A+EGIN+I +G+L SLSEQELIDCD S+N GC GGLM YA+K+++ +GG+ E
Sbjct: 159 WSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDTE 218
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
+DYP+ +GTC K + VVTI GY++VP + E LL+A+A QP+SV I S FQ
Sbjct: 219 DDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQL 278
Query: 278 YSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
YS G+F GPC LDH V VGYG G DY IVKNSWG +WG +GY+ M RNTG G+
Sbjct: 279 YSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSSGI 338
Query: 338 CGINKMASIPLK 349
CGIN MAS P K
Sbjct: 339 CGINMMASFPTK 350
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 167/353 (47%), Positives = 228/353 (64%), Gaps = 17/353 (4%)
Query: 1 MAFFSHSKLLLLSLSL-SLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY 59
MA H+KL+L+++ L +L+A S + H SM+ ++WM+++G+ Y
Sbjct: 1 MALLLHNKLVLMAMLLVTLWASQSWSRSL--------HEASMELR---HKTWMTQYGRVY 49
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
K EK RF+IFKEN++ I+ N Y LG+N F D+++EEF+ + G +
Sbjct: 50 KGNVEKEKRFKIFKENVEFIESFNNNGNKPYKLGINAFTDLTNEEFRASHNGYTMSMSSH 109
Query: 119 RQP--SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
+ + F Y +V A+P S+DWR KGAVT +K+QG CG CWAFS VAA+EGI ++ +G
Sbjct: 110 QSSYRTKSFRYENVTAVPPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGT 169
Query: 177 LTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
L SLSEQEL+DCDTS + GC GGLMD AF++I+ + GL E +YPY +G+C +K
Sbjct: 170 LISLSEQELVDCDTSGMDQGCEGGLMDDAFEFIIENNGLTTEANYPYEGVDGSCNTRKAA 229
Query: 236 MEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGV 295
I+GY++VP DE++L KA+A+QPVSVAI+A + FQ YS G+FTG CG ELDHGV
Sbjct: 230 NHAAKITGYENVPAYDEEALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGTELDHGV 289
Query: 296 AAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VGYG S G+ Y +VKNSWG WGE GYIRM+R+ EGLCGI S P
Sbjct: 290 TVVGYGTSDDGTKYWLVKNSWGTSWGEDGYIRMERDIDAKEGLCGIAMEPSYP 342
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 160/344 (46%), Positives = 217/344 (63%), Gaps = 13/344 (3%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
+ +L S+ A S A F + L ++ E WM+++ + YK EK RFE
Sbjct: 1 MATLQASILAVLSFAF-FCGAALAARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFE 59
Query: 71 IFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYL--GLKPQFPTRRQPSAEFSY 127
+FK N+K I+ N +WLG+N+FAD++++EF+ G KP + S F Y
Sbjct: 60 VFKANVKFIESFNTGGNRKFWLGINQFADLTNDEFRTTKTNKGFKPSLD---KVSTGFRY 116
Query: 128 RDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
+V A+P ++DWR GAVTP+K+QG CG CWAFS VAA EGI +I +G L SLSEQEL
Sbjct: 117 ENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQEL 176
Query: 186 IDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DCD + GC GGLMD AFK+I+ +GGL E +YPY +G C K I GY
Sbjct: 177 VDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKC--KSGSNSAANIKGY 234
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-S 303
+DVP NDE +L+KA+A+QPVSVA++ FQFYSGGV TG CG +LDHG+AA+GYGK S
Sbjct: 235 EDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTS 294
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+ Y ++KNSWG WGE GY+RM+++ +G+CG+ S P
Sbjct: 295 DGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAMEPSYP 338
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 154/303 (50%), Positives = 207/303 (68%), Gaps = 7/303 (2%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKY 108
E+WM+++G+ YK EK +FE+FK N + ID N E +WLG+N+FAD+++EEFK
Sbjct: 38 ETWMAQYGRVYKDAAEKAQKFEVFKANARFIDSFNAENHKFWLGINQFADLTNEEFKATK 97
Query: 109 LGLKPQFPTRRQPSAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
K + + S F Y ++K ALP S+DWR KGAVTPVK+QG CG CWAFS VAA
Sbjct: 98 TN-KGFISNKARVSTGFKYENLKIEALPTSIDWRTKGAVTPVKDQGQCGCCWAFSAVAAT 156
Query: 167 EGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
EGI ++ +G L SLSEQEL+DCD + GC GGLMD AFK+I+ +GGL +E YPY E
Sbjct: 157 EGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAE 216
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG 285
+G C K TI Y+DVP N+E +L+KA+A+QPVSVA++ FQFYSGGV TG
Sbjct: 217 DGKC--KSGSKSAGTIKSYEDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTG 274
Query: 286 PCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
CG +LDHG+AA+GYG S G+ + ++KNSWG WGE G++RM+++ +G+CG+
Sbjct: 275 SCGTDLDHGIAAIGYGVTSDGTKFWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMEP 334
Query: 345 SIP 347
S P
Sbjct: 335 SYP 337
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 172/320 (53%), Positives = 208/320 (65%), Gaps = 9/320 (2%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEF 96
L S + L +L+E W S H + + EK RF FK N I NK Y L LN F
Sbjct: 36 LESEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRF 94
Query: 97 ADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYR--DVKALPKSVDWRKKGAVTPVKNQGS 153
DM EF+ ++G L+ P + F Y +V LP SVDWR+KGAVT VK+QG
Sbjct: 95 GDMDQAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGG 213
CGSCWAFSTV +VEGIN I +G+L SLSEQELIDCDT+ N+GC GGLMD AF+YI +GG
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 214 LHKEEDYPYLMEEGTCEDKKEEME---VVTISGYQDVPENDEQSLLKALAHQPVSVAIEA 270
L E YPY GTC + VV I G+QDVP N E+ L +A+A+QPVSVA+EA
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 271 SGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKR 329
SG F FYS GVFTG CG ELDHGVA VGYG ++ G Y VKNSWGP WGE+GYIR+++
Sbjct: 275 SGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEK 334
Query: 330 NTGKPEGLCGINKMASIPLK 349
++G GLCGI AS P+K
Sbjct: 335 DSGASGGLCGIAMEASYPVK 354
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 320 bits (821), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 163/342 (47%), Positives = 221/342 (64%), Gaps = 14/342 (4%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
+++L+L + AC + ++ T+ + + +E+W+ ++G+ Y+ EE R
Sbjct: 9 IVILNLWIIASACPEI---------HTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVR 59
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR 128
F+I++ N+++I+ N + SY L N FAD+++EEFK+ YLG P+F + EF Y
Sbjct: 60 FDIYQSNVQYIEFYNSQNYSYKLIDNRFADITNEEFKSTYLGYLPRFRVQ----TEFRYH 115
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
LPKS+DWRKKGAVT VK+QG CGSCWAFS VAAVEGIN+I + NL SLSEQ+LIDC
Sbjct: 116 KHGELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDC 175
Query: 189 DT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
D S N GC GG M AF YI GG+ ++YPY +G C K + VTISGY+ V
Sbjct: 176 DIKSGNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNNAVTISGYESV 235
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSD 307
P +E+ L A+AHQPVS+A +A G FQFYS G+F+G CG L+HG+ VGYG+ G
Sbjct: 236 PARNEKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYGEENGDK 295
Query: 308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
Y IVKNSW WGE GY+RMKR+T +G CGI A+ P+K
Sbjct: 296 YWIVKNSWANDWGESGYVRMKRDTKDKDGTCGIAMDATYPVK 337
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 320 bits (821), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 158/329 (48%), Positives = 228/329 (69%), Gaps = 8/329 (2%)
Query: 25 AHDFSIVGYSPE-HLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN 83
A D SI+ Y+ + + D+++ +FESW+ ++GK+Y + EK RFEIFK+NL+ +D+ N
Sbjct: 24 AFDASIITYAKKWEQRTNDEVMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHN 83
Query: 84 KEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTR-RQPSAEFSYRDVKALPKSVDWRK 141
+V SY +GLN+F+D++ EE+ + YLG K F R S + R LP S+DWRK
Sbjct: 84 ADVNRSYKVGLNQFSDLTLEEYSSIYLGTK--FDMRMTNVSDRYEPRVGDQLPNSIDWRK 141
Query: 142 KGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGL 200
KGAV VKNQG+CGSCW F+ +AAVE INQIV+GNL SLSEQ+++DC S NNGC GG
Sbjct: 142 KGAVLGVKNQGNCGSCWTFAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGS 201
Query: 201 MDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA 260
A+++I+ +GG++ E +YPY ++G C+++K + + VTI Y++VP +E++L KA++
Sbjct: 202 RAGAYQFIIDNGGINTEANYPYKAQDGECDEQKNQ-KYVTIDRYENVPRKNEKALQKAVS 260
Query: 261 HQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWG 320
+Q VSV I ++ ++F+ Y G+FTGPCGA++DH V VGYG G DY IV+NSWG WG
Sbjct: 261 NQLVSVGIASNSSEFKAYKSGIFTGPCGAKIDHAVTIVGYGTEGGMDYWIVRNSWGSNWG 320
Query: 321 ERGYIRMKRNTGKPEGLCGINKMASIPLK 349
E GY+RM+RN G G C I + P+K
Sbjct: 321 ENGYVRMQRNVGNA-GTCFIATSPNYPVK 348
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 160/323 (49%), Positives = 209/323 (64%), Gaps = 10/323 (3%)
Query: 35 PEHLTSMDKLIE-----LFESWMSKHGKTYK-CIEEKLHRFEIFKENLKHIDQRNKEVTS 88
PEH + KL + F W+ K YK +EE +F ++ +NL+ + N++ ++
Sbjct: 30 PEHHVAAVKLAKGNPRAAFSDWVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHNEKDST 89
Query: 89 YWLGLNEFADMSHEEFKNKYLGLKPQFPTR---RQPSAEFSYRDVKALPKSVDWRKKGAV 145
+ LGL FAD++H+E++ LG +P+ S F Y D +A P S+DWRKKGAV
Sbjct: 90 FKLGLTNFADLTHDEYRQHALGYRPELKGTGLGTGKSTGFQYADYEA-PPSIDWRKKGAV 148
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
T VKNQ CGSCWAFST +VEG N I SG L SLSEQEL+DCD + ++GC+GGLMD+AF
Sbjct: 149 TDVKNQQQCGSCWAFSTTGSVEGANAIYSGELVSLSEQELVDCDVTQDHGCHGGLMDFAF 208
Query: 206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVS 265
+I+ +GG+ E+DY Y ++G C KE+ VVTI Y+DVP NDE +L KA A+QP+S
Sbjct: 209 SFIIRNGGIDTEKDYKYKAQDGVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPIS 268
Query: 266 VAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYI 325
VAIEA +FQ Y+GGVF PCG LDHGV VGYG G+DY IVKNSWG WG+ GYI
Sbjct: 269 VAIEADQREFQLYAGGVFDAPCGTALDHGVLVVGYGSDNGTDYWIVKNSWGDFWGDSGYI 328
Query: 326 RMKRNTGKPEGLCGINKMASIPL 348
R+ R G CGI AS P+
Sbjct: 329 RLARGISNSAGQCGIAMQASYPI 351
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 170/346 (49%), Positives = 220/346 (63%), Gaps = 16/346 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K +L+L L L C+S ++ H SM E E WM K+GK YK EK
Sbjct: 7 KQHILALVLLLSICTSQVMSRNL------HEASMS---ERHEQWMKKYGKVYKDAAEKQK 57
Query: 68 RFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFS 126
R IFK+N++ I+ N Y L +N AD ++EEF + G K + + P F
Sbjct: 58 RLLIFKDNVEFIESFNAAGNKPYKLSINHLADQTNEEFVASHNGYKYKGSHSQTP---FK 114
Query: 127 YRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
Y +V +P +VDWR+ GAVT VK+QG CGSCWAFSTVAA EGI QI +G L SLSEQEL+
Sbjct: 115 YGNVTDIPTAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELV 174
Query: 187 DCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQD 246
DCD S ++GC+GGLM+ F++I+ +GG+ E +YPY +GTC+ KE I GY+
Sbjct: 175 DCD-SVDHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYET 233
Query: 247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGS 306
VP N E++L +A+A+QPVSV+I+A G+ FQFYS GVFTG CG +LDHGV VGYG +
Sbjct: 234 VPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDG 293
Query: 307 --DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
+Y IVKNSWG +WGE GYIRM+R EGLCGI AS P+ K
Sbjct: 294 THEYWIVKNSWGTQWGEEGYIRMQRGIDAQEGLCGIAMDASYPMGK 339
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 163/344 (47%), Positives = 230/344 (66%), Gaps = 10/344 (2%)
Query: 11 LLSLSLSLFACSSLAHDFSI-VGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
+S+SL LF + L F+I SP L + D+++ L+ESW+ K+GK+Y + E+ R
Sbjct: 7 FISMSL-LFFSTFLIFSFAIDAKISP--LRTNDEVMALYESWLVKYGKSYNSLGEREMRI 63
Query: 70 EIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR 128
EIFKENL+ ID+ N + SY +GLN+FAD++ EE+++ YLG K + + S + +
Sbjct: 64 EIFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSSL--KSKVSNRYMPQ 121
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
+ LP VDWR GAV VKNQG C SCWAF+T+A VE INQI++G+L SLSEQEL+DC
Sbjct: 122 VGEVLPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDC 181
Query: 189 D-TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
+ T N GC GG MD A+++I+ +GG++ EE+YPY+ ++ C++ K+ VTI Y+ V
Sbjct: 182 NRTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCDEPKKNQNYVTIDSYEQV 241
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT-GPCGAELDHGVAAVGYGKSKGS 306
P NDE ++ +A+A+QPVSVAI+A F+FY G+FT G CG L+H V +GYG G
Sbjct: 242 PPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVTIIGYGTENGI 301
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
DY IVKNS+G +WGE GY +++RN G EG CGI P+K
Sbjct: 302 DYWIVKNSYGTQWGESGYGKVQRNVGG-EGRCGIASYPFYPVKN 344
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 320 bits (819), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 173/334 (51%), Positives = 217/334 (64%), Gaps = 14/334 (4%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVT 87
S + + L S + L L+E W ++H + + EK RF +F+EN + + + N +
Sbjct: 30 SAMDFGESDLASEESLWALYERWRARH-TVSRDLAEKSRRFNVFRENARLVHEFNLRRDA 88
Query: 88 SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEF----------SYRDVKALPKSV 137
Y L LN FAD++ +EF+ Y + +P A S+ ALP SV
Sbjct: 89 PYKLRLNRFADLTSDEFRRSYASSRVSHHRMFKPRAANNNDDDDDKGSSFTHGGALPTSV 148
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCN 197
DWR+KGAVT VK+QG CGSCWAFST+AAVEGIN I + NLTSLSEQ+L+DCDT N GC+
Sbjct: 149 DWREKGAVTGVKDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCD 208
Query: 198 GGLMDYAFKYIVASGGLHKEEDYPYLMEE-GTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
GGLMD AF YI GG+ E+ YPY + +C KK VV+I GY+DVP NDE +L
Sbjct: 209 GGLMDDAFSYIAKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALK 268
Query: 257 KALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSW 315
KA+A QPV+VAIEA G+ FQFYS GVF G CG ELDHGVAAVGYG + G+ Y IVKNSW
Sbjct: 269 KAVAAQPVAVAIEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWIVKNSW 328
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G +WGE+GYIRMKR+ EGLCGI AS P+K
Sbjct: 329 GEEWGEKGYIRMKRDVADKEGLCGIAMEASYPVK 362
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 171/324 (52%), Positives = 209/324 (64%), Gaps = 13/324 (4%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEF 96
L S + L +L+E W + H + + EK RF FK N+ I NK Y L LN F
Sbjct: 36 LESEEALWDLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRF 94
Query: 97 ADMSHEEFKNKYLGLKPQFPTRRQPSAE-------FSYRDVKALPKSVDWRKKGAVTPVK 149
DMS EF+ + G + R P+ ++ +V LP+SVDWR+KGAVT VK
Sbjct: 95 GDMSQAEFRATFAGSRVSDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVK 154
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIV 209
NQG CGSCWAFSTV +VEGIN I +G L SLSEQELIDCDT+ N+GC GGLMD AF+YI
Sbjct: 155 NQGKCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYIK 214
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEME---VVTISGYQDVPENDEQSLLKALAHQPVSV 266
+GGL E YPY GTC+ K VV I G+QDVP N E++L KA+A+QPVSV
Sbjct: 215 KNGGLTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSV 274
Query: 267 AIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYI 325
I+ASG F FYS GVFTG CG ELDHGVA VGYG ++ G Y VKNSWGP WGE+GYI
Sbjct: 275 GIDASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEKGYI 334
Query: 326 RMKRNTGKPEGLCGINKMASIPLK 349
R+++++G GLCGI AS +K
Sbjct: 335 RVEKDSGAEGGLCGIAMEASYAVK 358
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 155/322 (48%), Positives = 216/322 (67%), Gaps = 14/322 (4%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGL 93
+ S +++ L+ W +K+ K ++ +R E+FKENL+ +D+ N ++ LG+
Sbjct: 41 VRSDEEVRMLYLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHNAAADRGEHTFRLGM 100
Query: 94 NEFADMSHEEFKNKYLGLKPQFPTRRQP-----SAEFSYRDVKALPKSVDWRKKGAVTPV 148
N FAD+++EE++ ++L F R+ S+ + R+ LP S+DWR+KGAV PV
Sbjct: 101 NRFADLTNEEYRTRFL---RDFSRLRRSASGKISSRYRLREGDDLPDSIDWREKGAVVPV 157
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
KNQG CGSCWAFSTVAAVEGINQIV+G+L SLSEQ+L+DC T+ N+GC GG M+ AF++I
Sbjct: 158 KNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA-NHGCRGGWMNPAFQFI 216
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
V +GG++ EE YPY + G C + VV+I Y++VP ++EQSL KA+A+QPVSV +
Sbjct: 217 VNNGGINSEETYPYRGQNGIC-NSTVNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTM 275
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMK 328
+A+G DFQ Y G+FTG C +H + VGYG DY VKNSWG WGE GYIR++
Sbjct: 276 DAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDYRTVKNSWGKNWGESGYIRVE 335
Query: 329 RNTGKPEGLCGINKMASIPLKK 350
RN G P G CGI + AS P+KK
Sbjct: 336 RNIGNPNGKCGITRFASYPVKK 357
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 158/260 (60%), Positives = 188/260 (72%), Gaps = 11/260 (4%)
Query: 99 MSHEEFKNKYLGLK----PQFPTRRQPSA----EFSYRDVKALPKSVDWRKKGAVTPVKN 150
M+ +EF+ Y G + F RQ S+ F Y D + +P SVDWR+KGAVT VK+
Sbjct: 1 MTADEFRRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKD 60
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVA 210
QG CGSCWAFST+AAVEGIN I + NLTSLSEQ+L+DCDT N GCNGGLMDYAF+YI
Sbjct: 61 QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAK 120
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEA 270
GG+ E+ YPY + +C KK VVTI GY+DVP NDE +L KA+AHQPVSVAIEA
Sbjct: 121 HGGVAAEDAYPYRARQASC--KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEA 178
Query: 271 SGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKR 329
SG+ FQFYS GVF+G CG ELDHGVAAVGYG + G+ Y +VKNSWGP+WGE+GYIRM R
Sbjct: 179 SGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMAR 238
Query: 330 NTGKPEGLCGINKMASIPLK 349
+ EG CGI AS P+K
Sbjct: 239 DVAAKEGHCGIAMEASYPVK 258
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 213/345 (61%), Gaps = 33/345 (9%)
Query: 6 HSKLLLLSL-SLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
SK++ ++L + ++A +L+ V S H E WM +G+TYK I E
Sbjct: 4 ESKIICITLLIMGVWASQALSRTLHEVSMSERH-----------EDWMGLYGRTYKDIAE 52
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE 124
K RF+IFKEN+++I+ NK FK G R
Sbjct: 53 KERRFKIFKENVEYIESVNK-------------------FKASRNGYNMSSRPRSSEITS 93
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
F Y +V A+P S+DWRKKGAVTP+K+QG CG CWAFS VAA+EG+ Q+ +G L SLSEQE
Sbjct: 94 FRYENVAAVPSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQE 153
Query: 185 LIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
L+DCDTS + GC GGLMD AF++I+ +GGL E +YPY + TC KK I
Sbjct: 154 LVDCDTSGEDQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKN 213
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
Y+DVP N E +LLKA+A PVSVAI+A G+DFQFYS GVFTG CG ELDHGV AVGYGK+
Sbjct: 214 YEDVPANSEAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKT 273
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+ Y +VKNSWG WGE GYI M+R+ G EGLCGI AS P
Sbjct: 274 DDGTKYWLVKNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYP 318
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 155/303 (51%), Positives = 209/303 (68%), Gaps = 3/303 (0%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
++ W+ ++G+ Y +E L RF I+ N++ I+ N + S+ L N+FAD++++EF +
Sbjct: 46 YDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQNLSFKLTDNKFADLTNDEFNSI 105
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
YLG + + RR S + + LP +VDWR+ GAVTP+K+QG CGSCWAFS VAAVE
Sbjct: 106 YLGYQIRSYKRRNLS--HMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAVAAVE 163
Query: 168 GINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
GIN+I +GNL SLSEQEL+DCD + +N GCNGG M+ AF +I + GGL E DYPY +
Sbjct: 164 GINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYPYKGTD 223
Query: 227 GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP 286
G+CE K + V I GY+ VP N+E SL A++ QPVSVAI+ASG +FQ YS GVF+G
Sbjct: 224 GSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGVFSGY 283
Query: 287 CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASI 346
CG +L+HGV VGYG + G Y +VKNSWG WGE GYIRMKR++ +G+CGI S
Sbjct: 284 CGIQLNHGVTIVGYGDNNGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCGIAMEPSY 343
Query: 347 PLK 349
P+K
Sbjct: 344 PIK 346
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 161/346 (46%), Positives = 215/346 (62%), Gaps = 5/346 (1%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
+ L + L +C ++A F + +++ E F+ W+ + Y EE R
Sbjct: 1 MRLSCVLLVACSCLAVAAGFPFENHRLFIQQAVESPREAFDFWVQTLKRAYASAEEYERR 60
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR 128
F+++ +NL+ + + N TS+WL + +AD+S +E+++K LG R A
Sbjct: 61 FDVWLDNLRFVHEYNAGHTSHWLSMGVYADLSQDEYRSKALGYNADLHEERPLRAAPFLY 120
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
+ PK VDW KGAVTPVKNQ CGSCWAFST AVEG + I +G L SLSEQ L+DC
Sbjct: 121 EGTVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKLASLSEQMLVDC 180
Query: 189 DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
D +NGC+GGLMD+AF++I+ +GG+ E+DYPY EEG C+D K VVTI YQDVP
Sbjct: 181 DRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTAEEGMCQDNKMRRHVVTIDDYQDVP 240
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSD- 307
NDE +L+KA+A+QPVSVAIEA FQ Y GGVF CG LDHGV VGYG +
Sbjct: 241 PNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFDAECGTALDHGVLVVGYGTASNGTH 300
Query: 308 ---YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
Y +VKNSWG +WG++GYIR+ RN G+ EG CG+ AS P+KK
Sbjct: 301 HLPYWLVKNSWGAEWGDKGYIRLLRNLGE-EGQCGVAMQASFPIKK 345
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 169/334 (50%), Positives = 212/334 (63%), Gaps = 13/334 (3%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT- 87
S + + + L S + L EL+ W S H + EK RF FK N+ I N +
Sbjct: 23 SAIPFDAKDLESEEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLND 82
Query: 88 --------SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDW 139
SY L LN F DM EF++ + G + Q F Y VK +P++VDW
Sbjct: 83 TSTNNNGPSYRLRLNRFGDMDQAEFRSTFAGPLHRHTRPAQSIPGFIYDTVKDIPQAVDW 142
Query: 140 RKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNG 198
R+KGAVT VK+QG CGSCWAFS VA+VEG+N I +G+L SLSEQELIDCDT +NGC G
Sbjct: 143 RQKGAVTGVKDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQG 202
Query: 199 GLMDYAFKYIV-ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLK 257
GLM+ AF++I ++GGL E YPY GTC + V I G+Q VP +E++L K
Sbjct: 203 GLMESAFEFIAHSAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAK 262
Query: 258 ALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK--GSDYIIVKNSW 315
A+AHQPVSVAI+A G FQFYS GVFTG CG+ELDHGVA VGYG ++ G +Y IVKNSW
Sbjct: 263 AVAHQPVSVAIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKEYWIVKNSW 322
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
GP WGE GY+RM+R++G GLCGI AS P+K
Sbjct: 323 GPGWGEHGYVRMQRDSGVDGGLCGIAMEASYPVK 356
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 157/342 (45%), Positives = 218/342 (63%), Gaps = 9/342 (2%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
+ +L S+ A A F + L+ ++ E WM+++ + YK EK RFE
Sbjct: 94 MATLKASISAIIGFAF-FCGAAMAARDLSDDSVMVARHEQWMAQYSRVYKDASEKARRFE 152
Query: 71 IFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRD 129
+FK N++ I+ N + +WLG+N+FAD++++EF++ + + P+ F Y +
Sbjct: 153 VFKANVQFIESFNAGGNNKFWLGVNQFADLTNDEFRSTKTNKGLKSSNMKIPTG-FRYEN 211
Query: 130 VKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELID 187
V A LP ++DWR KGAVTP+K+QG CG CWAFS VAA EGI +I +G L SL+EQEL+D
Sbjct: 212 VSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVD 271
Query: 188 CDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQD 246
CD + GC GGLMD AFK+I+ +GGL E YPY +G C K TI GY+D
Sbjct: 272 CDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKC--KSGSNSAATIKGYED 329
Query: 247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-SKG 305
VP NDE +L+KA+A+QPVSVA++ FQFYSGGV TG CG +LDHG+AA+GYGK S G
Sbjct: 330 VPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDG 389
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ Y ++KNSWG WGE GY+RM+++ G+CG+ S P
Sbjct: 390 TKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 431
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 162/340 (47%), Positives = 224/340 (65%), Gaps = 8/340 (2%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLT--SMDKLIELFESWMSKHGKTYKCIEEKLHRFEI 71
LS+SL S+L + ++ ++LT + D+L ++ESW++K+GK+Y + E RFEI
Sbjct: 8 LSMSLLFFSTLL--VLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEI 65
Query: 72 FKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV 130
FKE L+ ID+ N + SY +GLN+FAD ++EEF++ YLG + + S + R
Sbjct: 66 FKETLRFIDEHNADTNRSYRVGLNQFADQTNEEFQSTYLGFTSG-SNKMKVSNRYEPRVG 124
Query: 131 KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT 190
+ LP VDWR GAV +K+QG CGSCWAFS +A VEGIN+IV+G+L SLSEQEL+DC
Sbjct: 125 QVLPDYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGR 184
Query: 191 SFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPE 249
+ N GC+GG + F++I+ +GG++ E +YPY E+G C + + +I Y++VP
Sbjct: 185 TQNTRGCDGGSITDGFQFIINNGGINTEANYPYTAEDGQCNLDLQNEKYASIDTYENVPY 244
Query: 250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYI 309
N+E +L A+A+QPVSVA+EA+G FQ YS G+FTGPCG +DH V VGYG G DY
Sbjct: 245 NNEWALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYW 304
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
IVKNSW WGE GYIR+ RN G G CGI S P+K
Sbjct: 305 IVKNSWDTTWGEEGYIRILRNVGGA-GTCGIATKPSYPVK 343
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 170/350 (48%), Positives = 219/350 (62%), Gaps = 33/350 (9%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + + + L+L L A +S A ++ H SM E E WM+++G+ YK
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARNL------HEASM---YERHEDWMAQYGRVYK 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+EK R++IFK+N+ I+ NK + SY L +NEFAD+++EEF K +
Sbjct: 52 DADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFGTSRNRFKAHICSTE 111
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S F Y +V A+P ++DWRKKGAVTP+K+QG CGSCWAFS VAA+EGI Q+ +G L S
Sbjct: 112 ATS--FKYENVTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLIS 169
Query: 180 LSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCDTS + GCNG +YPY +GTC KK
Sbjct: 170 LSEQELVDCDTSGEDQGCNGA-------------------NYPYAGTDGTCNRKKAAHPA 210
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+GY+DVP N+E++L KA+ HQP++VAI+A G +FQFYS GVFTG CG ELDHGVAAV
Sbjct: 211 AKINGYEDVPANNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAV 270
Query: 299 GYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG S G Y +VKNSWG WGE GYIRM+R+ EGLCGI AS P
Sbjct: 271 GYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 320
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 170/355 (47%), Positives = 219/355 (61%), Gaps = 28/355 (7%)
Query: 1 MAFFSHSK-----LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKH 55
MAF S + LLL+L + L H TSM E E WM+++
Sbjct: 1 MAFTSQKQYTIALFLLLALGIPQMMSRKL------------HETSMR---ERHEQWMAEY 45
Query: 56 GKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQ 114
GK YK EK RF IFK N++ I+ N Y LG+N AD++ EEFK GLK
Sbjct: 46 GKVYKDAAEKEKRFLIFKHNVEFIESFNAAANKPYKLGVNHLADLTVEEFKASRNGLKRP 105
Query: 115 FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSC-GSCWAFSTVAAVEGINQIV 173
+ P F Y +V A+P ++DWR KGAVT +K+QG C GSCWAFSTVAA EGI+QI
Sbjct: 106 YELSTTP---FKYENVTAIPAAIDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQIT 162
Query: 174 SGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDK 232
+G L SLSEQEL+DCDT + GC GG M+ F++I+ +GG+ E +YPY +G C
Sbjct: 163 TGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGGITSEANYPYKAVDGKC--N 220
Query: 233 KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD 292
K V I GY+ VP N E++L KA+A+QPVSV+I+A+G F FYS G++ G CG ELD
Sbjct: 221 KATSPVAQIKGYEKVPPNSEKTLQKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELD 280
Query: 293 HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
HGV AVGYG + G+DY +VKNSWG +WGE+GY+RM+R GLCGI +S P
Sbjct: 281 HGVTAVGYGIANGTDYWLVKNSWGTQWGEKGYVRMQRGVAAKHGLCGIALDSSYP 335
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 171/350 (48%), Positives = 219/350 (62%), Gaps = 35/350 (10%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + + + L+L L A +S A S+ H SM E E WM ++G+ YK
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARSL------HEASM---YERHEDWMVQYGREYK 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+EK R++IFK+N+ I+ NK + SY L +NEFAD+++EEF+ K +
Sbjct: 52 DADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTE 111
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S F Y +V A+P +VDWRKKGAVTP+K+QG CGSCWAFS VAA+EGI Q+ +G L S
Sbjct: 112 ATS--FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLIS 169
Query: 180 LSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCDTS + GC +YPY +GTC KK
Sbjct: 170 LSEQELVDCDTSGEDQGCT---------------------NYPYAGTDGTCNRKKAAHPA 208
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+GY+DVP N+E++L KA+AHQP++VAI+ASG++FQFYS GVFTG CG ELDHGVAAV
Sbjct: 209 AKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAV 268
Query: 299 GYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG S G Y +VKNSW WGE GYIRM+R+ EGLCGI AS P
Sbjct: 269 GYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 318
>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
Length = 450
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 156/302 (51%), Positives = 198/302 (65%), Gaps = 3/302 (0%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
FE+W ++HG++Y E+ R F +N + N SY L LN FAD++H+EF+
Sbjct: 38 FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYALALNAFADLTHDEFRAA 97
Query: 108 YLGLKPQFPTR-RQPSAEFSYRD--VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
LG R A + D V A+P +VDWR+ GAVT VK+QGSCG+CW+FS
Sbjct: 98 RLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATG 157
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
A+EGIN+I +G+L SLSEQELIDCD S+N+GC GGLMDYA+K++V +GG+ E DYPY
Sbjct: 158 AMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRE 217
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
+GTC K + VVTI GY+DVP N+E LL+A+A QPVSV I S FQ YS G+F
Sbjct: 218 TDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFD 277
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
GPC LDH + VGYG G DY IVKNSWG WG +GY+ M RNTG G+CGIN+M
Sbjct: 278 GPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQMP 337
Query: 345 SI 346
S
Sbjct: 338 SF 339
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 158/331 (47%), Positives = 217/331 (65%), Gaps = 12/331 (3%)
Query: 26 HDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE 85
H ++IV E + D++ ++E+W S+HG + +++L R E+F++NL++ID N E
Sbjct: 32 HSYAIVPAPVER--ADDEVRRMYEAWKSEHGHGHGS-DDRL-RLEVFRDNLRYIDAHNAE 87
Query: 86 VT----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA---LPKSVD 138
++ LGL FAD++ EE++ + LG + + + + SYR LP ++D
Sbjct: 88 ADAGLHTFRLGLTPFADLTLEEYRGRALGFRARRGGASRVGSGSSYRPRPRGGDLPDAID 147
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
WR+ GAVT VKNQ CG CWAFS VAA+EGIN+IV+GNL SLSEQE+IDCDT + GCNG
Sbjct: 148 WRELGAVTGVKNQEQCGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCDTQ-DGGCNG 206
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
G M AF++++ +GG+ E DYPYL + C+ + VVTI G+ V +E +L +A
Sbjct: 207 GEMQNAFQFVINNGGIDTEADYPYLGTDAACDANRVNERVVTIDGFVSVATENETALQEA 266
Query: 259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPK 318
+A+QPVSVAI+ASG FQ Y+ G+F GPCG +LDHGV AVGYG G DY IVKNSW
Sbjct: 267 VANQPVSVAIDASGRKFQHYTSGIFNGPCGTQLDHGVTAVGYGSENGKDYWIVKNSWSSS 326
Query: 319 WGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WGE GYIR++RN G CGI AS P+K
Sbjct: 327 WGEAGYIRIRRNVAAATGKCGIAMDASYPVK 357
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 169/337 (50%), Positives = 222/337 (65%), Gaps = 10/337 (2%)
Query: 22 SSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
SSL ++SIVG L + +IE+F+ W +H K YK EE RF FK NLK+I +
Sbjct: 17 SSLPSEYSIVGNDFSELPPDESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIE 76
Query: 82 RN-KEVT-SYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKAL--PKS 136
+ KE T + +GLN+FAD+S+EEFK YL +K R + + S R++++ P S
Sbjct: 77 KTGKETTLRHRVGLNKFADLSNEEFKQLYLSKVKKPINKTRIDAEDRSRRNLQSCDAPSS 136
Query: 137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGC 196
+DWRKKG VT VK+QG CGSCW+FST A+EGIN IV+ +L SLSEQEL+DCDT+ N GC
Sbjct: 137 LDWRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTT-NYGC 195
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
GG MDYAF++++ +GG+ E +YPY +GTC KEE++VV+I GY+DV E D +LL
Sbjct: 196 EGGYMDYAFEWVINNGGIDTEANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDETD-SALL 254
Query: 257 KALAHQPVSVAIEASGTDFQFYSGGVF---TGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
A A QP+SV I+ S DFQ Y+GG++ ++DH V VGYG G DY IVKN
Sbjct: 255 CAAAQQPISVGIDGSAIDFQLYTGGIYDGDCSDDPDDIDHAVLIVGYGSENGEDYWIVKN 314
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
SWG WG GY +KRNT P G+C IN MAS P K+
Sbjct: 315 SWGTSWGIEGYFYIKRNTDLPYGVCAINAMASYPTKE 351
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 317 bits (813), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 171/354 (48%), Positives = 225/354 (63%), Gaps = 20/354 (5%)
Query: 12 LSLSLSLF-----AC--SSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
+ L+L LF AC SSL +F I G E S +++ ELF W +H + YK EE
Sbjct: 6 IQLALVLFIWASLACLSSSLPTEFYITG---EEFASEERVRELFHLWKERHKRVYKHAEE 62
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE 124
RFEIFKENLK++ +RN + + LG+N+FADMS+EEFK KYL + ++
Sbjct: 63 TAKRFEIFKENLKYVIERNSKGHRHTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLR 122
Query: 125 FSYRDVKAL-----PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S + K P S+DWRKKG VT +K+QG CGSCWAFS+ A+EGIN IV+G+L S
Sbjct: 123 RSMQQKKGTASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDLIS 182
Query: 180 LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVV 239
LSEQEL+DCDT+ N GC GG MDYAF++++++GG+ E DYPY +GTC KE+ +VV
Sbjct: 183 LSEQELVDCDTT-NYGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDGTCNTTKEDTKVV 241
Query: 240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG---PCGAELDHGVA 296
+I GY+DV E+D +LL A +QP+SV ++ S DFQ Y+ G++ G ++DH V
Sbjct: 242 SIDGYKDVDESD-SALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHAVL 300
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
VGYG DY I KNSWG WG GY +KRNT P G C IN MAS P K+
Sbjct: 301 IVGYGSEDSEDYWICKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYPTKE 354
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 317 bits (813), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 157/342 (45%), Positives = 217/342 (63%), Gaps = 9/342 (2%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
+ +L S+ A A F + L+ ++ E WM+++ + YK EK RFE
Sbjct: 1 MATLKASILAILGFAF-FCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFE 59
Query: 71 IFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRD 129
+FK N+K I+ N + +WLG+N+FAD++++EF++ + + P+ F Y +
Sbjct: 60 VFKANVKFIESFNAGGNNKFWLGVNQFADLTNDEFRSIKTNKGFKSSNMKIPTG-FRYEN 118
Query: 130 VK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELID 187
V ALP ++DWR KGAVTP+K+QG CG CWAFS VAA EGI +I +G L SL+EQEL+D
Sbjct: 119 VSVDALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVD 178
Query: 188 CDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQD 246
CD + GC GGLMD AFK+I+ +GGL E YPY +G C K TI GY+D
Sbjct: 179 CDVHGEDQGCEGGLMDDAFKFIINNGGLTTESSYPYTAADGKC--KSGSNSAATIKGYED 236
Query: 247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-SKG 305
VP NDE +L+KA+A+QPVSVA++ FQFYS GV TG CG +LDHG+AA+GYGK S G
Sbjct: 237 VPANDEAALMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYGKTSDG 296
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ Y ++KNSWG WGE GY+RM+++ G+CG+ S P
Sbjct: 297 TKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 338
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 169/350 (48%), Positives = 226/350 (64%), Gaps = 18/350 (5%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSM---DKLIELFESWMSKHGKTYKCIE 63
S L ++S+ L L A S A D S + Y + ++ +++ E++E W++KH K Y +
Sbjct: 2 STLFIISILLFL-ASFSYAMDISTIEYKYDKSSAWRTDEEVKEIYELWLAKHDKVYSGLV 60
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP-- 121
E RFEIFK+NLK ID+ N E +Y +GL + D+++EEF+ YLG + R +
Sbjct: 61 EYEKRFEIFKDNLKFIDEHNSENHTYKMGLTPYTDLTNEEFQAIYLGTRSDTIHRLKRTI 120
Query: 122 --SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S ++Y LP+ +DWRKKGAVTPVKNQG CGSCWAFSTV+ VE INQI +GNL S
Sbjct: 121 NISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNLIS 180
Query: 180 LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVV 239
LSEQ+L+DC+ N+GC GG YA++YI+ +GG+ E +YPY +G C K +VV
Sbjct: 181 LSEQQLVDCNKK-NHGCKGGAFVYAYQYIIDNGGIDTEANYPYKAVQGPCRAAK---KVV 236
Query: 240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVG 299
I GY+ VP +E +L KA+A QP VAI+AS FQ Y G+F+GPCG +L+HGV VG
Sbjct: 237 RIDGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKLNHGVVIVG 296
Query: 300 YGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
Y K DY IV+NSWG WGE+GYIRMKR G GLCGI ++ P K
Sbjct: 297 YWK----DYWIVRNSWGRYWGEQGYIRMKRVGGC--GLCGIARLPYYPTK 340
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 162/353 (45%), Positives = 225/353 (63%), Gaps = 23/353 (6%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLT--SMDKLIELFESWMSKHGKT 58
M+ S LL+LSL+ ++ ++LT + D++ ++ESW+ K+GK+
Sbjct: 10 MSLLFFSTLLILSLA-----------------FNAKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPT 117
Y + E RFEIFKE L+ ID+ N + SY +GLN+FAD++ EEF++ YLG
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSG-SN 111
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ + S + R + LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IV+G L
Sbjct: 112 KTKVSNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVL 171
Query: 178 TSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQELIDC + N GCNGG + F++I+ +GG++ EE+YPY ++G C + +
Sbjct: 172 ISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQNE 231
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+ VTI Y++VP N+E +L A+ +QPVSVA++A+G F+ YS G+FTGPCG +DH V
Sbjct: 232 KYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVT 291
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG G DY IVKNSW WGE GY+R+ RN G G CGI M S P+K
Sbjct: 292 IVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 343
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 317 bits (811), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 166/349 (47%), Positives = 223/349 (63%), Gaps = 25/349 (7%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K LL++ L CSS+ + L ++ ESWM ++G+ YK EK
Sbjct: 5 KASLLAILGCLCFCSSV--------LAARELNDDLSMVARHESWMLQYGRVYKDAAEKAS 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK----NK-YLGLKPQFPTRRQPS 122
+FE+FK N ID N +WLG+N+FAD++++EFK NK ++ K + PT
Sbjct: 57 KFEVFKANAGFIDSFNAGNHKFWLGINQFADITNKEFKATKTNKGFISNKVRAPTG---- 112
Query: 123 AEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
FSY +V ALP S+DWR KGAVTPVK+QG CG CWAFS VAA EGI ++ +G L SL
Sbjct: 113 --FSYENVSFDALPASIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSL 170
Query: 181 SEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVV 239
SEQEL+DCD + GC GGLMD AFK+I+++GGL +E YPY E+G C K
Sbjct: 171 SEQELVDCDVHGEDQGCEGGLMDDAFKFIISNGGLTQESSYPYDAEDGKC--KSGSKSAG 228
Query: 240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVG 299
TI Y+DVP N+E +L+KA+A+QPVSVA++ FQFYSGGV TG CG +LDHG+AA+G
Sbjct: 229 TIKSYEDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIG 288
Query: 300 YG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
YG S G+ Y ++KNSWG WGE G++RM+++ +G+CG+ S P
Sbjct: 289 YGVTSDGTKYWLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYP 337
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 162/353 (45%), Positives = 224/353 (63%), Gaps = 23/353 (6%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLT--SMDKLIELFESWMSKHGKT 58
M+ S LL+LSL+ ++ ++LT + D++ ++ESW+ K+GK+
Sbjct: 10 MSLLFFSTLLILSLA-----------------FNAKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPT 117
Y + E RFEIFKE L+ ID+ N + SY +GLN+FAD++ EEF++ YLG
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSG-SN 111
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ + S + R + LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IV+G L
Sbjct: 112 KTKVSNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVL 171
Query: 178 TSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQELIDC + N GCNGG + F++I+ +GG++ EE+YPY ++G C +
Sbjct: 172 ISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNE 231
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+ VTI Y++VP N+E +L A+ +QPVSVA++A+G F+ YS G+FTGPCG +DH V
Sbjct: 232 KYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVT 291
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG G DY IVKNSW WGE GY+R+ RN G G CGI M S P+K
Sbjct: 292 IVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 343
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 154/309 (49%), Positives = 197/309 (63%), Gaps = 10/309 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-------YWLGLNEFADMS 100
FE+W ++HGK Y E+ R F EN + N V S Y L LN FAD++
Sbjct: 39 FEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLT 98
Query: 101 HEEFKNKYLG---LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
H+EF+ LG + P PS V A+P ++DWR+ GAVT VK+QGSCG+C
Sbjct: 99 HDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCGAC 158
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKE 217
W+FS A+EGIN+I +G+L SLSEQELIDCD S+N GC GGLM YA+K+++ +GG+ E
Sbjct: 159 WSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDTE 218
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
+DYP+ +GTC K + VVTI GY++VP + E LL+A+A QP+SV I S FQ
Sbjct: 219 DDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQL 278
Query: 278 YSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
YS G+F GPC LDH V VGYG G DY IVKNSWG +WG +GY+ M RNTG G+
Sbjct: 279 YSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSSGI 338
Query: 338 CGINKMASI 346
CGIN MAS
Sbjct: 339 CGINMMASF 347
>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
Length = 475
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 161/342 (47%), Positives = 216/342 (63%), Gaps = 23/342 (6%)
Query: 15 SLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKE 74
SL+ +C + ++SI+ + S ++++ELF+ W +H K Y EE R E FK
Sbjct: 18 SLTFLSCYGIPSEYSILAFDLNKFPSEEQVVELFQQWKKEHQKFYIHPEEAALRLENFKR 77
Query: 75 NLKHIDQRNKEVTS---YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK 131
NLK+I +RN S + LGLN FADMS+EEFKNK++
Sbjct: 78 NLKYIVERNAMRNSPVGHHLGLNRFADMSNEEFKNKFIS---------------KVESCD 122
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS 191
P S+DWRKKG VT VK+QG+CGSCW+FS+ A+EG+N IV+G+L SLSEQEL+DCDT+
Sbjct: 123 DAPYSLDWRKKGVVTGVKDQGNCGSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTT 182
Query: 192 FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
N+GC GG MDYAF++++ +GG+ E DYPY+ GTC KEE +VVTI GY DV ++D
Sbjct: 183 -NDGCEGGYMDYAFEWVINNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDGYTDVTQSD 241
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGA---ELDHGVAAVGYGKSKGSDY 308
+L A QP+SV I+ S DFQ Y+GG++ G C + ++DH V VGYG DY
Sbjct: 242 -SALFCATVKQPISVGIDGSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGYGSDGNQDY 300
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
IVKNSWG WG G+I ++RNT G+C IN MAS P K+
Sbjct: 301 WIVKNSWGTSWGIEGFIYIRRNTNLKYGVCAINYMASFPTKE 342
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 162/353 (45%), Positives = 224/353 (63%), Gaps = 23/353 (6%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLT--SMDKLIELFESWMSKHGKT 58
M+ S LL+LSL+ ++ ++LT + D++ ++ESW+ K+GK+
Sbjct: 10 MSLLFFSTLLILSLA-----------------FNAKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPT 117
Y + E RFEIFKE L+ ID+ N + SY +GLN+FAD++ EEF++ YLG
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSG-SN 111
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ + S + R + LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IV+G L
Sbjct: 112 KTKVSNRYEPRFGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVL 171
Query: 178 TSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQELIDC + N GCNGG + F++I+ +GG++ EE+YPY ++G C +
Sbjct: 172 ISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNE 231
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+ VTI Y++VP N+E +L A+ +QPVSVA++A+G F+ YS G+FTGPCG +DH V
Sbjct: 232 KYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVT 291
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG G DY IVKNSW WGE GY+R+ RN G G CGI M S P+K
Sbjct: 292 IVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 343
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 153/261 (58%), Positives = 187/261 (71%), Gaps = 2/261 (0%)
Query: 89 YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
Y LG+N+FAD+++EEFK K + + F Y + A+P +VDWRKKGAVTPV
Sbjct: 10 YKLGINKFADLTNEEFKASRNKFKGHMCSSIIRTTTFKYENASAIPSTVDWRKKGAVTPV 69
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKY 207
KNQG CGSCWAFS VAA EGI+Q+ +G L SLSEQELIDCDT + GC GGLMD AFK+
Sbjct: 70 KNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGGLMDDAFKF 129
Query: 208 IVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVA 267
I+ + GL E YPY +GTC + + VTI+GY+DVP N+E +L KA+A+QP+SVA
Sbjct: 130 IIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELALQKAVANQPISVA 189
Query: 268 IEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIR 326
I+ASG+DFQFY+ GVFTG CG ELDHGV AVGYG + G+ Y +VKNSWG WGE GYIR
Sbjct: 190 IDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGADWGEEGYIR 249
Query: 327 MKRNTGKPEGLCGINKMASIP 347
M+R EGLCGI AS P
Sbjct: 250 MQRGIDAAEGLCGIAMQASYP 270
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 162/353 (45%), Positives = 224/353 (63%), Gaps = 23/353 (6%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLT--SMDKLIELFESWMSKHGKT 58
M+ S LL+LSL+ ++ ++LT + D++ ++ESW+ K+GK+
Sbjct: 10 MSLLFFSTLLILSLA-----------------FNTKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPT 117
Y + E RFEIFKE L+ ID+ N + SY +GLN+FAD++ EEF++ YLG
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSG-SN 111
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ + S + R + LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IV+G L
Sbjct: 112 KTKVSNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVL 171
Query: 178 TSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQELIDC + N GCNGG + F++I+ +GG++ EE+YPY ++G C +
Sbjct: 172 ISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNE 231
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+ VTI Y++VP N+E +L A+ +QPVSVA++A+G F+ YS G+FTGPCG +DH V
Sbjct: 232 KYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVT 291
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG G DY IVKNSW WGE GY+R+ RN G G CGI M S P+K
Sbjct: 292 IVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 343
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 170/351 (48%), Positives = 224/351 (63%), Gaps = 15/351 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MAF + +L +L LF LA S V H T+ L E E+WM+++GK YK
Sbjct: 1 MAFTGQKQHML---ALFLF----LAVGISQVMPRKLHQTA---LRERHENWMAEYGKMYK 50
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKP--QFPT 117
EK RF+IFK+N++ I+ N Y LG+N AD++ EEFK+ GLK +F T
Sbjct: 51 DAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFST 110
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQG-SCGSCWAFSTVAAVEGINQIVSGN 176
F Y +V +P+++DWR KGAVTP+K+QG CGSCWAFST+AA EGI+QI +GN
Sbjct: 111 TTFKLNGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGN 170
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L SLSEQEL+DCD S ++GC GG M+ F++I+ +GG+ E +YPY +GTC
Sbjct: 171 LVSLSEQELVDCD-SVDDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAAS 229
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
V I GY+ VP E++L KA+A+QPVSV+I A+ F FYS G++ G CG +LDHGV
Sbjct: 230 PVAQIKGYEIVPSYSEEALQKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVT 289
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
AVGYG G+DY IVKNSWG +WGE+GYIRM R G+CGI +S P
Sbjct: 290 AVGYGTENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYP 340
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 152/305 (49%), Positives = 211/305 (69%), Gaps = 6/305 (1%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFKNK 107
+ W++ H K YK + EK RF+IFKEN++ I+ N E Y LG+N+F+D+++E+F+
Sbjct: 43 DQWIAHHDKVYKDLNEKEMRFKIFKENVERIEAFNAGEDKGYKLGVNKFSDLTNEKFRVL 102
Query: 108 YLGLK---PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
+ G K P+ + +P F Y +V +P ++DWRKKGAVTP+K+Q CG CWAFS VA
Sbjct: 103 HTGYKRSHPKVMSSSKPKTHFRYANVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVA 162
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
A EG++Q+ +G L LSEQEL+DCD + GC+GGL+D AF +I+ + GL E +YPY
Sbjct: 163 ATEGLHQLKTGKLIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEANYPYK 222
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
E+G C KK + I+GY+DVP N E++LL+A+A+QPVSVAI+ S DFQFYS GVF
Sbjct: 223 GEDGVCNKKKSALSAAKIAGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVF 282
Query: 284 TGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
+G C L+H V AVGYG + G+ Y I+KNSWG KWG+ GY+R+KR+ + EGLCG+
Sbjct: 283 SGSCSTWLNHAVTAVGYGATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAM 342
Query: 343 MASIP 347
AS P
Sbjct: 343 DASYP 347
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 162/330 (49%), Positives = 217/330 (65%), Gaps = 11/330 (3%)
Query: 30 IVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTS 88
IV + + S ++++E+F+ W KH K Y+ EE RFE FK NLK+I +RN K +
Sbjct: 31 IVEHEIDAFLSEERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKAN 90
Query: 89 YW---LGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKAL--PKSVDWRKKG 143
W +GLN+FADMS+EEF+ YL + + + R V++ P S+DWR G
Sbjct: 91 KWEHHVGLNKFADMSNEEFRKAYLSKVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYG 150
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
VT VK+QGSCGSCWAFS+ A+EGIN +V+G+L SLSEQEL++CDTS N GC GG MDY
Sbjct: 151 VVTAVKDQGSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS-NYGCEGGYMDY 209
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQP 263
AF++++ +GG+ E DYPY +GTC KEE +VV+I GYQDV ++D +LL A+A QP
Sbjct: 210 AFEWVINNGGIDSESDYPYTGVDGTCNTTKEETKVVSIDGYQDVEQSD-SALLCAVAQQP 268
Query: 264 VSVAIEASGTDFQFYSGGVFTGPCG---AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWG 320
VSV I+ S DFQ Y+GG++ G C ++DH V VGYG +Y IVKNSWG WG
Sbjct: 269 VSVGIDGSAIDFQLYTGGIYDGSCSDDPDDIDHAVLIVGYGSEDSEEYWIVKNSWGTSWG 328
Query: 321 ERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
GY +KR+T P G+C +N MAS P K+
Sbjct: 329 IDGYFYLKRDTDLPYGVCAVNAMASYPTKQ 358
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 168/343 (48%), Positives = 225/343 (65%), Gaps = 12/343 (3%)
Query: 15 SLSLFACSSLAHD--FSIVGYSPEHLTSMD-KLIELFESWMSKHGKTYKCIEEKLHRFEI 71
S +LF C+SLA F +S T D + E E WM++HGK YK EK R++I
Sbjct: 3 SENLFHCTSLALLLLFGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKI 62
Query: 72 FKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFK--NKYLGLKPQFPTRRQPSAEFSYR 128
F++N+K I+ N S+ LG+N+FAD++ EEFK NK LK ++ ++ F Y
Sbjct: 63 FQQNVKGIEGFNNAGNKSHKLGVNQFADLTEEEFKAINK---LKGYMWSKISRTSTFKYE 119
Query: 129 DVKALPKSVDWRKKGAVTPVKNQG-SCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELID 187
V +P ++DWR+KGAVTP+K+QG CGSCWAF+ VAA EGI ++ +G L SLSEQELID
Sbjct: 120 HVTKVPATLDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQELID 179
Query: 188 CDTSFNNG-CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQD 246
CDT+ +NG C G++ AFK+IV + GL E YPY +GTC K E V +I GY+D
Sbjct: 180 CDTNGDNGGCKWGIIQEAFKFIVQNKGLATEASYPYQAVDGTCNAKVESKHVASIKGYED 239
Query: 247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KG 305
VP N+E +LL A+A+QPVSV +++S DF+FYS GV +G CG DH V VGYG S G
Sbjct: 240 VPANNETALLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDDG 299
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
+ Y ++KNSWG WGE+GYIR+KR+ EG+CGI AS P+
Sbjct: 300 TKYWLIKNSWGVYWGEQGYIRIKRDVAAKEGMCGIAMQASYPI 342
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 160/345 (46%), Positives = 215/345 (62%), Gaps = 21/345 (6%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
+L L L+LF ++LA L ++ E WM+++ + YK EK RF
Sbjct: 8 ILAILGLALFCGAALA---------ARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRF 58
Query: 70 EIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFK----NKYLGLKPQFPTRRQPSAE 124
E+FK N+K I+ N +WLG+N+FAD++++EF+ NK G KP P +
Sbjct: 59 EVFKANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRATKTNK--GFKPS-PVKVPTGFR 115
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
+ V ALP S+DWR KGAVTP+K+QG CG CWAFS VAA EGI +I + L SLSEQE
Sbjct: 116 YENVSVDALPASIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQE 175
Query: 185 LIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
L+DCD + GC GGLMD AFK+I+ +GGL E YPY +G C K I G
Sbjct: 176 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTATDGKC--KSGTNSAANIKG 233
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK- 302
++DVP NDE +L+KA+A+QPVSVA++ FQ YSGGV TG CG +LDHG+AA+GYG+
Sbjct: 234 FEDVPANDEAALMKAVANQPVSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYGQT 293
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
S G+ Y ++KNSWG WGE GY+RM+++ G+CG+ S P
Sbjct: 294 SDGTKYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 338
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 164/353 (46%), Positives = 226/353 (64%), Gaps = 12/353 (3%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MAF + S+ L L+L F C L + + +M + W+ H K YK
Sbjct: 1 MAFANLSQYLCLAL---FFICLGLWSSQVALSRPINYEATMRAR---HDQWIVHHEKVYK 54
Query: 61 CIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFKNKYLGLK---PQFP 116
+ EK RF+IFKEN++ I+ N E Y LG N+F+D+++EEF+ + G K P+
Sbjct: 55 DLNEKEVRFQIFKENVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVM 114
Query: 117 TRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
T + F Y +V +P ++DWRKKGAVTP+K+Q CG CWAFS VAA+EG++Q+ +G
Sbjct: 115 TSSKGKTHFRYTNVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGE 174
Query: 177 LTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
L LSEQEL+DCD + GC+GGL+D AF +I+ + GL E +YPY E+G C KK
Sbjct: 175 LIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCNKKKSA 234
Query: 236 MEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGV 295
+ I+GY+DVP N E++LL+A+A+QPVSVAI+ S DFQFYS GVF+G C L+H V
Sbjct: 235 LSAAKITGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAV 294
Query: 296 AAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
AVGYG + G+ Y I+KNSWG KWG+ GY+R+KR+ + EGLCG+ AS P
Sbjct: 295 TAVGYGATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYP 347
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 169/350 (48%), Positives = 219/350 (62%), Gaps = 35/350 (10%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + + + L+L L A +S A ++ H SM E E WM ++G+ YK
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARNL------HEASM---YERHEDWMVQYGREYK 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+EK R++IFK+N+ I+ NK + SY L +NEFAD+++EEF+ K +
Sbjct: 52 DADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTE 111
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S F Y +V A+P +VDWRKKGAVTP+K+QG CGSCWAFS VAA+EGI Q+ +G L S
Sbjct: 112 ATS--FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLIS 169
Query: 180 LSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCDTS + GC +YPY +GTC KK
Sbjct: 170 LSEQELVDCDTSGEDQGCT---------------------NYPYAGTDGTCNRKKAAHPA 208
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+GY+DVP N+E++L KA+AHQP++VAI+A G++FQFYS GVFTG CG ELDHGV+AV
Sbjct: 209 AKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAV 268
Query: 299 GYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG S G Y +VKNSWG WGE GYIRM+R+ EGLCGI AS P
Sbjct: 269 GYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 318
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 315 bits (806), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 160/352 (45%), Positives = 224/352 (63%), Gaps = 23/352 (6%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
+ +K LL ++ L CS++ + L+ + E WM+++G+ Y+
Sbjct: 1 MAMAKALLFAILGCLCLCSAV--------LAARELSDDAAMAARHERWMAQYGRVYRDDA 52
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK----NKYLGLKPQFPTRR 119
EK RFE+FK N+ I+ N ++WLG+N+FAD++++EF+ NK G P T R
Sbjct: 53 EKARRFEVFKANVAFIESFNAGNHNFWLGVNQFADLTNDEFRWTKTNK--GFIPS--TTR 108
Query: 120 QPSAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
P+ F Y +V ALP +VDWR KGAVTP+K+QG CG CWAFS VAA+EGI ++ +G L
Sbjct: 109 VPTG-FRYENVNIDALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKL 167
Query: 178 TSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQEL+DCD + GC GGLMD AFK+I+ +GGL E +YPY + C K
Sbjct: 168 ISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSN 225
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
V +I GY+DVP N+E +L+KA+A+QPVSVA++ FQFY GGV TG CG +LDHG+
Sbjct: 226 SVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIV 285
Query: 297 AVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
A+GYGK S G+ Y ++KNSWG WGE G++RM+++ G+CG+ S P
Sbjct: 286 AIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYP 337
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 160/352 (45%), Positives = 224/352 (63%), Gaps = 23/352 (6%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
+ +K LL ++ L CS++ + L+ + E WM+++G+ Y+
Sbjct: 1 MAMAKALLFAILGCLCLCSAV--------LAARELSDDAAMAARHERWMAQYGRVYRDDA 52
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK----NKYLGLKPQFPTRR 119
EK RFE+FK N+ I+ N ++WLG+N+FAD++++EF+ NK G P T R
Sbjct: 53 EKARRFEVFKANVAFIESFNAGNHNFWLGVNQFADLTNDEFRWMKTNK--GFIPS--TTR 108
Query: 120 QPSAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
P+ F Y +V ALP +VDWR KGAVTP+K+QG CG CWAFS VAA+EGI ++ +G L
Sbjct: 109 VPTG-FRYENVNIDALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKL 167
Query: 178 TSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQEL+DCD + GC GGLMD AFK+I+ +GGL E +YPY + C K
Sbjct: 168 ISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSN 225
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
V +I GY+DVP N+E +L+KA+A+QPVSVA++ FQFY GGV TG CG +LDHG+
Sbjct: 226 SVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIV 285
Query: 297 AVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
A+GYGK S G+ Y ++KNSWG WGE G++RM+++ G+CG+ S P
Sbjct: 286 AIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYP 337
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 170/330 (51%), Positives = 209/330 (63%), Gaps = 16/330 (4%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK--EVTSYW 90
+ L S + L +L+E W + H + ++ EK RF FKEN++ I NK + SY
Sbjct: 31 FDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYR 89
Query: 91 LGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE-------FSYRDVKALPKSVDWRKKG 143
L LN F DM EEF++ + + R + S+ F Y D +P+SVDWR+ G
Sbjct: 90 LRLNRFGDMGPEEFRSTFADSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHG 149
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
AVT VKNQG CGSCWAFSTV AVEGIN I +G+L SLSEQEL+DCDT+ NGC GGLM+
Sbjct: 150 AVTAVKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDTA-ENGCQGGLMEN 208
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV--VTISGYQDVPENDEQSLLKALAH 261
AF +I + GG+ E YPY GTC+ + V+I G+Q VP E +L KA+A
Sbjct: 209 AFDFIKSYGGITTESAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVAR 268
Query: 262 QPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS--KGSDYIIVKNSWGPKW 319
QPVSVAI+A G FQFYS GVFTG CG +LDHGVA VGYG S G+ Y IVKNSWGP W
Sbjct: 269 QPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSWGPSW 328
Query: 320 GERGYIRMKRNTGKPEGLCGINKMASIPLK 349
GE GYIRM+R G GLCGI AS P+K
Sbjct: 329 GEGGYIRMQRGAGNG-GLCGIAMEASFPIK 357
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 219/336 (65%), Gaps = 18/336 (5%)
Query: 24 LAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN 83
+ +DFS L S + +IE+F+ W +H K Y+ E R+ FK NLK+I ++
Sbjct: 33 VVNDFS-------ELVSEESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKA 85
Query: 84 KEVTS---YWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKAL--PKSV 137
+ T+ + +GLN+FAD+S+EEFK YL +K +R + ++ R+++ P S+
Sbjct: 86 GKKTAALGHSVGLNKFADLSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDAPSSL 145
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCN 197
DWRKKG VT VK+QG CGSCW+FST A+EGIN IV+G+L SLSEQEL+DCDT+ N GC
Sbjct: 146 DWRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT-NYGCE 204
Query: 198 GGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLK 257
GG MDYAF++++ +GG+ E +YPY +GTC KEE++VV+I GY DV E D +LL
Sbjct: 205 GGYMDYAFEWVINNGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDETD-SALLC 263
Query: 258 ALAHQPVSVAIEASGTDFQFYSGGVFTGPCG---AELDHGVAAVGYGKSKGSDYIIVKNS 314
A QP+SV ++ S DFQ Y+GG++ G C ++DH V VGYG G DY IVKNS
Sbjct: 264 ATVQQPISVGMDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYGSENGEDYWIVKNS 323
Query: 315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
WG +WG GY +KRNT P G+C IN AS P K+
Sbjct: 324 WGTEWGMEGYFYIKRNTDLPYGVCAINAEASYPTKE 359
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 144/254 (56%), Positives = 185/254 (72%), Gaps = 5/254 (1%)
Query: 100 SHEEFKNKYLGLKPQFPTRRQP---SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGS 156
S + Y G++ RR P S + YR ALP SVDWR+KGAV P+K+QG CGS
Sbjct: 7 SRPRRRTTYFGVRGA--GRRTPGLASDRYRYRAGDALPDSVDWREKGAVVPIKDQGGCGS 64
Query: 157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHK 216
CWAFST+A+VEGIN+IV+G+L SLSEQEL+DCD ++N+GCNGGLMDYAF++I+ +GG+
Sbjct: 65 CWAFSTIASVEGINKIVTGDLISLSEQELVDCDKTYNDGCNGGLMDYAFQFIIDNGGIDT 124
Query: 217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
E+DYPY ++G C+ ++ +VV+I+ Y+DVP NDEQ+L KA A QP++VAI+ G FQ
Sbjct: 125 EKDYPYTEQDGRCDSYRKNAKVVSINSYEDVPVNDEQALKKAAASQPIAVAIDGGGRSFQ 184
Query: 277 FYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
Y+ G+FTG CG LDHGV VGYG G DY IV+NSWG WGE+GYIRM RN P G
Sbjct: 185 LYNSGIFTGKCGTSLDHGVTVVGYGSESGKDYWIVRNSWGESWGEKGYIRMARNIDSPSG 244
Query: 337 LCGINKMASIPLKK 350
+CGI AS P+KK
Sbjct: 245 ICGIAMEASYPIKK 258
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 178/358 (49%), Positives = 223/358 (62%), Gaps = 19/358 (5%)
Query: 1 MAFFSHSKLLLLSLSLSLFA-CSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY 59
MA + + LL+ +++S C ++ D L S + L +L+E W + H +
Sbjct: 1 MAQLAKTLLLVALVAMSAVELCRAIEFD-------ERDLASDEALWDLYERWQTHH-HVH 52
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
+ EK RF FKEN++ I NK Y L LN F DM EEF++ + + R
Sbjct: 53 RHHGEKGRRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDMGREEFRSTFADSRINDLRR 112
Query: 119 RQPSAE-----FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIV 173
+ A F Y V LP SVDWRK+GAVT VK+QG CGSCWAFSTV +VEGIN I
Sbjct: 113 AESPAAPAVPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIR 172
Query: 174 SGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED-K 232
+G+L SLSEQELIDCDT NGC GGLM+ AF++I + GG+ E YPY GTC+ +
Sbjct: 173 TGSLVSLSEQELIDCDTD-ENGCQGGLMENAFEFIKSYGGVTTESAYPYRASNGTCDSVR 231
Query: 233 KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD 292
++V+I G+Q VP E +L KA+A+QPVSVAI+A G FQFYS GVFTG CG +LD
Sbjct: 232 SRRGQIVSIDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLD 291
Query: 293 HGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
HGVAAVGYG S G+ Y IVKNSWGP WGE GYIRM+R G GLCGI AS P+K
Sbjct: 292 HGVAAVGYGVSDDGTAYWIVKNSWGPSWGEGGYIRMQRGAGN-GGLCGIAMEASFPIK 348
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 313 bits (803), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 206/315 (65%), Gaps = 11/315 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK--EVTSYWLGLNEFADMSH 101
++E E WM++HG+ YK EK RFE F+ N+ I+ N +WLG+N+F D+++
Sbjct: 33 MVERHEQWMAQHGRVYKDGAEKARRFEAFRNNVVFIESFNAAGNRRKFWLGVNQFTDLTN 92
Query: 102 EEFK----NK-YLGLKPQFPTRRQPSAEFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSC 154
+EF+ NK ++ + P+ F Y +V A LP +VDWR KGAVTP+KNQG C
Sbjct: 93 DEFRATKTNKGFIKRNAAAVNKASPTGTFRYSNVSADALPAAVDWRAKGAVTPIKNQGQC 152
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGG 213
G CWAFS VAA EGI Q+ +G L LSEQEL+DCD + ++GC GG MD AF++I+ +GG
Sbjct: 153 GCCWAFSAVAATEGIVQLSTGKLVPLSEQELVDCDANGADHGCEGGEMDDAFEFIIKNGG 212
Query: 214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGT 273
L E +YPY ++G C+ K V TI GY+DVP NDE SL+KA+A QPVSVA++
Sbjct: 213 LTSETNYPYTAQDGQCKAKNTINSVATIKGYEDVPANDEASLMKAVAAQPVSVAVDGGDM 272
Query: 274 DFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
FQ Y+GGV +G CG LDHG+ AVGYG + G+ + ++KNSWG WGE GYIRM+++
Sbjct: 273 VFQHYAGGVLSGSCGTSLDHGIVAVGYGAADDGTKFWLMKNSWGTTWGEDGYIRMEKDVA 332
Query: 333 KPEGLCGINKMASIP 347
G+CG+ S P
Sbjct: 333 DAGGMCGLAMQPSYP 347
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 161/353 (45%), Positives = 222/353 (62%), Gaps = 23/353 (6%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLT--SMDKLIELFESWMSKHGKT 58
M+ S LL+LSL+ ++ ++LT + D++ ++ESW+ K+GK+
Sbjct: 10 MSLLFFSTLLILSLA-----------------FNAKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPT 117
Y + E RFEIFKE L+ ID+ N + SY +GLN+FAD++ EEF++ YLG
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSG-SN 111
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ + S + R + LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IV+G L
Sbjct: 112 KTKVSNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVL 171
Query: 178 TSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQELIDC + N GCNG + F +I+ +GG++ EE+YPY ++G C +
Sbjct: 172 ISLSEQELIDCGRTQNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECNVDLQNE 231
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+ VTI Y++VP N+E +L A+ +QPVSVA++A+G F+ YS G+FTGPCG +DH V
Sbjct: 232 KYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVT 291
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG G DY IVKNSW WGE GY+R+ RN G G CGI M S P+K
Sbjct: 292 IVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 343
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 153/322 (47%), Positives = 213/322 (66%), Gaps = 14/322 (4%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGL 93
+ S +++ L+ W K+ K ++ +R E+FKENL+ +D+ N ++ LG+
Sbjct: 43 VRSDEEVRMLYLEWRVKNHPAEKYLDLNEYRLEVFKENLQFVDEHNAAADRGEHTFLLGM 102
Query: 94 NEFADMSHEEFKNKYLGLKPQFPTRRQP-----SAEFSYRDVKALPKSVDWRKKGAVTPV 148
N FAD+++EE++ ++L F R+ S+ + R+ LP S+DWR+ GAV PV
Sbjct: 103 NRFADLTNEEYRTRFL---RDFSRLRRSASGKISSRYRLREGDDLPDSIDWRENGAVVPV 159
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
KNQG CGSCWAFSTVAAVEGINQIV+G+L SLSEQ+L+DC T+ N+GC GG M+ AF++I
Sbjct: 160 KNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA-NHGCRGGWMNPAFQFI 218
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
V +GG++ EE YPY + G C + VV+I Y++VP ++EQSL KA+A+QPVSV +
Sbjct: 219 VNNGGINSEETYPYRGQNGIC-NSTVNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTM 277
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMK 328
+A+G DFQ Y G+FTG C +H + VGYG D+ IVKNSWG WGE GYIR +
Sbjct: 278 DAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIVKNSWGKNWGESGYIRAE 337
Query: 329 RNTGKPEGLCGINKMASIPLKK 350
RN P G CGI + AS P+KK
Sbjct: 338 RNIENPNGKCGITRFASYPVKK 359
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 169/356 (47%), Positives = 215/356 (60%), Gaps = 21/356 (5%)
Query: 12 LSLSLSLFACSSLAHDFSIVGYSPE-HLTSMDKLIE----LFESWM----SKHGKTYKCI 62
+ LS+ L ACS LA G+ E H + + IE F+ W+ + Y
Sbjct: 8 MRLSVLLVACSCLA---VAAGFRFENHRLFIQQAIESPREAFDFWVHTVKPPSNRAYASS 64
Query: 63 EEKL-HRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ- 120
E RF I+ +NL+ + N TS+WL + +AD+S +E+++K LG +R
Sbjct: 65 AEVYERRFNIWLDNLRFAHEYNARHTSHWLSMGVYADLSQDEYRSKALGYNAHLHKKRPL 124
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
+A F Y+ P+ VDW GAVTPVK+Q CGSCWAFST AVEG N I +G L SL
Sbjct: 125 RAAPFLYKGT-VPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIATGKLVSL 183
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQ L+DCD ++ GC GG MD AF +IV +GG+ E+DYPY E+G C+D + VVT
Sbjct: 184 SEQMLVDCDREYDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRAEDGICQDNRTRRHVVT 243
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
I GYQDVP NDE +L+KA+AHQPVSVAIEA FQ Y GGVF CG LDH V VGY
Sbjct: 244 IDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAECGTALDHAVLVVGY 303
Query: 301 GKSKGSD----YIIVKNSWGPKWGERGYIRMKRNTGK--PEGLCGINKMASIPLKK 350
G + Y +VKNSWG +WGE+GYIR+ RN GK PEG CG+ AS P+KK
Sbjct: 304 GTASNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGLAMYASFPIKK 359
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 313 bits (801), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 161/353 (45%), Positives = 223/353 (63%), Gaps = 23/353 (6%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLT--SMDKLIELFESWMSKHGKT 58
M+ S LL+LSL+ ++ ++LT + D++ ++ESW+ K+GK+
Sbjct: 10 MSLLFFSTLLILSLA-----------------FNAKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPT 117
Y + E RFEIFKE L+ ID+ N + SY +GLN+FAD++ EEF++ YL
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFTSG-SN 111
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ + S + R + LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IV+G L
Sbjct: 112 KTKVSNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVL 171
Query: 178 TSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQELIDC + N GCNGG + F++I+ +GG++ EE+YPY ++G C +
Sbjct: 172 ISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNE 231
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+ VTI Y++VP N+E +L A+ +QPVSVA++A+G F+ YS G+FTGPCG +DH V
Sbjct: 232 KYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVT 291
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG G DY IVKNSW WGE GY+R+ RN G G CGI M S P+K
Sbjct: 292 IVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 343
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 164/324 (50%), Positives = 214/324 (66%), Gaps = 10/324 (3%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYW 90
+ ++ + L S + L +L+E W S + + + EK +RF +FKEN+K+I++ NK Y
Sbjct: 27 IDFTDKDLESDETLWDLYERWRSVY-TSARSFGEKQNRFHVFKENVKYINEVNKMDKPYK 85
Query: 91 LGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKN 150
L LN+F D++ EF Y K TR + S F Y +V+ +P+S+DWR KGAVTPVKN
Sbjct: 86 LRLNQFGDLTPSEFARTYANSKIIEGTRNE-SGGFMYENVE-VPRSIDWRVKGAVTPVKN 143
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVA 210
QG CG CWAFS AAVEGINQI +G L SLSEQ+LIDCDT N+GC GG M AF+YI
Sbjct: 144 QGRCGGCWAFSAAAAVEGINQITTGQLISLSEQQLIDCDTQ-NSGCRGGTMGRAFEYIKQ 202
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEA 270
GG+ E +YPY + G C++ + V+I GY ++ E ++LK LAHQPVSVA++A
Sbjct: 203 RGGITSEANYPYKAQAGMCKNNLIQRPTVSIDGYYNI-RRSEDAVLKILAHQPVSVAVDA 261
Query: 271 ---SGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIR 326
S D+ FY GVFTGPCG +L+HGV AVGYG + G DY I+KNSWG WGERGY+R
Sbjct: 262 TTWSSLDWMFYFQGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMR 321
Query: 327 MKRNTGKPEGLCGINKMASIPLKK 350
M R P GLCGI AS P+K+
Sbjct: 322 MLRGV-SPYGLCGIAMQASFPIKR 344
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 157/350 (44%), Positives = 221/350 (63%), Gaps = 19/350 (5%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
+ +K LL ++ L CS++ + L+ + E WM+++G+ YK
Sbjct: 1 MAMAKALLFAILGCLCLCSAV--------LAARELSDDAAMAARHERWMAQYGRMYKDDA 52
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYL--GLKPQFPTRRQP 121
EK RFE+FK N+ I+ N +WLG+N+FAD++++EF++ G P T R P
Sbjct: 53 EKARRFEVFKANVAFIESFNAGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPS--TTRVP 110
Query: 122 SAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
+ F Y +V ALP ++DWR KG VTP+K+QG CG CWAFS VAA+EGI ++ +G L S
Sbjct: 111 TG-FRYENVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLIS 169
Query: 180 LSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCD + GC GGLMD AFK+I+ +GGL E +YPY + C K V
Sbjct: 170 LSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSV 227
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
+I GY+DVP N+E +L+KA+A+QPVSVA++ FQFY GGV TG CG +LDHG+ A+
Sbjct: 228 ASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAI 287
Query: 299 GYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYGK S G+ Y ++KNSWG WGE G++RM+++ G+CG+ S P
Sbjct: 288 GYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYP 337
>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 379
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 177/354 (50%), Positives = 220/354 (62%), Gaps = 20/354 (5%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
LLL++L +F S+ + + L S + L +L+E W + H + ++ EK R
Sbjct: 8 LLLVAL---VFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRR 63
Query: 69 FEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ--PSAE- 124
F FKEN++ I NK Y L LN F DM EEF++ + + RRQ P+A
Sbjct: 64 FGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRIN-DLRRQDSPAARA 122
Query: 125 -----FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
F Y P+SVDWR++GAVT VK+QG CGSCWAFSTV AVEGIN I +G+L S
Sbjct: 123 GAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLAS 182
Query: 180 LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED---KKEEM 236
LSEQELIDCDT NGC GGLM+ AF++I + GG+ E YPY GTC+ ++
Sbjct: 183 LSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGG 241
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
VV I G+Q VP E +L KA+AHQPVSVA++A G FQFYS GVFTG CG +LDHGVA
Sbjct: 242 VVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVA 301
Query: 297 AVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
AVGYG G+ Y IVKNSWG WGE GYIRM+R G GLCGI AS P+K
Sbjct: 302 AVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGN-GGLCGIAMEASFPIK 354
>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 423
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 177/354 (50%), Positives = 220/354 (62%), Gaps = 20/354 (5%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
LLL++L +F S+ + + L S + L +L+E W + H + ++ EK R
Sbjct: 52 LLLVAL---VFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRR 107
Query: 69 FEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ--PSAE- 124
F FKEN++ I NK Y L LN F DM EEF++ + + RRQ P+A
Sbjct: 108 FGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRIN-DLRRQDSPAARA 166
Query: 125 -----FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
F Y P+SVDWR++GAVT VK+QG CGSCWAFSTV AVEGIN I +G+L S
Sbjct: 167 GAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLAS 226
Query: 180 LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED---KKEEM 236
LSEQELIDCDT NGC GGLM+ AF++I + GG+ E YPY GTC+ ++
Sbjct: 227 LSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGG 285
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
VV I G+Q VP E +L KA+AHQPVSVA++A G FQFYS GVFTG CG +LDHGVA
Sbjct: 286 VVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVA 345
Query: 297 AVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
AVGYG G+ Y IVKNSWG WGE GYIRM+R G GLCGI AS P+K
Sbjct: 346 AVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGN-GGLCGIAMEASFPIK 398
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 161/309 (52%), Positives = 201/309 (65%), Gaps = 9/309 (2%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
FE WM KHG+ Y EK RFE++KENL I++ N Y L N+FAD+++EEF+ K
Sbjct: 119 FEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNSGGHGYTLTDNKFADLTNEEFRAK 178
Query: 108 YLGLKPQFP------TRRQPSAEFSYRDVKA-LPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
LG P + E D LPK VDWRKKGAV VKNQGSCGSCWAF
Sbjct: 179 MLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGSCWAF 238
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
S VAA+EG+NQI +G L SLSEQEL+DCD GC GG M +AF++++A+ GL E Y
Sbjct: 239 SAVAAMEGLNQIKNGKLVSLSEQELVDCDAE-AVGCAGGFMSWAFEFVMANHGLTTEASY 297
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY G C+ K V+I+GY +V N E LLK A QPVSVA++A G FQ Y+G
Sbjct: 298 PYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLFQLYAG 357
Query: 281 GVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
GVF+GPC A+++HGV VGYG++ K Y IVKNSWGP+WGE GY+ M+R+ G P GLCG
Sbjct: 358 GVFSGPCTAQINHGVTVVGYGETDKAEKYWIVKNSWGPEWGEAGYMLMQRDAGVPTGLCG 417
Query: 340 INKMASIPL 348
I +AS P+
Sbjct: 418 IAMLASYPV 426
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 161/350 (46%), Positives = 218/350 (62%), Gaps = 16/350 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
M FS + L+L L L+++ ++ S V S H E WM+++GK Y
Sbjct: 1 MRSFSQNHYLILFLILTVWTFHVMSRRLSEVCTSERH-----------EKWMAQYGKLYT 49
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGL-KPQFPTR 118
EK RF+IFK N++ I+ N + L +N+FAD+ +EEFK + + K +
Sbjct: 50 DAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVE 109
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
F Y + +P ++DWRK+GAVTP+K+QG+CGSCWAFSTVAA+EGI+QI +G L
Sbjct: 110 TATETSFRYESITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLV 169
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
SLSEQEL+DC + GCN G + AF+++ +GGL E YPY TC KKE V
Sbjct: 170 SLSEQELVDCVKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGV 229
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I GY++VP N E++LLKA+A+QPVSV I+A QFYS G+FTG CG +H V +
Sbjct: 230 AQIKGYENVPSNSEKALLKAVANQPVSVYIDAGA--LQFYSSGIFTGKCGTAPNHAVTVI 287
Query: 299 GYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYGK++ G+ Y +VKNSWG KWGE+GYI+MKR+ EGLCGI AS P
Sbjct: 288 GYGKARGGAKYWLVKNSWGTKWGEKGYIKMKRDIRAKEGLCGIATNASYP 337
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 157/347 (45%), Positives = 212/347 (61%), Gaps = 23/347 (6%)
Query: 5 SHSKLLLLSLSLSLFACS---SLAHDFSIVGYSPEHLTSMDK-LIELFESWMSKHGKTYK 60
+H + LS+ +AC+ SLA L D+ ++ E WM+K+ + Y
Sbjct: 3 THYSSAFVLLSVVAWACALSGSLA---------ARDLADQDQAMVARHEEWMAKYDRVYS 53
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPT--- 117
EK RFE+FK N+ I+ N +WL N FAD++ +EF+ + G +P+
Sbjct: 54 DAAEKARRFEVFKANMALIESVNAGNHKFWLEANRFADLTDDEFRATWTGYRPKTAAASS 113
Query: 118 ---RRQPSAEFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 172
R + F Y +V +P SVDWR KGAVTP+KNQG CG CWAFS VA++EG+ ++
Sbjct: 114 KGRSRTATTGFKYANVSLDDVPASVDWRTKGAVTPIKNQGECGCCWAFSAVASMEGVVKL 173
Query: 173 VSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED 231
+G L SLSEQEL+DCD + + GC GG MD AF +IV +GGL E YPY +GTC
Sbjct: 174 STGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIVGNGGLTTESRYPYTASDGTCNS 233
Query: 232 KKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAEL 291
+ + +I GY+DVP NDE SL KA+A+QPVSVA++ + F+FY GGV +G CG EL
Sbjct: 234 NEASGDAASIKGYEDVPANDEASLRKAVANQPVSVAVDGGDSHFRFYKGGVLSGACGTEL 293
Query: 292 DHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
DHG+AAVGYG S G+ Y ++KNSWG WGE GYIRM+R+ E L
Sbjct: 294 DHGIAAVGYGVASDGTKYWVMKNSWGTSWGEAGYIRMERDIADEEVL 340
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 155/344 (45%), Positives = 212/344 (61%), Gaps = 15/344 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K LL ++ L CS++ + + ++ E WM ++G+ YK EK
Sbjct: 5 KALLFAILSCLCLCSAV--------LAAREQSDHAAMVARHERWMEQYGRVYKDATEKAR 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY 127
RFEIFK N+ I+ N +WLG+N+FAD+++ EF+ K P+ + F Y
Sbjct: 57 RFEIFKANVAFIESFNAGNHKFWLGVNQFADLTNYEFRATKTN-KGFIPSTVRVPTTFRY 115
Query: 128 RDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
+V LP +VDWR KGAVTP+K+QG CG CWAFS VAA+EGI ++ +G L SLSEQEL
Sbjct: 116 ENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQEL 175
Query: 186 IDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DCD + GC GGLMD AFK+I+ +GGL E YPY +G C TI GY
Sbjct: 176 VDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNS--AATIKGY 233
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS- 303
+DVP N+E +L+KA+A+QPVSVA++ FQFYSGGV TG CG +LDHG+ A+GYGK
Sbjct: 234 EDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDG 293
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+ Y ++KNSWG WGE G++RM+++ G+CG+ S P
Sbjct: 294 DGTQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYP 337
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 163/348 (46%), Positives = 213/348 (61%), Gaps = 11/348 (3%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
SK LLL++ L C + +IV + E L + E WM++HG+ YK EK
Sbjct: 5 SKPLLLAI-LCCIVCLYSSSGGAIVAAARE-LGGDAAMAARHERWMAQHGRVYKDAAEKA 62
Query: 67 HRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHEEFKNKYLGLK----PQFPTRRQP 121
R E+FK N+ I+ N YWLG+N+FAD++ EEFK K P R
Sbjct: 63 RRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVST 122
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
++ ALP SVDWR KGAVT +K+QG CG CWAFS VAA+EGI ++ +G L SLS
Sbjct: 123 GFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLS 182
Query: 182 EQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
EQEL+DCD N+ GC GG +D AF++I+++GGL E +YPY E+G C+ +
Sbjct: 183 EQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAAS 242
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
I GY+DVP NDE SL+KA+A QPVSVA++AS FQFY GGV G CG LDHGV +GY
Sbjct: 243 IRGYEDVPANDEPSLMKAVAGQPVSVAVDAS--KFQFYGGGVMAGECGTSLDHGVTVIGY 300
Query: 301 G-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G S G+ Y +VKNSWG WGE GY+RM+++ G+CG+ S P
Sbjct: 301 GAASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYP 348
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 162/348 (46%), Positives = 221/348 (63%), Gaps = 31/348 (8%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
L L L S+ A L D S+V H E+WM ++G+ YK EK +
Sbjct: 12 LGCLCLCGSVLAARELNDDLSMV---ARH-----------ENWMLQYGRVYKDAAEKAQK 57
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK----NK-YLGLKPQFPTRRQPSA 123
FE+FK N + I+ N +WLG+N+FAD+++EEFK NK ++ K + PT
Sbjct: 58 FEVFKANAEFINSFNAGNHKFWLGINQFADITNEEFKATKTNKGFISNKVRVPTG----- 112
Query: 124 EFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
F Y ++ ALP ++DWR KGAVTP+K+QG CG CWAFS VAA+EGI ++ +G L SLS
Sbjct: 113 -FMYENMSFDALPATIDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLVSLS 171
Query: 182 EQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
EQEL+DCD + GC GGLMD AFK+I+ +GGL +E +YPY +G C K T
Sbjct: 172 EQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTQESNYPYDAADGKC--KSGSSSAAT 229
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
I Y+DVP N+E +L+KA+A+QPVSVA++ FQFYSGGV TG CG +LDHG+AA+GY
Sbjct: 230 IKSYEDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGY 289
Query: 301 G-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G S G+ + I+KNSWG WGE G++RM+++ +G+CG+ S P
Sbjct: 290 GTTSDGTKFWIMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYP 337
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 155/307 (50%), Positives = 201/307 (65%), Gaps = 16/307 (5%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFK 105
++E W+ ++ K Y + EK R +IFKENLK ID+ N ++ +GL FAD++++E
Sbjct: 1 MYERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDE-- 58
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
P + + Y++ LP +DWR KGAV PVK+QG+CGSCWAFS V A
Sbjct: 59 ----------PKDFMKADRYLYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWAFSAVGA 108
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
VEGINQI +G L SLS+QELIDCD F N GC GG+M+YAF++I+ +GG+ ++DYPY
Sbjct: 109 VEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDYPYTA 168
Query: 225 EE-GTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+ G C DKK VV I GY+ V +NDE+SL KA+AHQPV VAIEAS F+ Y GV
Sbjct: 169 TDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYKSGV 228
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
FTG CG LDHGV VGYG S G DY I++NSWG WGE GY++++RN G CG+
Sbjct: 229 FTGTCGIYLDHGVVVVGYGTSSGEDYWIIRNSWGLNWGENGYVKLQRNIDDSFGKCGVAM 288
Query: 343 MASIPLK 349
M S P K
Sbjct: 289 MPSYPTK 295
>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
gi|194701540|gb|ACF84854.1| unknown [Zea mays]
Length = 379
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 177/354 (50%), Positives = 219/354 (61%), Gaps = 20/354 (5%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
LLL++L +F S+ + + L S + L +L+E W + H + ++ EK R
Sbjct: 8 LLLVAL---VFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRR 63
Query: 69 FEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ--PSAE- 124
F FKEN++ I NK Y L LN F DM EEF++ + + RRQ P+A
Sbjct: 64 FGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRIN-DLRRQDSPAARA 122
Query: 125 -----FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
F Y P+SVDWR++GAVT VK QG CGSCWAFSTV AVEGIN I +G+L S
Sbjct: 123 GAVPGFMYDSAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGSLAS 182
Query: 180 LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED---KKEEM 236
LSEQELIDCDT NGC GGLM+ AF++I + GG+ E YPY GTC+ ++
Sbjct: 183 LSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGG 241
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
VV I G+Q VP E +L KA+AHQPVSVA++A G FQFYS GVFTG CG +LDHGVA
Sbjct: 242 VVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVA 301
Query: 297 AVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
AVGYG G+ Y IVKNSWG WGE GYIRM+R G GLCGI AS P+K
Sbjct: 302 AVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGN-GGLCGIAMEASFPIK 354
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 168/351 (47%), Positives = 222/351 (63%), Gaps = 15/351 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MAF + +L +L LF LA S V H T+ L E E+WM+++GK YK
Sbjct: 1 MAFTGQKQHML---ALFLF----LAVGISQVMPRKLHQTA---LRERHENWMAEYGKMYK 50
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKP--QFPT 117
EK RF+IFK+N++ I+ N Y LG+N AD++ EEFK+ GLK +F T
Sbjct: 51 DAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFST 110
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQG-SCGSCWAFSTVAAVEGINQIVSGN 176
F Y +V +P+++DWR KGAVTP+K+QG CG WAFST+AA EGI+QI +GN
Sbjct: 111 TTFKLNGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGN 170
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L SLSEQEL+DCD S ++GC GG M+ F++I+ +GG+ E +YPY +GTC
Sbjct: 171 LVSLSEQELVDCD-SVDDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAAS 229
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
V I GY+ VP E++L KA+A+QPVSV+I A+ F FYS G++ G CG +LDHGV
Sbjct: 230 PVAQIKGYEIVPSYSEEALKKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVT 289
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
AVGYG G+DY IVKNSWG +WGE+GYIRM R G+CGI +S P
Sbjct: 290 AVGYGTENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYP 340
>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
Length = 289
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 143/219 (65%), Positives = 172/219 (78%)
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS 191
A+P+SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN+IV+G+L SLSEQEL+DCDTS
Sbjct: 2 AIPESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS 61
Query: 192 FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
+N GCNGGLMDYAF++I+ +GG+ EEDYPY +G C+ ++ +VVTI Y+DVPEN+
Sbjct: 62 YNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQNRKNAKVVTIDAYEDVPENN 121
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIV 311
E +L KALA+QP+SVAIEA G FQ YS GVF G CG ELDHGV AVGYG G DY IV
Sbjct: 122 EAALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGTENGKDYWIV 181
Query: 312 KNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
+NSWG WGE GYI+M RN + G CGI AS P+KK
Sbjct: 182 RNSWGGSWGESGYIKMARNIAEATGKCGIAMEASYPIKK 220
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 154/344 (44%), Positives = 212/344 (61%), Gaps = 15/344 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K LL ++ L CS++ + + ++ E WM ++G+ YK EK
Sbjct: 5 KALLFAILSCLCLCSAV--------LAAREQSDHAAMVARHERWMEQYGRVYKDATEKAR 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY 127
RFEIFK N+ I+ N +WLG+N+FAD+++ EF+ K P+ + F Y
Sbjct: 57 RFEIFKANVAFIESFNAGNHKFWLGVNQFADLTNYEFRATKTN-KGFIPSTVRVPTTFRY 115
Query: 128 RDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
+V LP +VDWR KGAVTP+K+QG CG CWAFS VAA+EGI ++ +G L SLSEQEL
Sbjct: 116 ENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQEL 175
Query: 186 IDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DCD + GC GGLMD AFK+I+ +GGL E YPY +G C TI GY
Sbjct: 176 VDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSN--SAATIKGY 233
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS- 303
++VP N+E +L+KA+A+QPVSVA++ FQFYSGGV TG CG +LDHG+ A+GYGK
Sbjct: 234 EEVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDG 293
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+ Y ++KNSWG WGE G++RM+++ G+CG+ S P
Sbjct: 294 DGTQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYP 337
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 164/349 (46%), Positives = 222/349 (63%), Gaps = 16/349 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MAF S + +L+LF S+ + S V H TS L E E+W++++G+ YK
Sbjct: 1 MAFTSK-----IQQNLALFLLLSI--EISQVMSRKLHETS---LREEHENWIARYGQVYK 50
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
EK F+IFKEN++ I+ N Y LG+N FAD++ EEFK+ GLK
Sbjct: 51 VAAEK-ETFQIFKENVEFIESFNAAANKPYKLGVNLFADLTLEEFKDFRFGLKKTHEFSI 109
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
P F Y +V +P+++DWR+KGAVTP+K+QG CGSCWAFSTVAA EGI+QI +GNL S
Sbjct: 110 TP---FKYENVTDIPEALDWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVS 166
Query: 180 LSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
L EQEL+ CDT + GC GG M+ F++I+ +GG+ + +YPY GTC V
Sbjct: 167 LXEQELVSCDTKGVDQGCEGGYMEDGFEFIIKNGGITTKANYPYKGVNGTCNTTIAASTV 226
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I GY+ VP E++L KA+A+QPVSV+I+A+ F FY+GG++TG CG +LDHGV AV
Sbjct: 227 AQIKGYETVPSYSEEALQKAVANQPVSVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAV 286
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYG + +DY IVKNSWG W E+G+IRM+R GLCG+ +S P
Sbjct: 287 GYGTTNETDYWIVKNSWGTGWDEKGFIRMQRGITVKHGLCGVALDSSYP 335
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 160/350 (45%), Positives = 216/350 (61%), Gaps = 16/350 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
M FS + L+L L L+++ ++ S V S H E WM+++GK Y
Sbjct: 1 MRSFSQNHYLILFLILTVWTFHVMSRRLSEVCTSERH-----------EKWMAQYGKLYT 49
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGL-KPQFPTR 118
EK RF+IFK N++ I+ N + L +N+FAD+ +EEFK + + K +
Sbjct: 50 DAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVE 109
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
F Y + +P ++DWRK+GAVTP+K+QG+CGSCWAFS VAA+EGI+QI +G L
Sbjct: 110 TATETSFRYESITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLV 169
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
SLSEQEL+DC + GCN G + AF+++ +GGL E YPY TC KKE V
Sbjct: 170 SLSEQELVDCVKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGV 229
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I GY++VP N E++LLKA+A+QPVSV I+A QFYS G+FTG CG +H +
Sbjct: 230 AQIKGYENVPSNSEKALLKAVANQPVSVYIDAGA--LQFYSSGIFTGKCGTAPNHAATVI 287
Query: 299 GYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
GYGK++ G+ Y +VKNSWG KWGE+GYIRMKR+ EGLCGI AS P
Sbjct: 288 GYGKARGGAKYWLVKNSWGTKWGEKGYIRMKRDIRAKEGLCGIATNASYP 337
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 151/256 (58%), Positives = 182/256 (71%), Gaps = 5/256 (1%)
Query: 99 MSHEEFKNKYLGLKPQ----FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
M++ EF++ Y G K F + + F Y VK++P SVDWRKKGAVTP+K+QG C
Sbjct: 1 MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 60
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGL 214
GSCWAFSTV AVEGIN I + L SLSEQEL+DCDTS N GCNGGLM YAF++I GG+
Sbjct: 61 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEKGGI 120
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
E+ YPY E+GTC+ K VV+I G++ VP N+E +LLKA A+QP+SVAI+A G+
Sbjct: 121 TTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSA 180
Query: 275 FQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
FQFYS GVF G CG +LDHGVA VGYG + G+ Y IVKNSWG WGE GYIRMKR
Sbjct: 181 FQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRGISA 240
Query: 334 PEGLCGINKMASIPLK 349
EGLCGI AS P+K
Sbjct: 241 KEGLCGIAVEASYPIK 256
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 154/344 (44%), Positives = 211/344 (61%), Gaps = 15/344 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K LL ++ L CS++ + + ++ E WM ++G+ YK EK
Sbjct: 5 KALLFAILSCLCLCSAV--------LAAREQSDHAAMVARHERWMEQYGRVYKDATEKAR 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY 127
RFEIFK N+ I+ N +WL +N+FAD+++ EF+ K P+ + F Y
Sbjct: 57 RFEIFKANVAFIESFNAGNHKFWLSVNQFADLTNYEFRATKTN-KGFIPSTVRVPTTFRY 115
Query: 128 RDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
+V LP +VDWR KGAVTP+K+QG CG CWAFS VAA+EGI ++ +G L SLSEQEL
Sbjct: 116 ENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQEL 175
Query: 186 IDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DCD + GC GGLMD AFK+I+ +GGL E YPY +G C TI GY
Sbjct: 176 VDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNS--AATIKGY 233
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS- 303
+DVP N+E +L+KA+A+QPVSVA++ FQFYSGGV TG CG +LDHG+ A+GYGK
Sbjct: 234 EDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDG 293
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+ Y ++KNSWG WGE G++RM+++ G+CG+ S P
Sbjct: 294 DGTQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYP 337
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 170/353 (48%), Positives = 216/353 (61%), Gaps = 15/353 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
+ +S L L L + S L S V Y P H T L + FE W+ H K Y
Sbjct: 2 LNVLRNSNLTLAVLICFVLIASKLCSVDSSV-YDP-HKT----LKQRFEKWLKTHSKLYG 55
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP---QFPT 117
+E + RF I++ N++ ID N + L N FADM++ EFK +LGL +
Sbjct: 56 GRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHK 115
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+++P + +P +VDWR +GAVTP++NQG CG CWAFS VAA+EGIN+I +GNL
Sbjct: 116 KQRPVCD----PAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNL 171
Query: 178 TSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQ+LIDCD ++N GC+GGLM+ AF++I +GGL E DYPY EGTC+ +K +
Sbjct: 172 VSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKN 231
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+VVTI GYQ V +N E SL A A QPVSV I+A G FQ YS GVFT CG L+HGV
Sbjct: 232 KVVTIQGYQKVAQN-EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVT 290
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG Y IVKNSWG WGE GYIRM+R + G CGI MAS PL+
Sbjct: 291 VVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPLQ 343
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 151/318 (47%), Positives = 196/318 (61%), Gaps = 8/318 (2%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS--YWLGLNE 95
L + + E WM+KHG+ Y EK R E+F++N+ I+ N + +WL N+
Sbjct: 30 LVDAAAMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQ 89
Query: 96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA--LPKSVDWRKKGAVTPVKNQGS 153
FAD+++ EF+ GL+P + F Y +V LP SVDWR KGAV PVK+QG
Sbjct: 90 FADLTNAEFRATRTGLRPSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGD 149
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASG 212
CG CWAFS VAA+EG ++ +G L SLSEQ+L+ CD + GC GGLMD AF +I+ +G
Sbjct: 150 CGCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNG 209
Query: 213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASG 272
GL E DYPY + C TI GY+DVP NDE +LLKA+A+QPVSVAI+
Sbjct: 210 GLAAESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGD 269
Query: 273 TDFQFYSGGVFTGP--CGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKR 329
FQFY GGV +G C ELDH + AVGYG S G+ Y ++KNSWG WGE GY+RM+R
Sbjct: 270 RHFQFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMER 329
Query: 330 NTGKPEGLCGINKMASIP 347
EG+CG+ MAS P
Sbjct: 330 GVADKEGVCGLAMMASYP 347
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 169/353 (47%), Positives = 218/353 (61%), Gaps = 15/353 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
+ +S L L+ L + S L S V Y P H T L + FE W+ H K Y
Sbjct: 2 LNVLRNSNLTLVVLICFVLIASKLCSVNSSV-YDP-HKT----LKQRFEKWLKTHSKLYG 55
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP---QFPT 117
+E + RF I++ N++ ID N + L N FADM++ EFK +LGL +
Sbjct: 56 GRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHK 115
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+++P + +P +VDWR +GAVTP++NQG CG CWAFS VAA+EGIN+I +GNL
Sbjct: 116 KQRPVCD----PAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNL 171
Query: 178 TSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQ+LIDCD ++N GC+GGLM+ AF++I ++GGL E DYPY EGTC+ +K +
Sbjct: 172 VSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIEGTCDQEKAKN 231
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+VVTI GYQ V +N E SL A A QPVSV I+A G FQ YS GVFT CG L+HGV
Sbjct: 232 KVVTIQGYQKVAQN-EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVT 290
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG Y IVKNSWG WGE GYIRM+R + G CGI +AS PL+
Sbjct: 291 VVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGISEDTGKCGIAMLASYPLQ 343
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 150/312 (48%), Positives = 196/312 (62%), Gaps = 8/312 (2%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS--YWLGLNEFADMSH 101
+ + E WM+KHG+ Y EK+ R E+F++N+ I+ N + +WL N+FAD+++
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWA 159
EF+ GL+P + F Y +V LP SVDWR KGAV PVK+QG CG CWA
Sbjct: 61 AEFRATRTGLRPSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWA 120
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEE 218
FS VAA+EG ++ +G L SLSEQ+L+ CD + GC GGLMD AF +I+ +GGL E
Sbjct: 121 FSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAES 180
Query: 219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
DYPY + C TI GY+DVP NDE +LLKA+A+QPVSVAI+ FQFY
Sbjct: 181 DYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFY 240
Query: 279 SGGVFTGP--CGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
GGV +G C ELDH + AVGYG S G+ Y ++KNSWG WGE GY+RM+R E
Sbjct: 241 KGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKE 300
Query: 336 GLCGINKMASIP 347
G+CG+ MAS P
Sbjct: 301 GVCGLAMMASYP 312
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 162/348 (46%), Positives = 212/348 (60%), Gaps = 11/348 (3%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
SK LLL++ L C + +IV + E L + E WM++HG+ YK EK
Sbjct: 5 SKPLLLAI-LCCIVCLYSSSGGAIVAAARE-LGGDAAMAARHERWMAQHGRVYKDAAEKA 62
Query: 67 HRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHEEFKNKYLGLK----PQFPTRRQP 121
R E+FK N+ I+ N YWLG+N+FAD++ EEFK K P R
Sbjct: 63 RRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVST 122
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
++ ALP SVDWR KGAVT +K+QG CG CWAFS VAA+EG ++ +G L SLS
Sbjct: 123 GFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVKLSTGKLISLS 182
Query: 182 EQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
EQEL+DCD N+ GC GG +D AF++I+++GGL E +YPY E+G C+ +
Sbjct: 183 EQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAAS 242
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
I GY+DVP NDE SL+KA+A QPVSVA++AS FQFY GGV G CG LDHGV +GY
Sbjct: 243 IRGYEDVPANDEPSLMKAVAGQPVSVAVDAS--KFQFYGGGVMAGECGTSLDHGVTVIGY 300
Query: 301 G-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G S G+ Y +VKNSWG WGE GY+RM+++ G+CG+ S P
Sbjct: 301 GAASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYP 348
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 150/312 (48%), Positives = 195/312 (62%), Gaps = 8/312 (2%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS--YWLGLNEFADMSH 101
+ + E WM+KHG+ Y EK R E+F++N+ I+ N + +WL N+FAD+++
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWA 159
EF+ GL+P + F Y +V LP SVDWR KGAV PVK+QG CG CWA
Sbjct: 61 AEFRATRTGLRPSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWA 120
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEE 218
FS VAA+EG ++ +G L SLSEQ+L+ CD + GC GGLMD AF +I+ +GGL E
Sbjct: 121 FSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAES 180
Query: 219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
DYPY + C TI GY+DVP NDE +LLKA+A+QPVSVAI+ FQFY
Sbjct: 181 DYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFY 240
Query: 279 SGGVFTGP--CGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
GGV +G C ELDH + AVGYG S G+ Y ++KNSWG WGE GY+RM+R E
Sbjct: 241 KGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKE 300
Query: 336 GLCGINKMASIP 347
G+CG+ MAS P
Sbjct: 301 GVCGLAMMASYP 312
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 154/308 (50%), Positives = 199/308 (64%), Gaps = 6/308 (1%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF 104
E E+WM+++GK YK EK RF+IFK N+ I+ N + L +N+FAD+ EEF
Sbjct: 36 ERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLSINQFADLHDEEF 95
Query: 105 K----NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
K N ++ T + F Y V L ++DWRK+GAVTP+K+Q CGSCWAF
Sbjct: 96 KALLTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRGAVTPIKDQRRCGSCWAF 155
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
S VAA+EGI+QI + L SLSEQEL+DC + GCNGG M+ AF+++ GG+ E Y
Sbjct: 156 SAVAAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFEFVAKKGGIASESYY 215
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY ++ +C+ KKE V I GY+ VP N E++L KA+AHQPVSV +EA G FQFYS
Sbjct: 216 PYKGKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQPVSVYVEAGGNAFQFYSS 275
Query: 281 GVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
G+FTG CG DH + VGYGKS+ G+ Y +VKNSWG WGE+GYIRMKR+ EGLCG
Sbjct: 276 GIFTGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWGEKGYIRMKRDIRAKEGLCG 335
Query: 340 INKMASIP 347
I A P
Sbjct: 336 IAMNAFYP 343
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 157/314 (50%), Positives = 207/314 (65%), Gaps = 25/314 (7%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSH 101
LF+++ +K K Y+ EE+ RF +F +N+ I++ N E V ++ + +N+FAD+++
Sbjct: 28 RLFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFADLTN 87
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKAL------PKSVDWRKKGAVTPVKNQGSCG 155
EE++ YL +P +PT E R+ + + SVDWR+KGAVTP+KNQG CG
Sbjct: 88 EEYRQLYL--RP-YPT------ELLGRERQEVWLDGPNAGSVDWRQKGAVTPIKNQGQCG 138
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGL 214
SCW+FST +VEG + I +GNL SLSEQ+L+DC SF N GCNGGLMD AFKYI+++GGL
Sbjct: 139 SCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGL 198
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
E+DYPY +G C+ KE V+ISGY+DVP+N+E L A+ PVSVAIEA
Sbjct: 199 DTEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQS 258
Query: 275 FQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
FQ YS GVF+GPCG LDHGV VGY SDY IVKNSWG WG++GYI MKR
Sbjct: 259 FQMYSSGVFSGPCGTNLDHGVLVVGY----TSDYWIVKNSWGASWGDQGYIMMKRGVSS- 313
Query: 335 EGLCGINKMASIPL 348
G+CGI S P+
Sbjct: 314 AGICGIAMQPSYPI 327
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 155/305 (50%), Positives = 207/305 (67%), Gaps = 6/305 (1%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNK 107
E WM++ GK+YK EK RF+IFK N++ I+ N + L +N FAD+++EEFK
Sbjct: 38 EKWMTQFGKSYKDAAEKEKRFQIFKNNVEFIELFNAVGNKPFNLSINHFADLTNEEFKAS 97
Query: 108 YLG---LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
G L +F + ++ F Y +V ++P S+DWRK+GAVTP+KNQGSCGSCWAFSTVA
Sbjct: 98 LNGNKKLHDKFDILNETTS-FRYHNVTSVPASMDWRKRGAVTPIKNQGSCGSCWAFSTVA 156
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
++EGI+QI +G L SLSEQELIDC ++GC+GG ++ AFK+I GG+ E +YPY
Sbjct: 157 SIEGIHQITTGELVSLSEQELIDCVRGNSSGCSGGYLEDAFKFIAKKGGMASETNYPYKE 216
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
+ C+ KKE V I GY+ VP N E LLKA+A+QPVSV ++A FQFYSGG+FT
Sbjct: 217 TDEKCKFKKESKHVAEIKGYEKVPSNSENDLLKAVANQPVSVYVDAGDYVFQFYSGGIFT 276
Query: 285 GPCGAELDHGVAAVGYGKSKG-SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
G CG + DH V VGYG S ++Y +VKNSWG WGE+GY+++KRN +GLCGI
Sbjct: 277 GKCGTDTDHVVTIVGYGVSLDYTEYWLVKNSWGTGWGEKGYMKLKRNVDSKKGLCGIATN 336
Query: 344 ASIPL 348
S P+
Sbjct: 337 PSYPV 341
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 155/328 (47%), Positives = 213/328 (64%), Gaps = 22/328 (6%)
Query: 42 DKLIELFESWMSKHGK----TYKCI----------EEKLHRFEIFKENLKHIDQRNKEVT 87
+++ ++E+W SKHG+ C E++ R E+F++NL++ID N E
Sbjct: 48 EEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNLRYIDAHNAEAD 107
Query: 88 ----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP-SAEFSYRDVKALPKSVDWRKK 142
++ LGL FAD++ EE++ + LG + + + +S R LP ++DWR+
Sbjct: 108 AGLHTFRLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVRG-GDLPDAIDWRQL 166
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAVT VK+Q CG CWAFS VAA+EG+N I +GNL SLSEQE+IDCD ++GC+GG M+
Sbjct: 167 GAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQME 225
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME-VVTISGYQDVPENDEQSLLKALAH 261
AF++++ +GG+ E DYP++ +GTC+ KE+ E V TI G +V N+E +L +A+A
Sbjct: 226 NAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKNEKVATIDGLVEVASNNETALQEAVAI 285
Query: 262 QPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGE 321
QPVSVAI+ASG FQ YS G+F GPCG LDHGV AVGYG G DY IVKNSW WGE
Sbjct: 286 QPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGSESGKDYWIVKNSWSASWGE 345
Query: 322 RGYIRMKRNTGKPEGLCGINKMASIPLK 349
GYIRM+RN +P G CGI AS P+K
Sbjct: 346 AGYIRMRRNVPRPTGKCGIAMDASYPVK 373
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 153/333 (45%), Positives = 208/333 (62%), Gaps = 13/333 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K LLL++ S+ CSS +G + ++E E WM+K + YK EK
Sbjct: 5 KALLLAIIGSICLCSSTVLSARELGDAA--------MVEKHEQWMAKFNRVYKDSTEKAQ 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY 127
RF+ FK N+ I+ N +WLG+N+F D++++EF+ + R P+ F Y
Sbjct: 57 RFKAFKANVAFIESFNTGNHKFWLGVNQFTDLTNDEFRATKTNKGLKRNGARAPT-RFKY 115
Query: 128 RDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
+V ALP +VDWR KG VTP+K+QG CG CWAFS VAA EGI ++ +G L SLSEQEL
Sbjct: 116 NNVSTDALPAAVDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQEL 175
Query: 186 IDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DCD + GC GG MD AFK+I+ +GGL E +YPY ++G C+ V TI GY
Sbjct: 176 VDCDVHGVDQGCEGGEMDNAFKFIIKNGGLTTEANYPYTAQDGQCKTSTTSNSVATIKGY 235
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KS 303
+DVP NDE SL+KA+A+QPVSVA++ FQ YSGGV TG CG +LDHG+ A+GYG S
Sbjct: 236 EDVPANDESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIVAIGYGMTS 295
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
G+ + ++KNSWG WGE GY+RM+++ G
Sbjct: 296 DGTKFWLLKNSWGTTWGESGYLRMEKDISDKSG 328
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 164/353 (46%), Positives = 220/353 (62%), Gaps = 19/353 (5%)
Query: 1 MAFFSHSK-LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY 59
M FS K +L++ L L+++ ++ S S +H E WM+++GK Y
Sbjct: 1 MCSFSQKKNILVVFLVLTVWTSQVMSRRLSEAYSSVKH-----------EKWMAQYGKVY 49
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYL-GLKPQFPT 117
K EK RF+IFK N+ I+ + + L +N+FAD+ +FK + G K +
Sbjct: 50 KDAAEKEKRFQIFKNNVHFIESFHAAGDKPFNLSINQFADL--HKFKALLINGQKKEHNV 107
Query: 118 RRQPSAE--FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
R + E F Y V +P S+DWRK+GAVTP+K+QG+C SCWAFSTVA +EG++QI G
Sbjct: 108 RTATATEASFKYDSVTRIPSSLDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKG 167
Query: 176 NLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
L SLSEQEL+DC + GC GG ++ AF++I GG+ E YPY TC+ KKE
Sbjct: 168 ELVSLSEQELVDCVKGDSEGCYGGYVEDAFEFIAKKGGVASETHYPYKGVNKTCKVKKET 227
Query: 236 MEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGV 295
VV I GY+ VP N E++LLKA+AHQPVS +EA G FQFYS G+FTG CG ++DH V
Sbjct: 228 HGVVQIKGYEQVPSNSEKALLKAVAHQPVSAYVEAGGYAFQFYSSGIFTGKCGTDIDHSV 287
Query: 296 AAVGYGKSKGSD-YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VGYGK++G + Y +VKNSWG +WGE+GYIRMKR+ EGLCGI A P
Sbjct: 288 TVVGYGKARGGNKYWLVKNSWGTEWGEKGYIRMKRDIRAKEGLCGIATGALYP 340
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 158/308 (51%), Positives = 198/308 (64%), Gaps = 5/308 (1%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHE 102
+ E E WM K+GK YK E RF IF+ N++ I+ N Y L +N AD ++E
Sbjct: 34 MYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNKPYKLSINHLADQTNE 93
Query: 103 EFKNKYLGLKPQF--PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
EF + G K R F Y +V +P +VDWR+KG T +K+QG CG CWAF
Sbjct: 94 EFMASHKGYKGSHWQGLRITTQTPFKYENVTDIPWAVDWRQKGDATSIKDQGQCGICWAF 153
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
S VAA EGI QI +GNL SLSEQEL+DCD S ++GC+GGLM++ F++I+ +GG+ E +Y
Sbjct: 154 SAVAATEGIYQITTGNLVSLSEQELVDCD-SVDHGCDGGLMEHGFEFIIKNGGISSEANY 212
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY GTC+ KE I GY+ VP N E+ L KA+A+QPVSV+I+A G+ FQFYS
Sbjct: 213 PYTAVNGTCDTNKEASPGAQIKGYETVPVNCEEELQKAVANQPVSVSIDAGGSAFQFYSS 272
Query: 281 GVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
GVFTG CG +LDHGV AVGYG + G Y IVKNSWG +WGE GYIRM R EGLCG
Sbjct: 273 GVFTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQWGEEGYIRMLRGIDAQEGLCG 332
Query: 340 INKMASIP 347
I AS P
Sbjct: 333 IAMDASYP 340
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 151/310 (48%), Positives = 208/310 (67%), Gaps = 11/310 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHE 102
+ E E WM+++ + YK EK RFE+FK+N ++ N + + +WLG+N+FAD++ E
Sbjct: 1 MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTE 60
Query: 103 EFK-NKYLGLKPQFPTRRQPSAEFSYRD--VKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
EFK NK G KP P+ F Y + V ALP +VDWR KGAVTP+KNQG CG CWA
Sbjct: 61 EFKANK--GFKP-ISAEEVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWA 117
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEE 218
FS +AA+EGI ++ +GNL SLSEQE +DCDT + + GC GG MD AF++++ +GGL E
Sbjct: 118 FSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLATES 177
Query: 219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
YPY + +G C K TI G++DVP N+E +L+K +A QPVSVA++AS F Y
Sbjct: 178 SYPYKVVDGKC--KGGSKSAATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTFMLY 235
Query: 279 SGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
SGGV TG CG +LDHG+AA+GYG +S + Y I+KNSWG WGE+G++RM+++ G+
Sbjct: 236 SGGVMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDISDKRGM 295
Query: 338 CGINKMASIP 347
C + S P
Sbjct: 296 CDLAMKPSYP 305
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 156/335 (46%), Positives = 213/335 (63%), Gaps = 30/335 (8%)
Query: 42 DKLIELFESWMSKHGKTY-KCI---EEKLHRFEIFKENLKHIDQRNKEVT----SYWLGL 93
+++ ++E+W SKHG+ C +E R E+F++NL++ID N E ++ LGL
Sbjct: 48 EEVRRMYEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGL 107
Query: 94 NEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA------------------LPK 135
FAD++ EE++ + LG + + R PSA + V + LP
Sbjct: 108 TPFADLTLEEYRGRALGFRARH--RGGPSARAAASRVGSGGTRSHHRRPRPRPRCGDLPD 165
Query: 136 SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNG 195
++DWR+ GAVT VKNQ CG CWAFS VAA+EGIN IV+GNL SLSEQE+IDCDT ++G
Sbjct: 166 AIDWRQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDTQ-DSG 224
Query: 196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE-DKKEEMEVVTISGYQDVPENDEQS 254
CNGG M+ AF++++ +GG+ E DYP++ +GTC+ +K + +V I G+ +V N+E +
Sbjct: 225 CNGGQMENAFQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAAIDGFVEVASNNETA 284
Query: 255 LLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNS 314
L +A+A QPVSVAI+A G FQ YS G+F GPCG LDHGV VGYG G Y IVKNS
Sbjct: 285 LQEAVAIQPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYGSENGKAYWIVKNS 344
Query: 315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
W WGE GYIR++RN P G CGI AS P+K
Sbjct: 345 WSDSWGEAGYIRIRRNVFLPVGKCGIAMDASYPVK 379
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 156/345 (45%), Positives = 218/345 (63%), Gaps = 23/345 (6%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
+ +K LL ++ L CS++ + L+ + E WM+++G+ YK
Sbjct: 1 MAMAKALLFAILGCLCLCSAV--------LAARELSDDAAMAARHERWMAQYGRMYKDDA 52
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK----NKYLGLKPQFPTRR 119
EK RFE+FK N I+ N +WLG+N+FAD++++EF+ NK G P T R
Sbjct: 53 EKARRFEVFKANAAFIESFNAGNHKFWLGVNQFADLTNDEFRLTKTNK--GFIPS--TTR 108
Query: 120 QPSAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
P+ F Y +V ALP ++DWR KG VTP+K+QG CG CWAFS VAA+EGI ++ +G L
Sbjct: 109 VPTG-FRYENVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKL 167
Query: 178 TSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQEL+DCD + GC GGLMD AFK+I+ +GGL E +YPY + C K
Sbjct: 168 ISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSN 225
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
V +I GY+DVP N+E +L+KA+A+QPVSVA++ FQFY GGV G CG +LDHG+
Sbjct: 226 SVASIKGYEDVPANNEAALMKAVANQPVSVAVDGDDMTFQFYKGGVMIGSCGTDLDHGIV 285
Query: 297 AVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
A+GYGK S G+ Y ++KNSWG WGE G++RM+++ G+CG+
Sbjct: 286 AIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGL 330
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 145/301 (48%), Positives = 202/301 (67%), Gaps = 6/301 (1%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMS 100
D +++ FE WM+++G+ YK +EK+ RF+IFK N+ HI+ N+ SY LG+N+F DM+
Sbjct: 31 DPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMT 90
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
+ EF +Y G+ +R+P F ++ A+ +S+DWR GAVT VK+Q CGSCWAF
Sbjct: 91 NNEFVTQYTGVSLPLNFKREPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAF 150
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
S +A VEGI +IV+G L SLSEQE++DC S NGC+GG +D A+ +I+++ G+ E DY
Sbjct: 151 SAIATVEGIYKIVTGYLVSLSEQEVLDCAVS--NGCDGGFVDNAYDFIISNNGVASEADY 208
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY EG C I+GY V NDE S+ A+ +QP++ AI+ASG +FQ+Y+G
Sbjct: 209 PYQAYEGDCTANSWPNSAY-ITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNG 267
Query: 281 GVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
GVF+GPCG L+H + +GYG+ S G+ Y IVKNSWG WGERGY+RM R GLCG
Sbjct: 268 GVFSGPCGTSLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYVRMARGVSS-SGLCG 326
Query: 340 I 340
I
Sbjct: 327 I 327
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 156/337 (46%), Positives = 213/337 (63%), Gaps = 36/337 (10%)
Query: 42 DKLIELFESWMSKHGK----TYKCI---------EEKLHRFEIFKENLKHIDQRNKEVT- 87
+++ ++E+W SKHG+ C E++ R E+F++NL++ID+ N E
Sbjct: 78 EEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEADA 137
Query: 88 ---SYWLGLNEFADMSHEEFKNKYLGLKPQFPT-----------RRQPSAEFSYRDVKAL 133
++ LGL FAD++ +E++ + LG + + R +P R L
Sbjct: 138 GLHTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARP------RGGDLL 191
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P ++DWR+ GAVT VK+Q CG CWAFS VAA+EGIN I +GNL SLSEQE+IDCD +
Sbjct: 192 PDAIDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ-D 250
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME-VVTISGYQDVPENDE 252
+GC+GG M+ AF++++ +GG+ E DYP++ +GTC+ KE E V TI G +V N+E
Sbjct: 251 SGCDGGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEVASNNE 310
Query: 253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
+L +A+A QPVSVAI+ASG FQ YS G+F GPCG LDHGV AVGYG G DY IVK
Sbjct: 311 TALQEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGSESGKDYWIVK 370
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
NSW WGE GYIRM+RN +P G CGI AS P+K
Sbjct: 371 NSWSASWGEAGYIRMRRNVPRPTGKCGIAMDASYPVK 407
>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 517
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 165/350 (47%), Positives = 223/350 (63%), Gaps = 13/350 (3%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
LL L F C L ++SI+ + S + +IELF+ W ++ K Y+ +++ R
Sbjct: 11 LLFLVWGSWTFLCYGLPSEYSILALEIDKFPSEEGVIELFQRWKEENKKIYRSPDQEKLR 70
Query: 69 FEIFKENLKHIDQRN-KEVTSYW--LGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSA- 123
FE FK NLK+I ++N K ++ Y LGLN FADMS+EEFK+K+ +K F R S
Sbjct: 71 FENFKRNLKYIAEKNSKRISPYGQSLGLNRFADMSNEEFKSKFTSKVKKPFSKRNGLSGK 130
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
+ S D P S+DWRKKG VT VK+QG CG CWAFS+ A+EGIN IVSG+L SLSE
Sbjct: 131 DHSCEDA---PYSLDWRKKGVVTAVKDQGYCGCCWAFSSTGAIEGINAIVSGDLISLSEP 187
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCD + N+GC+GG MDYAF++++ +GG+ E +YPY +GTC KEE +V+ I G
Sbjct: 188 ELVDCDRT-NDGCDGGHMDYAFEWVMHNGGIDTETNYPYSGADGTCNVAKEETKVIGIDG 246
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGA---ELDHGVAAVGY 300
Y +V ++D +SLL A QP+S I+ S DFQ Y GG++ G C + ++DH + VGY
Sbjct: 247 YYNVEQSD-RSLLCATVKQPISAGIDGSSWDFQLYIGGIYDGDCSSDPDDIDHAILVVGY 305
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
G DY IVKNSWG WG GYI ++RNT G+C IN MAS P K+
Sbjct: 306 GSEGDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMASYPTKE 355
>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
Length = 229
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 145/219 (66%), Positives = 171/219 (78%), Gaps = 1/219 (0%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
+P SVDWRKKGAVT VK+QG CGSCWAFST+ AVEGINQI + L SLSEQEL+DCDT
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61
Query: 193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
N GCNGGLMDYAF++I GG+ E +YPY +GTC+ KE V+I G+++VPENDE
Sbjct: 62 NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDE 121
Query: 253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIV 311
+LLKA+A+QPVSVAI+A G+DFQFYS GVFTG CG ELDHGVA VGYG + G+ Y V
Sbjct: 122 NALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTV 181
Query: 312 KNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
KNSWGP+WGE+GYIRM+R EGLCGI AS P+KK
Sbjct: 182 KNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPIKK 220
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 150/281 (53%), Positives = 200/281 (71%), Gaps = 9/281 (3%)
Query: 73 KENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF---KNKYLGLKPQFPTRRQPSAEFSYR 128
KEN+ +I+ N Y LG+N+FAD++ EEF +N++ G +F R + F Y
Sbjct: 5 KENVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNG-HMRFSNTR--TTTFKYE 61
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
+V LP S+DWR+KGAVTP+KNQGSCG CWAFS +AA EGI++I +G L SLSEQE++DC
Sbjct: 62 NVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDC 121
Query: 189 DT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
DT ++GC GG MD AFK+I+ + G++ E YPY +G C K+E + TI+GY+DV
Sbjct: 122 DTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGYEDV 181
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-SKGS 306
P N+E++L KA+A+QPVSVAI+A G DFQFY G+FTG CG ELDHGV AVGYG+ ++G+
Sbjct: 182 PINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEGT 241
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
Y +VKNSWG +WGE GY M+R EG+CGI +AS P
Sbjct: 242 KYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICGIAMLASYP 282
>gi|357439999|ref|XP_003590277.1| Cysteine protease [Medicago truncatula]
gi|355479325|gb|AES60528.1| Cysteine protease [Medicago truncatula]
Length = 514
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 162/386 (41%), Positives = 221/386 (57%), Gaps = 52/386 (13%)
Query: 15 SLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKE 74
SL+ +C + ++SI+ + S ++++ELF+ W +H K Y EE R E FK
Sbjct: 19 SLTFLSCYGIPSEYSILAFDLNKFPSEEQVVELFQQWKKEHQKFYIHPEEAALRLENFKR 78
Query: 75 NLKHIDQRNKEVTS---YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK 131
NLK+I +RN S + LGLN FADMS+EEFKNK++ + ++R +
Sbjct: 79 NLKYIVERNAMRNSPVGHHLGLNRFADMSNEEFKNKFISKVKKPISKRASNLHVKVESCD 138
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCG------------------------------------ 155
P S+DWRKKG VT VK+QG+CG
Sbjct: 139 DAPYSLDWRKKGVVTGVKDQGNCGKLLYFMHFKSFLVIYILELTTNFPLYSFESQFCILE 198
Query: 156 --------SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKY 207
SCW+FS+ A+EG+N IV+G+L SLSEQEL+DCDT+ N+GC GG MDYAF++
Sbjct: 199 KKKLDFVGSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTT-NDGCEGGYMDYAFEW 257
Query: 208 IVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVA 267
++ +GG+ E DYPY+ GTC KEE +VVTI GY DV ++D +L A QP+SV
Sbjct: 258 VINNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDGYTDVTQSD-SALFCATVKQPISVG 316
Query: 268 IEASGTDFQFYSGGVFTGPCGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGY 324
I+ S DFQ Y+GG++ G C + ++DH V VGYG DY IVKNSWG WG G+
Sbjct: 317 IDGSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGYGSDGNQDYWIVKNSWGTSWGIEGF 376
Query: 325 IRMKRNTGKPEGLCGINKMASIPLKK 350
I ++RNT G+C IN MAS P K+
Sbjct: 377 IYIRRNTNLKYGVCAINYMASFPTKE 402
>gi|113120273|gb|ABI30276.1| VXH-C [Vasconcellea x heilbornii]
Length = 282
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 155/281 (55%), Positives = 203/281 (72%), Gaps = 5/281 (1%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
FS SKL+ ++ L + A S A DFSIVGYS + LTS++K I LFESWM KH K YK +E
Sbjct: 5 FSISKLIFVATCLIVRAGLSFA-DFSIVGYSQDDLTSIEKSIRLFESWMLKHDKVYKSME 63
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPT--RRQP 121
EK++RFEIFK+NL +ID+ NK+ SYWLGLNEFAD++H+EFK KY+G P+ T +
Sbjct: 64 EKINRFEIFKDNLMYIDETNKKNNSYWLGLNEFADLTHDEFKKKYVGSIPEDYTIIEQSD 123
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF Y+ V P+SVDWR+KGAVTPVK+Q CGSCWAFSTVA VEGIN+IV+G L SLS
Sbjct: 124 DGEFPYKHVVDYPESVDWRQKGAVTPVKDQNPCGSCWAFSTVATVEGINKIVTGKLISLS 183
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DCD ++GC+GG + +Y+V G+H E +Y Y ++G C K ++ V I
Sbjct: 184 EQELLDCDRR-SHGCDGGYQRTSLQYVV-DNGVHTEYEYQYEKKQGNCRAKNKKGLKVYI 241
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+GY+ VP NDE SL+K +A+QPVSV +++S F FY GG+
Sbjct: 242 NGYKGVPPNDEISLIKVIANQPVSVLVDSSERAFHFYRGGI 282
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 154/312 (49%), Positives = 206/312 (66%), Gaps = 11/312 (3%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEE 103
IE E WMS+ + Y EK RFEIFK+NLK ++ N +Y L +NEF+D++ EE
Sbjct: 32 IEKHEQWMSRFHRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNKTYTLDVNEFSDLTDEE 91
Query: 104 FKNKYLGLK-PQFPTR-----RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
FK +Y GL P+ TR + F Y +V +S+DWR++GAVT VK+Q CG C
Sbjct: 92 FKARYTGLVVPEGMTRMSTTDSHETVSFRYENVGETGESMDWREEGAVTSVKHQQQCGCC 151
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKE 217
WAFS VAAVEG+ +I G L SLSEQ+L+DC T N+GC+GG+M AF YIV + G+ E
Sbjct: 152 WAFSAVAAVEGMTKIAKGELVSLSEQQLLDCSTE-NDGCDGGIMWKAFDYIVENQGITAE 210
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
++YPY + TCE + TISGY+ VP+NDE++LLKA++ QPVSVAIE SG +F
Sbjct: 211 DNYPYQGAQQTCESNH--VAAATISGYETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIH 268
Query: 278 YSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
YSGG+F G CG L+H V VGYG S +G Y ++KNSWG WGE GY+R+ R+ P+G
Sbjct: 269 YSGGIFNGECGTHLNHAVTIVGYGVSEEGIKYWLLKNSWGESWGEDGYMRIMRDVDAPQG 328
Query: 337 LCGINKMASIPL 348
+CG+ +A P+
Sbjct: 329 MCGLASLAYYPV 340
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 139/265 (52%), Positives = 191/265 (72%), Gaps = 7/265 (2%)
Query: 28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-- 85
SIV Y S ++ ++ WM+ HG+TY + E+ RFE+F++NL+++D N
Sbjct: 29 MSIVSYGER---SEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAAD 85
Query: 86 --VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKG 143
V S+ LGLN FAD++++E++ YLG++ + R+ + D + LP+SVDWR KG
Sbjct: 86 AGVHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKG 145
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
AV VK+QGSCGSCWAFST+AAVEGINQIV+G++ SLSEQEL+DCDTS+N GCNGGLMDY
Sbjct: 146 AVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDY 205
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQP 263
AF++I+ +GG+ EEDYPY +G C+ ++ +VVTI Y+DVP N E+SL KA+A+QP
Sbjct: 206 AFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQP 265
Query: 264 VSVAIEASGTDFQFYSGGVFTGPCG 288
+SVAIEA G FQ Y+ G+FTG CG
Sbjct: 266 ISVAIEAGGRAFQLYNSGIFTGTCG 290
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 161/360 (44%), Positives = 217/360 (60%), Gaps = 26/360 (7%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
LLL+S ++ ++ A S Y + S + L+ LF+ W+ +HGK Y EEK R
Sbjct: 7 LLLISATIICLVSAAKAVQHS---YEVGDINSGNGLVRLFDRWLGRHGKLYGSHEEKARR 63
Query: 69 FEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKP-QFPTRRQPSAEFS 126
+IF+ NL++I NK +S+ LGLN+FAD+++EEFK +Y G Q+ RR+ E +
Sbjct: 64 LQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNEEFKTRYFGKNSKQWRDRRRTELEGA 123
Query: 127 YRDVKALPKS--------------VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 172
+++ + K +DWRKKGAVT VK+Q CGSCWAFST A+EG+N I
Sbjct: 124 --ELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEGVNFI 181
Query: 173 VSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDK 232
+G L SLSEQEL+ CD + N GC GG MDYAF +++ +GG+ E+DY Y + TC
Sbjct: 182 STGKLVSLSEQELVACDAT-NYGCEGGDMDYAFTWVIQNGGIDTEKDYSYTGVDSTCNTN 240
Query: 233 KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGA--- 289
KE ++V+I GY DV D+ +LL A QPVSV I+ S DFQ Y+GG++ G C
Sbjct: 241 KEAKKIVSIDGYTDVSP-DDSALLCAAGSQPVSVGIDGSAIDFQLYTGGIYDGDCSGNPD 299
Query: 290 ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
++DH V VGY G DY IVKNSWG WG GY + RNT P G+C IN MAS P K
Sbjct: 300 DIDHAVLVVGYSAKNGKDYWIVKNSWGTDWGLEGYFYILRNTELPYGVCAINAMASYPTK 359
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 201/308 (65%), Gaps = 6/308 (1%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMS 100
D ++E FE WM+++G+ Y EK+ RF+IFK N+ HI+ N+ SY LG+N+F DM+
Sbjct: 4 DPMMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMT 63
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
+ EF +Y G R P F D+ A+P+S+DWR GAVT VKNQGSCGSCWAF
Sbjct: 64 NNEFLARYTGASLPLNIERDPVVSFDDVDISAVPQSIDWRDYGAVTSVKNQGSCGSCWAF 123
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
S +A VEGI +I +GNL SLSEQE++DC S+ GC+GG ++ A+ +I+++ G+ +
Sbjct: 124 SAIATVEGIYKIKAGNLISLSEQEVLDCALSY--GCDGGWVNKAYDFIISNNGVTSFANL 181
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY +G C + + I+GY V N+E+S++ A+A+QP++ I+A G DFQ+Y
Sbjct: 182 PYKGYKGPC-NHNDLPNKAYITGYTYVQSNNERSMMIAVANQPIAALIDAGG-DFQYYKS 239
Query: 281 GVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
GVFTG CG L+H + +GYG+ S G+ Y IVKNSWG WGERGYIRM R+ P GLCG
Sbjct: 240 GVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARDVSSPYGLCG 299
Query: 340 INKMASIP 347
I P
Sbjct: 300 IAMAPLFP 307
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 156/338 (46%), Positives = 219/338 (64%), Gaps = 26/338 (7%)
Query: 16 LSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
L++ C+SL + L+ ++E E+WM ++G+ YK EK RF++FK+N
Sbjct: 9 LAILGCASLCSSV----LAARELSDA-AMVERHENWMVEYGRVYKDAAEKARRFQVFKDN 63
Query: 76 LKHIDQRNKEVTS-YWLGLNEFADMSHEEFK-NKYLGLKPQFPTRRQPSAEFSYRD--VK 131
+ ++ N + +WLG+N+FAD++ EEFK NK G KP + P+ F Y + V
Sbjct: 64 VAFVESFNTNKNNKFWLGVNQFADLTTEEFKANK--GFKPT--AEKVPTTGFKYENLSVS 119
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT- 190
ALP +VDWR KGAVTP+KNQG C AA+EGI ++ +GNL SLSEQEL+DCDT
Sbjct: 120 ALPTAVDWRTKGAVTPIKNQGQC---------AAMEGIVKLSTGNLISLSEQELVDCDTH 170
Query: 191 SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
S + GC GG MD AF++++ +GGL E +YPY +G C K TI G++DVP N
Sbjct: 171 SMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKC--KGGSKSAATIKGHEDVPVN 228
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYI 309
+E +L+KA+A+QPVSVA++AS F YSGGV TG CG ELDHG+AA+GYG +S G+ Y
Sbjct: 229 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYW 288
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
I+KNSWG WGE+G++RM+++ G+CG+ S P
Sbjct: 289 ILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 326
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 155/334 (46%), Positives = 208/334 (62%), Gaps = 8/334 (2%)
Query: 22 SSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
S L ++S V + + + E+F+ W KH K YK EE R FK NLK+I +
Sbjct: 24 SGLPGEYSAVSNDLHEGLTEEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIE 83
Query: 82 RNKEVTS---YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVD 138
+N + S + +GLN+FAD+S+EEF+ YL K + P + + + P S+D
Sbjct: 84 KNGKRKSGLEHKVGLNKFADLSNEEFREMYLS-KVKKPITIEEKRKHRHLQTCDAPSSLD 142
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
WR KG VT VK+QG CGSCW+FST A+E IN IV+G+L SLSEQEL+DCDT+ N GC G
Sbjct: 143 WRNKGVVTAVKDQGDCGSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEG 202
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
G MD AF++++ +GG+ E DYPY +GTC KEE +VV+I GY DV +D +LL A
Sbjct: 203 GDMDSAFQWVIGNGGIDTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSD-SALLCA 261
Query: 259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGA---ELDHGVAAVGYGKSKGSDYIIVKNSW 315
QP+SV ++ S DFQ Y+GG++ G C ++DH + VGYG DY IVKNSW
Sbjct: 262 TVQQPISVGMDGSALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYGSENDEDYWIVKNSW 321
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G +WG GY ++RNT KP G+C IN AS P K
Sbjct: 322 GTEWGMEGYFYIRRNTSKPYGVCAINADASYPTK 355
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 152/309 (49%), Positives = 201/309 (65%), Gaps = 7/309 (2%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHE 102
+++ W KH +R E+FKENL+ +D+ N +Y LG+N FAD+++E
Sbjct: 51 IYQEWRVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNE 110
Query: 103 EFKNKYLGLKPQF--PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
E++ ++L + T + S ++ R+ LP S+DWR+KGAV VKNQG CGSCWAF
Sbjct: 111 EYRARFLRDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKNQGRCGSCWAF 170
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
+ +AAVEGINQIV+G+L SLSEQ+L+DC T N GC GG AF+YI+ +GG++ EE Y
Sbjct: 171 AAIAAVEGINQIVTGDLISLSEQQLVDCSTR-NYGCEGGWPYRAFQYIINNGGVNSEEHY 229
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY GTC KE VV+I Y++VP NDE+SL KA A+QP+SV I+ASG +FQ Y
Sbjct: 230 PYTGTNGTCNTTKENAHVVSIDSYRNVPSNDEKSLQKAAANQPISVGIDASGRNFQLYHS 289
Query: 281 GVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
G+FTG C L+HGV VGYG G+DY IVKNSWG WG GYI M+RN + G CGI
Sbjct: 290 GIFTGSCNTSLNHGVTVVGYGTENGNDYWIVKNSWGENWGNSGYILMERNIAESSGKCGI 349
Query: 341 NKMASIPLK 349
S P+K
Sbjct: 350 AISPSYPIK 358
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 155/309 (50%), Positives = 200/309 (64%), Gaps = 6/309 (1%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHE 102
+ E E WM K+GK YK E RF IF+ N++ I+ N Y L +N AD ++E
Sbjct: 34 MYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNKPYKLSINHLADQTNE 93
Query: 103 EFKNKYLGLKPQF--PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
EF + G K R F Y +V +P +VDWR+KG VT +K+Q CG+CWAF
Sbjct: 94 EFMASHKGYKGSHWQGLRITTQTPFKYENVTDIPWAVDWRQKGDVTSIKDQAQCGNCWAF 153
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
S VAA EGI QI +GNL SLSE+EL+DCD S ++GC+GGLM++ F++I+ +GG+ E +Y
Sbjct: 154 SAVAATEGIYQITTGNLVSLSEKELVDCD-SVDHGCDGGLMEHGFEFIIKNGGISSEANY 212
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYS 279
PY GTC+ KE V I+GY+ VP N E+ L KA+A+Q +SV+I+A G+ FQFY
Sbjct: 213 PYTAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVANQLTMSVSIDAGGSAFQFYP 272
Query: 280 GGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GVFTG CG +LDHGV AVGYG + G+ Y IVKNSWG +WGE GYIRM R EGLC
Sbjct: 273 SGVFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQWGEEGYIRMLRGIDAQEGLC 332
Query: 339 GINKMASIP 347
GI AS P
Sbjct: 333 GIAMDASYP 341
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 300 bits (769), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 152/320 (47%), Positives = 208/320 (65%), Gaps = 10/320 (3%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEF 96
L +L+E E WM +HGK YK EK RF+IFKENL+ I+ N + L +N+F
Sbjct: 25 LVISSRLLEKHEQWMEEHGKFYKDAAEKEQRFQIFKENLEFIESFNAAGDNGFNLSINQF 84
Query: 97 ADMSHEEFKNKYLGLKPQFP------TRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKN 150
D +++EFK YL K + P + + F Y +V +P ++DWR++GAVTP+K+
Sbjct: 85 GDQTNDEFKANYLNGKKK-PLIGVGIAAIEEESVFRYENVTEVPATMDWRERGAVTPIKH 143
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC-DTSFNNGCNGGLMDYAFKYIV 209
Q CGSCWAF+TVAA+EGI+QI +G L SLSEQEL+DC T+ +GCNGG ++ A +IV
Sbjct: 144 QHLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELVDCVKTNTTDGCNGGYVEDACDFIV 203
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIE 269
GG+ E +YPY +G C +K V I GY+ VP N+E++LLKA+A+QP++V I
Sbjct: 204 KKGGITSETNYPYTRVDGKCNVRKGTYNVAKIKGYEHVPANNEKALLKAVANQPIAVYIA 263
Query: 270 ASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMK 328
A+ FQFYS G+ G CG +LDH V VGYG S G Y +VKNSWG KWGE+GYI++K
Sbjct: 264 ATKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTSDDGVKYWLVKNSWGTKWGEKGYIKIK 323
Query: 329 RNTGKPEGLCGINKMASIPL 348
R+ EG CGI + + P+
Sbjct: 324 RDVHAKEGSCGIAMVPTYPI 343
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 300 bits (769), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 155/334 (46%), Positives = 206/334 (61%), Gaps = 24/334 (7%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFA 97
+ D ++E FE WM +HG+ Y EK R E+++ N++ ++ N Y L N+FA
Sbjct: 23 VARADPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGYRLADNKFA 82
Query: 98 DMSHEEFKNKYLGL-KPQ---------FPTRRQ--PSAEFSYRDVKALPKSVDWRKKGAV 145
D+++EEF+ K LG +P+ P+ S + LPKSVDWR+KGAV
Sbjct: 83 DLTNEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAV 142
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
PVK+QG CGSCWAFS VAA+EGINQI +G L SLSEQEL+DCDT GC GG M +AF
Sbjct: 143 APVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK-AIGCAGGYMSWAF 201
Query: 206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVS 265
++++ + GL E +YPY G C+ K + V+ISGY +V + E LL+A A QPVS
Sbjct: 202 EFVMKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVS 261
Query: 266 VAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSD-----------YIIVKNS 314
VA++A +Q Y GGVFTGPC AEL+HGV VGYG+++G Y IVKNS
Sbjct: 262 VAVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNS 321
Query: 315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
WGP+WG+ GYI M+R GLCGI + S P+
Sbjct: 322 WGPEWGDAGYILMQREASVASGLCGIAMLPSYPV 355
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 156/339 (46%), Positives = 223/339 (65%), Gaps = 19/339 (5%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
+FSIVG P + ++++ELF+ W KHGK YK +E +F+ F++NL+++ ++N E
Sbjct: 31 EFSIVG-RPGESIAEERVVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGER 89
Query: 87 TS---YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKAL---------P 134
+ + +GLN+FADMS+EEF+ Y+ K + PT ++ + E + A P
Sbjct: 90 GASGGHLVGLNKFADMSNEEFREVYVS-KVKKPTSKRMAIERRRQGKAAAAKAVAACDGP 148
Query: 135 KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN 194
S+DWRK G VT VK+QG CGSCWAFS+ A+EGIN + +G+L SLSEQEL+DCD++ N+
Sbjct: 149 TSLDWRKYGIVTGVKDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCDST-ND 207
Query: 195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQS 254
GC GG MDYAF++++++GG+ E DYPY E+GTC KEE + V+I GY+DV E +E +
Sbjct: 208 GCEGGYMDYAFEWVMSNGGIDTETDYPYTGEDGTCNTTKEETKAVSIDGYEDVAE-EESA 266
Query: 255 LLKALAHQPVSVAIEASGTDFQFYSGGVF---TGPCGAELDHGVAAVGYGKSKGSDYIIV 311
L A+ QP+SV I+ DFQ Y+GG++ ++DH V VGYG G +Y I+
Sbjct: 267 LFCAVLKQPISVGIDGGAIDFQLYTGGIYDGDCSDDPDDIDHAVLVVGYGAESGEEYWII 326
Query: 312 KNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
KNSWG WG +GY +KRNT K G+C IN MAS P K+
Sbjct: 327 KNSWGTDWGMKGYAYIKRNTSKDYGVCAINAMASYPTKE 365
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 155/334 (46%), Positives = 206/334 (61%), Gaps = 24/334 (7%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFA 97
+ D ++E FE WM +HG+ Y EK R E+++ N++ ++ N Y L N+FA
Sbjct: 44 VARADPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGYRLADNKFA 103
Query: 98 DMSHEEFKNKYLGL-KPQ---------FPTRRQ--PSAEFSYRDVKALPKSVDWRKKGAV 145
D+++EEF+ K LG +P+ P+ S + LPKSVDWR+KGAV
Sbjct: 104 DLTNEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAV 163
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
PVK+QG CGSCWAFS VAA+EGINQI +G L SLSEQEL+DCDT GC GG M +AF
Sbjct: 164 APVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK-AIGCAGGYMSWAF 222
Query: 206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVS 265
++++ + GL E +YPY G C+ K + V+ISGY +V + E LL+A A QPVS
Sbjct: 223 EFVMKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVS 282
Query: 266 VAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSD-----------YIIVKNS 314
VA++A +Q Y GGVFTGPC AEL+HGV VGYG+++G Y IVKNS
Sbjct: 283 VAVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNS 342
Query: 315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
WGP+WG+ GYI M+R GLCGI + S P+
Sbjct: 343 WGPEWGDAGYILMQREASVASGLCGIAMLPSYPV 376
>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
Length = 369
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 160/319 (50%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
+ + + S + L EL+E W +H + + + EK RF +FK+N++ I + N+ Y L
Sbjct: 33 FGDKDVASEEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRRDEPYKLR 91
Query: 93 LNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQG 152
LN F DM+ +E Y +R F R KA R GAV VK+QG
Sbjct: 92 LNRFGDMTADESAGAYA------SSRVSHHRMFRGRGEKAQ------RLHGAVGAVKDQG 139
Query: 153 SCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVAS 211
CGSCWAFST+AAVEGIN I + NLT+LSEQ+L+DCDT N GC+GGLMD AF+YI
Sbjct: 140 QCGSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKH 199
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEAS 271
GG+ YPY + +C+ VTI GY+DVP N E +L KA+A+QPVSVAIEA
Sbjct: 200 GGVAASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAG 259
Query: 272 GTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRN 330
G+ FQFYS GVF G CG ELDHGVAAVGYG + G+ Y IV+NSWG WGE+GYIRMKR+
Sbjct: 260 GSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRD 319
Query: 331 TGKPEGLCGINKMASIPLK 349
EGLCGI AS P+K
Sbjct: 320 VSAKEGLCGIAMEASYPIK 338
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 144/302 (47%), Positives = 201/302 (66%), Gaps = 7/302 (2%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMS 100
D +++ FE WM+++G+ YK +EK+ RF+IFK N+ HI+ N+ SY LG+N+F DM+
Sbjct: 31 DPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMT 90
Query: 101 HEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
+ EF +Y G + ++P F ++ A+ +S+DWR GAVT VK+Q CGSCWA
Sbjct: 91 NNEFVAQYTGGISRPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWA 150
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
FS +A VEGI +IV+G L SLSEQE++DC S NGC+GG +D A+ +I+++ G+ E D
Sbjct: 151 FSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS--NGCDGGFVDNAYDFIISNNGVASEAD 208
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS 279
YPY +G C I+GY V NDE S+ A+ +QP++ AI+ASG +FQ+Y+
Sbjct: 209 YPYQAYQGDCAANSWPNSAY-ITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYN 267
Query: 280 GGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GGVF+GPCG L+H + +GYG+ S G+ Y IVKNSWG WGERGYIRM R GLC
Sbjct: 268 GGVFSGPCGTSLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGVSS-SGLC 326
Query: 339 GI 340
GI
Sbjct: 327 GI 328
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 300 bits (767), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 164/349 (46%), Positives = 204/349 (58%), Gaps = 55/349 (15%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA + + + ++L L A +S A S+ H SM E E WM+++G+ YK
Sbjct: 1 MASTNQYQYVSMALLFILAAWASQATSRSL------HEASM---YERHEDWMARYGRMYK 51
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ 120
EK RF+IFK+N+
Sbjct: 52 DANEKEKRFKIFKDNVAQ------------------------------------------ 69
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
+ F Y +V A+P ++DWRKKGAVTP+K+Q CGSCWAFS VAA EGI QI +G L SL
Sbjct: 70 -ATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISL 128
Query: 181 SEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVV 239
SEQEL+DCDT N GC+GGL D AF++I G L E YPY ++GTC KKE
Sbjct: 129 SEQELVDCDTGGENQGCSGGLXDDAFRFIXIHG-LASEATYPYEGDDGTCNSKKEAHPAA 187
Query: 240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVG 299
I GY+DVP N+E++L KA+AHQPV+VAI+A G +FQFY+ GVFTG CG ELDHGVAAVG
Sbjct: 188 KIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVG 247
Query: 300 YG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
YG G Y +VKNSWG WGE GYIRM+R+ EGLCGI AS P
Sbjct: 248 YGIGDDGMXYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 296
>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 343
Score = 300 bits (767), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 164/356 (46%), Positives = 216/356 (60%), Gaps = 32/356 (8%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHL-----TSMDKLIELFESWMSKH 55
M SK + L ++ A SS A D SI+ Y H S ++++ ++E ++KH
Sbjct: 1 MGTNRSSKATIFILFFTVLAVSS-ALDLSIISYDRSHADKSGWRSDEEVMSIYEEXLAKH 59
Query: 56 GKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQF 115
GK Y I+E RF+I KENLK ++Q N +Y +GLN FAD S +
Sbjct: 60 GKVYNAIDEMEERFQISKENLKFVEQHNAGNRTYKVGLNRFADRS-------------RM 106
Query: 116 PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
TR PS+ ++ R L +SVDWRK+GAV VK Q C SC F+ +AAVEGIN+IV+G
Sbjct: 107 MTR--PSSRYAPRVSDNLSESVDWRKEGAVVRVKTQSECESCRTFTVIAAVEGINKIVTG 164
Query: 176 NLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
NLT+LS DCD + N GC+GGL DYA ++I+ +GG+ EEDYP+ G C+ K
Sbjct: 165 NLTALS-----DCDRTVNAGCSGGLADYALEFIINNGGIDTEEDYPFQGAVGICDQYK-- 217
Query: 236 MEVVTISGYQDVPENDEQSLLKALAHQPVSVA-IEASGTDFQFYSGGVFTGPCGAELDHG 294
+ + GY+ VP DE +L KA+A+QPVSVA IEA G +FQ Y G+FTG CG +DHG
Sbjct: 218 --INAVDGYERVPAYDELALKKAVANQPVSVAYIEAYGKEFQLYESGIFTGKCGTSIDHG 275
Query: 295 VAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK-PEGLCGINKMASIPLK 349
V AVGYG G DY IVKNSWG WGE GY+RM+RNT + G CGI + P+K
Sbjct: 276 VTAVGYGTENGIDYWIVKNSWGENWGEAGYVRMERNTAEDTAGKCGIAILTLYPIK 331
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 300 bits (767), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 145/334 (43%), Positives = 212/334 (63%), Gaps = 11/334 (3%)
Query: 16 LSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
L LF C+ A + P D +++ FE WM+++G+ YK +EK+ RF+IFK N
Sbjct: 10 LFLFLCAMWASPSAASRDEPN-----DPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNN 64
Query: 76 LKHIDQRN-KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALP 134
+KHI+ N + SY LG+N+F DM+ EF +Y G+ R+P F ++ A+P
Sbjct: 65 VKHIETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDDVNISAVP 124
Query: 135 KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN 194
+S+DWR GAV VKNQ CGSCW+F+ +A VEGI +I +G L SLSEQE++DC S+
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY-- 182
Query: 195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQS 254
GC GG ++ A+ +I+++ G+ EE+YPYL +GTC + I+GY V NDE+S
Sbjct: 183 GCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTC-NANSFPNSAYITGYSYVRRNDERS 241
Query: 255 LLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKN 313
++ A+++QP++ I+AS +FQ+Y+GGVF+GPCG L+H + +GYG+ S G+ Y IV+N
Sbjct: 242 MMYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRN 300
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
SWG WGE GY+RM R G+CGI P
Sbjct: 301 SWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 161/353 (45%), Positives = 216/353 (61%), Gaps = 19/353 (5%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
M FS + L+L L LS++ ++ S S H E WM+++G+ YK
Sbjct: 1 MNSFSQNHYLILFLVLSVWTSHVMSRRLSEACTSERH-----------EKWMAQYGRVYK 49
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQ---FP 116
EK RF++FK N+ I+ N + L +N+FAD++ EEFK + ++ +
Sbjct: 50 DAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVE 109
Query: 117 TRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
T Q S F Y V +P ++DWRK+GAVTP+K+QG CGSCWAFS VAA EGI+QI +G
Sbjct: 110 TSTQTS--FRYESVTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGK 167
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L LSEQEL+DC + GC GG +D AF++I GG+ E YPY TC+ KKE
Sbjct: 168 LVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETH 227
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF-TGPCGAELDHGV 295
V I GY+ VP N+E++LLKA+A+QPVSV I+A F++YS G+F CG + +H V
Sbjct: 228 GVAEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNVRNCGTDPNHAV 287
Query: 296 AAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
A VGYGK+ GS Y +VKNSWG +WGERGYIR+KR+ EGLCGI K P
Sbjct: 288 AVVGYGKALDGSKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYP 340
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 157/351 (44%), Positives = 213/351 (60%), Gaps = 15/351 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
M FS + L+L L L+++ ++ S S H E WM+++G+ YK
Sbjct: 1 MNSFSQNHYLILFLVLAVWTSHVMSRRLSEACTSERH-----------EKWMAQYGRVYK 49
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFP-TR 118
EK RF++FK N+ I+ N + L +N+FAD++ EEFK + ++ +
Sbjct: 50 DAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVE 109
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
F Y V +P ++DWRK+GAVTP+K+QG CGSCWAFS VAA EGI+QI +G L
Sbjct: 110 TSTETSFRYESVTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLV 169
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DC + GC GG +D AF++I GG+ E YPY TC+ KKE V
Sbjct: 170 PLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGV 229
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGAELDHGVAA 297
I GY+ VP N+E++LLKA+A+QPVSV I+A F++YS G+F CG + +H VA
Sbjct: 230 AEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAV 289
Query: 298 VGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VGYGK+ GS Y +VKNSWG +WGERGYIR+KR+ EGLCGI K P
Sbjct: 290 VGYGKALDGSKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYP 340
>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 166/333 (49%), Positives = 216/333 (64%), Gaps = 11/333 (3%)
Query: 22 SSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
S LA F + + L + + L +L+E W KH + ++EK RF +FKEN+ H+
Sbjct: 18 SGLAESFE---FDEKELATEESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFT 73
Query: 82 RNKEVTSYWLGLNEFADMSHEEFKNKY----LGLKPQFPTRRQPSAEFSYRDVKALPKSV 137
N+ Y L LN+FADMS+ EF N Y + + RR+ + F Y LP SV
Sbjct: 74 VNQMDKPYKLKLNKFADMSNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSV 133
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCN 197
DWR++GAV VK QG CGSCWAFS+VAAVEGIN+I + L SLSEQEL+DC+ N GCN
Sbjct: 134 DWRERGAVNAVKEQGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCN 192
Query: 198 GGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLK 257
GG M+ AF +I +GG+ E YPY G C + +V I GY+ VPEN E +L++
Sbjct: 193 GGFMEIAFDFIKRNGGIATENSYPYHGSRGLCRSSRISSPIVKIDGYESVPEN-EDALMQ 251
Query: 258 ALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWG 316
A+A+QPVSVAI+A+G DFQFYS GVF G CG EL+HGV A+GYG ++ G+DY +V+NSWG
Sbjct: 252 AVANQPVSVAIDAAGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWG 311
Query: 317 PKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WGE GY+RMKR + EGLCGI AS P+K
Sbjct: 312 VGWGEDGYVRMKRGVEQAEGLCGIAMEASYPIK 344
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 161/345 (46%), Positives = 215/345 (62%), Gaps = 18/345 (5%)
Query: 22 SSLAHDFSIVGYSPEHLTS--MDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHI 79
S A ++ G PE + + ++ ELFE WM KH K Y EK R+ F NL +
Sbjct: 23 SCSAGEWPSSGQGPEDVGAGGVEGGQELFERWMEKHRKVYAHPGEKARRYANFLSNLAFV 82
Query: 80 DQRNKE-----VTSYWLGLNEFADMSHEEFKNKYLG--LKPQFPTRRQPSAEFSYRDVKA 132
+RN E + +G+N FAD+S+EEF+ Y L+ + R V A
Sbjct: 83 RKRNAEGRRAPSSGQGVGMNVFADLSNEEFREVYSSRVLRKKAAEGRGARRRAGEGRVVA 142
Query: 133 ---LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD 189
P S+DWRK+GAVT VKNQG CGSCWAFS+ A+EGIN I +G L SLSEQEL+DCD
Sbjct: 143 GCDAPASLDWRKRGAVTAVKNQGDCGSCWAFSSTGAMEGINAITTGELISLSEQELVDCD 202
Query: 190 TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME-EGTCEDKKEEMEVVTISGYQDVP 248
T+ N GC+GG MDYAF++++ +GG+ E +YPY + + C KEE++VV+I GY+DV
Sbjct: 203 TT-NEGCDGGYMDYAFEWVINNGGIDSEANYPYTGQADSVCNTTKEEIKVVSIDGYEDVA 261
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGA---ELDHGVAAVGYGKSKG 305
+ E +LL A QPVSV I+ S DFQ Y+GG++ G C ++DH V VGYG+ G
Sbjct: 262 TS-ESALLCAAVQQPVSVGIDGSSLDFQLYAGGIYDGDCSGNPDDIDHAVLVVGYGQQGG 320
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
+DY IVKNSWG WG +GYI ++RNTG P G+C I+ MAS P K+
Sbjct: 321 TDYWIVKNSWGTDWGMQGYIYIRRNTGLPYGVCAIDAMASYPTKQ 365
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 163/357 (45%), Positives = 225/357 (63%), Gaps = 23/357 (6%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
M F+ + L++L++ + L+ S GY E + + WM++HG+TYK
Sbjct: 10 MITFTAAALMILAVMTMVVEARDLST--STGGYGEEAMKVR------HQQWMAEHGRTYK 61
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
EK RF++FK N +D+ N SY L +NEFADM+++EF Y GLKP P
Sbjct: 62 DEAEKARRFQVFKANADFVDRSNAAGGKSYELAINEFADMTNDEFVAMYTGLKP-VPAGP 120
Query: 120 QPSAEFSYRDVK---ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
+ A F Y ++ ++VDWR+KGAVT +KNQG CG CWAF+ VAAVE I+QI +GN
Sbjct: 121 KKMAGFKYENLTLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIHQITTGN 180
Query: 177 LTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
L SLSEQ+++DCDT NNGCNGG +D AF+YI+++GGL E+ YPY +GTC+ +
Sbjct: 181 LVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIISNGGLATEDAYPYAAAQGTCQSSVQ-- 238
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG-PCGA-ELDHG 294
VTIS YQDVP DE +L A+A+QPV+VAI+A +FQFYS GV T CG L+H
Sbjct: 239 PAVTISSYQDVPSGDEAALAAAVANQPVAVAIDAH-NNFQFYSSGVLTADTCGTPSLNHA 297
Query: 295 VAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
V AVGY ++ G+ Y ++KN WG WGE GY+R++R T CG+ + AS P+ +
Sbjct: 298 VTAVGYSTAEDGTPYWLLKNQWGQNWGEGGYLRVERGTNA----CGVAQQASYPVAR 350
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 151/312 (48%), Positives = 202/312 (64%), Gaps = 11/312 (3%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEE 103
+E E WMS+ + Y EK RFEIF NLK ++ N +Y L +NEF+D++ EE
Sbjct: 32 VEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEE 91
Query: 104 FKNKYLGLK-PQFPTR-----RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
FK +Y GL P+ TR + F Y +V +S+DW ++GAVT VK+Q CG C
Sbjct: 92 FKARYTGLVVPEGMTRISTTDSHETVSFRYENVGETGESMDWIQEGAVTSVKHQQQCGCC 151
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKE 217
WAFS VAAVEG+ +I +G L SLSEQ+L+DC T NNGC GG+M AF YI + G+ E
Sbjct: 152 WAFSAVAAVEGMTKIANGELVSLSEQQLLDCSTE-NNGCGGGIMWKAFDYIKENQGITTE 210
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
++YPY + TCE + TISGY+ VP+NDE++LLKA++ QPVSVAIE SG +F
Sbjct: 211 DNYPYQGAQQTCESNH--LAAATISGYETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIH 268
Query: 278 YSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
YSGG+F G CG +L H V VGYG S +G Y ++KNSWG WGE GY+R+ R+ P+G
Sbjct: 269 YSGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGENGYMRIMRDVDSPQG 328
Query: 337 LCGINKMASIPL 348
+CG+ +A P+
Sbjct: 329 MCGLASLAYYPV 340
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 151/306 (49%), Positives = 195/306 (63%), Gaps = 6/306 (1%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHE 102
+ E E W K+GK YK EK R IFK+N++ I+ N Y L +N D ++E
Sbjct: 36 MSERHEQWTKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLTDQTNE 95
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
EF + G K + + P F Y ++ +P +VDWR+ GAV +K+QG CG+CWAFST
Sbjct: 96 EFVASHNGYKHKGSHSQTP---FKYENITGVPNAVDWRENGAVXAMKDQGQCGNCWAFST 152
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
VA EGI QI + L SLSEQEL+DCD S ++GC+GG M+ F++I +GG+ E +YPY
Sbjct: 153 VATTEGIYQITTSMLMSLSEQELVDCD-SVDHGCDGGYMEGGFEFIXKNGGISSEANYPY 211
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+GT + KE I GY+ VP N E +L KA+A+QPVSV I+ G+ FQF S GV
Sbjct: 212 TAVDGTYDANKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDVGGSAFQFNSSGV 271
Query: 283 FTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
FTG CG +LDHGV AVGYG + G+ Y IVKNSWG +WGE GYIRM+R T EGLCGI
Sbjct: 272 FTGQCGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIA 331
Query: 342 KMASIP 347
AS P
Sbjct: 332 MDASYP 337
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 297 bits (760), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 149/310 (48%), Positives = 203/310 (65%), Gaps = 7/310 (2%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHE 102
+++ W +KH +R E+FKENL+ +D+ N +Y LG+N FAD+++E
Sbjct: 42 IYQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNE 101
Query: 103 EFKNKYLGLKPQF--PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
E++ ++L + T + S ++ R+ LP S+DWR+KGAV VK+QG CGSCWAF
Sbjct: 102 EYRARFLRDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKSQGRCGSCWAF 161
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
+ +A VEGINQIV+G+L SLSEQ+L+DC T N+GC GG AF+YI+ +GG++ EE Y
Sbjct: 162 AAIATVEGINQIVTGDLISLSEQQLVDCSTR-NHGCEGGWPYRAFQYIINNGGVNSEEHY 220
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY GTC K VV+I Y++VP NDE+SL KA+A+QP+SV I ASG +FQ Y
Sbjct: 221 PYTGTNGTCNTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISVGINASGRNFQLYHS 280
Query: 281 GVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
G+FTG C L+HGV VGYG G+DY IVKNSWG WG+ GYI M+RN + G CGI
Sbjct: 281 GIFTGSCNTSLNHGVTVVGYGTVNGNDYWIVKNSWGESWGDSGYILMERNIAESSGKCGI 340
Query: 341 NKMASIPLKK 350
S P+K+
Sbjct: 341 AISPSYPIKE 350
>gi|326520659|dbj|BAJ92693.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 289
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 140/260 (53%), Positives = 186/260 (71%), Gaps = 7/260 (2%)
Query: 23 SLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQR 82
+ A D SIV Y S +++ ++ WM++HG TY I E+ RFE F++NL++IDQ
Sbjct: 21 AAAADMSIVSYGER---SEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 83 NKE----VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVD 138
N V S+ LGLN FAD+++EE+++ YLG + + R+ SA + D LP+SVD
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAADNDELPESVD 137
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
WRKKGAV VK+QG CGSCWAFS +AAVEGINQIV+G++ LSEQEL+DCDTS+N GCNG
Sbjct: 138 WRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNG 197
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
GLMDYAF++I+ +GG+ EEDYPY + C+ K+ +VVTI GY+DVP N E+SL KA
Sbjct: 198 GLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKA 257
Query: 259 LAHQPVSVAIEASGTDFQFY 278
+A+QP+SVAIEA G FQ Y
Sbjct: 258 VANQPISVAIEAGGRAFQLY 277
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 296 bits (758), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 138/301 (45%), Positives = 201/301 (66%), Gaps = 6/301 (1%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMS 100
D +++ FE WM+++G+ YK +EK+ RF+IFK N+KHI+ N + SY LG+N+F DM+
Sbjct: 4 DPMMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDMT 63
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
EF +Y G+ R+P F ++ A+P+S+DWR GAV VKNQ CGSCWAF
Sbjct: 64 KSEFVAQYTGVSLPLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWAF 123
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
+ +A VEGI +I +G L SLSEQE++DC S+ GC GG ++ A+ +I+++ G+ EE+Y
Sbjct: 124 AAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY--GCKGGWVNKAYDFIISNNGVTTEENY 181
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY +GTC + I+GY V NDE+S++ A+++QP++ I+AS +FQ+Y+G
Sbjct: 182 PYQAYQGTC-NANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDAS-ENFQYYNG 239
Query: 281 GVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
GVF+GPCG L+H + +GYG+ S G+ Y IV+NSWG WGE GY+RM R G CG
Sbjct: 240 GVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSSSSGACG 299
Query: 340 I 340
I
Sbjct: 300 I 300
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 215/347 (61%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+S+ ++LF S+ + + P+ S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMSILITLFFVISMFNSQTRARSQPKLSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK---PQFPTRRQPSA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ EEF K+ GL PS
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMPST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VKNQG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL ++ TC + + V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQG-KTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SNYQVVPEG-ETSLLQAVTKQPVSIGIAAS-HDLQFYAGGTYDGSCANRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 148/316 (46%), Positives = 205/316 (64%), Gaps = 13/316 (4%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHID--QRNKEVTSYWLGLNEFADMSHE 102
IE E WM++ + Y EK +RF IFK+NL+ + NK +T Y L +NEF+D++ E
Sbjct: 32 IEKHEQWMARFNRVYSDESEKRNRFNIFKKNLEFVQSFNMNKNIT-YKLDVNEFSDLTDE 90
Query: 103 EFKNKYLGLK-PQFPT-----RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGS 156
EF+ + GL P+ T + F Y +V +S+DWR++GAVTPVK QG CG
Sbjct: 91 EFRATHTGLVVPEEITGISTLSSDKTVPFRYGNVSDTGESMDWRQEGAVTPVKYQGRCGG 150
Query: 157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHK 216
CWAFS VAAVEGI +I G L SLSEQ+L+DCDT +N GC+GG+M AF+YI+ + G+
Sbjct: 151 CWAFSAVAAVEGITKITKGELVSLSEQQLLDCDTDYNQGCHGGIMSKAFEYIIKNQGITT 210
Query: 217 EEDYPYLMEE---GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGT 273
E++YPY + + TISGY+ VP N+E++LL+A++ QPVSV IE +G
Sbjct: 211 EDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGA 270
Query: 274 DFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
F+ YSGG+F G CG +L H V VGYG S +G+ Y +VKNSWG WGE G++R+KR+
Sbjct: 271 GFRHYSGGIFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGEDGFMRIKRDVD 330
Query: 333 KPEGLCGINKMASIPL 348
P+G+CG+ +A PL
Sbjct: 331 APQGMCGLAMLAFYPL 346
>gi|356557743|ref|XP_003547170.1| PREDICTED: LOW QUALITY PROTEIN: xylem cysteine proteinase 1-like
[Glycine max]
Length = 400
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 157/353 (44%), Positives = 223/353 (63%), Gaps = 10/353 (2%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
+H LL + F C L ++SI+ + S + ++ELF+ W ++ K Y+ EE
Sbjct: 7 THLFLLFIVWGSWSFLCYDLPSEYSILALEIDKFPSEEGVVELFQRWKEENKKIYRNPEE 66
Query: 65 KLHRFEIFKENLKHIDQRN-KEVTSYW--LGLNEFADMSHEEFKNKYLGLKPQFPTRRQP 121
+ RFE FK NLK+I ++N K ++ Y LGLN+FADMS+EEFK+K++ K + P ++
Sbjct: 67 EKLRFENFKRNLKYIVEKNSKRISPYGQSLGLNQFADMSNEEFKSKFMS-KVKKPFSKRN 125
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVT-PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
+ P S+DWRKKG VT VK+QG CGS WAFS+ A+EGIN IV+ +L SL
Sbjct: 126 GVSSKDHSCEDEPYSLDWRKKGVVTLAVKDQGYCGSYWAFSSTDAIEGINAIVTADLISL 185
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQEL+DCD++ N+GC+GG MDYAF++++ +GG+ E +YPY+ +GTC KE+ +V+
Sbjct: 186 SEQELVDCDST-NDGCDGGXMDYAFEWVMYNGGIDTETNYPYIGADGTCNVTKEKTKVIG 244
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGA---ELDHGVAA 297
I GY DV ++D SLL A QP+S I+ + DFQ Y GG++ G C + ++DH +
Sbjct: 245 IDGYYDVGQSD-SSLLCATVKQPISAGIDGTSWDFQLYIGGIYDGDCSSDPDDIDHAILV 303
Query: 298 VGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
VGYG DY IVKNSW WG G I +++NT G C IN MAS P K+
Sbjct: 304 VGYGSEGDDDYWIVKNSWRTSWGMEGCIYLRKNTNLKYGXCAINYMASYPTKE 356
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 159/305 (52%), Positives = 196/305 (64%), Gaps = 7/305 (2%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
FE W+ ++ + YK EE RF I++ NL++I+ +N + SY L N+FAD+++EEF +
Sbjct: 5 FERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQEXSYNLTDNKFADLTNEEFVSP 64
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
YLG F TR P F Y + + LP+S DWRK+GAV+ +K+QG+CGSCWAFS VAAVE
Sbjct: 65 YLG----FGTRFLPHTGFMYHEHEDLPESKDWRKEGAVSDIKDQGNCGSCWAFSAVAAVE 120
Query: 168 GINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
GIN+I SG L SLSEQE DCD N GC GGLMD AF +I +GGL +DYPY +
Sbjct: 121 GINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDYPYEGVD 180
Query: 227 GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA--HQPVSVAIEASGTDFQFYSGGVFT 284
GTC +K ISG+ VP NDE L A +Q SVAI+A G FQ Y GVF+
Sbjct: 181 GTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAFQLYLKGVFS 240
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
G CG +L+HGV VGYGK Y IVKNSWG WGE GYIRMKR+ G CGI A
Sbjct: 241 GICGKQLNHGVTIVGYGKGTSDKYWIVKNSWGADWGESGYIRMKRDAFDKAGTCGIAMQA 300
Query: 345 SIPLK 349
S PLK
Sbjct: 301 SYPLK 305
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 155/331 (46%), Positives = 204/331 (61%), Gaps = 23/331 (6%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFA 97
LT D +++ FE WM +HG+ Y EK RFE+++ N++ ++ N Y L N+FA
Sbjct: 22 LTRADLMLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFA 81
Query: 98 DMSHEEFKNKYLGLKPQFPTRR---QPSAEF-----SYRDVKALPKSVDWRKKGAVTPVK 149
D+++EEF+ K LG +P + SA+ S D+ LPKSVDWRKKGAV VK
Sbjct: 82 DLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDI--LPKSVDWRKKGAVVEVK 139
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIV 209
NQG CGSCWAFS VAA+EGINQI +G L SLSEQEL+DCD GC GG M +AF+++V
Sbjct: 140 NQGDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDE-AVGCGGGYMSWAFEFVV 198
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIE 269
+ GL E YPY G C+ K V I+GY++V + E L +A A QPVSVA++
Sbjct: 199 GNHGLTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVD 258
Query: 270 ASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-----------GSDYIIVKNSWGPK 318
FQ Y GV+TGPC A+++HGV VGYG+S+ G Y IVKNSWG +
Sbjct: 259 GGSFMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAE 318
Query: 319 WGERGYIRMKRNT-GKPEGLCGINKMASIPL 348
WG+ GYI M+R+ G GLCGI + S P+
Sbjct: 319 WGDAGYILMQRDVAGLASGLCGIALLPSYPV 349
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/316 (46%), Positives = 203/316 (64%), Gaps = 12/316 (3%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHEE 103
IE E WM++ + Y EK +RF IFK+NL+ + N +Y + +NEF+D++ EE
Sbjct: 32 IEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEE 91
Query: 104 FKNKYLGLK-PQFPTR------RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGS 156
F+ + GL P+ TR + + F Y +V +S+DWR++GAVTPVK QG CG
Sbjct: 92 FRATHTGLVVPEAITRISTLSSGKNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGG 151
Query: 157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHK 216
CWAFS VAAVEGI +I G L SLSEQ+L+DCD +N GC GG+M AF+YI+ + G+
Sbjct: 152 CWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQGCRGGIMSKAFEYIIKNQGITT 211
Query: 217 EEDYPYLMEE---GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGT 273
E++YPY + + TISGY+ VP N+E++LL+A++ QPVSV IE +G
Sbjct: 212 EDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGA 271
Query: 274 DFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
F+ YSGGVF G CG +L H V VGYG S +G+ Y +VKNSWG WGE GY+R+KR+
Sbjct: 272 AFRHYSGGVFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDVD 331
Query: 333 KPEGLCGINKMASIPL 348
P+G+CG+ +A PL
Sbjct: 332 APQGMCGLAILAFYPL 347
>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
Length = 484
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 155/339 (45%), Positives = 212/339 (62%), Gaps = 17/339 (5%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKH------GKTYKCI----EEKLHRFEIFKENLKH 78
+ V +P + +++ L+E W S+H G T + ++ R E+F+ NL++
Sbjct: 34 AAVTVTPPPERTDEEVRRLYEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNLRY 93
Query: 79 IDQRNKEVTS----YWLGLNEFADMSHEEFKNKYL-GLKPQFPTRRQPSAEFSYRDV--K 131
ID N E + + LGL FAD++ EE++ + L G + + T Y + +
Sbjct: 94 IDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVVGSRRYLPLAGE 153
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS 191
LP +VDWR++GAV VK+QG CG+CWAFS VAAVEGIN+IV+G+L SLSEQELIDCD
Sbjct: 154 QLPDAVDWRERGAVAEVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELIDCDKF 213
Query: 192 FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
+ GC+GGLMD AF +++ +GG+ E DYP+ +GTC+ K + VV+I ++ VP N
Sbjct: 214 QDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDSFERVPINY 273
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIV 311
E++L KA+AHQPVS +IEAS FQ YS G+F G CG LDHGV VGYG G DY IV
Sbjct: 274 ERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYGSEGGKDYWIV 333
Query: 312 KNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
KNSWG +WGE GY+RM RN G CGI P+K+
Sbjct: 334 KNSWGTQWGEAGYVRMARNVRVRAGKCGIAMEPLYPVKE 372
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 154/331 (46%), Positives = 203/331 (61%), Gaps = 23/331 (6%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFA 97
L D +++ FE WM +HG+ Y EK RFE+++ N++ ++ N Y L N+FA
Sbjct: 21 LARADLMLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFA 80
Query: 98 DMSHEEFKNKYLGLKPQFPTRR---QPSAEF-----SYRDVKALPKSVDWRKKGAVTPVK 149
D+++EEF+ K LG +P + SA+ S D+ LPKSVDWRKKGAV VK
Sbjct: 81 DLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDI--LPKSVDWRKKGAVVEVK 138
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIV 209
NQG CGSCWAFS VAA+EGINQI +G L SLSEQEL+DCD GC GG M +AF+++V
Sbjct: 139 NQGDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDE-AVGCGGGYMSWAFEFVV 197
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIE 269
+ GL E YPY G C+ K V I+GY++V + E L +A A QPVSVA++
Sbjct: 198 GNHGLTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVD 257
Query: 270 ASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-----------GSDYIIVKNSWGPK 318
FQ Y GV+TGPC A+++HGV VGYG+S+ G Y IVKNSWG +
Sbjct: 258 GGSFMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAE 317
Query: 319 WGERGYIRMKRNT-GKPEGLCGINKMASIPL 348
WG+ GYI M+R+ G GLCGI + S P+
Sbjct: 318 WGDAGYILMQRDVAGLASGLCGIALLPSYPV 348
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 162/358 (45%), Positives = 220/358 (61%), Gaps = 34/358 (9%)
Query: 9 LLLLSLSLSLFACSSL---AHDFSIV---GYSPEHLTSMDKLIELFESWMSKHGKTYKCI 62
+ +++L++ A +++ A D S GY E + + WM++HG+TY+
Sbjct: 12 ITFTAVALTILAVTTMMAEARDLSSTSTGGYGEEAMKVR------HQQWMAEHGRTYRDE 65
Query: 63 EEKLHRFEIFKENLKHIDQRNK---EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
EK HRF++FK N +D N + SY L LNEFADM+++EF Y GL+P P
Sbjct: 66 AEKAHRFQVFKANADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTGLRP-VPAGA 124
Query: 120 QPSAEFSY-----RDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVS 174
+ A F Y D ++VDWR+KGAVT +KNQG CG CWAF+ VAAVEGI+QI +
Sbjct: 125 KKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITT 184
Query: 175 GNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
GNL SLSEQ+++DCDT NNGCNGG +D AF+YIV +GGL E+ YPY + C+ +
Sbjct: 185 GNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIVGNGGLGTEDAYPYTAAQAMCQSVQ- 243
Query: 235 EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGA--EL 291
V ISGYQDVP DE +L A+A+QPVSVAI+A +FQ Y GGV T C L
Sbjct: 244 --PVAAISGYQDVPSGDEAALAAAVANQPVSVAIDAH--NFQLYGGGVMTAASCSTPPNL 299
Query: 292 DHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
+H V AVGYG ++ G+ Y ++KN WG WGE GY+R++R CG+ + AS P+
Sbjct: 300 NHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLERGANA----CGVAQQASYPV 353
>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 294 bits (752), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 165/333 (49%), Positives = 215/333 (64%), Gaps = 11/333 (3%)
Query: 22 SSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
S LA F + + L + + L +L+E W KH + ++EK RF +FKEN+ H+
Sbjct: 18 SGLAESFE---FDEKELATEESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFT 73
Query: 82 RNKEVTSYWLGLNEFADMSHEEFKNKY----LGLKPQFPTRRQPSAEFSYRDVKALPKSV 137
N+ Y L LN+FADMS+ EF N Y + + RR+ + F Y LP SV
Sbjct: 74 VNQMDKPYKLKLNKFADMSNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSV 133
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCN 197
D R++GAV VK QG CGSCWAFS+VAAVEGIN+I + L SLSEQEL+DC+ N GCN
Sbjct: 134 DGRERGAVNAVKEQGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCN 192
Query: 198 GGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLK 257
GG M+ AF +I +GG+ E YPY G C + +V I GY+ VPEN E +L++
Sbjct: 193 GGFMEIAFDFIKRNGGIATENSYPYHGSRGLCRSSRISSPIVKIDGYESVPEN-EDALMQ 251
Query: 258 ALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWG 316
A+A+QPVSVAI+A+G DFQFYS GVF G CG EL+HGV A+GYG ++ G+DY +V+NSWG
Sbjct: 252 AVANQPVSVAIDAAGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWG 311
Query: 317 PKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WGE GY+RMKR + EGLCGI AS P+K
Sbjct: 312 VGWGEDGYVRMKRGVEQAEGLCGIAMEASYPIK 344
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 163/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+S+ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMSILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 217/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + + P+ S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISMFNTQTRARSQPKLSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 217/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + + P+ S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISMFNTQTRARSQPKLSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 218/349 (62%), Gaps = 18/349 (5%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFS--IVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S ++L+++ LF S++ S + + P L E E WM++ + Y+ E
Sbjct: 3 SIMVLVTIFTILFTTFSISQATSRTVTFHEPSSL-------EKHEQWMARFSRVYRDELE 55
Query: 65 KLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK---PQFPTRRQ 120
K R ++FK+NLK I+ NK+ SY LG+NEFAD ++EEF + GLK +
Sbjct: 56 KQMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLSSKVVDETI 115
Query: 121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
S ++ D+ + K DWR +GAVTPVK QG CG CWAFS VAAVEG+ +I GNL SL
Sbjct: 116 SSRSWNISDMVGVSK--DWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKIAGGNLVSL 173
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQ+L+DCD ++ GC+GG+M AF YI+ + G+ E DY Y +G C +
Sbjct: 174 SEQQLLDCDREYDRGCDGGIMSDAFNYIIQNRGIASENDYSYQGSDGRC--RSSARPAAR 231
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
ISG+Q VP N+EQ+LL+A++ QPVSV+++A+G F YSGGV+ GPCG +H V VGY
Sbjct: 232 ISGFQTVPSNNEQALLEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSNHAVTFVGY 291
Query: 301 GKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
G S+ G+ Y + KNSWG WGE+GYIR++R+ P+G+CG+ + A P+
Sbjct: 292 GTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPV 340
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 216/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+S+ ++LF S+ + + P+ S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMSILITLFFVISMFNSQTRARSQPKLSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ EEF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VKNQG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYG 294
Query: 302 KSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ G Y ++KNSWG WGE+G++++ R+ G P GLC I K++S P
Sbjct: 295 TDENGQKYWLLKNSWGTSWGEKGFMKIIRDYGNPSGLCDIAKLSSYP 341
>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
Length = 245
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 138/221 (62%), Positives = 165/221 (74%), Gaps = 1/221 (0%)
Query: 131 KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT 190
+ LP+SVDWR+ GAV PVK+Q SCGSCWAFSTVAAVEGINQIV+G L SLSEQEL+DCDT
Sbjct: 4 EVLPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDT 63
Query: 191 SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
++ GCNGGLMDYAF +I+ +GGL E+DYPY +G C + +VV+I GY+DVP
Sbjct: 64 EYDMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPF 123
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYII 310
DE++L KA+AHQPVSVA+EA G Q Y G+FTG CG LDHG+ AVGYG G+DY I
Sbjct: 124 DEKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGTENGTDYWI 183
Query: 311 VKNSWGPKWGERGYIRMKRNTGKP-EGLCGINKMASIPLKK 350
V+NSWG WGE GYIRM+RN G CGI AS P+K
Sbjct: 184 VRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPIKN 224
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 155/352 (44%), Positives = 212/352 (60%), Gaps = 15/352 (4%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
M FS + L+L L L+++ ++ S S H E WM+++G+ YK
Sbjct: 1 MNSFSQNHYLILFLVLAVWTSHVMSRRLSEACTSERH-----------EKWMAQYGRVYK 49
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFP-TR 118
EK RF++FK N+ I+ N + L +N+FAD++ EEFK + ++ +
Sbjct: 50 DAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVE 109
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
F Y V +P ++D RK+GAVTP+K+QG CGSCWAFS VAA EGI+QI +G L
Sbjct: 110 TSTETSFRYESVTKIPATIDRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLV 169
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DC + GC GG +D AF++I GG+ E YPY TC+ KKE V
Sbjct: 170 PLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGV 229
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGAELDHGVAA 297
I GY+ VP N+E++LLKA+A+QPVSV I+A F++YS G+F CG + +H VA
Sbjct: 230 AEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAV 289
Query: 298 VGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
VGYGK+ S Y +VKNSWG +WGERGYIR+KR+ EGLCGI K P+
Sbjct: 290 VGYGKALDDSKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPI 341
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK+
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKVE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
sativus]
Length = 235
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 135/220 (61%), Positives = 169/220 (76%), Gaps = 1/220 (0%)
Query: 131 KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT 190
+ALP++VDWR+KGAV +KNQG+CGSCWAFST A VEGIN+IV+G L SLSEQEL+DCD
Sbjct: 2 EALPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDCDK 61
Query: 191 SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
S+N GCNGGLMDYAF++I+ +GGL+ E+DYPY +G C + +VVTI GY+DVP N
Sbjct: 62 SYNQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRGSDGKCNSLLKNSKVVTIDGYEDVPTN 121
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYII 310
DE +L +A+++QPVSVAI+A G FQ Y G+FTG CG ++DH V AVGYG G DY I
Sbjct: 122 DETALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYGSENGVDYWI 181
Query: 311 VKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLK 349
V+NSWG KWGE GYIR++RN + G CGI AS P+K
Sbjct: 182 VRNSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPVK 221
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 216/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+S+ ++LF S+ + + P+ S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMSILITLFFVISMFNTQTRARSQPKLSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VKNQG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I K++S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 341
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
Length = 299
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 144/281 (51%), Positives = 199/281 (70%), Gaps = 14/281 (4%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYS---PEHLTSM---DKLIELFESWMSKHGKTYKCIE 63
L++ L +S F S LA D SI+ Y P+ TS +++ ++E W+ KHGK+Y +
Sbjct: 12 LMIVLIISSFTVS-LALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLG 70
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP-- 121
EK RFEIFK+NLK ID+ N ++Y LGL FAD+++EE+++K+LG K P RR
Sbjct: 71 EKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKID-PNRRMKKL 129
Query: 122 ----SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
S ++ R LP+SVDWRK+GAV VK+Q SCGSCWAFS +AAVEGIN+IV+G+L
Sbjct: 130 GGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDL 189
Query: 178 TSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
SLSEQEL+DCDTS+N GCNGGLMDYAF++I+++GG+ E+DYPY +G C+ ++ +
Sbjct: 190 ISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAK 249
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
VVTI Y+DVP DE +L KA+A+QP++VA+E G +FQ Y
Sbjct: 250 VVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLY 290
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 160/358 (44%), Positives = 218/358 (60%), Gaps = 34/358 (9%)
Query: 9 LLLLSLSLSLFACSSL---AHDFSIV---GYSPEHLTSMDKLIELFESWMSKHGKTYKCI 62
+ +++L++ A ++ A D S GY E + + WM++HG+TY+
Sbjct: 12 IAFTAVALTILAVKTMMAEARDLSSTSTGGYGEEAMKVR------HQQWMAEHGRTYRDE 65
Query: 63 EEKLHRFEIFKENLKHIDQRNK---EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
EK HRF++FK N +D N + SY + LNEFADM+++EF Y GL+P P
Sbjct: 66 AEKAHRFQVFKANADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTGLRP-VPAGA 124
Query: 120 QPSAEFSY-----RDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVS 174
+ A F Y D ++VDWR+KGAVT +KNQG CG CWAF+ VAAVEGI+QI +
Sbjct: 125 KKMAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITT 184
Query: 175 GNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
GNL SLSEQ+++DCDT NNGCNGG +D AF+YI +GGL E+ YPY + C+ +
Sbjct: 185 GNLVSLSEQQVLDCDTEGNNGCNGGYIDNAFQYIAGNGGLATEDAYPYTAAQAMCQSVQ- 243
Query: 235 EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGA--EL 291
V ISGYQDVP DE +L A+A+QPVSVAI+A +FQ Y GGV T C L
Sbjct: 244 --PVAAISGYQDVPSGDEAALAAAVANQPVSVAIDAH--NFQLYGGGVMTAASCSTPPNL 299
Query: 292 DHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
+H V AVGYG ++ G+ Y ++KN WG WGE GY+R++R CG+ + AS P+
Sbjct: 300 NHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLERGANA----CGVAQQASYPV 353
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 207/344 (60%), Gaps = 25/344 (7%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
+ ++ S+ A LA F + L ++ E WM ++ + YK EK RFE
Sbjct: 1 MATIKASILAILGLAF-FCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFE 59
Query: 71 IFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFK----NKYLGLKPQFPTRRQPSAEF 125
+FK N+K I+ N +WLG+N+FAD++++EF+ NK G KP P + +
Sbjct: 60 VFKANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRATKTNK--GFKPS-PVKVSTGFRY 116
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
V ALP ++DWR KGAVTP+K+QG C EGI +I +G L SLSEQEL
Sbjct: 117 ENVSVDALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQEL 164
Query: 186 IDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DCD + GC GGLMD AFK+I+ +GGL E YPY +G C K T+ G+
Sbjct: 165 VDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKC--KSGSNSAATVKGF 222
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-S 303
+DVP NDE +L+KA+A+QPVSVA++ FQFYSGGV TG CG +LDHG+AA+GYG+ S
Sbjct: 223 EDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTS 282
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+ Y ++KNSWG WGE GY+RM+++ G+CG+ S P
Sbjct: 283 DGTKYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 326
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 215/347 (61%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+S+ ++LF S+ + + P+ S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMSILITLFFVISMFNSQTRARSQPKLSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R+ G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 216/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + + PE S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISMFNTQTRGRSQPELSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY Y E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK+
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKVE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S Y+ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R+ G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 291 bits (745), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 220/347 (63%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGGLM AF +I+ +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTCR-SREKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S Y+ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C +++H V A+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGNCADQINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+G Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 157/353 (44%), Positives = 222/353 (62%), Gaps = 20/353 (5%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLI-ELFESWMSKHGKTYKCIEEK 65
+ +L + +SL++ + S V + +T + ++ E + WM++ + Y EK
Sbjct: 2 TSILFMFVSLTILSMSLK------VSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEK 55
Query: 66 LHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHEEFKNKYLGLK-------PQFPT 117
RF++FK+NLK I++ NK+ +Y LG+NEFAD + EEF + GLK +F
Sbjct: 56 QMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIPSSEFVD 115
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
PS ++ DV A P+ DWR +GAVTPVK QG CG CWAFS+VAAVEG+ +IV GNL
Sbjct: 116 EMIPSWNWNVSDV-AGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNL 174
Query: 178 TSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
SLSEQ+L+DCD +NGCNGG+M AF YI+ + G+ E YPY EGTC +
Sbjct: 175 VSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQETEGTC--RYNAKP 232
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGAELDHGVA 296
I G+Q VP N+E++LL+A++ QPVSV+I+A G F YSGGV+ P CG +++H V
Sbjct: 233 SAWIRGFQTVPSNNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDVNHAVT 292
Query: 297 AVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
VGYG S +G Y + KNSWG WGE GYIR++R+ P+G+CG+ + A P+
Sbjct: 293 FVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPV 345
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 216/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + + P+ S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISMFNTQTRARSQPKLSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 340
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 152/306 (49%), Positives = 193/306 (63%), Gaps = 11/306 (3%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHEEFKNK 107
E WM+ H + Y EK R +IFKENL+ I++ N E Y L LN FAD+++EEF
Sbjct: 39 EEWMAMHDRVYADSAEKDRRQQIFKENLEFIEKHNNEGKKRYNLSLNSFADLTNEEFVAS 98
Query: 108 YLGLKPQFPT-----RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
+ G + PT + S F V + S+DWRK+GAV +KNQG CGSCWAFS
Sbjct: 99 HTGALYKPPTQLGSFKINHSLGFHKMSVGDIEASLDWRKRGAVNDIKNQGRCGSCWAFSA 158
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
VAAVEGINQI +G L SLSEQ L+DC + N+GC+G ++ AF YI GL EE+YPY
Sbjct: 159 VAAVEGINQIKNGQLVSLSEQNLVDCAS--NDGCHGQYVEKAFDYI-RDYGLANEEEYPY 215
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+ GTC + I GYQ V +E+ LL A+A QPVSV +EA G FQFYSGGV
Sbjct: 216 VETVGTCSGNSNP--AIQIRGYQSVTPQNEEQLLTAVASQPVSVLLEAKGQGFQFYSGGV 273
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
F+G CG EL+H V VGYG+ Y +++NSWG WGE GY+++ R+TG P+GLCGIN
Sbjct: 274 FSGECGTELNHAVTIVGYGEEAEGKYWLIRNSWGKSWGEGGYMKLMRDTGNPQGLCGINM 333
Query: 343 MASIPL 348
AS P
Sbjct: 334 QASYPF 339
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S Y+ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 290 bits (743), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 216/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+S+ ++LF S+ + + P+ S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMSILITLFFVISMFNSQTRARSQPKLSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S Y+ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R+ G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPSGLCDIAKMSSYP 341
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 290 bits (743), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 217/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 145/333 (43%), Positives = 198/333 (59%), Gaps = 27/333 (8%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV---TSYWLGLNEFAD 98
D + + F W ++H +TY EE+ HR ++ N+++I+ N + +Y LG + D
Sbjct: 36 DPMAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLTYELGETAYTD 95
Query: 99 MSHEEFKNKYLGLKPQFPTRRQ--PSAEFSYR------------------DVKALPKSVD 138
++ +EF Y P P + R + P SVD
Sbjct: 96 LTSDEFTAMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVNESAGAPASVD 155
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
WR++GAVT VKNQG CGSCWAFSTVA +EGI+QI +G L SLSEQEL+DCD ++GCNG
Sbjct: 156 WRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVDCD-KLDHGCNG 214
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
G+ A ++I ++GG+ ++DYPY ++ TC+ KK +ISG+Q V E SL A
Sbjct: 215 GVSYRALQWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGFQRVATRSELSLTNA 274
Query: 259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK--GSDYIIVKNSWG 316
+A QPV+V+IEA G +FQ Y GV+ GPCG L+HGV VGYG+ + G Y IVKNSWG
Sbjct: 275 VAMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYGEDEVTGESYWIVKNSWG 334
Query: 317 PKWGERGYIRMKRN-TGKPEGLCGINKMASIPL 348
KWG+ GY+RMK+ KPEG+CGI S PL
Sbjct: 335 EKWGDNGYLRMKKGIIDKPEGICGIAIRPSFPL 367
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 145/259 (55%), Positives = 185/259 (71%), Gaps = 7/259 (2%)
Query: 95 EFADMSHEEFKNKYLGLKPQ---FPTRRQPSAEFSYRDVK--ALPKSVDWRKKGAVTPVK 149
+FA+++++EF++ Y G K + S F Y++V ALP +VDWRKKGAVTP+K
Sbjct: 1 QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIV 209
NQGSCG CWAFS VAA+EG QI G L SLSEQ+L+DCDT+ + GC+GGL+D AF++I+
Sbjct: 61 NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCSGGLIDTAFEHIM 119
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIE 269
A+GGL E +YPY E+ TC+ K +I+GY+DVP NDE +L+KA+AHQPVSV IE
Sbjct: 120 ATGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGIE 179
Query: 270 ASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMK 328
G DFQFYS GVFTG C LDH V AVGY +S GS Y I+KNSWG KWGE GY+R+K
Sbjct: 180 GGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIK 239
Query: 329 RNTGKPEGLCGINKMASIP 347
++ EGLCG+ AS P
Sbjct: 240 KDIKDKEGLCGLAMKASYP 258
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK+
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKVE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
E D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 ELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S Y+ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISI-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKTNDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFYSGG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYSGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+G Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 217/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 160/348 (45%), Positives = 214/348 (61%), Gaps = 19/348 (5%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+S+ ++LF S+ + + P+ S E E WMS+HG+ YK EK
Sbjct: 4 KIDLMSILITLFFVISMFNSQTTARSQPKLSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPS---- 122
RF IFKEN+K I+ NK SY LG+NEFAD++ EEF K+ G+ P+ PS
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGIN--IPSYLSPSPMSS 116
Query: 123 AEFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
EF D+ +P ++DWR+ GAVT VKNQG CG CWAFS V ++EG +I +GNL
Sbjct: 117 TEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEF 176
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQEL+DC T+ N GCNGG M AF +I +GG+ E DY Y ++ TC +E+ V
Sbjct: 177 SEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQYTCR-SQEKTAAVQ 234
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
IS YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GY
Sbjct: 235 ISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGY 292
Query: 301 GKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G KG Y ++KNSWG WGE G++++ R++G P G C I KM+S P
Sbjct: 293 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPGGHCDIAKMSSYP 340
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S Y+ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 149/346 (43%), Positives = 209/346 (60%), Gaps = 25/346 (7%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
+ ++ S+ A LA F + L ++ E WM ++ + YK EK RFE
Sbjct: 1 MATIKASILAILGLAF-FCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFE 59
Query: 71 IFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFK----NKYLGLKPQFPTRRQPSAEF 125
+FK N+K I+ N +WLG+N+FAD++++EF+ NK G KP P + +
Sbjct: 60 VFKANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRATKTNK--GFKPS-PVKVPTGFRY 116
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
V ALP ++DWR KGAVTP+K+QG C EGI +I +G L SLSEQEL
Sbjct: 117 ENVSVDALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQEL 164
Query: 186 IDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DCD + GC GGLMD AF++I+ +GGL E YPY +G C K T+ G+
Sbjct: 165 VDCDVHGEDQGCEGGLMDDAFQFIIKNGGLTTESSYPYTAADGKC--KSGSNSAATVKGF 222
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-S 303
+DVP NDE +L+KA+A+QPVSVA++ FQFYSGGV TG CG +LDHG+AA+GYG+ S
Sbjct: 223 EDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTS 282
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G+ Y ++KNSWG WGE GY+RM+++ G+CG+ S P++
Sbjct: 283 DGTKYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPIE 328
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 217/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 137/302 (45%), Positives = 199/302 (65%), Gaps = 7/302 (2%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMS 100
D +++ FE WM+++G+ YK +EK+ RF+IFK N+ HI+ N SY LG+N+F DM+
Sbjct: 31 DPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSHNGNSYTLGINQFTDMT 90
Query: 101 HEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
EF +Y G + R+P F ++ A+P+S+DWR GAV VKNQ CGSCWA
Sbjct: 91 KSEFVAQYTGGISRPLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWA 150
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
F+ +A VEGI +I +G L SLSEQE++DC S+ GC GG ++ A+ +I+++ G+ EE+
Sbjct: 151 FAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY--GCKGGWVNKAYDFIISNNGVTTEEN 208
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS 279
YPY +GTC + I+GY V NDE+S++ A+++QP++ I+AS +FQ+Y+
Sbjct: 209 YPYQAYQGTC-NANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDAS-ENFQYYN 266
Query: 280 GGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GGVF+GPCG L+H + +GYG+ S G+ Y IV+NSWG WGE GY+RM R G C
Sbjct: 267 GGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSSSSGAC 326
Query: 339 GI 340
GI
Sbjct: 327 GI 328
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 217/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S Y+ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 290 bits (741), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 215/347 (61%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + + P+ S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISMFNTQTRARSQPKLSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R+ G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 290 bits (741), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ G Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDENGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 290 bits (741), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 147/309 (47%), Positives = 205/309 (66%), Gaps = 7/309 (2%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEF 104
E E WM+++GK YK EK RF++FK N++ I+ N + L +N+FAD+ EEF
Sbjct: 33 ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEF 92
Query: 105 KNKYLGLKPQFPTRRQPSAE--FSYRDVKALPKSVDWRKKGAVTPVKNQG-SCGSCWAFS 161
K ++ + +R + + E F Y +V +P ++DWRK+GAVTP+K+QG +CGSCWAF+
Sbjct: 93 KALLNNVQKK-ASRVETATETSFRYENVTKIPSTMDWRKRGAVTPIKDQGYTCGSCWAFA 151
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
TVA VE ++QI +G L SLSEQEL+DC + GC GG ++ AF++I GG+ E YP
Sbjct: 152 TVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGITSEAYYP 211
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
Y ++ +C+ KKE V I GY+ VP N E++LLKA+A+QPVSV I+A F+FYS G
Sbjct: 212 YKGKDRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKFYSSG 271
Query: 282 VFTGP-CGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
+F CG LDH VA VGYGK + G+ Y +VKNSW WGE+GY+R+KR+ +GLCG
Sbjct: 272 IFEARNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGYMRIKRDIRAKKGLCG 331
Query: 340 INKMASIPL 348
I AS P+
Sbjct: 332 IASNASYPI 340
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 217/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
Length = 341
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 150/315 (47%), Positives = 201/315 (63%), Gaps = 26/315 (8%)
Query: 42 DKLIELFESWMSKHGKTYKCIE-EKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEF 96
+++ +L+++W S+HG+ I R ++F++NL++ID N E ++ LGL F
Sbjct: 45 EEVRQLYKTWKSEHGRPRDGISVADGLRLKVFRDNLRYIDAHNAEADAGLHTFRLGLTPF 104
Query: 97 ADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
D++ EEF+ LG L P R S + R LP +VDWR++GAVT VKNQ CG
Sbjct: 105 TDLTLEEFRAHALGFLNSTLP--RVASDRYLPRAGDDLPDAVDWRQQGAVTGVKNQLDCG 162
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
CWAFS VAA+EGIN+IV+ NL SLSEQELIDCDT + GC GG M AF++++ +GG+
Sbjct: 163 GCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDTE-DYGCQGGEMQKAFQFVIDNGGID 221
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
E DYP++ GTC+ +E+ +VV+I Y++VP NDE++L KA+A+QP
Sbjct: 222 TEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQP------------ 269
Query: 276 QFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
G+F GPCG LDHGV AVGYG G D+ IVKNSWG +WGE GYIRMKRN P
Sbjct: 270 -----GIFNGPCGFILDHGVTAVGYGSDNGEDFWIVKNSWGAEWGESGYIRMKRNVLLPM 324
Query: 336 GLCGINKMASIPLKK 350
G CGI AS P+K
Sbjct: 325 GKCGIAMYASYPVKN 339
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 151/359 (42%), Positives = 223/359 (62%), Gaps = 24/359 (6%)
Query: 4 FSHSKLLLLSLSLSLFAC------SSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGK 57
+ + L+LL +++ C ++ + VG + +M ++ ++ WM+++ +
Sbjct: 11 ITMTTLMLLLCVIAIADCICQAAVAARVEPSTTVGRTTGGDEAM--MMARYKKWMAQYRR 68
Query: 58 TYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHEEFKNKYLGLK--PQ 114
YK EK HRF++FK N + ID+ N Y LG N+FAD++ +EF Y GL+
Sbjct: 69 KYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAA 128
Query: 115 FPT-RRQPSAEFSYRDVKALPKSV--DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQ 171
P+ +Q A F Y++ L V DWR++GAVTPVKNQG CG CWAFS V A+EG+
Sbjct: 129 VPSGAKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIM 188
Query: 172 IVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE 230
I +GNL SLSEQ+++DCD S N GCNGG MD AF+Y+V +GG+ E+ YPY +GTC+
Sbjct: 189 ITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQGTCQ 248
Query: 231 DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGA 289
+ + TISG+QD+P DE +L A+A+QPVSV ++ + FQFY GG++ G CG
Sbjct: 249 NVQ---PAATISGFQDLPSGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGT 305
Query: 290 ELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+++H V A+GYG +G+ Y I+KNSWG WGE G+++++ G CGI+ MAS P
Sbjct: 306 DMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQMGVGA----CGISTMASYP 360
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 160/343 (46%), Positives = 215/343 (62%), Gaps = 15/343 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+S+ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KIDLMSILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQPSAEF 125
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P +
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPINDL 118
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
S D +P ++DWR+ GAVT VKNQG CG CWAFS V ++EG +I +GNL SEQEL
Sbjct: 119 SDDD---MPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQEL 175
Query: 186 IDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ 245
+DC T+ N GCNGG M AF +I +GG+ +E DY YL ++ TC +E+ V IS YQ
Sbjct: 176 LDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCR-SQEKTAAVQISSYQ 233
Query: 246 DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-K 304
VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG K
Sbjct: 234 VVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGTDEK 291
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G Y ++KNSWG WGE G++++ R++G P GLC I K++S P
Sbjct: 292 GQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S Y+ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GC+GG M AF +I+ +GG+ +E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 215/347 (61%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+S+ ++LF S+ + + P+ S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMSILITLFFVISMFNSQTRARSQPKLSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENIKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GC+GG M AF +I +GG+ E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 217/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 160/343 (46%), Positives = 215/343 (62%), Gaps = 15/343 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+S+ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMSILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQPSAEF 125
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P +
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPINDL 118
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
S D +P ++DWR+ GAVT VKNQG CG CWAFS V ++EG +I +GNL SEQEL
Sbjct: 119 SDDD---MPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQEL 175
Query: 186 IDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ 245
+DC T+ N GCNGG M AF +I +GG+ +E DY YL ++ TC +E+ V IS YQ
Sbjct: 176 LDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCR-SQEKTAAVQISSYQ 233
Query: 246 DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-K 304
VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG K
Sbjct: 234 VVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGTDEK 291
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G Y ++KNSWG WGE G++++ R++G P GLC I K++S P
Sbjct: 292 GQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 129/216 (59%), Positives = 162/216 (75%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IV+GNL SLSEQEL+DCD S+N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GC+GGLMDYAF++++ +GG+ EEDYPY G C+ ++ +VVTI Y+DVP N+E+
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNGVCDQYRKNAKVVTIDSYEDVPVNNEK 121
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
+L KA+AHQPVS+A+EA G DFQ Y G+FTG CG +DHGV GYG G DY IV+N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVVAGYGTENGMDYWIVRN 181
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWG KWGE+GY+R++RN GLCG+ S P+K
Sbjct: 182 SWGAKWGEKGYLRVQRNVASSSGLCGLAIEPSYPVK 217
>gi|307103885|gb|EFN52142.1| hypothetical protein CHLNCDRAFT_139276 [Chlorella variabilis]
Length = 388
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 144/309 (46%), Positives = 192/309 (62%), Gaps = 14/309 (4%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
+ F W HG++YK E R +F EN KH+ ++N + L LN+FAD++ EEF
Sbjct: 44 QAFSQWQMTHGRSYKSASEARKRQAVFVENAKHVAEQNARNSGLVLALNQFADLTLEEFA 103
Query: 106 NKYLGLKPQF-PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
+LG P + + F Y D LP +VDWRKK AVTPVKNQ CGSCWAFS
Sbjct: 104 ATHLGYNPSLREGKEHTTTSFQYADANDLPSTVDWRKKNAVTPVKNQAMCGSCWAFSATG 163
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
AVEGIN I +G L SLSEQ+L+DCD+ + GC GGLMD+AF YI +GG+ E+DY Y
Sbjct: 164 AVEGINAIRTGKLVSLSEQQLVDCDSEKDLGCGGGLMDFAFDYITKNGGIDSEDDYSYWG 223
Query: 225 EEGTCEDKKE-EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
C+ +KE + VVTI G++DVP+ND ++L KA+AHQPVS+ ++SG V
Sbjct: 224 YGLICQRRKEADRHVVTIDGFEDVPKNDGEALKKAIAHQPVSL----------YHSGVVG 273
Query: 284 TGPCGAELDHGVAAVGY--GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
C +L+HGV AVGY G G+ + ++KNSWG WGE+G+ R+ + + G CG+
Sbjct: 274 DDACCQDLNHGVLAVGYDDGSKGGTPHYVIKNSWGEGWGEQGFFRLAAKSSEASGACGVY 333
Query: 342 KMASIPLKK 350
K AS PLKK
Sbjct: 334 KAASYPLKK 342
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 162/317 (51%), Positives = 199/317 (62%), Gaps = 15/317 (4%)
Query: 36 EHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLN 94
E + S L ++F ++M ++ K Y E RF FK N++ I N SY +GLN
Sbjct: 30 EEVPSEVMLQDMFTAFMKQYSKAYSHAEFS-SRFNQFKANVETIRLHNTLANASYTMGLN 88
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
EFAD+S EEFK KY G K R + +++V+A P S+DWR AVTP+K+QG C
Sbjct: 89 EFADLSFEEFKGKYFGYKH--VEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQC 146
Query: 155 GSCWAFSTVAAVEGINQIVSG--NLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVAS 211
GSCWAFS ++EG ++ G LTSLSEQ+L+DC TS+ N GCNGGLMDYAF+YI+A+
Sbjct: 147 GSCWAFSATGSIEGA-WVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIAN 205
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEA 270
G+ E YPY G C+ K +VVTISGY+DV DE SLL A+ PVSVAIEA
Sbjct: 206 KGICAESAYPYKGVGGLCQ--KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEA 263
Query: 271 SGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRN 330
FQFYS GVF+G CG LDHGV AVGYG + DY IVKNSWG WGE GYIRM RN
Sbjct: 264 DQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYIRMIRN 323
Query: 331 TGKPEGLCGINKMASIP 347
+ CGI S P
Sbjct: 324 KNQ----CGIAIQPSYP 336
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 216/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R+ G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 218/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S Y+ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 158/348 (45%), Positives = 217/348 (62%), Gaps = 17/348 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQPSAEF 125
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P +
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 126 SYRDVKAL-----PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
++ + L P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL
Sbjct: 119 EFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEF 178
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL ++ TC +E+ V
Sbjct: 179 SEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQEKTAAVQ 236
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
IS YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GY
Sbjct: 237 ISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGNCADRINHAVTAIGY 294
Query: 301 GKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G +G Y ++KNSWG WGE GY+++ R++G P GLC I KM+S P
Sbjct: 295 GTDEEGQKYWLLKNSWGTSWGENGYMKIIRDSGDPSGLCDIAKMSSYP 342
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 216/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK+
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKVE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
E D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 ELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R+ G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 135/301 (44%), Positives = 200/301 (66%), Gaps = 6/301 (1%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMS 100
D +++ FE WM+++G+ YK +EK+ RF+IFK N+ HI+ N+ SY LG+N+F DM+
Sbjct: 31 DPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMT 90
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
+ EF +Y GL +R+P F D+ ++P+S+DWR GAVT VKNQG CGSCWAF
Sbjct: 91 NNEFVAQYTGLSLPLNIKREPVVSFDDVDISSVPQSIDWRDSGAVTSVKNQGRCGSCWAF 150
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
+++A VE I +I GNL SLSEQ+++DC S+ GC GG ++ A+ +I+++ G+ Y
Sbjct: 151 ASIATVESIYKIKRGNLVSLSEQQVLDCAVSY--GCKGGWINKAYSFIISNKGVASAAIY 208
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY +GTC+ I+ Y V N+E++++ A+++QP++ A++ASG +FQ Y
Sbjct: 209 PYKAAKGTCKTNGVPNSAY-ITRYTYVQRNNERNMMYAVSNQPIAAALDASG-NFQHYKR 266
Query: 281 GVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
GVFTGPCG L+H + +GYG+ S G + IV+NSWG WGE GYIR+ R+ GLCG
Sbjct: 267 GVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNSWGAGWGEGGYIRLARDVSSSFGLCG 326
Query: 340 I 340
I
Sbjct: 327 I 327
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 140/292 (47%), Positives = 194/292 (66%), Gaps = 7/292 (2%)
Query: 52 MSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSHEEFKNKYLG 110
M+++G+ YK +EK+ RF+IFK N+ HI+ N+ SY LG+N+F DM++ EF +Y G
Sbjct: 1 MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60
Query: 111 -LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGI 169
+ ++P F ++ A+ +S+DWR GAVT VK+Q CGSCWAFS +A VEGI
Sbjct: 61 GISRPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGI 120
Query: 170 NQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTC 229
+IV+G L SLSEQE++DC S NGC+GG +D A+ +I+++ G+ E DYPY +G C
Sbjct: 121 YKIVTGYLVSLSEQEVLDCAVS--NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDC 178
Query: 230 EDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGA 289
I+GY V NDE S+ A+ +QP++ AI+ASG +FQ+Y+GGVF+GPCG
Sbjct: 179 AANSWPNSAY-ITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGT 237
Query: 290 ELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
L+H + +GYG+ S G+ Y IVKNSWG WGERGYIRM R GLCGI
Sbjct: 238 SLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGV-SSSGLCGI 288
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 216/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R+ G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 158/348 (45%), Positives = 216/348 (62%), Gaps = 17/348 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQPSAEF 125
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P +
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 126 SYRDVKAL-----PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
++ + L P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +G L
Sbjct: 119 EFKKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEF 178
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL E+ TC +E+ V
Sbjct: 179 SEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQEKTAAVQ 236
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
IS YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GY
Sbjct: 237 ISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGY 294
Query: 301 GKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 342
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 143/313 (45%), Positives = 205/313 (65%), Gaps = 11/313 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHE 102
+++ E WM++ + Y+ EK R ++FK+NLK I+ NK+ SY LG+NEFAD ++E
Sbjct: 35 MVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNE 94
Query: 103 EFKNKYLGLK------PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGS 156
EF + GLK P + S++ ++ + +S DWR +GAVTPVK QG CG
Sbjct: 95 EFLAIHTGLKGLTEVSPSKVVAKTISSQ-TWNVSDMVVESKDWRAEGAVTPVKYQGQCGC 153
Query: 157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHK 216
CWAFS VAAVEG+ +I GNL SLSEQ+L+DCD ++ GC+GG+M AF Y+V + G+
Sbjct: 154 CWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYVVQNRGIAS 213
Query: 217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
E DY Y +G C + ISG+Q VP N+E++LL+A++ QPVSV+++A+G F
Sbjct: 214 ENDYSYQGSDGGC--RSNARPAARISGFQTVPSNNERALLEAVSRQPVSVSMDATGDGFM 271
Query: 277 FYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
YSGGV+ GPCG +H V VGYG S+ G+ Y + KNSWG WGE+GYIR++R+ P+
Sbjct: 272 HYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQ 331
Query: 336 GLCGINKMASIPL 348
G+CG+ + A P+
Sbjct: 332 GMCGVAQYAFYPV 344
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 151/317 (47%), Positives = 196/317 (61%), Gaps = 13/317 (4%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADM 99
L +LF W KHGKTY EEK R +IF +N + + + N E ++++GLN AD+
Sbjct: 64 LSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLADL 123
Query: 100 SHEEFKNKYLGLKPQFPTRRQP--SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
+ +EFK K LG R P ++ + Y DV P+ +DW GAVTPVKNQ CGSC
Sbjct: 124 TKDEFK-KMLGYNAALRASRAPVDASTWEYADVTP-PEEIDWVASGAVTPVKNQKQCGSC 181
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKE 217
WAFST AVEG+N I +G L SLSE+ELI C T+ N GCNGGLMD F++IV + G+ E
Sbjct: 182 WAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEWIVNNRGIDTE 241
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
+ + Y+ +E C + V I G++DVP NDE SL+KA++ QPVSVAIEA FQ
Sbjct: 242 DGWEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQSFQL 301
Query: 278 YSGGVFTGP-CGAELDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
Y+GGV++ CG ELDHGV VGYG +K + +KNSWGP WGE GYIR+ +
Sbjct: 302 YAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRIAKGGS 361
Query: 333 KPEGLCGINKMASIPLK 349
EG CG+ S P K
Sbjct: 362 GVEGQCGVAMQPSYPTK 378
>gi|313118762|gb|ADR32293.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 128/216 (59%), Positives = 162/216 (75%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IV+GNL SLSEQEL+DCD S+N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GC+GGLMDYAF++++ +GG+ EEDYPY C+ ++ +VV I Y+DVP N+E+
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
+L KA+AHQPVS+A+EA G DFQ Y G+FTG CG +DHGV A GYG G DY IV+N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWG KWGE+GY+R++RN + GLCG+ S P+K
Sbjct: 182 SWGAKWGEKGYLRVQRNIARSSGLCGLATEPSYPVK 217
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 217/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF ++ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVITM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GC+GG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 214/347 (61%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+S+ ++LF S+ + + P+ S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMSILITLFFVISMFNSQTRARSQPKLSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QF +GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFCAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R+ G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 146/311 (46%), Positives = 196/311 (63%), Gaps = 24/311 (7%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHE 102
++ E WM ++ + YK EK RFE+FK N+K I+ N +WLG+N+FAD++++
Sbjct: 1 MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTND 60
Query: 103 EFK----NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
EF+ NK G KP P + + V ALP ++DWR KGAVTP+K+QG C
Sbjct: 61 EFRATKTNK--GFKPS-PVKVPTGFRYENISVDALPATIDWRTKGAVTPIKDQGQC---- 113
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKE 217
EGI +I +G L SLSEQEL+DCD + GC GGLMD AFK+I+ GGL E
Sbjct: 114 --------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTTE 165
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
YPY +G C K V T+ G++DVP NDE SL+KA+A+QPVSVA++ FQF
Sbjct: 166 SSYPYTAADGKC--KSGSNSVATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTFQF 223
Query: 278 YSGGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
YSGGV TG CG +LDHG+AA+GYG+ S G+ Y ++KNSWG WGE GY+RM+++ G
Sbjct: 224 YSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKRG 283
Query: 337 LCGINKMASIP 347
+CG+ S P
Sbjct: 284 MCGLAMEPSYP 294
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 151/332 (45%), Positives = 211/332 (63%), Gaps = 17/332 (5%)
Query: 28 FSIVGYSPEHLTSMD----KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN 83
FSI+ P +TS + ++E E+WM HG+ YK EK HRF+ FKEN++ I+ N
Sbjct: 17 FSILSLYPFIVTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHRFKTFKENVEFIESFN 76
Query: 84 KEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA---EFSYRDVKALPKSVDW 139
K T Y L +N++AD++ EEF ++GL ++++ +A F Y V +P S+DW
Sbjct: 77 KNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTSFKYDSVTEVPNSMDW 136
Query: 140 RKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGG 199
RK+G+VT VK+QG CG CWAFS AA+EG QI + L SLSEQ+L+DC T N GC GG
Sbjct: 137 RKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQLLDCSTQ-NKGCEGG 195
Query: 200 LMDYAFKYIVAS--GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLK 257
LM A+ +++ + GG+ E +YPY + C K E+ VTI+GY+ VP +DE SLLK
Sbjct: 196 LMTVAYDFLLQNNGGGITTETNYPYEEAQNVC--KTEQPAAVTINGYEVVP-SDESSLLK 252
Query: 258 ALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK--GSDYIIVKNSW 315
A+ +QP+SV I A+ +F Y G++ G C + L+H V +GYG S+ G+ Y IVKNSW
Sbjct: 253 AVVNQPISVGI-AANDEFHMYGSGIYDGSCNSRLNHAVTVIGYGTSEEDGTKYWIVKNSW 311
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G WGE GY+R+ R+ G G CGI K+AS P
Sbjct: 312 GSDWGEEGYMRIARDVGVDGGHCGIAKVASFP 343
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 217/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S Y+ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 215/347 (61%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + + PE S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISMFNTQTRGRSQPELSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GC+GG M AF +I +GG+ E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 215/347 (61%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + + PE S E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISMFNTQTRGRSQPELSVS-----ERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GC+GG M AF +I +GG+ E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 161/317 (50%), Positives = 199/317 (62%), Gaps = 15/317 (4%)
Query: 36 EHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLN 94
E + S L ++F ++M ++ K Y E RF FK N++ I N SY +GLN
Sbjct: 30 EEVPSEVMLQDMFTAFMKQYSKAYSHAEFS-SRFNQFKANVETIRLHNTLANASYTMGLN 88
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
EFAD+S EEFK KY G K R + +++V+A P S+DWR AVTP+K+QG C
Sbjct: 89 EFADLSFEEFKGKYFGYKH--VEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQC 146
Query: 155 GSCWAFSTVAAVEGINQIVSG--NLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVAS 211
GSCWAFS ++EG ++ G LTSLSEQ+L+DC TS+ + GCNGGLMDYAF+YI+A+
Sbjct: 147 GSCWAFSATGSIEGA-WVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIAN 205
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEA 270
G+ E YPY G C+ K +VVTISGY+DV DE SLL A+ PVSVAIEA
Sbjct: 206 KGICAESAYPYKGVGGLCQ--KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEA 263
Query: 271 SGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRN 330
FQFYS GVF+G CG LDHGV AVGYG + DY IVKNSWG WGE GYIRM RN
Sbjct: 264 DQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYIRMIRN 323
Query: 331 TGKPEGLCGINKMASIP 347
+ CGI S P
Sbjct: 324 KNQ----CGIAIQPSYP 336
>gi|313118764|gb|ADR32294.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 128/216 (59%), Positives = 161/216 (74%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IV+GNL SLSEQEL+DCD S+N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GC+GGLMDYAF++++ +GG+ EEDYPY G C+ ++ +VV I Y+DVP N+E+
Sbjct: 62 QGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNGVCDQYRKNAKVVVIDSYEDVPVNNEK 121
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
+L KA+AHQPVS+A+EA G DFQ Y G+FTG CG +DHGV A GYG G DY IV+N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGLDYWIVRN 181
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWG WGE+GY+R++RN GLCG+ S P+K
Sbjct: 182 SWGADWGEKGYLRVQRNVASSSGLCGLAIEPSYPVK 217
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 206/314 (65%), Gaps = 17/314 (5%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHE 102
++ ++ WM+++ + YK EK HRF++FK N + ID+ N Y LG N+FAD++ +
Sbjct: 55 MMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSK 114
Query: 103 EFKNKYLGLK--PQFPT--RRQPSAEFSYRDVKALPKSV--DWRKKGAVTPVKNQGSCGS 156
EF Y GL+ P+ ++ P+A Y++ L V DWR++GAVTPVKNQG CG
Sbjct: 115 EFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGC 174
Query: 157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLH 215
CWAFS V A+EG+ I +GNL SLSEQ+++DCD S N GCNGG MD AF+Y++ +GG+
Sbjct: 175 CWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVINNGGVT 234
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
E+ YPY +GTC++ + TISG+QD+P DE +L A+A+QPVSV ++ + F
Sbjct: 235 TEDAYPYSAVQGTCQNVQ---PAATISGFQDLPSGDENALANAVANQPVSVGVDGGSSPF 291
Query: 276 QFYSGGVFTGP-CGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
QFY GG++ G CG +++H V A+GYG +G+ Y I+KNSWG WGE G+++++ G
Sbjct: 292 QFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQMGVGA 351
Query: 334 PEGLCGINKMASIP 347
CGI+ MAS P
Sbjct: 352 ----CGISTMASYP 361
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 216/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++E +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 152/310 (49%), Positives = 190/310 (61%), Gaps = 12/310 (3%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-----SYWLGLNEFADMSHEE 103
E WM+KHGKTYK EEK R E+F+ N K ID N + L N FAD++ +E
Sbjct: 43 EKWMAKHGKTYKDEEEKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDE 102
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRD--VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
F+ G + F Y + + A P+S+DWR GAVT VK+QGSCG CWAFS
Sbjct: 103 FRAARTGYQRPPAAVAGAGGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWAFS 162
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDY 220
VAAVEG+ +I +G L SLSEQEL+DCD + GC GGLMD AF+YI GGL E Y
Sbjct: 163 AVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAESSY 222
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY + +I G+QDVP NDE +L+ A+A QPVSVAI +G F+FY
Sbjct: 223 PYRGVD-GACRAAAGRAAASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYDR 281
Query: 281 GVFTGP-CGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GV G CG EL+H V AVGYG S G+ Y ++KNSWG WGE GY+R++R G+ EG C
Sbjct: 282 GVLGGAGCGTELNHAVTAVGYGTASDGTGYWLMKNSWGASWGEGGYVRIRRGVGR-EGAC 340
Query: 339 GINKMASIPL 348
GI +MAS P+
Sbjct: 341 GIAQMASYPV 350
>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
Length = 328
Score = 286 bits (733), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 135/230 (58%), Positives = 171/230 (74%), Gaps = 1/230 (0%)
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
S ++ R LP+SVDWRK+GAV VK+Q SCGSCWAFS +AAVEGIN+IV+G+L SLS
Sbjct: 13 SNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLISLS 72
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DCDTS+N GCNGGLMDYAF++I+++GG+ E+DYPY +G C+ ++ +VVTI
Sbjct: 73 EQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVVTI 132
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
Y+DVP DE +L KA+A+QP++VA+E G +FQ Y GV TG CG LDHGVAAVGYG
Sbjct: 133 DDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVLTGRCGTALDHGVAAVGYG 192
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIPLKK 350
G DY IV+NSWG WGE+GYIR++RN G CGI S P+K
Sbjct: 193 TENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKN 242
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 159/347 (45%), Positives = 217/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GC+GG M AF +I+ +GG+ +E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ G Y ++KNSWG WGE G++++ R+ G P GLC I KM+S P
Sbjct: 295 TDENGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 215/347 (61%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
E D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 ELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R+ G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
Length = 324
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 153/356 (42%), Positives = 215/356 (60%), Gaps = 41/356 (11%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY- 59
M F ++ LSL + S A D S+ + L S +++ +F++WMSKHGKTY
Sbjct: 1 MGFVRLVCMITLSLLIIFLLPPSSAMDLSV---TSGGLRSNEEVGFIFQTWMSKHGKTYT 57
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR 119
+ +K RF+ FK+NL+ IDQ N + SY LGL +FAD++ +E+++ + G P ++
Sbjct: 58 NALGDKEQRFQNFKDNLRFIDQHNAKNLSYRLGLTQFADLTVQEYQDLFSGR----PIQK 113
Query: 120 QPSAEFSYRDV----KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
Q + ++R V LP+SVDWR+KGAV+ +K+QG C VE IN+IV+G
Sbjct: 114 QKALRVTHRYVPLAEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTG 163
Query: 176 NLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE-DKKE 234
L SLSEQEL+DC N+GCNGGLMD AF++++ + GL + DYPY +G C ++
Sbjct: 164 ELISLSEQELVDCSID-NHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNT 222
Query: 235 EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHG 294
+V+ I GY+DVP N+E SL KA+AHQP G++TGPCG +LDH
Sbjct: 223 SKKVIKIDGYEDVPANNENSLQKAVAHQP-----------------GIYTGPCGTDLDHA 265
Query: 295 VAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
V VGYG G DY IV+NSWG WGE GY ++ RN P G+CGI +AS P+K
Sbjct: 266 VVIVGYGTENGQDYWIVRNSWGTVWGEAGYAKIARNFENPTGVCGIAMVASYPIKN 321
>gi|313118766|gb|ADR32295.1| C14 cysteine protease [Solanum demissum]
gi|313118774|gb|ADR32299.1| C14 cysteine protease [Solanum verrucosum]
gi|313118776|gb|ADR32300.1| C14 cysteine protease [Solanum verrucosum]
gi|313118778|gb|ADR32301.1| C14 cysteine protease [Solanum verrucosum]
Length = 217
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 128/216 (59%), Positives = 161/216 (74%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IV+GNL SLSEQEL+DCD S+N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GC+GGLMDYAF++++ +GG+ EEDYPY C+ ++ +VV I Y+DVP N+E+
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
+L KA+AHQPVS+A+EA G DFQ Y G+FTG CG +DHGV A GYG G DY IV+N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWG KWGE+GY+R++RN GLCG+ S P+K
Sbjct: 182 SWGAKWGEKGYLRVQRNIASSSGLCGLATEPSYPVK 217
>gi|222632170|gb|EEE64302.1| hypothetical protein OsJ_19139 [Oryza sativa Japonica Group]
Length = 1105
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 147/315 (46%), Positives = 189/315 (60%), Gaps = 20/315 (6%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
FE+W ++HG++Y E + R + + ++ + +
Sbjct: 38 FEAWCAEHGRSYATPGELVGR------GSRRFAGTTRRSWRRTTARPRRTPLALQRLRGP 91
Query: 108 YLGLKPQFPTR--RQPSAEFSYRD-----------VKALPKSVDWRKKGAVTPVKNQGSC 154
Y P P R R +A RD V A+P +VDWR+ GAVT VK+QGSC
Sbjct: 92 YARRVPA-PRRSGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSC 150
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGL 214
G+CW+FS A+EGIN+I +G+L SLSEQELIDCD S+N+GC GGLMDYA+K++V +GG+
Sbjct: 151 GACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGI 210
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
E DYPY +GTC K + VVTI GY+DVP N+E LL+A+A QPVSV I S
Sbjct: 211 DTEADYPYRETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARA 270
Query: 275 FQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
FQ YS G+F GPC LDH + VGYG G DY IVKNSWG WG +GY+ M RNTG
Sbjct: 271 FQLYSKGIFDGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNS 330
Query: 335 EGLCGINKMASIPLK 349
G+CGIN+M S P K
Sbjct: 331 NGVCGINQMPSFPTK 345
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 137/303 (45%), Positives = 197/303 (65%), Gaps = 8/303 (2%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMS 100
D +++ FE WM+++G+ YK +EK+ RF+IFK N+ HI+ N + SY LG+N+F DM+
Sbjct: 31 DPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSRNGNSYTLGINQFTDMT 90
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
+ EF +Y G+ R+P F D+ A+P+S+DWR GAVT VKN CGSCWAF
Sbjct: 91 NNEFVAQYTGVSLPLNIEREPVVSFDDVDISAVPQSIDWRNYGAVTSVKNHIPCGSCWAF 150
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
+ +A VE I +I G L SLSEQ+++DC S+ GC+GG ++ A+ +I+++ G+ Y
Sbjct: 151 AAIATVESIYKIKRGYLISLSEQQVLDCAVSY--GCDGGWVNKAYDFIISNKGVASAAIY 208
Query: 221 PYLME--EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
PY +GTC I+GY V N+E+S++ A+++QP++ +IEASG DFQ Y
Sbjct: 209 PYKASQGQGTCRINGVPNSAY-ITGYTRVQSNNERSMMYAVSNQPIAASIEASG-DFQHY 266
Query: 279 SGGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
GVF+GPCG L+H + +GYG+ S G + IV+NSWG WGERGYIRM R+ GL
Sbjct: 267 KRGVFSGPCGTSLNHAITIIGYGQDSSGKKFWIVRNSWGASWGERGYIRMARDVSSSSGL 326
Query: 338 CGI 340
CGI
Sbjct: 327 CGI 329
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 146/306 (47%), Positives = 199/306 (65%), Gaps = 10/306 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
+E + +K G++Y EE+ R +F +N++ I++ N + +Y LG+N+FAD++ EEF
Sbjct: 19 WEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHTYTLGVNQFADLTVEEFSKT 78
Query: 108 YLGLKPQFPTRRQPSAEFSYRDV---KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
Y+G K P ++ A + R V +ALP SVDW +GAVTPVKNQG CGSCW+FST
Sbjct: 79 YMGFKK--PAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGSCWSFSTTG 136
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
++EG N+I +G L SLSEQ+ +DC ++ N GCNGGLMD AFKY A+ L E+ YPY
Sbjct: 137 SLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEANA-LCTEQSYPYK 195
Query: 224 MEEGTCEDKKEEMEVV--TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
+G+C+ + ++SGY+DV + EQ ++ A+A QPVS+AIEA + FQ YSGG
Sbjct: 196 GTDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKSVFQLYSGG 255
Query: 282 VFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
V TG CGA LDHGV AVGYG G+DY VKNSWG WG GY+ ++R G G CG+
Sbjct: 256 VLTGACGASLDHGVLAVGYGTLSGTDYWKVKNSWGSTWGMSGYVLLQRGKGGS-GECGLL 314
Query: 342 KMASIP 347
S P
Sbjct: 315 SEPSYP 320
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 151/305 (49%), Positives = 197/305 (64%), Gaps = 10/305 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F WM KH + Y EE R++ FKEN+ I + N + + LGL +FAD+++EE+K
Sbjct: 33 FIGWMRKHDRAYSH-EEFTDRYQAFKENMDFIHKWNSQESDTVLGLTKFADLTNEEYKKH 91
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKAL-PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
YLG+K + +A+ + K P S+DWR+KGAV+ VK+QG CGSCW+FST AV
Sbjct: 92 YLGIKVNVK-KNLNAAQKGLKFFKFTGPDSIDWREKGAVSQVKDQGQCGSCWSFSTTGAV 150
Query: 167 EGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
EG +QI SGN+ SLSEQ L+DC + N GC GGLM AF+YI+ +GG+ E YPY
Sbjct: 151 EGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIATESSYPYTAA 210
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG 285
+G C+ K M I GY+++P+ +E SL ALA QPVSVAI+AS FQ YS GV+
Sbjct: 211 QGRCKFTK-SMNGANIIGYKEIPQGEEDSLTAALAKQPVSVAIDASHMSFQLYSSGVYDE 269
Query: 286 P-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
P C +E LDHGV AVGYG +G DY I+KNSWGP WG+ GYI M RN + CG+ M
Sbjct: 270 PACSSEALDHGVLAVGYGTLEGKDYYIIKNSWGPTWGQDGYIFMSRNA---QNQCGVATM 326
Query: 344 ASIPL 348
AS P+
Sbjct: 327 ASYPI 331
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 161/347 (46%), Positives = 215/347 (61%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGHVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GC+GG M AF +I +GG+ E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 155/340 (45%), Positives = 202/340 (59%), Gaps = 24/340 (7%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
LL+L ++LF S+ A S D L +F WM +H K+Y EE ++R+
Sbjct: 6 LLALCVALFVASTFA-------------VSHDPLTGVFADWMQEHQKSY-ANEEFVYRWN 51
Query: 71 IFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV 130
+++EN +I+ N + S+ L +N+F D+++ EF + GL T Q E
Sbjct: 52 VWRENYLYIEAHNHQNKSFHLAMNKFGDLTNAEFNKLFKGLSI---TADQAKQESDIAPA 108
Query: 131 KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT 190
LP DWR+KGAVT VKNQG CGSCW+FST + EG N + G LTSLSEQ L+DC T
Sbjct: 109 PGLPADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVDCST 168
Query: 191 SF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPE 249
S+ N+GCNGGLMDYAF+YI+ + G+ EE YPY +GTC K+ +S Y +VP
Sbjct: 169 SYGNHGCNGGLMDYAFEYIIRNKGIDTEESYPYHASQGTCRYNKQHSGGELVS-YTNVPS 227
Query: 250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPC--GAELDHGVAAVGYGKSKGSD 307
+E +LL A+A QP SVAI+AS + FQFY GGV+ P + LDHGV AVG+G G D
Sbjct: 228 GNEGALLNAVATQPTSVAIDASHSSFQFYKGGVYDEPACSSSRLDHGVLAVGWGVRDGKD 287
Query: 308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
Y +VKNSWG WG GYI M RN CGI AS P
Sbjct: 288 YWLVKNSWGADWGLSGYIEMSRN---KHNQCGIATAASHP 324
>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
Length = 355
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 149/329 (45%), Positives = 206/329 (62%), Gaps = 12/329 (3%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSM-----DKLIELFESWMSKHGKTYKCIEE 64
+ + L +FA SS A D SI+ + H D+++ +FE W+ KH K Y + E
Sbjct: 3 MAIVLLFMVFAVSS-ALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNALGE 61
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGL---KPQFPTRRQP 121
K RF+IFK NL+ ID+RN +Y LGLN FAD+++ E++ YL P+ P
Sbjct: 62 KEKRFQIFKNNLRFIDERNSLNRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTPP 121
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQG-SCGSCWAFSTVAAVEGINQIVSGNLTSL 180
+ R +PKSVDWRK+GAVTPVKNQG +C SCWAF+ V AVE + +I +G+L SL
Sbjct: 122 RNRYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLISL 181
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQE++DC TS + GC GG + + + YI G+ E+DYPY +EG C+ K+ +VT
Sbjct: 182 SEQEVVDCTTSSSRGCGGGDIQHGYIYI-RKNGISLEKDYPYRGDEGKCDSNKKNA-IVT 239
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
I G+ VP E++L + +A+QPV+V I A +FQ+Y+ GVF G CG EL+H + VGY
Sbjct: 240 IDGHGWVPTQLEEALKQGIANQPVAVPIPADDYEFQYYTSGVFKGKCGTELNHALLLVGY 299
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
G K DY I KNS+ KWGE GYIR++R
Sbjct: 300 GAEKDGDYWIAKNSYSDKWGENGYIRIQR 328
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 157/347 (45%), Positives = 216/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ +++F S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITVFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK---PQFPTRRQPSA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL ++ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S Y+ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 215/347 (61%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK+
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKVE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFKEN+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
E D+ +P ++DW + GAVT VK+QG CG CWAFS V ++EG +I +GNL S
Sbjct: 119 ELKINDLSDDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+GG + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R+ G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 291
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 147/221 (66%), Positives = 172/221 (77%), Gaps = 2/221 (0%)
Query: 130 VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD 189
V+ +P SVDWR+KGAVT VK+QG CGSCWAFST+AAVEGIN I + NLTSLSEQ+L+DCD
Sbjct: 58 VRDVPSSVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCD 117
Query: 190 TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPE 249
T N GCNGGLMDYAF+YI GG+ E+ YPY + + +KK VVTI GY+DVP
Sbjct: 118 TKSNAGCNGGLMDYAFQYIAKHGGVAAEDAYPYKARQASSCNKKPSA-VVTIDGYEDVPA 176
Query: 250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDY 308
NDE +L KA+A QPV+VAIEASG+ FQFYS GVF G CG ELDHGVAAVGYG + G+ Y
Sbjct: 177 NDETALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKY 236
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
IVKNSWGP+WGE+GYIRMKR+ EGLCGI AS P+K
Sbjct: 237 WIVKNSWGPEWGEKGYIRMKRDVEDKEGLCGIAMEASYPVK 277
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 155/359 (43%), Positives = 201/359 (55%), Gaps = 53/359 (14%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMS 100
D ++E FE WM +HG+ Y EK R E+++ N+ ++ N Y L N+FAD++
Sbjct: 26 DPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLT 85
Query: 101 HEEFKNKYLGLKPQFPTRRQPS------------AEFSYRDVKALPKSVDWRKKGAVTPV 148
+EEF+ K LG P R + R LPKSVDWR+KGAV PV
Sbjct: 86 NEEFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSDELPKSVDWREKGAVAPV 145
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
KNQG CGSCWAFS VAA+EGINQI +G L SLSEQEL+DCDT GC GG M +AF+++
Sbjct: 146 KNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK-AIGCAGGYMSWAFEFV 204
Query: 209 VASGGLHKEEDYPYLME----------------------------EGTCEDKKEEMEVVT 240
+ + GL E +YPY G C+ K + V+
Sbjct: 205 MNNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESAVS 264
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
ISGY +V + E LL+A A QPVSVA++A +Q Y GGVFTGPC A+L+HGV VGY
Sbjct: 265 ISGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVVGY 324
Query: 301 GKSK-----------GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
G+++ G Y IVKNSWGP+WG+ GYI M+R GLCGI + S P+
Sbjct: 325 GETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSYPV 383
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 159/347 (45%), Positives = 216/347 (62%), Gaps = 16/347 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L+++ ++LF S+ + G S L+ + E E WMS+HG+ YK EK
Sbjct: 4 KVDLMNILITLFFVISM-FNTQTRGRSQPKLS----VSERHELWMSRHGRVYKDEVEKGE 58
Query: 68 RFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQP--SA 123
RF IFK+N+K I+ NK SY LG+NEFAD++ +EF K+ GL P P S
Sbjct: 59 RFMIFKKNMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSST 118
Query: 124 EFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
EF D+ +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I +G L S
Sbjct: 119 EFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFS 178
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQEL+DC T+ N GCNGG M AF +I+ +GG+ +E DY YL E+ TC +E+ V I
Sbjct: 179 EQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQEKTAAVQI 236
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S YQ VPE E SLL+A+ QPVS+ I AS D QFY+ G + G C ++H V A+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAEGTYDGSCADRINHAVTAIGYG 294
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
KG Y ++KNSWG WGE G++++ R++G P GLC I KM+S P
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 154/305 (50%), Positives = 188/305 (61%), Gaps = 23/305 (7%)
Query: 53 SKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEEFKNKY 108
S + K+Y+ + R F+ NL+ I++ N E + SY +G+NEFAD++ +EF Y
Sbjct: 3 SDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALY 62
Query: 109 LGLKPQFPTRRQPSAEFSYRDVKALP----KSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
+ P R P Y V LP SVDWR KGAVTP+KNQG CGSCW+FST
Sbjct: 63 V---PSKFNRTMP-----YNTVY-LPATSEDSVDWRTKGAVTPIKNQGQCGSCWSFSTTG 113
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
+ EG + I +GNL SLSEQ+L+DC SF N GCNGGLMD AFKYI+++ GL EEDYPY
Sbjct: 114 STEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYT 173
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
++GTC +KE TIS Y DVP+N+E L A+A PVSVAIEA + FQ Y GVF
Sbjct: 174 AQDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVF 233
Query: 284 TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
G CG LDHGV VGY DY IVKNSWG WG GYI MKR G+CGI
Sbjct: 234 DGNCGTNLDHGVLVVGYTD----DYWIVKNSWGTTWGVEGYINMKRGV-SASGICGIAMQ 288
Query: 344 ASIPL 348
S P+
Sbjct: 289 PSYPI 293
>gi|313118772|gb|ADR32298.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 127/216 (58%), Positives = 161/216 (74%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IV+G+L SLSEQEL+DCD S+N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGDLISLSEQELVDCDKSYN 61
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GC+GGLMDYAF++++ +GG+ EEDYPY C+ ++ +VV I Y+DVP N+E+
Sbjct: 62 QGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
+L KA+AHQPVS+A+EA G DFQ Y G+FTG CG +DHGV A GYG G DY IV+N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWG KWGE+GY+R++RN GLCG+ S P+K
Sbjct: 182 SWGAKWGEKGYLRVQRNIASSSGLCGLATEPSYPVK 217
>gi|313118760|gb|ADR32292.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 127/216 (58%), Positives = 160/216 (74%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IV+GNL SLSEQEL+DCD S+N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GC+GGLMDYAF++++ +GG+ EEDYPY C+ ++ +VV I Y+DVP N+E+
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
+L KA+AHQPVS+A+EA G DFQ Y G+FTG CG +DHGV A GYG G DY IV+N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWG WGE+GY+R++RN GLCG+ S P+K
Sbjct: 182 SWGANWGEKGYLRVQRNIASSSGLCGLATEPSYPVK 217
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 149/315 (47%), Positives = 203/315 (64%), Gaps = 13/315 (4%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHE 102
+ E + WM++ + Y EK RF++FK+NLK I++ NK+ +Y LG+NEFAD + E
Sbjct: 43 VAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTRE 102
Query: 103 EFKNKYLGLK-------PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
EF + GLK +F PS ++ DV A ++ DWR +GAVTPVK QG CG
Sbjct: 103 EFIATHTGLKGVNGIPSSEFVDEMIPSWNWNVSDV-AGRETKDWRYEGAVTPVKYQGQCG 161
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
CWAFS+VAAVEG+ +IV NL SLSEQ+L+DCD +NGCNGG+M AF YI+ + G+
Sbjct: 162 CCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIA 221
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
E YPY EGTC + I G+Q VP N+E++LL+A++ QPVSV+I+A G F
Sbjct: 222 SEASYPYQAAEGTC--RYNGKPSAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGF 279
Query: 276 QFYSGGVFTGP-CGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
YSGGV+ P CG ++H V VGYG S +G Y + KNSWG WGE GYIR++R+
Sbjct: 280 MHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAW 339
Query: 334 PEGLCGINKMASIPL 348
P+G+CG+ + A P+
Sbjct: 340 PQGMCGVAQYAFYPV 354
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 149/354 (42%), Positives = 218/354 (61%), Gaps = 30/354 (8%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
+ +K LL ++ L CS++ + L+ + E WM+++G+ YK
Sbjct: 1 MAMAKALLFAILGCLCLCSAV--------LAARELSDDAAMAARHERWMAQYGRMYKDDA 52
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYL--GLKPQFPTRRQP 121
EK RFE+FK N+ I+ N +WLG+N+FAD++++EF++ G P T R P
Sbjct: 53 EKARRFEVFKANVAFIESFNAGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPS--TTRVP 110
Query: 122 SAEFSYRD----VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ +R+ + ALP ++DWR KG VTP+K+QG CG CWAFS VAA+EGI ++ +G L
Sbjct: 111 TG---FRNENVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKL 167
Query: 178 TSLS-EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
S S + L+ T + GC GGLMD AFK+I+ +GGL E +YPY +DK + +
Sbjct: 168 ISHSLNKSLL---TVMSMGCEGGLMDDAFKFIIKNGGLTTESNYPY----AAVDDKFKSV 220
Query: 237 E--VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHG 294
V +I GY+DVP N+E +L+KA+A+QPVSVA++ FQFY GGV TG CG +LDHG
Sbjct: 221 SNSVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHG 280
Query: 295 VAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ A+GYGK S G+ Y ++KNSWG WGE G++RM+++ G+CG+ S P
Sbjct: 281 IVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYP 334
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 144/305 (47%), Positives = 195/305 (63%), Gaps = 7/305 (2%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNK 107
E WM++HG+ YK EK R E+F+ N + ID N T S+ L N FAD++ EEF+
Sbjct: 39 EKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVEEFRAA 98
Query: 108 YLGLKPQFPTRRQPSAEFSYRD--VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
GL+P+ P + F Y + + +SVDWR GAVT VK+QG+CG CWAFS VAA
Sbjct: 99 RTGLRPR-PAPSAGAGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGACGCCWAFSAVAA 157
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
VEG+N+I +G L SLSEQEL+DCD S + GC+GGLMD AF+++ GGL E YPY
Sbjct: 158 VEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQG 217
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
+G C +I G++DVP N+E +L A+A+QPVSVAI F+FY GV
Sbjct: 218 RDGPCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDMAFRFYDSGVLG 277
Query: 285 GPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
G CG +L+H + AVGYG + G+ Y ++KNSWG WGE GY+R++R + EG+CG+ K+
Sbjct: 278 GACGTDLNHAITAVGYGTANDGTRYWLMKNSWGASWGEGGYVRIRRGV-RGEGVCGLAKL 336
Query: 344 ASIPL 348
S P+
Sbjct: 337 PSYPV 341
>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
Length = 493
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 144/291 (49%), Positives = 190/291 (65%), Gaps = 7/291 (2%)
Query: 67 HRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYL-GLKPQFPTRRQP 121
R E+F++NL++ID N E + + LGL FAD++ EE++ + L G + + T
Sbjct: 91 RRLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGV 150
Query: 122 SAEFSYRDV--KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
Y + + LP +VDWR++GAV VK+QG CG CWAFS VAAVEGIN+IV+G+L S
Sbjct: 151 VGRRRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSLIS 210
Query: 180 LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVV 239
LSEQELIDCD + GC+GGLMD AF +++ +GG+ E DYP+ +GTC+ K + VV
Sbjct: 211 LSEQELIDCDKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVV 270
Query: 240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVG 299
+I ++ VP N E++L KA+AHQPVS +IEAS FQ YS G+F G CG LDHGV VG
Sbjct: 271 SIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVG 330
Query: 300 YGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
YG G DY IVKNSWG +WGE GY+RM RN GI P+K+
Sbjct: 331 YGSEGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLYPVKE 381
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 149/315 (47%), Positives = 203/315 (64%), Gaps = 13/315 (4%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHE 102
+ E + WM++ + Y EK RF++FK+NLK I++ NK+ +Y LG+NEFAD + E
Sbjct: 19 VAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTRE 78
Query: 103 EFKNKYLGLK-------PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
EF + GLK +F PS ++ DV A ++ DWR +GAVTPVK QG CG
Sbjct: 79 EFIATHTGLKGVNGIPSSEFVDEMIPSWNWNVSDV-AGRETKDWRYEGAVTPVKYQGQCG 137
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
CWAFS+VAAVEG+ +IV NL SLSEQ+L+DCD +NGCNGG+M AF YI+ + G+
Sbjct: 138 CCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIA 197
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
E YPY EGTC + I G+Q VP N+E++LL+A++ QPVSV+I+A G F
Sbjct: 198 SEASYPYQAAEGTC--RYNGKPSAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGF 255
Query: 276 QFYSGGVFTGP-CGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
YSGGV+ P CG ++H V VGYG S +G Y + KNSWG WGE GYIR++R+
Sbjct: 256 MHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAW 315
Query: 334 PEGLCGINKMASIPL 348
P+G+CG+ + A P+
Sbjct: 316 PQGMCGVAQYAFYPV 330
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 142/328 (43%), Positives = 201/328 (61%), Gaps = 12/328 (3%)
Query: 16 LSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
L LF C A + P D +++ FE WM ++G+ YK +EK+ RF+IFK N
Sbjct: 10 LFLFLCVMWASPSAASADEPS-----DPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNN 64
Query: 76 LKHIDQRN-KEVTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKAL 133
+ HI+ N + SY LG+N+F DM++ EF +Y G + R+P F D+ A+
Sbjct: 65 VNHIETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGISRPLNIEREPVVSFDDVDISAV 124
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P+S+DWR GAVT VKNQ CG+CWAF+ +A VE I +I G L LSEQ+++DC +
Sbjct: 125 PQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAKGY- 183
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GC GG AF++I+++ G+ YPY +GTC+ I+GY VP N+E
Sbjct: 184 -GCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTCKTNGVPNSAY-ITGYARVPRNNES 241
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVK 312
S++ A++ QP++VA++A+ +FQ+Y GVF GPCG L+H V A+GYG+ S G Y IVK
Sbjct: 242 SMMYAVSKQPITVAVDANA-NFQYYKSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVK 300
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGI 340
NSWG +WGE GYIRM R+ G+CGI
Sbjct: 301 NSWGARWGEAGYIRMARDVSSSSGICGI 328
>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
Length = 336
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 144/302 (47%), Positives = 196/302 (64%), Gaps = 14/302 (4%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEE 103
+++FE WM+K GKTYKC EK HRF IF++N+ I +VT +G+N+FAD++++E
Sbjct: 34 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 93
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKAL--PKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
F Y G KP P + P R V + P +DWR +GAVT VK+QG+CGSCWAF+
Sbjct: 94 FVATYTGAKPPHP-KEAP------RPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFA 146
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
VAA+EG+ +I +G LT LSEQEL+DCDT+ +NGC GG D AF+ + + GG+ E DY
Sbjct: 147 AVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGGGHTDRAFELVASKGGITAESDYR 205
Query: 222 YLMEEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
Y +G C D +I GY+ VP NDE+ L A+A QPV+V I+ASG FQFY
Sbjct: 206 YEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKS 265
Query: 281 GVFTGPCGAELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GVF GPCGA +H V VGY + + G Y + KNSWG WG++GYI ++++ +P G C
Sbjct: 266 GVFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTC 325
Query: 339 GI 340
G+
Sbjct: 326 GL 327
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 164/375 (43%), Positives = 215/375 (57%), Gaps = 29/375 (7%)
Query: 1 MAFFSHSKLLLLSLSLSLF--ACSSLAHDFSIVGYSPEHLTSMDK-LIELFESWMSKHGK 57
MA S L + L L++F CSS A G +++ D +IE F+ W + + K
Sbjct: 1 MASSSKGSLPCVLLLLAVFHHGCSS-ARAHRRAGDMERSMSTDDSSMIERFQRWKAAYNK 59
Query: 58 TYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKNKYLGLKP 113
+Y + E+ RF + N+ +I+ N E +Y LG + D++++EF Y P
Sbjct: 60 SYATVAEERRRFRVCARNMAYIEATNAEAEAAGLTYELGETAYTDLTNQEFMAMYTAPAP 119
Query: 114 -QFP-------TRRQPSAEFS--------YRDVK-ALPKSVDWRKKGAVTPVKNQGSCGS 156
Q P TR P Y ++ + P SVDWR GAVTPVKNQG CGS
Sbjct: 120 AQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSTSAPASVDWRASGAVTPVKNQGRCGS 179
Query: 157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHK 216
CWAFSTVA VEGI QI +G L SLSEQEL+DCDT ++GC+GG+ A ++I ++GG+
Sbjct: 180 CWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDT-LDDGCDGGISYRALRWIASNGGITT 238
Query: 217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
E DYPY C K V+I+G + V E SL A+A QPV+V+IEA G +FQ
Sbjct: 239 ETDYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEASLANAVAGQPVAVSIEAGGDNFQ 298
Query: 277 FYSGGVFTGPCGAELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNT-GK 333
Y GV+ GPCG L+HGV VGYG+ + G Y IVKNSWG WG+ GYIRMK++ GK
Sbjct: 299 HYKKGVYNGPCGTNLNHGVTVVGYGQEAAGGDRYWIVKNSWGQGWGDDGYIRMKKDVAGK 358
Query: 334 PEGLCGINKMASIPL 348
PEGLCGI S PL
Sbjct: 359 PEGLCGIAIRPSYPL 373
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 148/301 (49%), Positives = 192/301 (63%), Gaps = 24/301 (7%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
F+ + + K Y+ EE+ RF IF +NL I + N E ++ +G+N+FAD+++EE
Sbjct: 20 FDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEE 79
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKAL------PKSVDWRKKGAVTPVKNQGSCGSC 157
++ YL +P +PT E R+ + + SVDWR+KGAVTP+KNQG CGSC
Sbjct: 80 YRQLYL--RP-YPT------ELLGRERQEVWLDGPNAGSVDWRQKGAVTPIKNQGQCGSC 130
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHK 216
W+FST +VEG + I +GNL SLSEQ+L+DC SF N GCNGGLMD AFKYI+++GGL
Sbjct: 131 WSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDT 190
Query: 217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
E+DYPY +G C+ KE V+ISGY+DVP+N+E L A+ PVSVAIEA FQ
Sbjct: 191 EQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQ 250
Query: 277 FYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
YS GVF+GPCG LDHGV VGY SDY IVKNSWG W RG + EG
Sbjct: 251 MYSSGVFSGPCGTNLDHGVLVVGY----TSDYWIVKNSWGASWVTRGGCHSGEQAVRIEG 306
Query: 337 L 337
+
Sbjct: 307 I 307
>gi|357477225|ref|XP_003608898.1| Cysteine proteinase, partial [Medicago truncatula]
gi|355509953|gb|AES91095.1| Cysteine proteinase, partial [Medicago truncatula]
Length = 260
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 140/262 (53%), Positives = 179/262 (68%), Gaps = 22/262 (8%)
Query: 93 LNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
LN+FADM++ EF++ Y K F + F Y +V+ +P S+DWRK GAVT V
Sbjct: 2 LNKFADMTNYEFRSIYADSKVNHHRMFRGMSHDNGPFMYENVEGVPSSIDWRKIGAVTGV 61
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
K+QG CGSCWAFST+ AVEGINQI + L SLSEQEL+DCDT N GCNGGLM+YAF++I
Sbjct: 62 KDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTEVNQGCNGGLMEYAFEFI 121
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
G+ E +YPY ++GTC +KE V+I G+++VP N+E++LLKA A+QP+SVAI
Sbjct: 122 -KQNGITTETNYPYAAKDGTCNIQKENKPAVSIDGHENVPANNEKALLKAAANQPISVAI 180
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMK 328
+A G+DFQFYS GVFTG CG EL+HGV NSWG +WGE+GYIRM+
Sbjct: 181 DAGGSDFQFYSEGVFTGHCGTELNHGV-----------------NSWGSEWGEQGYIRMQ 223
Query: 329 RNTGKPEGLCGINKMASIPLKK 350
R +GLCGI AS P+KK
Sbjct: 224 RAISHKQGLCGIAMEASYPIKK 245
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 152/309 (49%), Positives = 199/309 (64%), Gaps = 13/309 (4%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHE 102
L E FE W +K+G YK + E+ F+IFK N+ +ID N Y L +N F D E
Sbjct: 38 LSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPIE 97
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
+ + + + T P+ F Y +V +P +VDWRK+GAVTP+KNQG CGSCWAFS
Sbjct: 98 DSDDGF-----ERTTTTTPTTTFKYENVTDIPATVDWRKRGAVTPIKNQGKCGSCWAFSA 152
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
VAA+EGI +I SGNL SLSEQ+L+DCD S GC+ G M AFK+I+ +GG+ E +YP
Sbjct: 153 VAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIATEANYP 212
Query: 222 Y-LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
Y + +GTC K+ V I Y++VP N E SLLKA+A+QPVSV I+ G F+FYS
Sbjct: 213 YKRVVKGTC---KKVSHKVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRGM-FKFYSS 268
Query: 281 GVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
G+FTG CG + +H + VGYG SK G Y +VKNSW +WGE+GYIR+KR+ EGLCG
Sbjct: 269 GIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRDIDAKEGLCG 328
Query: 340 INKMASIPL 348
I S P+
Sbjct: 329 IAMKPSYPI 337
>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 335
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 144/302 (47%), Positives = 196/302 (64%), Gaps = 14/302 (4%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEE 103
+++FE WM+K GKTYKC EK HRF IF++N+ I +VT +G+N+FAD++++E
Sbjct: 33 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 92
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKAL--PKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
F Y G KP P + P R V + P +DWR +GAVT VK+QG+CGSCWAF+
Sbjct: 93 FVATYTGAKPPHP-KEAP------RPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFA 145
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
VAA+EG+ +I +G LT LSEQEL+DCDT+ +NGC GG D AF+ + + GG+ E DY
Sbjct: 146 AVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGGGHTDRAFELVASKGGITAESDYR 204
Query: 222 YLMEEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
Y +G C D +I GY+ VP NDE+ L A+A QPV+V I+ASG FQFY
Sbjct: 205 YEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKS 264
Query: 281 GVFTGPCGAELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GVF GPCGA +H V VGY + + G Y + KNSWG WG++GYI ++++ +P G C
Sbjct: 265 GVFPGPCGASSNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTC 324
Query: 339 GI 340
G+
Sbjct: 325 GL 326
>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
Length = 342
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 150/336 (44%), Positives = 208/336 (61%), Gaps = 19/336 (5%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
+L + +L A ++ D S + +T +++FE WM+K GKTYKC EK HRF
Sbjct: 11 VLLVVCTLMALQAMGADAYYNNGSDDGVT-----MQMFEEWMAKFGKTYKCHGEKEHRFG 65
Query: 71 IFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRD 129
IF++N+ I +VT +G+N+FAD++++EF Y G KP P + P R
Sbjct: 66 IFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHP-KEAP------RP 118
Query: 130 VKAL--PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELID 187
V + P +DWR +GAVT VK+QG+CGSCWAF+ VAA+EG+ +I +G LT LSEQEL+D
Sbjct: 119 VDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVD 178
Query: 188 CDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE-DKKEEMEVVTISGYQD 246
CDT+ +NGC GG D AF+ + + GG+ E DY Y +G C D I GY+
Sbjct: 179 CDTN-SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAARIGGYRA 237
Query: 247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK--SK 304
VP NDE+ L A+A QPV+V I+ASG FQFY GVF GPCGA +H V VGY + +
Sbjct: 238 VPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGAS 297
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
G Y + KNSWG WG++GYI ++++ +P G CG+
Sbjct: 298 GKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGL 333
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 203/313 (64%), Gaps = 11/313 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHE 102
+++ E WM++ + Y+ EK R ++FK+NLK I+ NK+ SY LG+NEFAD ++E
Sbjct: 35 MVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNE 94
Query: 103 EFKNKYLGLK------PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGS 156
EF + GLK P + S++ ++ + +S DWR +GAVTPVK QG CG
Sbjct: 95 EFLAIHTGLKGLTEVSPSKVVAKTISSQ-TWNVSDMVVESKDWRAEGAVTPVKYQGQCGC 153
Query: 157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHK 216
CWAFS VAAVEG+ +I GNL SLSEQ+L+DCD ++ C+GG+M AF Y+V + G+
Sbjct: 154 CWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYDRDCDGGIMSDAFNYVVQNRGIAS 213
Query: 217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
E DY Y +G C + ISG+Q VP N+E++LL+A++ QPVSV+++A+G F
Sbjct: 214 ENDYSYQGSDGGC--RSNARPAARISGFQTVPSNNERALLEAVSRQPVSVSMDATGDGFM 271
Query: 277 FYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
YSGGV+ GPCG +H V VGYG S+ G+ Y + KNSWG W E+GYIR++R+ P+
Sbjct: 272 HYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGETWEEKGYIRIRRDVAWPQ 331
Query: 336 GLCGINKMASIPL 348
G+CG+ + A P+
Sbjct: 332 GMCGVAQYAFYPV 344
>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
Length = 319
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 144/302 (47%), Positives = 196/302 (64%), Gaps = 14/302 (4%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEE 103
+++FE WM+K GKTYKC EK HRF IF++N+ I +VT +G+N+FAD++++E
Sbjct: 17 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 76
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKAL--PKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
F Y G KP P + P R V + P +DWR +GAVT VK+QG+CGSCWAF+
Sbjct: 77 FVATYTGAKPPHP-KEAP------RPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFA 129
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
VAA+EG+ +I +G LT LSEQEL+DCDT+ +NGC GG D AF+ + + GG+ E DY
Sbjct: 130 AVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGGGHTDRAFELVASKGGITAESDYR 188
Query: 222 YLMEEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
Y +G C D +I GY+ VP NDE+ L A+A QPV+V I+ASG FQFY
Sbjct: 189 YEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKS 248
Query: 281 GVFTGPCGAELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GVF GPCGA +H V VGY + + G Y + KNSWG WG++GYI ++++ +P G C
Sbjct: 249 GVFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTC 308
Query: 339 GI 340
G+
Sbjct: 309 GL 310
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 143/330 (43%), Positives = 202/330 (61%), Gaps = 16/330 (4%)
Query: 16 LSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
L LF C A + P D +++ FE WM ++G+ YK +EK+ RF+IFK N
Sbjct: 10 LFLFLCVMWASPSAASADEPS-----DPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNN 64
Query: 76 LKHI---DQRNKEVTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVK 131
+ HI + RNK+ SY LG+N+F DM++ EF +Y G + R+P F D+
Sbjct: 65 VNHIETFNSRNKD--SYTLGINQFTDMTNNEFVAQYTGGISRPLNIEREPVVSFDDVDIS 122
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS 191
A+P+S+DWR GAVT VKNQ CG+CWAF+ +A VE I +I G L LSEQ+++DC
Sbjct: 123 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAKG 182
Query: 192 FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
+ GC GG AF++I+++ G+ YPY +GTC+ I+GY VP N+
Sbjct: 183 Y--GCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTCKTNGVPNSAY-ITGYARVPRNN 239
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-SKGSDYII 310
E S++ A++ QP++VA++A+ Q+Y+ GVF GPCG L+H V A+GYG+ S G Y I
Sbjct: 240 ESSMMYAVSKQPITVAVDANANS-QYYNSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWI 298
Query: 311 VKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
VKNSWG +WGE GYIRM R+ G+CGI
Sbjct: 299 VKNSWGARWGEAGYIRMARDVSSSSGICGI 328
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 146/305 (47%), Positives = 195/305 (63%), Gaps = 8/305 (2%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKN 106
F SWM K +E +HRFE+F N + I+ NK+ +S + +G NE++ ++ +EFK
Sbjct: 28 FLSWMKKFAVKLNPLEW-VHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKK 86
Query: 107 KYLGLKPQFPTRRQPSAEFSYR----DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
GL+ P+ Q A+++ ++ +P +DW ++G VTPVKNQG CGSCWAFST
Sbjct: 87 LRTGLRVS-PSYIQSRAKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFST 145
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
A+EG + S L S+SEQEL+DCD + + GCNGGLMD AFK++ GL KEEDYPY
Sbjct: 146 TGAIEGAAFVSSKQLVSVSEQELVDCDHNGDMGCNGGLMDNAFKWVKTHKGLCKEEDYPY 205
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+EGTC KK + V ++ + DVP NDEQ+L A+A QPVSVAIEA +FQFY GV
Sbjct: 206 HAKEGTCALKKCK-PVTKVTAFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQFYKSGV 264
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
F CG +LDHGV VGYG+ G Y VKNSWG WG++GYI++ R G G CG+
Sbjct: 265 FDKSCGTKLDHGVLVVGYGEEGGKKYWKVKNSWGADWGDKGYIKLAREFGPETGQCGVAM 324
Query: 343 MASIP 347
+ S P
Sbjct: 325 VPSYP 329
>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
Length = 319
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 144/302 (47%), Positives = 196/302 (64%), Gaps = 14/302 (4%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEE 103
+++FE WM+K GKTYKC EK HRF IF++N+ I +VT +G+N+FAD++++E
Sbjct: 17 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 76
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKAL--PKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
F Y G KP P + P R V + P +DWR +GAVT VK+QG+CGSCWAF+
Sbjct: 77 FVATYTGAKPPHP-KEAP------RPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFA 129
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
VAA+EG+ +I +G LT LSEQEL+DCDT+ +NGC GG D AF+ + + GG+ E DY
Sbjct: 130 AVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGGGHTDRAFELVASKGGITAESDYR 188
Query: 222 YLMEEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
Y +G C D +I GY+ VP NDE+ L A+A QPV+V I+ASG FQFY
Sbjct: 189 YEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKS 248
Query: 281 GVFTGPCGAELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GVF GPCGA +H V VGY + + G Y + KNSWG WG++GYI ++++ +P G C
Sbjct: 249 GVFPGPCGASSNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTC 308
Query: 339 GI 340
G+
Sbjct: 309 GL 310
>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
Length = 262
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 140/224 (62%), Positives = 165/224 (73%), Gaps = 4/224 (1%)
Query: 130 VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD 189
V LP SVDWR+KGAVT VK+QG CGSCWAFSTV +VEGIN I +G+L SLSEQELIDCD
Sbjct: 1 VSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD 60
Query: 190 TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME---VVTISGYQD 246
T+ N+GC GGLMD AF+YI +GGL E YPY GTC + VV I G+QD
Sbjct: 61 TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQD 120
Query: 247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-G 305
VP N E+ L +A+A+QPVSVA+EASG F FYS GVFTG CG ELDHGVA VGYG ++ G
Sbjct: 121 VPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDG 180
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
Y VKNSWGP WGE+GYIR+++++G GLCGI AS P+K
Sbjct: 181 KAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVK 224
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 144/306 (47%), Positives = 193/306 (63%), Gaps = 9/306 (2%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F+SW + HG +Y + E+ R I++ NL I++ N E SY L +N+FAD+++ EF K
Sbjct: 22 FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLAVNKFADLTYPEFAAK 81
Query: 108 YLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
YLGL+ + A +Y + +LP SVDWR G VTP+K+QG CGSCW+FST +V
Sbjct: 82 YLGLRFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTTGSV 141
Query: 167 EGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
EG + +G L SLSEQ L+DC ++ N GCNGGLMD AF+YI+++ G+ E YPY +
Sbjct: 142 EGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPYTAQ 201
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFT 284
+GTC+ + T++ YQD+ E L A+A P+SVAI+AS FQFYS GV+
Sbjct: 202 DGTCQFNSANVG-ATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGVYN 260
Query: 285 GPC--GAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
P ++LDHGV AVGYG S SDY +VKNSWG WG+ GYI M RN+ CGI
Sbjct: 261 EPACSSSQLDHGVLAVGYGTSGSSDYWLVKNSWGTSWGQSGYIWMTRNSNNQ---CGIAT 317
Query: 343 MASIPL 348
AS PL
Sbjct: 318 AASYPL 323
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 151/329 (45%), Positives = 196/329 (59%), Gaps = 25/329 (7%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADM 99
+IE F+ W + + K+Y + E+ RF ++ N+ +I+ N E +Y LG + D+
Sbjct: 46 MIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAAGLTYELGETAYTDL 105
Query: 100 SHEEFKNKYLGLK-PQFP-------TRRQPSAEFS--------YRDVKA-LPKSVDWRKK 142
+++EF Y Q P TR P Y ++ A P SVDWR
Sbjct: 106 TNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSASAPASVDWRAS 165
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAVTPVKNQG CGSCWAFSTVA VEGI QI +G L SLSEQEL+DCDT ++GC+GG+
Sbjct: 166 GAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDT-LDDGCDGGISY 224
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
A ++I ++GG+ E DYPY C K V+I+G + V E SL A+A Q
Sbjct: 225 RALRWIASNGGITTEADYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEASLANAVAGQ 284
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWG 320
PV+V+IEA G +FQ Y GV+ GPCG L+HGV VGYG+ + G Y IVKNSWG WG
Sbjct: 285 PVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAAGDRYWIVKNSWGQGWG 344
Query: 321 ERGYIRMKRNT-GKPEGLCGINKMASIPL 348
+ GYIRMK++ GKPEGLCGI S PL
Sbjct: 345 DDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
Length = 340
Score = 280 bits (716), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 155/353 (43%), Positives = 217/353 (61%), Gaps = 26/353 (7%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
S +L+LS L + A + ++ ++ S D++I L+E W+ KH K Y + EK+
Sbjct: 3 SFVLILSFLLFVSAITCISTNWR----------SDDEVIALYEEWLVKHQKLYSSLGEKI 52
Query: 67 HRFEIFKENLKHIDQRNK----EVTSYWLGLNEFADMSHEEFKNKYLGLKPQF------- 115
RFEIFK+NL++IDQ+N ++ LGLN+FAD++ +EF + YLG +
Sbjct: 53 KRFEIFKDNLRYIDQQNHYNKVNHMNFTLGLNQFADLTLDEFSSIYLGTSVDYEQIISSN 112
Query: 116 PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
P + DV LP SVDWR+KG V P++NQG CGSCW FS VA++E +N I G
Sbjct: 113 PNHDDVEEDILKEDVVELPDSVDWREKGVVFPIRNQGKCGSCWTFSAVASIETLNGIKKG 172
Query: 176 NLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
++ +LSEQEL+DC+T + GC GG + AF Y VA G+ EE YPY+ +G C K++
Sbjct: 173 HMIALSEQELLDCET-ISQGCKGGHYNNAFAY-VAKNGITSEEKYPYIFRQGQCYQKEK- 229
Query: 236 MEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGV 295
VV ISGY+ VP N+ L A+A Q VSVA++ DFQFY G+F+G CG LDH V
Sbjct: 230 --VVKISGYKRVPRNNGGQLQSAVAQQVVSVAVKCESKDFQFYDRGIFSGACGPILDHAV 287
Query: 296 AAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
VGYG G++Y I++NSWG WGE GY+R+++N+ EG CGI S P+
Sbjct: 288 NIVGYGSKGGANYWIMRNSWGTNWGENGYMRIQKNSKHYEGHCGIAMQPSYPV 340
>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 351
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 143/352 (40%), Positives = 210/352 (59%), Gaps = 16/352 (4%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K L++ L L F C+ + F + + S L++L++ W S H + + E +
Sbjct: 5 KFLIVPLVLIAFLCN-ICESFEL---ERKDFESEKSLMQLYKRWSSHH-RISRNANEMHN 59
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE--- 124
RF++FK N KH+ + N S L LN+FADMS +EF+N Y + E
Sbjct: 60 RFKVFKNNAKHVFKVNLMGKSLKLKLNQFADMSDDEFRNMYSSNITYYKDLHAKKIEATG 119
Query: 125 -----FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
F Y +P S+DWRKKGAV +KNQG CGSCWAF+ VAAVE I+QI + L S
Sbjct: 120 GRIGGFMYEHANNIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIHQIKTNELVS 179
Query: 180 LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVV 239
LSE+E++DCD + GC GG + AF++++ + G+ E++YPY G C + + V
Sbjct: 180 LSEEEVLDCDYR-DGGCRGGFYNSAFEFMMDNDGVTIEDNYPYYEGNGYCRRRGGRNKRV 238
Query: 240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP--CGAELDHGVAA 297
I GY++VP N+E +L+KA+AHQPV+VAI + G+DF+FY GG+FT CG +DH V
Sbjct: 239 RIDGYENVPRNNEYALMKAVAHQPVAVAIASGGSDFKFYGGGMFTENDFCGFNIDHTVVV 298
Query: 298 VGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG + DY I++N +G +WG GY++M+R P+G+CG+ + P+K
Sbjct: 299 VGYGTDEDGDYWIIRNQYGHRWGMNGYMKMQRGAHSPQGVCGMAMQPAYPVK 350
>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 422
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 154/316 (48%), Positives = 199/316 (62%), Gaps = 15/316 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
F+ W++ HGK Y C +E+ R IF +N + + N+ S+WL LN AD++ EE
Sbjct: 70 FDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLADLTREE 129
Query: 104 FKNK--YLGLKPQFPTRRQP--SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
FK+ Y K + + P +A + Y DV P+++DW +GAVTPVKNQG CGSCWA
Sbjct: 130 FKHMLGYDASKKRVESSSPPVDAANWEYADVTP-PETMDWVSRGAVTPVKNQGQCGSCWA 188
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDC-DTSFNNGCNGGLMDYAFKYIVASGGLHKEE 218
FSTV AVEG+ + +G+L SLSEQEL+ C NNGC GGLMD F++IV + G+ EE
Sbjct: 189 FSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIVENRGVDDEE 248
Query: 219 DYPYLMEEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
D+ YL ++ C KK + +I G++DVP NDE +L KA++ QPV+VAIEA +FQ
Sbjct: 249 DWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAIEADHREFQL 308
Query: 278 YSGGVFTGPCGAELDHGVAAVGY---GKSKG-SDYIIVKNSWGPKWGERGYIRMKRNTGK 333
YSGGVF G CG LDHGV VGY G+S G Y VKNSWG KWGE GYIR+ R
Sbjct: 309 YSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGYIRIARGGMG 368
Query: 334 PEGLCGINKMASIPLK 349
P G CG+ AS P K
Sbjct: 369 PAGQCGVAMQASYPTK 384
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 155/342 (45%), Positives = 205/342 (59%), Gaps = 24/342 (7%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
+L L + C S A FS Y F++WM KH K+Y +E R
Sbjct: 4 VLALIFCFLIINCCSAARIFSQKQYQTA-----------FQNWMVKHQKSYT-NDEFGSR 51
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR 128
+ +F++N+ + + N++ ++ LGLN AD+++EEFK YLG K +++ +
Sbjct: 52 YSVFQDNMDIVAKWNQKGSNTILGLNVMADLTNEEFKKLYLGTKANVTYKKK-----TLV 106
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
V LP SVDWR GAVT VKNQG CG C+AFST +VEGI++I S L LSEQ+++DC
Sbjct: 107 GVSGLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDC 166
Query: 189 DTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
S NNGC+GGLM +F+YI+A GGL E YPY E G C+ K+ + TI+GY++V
Sbjct: 167 SGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYTGEVGKCKFNKKNIG-ATITGYKNV 225
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYGKSKG 305
E L A+A QPVSVAI+AS + FQ Y+ GV+ P C + +LDHGV AVGYG G
Sbjct: 226 ESGSESDLQTAVAAQPVSVAIDASQSSFQLYASGVYYEPECSSTQLDHGVLAVGYGSQSG 285
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
DY IVKNSWG WGE G+I M RN + CGI MAS P
Sbjct: 286 QDYWIVKNSWGADWGENGFILMARN---KDNNCGIATMASFP 324
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 143/305 (46%), Positives = 194/305 (63%), Gaps = 8/305 (2%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNK 107
E WM++HG+ YK EK R E+F+ N + ID N T S+ L N FAD++ +EF+
Sbjct: 39 EKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVQEFRAA 98
Query: 108 YLGLKPQFPTRRQPSAEFSYRD--VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
GL+P+ P + F Y + + +SVDWR GAVT VK+QG+ G CWAFS VAA
Sbjct: 99 RTGLRPR-PAPSAGAGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGASGCCWAFSAVAA 157
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
VEG+N+I +G L SLSEQEL+DCD S + GC+GGLMD AF+++ GGL E YPY
Sbjct: 158 VEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQC 217
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
+G C +I G++DVP N+E +L A+AHQPVSVAI F+FY GV
Sbjct: 218 RDGPCR-SSAAAAAASIRGHEDVPRNNEAALAAAVAHQPVSVAINGEDMAFRFYDSGVLG 276
Query: 285 GPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
G CG +L+H + AVGYG + G+ Y ++KNSWG WGE GY+R++R + EG+CG+ K+
Sbjct: 277 GACGTDLNHAITAVGYGTAADGTRYWLMKNSWGASWGEGGYVRIRRGV-RGEGVCGLAKL 335
Query: 344 ASIPL 348
S P+
Sbjct: 336 PSYPV 340
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 278 bits (711), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 134/304 (44%), Positives = 202/304 (66%), Gaps = 8/304 (2%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFK 105
+FE W +KHGK+Y EK R IF + L +I++ N + T++ LGLN+F+D+++ EF+
Sbjct: 36 MFEDWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 95
Query: 106 NKYLGL--KPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
++G +P++ R AE DV +LP S+DWR+KGAVTP+K+QG CGSCWAFS +
Sbjct: 96 AMHVGKFKRPRYQDRLP--AEDEDVDVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAI 153
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
A++E + + + L SLSEQ+L+DCDT + GC+GGLM+ AFK++V +GG+ E YPY
Sbjct: 154 ASIESAHFLATKELVSLSEQQLMDCDT-VDAGCDGGLMETAFKFVVKNGGVTTEAAYPYT 212
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
G+C K + +V I+G++ V E+ +L+KA++ PV+V+I S +FQ Y G+
Sbjct: 213 GSVGSCNANKAKNKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGIL 272
Query: 284 TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
+G C LDHGV +GYG G Y I+KNSWG WGE G+++++R G +G+CG+N
Sbjct: 273 SGKCDDSLDHGVLLIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIERKDG--DGMCGMNGD 330
Query: 344 ASIP 347
+S P
Sbjct: 331 SSYP 334
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 154/342 (45%), Positives = 201/342 (58%), Gaps = 22/342 (6%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
+L L + C S A FS Y F++WM KH K+Y +E R
Sbjct: 4 ILALVFCFLIVNCISAARVFSQKQYQTA-----------FQNWMVKHQKSYTN-DEFGSR 51
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR 128
+ IF++N+ + + N++ + LGLN AD++++E++ YLG K T ++P+
Sbjct: 52 YTIFQDNMDFVTKWNQKGSDTILGLNSMADLTNQEYQRIYLGTKT---TVKKPNLIIGVT 108
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
DV P SVDWR GAVT VKNQG CG C++FST +VEGI++I S L SLSEQ+++DC
Sbjct: 109 DVSKAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDC 168
Query: 189 DTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
S NNGC+GGLM +F+YI+A GGL E YPY G C+ K + TI+GY++V
Sbjct: 169 SGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKANIG-ATITGYKNV 227
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPC--GAELDHGVAAVGYGKSKG 305
E L A+A QPVSVAI+AS FQ YS GV+ P +LDHGV AVGYG G
Sbjct: 228 KSGSESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYGSQSG 287
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
DY IVKNSWG WGE+G+I M RN CGI MAS P
Sbjct: 288 QDYWIVKNSWGADWGEKGFILMARN---KHNNCGIATMASYP 326
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 146/354 (41%), Positives = 211/354 (59%), Gaps = 43/354 (12%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
+ +K LL ++ L CS++ + L+ + E WM+++G+ YK
Sbjct: 1 MAMAKALLFAILGCLCLCSAV--------LAARELSDDAAMAARHERWMAQYGRMYKDDA 52
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYL--GLKPQFPTRRQP 121
EK RFE+FK N+ I+ N +WLG+N+FAD++++EF++ G P T R P
Sbjct: 53 EKARRFEVFKANVAFIESFNAGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPS--TTRVP 110
Query: 122 SAEFSYRD----VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ +R+ + ALP ++DWR KG VTP+K+QG CG CWAFS VAA+E
Sbjct: 111 TG---FRNENVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAME---------- 157
Query: 178 TSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
EL+DCD + GC GGLMD AFK+I+ +GGL E +YPY +DK + +
Sbjct: 158 ------ELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPY----AAVDDKFKSV 207
Query: 237 E--VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHG 294
V +I GY+DVP N+E +L+KA+A+QPVSVA++ FQFY GGV TG CG +LDHG
Sbjct: 208 SNSVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHG 267
Query: 295 VAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
+ A+GYGK S G+ Y ++KNSWG WGE G++RM+++ G+CG+ S P
Sbjct: 268 IVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYP 321
>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
Length = 246
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 141/262 (53%), Positives = 171/262 (65%), Gaps = 23/262 (8%)
Query: 88 SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTP 147
SY L +NEFAD+++EEF K + S F Y +V A+P + DWRKKGAVTP
Sbjct: 4 SYKLSINEFADLTNEEFGTSRNRFKAHICSTEATS--FKYENVTAVPSTXDWRKKGAVTP 61
Query: 148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFK 206
+K+QG CGSCWAFS VAA+EGI Q+ +G L SLSEQEL+DCDTS + GC G
Sbjct: 62 IKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXGA------- 114
Query: 207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSV 266
+YPY +GTC KK I+GY+DVP N+E++L KA+AHQP++V
Sbjct: 115 ------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAV 162
Query: 267 AIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYI 325
AI+A G +FQFYS GVFTG CG ELDHGV AVGYG S G Y +VKNSWG WGE GYI
Sbjct: 163 AIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTGWGEEGYI 222
Query: 326 RMKRNTGKPEGLCGINKMASIP 347
RM+R+ EGLCGI AS P
Sbjct: 223 RMQRDVTAKEGLCGIAMQASYP 244
>gi|118145|sp|P20721.1|CYSPL_SOLLC RecName: Full=Low-temperature-induced cysteine proteinase; Flags:
Precursor
gi|806314|gb|AAA66308.1| thiol protease, partial [Solanum lycopersicum]
Length = 346
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 125/218 (57%), Positives = 161/218 (73%)
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS 191
+LP+S+DWR+KG + VK+QGSCGSCWAFS VAA+E IN IV+GNL SLSEQEL+DCD S
Sbjct: 17 SLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRS 76
Query: 192 FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
+N GC+GGLMDYAF++++ +GG+ EEDYPY G C+ ++ +VV I Y+DVP N+
Sbjct: 77 YNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNN 136
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIV 311
E++L KA+AHQPVS+A+EA G DFQ Y G+FTG CG +DHGV GYG G DY IV
Sbjct: 137 EKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGTENGMDYWIV 196
Query: 312 KNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+NSWG E GY+R++RN GLCG+ S P+K
Sbjct: 197 RNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPVK 234
>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 377
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 147/334 (44%), Positives = 196/334 (58%), Gaps = 25/334 (7%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT---SYWLGLNE 95
T + + F+ W ++HG+ Y +E+L R ++ N+++I+ N + +Y LG
Sbjct: 44 TILQTMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGETA 103
Query: 96 FADMSHEEFKNKYLGLKPQFP------------TRRQPSAEFSYRDV------KALPKSV 137
+ D++ +EF Y P T R + + + V P SV
Sbjct: 104 YTDLTADEFTAMYTSPSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAGAPASV 163
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCN 197
DWR KGAVT VKNQG CGSCWAFSTVA VEGI+QI +GNL SLSEQEL+DCDT + GC+
Sbjct: 164 DWRAKGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCDT-LDYGCD 222
Query: 198 GGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLK 257
GG+ +A ++I ++GG+ E DYPY ++G C K + ISG+ V E SL
Sbjct: 223 GGVSYHALEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRSEPSLAN 282
Query: 258 ALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV--GYGKSKGSDYIIVKNSW 315
A+A QPV+V+IEA G +FQ Y GV+ GPCG L+HGV V G + G Y IVKNSW
Sbjct: 283 AVAAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEKYWIVKNSW 342
Query: 316 GPKWGERGYIRMKRNT-GKPEGLCGINKMASIPL 348
G KWG+ GY RMK++ GKPEGLCGI S PL
Sbjct: 343 GKKWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 135/306 (44%), Positives = 203/306 (66%), Gaps = 10/306 (3%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFK 105
+FE W +KHGK+Y EK R IF + L +I++ N + T++ LGLN+F+D+++ EF+
Sbjct: 40 MFEDWAAKHGKSYSSDLEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 99
Query: 106 NKYLGL--KPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
++G +P++ R AE DV +LP S+DWR+KGAVTP+K+QG CGSCWAFS +
Sbjct: 100 AMHVGKFKRPRYQDRLP--AEDEDVDVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAI 157
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
A++E + + + L SLSEQ+L+DCDT + GC+GGLM+ AFK++V +GG+ E YPY
Sbjct: 158 ASIESAHFLATKELVSLSEQQLMDCDT-VDAGCDGGLMETAFKFVVKNGGVTTEASYPYT 216
Query: 224 MEEGTCEDKKEEM--EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
G+C K + +V I+G++ V E+ +L+KA++ PV+V+I S +FQ Y G
Sbjct: 217 GSVGSCNANKVAIINKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSG 276
Query: 282 VFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
+ +G CG LDHGV +GYG G Y I+KNSWG WGE G+++++R G +G+CG+N
Sbjct: 277 ILSGQCGDSLDHGVLLIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIERKDG--DGICGMN 334
Query: 342 KMASIP 347
+S P
Sbjct: 335 GDSSYP 340
>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 352
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 157/356 (44%), Positives = 203/356 (57%), Gaps = 22/356 (6%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIEL-FESWMSKHGKTYKCIE 63
+ SKL +++ SL L L+ + + S +E + WM++HG+TYK
Sbjct: 4 TSSKLQVMAASLLLVVAGGLSTMAKVT------MASRAGTMEARHDKWMAEHGRTYKDAA 57
Query: 64 EKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPS 122
EK RF +FK N+ ID+ N Y L N F D++ EF Y G P +
Sbjct: 58 EKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAAN 117
Query: 123 A--EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
A S D + P VDWR++GAVT VKNQ SCG CWAFSTVAAVEGI+QI +G L SL
Sbjct: 118 ATTRLSSEDDQ-QPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSL 176
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE---DKKEEME 237
SEQ+L+DC + N GC GG +D AF+Y+ SGG+ E Y Y +G C+
Sbjct: 177 SEQQLLDC--ADNGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGV 234
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG-PCGAELDHGVA 296
TISGYQ V NDE SL A+A QPVSVAIE SG F+ Y GVFT CG +LDH VA
Sbjct: 235 AATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVA 294
Query: 297 AVGYGK----SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
VGYG S G Y I+KNSWG WG+ GY++++++ G +G CG+ S P+
Sbjct: 295 VVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDVGS-QGACGVAMAPSYPV 349
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 135/304 (44%), Positives = 199/304 (65%), Gaps = 10/304 (3%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFK 105
+FE W +KHGK+Y EK R IF + L +I++ N + T++ LGLN+F+D+++ EF+
Sbjct: 1 MFEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 NKYLGL--KPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
Y+G P++ RR P+ + DV +LP S+DWR++GAVTP+K+QG CGSCWAFS +
Sbjct: 61 ANYVGKFKSPRYQDRR-PAKDVDV-DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
A++E + + + L SLSEQ+LIDCDT + GC GG + AFK++V +GG+ EE YPY
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCDT-VDQGCQGGFPEDAFKFVVENGGVTTEEAYPYT 177
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
G+C K + VV I+GY+DV ++ +L+KA++ PV+V I S +FQ Y G+
Sbjct: 178 GFAGSCNANKNK--VVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 284 TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
+G C DH V +GYG G Y I+KNSWG WGE G++++K+ G EG+CG+N
Sbjct: 236 SGQCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGENGFMKIKKKDG--EGMCGMNGQ 293
Query: 344 ASIP 347
+S P
Sbjct: 294 SSYP 297
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 204/342 (59%), Gaps = 15/342 (4%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
L+L +L ++ AC + +D S S + M +ESW+ K+G+ Y+ +E R
Sbjct: 14 LVLCNLWITASACPAKHNDNS----SDSEVMRM-----RYESWLKKYGQKYRNKDEWEFR 64
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR 128
FEI++ N++ I+ N + SY L N+F D+++EEF+ YL +P R F Y+
Sbjct: 65 FEIYRANVQFIEVYNSQNYSYKLMDNKFVDLTNEEFRRMYLVYQP----RSHLQTRFMYQ 120
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
LPK +DWR +GAVT +K+QG CGSCW+FS VA VE IN+I +G L SLSEQ+LIDC
Sbjct: 121 KHGDLPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQQLIDC 180
Query: 189 DT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
D + N GCNGG M+ F +I GGL +++YPY +G K V I GY+++
Sbjct: 181 DNRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQGSDGDXNKAKVRNHAVAICGYENL 239
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSD 307
P ++E L A+AHQP SVA +A G FQ YS G F+G CG +L+H + VGYG+ G
Sbjct: 240 PAHNENMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGSCGKDLNHRMTIVGYGEENGEK 299
Query: 308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
Y +VKNSW G GYIRMKR+ +G CG AS P K
Sbjct: 300 YWLVKNSWANDXGVSGYIRMKRDPKDKDGTCGTAMEASYPDK 341
>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
Length = 380
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 149/333 (44%), Positives = 192/333 (57%), Gaps = 29/333 (8%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADM 99
+IE F+ W + + K+Y + E RF ++ N+ +I+ N E +Y LG + D+
Sbjct: 48 MIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTYELGETAYTDL 107
Query: 100 SHEEFKNKYLGLK--PQFP--------------TRRQPSAEFSYRDV-----KALPKSVD 138
+++EF Y Q P TR P V A P SVD
Sbjct: 108 TNQEFMAMYTAAPSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVYVNLSTAAPASVD 167
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
WR GAVTPVKNQG CGSCWAFSTVA VEGI QI +G L SLSEQEL+DCDT + GC+G
Sbjct: 168 WRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDT-LDAGCDG 226
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
G+ A ++I ++GGL EEDYPY C K +I+G + V E SL A
Sbjct: 227 GISYRALRWITSNGGLTTEEDYPYTGTTDACNRAKLAHNAASIAGLRRVATRSEASLANA 286
Query: 259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK--GSDYIIVKNSWG 316
+A QPV+V+IEA G +FQ Y GV+ GPCG L+HGV VGYG+ + G Y I+KNSWG
Sbjct: 287 VAGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYGQEEEDGDKYWIIKNSWG 346
Query: 317 PKWGERGYIRMKRNT-GKPEGLCGINKMASIPL 348
WG+ GYI+M+++ GKPEGLCGI S PL
Sbjct: 347 ASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379
>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
Length = 351
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 157/356 (44%), Positives = 213/356 (59%), Gaps = 26/356 (7%)
Query: 3 FFSHSKLLLLSLSLS-LFACSSLAHDFSI-VGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
+ + LL L+++ C+ A D S GY E +T+ E WM +HG+TYK
Sbjct: 11 LITAAVALLTVLAIANCIGCAVAARDLSSSTGYGEEAMTAR------HEKWMVEHGRTYK 64
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEV--TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
EK RF++FK N +D N Y L +N FADM+H+EF +Y G KP P
Sbjct: 65 DEAEKARRFQVFKANAAFVDTSNAAAGGKKYHLAINRFADMTHDEFMARYTGFKP-LPAT 123
Query: 119 RQPSAEFSYRDVKALP---KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
+ F Y +V ++VDWRKKGAVT VKNQ CG CWAFS VAA+EG++QI +G
Sbjct: 124 GKKMPGFKYANVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEGMHQINTG 183
Query: 176 NLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
L SLSEQ+L+DC T NNGC GG M+ AF+Y++ + G+ E YPY +G C++ +
Sbjct: 184 ELVSLSEQQLVDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQGMCQNVQ- 242
Query: 235 EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG-PCGAELDH 293
V + YQ VP +DE +L A+A QPVSVA++A+ +FQFY GGV T CG L+H
Sbjct: 243 --PAVAVRSYQQVPRDDEDALAAAVAGQPVSVAVDAN--NFQFYKGGVMTADSCGTNLNH 298
Query: 294 GVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
V AVGYG ++ G+ Y ++KN WG WGE GY+R++R G CG+ K AS P+
Sbjct: 299 AVTAVGYGTAEDGTPYWLLKNQWGSTWGEEGYLRLQRGVGA----CGVAKDASYPV 350
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 147/311 (47%), Positives = 202/311 (64%), Gaps = 11/311 (3%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFK 105
++E W+ +HGK Y + EK RF+IFK+NLKHI++ N + SY GLN+F+D++ +EF+
Sbjct: 40 IYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSYDRGLNQFSDLTVDEFQ 99
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTP-VKNQGSCGSCWAFSTVA 164
YLG K + + + + Y++ LP VDWR++GAV P VK QG CGSCWAF+
Sbjct: 100 ASYLGGKIEKKSLSDVAERYQYKEGDILPDEVDWRERGAVVPRVKRQGDCGSCWAFAATG 159
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
AVEGINQI +G L SLSEQELIDCD +N GC GG +AF++I +GG+ +EDY Y
Sbjct: 160 AVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWAFEFIKENGGIVTDEDYGYT 219
Query: 224 MEEGTCEDKKEEME---VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
++ T K EM+ VVTI+G++ VP NDE SL KA+++QP+SV I A+ + Y
Sbjct: 220 GDD-TAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVSYQPISVMISAA--NMSDYKS 276
Query: 281 GVFTGPCGAEL-DHGVAAVGYGKSKG-SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GV+ GPC DH V VGYG S DY +++NSWGP WGE GY+R++RN +P G C
Sbjct: 277 GVYKGPCSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPGWGEGGYLRLQRNFNEPTGKC 336
Query: 339 GINKMASIPLK 349
+ P+K
Sbjct: 337 AVAVAPVYPIK 347
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 136/303 (44%), Positives = 197/303 (65%), Gaps = 8/303 (2%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFK 105
+FE W +KHGK+Y EK R IF + L +I++ N T++ LGLN+F+D+++ EF+
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 NKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
Y+G KP R+P+ + DV +LP S+DWR++GAVTP+K+QG CGSCWAFS +A
Sbjct: 61 ANYVGKFKPPRYQDRRPAKDVDV-DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
++E + + + L SLSEQ+LIDCDT + GC GG + AFK++V +GG+ EE YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDT-VDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTG 178
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
G+C K + VV I+GY+DV ++ +L+KA++ PV+V I S +FQ Y G+ +
Sbjct: 179 FAGSCNANKNK--VVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILS 236
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
G C DH V +GYG G Y I+KNSWG WGE G++R+K+ G EG+CG+N +
Sbjct: 237 GHCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKKKDG--EGMCGMNGQS 294
Query: 345 SIP 347
S P
Sbjct: 295 SYP 297
>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
Length = 1039
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 126/196 (64%), Positives = 152/196 (77%)
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGL 214
GSCWAFST+AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+
Sbjct: 713 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGI 772
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
E+DYPY +G C+ ++ +VVTI Y+DVP NDE+SL KA+A+QPVSVAIEA+GT
Sbjct: 773 DTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTT 832
Query: 275 FQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
FQ YS G+FTG CG LDHGV VGYG G DY I+KNSWG WGE GY+RM+RN
Sbjct: 833 FQLYSSGIFTGSCGTALDHGVTVVGYGTENGKDYWIMKNSWGSSWGESGYVRMERNIKAS 892
Query: 335 EGLCGINKMASIPLKK 350
G CGI S PLK+
Sbjct: 893 SGKCGIAVEPSYPLKE 908
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 136/303 (44%), Positives = 197/303 (65%), Gaps = 8/303 (2%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFK 105
+FE W +KHGK+Y EK R IF + L +I++ N T++ LGLN+F+D+++ EF+
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 NKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
Y+G KP R+P+ + DV +LP S+DWR++GAVTP+K+QG CGSCWAFS +A
Sbjct: 61 ANYVGKFKPPRYQDRRPAKDVDV-DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
++E + + + L SLSEQ+LIDCDT + GC GG + AFK++V +GG+ EE YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDT-VDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTG 178
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
G+C K + VV I+GY+DV ++ +L+KA++ PV+V I S +FQ Y G+ +
Sbjct: 179 FAGSCNANKNK--VVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILS 236
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
G C DH V +GYG G Y I+KNSWG WGE G++R+K+ G EG+CG+N +
Sbjct: 237 GHCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKKEDG--EGMCGMNGQS 294
Query: 345 SIP 347
S P
Sbjct: 295 SYP 297
>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
Length = 510
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 151/308 (49%), Positives = 186/308 (60%), Gaps = 13/308 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGL----NEFADMSHEE 103
F +WM H ++ E R E + N +I + N E + W G+ NEF+ MS EE
Sbjct: 29 FSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLE--NAWTGVKLDHNEFSSMSFEE 86
Query: 104 FKNKYLG-LKPQ--FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
FK K G + P+ R + + DV+ +P SVDW+ KG VTPVKNQG CGSCWAF
Sbjct: 87 FKFKMTGYVMPEGYLEQRLASRVDNLWSDVQ-VPDSVDWQDKGGVTPVKNQGMCGSCWAF 145
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
ST AVEG + SG L SLSEQEL+DCD + + GCNGGLMD+AF +I +GG+ E+DY
Sbjct: 146 STTGAVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDY 205
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
Y + C D + +VV ISG+QDV DE +L A+A QPVSVAIEA FQFY
Sbjct: 206 EYKAKAQVCRDCE---KVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKS 262
Query: 281 GVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
GVF CG LDHGV AVGYG G + VKNSWG WGE+GYIR+ R P G CGI
Sbjct: 263 GVFNLTCGTRLDHGVLAVGYGSENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGI 322
Query: 341 NKMASIPL 348
+ S P
Sbjct: 323 ASVPSYPF 330
>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 535
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 151/308 (49%), Positives = 186/308 (60%), Gaps = 13/308 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGL----NEFADMSHEE 103
F +WM H ++ E R E + N +I + N E + W G+ NEF+ MS EE
Sbjct: 29 FSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLE--NAWTGVKLDHNEFSSMSFEE 86
Query: 104 FKNKYLG-LKPQ--FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
FK K G + P+ R + + DV+ +P SVDW+ KG VTPVKNQG CGSCWAF
Sbjct: 87 FKFKMTGYVMPEGYLEQRLASRVDNLWSDVQ-VPDSVDWQDKGGVTPVKNQGMCGSCWAF 145
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
ST AVEG + SG L SLSEQEL+DCD + + GCNGGLMD+AF +I +GG+ E+DY
Sbjct: 146 STTGAVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDY 205
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
Y + C D + +VV ISG+QDV DE +L A+A QPVSVAIEA FQFY
Sbjct: 206 EYKAKAQVCRDCE---KVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKS 262
Query: 281 GVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
GVF CG LDHGV AVGYG G + VKNSWG WGE+GYIR+ R P G CGI
Sbjct: 263 GVFNLTCGTRLDHGVLAVGYGSENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGI 322
Query: 341 NKMASIPL 348
+ S P
Sbjct: 323 ASVPSYPF 330
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 154/344 (44%), Positives = 217/344 (63%), Gaps = 16/344 (4%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
+++L L+AC+ A ++ + + + + + WM ++G++Y E RF+
Sbjct: 7 IIALCTMLWACAYTAMSRTL------YDETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFK 60
Query: 71 IFKENLKHIDQRNKEV--TSYWLGLNEFADMSHEEFKNKYLGL--KPQFPTRRQPSAEFS 126
IF ENL++I++ N SY L LN+F+D+++EEF + GL P P+ A +
Sbjct: 61 IFMENLEYIEKFNNAPGNKSYKLDLNQFSDLTNEEFIASHTGLMIDPSKPSSSSKRASPA 120
Query: 127 YRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
D+ P S+DWR++GAVT VKNQG+CGSCWAFS VAAVEGI +I +GNL SLSEQ+L+
Sbjct: 121 SLDLSDTPTSLDWREQGAVTDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLV 180
Query: 187 DCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ 245
DC ++ N GC GG MD AF YI + G+ E DY Y GTC++ + ISGY+
Sbjct: 181 DCASNEQNQGCGGGFMDNAFSYITEN-GIASENDYQYRGGAGTCQNNEMITPAARISGYE 239
Query: 246 DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK- 304
DVP ++Q LL A++ QPVSVAI A G F Y G+++GPCG+ L+HGV VGYG S+
Sbjct: 240 DVPAGEDQ-LLLAVSQQPVSVAI-AVGQSFHLYKEGIYSGPCGSSLNHGVTLVGYGTSEE 297
Query: 305 -GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+ Y ++KNSWG WGE GY+R+ R +G+ EG CGI AS P
Sbjct: 298 DGTKYWLIKNSWGESWGENGYMRLLRESGQSEGHCGIAVKASHP 341
>gi|42563538|gb|AAS20467.1| cysteine protease-like protein [Pelargonium x hortorum]
Length = 234
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 129/198 (65%), Positives = 152/198 (76%), Gaps = 1/198 (0%)
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGG 213
CG CWAFST+AAVEGIN IV+G L SLSEQEL+DCD S+N GCNGGLMDYAF++I+ +GG
Sbjct: 1 CGRCWAFSTIAAVEGINHIVTGELISLSEQELVDCDRSYNQGCNGGLMDYAFEFIIKNGG 60
Query: 214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGT 273
+ EEDYPY +GTC+ ++ +VVTI GY+DVPENDE SL KA+A+QPVSVAIEA G
Sbjct: 61 IDSEEDYPYKAVDGTCDPIRKNAKVVTIDGYEDVPENDENSLKKAVAYQPVSVAIEAGGR 120
Query: 274 DFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
+FQ Y G+FTG CG LDHGVAAVGYG G DY IV+NSWG WGE GYIRM+RN
Sbjct: 121 EFQLYQSGIFTGRCGTALDHGVAAVGYGTENGIDYWIVRNSWGSSWGENGYIRMERNVKT 180
Query: 334 PE-GLCGINKMASIPLKK 350
+ G CGI AS P K+
Sbjct: 181 TKTGKCGIAMEASYPTKE 198
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 134/303 (44%), Positives = 197/303 (65%), Gaps = 8/303 (2%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFK 105
+FE W +KH K+Y EK R +F + L +I++ N + T++ LGLN+F+D+++ EF+
Sbjct: 1 MFEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 NKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
Y+G KP R+P+ + DV +LP S+DWR++GAVTP+K+QG CGSCWAFS +A
Sbjct: 61 ANYVGKFKPPRYQDRRPAKDVDV-DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
++E + + + L SLSEQ+LIDCDT + GC GG D AFK++V +GG+ EE YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDT-VDQGCQGGFPDDAFKFVVENGGVTTEEAYPYTG 178
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
G+C K + VV I+GY+DV ++ +L+KA++ PV+V I S +FQ Y G+ +
Sbjct: 179 FAGSCNTNKNK--VVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILS 236
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
G C DH V +GYG G Y I+KNSWG WGE G++++K+ G EG+CG+N +
Sbjct: 237 GQCCNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIKKKDG--EGMCGMNGQS 294
Query: 345 SIP 347
S P
Sbjct: 295 SYP 297
>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
Length = 342
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 148/311 (47%), Positives = 186/311 (59%), Gaps = 15/311 (4%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNK 107
+ WM++HG+TYK EK RF +FK N+ ID+ N Y L N F D++ EF
Sbjct: 33 DKWMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAM 92
Query: 108 YLGLKPQFPTRRQPSA--EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
Y G P +A S D + P VDWR++GAVT VKNQ SCG CWAFSTVAA
Sbjct: 93 YTGYNPANTMYAAANATTRLSSEDDQ-QPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAA 151
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
VEGI+QI +G L SLSEQ+L+DC + N GC GG +D AF+Y+ SGG+ E Y Y
Sbjct: 152 VEGIHQITTGELVSLSEQQLLDC--ADNGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGA 209
Query: 226 EGTCE---DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+G C+ TISGYQ V NDE SL A+A QPVSVAIE SG F+ Y GV
Sbjct: 210 QGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGV 269
Query: 283 FTG-PCGAELDHGVAAVGYGK----SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
FT CG +LDH VA VGYG S G Y I+KNSWG WG+ GY++++++ G +G
Sbjct: 270 FTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDVGS-QGA 328
Query: 338 CGINKMASIPL 348
CG+ S P+
Sbjct: 329 CGVAMAPSYPV 339
>gi|356560855|ref|XP_003548702.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
Length = 357
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 155/348 (44%), Positives = 208/348 (59%), Gaps = 15/348 (4%)
Query: 9 LLLLSLSLSLFACSS-LAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
+ ++L F+ SS +SI+G + + L S D+ I+LF+ W +HG YK ++E
Sbjct: 12 FFFICITLICFSSSSNFPVQYSILGPNLDKLPSQDETIQLFQLWRKEHGLVYKDLKEMAK 71
Query: 68 RFEIFKENLKHIDQRNKEVTS---YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE 124
RFEIF NL +I + N + +S Y LGLN FAD S EF+ YL PT P
Sbjct: 72 RFEIFLSNLNYIIEFNAKRSSPSGYLLGLNNFADWSPSEFQEIYLH-SLDMPTDSAPKLN 130
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
A P S+DWR K AVT +KNQGSCGSCWAFS A+EGI+ I +G L SLSEQE
Sbjct: 131 GPLLSCIA-PASLDWRNKVAVTAIKNQGSCGSCWAFSAAGAIEGIHAITTGELISLSEQE 189
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE-GTCEDKKEEMEVVTISG 243
L++CD + GCNGG ++ AF +++++GG+ E +YPY ++ G C K+ TI G
Sbjct: 190 LVNCD-RVSKGCNGGWVNKAFDWVISNGGITLEAEYPYTGKDGGNCNSDKQVPIKATIDG 248
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG-PCGAE---LDHGVAAVG 299
Y+ V ++D LL ++ QP+S+ + A TDFQ Y G+F G C + +H V VG
Sbjct: 249 YEQVEQSD-NGLLCSIVKQPISICLNA--TDFQLYESGIFDGQQCSSSSKYTNHCVLIVG 305
Query: 300 YGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
Y S G DY IVKNSWG KWG GYI +KRNTG P G+CG+N A P
Sbjct: 306 YDSSNGEDYWIVKNSWGTKWGINGYIWIKRNTGLPYGVCGMNAWAYNP 353
>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 153/332 (46%), Positives = 191/332 (57%), Gaps = 34/332 (10%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F+ W+ +G Y+ EE RF I++ N+++I + + SY L N+FAD+++EEF +
Sbjct: 5 FDRWLKXNGXNYEDKEEWEIRFVIYQANVEYIGCKKSQKNSYNLTDNKFADLTNEEFVST 64
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG------------ 155
YLG F TR P F Y + LP S DWRK+GAVT +K+QG+CG
Sbjct: 65 YLG----FATRLIPHTRFKYHEHGNLPXSKDWRKEGAVTDIKDQGNCGKHSTWFSPEISH 120
Query: 156 -----------------SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCN 197
S WAFS VAAVE IN+I SG L SLSEQEL+D D + N GC
Sbjct: 121 NLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDYDVANKNQGCE 180
Query: 198 GGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLK 257
GGLMD F +I +GGL +DYPY +G+C +K V ISGY+ P DE L
Sbjct: 181 GGLMDTTFAFIKKNGGLTTSKDYPYEGVDGSCNKEKALHHAVNISGYERAPSKDEAMLKV 240
Query: 258 ALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGP 317
A A+QP+SVAI+A G FQ YS GVF+G CG +L+HGV VGY K Y VKNS G
Sbjct: 241 AAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGYDKGTFDKYRTVKNSXGA 300
Query: 318 KWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WGE GYIRMKR+ G CGI AS PLK
Sbjct: 301 DWGESGYIRMKRDAFDKAGTCGIAMKASYPLK 332
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 191/308 (62%), Gaps = 10/308 (3%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSHEEFKNK 107
+ WM + Y EK R E+F ENLK I+ N SY LG+N+F D + EEF
Sbjct: 39 QKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSYKLGVNKFTDWTKEEFLAT 98
Query: 108 YLGLK-----PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
+ GL F + + +++ L + DWR +GAVTPVK QG CG CWAFS
Sbjct: 99 HTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGAVTPVKYQGECGGCWAFSA 158
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
+AAVEG+ +I GNL SLSEQ+L+DC NNGC GG M AF YIV +GG+ E YPY
Sbjct: 159 IAAVEGLTKIARGNLISLSEQQLLDCAREQNNGCKGGTMIEAFNYIVKNGGVSSENAYPY 218
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
++EG C + ++ + I G+++VP N+E++LL+A++ QPV+V I+AS T F YSGGV
Sbjct: 219 QVKEGPC--RSNDIPAIVIRGFENVPSNNERALLEAVSRQPVAVDIDASETGFIHYSGGV 276
Query: 283 FTG-PCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
+ CG ++H V VGYG S+ G Y + KNSWG WGE GYIR++R+ P+G+CG+
Sbjct: 277 YNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGV 336
Query: 341 NKMASIPL 348
+ AS P+
Sbjct: 337 AQYASYPV 344
>gi|1085731|pir||S46476 cysteine proteinase (EC 3.4.22.-) III - mountain papaya
gi|926847|gb|AAB32657.1| cysteine proteinase CC-III [Carica candamarcensis=mountain papaya,
Hook, latex, Peptide, 214 aa]
Length = 214
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 130/214 (60%), Positives = 162/214 (75%), Gaps = 6/214 (2%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P+S+DWRKKGAVTPVKNQGSCGSCWAFST+A VEGIN+IV GNLTSLSEQEL+DCD +
Sbjct: 2 PESIDWRKKGAVTPVKNQGSCGSCWAFSTIATVEGINKIVHGNLTSLSEQELVDCDRR-S 60
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
+GC GG + KY+V G+H E++YPY ++ C K ++ +V ISGY+ VP NDE
Sbjct: 61 HGCKGGYQTTSLKYVVDH-GVHTEKEYPYEEKQYKCRAKDKKPPIVKISGYKKVPSNDEI 119
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
SL+KA+A QPVSV +E+ G FQFY G+F GPCG ++DH V AVGYGK DYI++KN
Sbjct: 120 SLIKAIAKQPVSVLVESKGKAFQFYKKGIFGGPCGTKVDHAVTAVGYGK----DYILIKN 175
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
SWGP WGE GYI++KR +G EG+CGI K + P
Sbjct: 176 SWGPXWGEXGYIKIKRASGHCEGICGIYKSSYFP 209
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 159/305 (52%), Positives = 198/305 (64%), Gaps = 19/305 (6%)
Query: 55 HGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFADMSHEEFKNKYLG 110
H KTY +EE+ RFEIF+EN++ I++ NK SY+LG+N+F+D+ HEEF KY G
Sbjct: 63 HDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHEEFV-KYNG 121
Query: 111 LKPQFPTRRQPSAEFSYRDVKAL--PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEG 168
LK T + SY L P SVDWRKKG VT VKNQG CGSCW+FST ++EG
Sbjct: 122 LKK---TSLKDGGCSSYLAANNLVEPDSVDWRKKGYVTDVKNQGQCGSCWSFSTTGSLEG 178
Query: 169 INQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEG 227
+ SG L SLSE +L+DC SF N GCNGGLMD AFKYI + GGL EEDYPY ++G
Sbjct: 179 QHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESEEDYPYKPKQG 238
Query: 228 TCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP 286
TC+ ++ T +G DV E +L KA++ PVSVAI+AS + FQ Y+GGV+ P
Sbjct: 239 TCKFDDTKV-AATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSFQSYAGGVYDEP 297
Query: 287 -CGAE-LDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
C +E LDHGV VGYG +G DY IVKNSWG +WGE GY++M RN + CGI
Sbjct: 298 ECSSEQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEWGEDGYVKMSRN---KKNQCGIATQ 354
Query: 344 ASIPL 348
AS PL
Sbjct: 355 ASYPL 359
>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 533
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 150/306 (49%), Positives = 184/306 (60%), Gaps = 9/306 (2%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV--TSYWLGLNEFADMSHEEFK 105
F +WMS HG T+ E R E + N +I + N E T LG N F+ MS +EFK
Sbjct: 28 FSAWMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVKLGHNAFSHMSFDEFK 87
Query: 106 NKYLGLK-PQ--FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
K GL P+ R + + DV+ +P +VDW KG VTPVKNQG CGSCWAFST
Sbjct: 88 FKMTGLVLPEGYLEQRLASRVDGLWSDVE-VPSAVDWVDKGGVTPVKNQGMCGSCWAFST 146
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
AVEG + SG L SLSEQEL+DCD + + GCNGGLMD+AF++I GG+ E+DY Y
Sbjct: 147 TGAVEGATFVSSGKLLSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEY 206
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+ C ++ VV ++G+QDV DE +L A+A QPVSVAIEA FQFY GV
Sbjct: 207 KAKAQVC---RKCDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGV 263
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
F CG LDHGV AVGYG G + VKNSWG WGE+GYIR+ R P G CGI
Sbjct: 264 FNLTCGTRLDHGVLAVGYGNDNGQKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIAS 323
Query: 343 MASIPL 348
+ S P
Sbjct: 324 VPSYPF 329
>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
Length = 533
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 150/306 (49%), Positives = 183/306 (59%), Gaps = 9/306 (2%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV--TSYWLGLNEFADMSHEEFK 105
F +WM HG T+ E R E + N +I + N E T LG N F+ MS +EFK
Sbjct: 28 FSAWMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTGVTLGHNAFSHMSFDEFK 87
Query: 106 NKYLGLK-PQ--FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
K GL P+ R + + DV+ +P +VDW KG VTPVKNQG CGSCWAFST
Sbjct: 88 FKMTGLVLPEGYLEQRLASRVDGLWSDVE-VPSAVDWVDKGGVTPVKNQGMCGSCWAFST 146
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
AVEG + SG L SLSEQEL+DCD + + GCNGGLMD+AF++I GG+ E+DY Y
Sbjct: 147 TGAVEGATFVSSGKLPSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEY 206
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+ C +E VV ++G+QDV DE +L A+A QPVSVAIEA FQFY GV
Sbjct: 207 KAKAQVC---RECDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGV 263
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
F CG LDHGV AVGYG G + VKNSWG WGE+GYIR+ R P G CGI
Sbjct: 264 FNLTCGTRLDHGVLAVGYGNDNGHKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIAS 323
Query: 343 MASIPL 348
+ S P
Sbjct: 324 VPSYPF 329
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 158/365 (43%), Positives = 206/365 (56%), Gaps = 32/365 (8%)
Query: 7 SKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKL 66
+ L+++ ++LS+ +S + Y+ L S + L L+E W + H + EK
Sbjct: 13 ATLVVVGMALSIAPVAS------AIDYTERDLASEESLWALYERWCA-HYNMARDHGEKT 65
Query: 67 HRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE- 124
RF++FKEN + I + N + +Y LGLN F+DM+ EEF G P E
Sbjct: 66 RRFDLFKENARRIYEHNHQGNATYTLGLNRFSDMTDEEFNRSPYGGCLTAPRMSDDEIEE 125
Query: 125 ------------------FSYRDVKALPKSVDWRKKGAVTPVKNQG-SCGSCWAFSTVAA 165
S P +VDWR + AVT VK+QG +CGSCWAFS +AA
Sbjct: 126 LHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWRGR-AVTRVKDQGPTCGSCWAFSAIAA 184
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
VEGIN I + NL LSEQ+L+DCD N+GCNGGLM AF ++V + G+ E YPY+
Sbjct: 185 VEGINAIRTRNLVPLSEQQLVDCD-KLNHGCNGGLMTTAFSFVVRNRGVVPEGAYPYMGR 243
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG 285
EG C K VTI GYQ VP D +L+ A+A QPVSVAIEAS +F+ Y GGVF G
Sbjct: 244 EGRC--KHVMAPPVTIYGYQRVPRFDANALMNAVAAQPVSVAIEASSFEFRHYQGGVFNG 301
Query: 286 PCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMAS 345
CG L H AVGYG G + IVKNSWGP WGE GY+R+ RNT +G+CGI S
Sbjct: 302 NCGGRLGHAATAVGYGADAGGPFWIVKNSWGPGWGEGGYVRISRNTPVRQGVCGILTENS 361
Query: 346 IPLKK 350
P+K+
Sbjct: 362 YPVKR 366
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 198/345 (57%), Gaps = 37/345 (10%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
+ +L S+ A A F + L+ ++ E WM+++ + YK EK RF+
Sbjct: 1 MATLKASILAILGFAF-FCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFK 59
Query: 71 IFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYL--GLKPQFPTRRQPSAEFSYR 128
FAD+++ EF++ G K + + F Y
Sbjct: 60 -------------------------FADLTNHEFRSVKTNKGFKS---SNMKILTGFRYE 91
Query: 129 DVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
+V A LP ++DWR KG VTP+K+QG CG C AFS VAA EGI +I +G L SL++QEL+
Sbjct: 92 NVSADALPTTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVSLADQELV 151
Query: 187 DCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ 245
DCD + GC GGLMD AFK+I+ +GGL E YPY +G C TI GY+
Sbjct: 152 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCNSGSNS--AATIKGYE 209
Query: 246 DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-SK 304
DVP NDE +L+KA+A+QPVSVA++ F+FYSGGV TG CG +LDHG+AA+GYGK S
Sbjct: 210 DVPANDEAALMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYGKTSD 269
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G+ Y ++KNSWG WGE GY+RM+++ G+CG+ S P K
Sbjct: 270 GTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPTK 314
>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
Length = 319
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 151/296 (51%), Positives = 189/296 (63%), Gaps = 11/296 (3%)
Query: 36 EHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLN 94
E + S L ++F ++M ++ K Y E RF FK +++ I N SY +GLN
Sbjct: 30 EEVPSEVMLQDMFTAFMKQYSKAYSHAEFS-SRFNQFKASVETIRLHNTLANASYTMGLN 88
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
EFAD+S EEFK KY G K R + +++V+A P S+DWR AVTP+K+QG C
Sbjct: 89 EFADLSFEEFKGKYFGCKH--VEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQC 146
Query: 155 GSCWAFSTVAAVEGINQIVSG--NLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVAS 211
GSCWAFS ++EG ++ G LTSLSEQ+L+DC TS+ N GCNGGLMDYAF+YI+A+
Sbjct: 147 GSCWAFSATGSIEGA-WVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIAN 205
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEA 270
G+ E YPY G C+ K +VVTISG++DV DE S L A+ PVSVAIEA
Sbjct: 206 KGICAESAYPYKGVGGLCQ--KSCTKVVTISGHKDVASGDEASSLNAVGTVGPVSVAIEA 263
Query: 271 SGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIR 326
FQFYS GVF+G CG LDHGV AVGYG + DY IVKNSWG WGE GYIR
Sbjct: 264 DQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYIR 319
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 153/305 (50%), Positives = 195/305 (63%), Gaps = 15/305 (4%)
Query: 55 HGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFADMSHEEFKNKYLG 110
HGK Y+ E+ +R +I+ EN I + N++ SY L +NE+ DM H EF + G
Sbjct: 36 HGKEYQSETEEYYRLKIYMENRMMIARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNG 95
Query: 111 LKPQFPTR-RQPSAEFSYRDV--KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
+ + ++ RQ S + K LPK+VDWRKKGAVTPVKNQG CGSCWAFST ++E
Sbjct: 96 FRRDYRSKPRQGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLE 155
Query: 168 GINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
G + SG++ SLSEQ L+DC T+F NNGC GGLMD AFKYI A+GG+ E+ YPY +
Sbjct: 156 GQHFRKSGDMVSLSEQNLVDCSTAFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTD 215
Query: 227 GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTG 285
GTC KK ++ T +G+ D+PE +E L KA+A P+SVAI+AS FQFYS GV+
Sbjct: 216 GTCHFKKSDVG-ATDTGFVDIPEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGVYDE 274
Query: 286 P-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
P C +E LDHGV VGYG DY +VKNSWG WG+ GYI M RN + CGI
Sbjct: 275 PECSSENLDHGVLVVGYGTKDDQDYWLVKNSWGTTWGDGGYIYMTRN---KDNQCGIASS 331
Query: 344 ASIPL 348
AS PL
Sbjct: 332 ASYPL 336
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 150/348 (43%), Positives = 202/348 (58%), Gaps = 26/348 (7%)
Query: 16 LSLFACSSL-----AHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
+ LF C +L H S P H SM E E WM+++ + YK E+ RF
Sbjct: 1 MCLFVCMTLHIYYLEHRASEATSRPLHEASM---YERHEQWMARYSRNYKDDAEEERRFX 57
Query: 71 IFKENLKHI-------DQRNKEVTSYWLGLNEFADMSHEEFK--NKYLGLKPQFPTRRQP 121
+FK+N+ I + NK LG+N ADM+HEEF+ + P R +
Sbjct: 58 MFKDNVDFIQTFDTAGNMPNK------LGVNALADMTHEEFRASGNTFKIPPNLGLRSET 111
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
++ F +++V +P ++DWRKK VT +KNQ CG CWAFS VAA+EGI ++ + SLS
Sbjct: 112 TS-FRHQNVTRIPSTMDWRKKRTVTHIKNQLQCGGCWAFSAVAAMEGIAKLQTSKSISLS 170
Query: 182 EQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
EQEL+DCD +N GC GG MD AFK+I+ + GL+ E Y Y EG C KKE
Sbjct: 171 EQELVDCDIFGSNIGCEGGCMDDAFKFIIQNRGLNSEARYLYKGVEGHCNKKKESSRAAR 230
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
I+ Y+++PE E++LLK +AHQP+SVAI+A G+ FQFY G+ T G +LD+GV GY
Sbjct: 231 INDYENMPEFSEKALLKVVAHQPISVAIDAGGSAFQFYEIGIITXESGNDLDYGVTTDGY 290
Query: 301 GKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G+S G + +VKNSWG WGE GY RM+R GLCG AS P
Sbjct: 291 GRSADGKKHWLVKNSWGTDWGENGYTRMERGVKATTGLCGFTMQASYP 338
>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 358
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 187/318 (58%), Gaps = 20/318 (6%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHE 102
+++ F +W H ++Y EE L RF++++ N + ID N + +Y L NEFAD++ E
Sbjct: 43 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 102
Query: 103 EFKNKYLGLKP-QFPTRRQP--------SAEFSYR-DVKALPKSVDWRKKGAVTPVKNQG 152
EF Y G P A FSYR DV P SVDWR +GAV P K+Q
Sbjct: 103 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRVDV---PASVDWRAQGAVVPPKSQT 159
Query: 153 S-CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVAS 211
S C SCWAF T A +E +N I +G L SLSEQ+L+DCD S++ GCN G A+K++V +
Sbjct: 160 STCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAYKWVVEN 218
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEAS 271
GGL E DYPY G C K I+G+ VP +E +L A+A QPV+VAIE
Sbjct: 219 GGLTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV- 277
Query: 272 GTDFQFYSGGVFTGPCGAELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKR 329
G+ QFY GGV+TGPCG L H V VGYG S G+ Y +KNSWG WGERGYIR+ R
Sbjct: 278 GSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILR 337
Query: 330 NTGKPEGLCGINKMASIP 347
+ G P GLCG+ + P
Sbjct: 338 DVGGP-GLCGVTLDIAYP 354
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 148/315 (46%), Positives = 190/315 (60%), Gaps = 28/315 (8%)
Query: 37 HLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNE 95
L + D L+E E WM++HG+TY+ EEK RF+IFK NL++ID NK +Y LGLN
Sbjct: 28 QLINEDALVEKHEQWMARHGRTYQDSEEKERRFQIFKSNLEYIDNFNKASNQTYQLGLNN 87
Query: 96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
FAD+SHEE+ Y R+ P +P+S+DWR GAVTP+KNQ CG
Sbjct: 88 FADLSHEEYVATYTA-------RKMPVE---------VPESIDWRDHGAVTPIKNQYQCG 131
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
CWAFS AAVEGI N SLS Q+L+DC S N GC GG M+ AF YI+ + G+
Sbjct: 132 CCWAFSAAAAVEGI----VANGVSLSAQQLLDC-VSDNQGCKGGWMNNAFNYIIQNQGIA 186
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEA-SGTD 274
E DYPY + C + M ISG++DV DE++L++A+A QPVSV I+A S +
Sbjct: 187 LETDYPYQQMQQMCSSR---MAAAQISGFEDVTPKDEEALMRAVAKQPVSVTIDATSNPN 243
Query: 275 FQFYSGGVFTGP-CGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTG 332
F+ Y GVFT CG H V VGYG S+ G+ Y + KNSWG WGE GY+R++R+ G
Sbjct: 244 FKLYKEGVFTAAGCGNGHSHAVTLVGYGTSEDGTKYWLAKNSWGETWGESGYMRLQRDIG 303
Query: 333 KPEGLCGINKMASIP 347
G CGI AS P
Sbjct: 304 LEGGPCGIALYASYP 318
>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
Length = 362
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 187/318 (58%), Gaps = 20/318 (6%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHE 102
+++ F +W H ++Y EE L RF++++ N + ID N + +Y L NEFAD++ E
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 106
Query: 103 EFKNKYLGLKP-QFPTRRQP--------SAEFSYR-DVKALPKSVDWRKKGAVTPVKNQG 152
EF Y G P A FSYR DV P SVDWR +GAV P K+Q
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRVDV---PASVDWRAQGAVVPPKSQT 163
Query: 153 S-CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVAS 211
S C SCWAF T A +E +N I +G L SLSEQ+L+DCD S++ GCN G A+K++V +
Sbjct: 164 STCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAYKWVVEN 222
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEAS 271
GGL E DYPY G C K I+G+ VP +E +L A+A QPV+VAIE
Sbjct: 223 GGLTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV- 281
Query: 272 GTDFQFYSGGVFTGPCGAELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKR 329
G+ QFY GGV+TGPCG L H V VGYG S G+ Y +KNSWG WGERGYIR+ R
Sbjct: 282 GSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILR 341
Query: 330 NTGKPEGLCGINKMASIP 347
+ G P GLCG+ + P
Sbjct: 342 DVGGP-GLCGVTLDIAYP 358
>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
Length = 362
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 187/318 (58%), Gaps = 20/318 (6%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHE 102
+++ F +W H ++Y EE L RF++++ N + ID N + +Y L NEFAD++ E
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYRLAENEFADLTEE 106
Query: 103 EFKNKYLGLKP-QFPTRRQP--------SAEFSYR-DVKALPKSVDWRKKGAVTPVKNQG 152
EF Y G P A FSYR DV P SVDWR +GAV P K+Q
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRVDV---PASVDWRAQGAVVPPKSQT 163
Query: 153 S-CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVAS 211
S C SCWAF T A +E +N I +G L SLSEQ+L+DCD S++ GCN G A+K++V +
Sbjct: 164 STCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAYKWVVEN 222
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEAS 271
GGL E DYPY G C K I+G+ VP +E +L A+A QPV+VAIE
Sbjct: 223 GGLTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV- 281
Query: 272 GTDFQFYSGGVFTGPCGAELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKR 329
G+ QFY GGV+TGPCG L H V VGYG S G+ Y +KNSWG WGERGYIR+ R
Sbjct: 282 GSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILR 341
Query: 330 NTGKPEGLCGINKMASIP 347
+ G P GLCG+ + P
Sbjct: 342 DVGGP-GLCGVTLDIAYP 358
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 148/334 (44%), Positives = 198/334 (59%), Gaps = 28/334 (8%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFA 97
L D +++ FE WM +HG+ Y EK RFE+++ N++ ++ N Y L N+FA
Sbjct: 21 LARADLMLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFA 80
Query: 98 DMSHEEFKNKYLGLKPQFPTRR---QPSAEF-----SYRDVKALPKSVDWRKKGAVTPVK 149
D+++EEF+ K LG +P + SA+ S D+ LPKSVDWR KGAV +
Sbjct: 81 DLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDI--LPKSVDWRNKGAV--IN 136
Query: 150 NQGSC---GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFK 206
C GSCWAFS VAA+EGINQI +G L SLSEQEL+DCD GC GG M +AF+
Sbjct: 137 RWKICVDAGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDE-AVGCGGGYMSWAFE 195
Query: 207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSV 266
++V + GL E YPY G C+ K V I+GY++V + E L +A A QPVSV
Sbjct: 196 FVVGNHGLTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSV 255
Query: 267 AIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-----------GSDYIIVKNSW 315
A++ FQ Y GV+TGPC A+++HGV VGYG+S+ G Y IVKNSW
Sbjct: 256 AVDGGSFMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSW 315
Query: 316 GPKWGERGYIRMKRNT-GKPEGLCGINKMASIPL 348
G +WG+ GYI M+R+ G GLCGI + S P+
Sbjct: 316 GAEWGDAGYILMQRDVAGLASGLCGIALLPSYPV 349
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 148/307 (48%), Positives = 195/307 (63%), Gaps = 10/307 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHE 102
L E ++ W K+ YK E+ +IFK N+ +ID N SY L +N FAD+ E
Sbjct: 35 LSERYKHWKIKYRVIYKDDAEEEKHIQIFKHNVAYIDSFNAAGNKSYKLTINRFADLPTE 94
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
+ + K + PT S+ F Y+++ +P +VDWRK+GAVTPVKNQ CGSCWAFS
Sbjct: 95 PSDDGFKKRKLE-PT---TSSLFKYKNITDIPAAVDWRKRGAVTPVKNQRECGSCWAFSA 150
Query: 163 VAAVEGINQIVSGNLTSLSEQELID-CDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
V A+EGI QI SGNL SLSEQEL+D +++ NGCNGG + AF++++ +GG+ E YP
Sbjct: 151 VGALEGIQQITSGNLVSLSEQELVDRVRSNWTNGCNGGYLIDAFEFVLENGGIATEASYP 210
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
Y +G + K+ V I Y+ VP N E SLLK +A+QPVSV I+ SG +FYS G
Sbjct: 211 YRGVKGN--NSKKVSRQVQIKSYEQVPRNSEDSLLKVVANQPVSVGIDISGM-IRFYSSG 267
Query: 282 VFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
+FTG CG + +H V VGYG S G+ Y +VKNSWG +WGE+ YIRMKR+ EGLCGI
Sbjct: 268 IFTGECGTKPNHAVIIVGYGTSNDGTKYWLVKNSWGIRWGEKRYIRMKRDIDAKEGLCGI 327
Query: 341 NKMASIP 347
AS P
Sbjct: 328 PMDASYP 334
>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
Length = 396
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 146/330 (44%), Positives = 200/330 (60%), Gaps = 26/330 (7%)
Query: 43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFAD 98
K+ + F++W+ K+ K EE+L R +IF EN + + N + S+++ +N+FA
Sbjct: 67 KIEDAFDAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVEMNKFAA 126
Query: 99 MSHEEFKNKYLGLKPQFPTRRQPSAE-------FSYRDVKALPKSVDWRKKGAVTPVKNQ 151
+ EE++ K LG K R++ S E + Y V+A P+S+DW +G +T KNQ
Sbjct: 127 HTREEYR-KMLGFKKSL-RRKKDSGEAAKDVSLWEYEGVEA-PESIDWVDEGVITTPKNQ 183
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVA 210
GSCGSCWAFS + AVEGIN I +G L SLSEQEL+ C N GCNGGLMD AF++IV
Sbjct: 184 GSCGSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAFEWIVE 243
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEA 270
+GG+ E+ Y Y C+ +K + + +I G+ DVP NDE +L KA++ QPVSVAIEA
Sbjct: 244 NGGVDSEKQYQYKASFDDCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVSVAIEA 303
Query: 271 SGTDFQFYSGGVFTG-PCGAELDHGVAAVGYGKSKGSDYII----------VKNSWGPKW 319
FQ Y GGV+ CG +LDHGV VGYG S +I +KNSW +W
Sbjct: 304 DQRSFQLYGGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNSWSEQW 363
Query: 320 GERGYIRMKRNTGKPEGLCGINKMASIPLK 349
GE GYIR+ R+ P G+CG+ +MAS P K
Sbjct: 364 GEGGYIRIARDVESPSGMCGVAEMASYPEK 393
>gi|113120267|gb|ABI30273.1| VXH-B, partial [Vasconcellea x heilbornii]
Length = 266
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 136/264 (51%), Positives = 189/264 (71%), Gaps = 5/264 (1%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SKL +++ LS+ S DFSI GYSP+ LTS +KLI LF+SWM ++GK YK I+E
Sbjct: 6 SFSKLFFVAICLSVRMGLSYG-DFSIGGYSPDDLTSTEKLINLFDSWMVEYGKVYKDIDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSA 123
K+++FEIFK+NLK+ID+ NK+ +YWLGL F D++++EFK KY+G + + T + +
Sbjct: 65 KIYKFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKYVGSISESWSTTEESND 124
Query: 124 E-FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
E F Y DV +P S+DWR+KGAVTPV++QGSCGSCW FS+VAAVEGIN+IV+G L SLSE
Sbjct: 125 EGFIYDDVVNIPASIDWRQKGAVTPVRHQGSCGSCWTFSSVAAVEGINKIVTGRLVSLSE 184
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DC+ + GC GG YA +Y VA G+H ++YPY + C ++ + V
Sbjct: 185 QELLDCERR-SYGCRGGFPPYALQY-VAQNGIHLRQNYPYEGVQRQCRARQVQGPKVKTD 242
Query: 243 GYQDVPENDEQSLLKALAHQPVSV 266
G VP N+E++L++A+A+QPVSV
Sbjct: 243 GVGRVPRNNERALIQAIANQPVSV 266
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 194/315 (61%), Gaps = 14/315 (4%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSHE 102
+++ + WM + + Y EK R ++ ENLK I+ N SY LG+NEF D + E
Sbjct: 35 IVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTKE 94
Query: 103 EFKNKYLGLKP-------QFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
EF Y GL+ + +P+ ++ DV L + DWR +GAVTPVK+QG CG
Sbjct: 95 EFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDV--LGTNKDWRNEGAVTPVKSQGECG 152
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
CWAFS +AAVEG+ +I GNL SLSEQ+L+DC NNGC GG AF YI+ G+
Sbjct: 153 GCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGIS 212
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
E +YPY ++EG C + + I G+++VP N+E++LL+A++ QPV+VAI+AS F
Sbjct: 213 SENEYPYQVKEGPC--RSNARPAILIRGFENVPSNNERALLEAVSRQPVAVAIDASEAGF 270
Query: 276 QFYSGGVFTGP-CGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
YSGGV+ CG ++H V VGYG S +G Y + KNSWG WGE GYIR++R+
Sbjct: 271 VHYSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEW 330
Query: 334 PEGLCGINKMASIPL 348
P+G+CG+ + AS P+
Sbjct: 331 PQGMCGVAQYASYPV 345
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 193/312 (61%), Gaps = 8/312 (2%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHE 102
++ E WM++HG+TY EK R EIF+ N + ID N S+ L N FAD++ E
Sbjct: 43 MVSRHEKWMAEHGRTYTDEAEKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDE 102
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYR----DVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
EF+ G +P+ + +R + +SVDWR GAVT VK+QG CG CW
Sbjct: 103 EFRAARTGFRPRPAPAAAAGSGGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCW 162
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKE 217
AFS VAAVEG+N+I +G L SLSEQEL+DCD + + GC GGLMD AF++I GGL E
Sbjct: 163 AFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASE 222
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
YPY ++G+C +I G++DVP N+E +L A+A+QPVSVAI F+F
Sbjct: 223 SGYPYQGDDGSCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRF 282
Query: 278 YSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
Y GV G CG +L+H + AVGYG + GS Y ++KNSWG WGE GY+R++R + EG
Sbjct: 283 YDSGVLGGECGTDLNHAITAVGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGV-RGEG 341
Query: 337 LCGINKMASIPL 348
+CG+ K+ S P+
Sbjct: 342 VCGLAKLPSYPV 353
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 137/296 (46%), Positives = 178/296 (60%), Gaps = 5/296 (1%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKN 106
+F ++ +K+GK Y I E RF IFK N+ I N ++ LG+NEF D++ EE
Sbjct: 26 MFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEELAA 85
Query: 107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
Y GLKP P + L SVDW +G VTPVKNQG CGSCW+FST A+
Sbjct: 86 SYTGLKPASLWSGLPRLSTHEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTTGAL 145
Query: 167 EGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
EG + +GNL SLSEQ+ +DCDT+ ++GCNGG MD AF + + E YPY +
Sbjct: 146 EGAWALSTGNLVSLSEQQFVDCDTT-DSGCNGGWMDNAFSF-AKKNSICTEGSYPYTATD 203
Query: 227 GTCEDKKEEMEVVT--ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
GTC ++ + + GY DV + EQ+++ A+A QPVS+AIEA FQ YS GV T
Sbjct: 204 GTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLYSSGVLT 263
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
CG LDHGV AVGYG G+DY VKNSWG WGE+GY+R++R G G CG+
Sbjct: 264 ASCGTRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRGKGG-AGECGL 318
>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
Length = 382
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 149/379 (39%), Positives = 207/379 (54%), Gaps = 34/379 (8%)
Query: 2 AFFSHSKLLLLSLSLSLFACSS--LAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY 59
+FFS LL+L L + CSS S + + + ++E+F+ W +++ ++Y
Sbjct: 5 SFFSMPCLLIL-LGVFFIGCSSGTARRVTSDTAANTDGEPAATTMMEMFQRWKAEYNRSY 63
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLG-------- 110
EE+ R ++ N+++I+ N +Y LG + D++++EF Y
Sbjct: 64 ATPEEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTNDEFMAMYTAPPLRSAAD 123
Query: 111 -----------LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
P E + + P SVDWR GAVT VK+QG CGSCWA
Sbjct: 124 DDDDAATTTIITTRAGPVDEHQQPEVYFNESAGAPASVDWRASGAVTEVKDQGRCGSCWA 183
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
FSTVA VEGI +I G L SLSEQEL+DCDT ++GC+GG+ A ++I A+GG+ +D
Sbjct: 184 FSTVAVVEGIQKIKKGKLVSLSEQELVDCDT-LDSGCDGGVSYRALEWITANGGITTRDD 242
Query: 220 YPYL-MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
YPY C+ K TI+G + V E SL A A QPV+V+IEA G +FQ Y
Sbjct: 243 YPYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNAAAAQPVAVSIEAGGDNFQHY 302
Query: 279 SGGVFTGPCGAELDHGVAAVGYGK--------SKGSDYIIVKNSWGPKWGERGYIRMKRN 330
GV+ GPCG L+HGV VGYG+ + G Y I+KNSWG WG++GYI+MK++
Sbjct: 303 RKGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWIIKNSWGKNWGDQGYIKMKKD 362
Query: 331 T-GKPEGLCGINKMASIPL 348
GKPEGLCGI S PL
Sbjct: 363 VAGKPEGLCGIAIRPSFPL 381
>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase GP-II
gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
Length = 221
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 126/218 (57%), Positives = 161/218 (73%), Gaps = 2/218 (0%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP S+DWR+ GAV PVKNQG CGSCWAFSTVAAVEGINQIV+G+L SLSEQ+L+DC T+
Sbjct: 3 LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC-TTA 61
Query: 193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
N+GC GG M+ AF++IV +GG++ EE YPY ++G C + VV+I Y++VP ++E
Sbjct: 62 NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGIC-NSTVNAPVVSIDSYENVPSHNE 120
Query: 253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
QSL KA+A+QPVSV ++A+G DFQ Y G+FTG C +H + VGYG D+ IVK
Sbjct: 121 QSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIVK 180
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
NSWG WGE GYIR +RN P+G CGI + AS P+KK
Sbjct: 181 NSWGKNWGESGYIRAERNIENPDGKCGITRFASYPVKK 218
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 139/283 (49%), Positives = 182/283 (64%), Gaps = 11/283 (3%)
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR 128
F NL+ I+ N +S+ +G+ +FAD++ EF + Y+ P TR + +
Sbjct: 48 FRCHLANLRVIEAHNAGNSSFTMGITQFADLTAAEF-SAYVKRFPMNVTRPRNEVWIT-- 104
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
+A + VDWR+K AVT +KNQG CGSCW+FST +VEG + I +G L SLSEQ+L+DC
Sbjct: 105 --EAPLQEVDWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDC 162
Query: 189 DTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
T + N+GCNGGLMDYAF+Y++A+GGL EEDYPY E+G C +KE+ I G+++V
Sbjct: 163 STRYGNHGCNGGLMDYAFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEIHGFRNV 222
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSD 307
P+ E L A++ PVSVAIEA FQ Y+ GVF G CG LDHGV VGY D
Sbjct: 223 PKEHEDQLAAAVSIGPVSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGY----SDD 278
Query: 308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
Y IVKNSWG WGE GYIR+KR K +G+CGI AS P K+
Sbjct: 279 YWIVKNSWGKSWGEEGYIRLKRGVDK-KGMCGITMQASYPEKR 320
>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 195/310 (62%), Gaps = 20/310 (6%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHE 102
+ E E M+++GK YK ++ FKEN+ +I+ N Y G+N+FA
Sbjct: 35 MXERHEQRMTRYGKVYKDPPKRX-----FKENVNYIEACNNAANKPYKRGINQFAP---- 85
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
+N++ G R F + +V A P +VD R+KGAVTP+K+QG CG CWAFS
Sbjct: 86 --RNRFKGHMCSSIIR---ITTFKFENVTATPSTVDCRQKGAVTPIKDQGQCGCCWAFSA 140
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
VAA EGI+ + +G L SLSEQEL+DCDT + GC GGLMD AFK+I+ + GL P
Sbjct: 141 VAATEGIHALSAGKLISLSEQELVDCDTKGVDXGCEGGLMDDAFKFIIQNHGLKHXSQLP 200
Query: 222 -YLMEEGTCEDKKEEMEVVT-ISGYQDVPENDEQS-LLKALAHQPVSVAIEASGTDFQFY 278
Y+ +G C + T I+GY+DVP N+E++ L KA+A+ PVS AI+ASG+DFQFY
Sbjct: 201 LYMGVDGKCNANEAAKNAATIITGYEDVPANNEKAHLQKAVANNPVSEAIDASGSDFQFY 260
Query: 279 SGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
GVFTG CG ELDHGV AVGYG S G++Y +VKNSWG +WGE GYIRM+R E L
Sbjct: 261 KSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEAL 320
Query: 338 CGINKMASIP 347
CGI AS P
Sbjct: 321 CGIAVQASYP 330
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 203/315 (64%), Gaps = 11/315 (3%)
Query: 43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSH 101
+++ ++E W+ ++GK Y + EK RF+IFK+NLK I++ N + SY GLN+F+D++
Sbjct: 36 EVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTA 95
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTP-VKNQGSCGSCWAF 160
+EF+ YLG K + + + + Y++ LP VDWR++GAV P VK QG CGSCWAF
Sbjct: 96 DEFQASYLGGKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAF 155
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
+ AVEGINQI +G L SLSEQELIDCD +N GC GG +AF++I +GG+ +E
Sbjct: 156 AATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEV 215
Query: 220 YPYLMEEGTCEDKKEEME---VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
Y Y E+ T K EM+ VVTI+G++ VP NDE SL KA+A+QP+SV I A+ +
Sbjct: 216 YGYTGED-TAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA--NMS 272
Query: 277 FYSGGVFTGPCGAEL-DHGVAAVGYGKSKG-SDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
Y GV+ G C DH V VGYG S DY +++NSWGP+WGE GY+R++RN +P
Sbjct: 273 DYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEP 332
Query: 335 EGLCGINKMASIPLK 349
G C + P+K
Sbjct: 333 TGKCAVAVAPVYPIK 347
>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
(fragment)
gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
gi|226542|prf||1601514A actinidin
Length = 302
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 134/276 (48%), Positives = 180/276 (65%), Gaps = 4/276 (1%)
Query: 76 LKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALP 134
L+ ID+ N + SY +GLN+FAD++ EEF++ YLG + + S + R + LP
Sbjct: 1 LRFIDEHNADTNRSYKVGLNQFADLTGEEFRSTYLGFTGG-SNKTKVSNRYEPRVSQVLP 59
Query: 135 KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN 194
VDWR GAV +K+QG CG CWAFS +A VEGIN+IV+G L SLSEQELI C + N
Sbjct: 60 SYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGTQNT 119
Query: 195 -GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GCNGG + F++I+ +GG++ E+YPY ++G C + + VTI Y +VP N+E
Sbjct: 120 RGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYVTIDTYGNVPYNNEW 179
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
+L A+ +QPVSVA++A+G F+ YS G+FTGPCG +DH V VGYG G DY IV+N
Sbjct: 180 ALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIVEN 239
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SW WGE GY+R+ RN G G CGI M S P+K
Sbjct: 240 SWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPVK 274
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 140/312 (44%), Positives = 188/312 (60%), Gaps = 8/312 (2%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFAD 98
T+ D L +F WM + K+Y EE + R+ +++EN + I++ N+ + +L +N+F D
Sbjct: 21 TTHDPLTGVFAEWMRDNSKSYS-NEEFVFRWNVWRENQQLIEEHNRSNKTSFLAMNKFGD 79
Query: 99 MSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
+++ EF + GL + +A L DWR+KGAVT VKNQG CGSCW
Sbjct: 80 LTNAEFNKLFKGLAFDYSFHANKAAAEKAVPAPGLSADFDWRQKGAVTHVKNQGQCGSCW 139
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKE 217
+FST + EG N + +G LTSLSEQ LIDC S+ NNGCNGGLMDYAF+YI+ + G+ E
Sbjct: 140 SFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGIDTE 199
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
YPY + TC+ +++ Y DV DE +LL A+A +P SVAI+AS FQF
Sbjct: 200 ASYPYQTAQYTCQYNPAN-SGGSLTSYTDVSSGDENALLNAVATEPTSVAIDASHNSFQF 258
Query: 278 YSGGVF--TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
YSGGV+ + +LDHGV AVG+G G DY +VKNSWG WG GYI+M RN
Sbjct: 259 YSGGVYYESACSSTQLDHGVLAVGWGTEDGQDYWLVKNSWGADWGLAGYIKMARNRSNN- 317
Query: 336 GLCGINKMASIP 347
CGI AS P
Sbjct: 318 --CGIATSASYP 327
>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 498
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 161/343 (46%), Positives = 208/343 (60%), Gaps = 28/343 (8%)
Query: 30 IVGYSPEH--LTSMDKLI-------ELFESWMSKHGKTY-KCIEEKLHRFEIFKENLKHI 79
+VG S H L+S D L F W ++H +TY + E R +F +N++ I
Sbjct: 13 LVGLSCAHALLSSADMLALAQVEPERAFGLWATQHARTYSEGSPEYTRRLGVFADNVRAI 72
Query: 80 DQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLK---PQFPTRRQ-----PSAEFSYRDVK 131
++N+ T L LNE+AD + EEF K LGLK Q R S+ + Y V+
Sbjct: 73 AEQNRRNTGITLALNEYADETWEEFAAKRLGLKISQEQLKAREARSSSSSSSSWRYAQVQ 132
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS 191
P +VDWR K AVT VKNQG CGSCWAFS V ++EG N + +G L +LSEQ+L+DCDT+
Sbjct: 133 T-PAAVDWRAKNAVTQVKNQGQCGSCWAFSAVGSIEGANALATGQLVALSEQQLVDCDTA 191
Query: 192 FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEG---TCEDKKE-EMEVVTISGYQDV 247
N GC+GGLMD AFKY++ +GG+ EEDY Y G C +K+ + V+I GY+DV
Sbjct: 192 SNMGCSGGLMDDAFKYVLDNGGIDTEEDYSYWSGYGFGFWCNKRKQTDRPAVSIDGYEDV 251
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGS 306
P + E +LLKA+A QPV+VAI AS + QFYS GV C L+HGV AVGY S K
Sbjct: 252 PTS-EPALLKAVAGQPVAVAICAS-ANMQFYSSGVINSCCEG-LNHGVLAVGYDTSDKAQ 308
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
Y IVKNSWG WGE+GY R+K G P+GLCGI AS +K
Sbjct: 309 PYWIVKNSWGGSWGEQGYFRLKMGEG-PKGLCGIASAASYAVK 350
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 145/314 (46%), Positives = 202/314 (64%), Gaps = 11/314 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHE 102
++ ++E W+ ++GK Y + EK RF+IFK+NLK I++ N + SY GLN+F+D++ +
Sbjct: 37 VLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTAD 96
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTP-VKNQGSCGSCWAFS 161
EF+ YLG K + + + + Y++ LP VDWR++GAV P VK QG CGSCWAF+
Sbjct: 97 EFQASYLGGKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFA 156
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDY 220
AVEGINQI +G L SLSEQELIDCD +N GC GG +AF++I +GG+ +E Y
Sbjct: 157 ATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVY 216
Query: 221 PYLMEEGTCEDKKEEME---VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
Y E+ T K EM+ VVTI+G++ VP NDE SL KA+A+QP+SV I A+ +
Sbjct: 217 GYTGED-TAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA--NMSD 273
Query: 278 YSGGVFTGPCGAEL-DHGVAAVGYGKSKG-SDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
Y GV+ G C DH V VGYG S DY +++NSWGP+WGE GY+R++RN +P
Sbjct: 274 YKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPT 333
Query: 336 GLCGINKMASIPLK 349
G C + P+K
Sbjct: 334 GKCAVAVAPVYPIK 347
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 149/322 (46%), Positives = 202/322 (62%), Gaps = 13/322 (4%)
Query: 36 EHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLN 94
E +T ++ E WM++HG+TY EEK R E+F+ N K ID N E +++ L N
Sbjct: 32 EAITVDSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATN 91
Query: 95 EFADMSHEEFKNKYLGLK-PQFPTRRQPSAEFSYR----DVKALPKSVDWRKKGAVTPVK 149
FAD++ EEF+ GL+ P S +R + S+DWR GAVT VK
Sbjct: 92 RFADLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVK 151
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYI 208
+QGSCG CWAFS VAAVEG+ +I +G L SLSEQ+L+DCD ++ GC GGLMD AF+Y+
Sbjct: 152 DQGSCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYM 211
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
+ GGL E YPY +G+C + +I GY+DVP N+E +L+ A+AHQPVSVAI
Sbjct: 212 INRGGLTTESSYPYRGTDGSC---RRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAI 268
Query: 269 EASGTDFQFYSGGVFTGP-CGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIR 326
+ F+FY GV G CG EL+H + AVGYG S G+ Y I+KNSWG WGE GY+R
Sbjct: 269 NGGDSVFRFYDSGVLGGSGCGTELNHAITAVGYGTASDGTKYWIMKNSWGGSWGEGGYVR 328
Query: 327 MKRNTGKPEGLCGINKMASIPL 348
++R + EG+CG+ ++AS P+
Sbjct: 329 IRRGV-RGEGVCGLAQLASYPV 349
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 138/296 (46%), Positives = 178/296 (60%), Gaps = 5/296 (1%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKN 106
+F ++ +K+GK Y I E RF IFK N+ I N ++ LG+NEF D++ EEF
Sbjct: 26 MFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEEFAA 85
Query: 107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
Y GLKP P + L SVDW +G VTPVKNQG CGSCW+FST A+
Sbjct: 86 SYTGLKPASLWSGLPRLSTHEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTTGAL 145
Query: 167 EGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
EG + +GNL SLSEQ+ DCDT+ ++GCNGG MD AF + + E YPY +
Sbjct: 146 EGAWALSTGNLVSLSEQQFEDCDTT-DSGCNGGWMDNAFSF-AKKNSICTEGSYPYTATD 203
Query: 227 GTCEDKKEEMEVVT--ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
GTC ++ + + GY DV + EQ+++ A+A QPVS+AIEA FQ YS GV T
Sbjct: 204 GTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLYSSGVLT 263
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
CG LDHGV AVGYG G+DY VKNSWG WGE+GY+R++R G G CG+
Sbjct: 264 ASCGTRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRGKGG-AGECGL 318
>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
Length = 214
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 125/215 (58%), Positives = 159/215 (73%), Gaps = 2/215 (0%)
Query: 136 SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNG 195
SVDWRKKG VT +K+QG CG+CWAFS +AAVEG+ + +G L SLSEQEL+DCDT+ N G
Sbjct: 1 SVDWRKKGGVTEIKDQGDCGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQG 60
Query: 196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
C+GG+MDYAF+Y++ +GG+ + +YPY + G C+ K + TI+G+Q +P E+ L
Sbjct: 61 CDGGMMDYAFQYMIRNGGITSQSNYPYRAQRGACDKDKVKYHAATINGFQAIPPQSEELL 120
Query: 256 LKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNS 314
L+A+A+QPVSVAIEA G DFQ YS GVFTG CG+ LDHGVA VGYG + G Y +VKNS
Sbjct: 121 LRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNS 180
Query: 315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
WG WGE GY+RM+R G G+CGIN AS P K
Sbjct: 181 WGSGWGESGYVRMERQ-GPGAGVCGINLDASYPTK 214
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 190/318 (59%), Gaps = 19/318 (5%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN--------KEVTSYWLGLNEFADMS 100
ESWM++HG+TY EEK R EIF+ N + ID N + V S+ L N FAD++
Sbjct: 44 ESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATNRFADLT 103
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA---LPKSVDWRKKGAVTPVKNQGSCGSC 157
EEF+ GL+ F Y + S+DWR GAVT VK+QGSCG C
Sbjct: 104 DEEFRAARTGLRRPAAVAGAVGGGFRYENFSLQADAAGSMDWRAMGAVTGVKDQGSCGCC 163
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHK 216
WAFS VAA+EG+ +I +G L SLSEQ+L+DCD + GC GGLMD AF+YI GGL
Sbjct: 164 WAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYISRQGGLAS 223
Query: 217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
E YPY E+G +I G++DVP N+E +L+ A+AHQPVSVAI F+
Sbjct: 224 ESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVAINGGDYVFR 283
Query: 277 FYS----GGVFTGPC-GAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRN 330
FY G G C ELDH + AVGYG + G+ Y ++KNSWG WGE GY+R++R
Sbjct: 284 FYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWLMKNSWGSGWGESGYVRIRRG 343
Query: 331 TGKPEGLCGINKMASIPL 348
+ + EG+CG+ K+AS P+
Sbjct: 344 S-RGEGVCGLAKLASYPV 360
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 141/303 (46%), Positives = 191/303 (63%), Gaps = 10/303 (3%)
Query: 50 SWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYL 109
+W S HGK+Y + E+ R I+++NL+ I + N E SY + +N D++ +EF+ YL
Sbjct: 29 AWKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHNAEDHSYKMAMNHLGDLTEDEFRYFYL 88
Query: 110 GLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGI 169
G++ + ++ A + +P SVDW +KG VT VKNQG CGSCWAFST +VEG
Sbjct: 89 GVRAHHNSTKRGWATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQ 148
Query: 170 NQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT 228
+ +G+L SLSEQ LIDC S+ NNGC GGLMD AF+YI ++GG+ E YPYL ++G+
Sbjct: 149 HFRKTGSLVSLSEQNLIDCSGSYGNNGCQGGLMDNAFRYIESNGGIDTESSYPYLGQQGS 208
Query: 229 CEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP- 286
C + ++GYQD+P+ EQ+L A+A PVSVA++AS +QFYS GV+ P
Sbjct: 209 CHFSSSHVG-ARVTGYQDIPQGSEQALQSAVATVGPVSVAVDAS--QWQFYSSGVYDNPY 265
Query: 287 CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMAS 345
C + +LDHGV +GYG G DY +VKNSWG WG GYI M RN CGI AS
Sbjct: 266 CSSTQLDHGVLVIGYGNYNGQDYWLVKNSWGYSWGVEGYIMMSRNKNNQ---CGIASSAS 322
Query: 346 IPL 348
PL
Sbjct: 323 YPL 325
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 154/328 (46%), Positives = 200/328 (60%), Gaps = 15/328 (4%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-- 86
+I+ S H S D E + + + HGKTYK E++ R +IF +N K I+ N +
Sbjct: 9 AIIALSYAH-PSFDIYPEEWHVFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQ 67
Query: 87 --TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGA 144
SY + +N F D+ EFK G K T+R + E + LPK+VDWR+KGA
Sbjct: 68 GEVSYKMMMNHFGDLMVHEFKALMNGFKMSPDTKR--NGELYFPSNSNLPKTVDWRQKGA 125
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDY 203
VTPVK+QG CGSCW+FS ++EG + +G L SLSEQ L+DC TS+ NNGC GGLMD
Sbjct: 126 VTPVKDQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQ 185
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-Q 262
AF+Y+ + G+ E YPY E TC KK ++ T G+ D+P DE++L ALA
Sbjct: 186 AFQYVSDNKGIDTEASYPYEARENTCRFKKNKVG-GTDKGHVDIPAGDEKALQNALATVG 244
Query: 263 PVSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWG 320
P+SVAI+A+ FQFYS GV+ P C + +LDHGV AVGYG G DY +VKNSWGP WG
Sbjct: 245 PISVAIDANHGSFQFYSKGVYNEPNCSSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWG 304
Query: 321 ERGYIRMKRNTGKPEGLCGINKMASIPL 348
E GYI++ RN CGI MAS PL
Sbjct: 305 ENGYIKIARNHSNH---CGIASMASYPL 329
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 152/314 (48%), Positives = 196/314 (62%), Gaps = 19/314 (6%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEE 103
+ ++ +KHGK+Y E++ R +I+ EN I + N++ Y + +NEF DM H E
Sbjct: 27 WSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHE 86
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVK-----ALPKSVDWRKKGAVTPVKNQGSCGSCW 158
F + G K + + QP +Y + + +LPK+VDWR KGAVTPVKNQG CGSCW
Sbjct: 87 FVSTRNGFKRNY--KDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCW 144
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKE 217
AFS ++EG + SG++ SLSEQ L+DC T F NNGC GGLMD AFKYI A+ G+ E
Sbjct: 145 AFSATGSLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTE 204
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
+ YPY +GTC KK + T SG+ D+ E E L KA+A P+SVAI+AS FQ
Sbjct: 205 KSYPYNGTDGTCHFKKSTVG-ATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQ 263
Query: 277 FYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
FYS GV+ P C +E LDHGV VGYG G+DY +VKNSWG WG+ GYIRM RN
Sbjct: 264 FYSDGVYDEPECDSESLDHGVLVVGYGTLNGTDYWLVKNSWGTTWGDEGYIRMSRN---K 320
Query: 335 EGLCGINKMASIPL 348
+ CGI AS PL
Sbjct: 321 KNQCGIASSASYPL 334
>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 371
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 147/354 (41%), Positives = 204/354 (57%), Gaps = 20/354 (5%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIEL--FESWMSKHGKTYKCIEEKLHR 68
+L L LF + +I+ + H+ +D ++ L F W + H +TY EE+L R
Sbjct: 20 VLMLRGCLFVFLTALPPAAIMTPAAGHVVELDDMLMLDRFVRWQAAHNRTYGDAEERLRR 79
Query: 69 FEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGL----------KPQFPT 117
F++++ N+++I+ N+ +Y LG N+FAD++ EEF + Y T
Sbjct: 80 FQVYRANIEYIEATNRRGGLTYELGENQFADLTSEEFLSMYASSYDAGDRADDEAALITT 139
Query: 118 RRQPSAEFSYRDVKALPK-SVDWRKKGAVTPVKNQG-SCGSCWAFSTVAAVEGINQIVSG 175
+S D++ALP S DWR KGAVTP KNQG +C SCWAF TVA +EG+ I +G
Sbjct: 140 DVAGDGAWSDGDLEALPPPSWDWRAKGAVTPPKNQGPTCSSCWAFVTVATIEGLTFIKTG 199
Query: 176 NLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
L SLSEQ+L+DCD ++ GCN G F++++ +GGL E +YPY G C K
Sbjct: 200 KLISLSEQQLVDCDM-YDGGCNTGSYSRGFRWVLENGGLTTEAEYPYTAARGPCNRAKSA 258
Query: 236 MEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGV 295
I+G +P +E + KA+A QPV VAIE G+ QFY GV++GPCG L H V
Sbjct: 259 HHAAKITGQGRIPPQNELVMQKAVAGQPVGVAIEV-GSGMQFYKTGVYSGPCGTNLAHAV 317
Query: 296 AAVGYG--KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
VGYG + G+ Y IVKNSWG WGERG+IRM+R+ G P GLCGI + P
Sbjct: 318 TVVGYGVDPASGAKYWIVKNSWGQAWGERGFIRMRRDVGGP-GLCGIALDVAYP 370
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 143/317 (45%), Positives = 192/317 (60%), Gaps = 11/317 (3%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFA 97
+ D+ + +++W H K Y + E+ R I+++NLK I + N E S+ L +N
Sbjct: 18 VVKFDEDEQQWQAWKLFHTKKYTTVTEEGARKAIWRDNLKKIQKHNAEGHSFTLAMNHLG 77
Query: 98 DMSHEEFKNKYLGLKPQFP--TRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
D++ +EF+ Y G++ + T++Q SA + V+ +P +VDWRK+G VTPVKNQG CG
Sbjct: 78 DLTQDEFRYFYTGMRSHYSNYTKKQGSAFLAPSHVQ-VPDTVDWRKEGYVTPVKNQGQCG 136
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGL 214
SCWAFST ++EG N +G L SLSEQ L+DC T++ NNGC GGLMDYAFKYI +GG+
Sbjct: 137 SCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCQGGLMDYAFKYIKENGGI 196
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGT 273
EE YPY C +K + V +G+ DV DE++L A P+SVAI+A
Sbjct: 197 DTEESYPYEARNDRCRFQKSNIGAVD-TGFVDVTHGDEEALKTAAGTVGPISVAIDAGHM 255
Query: 274 DFQFYSGGVFT--GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNT 331
FQFY GV+ G LDHGV VGYG +GSDY +VKNSWG +WG GYI M RN
Sbjct: 256 SFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQGSDYWLVKNSWGERWGMEGYIMMSRNK 315
Query: 332 GKPEGLCGINKMASIPL 348
CG+ AS PL
Sbjct: 316 NNQ---CGVATQASYPL 329
>gi|312100382|gb|ADQ27799.1| mitogenic proteinase [Vasconcellea cundinamarcensis]
Length = 214
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 125/216 (57%), Positives = 162/216 (75%), Gaps = 6/216 (2%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P+S+DWR+KGAVTPVK+Q CGSCWAFSTVA VEGIN+IV+G L SLSEQEL+DCD +
Sbjct: 2 PESIDWRQKGAVTPVKDQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-S 60
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
+GCNGG + +Y+V +G +H E +YPY ++G C K ++ V I+GY+ VP NDE
Sbjct: 61 HGCNGGYQTTSLQYVVDNG-VHTEYEYPYEKKQGNCRAKDKKGLKVQITGYKRVPPNDEI 119
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
SL+K +A+QPVSV IE+ F FY GG++ GPCG LDH V A+GYGK DYI++KN
Sbjct: 120 SLIKVIANQPVSVLIESKDRSFHFYRGGIYKGPCGTRLDHAVTAIGYGK----DYILIKN 175
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWGP WGE+GYIR+KR +GK EG+CG+ K + P+K
Sbjct: 176 SWGPNWGEKGYIRIKRASGKSEGICGVYKSSYFPIK 211
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 201/322 (62%), Gaps = 13/322 (4%)
Query: 36 EHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLN 94
E +T ++ E WM++HG+TY EEK R E+F+ N K ID N E +++ L N
Sbjct: 32 EAITVDAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATN 91
Query: 95 EFADMSHEEFKNKYLGLK-PQFPTRRQPSAEFSYR----DVKALPKSVDWRKKGAVTPVK 149
FAD++ EEF+ GL+ P S +R + S+DWR GAVT VK
Sbjct: 92 RFADLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVK 151
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYI 208
+QGSCG CWAFS VAAVEG+ +I +G L SLSEQ+L+DCD ++ GC GGLMD AF+Y+
Sbjct: 152 DQGSCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYM 211
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
+ GGL E YPY +G+C + +I GY+DVP N+E +L+ A+AHQPVSVAI
Sbjct: 212 INRGGLTTESSYPYRGTDGSC---RRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAI 268
Query: 269 EASGTDFQFYSGGVFTGP-CGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIR 326
+ F+FY GV G CG EL+H + A GYG S G+ Y I+KNSWG WGE GY+R
Sbjct: 269 NGGDSVFRFYDSGVLGGSGCGTELNHAITAAGYGTASDGTKYWIMKNSWGGSWGEGGYVR 328
Query: 327 MKRNTGKPEGLCGINKMASIPL 348
++R + EG+CG+ ++AS P+
Sbjct: 329 IRRGV-RGEGVCGLAQLASYPV 349
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 154/305 (50%), Positives = 190/305 (62%), Gaps = 15/305 (4%)
Query: 55 HGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKNKYLG 110
HGK Y E+ +R +I+ EN I + N++ SY L +NEF D+ H EF + G
Sbjct: 57 HGKEYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDLLHHEFVSTRNG 116
Query: 111 LKPQF-PTRRQPSAEFSYRDV--KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
K + T R+ S + K LPK+VDWRKKGAVTPVKNQG CGSCWAFST ++E
Sbjct: 117 FKRNYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLE 176
Query: 168 GINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
G + +G + SLSEQ L+DC F NNGC GGLMD AFKYI A+GG+ E YPY +
Sbjct: 177 GQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYIKANGGIDTELSYPYNGTD 236
Query: 227 GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTG 285
G C +K ++ T +G+ D+PE +EQ L KA+A PVSVAI+AS FQFYS GV+
Sbjct: 237 GICHFEKSDVG-ATDTGFVDIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYSQGVYDE 295
Query: 286 P-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
P C +E LDHGV VGYG G DY +VKNSWG WG+ GYI M RN E CGI
Sbjct: 296 PECSSESLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDDGYIYMTRN---KENQCGIASS 352
Query: 344 ASIPL 348
AS PL
Sbjct: 353 ASYPL 357
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 152/314 (48%), Positives = 195/314 (62%), Gaps = 19/314 (6%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEE 103
+ ++ +KHGK+Y E++ R +I+ EN I + N++ Y + +NEF DM H E
Sbjct: 27 WSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHE 86
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVK-----ALPKSVDWRKKGAVTPVKNQGSCGSCW 158
F + G K + + QP +Y + + +LPK+VDWR KGAVTPVKNQG CGSCW
Sbjct: 87 FVSTRNGFKRNY--KDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCW 144
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKE 217
AFS ++EG + SG++ SLSEQ L+ C T F NNGC GGLMD AFKYI A+ G+ E
Sbjct: 145 AFSATGSLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTE 204
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
+ YPY +GTC KK + T SG+ D+ E E L KA+A P+SVAI+AS FQ
Sbjct: 205 KSYPYNGTDGTCHFKKSTVG-ATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQ 263
Query: 277 FYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
FYS GV+ P C +E LDHGV VGYG G+DY VKNSWG WG+ GYIRM RN
Sbjct: 264 FYSDGVYDEPECDSESLDHGVLVVGYGTLNGTDYWFVKNSWGTTWGDEGYIRMSRN---K 320
Query: 335 EGLCGINKMASIPL 348
+ CGI ASIPL
Sbjct: 321 KNQCGIASSASIPL 334
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 129/235 (54%), Positives = 163/235 (69%), Gaps = 7/235 (2%)
Query: 119 RQPSAEFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
R P+ F Y +V A LP ++DWR KGAVTP+K+QG CG CWAFS VAA EGI +I +G
Sbjct: 2 RIPTG-FRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGK 60
Query: 177 LTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
L SL+EQEL+DCD + GC GGLMD AFK+I+ +GGL E YPY +G C K
Sbjct: 61 LVSLAEQELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKC--KSGS 118
Query: 236 MEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGV 295
TI GY+DVP NDE +L+KA+A+QPVSVA++ FQFYSGGV TG CG +LDHG+
Sbjct: 119 NSAATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGI 178
Query: 296 AAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
AA+GYGK S G+ Y ++KNSWG WGE GY+RM+++ G+CG+ S P K
Sbjct: 179 AAIGYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPTK 233
>gi|298709635|emb|CBJ31444.1| Cathepsin L-like proteinase [Ectocarpus siliculosus]
Length = 475
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/321 (46%), Positives = 191/321 (59%), Gaps = 21/321 (6%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEE 103
L+ FE W K+G+++ + E H + + I N E Y L N ++ MS +E
Sbjct: 158 LLGFFE-WTYKYGQSWGSVHEAFHALQNYARADDKIALHNHEDAGYTLAHNAYSHMSWQE 216
Query: 104 FKNKY-LGLKPQFPTRRQPSAEFSYRDV-----------KALPKSVDWRKKGAVTPVKNQ 151
F+ + +G P + P AEF+ R +P VDW KGAVTPVKNQ
Sbjct: 217 FREHFSIGKDMVVPPDQLP-AEFALRPRGEKAPKELLRGAPIPDEVDWVAKGAVTPVKNQ 275
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVAS 211
GSCGSCW+FST ++EG + I GNL LSEQEL+DCDT ++ GCNGGLMDY+F +I +
Sbjct: 276 GSCGSCWSFSTTGSMEGAHFIKHGNLAVLSEQELVDCDT-YDMGCNGGLMDYSFHWIQQN 334
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVV---TISGYQDVPENDEQSLLKALAHQPVSVAI 268
GG+ EEDYPY C KK +VV + + DV +DEQ+L++A+A QPVS+AI
Sbjct: 335 GGICSEEDYPYTAAGDLC--KKSTCDVVEGTMVDKWVDVASDDEQALMEAVAQQPVSIAI 392
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRM 327
EA FQ YSGGV T CG LDHGV VGYG S+ G Y VKNSWGP+WG GYI +
Sbjct: 393 EADQMSFQLYSGGVLTAACGTNLDHGVLLVGYGVSEDGVKYWKVKNSWGPEWGAEGYILL 452
Query: 328 KRNTGKPEGLCGINKMASIPL 348
KR + G CGI + AS P+
Sbjct: 453 KREADQEGGECGILEQASYPV 473
>gi|281204396|gb|EFA78592.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 330
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 152/327 (46%), Positives = 190/327 (58%), Gaps = 11/327 (3%)
Query: 28 FSIVGY-SPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
F IVG S L S F +WM + + Y E + R+ FK NL I + N +
Sbjct: 8 FLIVGIASANRLFSEQHYQNQFTNWMVRLDRAYDVFEFQ-DRYNAFKNNLDLIHKWNSQG 66
Query: 87 TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA-LPKSVDWRKKGAV 145
S LG+N AD+S+EE++N YLG+K Q +A V A + S+DWR GAV
Sbjct: 67 HSTVLGVNHLADLSNEEYRNLYLGVKVDASRLPQQAASIKLNKVFAPVAASLDWRSSGAV 126
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYA 204
VK+QG CGSCW+FST ++EG NQI +GN SLSEQ+L+DC + N GCNGGLMD A
Sbjct: 127 GRVKDQGQCGSCWSFSTTGSIEGANQIATGNFASLSEQQLMDCSRDYGNEGCNGGLMDAA 186
Query: 205 FKYIVASGGLHKEEDYPYLMEEG-TCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQP 263
KY++A GGL EE YPY M + TC+ + IS Y DV E L L P
Sbjct: 187 MKYVIAQGGLDTEESYPYTMSDSYTCKFNPANIG-AKISSYIDVQRGSETDLAAKLNKGP 245
Query: 264 VSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGE 321
VSVAI+AS + FQ Y GV+ P C + LDHGV AVGYG S+Y IVKNSWGP WG
Sbjct: 246 VSVAIDASHSSFQLYKSGVYYEPACSSYNLDHGVLAVGYGTEGSSNYWIVKNSWGPNWGL 305
Query: 322 RGYIRMKRNTGKPEGLCGINKMASIPL 348
GYI M ++ CGI+ MASIP+
Sbjct: 306 SGYIWMAKDKSNH---CGISSMASIPV 329
>gi|449532567|ref|XP_004173252.1| PREDICTED: oryzain alpha chain-like [Cucumis sativus]
Length = 321
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 126/196 (64%), Positives = 147/196 (75%), Gaps = 1/196 (0%)
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGL 214
GSCWAFS+VAAVEGINQIV+G L LSEQEL+DCD SFN GCNGGLMDYAF++I+ +GG+
Sbjct: 13 GSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGI 72
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
EEDYPY + C+ ++ +VVTI GY+DVPENDE SL KA+A+QPVSVAIEA G
Sbjct: 73 DTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRA 132
Query: 275 FQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK- 333
FQ Y GVFTG CG +LDHGV AVGYG G+DY IV+NSWG WGE GYIR++RN
Sbjct: 133 FQLYQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLERNVANI 192
Query: 334 PEGLCGINKMASIPLK 349
G CGI S P K
Sbjct: 193 TTGKCGIAVQPSYPTK 208
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 200/322 (62%), Gaps = 17/322 (5%)
Query: 35 PEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGL 93
P L + + + E E WM++HG+TY EK RF+IFK NL +I+ NK +Y LGL
Sbjct: 27 PRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQIFKNNLDYIENFNKAFNKTYKLGL 86
Query: 94 NEFADMSHEEFKNKYLGLK-----PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
N+F+D+S EEF Y G + P T +P+ +Y + +P+S+DWR+ G VT V
Sbjct: 87 NKFSDLSEEEFVTTYNGYEMPTTLPTANTTVKPTFFSNYYNQDEVPESIDWRENGVVTSV 146
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
KNQG CG CWAFS VAAVEGI +GN SLS Q+L+DC N+GC GG M AF+YI
Sbjct: 147 KNQGECGCCWAFSAVAAVEGI----AGNGASLSAQQLLDC-VGDNSGCGGGTMIKAFEYI 201
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
V + G+ + DYPY + C I+GY+ V ++ E++L +A+A QP+SVAI
Sbjct: 202 VQNQGIVSDTDYPYEQTQEMCRSGSN--VAARITGYESVIQS-EEALKRAVAKQPISVAI 258
Query: 269 EA-SGTDFQFYSGGVFTGP-CGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYI 325
+A SG +F+ Y GVF+ CG L H V VGYG ++ G+ Y +VKNSWG +WGE GY+
Sbjct: 259 DASSGPNFKSYISGVFSAEDCGTHLTHAVTLVGYGTTEDGTKYWLVKNSWGEEWGESGYM 318
Query: 326 RMKRNTGKPEGLCGINKMASIP 347
R++R+ G EG CGI AS P
Sbjct: 319 RLQRDVGAMEGPCGIAMQASYP 340
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 141/292 (48%), Positives = 182/292 (62%), Gaps = 8/292 (2%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F S+ + +GK+Y EE R+ IFK NL +I N++ SY L +N F D+S EEF+ K
Sbjct: 119 FGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQGYSYSLKMNHFGDLSREEFRRK 178
Query: 108 YLGLKPQFPTRRQP---SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
YLG + + E +P +VDWR+KG VTPVK+Q CGSCWAFS
Sbjct: 179 YLGYNKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSATG 238
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
A+EG + +G L SLSEQEL+DC + N GC+GG M+ AF+Y+V SGGL EE YPYL
Sbjct: 239 ALEGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPYL 298
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
+G C K+ +VVTISG++DVP E ++ ALAH PVS+AIEA FQFY GVF
Sbjct: 299 ARDGEC--KRACKKVVTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYHEGVF 356
Query: 284 TGPCGAELDHGVAAVGYGKSKGS--DYIIVKNSWGPKWGERGYIRMKRNTGK 333
CG +LDHGV VGYG K + D+ I+KNSWG WG GY+ M + G+
Sbjct: 357 DASCGTDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYMAMHKGE 408
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 155/305 (50%), Positives = 188/305 (61%), Gaps = 15/305 (4%)
Query: 55 HGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG 110
HGK Y+ E+ +R +I+ EN +H ++ K SY L +NEF DM H EF + G
Sbjct: 30 HGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDMLHHEFVSTRNG 89
Query: 111 LKPQF-PTRRQPS--AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
K + T R+ S E + LPK+VDWRKKGAVTPVKNQG CGSCW+FST ++E
Sbjct: 90 FKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPVKNQGQCGSCWSFSTTGSLE 149
Query: 168 GINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
G + L SLSEQ LIDC SF NNGC GGLMDYAFKYI A+ G+ E+ YPY +
Sbjct: 150 GQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYIKANKGIDTEQSYPYNATD 209
Query: 227 GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTG 285
G C K + T +G+ D+PE DE L KA+A PVSVAI+AS FQFYS GV+
Sbjct: 210 GVCHFNKSAVG-ATDTGFVDIPEGDENKLKKAVATVGPVSVAIDASHESFQFYSEGVYDE 268
Query: 286 P-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
P C +E LDHGV VGYG G DY +VKNSWG WG+ GYI M RN + CGI
Sbjct: 269 PECDSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDGGYIYMSRN---KDNQCGIASA 325
Query: 344 ASIPL 348
AS PL
Sbjct: 326 ASYPL 330
>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
Length = 360
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 140/306 (45%), Positives = 180/306 (58%), Gaps = 19/306 (6%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHE 102
+++ F +W H ++Y EE L RF++++ N + ID N + +Y L NEFAD++ E
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 106
Query: 103 EFKNKYLGLKP-QFPTRRQP--------SAEFSYR-DVKALPKSVDWRKKGAVTPVKNQG 152
EF Y G P A FSYR DV P SVDWR +GAV P K+Q
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRVDV---PASVDWRAQGAVVPPKSQT 163
Query: 153 S-CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVAS 211
S C SCWAF T A +E +N I +G L SLSEQ+L+DCD S++ GCN G A+K++V +
Sbjct: 164 STCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAYKWVVEN 222
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEAS 271
GGL E DYPY G C K I+G+ VP +E +L A+A QPV+VAIE
Sbjct: 223 GGLTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV- 281
Query: 272 GTDFQFYSGGVFTGPCGAELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKR 329
G+ QFY GGV+TGPCG L H V VGYG S G+ Y +KNSWG WGERGYIR+ R
Sbjct: 282 GSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILR 341
Query: 330 NTGKPE 335
+ G P
Sbjct: 342 DVGGPR 347
>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
Length = 430
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 148/344 (43%), Positives = 193/344 (56%), Gaps = 37/344 (10%)
Query: 39 TSMDKLIELFESWMSKHG--KTYKCIEEKLHRFEIFKENLKHIDQRNK-----EVTSYWL 91
++ + L FE W S+HG + + EE R F EN ++ + N EV S+W+
Sbjct: 89 SNANALARHFERWCSEHGLERYLRDTEEYAKRLATFAENAAYVVEHNALYAIGEV-SHWV 147
Query: 92 GLNEFADMSHEEFKNKYLGLKPQFPTR--------------RQPSAEFSYRDVKALPKSV 137
GLN A + EE++ LG KP+ + Q A + Y V P+++
Sbjct: 148 GLNSLAATTREEYR-ALLGYKPELRSSGDAEMLEATSTDKVEQYKASWEYASVDP-PEAI 205
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCN 197
DW + GAVTP KNQG CGSCWAFST AVEGI +I +G L SLSEQE++ C N GCN
Sbjct: 206 DWVELGAVTPPKNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSKQ-NMGCN 264
Query: 198 GGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLK 257
GGLMDYAF++IV +GG+ E YPY E C K ++ V TI G++DVP DE+ L K
Sbjct: 265 GGLMDYAFRWIVKNGGIDSEFQYPYSAEALACNRWKLQLHVATIDGFKDVPPGDEKELEK 324
Query: 258 ALAHQPVSVAIEASGTDFQFYSGGVF-TGPCGAELDHGVAAVGYG-----------KSKG 305
A++ QPVS+AIEA FQ Y GGV+ + CG+++DHGV VGYG +
Sbjct: 325 AVSQQPVSIAIEADTKSFQLYDGGVYDSKECGSQVDHGVLVVGYGFDDTHHNATKHHKRH 384
Query: 306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+ VKNSWG WGE G+IRM R G CGI S P K
Sbjct: 385 RHFWKVKNSWGGTWGEGGFIRMARRISDETGQCGITTAPSYPTK 428
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 154/309 (49%), Positives = 199/309 (64%), Gaps = 16/309 (5%)
Query: 50 SWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFK 105
S+ +HG+ Y+ EE+ RFEIFK+NL++I++ NK+ + SY+LG+N+FADM +EEF+
Sbjct: 44 SFKKQHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR 103
Query: 106 NKYLGLKPQFPTRR--QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
Y GL+ + R Q S + + A P VDWRKKG VT VKNQG CGSCW+FST
Sbjct: 104 -MYNGLRRDYNYSREVQCSNHLTPEYLVA-PDEVDWRKKGYVTAVKNQGQCGSCWSFSTT 161
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
++EG + SG L SLSEQ+L+DC F N GCNGGLMD AF+YI+ +GG+ EE+YPY
Sbjct: 162 GSLEGQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGIETEEEYPY 221
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
+ C KK E+ T SG DV DE L ++A PVS+AI+AS FQ YSGG
Sbjct: 222 DARQERCHFKKSEV-AATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGG 280
Query: 282 VFTGP-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
V+ P C + ELDHGV VGYG G DY +VKNSWG WG GY++M RN + CG
Sbjct: 281 VYDEPKCSSTELDHGVLVVGYGTDDGQDYWLVKNSWGTTWGLEGYVKMSRN---QDNQCG 337
Query: 340 INKMASIPL 348
+ AS PL
Sbjct: 338 VATQASYPL 346
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 152/356 (42%), Positives = 213/356 (59%), Gaps = 18/356 (5%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
FS + +LL+ + ++ C + + + G + + E +E W + HG+TYK
Sbjct: 8 FSLAAILLIII---MYCCPTGLVEAARKGPAAAGGGDDSAMRERYEKWAADHGRTYKDSL 64
Query: 64 EKLHRFEIFKENLKHIDQRNKE--VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP 121
EK RFE+F+ N ID N S L N+FAD+++EEF +Y G +P F T
Sbjct: 65 EKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEFA-EYYG-RP-FSTPVIG 121
Query: 122 SAEFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
+ F Y +V+ +P +++WR +GAVT VKNQ C SCWAFS VAAVEGI+QI S NL +
Sbjct: 122 GSGFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEGIHQIRSHNLVA 181
Query: 180 LSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE-GTCEDKKEEME 237
LS Q+L+DC T NN GCN G MD AF+YI ++GG+ E DYPY GTC + +
Sbjct: 182 LSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDYPYEDRALGTCRASGKPV- 240
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG----PCGAELDH 293
+I G+Q VP N+E +LL A+AHQPVSVA++ G QF+S GVF C +L+H
Sbjct: 241 AASIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVSQFFSSGVFGAMQNETCTTDLNH 300
Query: 294 GVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
+ AVGYG + G+ Y ++KNSWG WGE GY+++ R+ GLCG+ S P+
Sbjct: 301 AMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIARDVASNTGLCGLAMQPSYPV 356
>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
Length = 330
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 142/310 (45%), Positives = 185/310 (59%), Gaps = 9/310 (2%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSH 101
D L +F WM H K+Y EE + R+ +++EN I + N++ SY+L +N+F D+++
Sbjct: 24 DPLTGVFADWMRTHTKSYSN-EEFVFRWNVWRENYNFIQEENRKNNSYYLTMNKFGDLTN 82
Query: 102 EEFKNKYLGLKPQFPTR-RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
EF Y GL + + A LP + DWR+KGAVT VKNQG CGSCW+F
Sbjct: 83 AEFNKVYKGLAFDYSAHILKAKAATPAAPAPGLPANFDWRQKGAVTHVKNQGQCGSCWSF 142
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEED 219
ST + EG N + G L SLSEQ LIDC S+ NNGCNGGLMDYAF+YI+ + G+ E
Sbjct: 143 STTGSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGIDTEAS 202
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS 279
YPY + C +++ Y DV DE +LL A+A +P SVAI+AS FQFYS
Sbjct: 203 YPYETAQYNCRYNPAN-SGGSLTSYTDVSSGDENALLNAVAIEPTSVAIDASHNSFQFYS 261
Query: 280 GGVF--TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
GGV+ + +LDHGV AVG+G G DY +VKNSWG WG +GYI+M RN
Sbjct: 262 GGVYYESSCSSTQLDHGVLAVGWGTENGQDYWLVKNSWGADWGLQGYIKMARN---RHNN 318
Query: 338 CGINKMASIP 347
CGI AS P
Sbjct: 319 CGIATAASYP 328
>gi|125592009|gb|EAZ32359.1| hypothetical protein OsJ_16569 [Oryza sativa Japonica Group]
Length = 480
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 151/359 (42%), Positives = 207/359 (57%), Gaps = 28/359 (7%)
Query: 18 LFACSSLAHDFSIVGYSPEHLT-------SMDKLIELFESWMSKHG--KTYKCIEEKLHR 68
+ ++ A D SI+ Y+ EH + + ++ W++++G E R
Sbjct: 15 IVGAATAAPDMSIISYNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERR 74
Query: 69 FEIFKENLKHIDQRN---KEVTSYWLGLNEFA---------DMSHEEFKNKYLGLKPQFP 116
F +F +NLK +D N E + LG+N D+ + + + + + P
Sbjct: 75 FLVFWDNLKFVDAHNARADERGGFRLGMNRLRRSHQRGVPRDLPRRQGRREEPRRRGEVP 134
Query: 117 TRRQPSAE---FSYRDVKALPKSVD--WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQ 171
RR A + + P+ R VK G GSCWAFS V+ VE INQ
Sbjct: 135 PRRGGGAAGVRRLEGEGRRRPRQEPGPMRSFSVHLSVKYFGQ-GSCWAFSAVSTVESINQ 193
Query: 172 IVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE 230
+V+G + +LSEQEL++C T+ N+GCNGGLMD AF +I+ +GG+ E+DYPY +G C+
Sbjct: 194 LVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCD 253
Query: 231 DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAE 290
+E +VV+I G++DVP+NDE+SL KA+AHQPVSVAIEA G +FQ Y GVF+G CG
Sbjct: 254 INRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTS 313
Query: 291 LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
LDHGV AVGYG G DY IV+NSWGPKWGE GY+RM+RN G CGI MAS P K
Sbjct: 314 LDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPTK 372
>gi|255586666|ref|XP_002533962.1| cysteine protease, putative [Ricinus communis]
gi|223526059|gb|EEF28418.1| cysteine protease, putative [Ricinus communis]
Length = 417
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 135/290 (46%), Positives = 193/290 (66%), Gaps = 13/290 (4%)
Query: 8 KLLLLSLSLSLFACSS--LAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEK 65
+ L++ L + C S L ++SIVG L S +++ ELF+ W KH K YK +EE
Sbjct: 7 QFLIIFLLVGPLTCLSFTLPDEYSIVGNDLHELLSEERVKELFQQWKEKHRKVYKHVEEA 66
Query: 66 LHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP 121
R E F+ NLK++ ++N++ +++ +GLN+FADMS+ EF+ KYL K + P +++
Sbjct: 67 EKRLENFRRNLKYVVEKNQKKKNLGSAHTVGLNKFADMSNVEFRQKYLS-KVKKPIKKRN 125
Query: 122 SAEFSYRDVK----ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ + R P S+DWRKKG VTPVK+QG CGSCWAFS+ A+EGIN IV+G+L
Sbjct: 126 NNLMTSRQRNLQSCVAPSSLDWRKKGVVTPVKDQGDCGSCWAFSSTGAIEGINAIVTGDL 185
Query: 178 TSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
SLSEQEL+DCDT+ N GC+GG MDYAF++++ +GG+ E DYPY +GTC KEE +
Sbjct: 186 VSLSEQELMDCDTT-NYGCDGGYMDYAFEWVINNGGIDTEIDYPYTGVDGTCNIAKEETK 244
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPC 287
VV++ GY+DV E+D +LL A QP+SV I+ S DFQ Y+ G++ G C
Sbjct: 245 VVSVDGYEDVAESD-SALLCATVQQPISVGIDGSAIDFQLYTSGIYNGSC 293
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 150/347 (43%), Positives = 217/347 (62%), Gaps = 12/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
++ L+ SL + A L +++G + L+ L + +E++ ++H K Y+ E+L
Sbjct: 42 QISLVQTSLRVSAGMKLLAVLAVIGLASA-LSPNPNLNQHWENFKAEHNKKYESFPEELM 100
Query: 68 RFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFS 126
R IF+EN + I+ N K+ ++LG+N F D++++E++ +YLG + T + S FS
Sbjct: 101 RRLIFEENHQFIEDHNSKKEFDFYLGMNHFGDLTNKEYRERYLGYRRPENTPSKASYIFS 160
Query: 127 YRD-VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
+ ++ +P +DWR +G VTPVKNQG CGSCWAFS V ++EG + +G L SLSEQ L
Sbjct: 161 RAEKIEDVPDQIDWRDQGFVTPVKNQGQCGSCWAFSAVGSLEGQHFKSTGKLVSLSEQNL 220
Query: 186 IDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DC T N+GCNGG MD AF+Y+ + G+ E+ YPY+ +G+C K + + T+ G+
Sbjct: 221 VDCSTPEGNSGCNGGWMDQAFEYVKDNHGIDTEDSYPYVGTDGSCHFKNKSIG-ATLKGF 279
Query: 245 QDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVFTGP-CG-AELDHGVAAVGYG 301
DV E DE++L +A+ PVSVAI+AS FQFY GGV+ P C +ELDHGV VGYG
Sbjct: 280 MDVKEGDEEALRQAVGVAGPVSVAIDASSMLFQFYRGGVYNVPWCSTSELDHGVLVVGYG 339
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
K +G D+ +VKNSWG WG GYI M RN G CGI ASIP
Sbjct: 340 KQFQGKDFWMVKNSWGVGWGIYGYIEMSRNKGNQ---CGIASKASIP 383
>gi|384247445|gb|EIE20932.1| hypothetical protein COCSUDRAFT_18161 [Coccomyxa subellipsoidea
C-169]
Length = 387
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 150/327 (45%), Positives = 186/327 (56%), Gaps = 36/327 (11%)
Query: 57 KTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYW-------------------------- 90
K Y EE R IFK N+ +I N SY
Sbjct: 9 KKYSNEEEAALRLNIFKTNVDYITSVNSAQQSYQASKHFSENTQQTALSSLFLSQLAHTD 68
Query: 91 ----LGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALP-KSVDWRKKGAV 145
LGLNEFAD + EEF + +LGL + SA +R P S++W + GAV
Sbjct: 69 LLPQLGLNEFADQTWEEFSSTHLGLNAGEDGSFRSSANTGFRHADVTPANSINWVEAGAV 128
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
TPVKNQ CGSCWAFST +VEG N + +G+L SLSEQ+L+DCDT + GC GGLMDYAF
Sbjct: 129 TPVKNQAFCGSCWAFSTTGSVEGANFLATGDLVSLSEQQLVDCDTKKDQGCGGGLMDYAF 188
Query: 206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVS 265
YI+ +GGL EEDY Y G C +EE VV+I GY+DVP NDE +L KA++ QPVS
Sbjct: 189 DYIIKNGGLDTEEDYSYWSVGGFCNKLREERTVVSIDGYEDVPVNDEVALAKAVSKQPVS 248
Query: 266 VAIEASGTDFQFYSGGVFT--GPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGER 322
VAI AS QFYS GV G C L+HGV A GY G Y +VKNSWG WG +
Sbjct: 249 VAICASEA-MQFYSSGVIAAKGSC-IGLNHGVLAAGYDVDESGKPYWLVKNSWGGTWGMQ 306
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLK 349
GY+++++++ EG CGI AS P+K
Sbjct: 307 GYMKLEKDSSVKEGACGIAMAASYPVK 333
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 193/337 (57%), Gaps = 12/337 (3%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFK 73
+ L++F SL SI + +L S F WM KH K Y E +++ FK
Sbjct: 1 MRLAVFLIVSLV-ILSINVCAATNLFSAQTYQTSFLGWMKKHNKAYHH-HEFNDKYQTFK 58
Query: 74 ENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR--RQPSAEFSYRDVK 131
+N+ I N + + LGLN FAD+++EE+K YLG+ R + P ++
Sbjct: 59 DNMDFIHNWNSKESDTVLGLNRFADLTNEEYKKTYLGMSINVNLRANQVPMNGLNFERFT 118
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS 191
P S+DWR+ GAV VK+QG CGSCWAF+T AVEG +QI +GN+ + SEQ L+DC
Sbjct: 119 G-PSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDCSGR 177
Query: 192 F-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
+ NNGC+GGLM AFKYI+ + G+ EE YPY + C M ISGY+DVP
Sbjct: 178 YGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTATQNRCV-YNTTMLGTAISGYKDVPRG 236
Query: 251 DEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT-GPCGA-ELDHGVAAVGYGKSKGSDY 308
E +L A++ QPV+VAI+AS FQ Y GV+ C + L+HGV AVGYG +G DY
Sbjct: 237 SESALTAAISKQPVAVAIDASPITFQLYKSGVYQEATCSSYRLNHGVLAVGYGTLEGKDY 296
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMAS 345
IVKNSW WG +GYI M RN CGI MAS
Sbjct: 297 YIVKNSWAETWGNQGYILMARNANNH---CGIATMAS 330
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 146/308 (47%), Positives = 187/308 (60%), Gaps = 11/308 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKN 106
F W + H + Y +E+ R EI+ NL+ I++ N SY LG+NEF D++H EF
Sbjct: 21 FAEWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAA 80
Query: 107 KYLGLKPQFPTRRQPSAEFSYR-DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
KYLG++ + A +Y + +LP SVDWR G VTPVKNQG CGSCW+FST +
Sbjct: 81 KYLGVRFNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGS 140
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
VEG + +G L SLSEQ L+DC + N GCNGGLMD AF+YI+ +GG+ E YPY
Sbjct: 141 VEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYPYTA 200
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVF 283
GTC+ + T++ YQD+ E L A+A PVSVAI+AS +FQFY GV+
Sbjct: 201 TTGTCKFNAANIG-ATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVY 259
Query: 284 T-GPCG-AELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
C +LDHGV AVGYG S +G DY +VKNSWG WG+ GYI M RN + CGI
Sbjct: 260 NEKKCSTTQLDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKAGYIWMSRN---ADNQCGI 316
Query: 341 NKMASIPL 348
AS PL
Sbjct: 317 ATSASYPL 324
>gi|5853329|gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
Length = 501
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 147/347 (42%), Positives = 210/347 (60%), Gaps = 18/347 (5%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
L+ L+ + +L +FSI+ + S K+ +LF W HGKTY+ EE+ R
Sbjct: 11 LIFLTYVSYSISTKTLPSEFSILEGQENDILSSAKVSDLFGKWKELHGKTYQHEEEENLR 70
Query: 69 FEIFKENLKHIDQRNKEVTS---YWLGLNEFADMSHEEFKNKYLGLKPQFPTRR------ 119
E FK+++K + ++N E S + +GLN+FAD+S+EEFK Y+ +
Sbjct: 71 LENFKKSVKFVMEKNSERKSELDHTVGLNKFADLSNEEFKEMYMSKVKGSRSNELKMGGV 130
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
+ + S R A P S+DWR KG VTP+K+QG CGSCWAFS ++E N I +G+L
Sbjct: 131 KRNMSVSSRTCDA-PTSLDWRDKGVVTPMKDQGQCGSCWAFSVSGSIESANAIATGDLIR 189
Query: 180 LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM---EEGTCEDKKEEM 236
LSEQEL+DCDT ++ GC+GG MD A+++I+ +GGL E+DYPY +G C+ K
Sbjct: 190 LSEQELVDCDT-YDYGCDGGNMDTAYRWIIKNGGLDSEDDYPYTSSNGRDGKCDKTKSAK 248
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGA---ELDH 293
VV++ Y +V E++E ++L A+A PV++ I S DFQ Y+GGV+ G C + ++DH
Sbjct: 249 SVVSLDSYVEV-ESNEDAVLCAVATTPVTIGIVGSAYDFQLYTGGVYNGQCSSKPYDIDH 307
Query: 294 GVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
V VGYG G DY IVKNSWG WG GYI M+RNT G+CG+
Sbjct: 308 AVLIVGYGSQDGKDYWIVKNSWGTYWGLEGYILMERNTDIKNGVCGM 354
>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 337
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 139/304 (45%), Positives = 186/304 (61%), Gaps = 4/304 (1%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFKNK 107
E WM++HGK YK EK +IF+ N++ I+ + S+ L N+FAD+ EEFK
Sbjct: 33 EKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFDVCGDKSFNLSTNQFADLHDEEFKAL 92
Query: 108 YL-GLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFS-TVAA 165
G K + F Y +V +P S+DWRK+G VTP+K+QG C SCWAFS VA
Sbjct: 93 LTNGHKKEHSLWTTTETLFRYDNVTKIPASMDWRKRGVVTPIKDQGKCLSCWAFSLCVAT 152
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
+EG++QI++ L LSEQEL+D + GC G ++ AFK+I G + E YPY
Sbjct: 153 IEGLHQIITSELVPLSEQELVDFVKGESEGCYGDYVEDAFKFITKKGRIESETHYPYKGV 212
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG 285
TC+ KKE V I GY+ VP E +LLKA+A+Q VSV++EA + FQFYS G+FTG
Sbjct: 213 NNTCKVKKETHGVAQIKGYKKVPSKSENALLKAVANQLVSVSVEARDSAFQFYSSGIFTG 272
Query: 286 PCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
CG + DH VA YG+S G+ Y + KNSWG +WGE+GYIR+K + EGLCGI K
Sbjct: 273 KCGTDTDHRVALASYGESGDGTKYWLAKNSWGTEWGEKGYIRIKXDIPAKEGLCGIAKYP 332
Query: 345 SIPL 348
P+
Sbjct: 333 YYPI 336
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 154/320 (48%), Positives = 199/320 (62%), Gaps = 17/320 (5%)
Query: 40 SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNE 95
S + L +E++ S H KTYK E+L RF+IF EN I + N K + SY LG+N+
Sbjct: 19 SQEILRTEWEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQ 78
Query: 96 FADMSHEEF---KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQG 152
FAD+ EF N Y G + R + + +LPK+VDWRKKGAVTPVK+QG
Sbjct: 79 FADLLPHEFVKMMNGYQG--KRLAGRGSTYLPPANLNDSSLPKTVDWRKKGAVTPVKDQG 136
Query: 153 SCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVAS 211
CGSCWAFS+ ++EG + + +G L SLSEQ L+DC +++ N GCNGGLMD +F YI A+
Sbjct: 137 QCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKAN 196
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEA 270
GG+ E+ YPY E+G C KKE++ T +G+ D+ E E+ L KA+A PVSVAI+A
Sbjct: 197 GGIDTEDSYPYEAEDGDCRYKKEDVG-ATDTGFVDIKEGSEKDLQKAVATVGPVSVAIDA 255
Query: 271 SGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMK 328
S FQ YS GV+ P C +E LDHGV AVGYG G Y +VKNSW WG+ GYI M
Sbjct: 256 SQQSFQLYSEGVYDEPNCSSESLDHGVLAVGYGVKNGKKYWLVKNSWAETWGQDGYILMS 315
Query: 329 RNTGKPEGLCGINKMASIPL 348
R+ CGI AS PL
Sbjct: 316 RDKNNQ---CGIASSASYPL 332
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 149/306 (48%), Positives = 188/306 (61%), Gaps = 17/306 (5%)
Query: 55 HGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG 110
HGK Y E+ +R +I+ EN +H ++ K SY L +NEF D+ H EF + G
Sbjct: 34 HGKDYASDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDLLHHEFVSTRNG 93
Query: 111 LKPQFPTRRQPSAEF----SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
K + + + F + D++ LPK+VDWRKKGAVTPVKNQG CGSCWAFST ++
Sbjct: 94 FKRNYRDSPREGSFFVEPEGFEDLQ-LPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSL 152
Query: 167 EGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
EG + + L SLSEQ L+DC SF NNGC GGLMD AFKYI ++ G+ E YPY
Sbjct: 153 EGPHFRKTRKLVSLSEQNLVDCSRSFGNNGCEGGLMDNAFKYIKSNKGIDTEWSYPYNAT 212
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFT 284
+G C + ++ T +G+ D+PE DE L KA+A PVSVAI+AS FQFYS GV+
Sbjct: 213 DGVCHFNRSDVG-ATDTGFVDIPEGDENKLKKAVAAVGPVSVAIDASHESFQFYSEGVYD 271
Query: 285 GP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
P C +E LDHGV VGYG G DY +VKNSWG WG+ GYI M RN + CGI
Sbjct: 272 EPECSSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDEGYIYMTRN---KDNQCGIAS 328
Query: 343 MASIPL 348
AS PL
Sbjct: 329 SASYPL 334
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 187/311 (60%), Gaps = 17/311 (5%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHEEFKN 106
FE+W GK+Y E+++R +++ N +D N + SY LG+N FAD++HEEFK
Sbjct: 30 FEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFKR 89
Query: 107 KYLGLKPQFPTRRQPSAEFS-----YRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
YLG K +P + FS +V ALP SVDWR G VTPVK+QG CGSCW+FS
Sbjct: 90 FYLGTKVDL---NRPRSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSFS 146
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
T +VEG + +G L SLSEQ L+DC + N GCNGGLMD AF+YI+ + G+ E Y
Sbjct: 147 TTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASY 206
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYS 279
PY ++GTC+ + T+S +QD+ E L A+A PVSVAI+AS FQ Y+
Sbjct: 207 PYTAKDGTCKFNAANVG-ATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLYT 265
Query: 280 GGVFT-GPCGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
GV+ C + LDHGV A GYG S G+ Y +VKNSWG WG+ GYI M RN
Sbjct: 266 SGVYNEKKCSSTSLDHGVLAAGYGTSNGTPYWLVKNSWGSSWGQAGYIWMSRNANNQ--- 322
Query: 338 CGINKMASIPL 348
CGI AS P+
Sbjct: 323 CGIATSASYPI 333
>gi|110739710|dbj|BAF01762.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
Length = 300
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 123/192 (64%), Positives = 145/192 (75%)
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEE 218
AFST+ AVEGIN+IV+G+L SLSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ E
Sbjct: 1 AFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEA 60
Query: 219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
DYPY +G C+ ++ +VVTI Y+DVPEN E SL KALAHQP+SVAIEA G FQ Y
Sbjct: 61 DYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLY 120
Query: 279 SGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
S GVF G CG ELDHGV AVGYG G Y IV+NSWG +WGE GYI+M RN P G C
Sbjct: 121 SSGVFDGLCGTELDHGVVAVGYGTENGKGYWIVRNSWGNRWGESGYIKMARNIEAPTGKC 180
Query: 339 GINKMASIPLKK 350
GI AS P+KK
Sbjct: 181 GIAMEASYPIKK 192
>gi|4469157|emb|CAB38316.1| chymopapain isoform IV [Carica papaya]
Length = 226
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 123/216 (56%), Positives = 154/216 (71%), Gaps = 2/216 (0%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P+S+DWR KGAVTPVKNQG+CGSCWAFST+A VEGIN+IV+GNL LSEQEL+DCD +
Sbjct: 1 PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCD-RHS 59
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GC GG + +Y VA+ G+H + YPY ++ C + V I+GY+ VP N E
Sbjct: 60 YGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCET 118
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
S L ALA+QP+SV +EA G FQ Y GVF GPCG +LDH V AVGYG S G +YII+KN
Sbjct: 119 SFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIIIKN 178
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWGP WGE+GY+R+KR +G +G CG+ K + P K
Sbjct: 179 SWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 214
>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
proteinase II; Flags: Precursor
gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
Length = 337
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 150/346 (43%), Positives = 202/346 (58%), Gaps = 17/346 (4%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
+ LS++L +F L+ F G H D I+ WM + K Y +E + R+
Sbjct: 1 MRLSITL-IFTLIVLSISFISAGNVFSHKQYQDSFID----WMRSNNKAYTH-KEFMPRY 54
Query: 70 EIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPT----RRQPSAEF 125
E FK+N+ ++ N + + LGLN+ AD+S+EE++ YLG + +R
Sbjct: 55 EEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRL 114
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
+ K P +VDWR+K AVTPVK+QG CGSC++FST +VEG+ I +G L SLSEQ +
Sbjct: 115 NRPQFKQ-PLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNI 173
Query: 186 IDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DC +SF N GCNGGLM AF+YI+ + GL+ EE YPY M+ +E I+ Y
Sbjct: 174 LDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKITSY 233
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGK 302
+++ DE L AL PVSVAI+AS FQ Y+ GV+ P C +E LDHGV AVG G
Sbjct: 234 KEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGT 293
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
G DY IVKNSWGP WG GYI M RN + CGI+ MAS P+
Sbjct: 294 DNGEDYYIVKNSWGPSWGLNGYIHMARN---KDNNCGISTMASYPI 336
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 142/309 (45%), Positives = 187/309 (60%), Gaps = 8/309 (2%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
+ F S+ + + K+Y EEK R+ IFK NL +I N++ SY L +N F D+S +EF+
Sbjct: 115 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFR 174
Query: 106 NKYLGLKPQFPTRRQ---PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
KYLG K + + E LP VDWR +G VTPVK+Q CGSCWAFST
Sbjct: 175 RKYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFST 234
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
A+EG + +G L SLSEQEL+DC + N C+GG M+ AF+Y++ SGG+ E+ YP
Sbjct: 235 TGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYP 294
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
YL + C + E +VV I G++DVP E ++ ALA PVS+AIEA FQFY G
Sbjct: 295 YLARDEECRAQSCE-KVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEG 353
Query: 282 VFTGPCGAELDHGVAAVGYGKSKGS--DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
VF CG +LDHGV VGYG K S D+ I+KNSWG WG GY+ M + G+ EG CG
Sbjct: 354 VFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGE-EGQCG 412
Query: 340 INKMASIPL 348
+ AS P+
Sbjct: 413 LLLDASFPV 421
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 137/309 (44%), Positives = 192/309 (62%), Gaps = 25/309 (8%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEE 103
IE E WMS+ + Y EK RFEIFK+NLK ++ N +Y L +N+F+D++ EE
Sbjct: 15 IEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDLTDEE 74
Query: 104 FKNKYLGLKPQFPT-RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
F+ +Y+GL P+ T Q + F Y +V +S+DWR +GAVTPVK+QG CG CWAF+
Sbjct: 75 FQARYMGLVPEGMTGDSQKTVSFRYENVSETGESMDWRLEGAVTPVKDQGQCGCCWAFAA 134
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYP 221
VAAVEG+ +I +G L SLSEQ+L+DC T+ NN GC+GGL A+ YI + G+ EE+YP
Sbjct: 135 VAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQGITSEENYP 194
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
Y + TC K + TISGY+ VP++DE++LLKA++ G
Sbjct: 195 YQAVQQTC--KSTDPAAATISGYEAVPKDDEEALLKAVSQH------------------G 234
Query: 282 VFTGP-CGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
+F CG + H V VGYG S +G Y ++KNSWG WGE GY+R+KR+ +P+G+CG
Sbjct: 235 IFEDEYCGTDSHHAVTIVGYGTSEEGIKYWLLKNSWGESWGENGYMRIKRDVDEPQGMCG 294
Query: 340 INKMASIPL 348
+ A P+
Sbjct: 295 LAHRAYYPV 303
>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 198/304 (65%), Gaps = 16/304 (5%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
+LF+++ +K+GK Y E+ +R ++ N+ I++ N + S+ LG+ FADM++ EF
Sbjct: 25 KLFQTFEAKYGKNYLS-SEREYRKKVLAYNMDWIEKFNSDEHSFTLGMTPFADMTNTEFA 83
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
L + P + + + V+ S+DWR+KGAVTPVKNQGSCGSCWAFS A
Sbjct: 84 TSKLCGCMKKPLNHKQARVLNNMAVE----SIDWREKGAVTPVKNQGSCGSCWAFSATGA 139
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
+EG N + +G L SLSEQ+L+DCDT + GC GG MD AF+Y++ GL EEDYPY +
Sbjct: 140 LEGGNFVATGKLVSLSEQQLVDCDTE-DAGCGGGFMDTAFEYVMKK-GLCTEEDYPYHAK 197
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF-T 284
+ C+D + V++I+GY+DVP ND +L +AL PVSVAI+A FQ Y+GGV +
Sbjct: 198 DEDCKD-DQCTSVISITGYEDVPANDGVALKQALTKAPVSVAIQADSFVFQMYTGGVLDS 256
Query: 285 GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMK-RNTGKPEGLCGINKM 343
CG L+HGV AVGY K +YIIVKNSWG WG++GY+++ R+ G EG+CGIN
Sbjct: 257 DMCGTSLNHGVLAVGYAK----EYIIVKNSWGASWGDKGYVKIAHRDQG--EGICGINMA 310
Query: 344 ASIP 347
AS P
Sbjct: 311 ASYP 314
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 145/302 (48%), Positives = 188/302 (62%), Gaps = 15/302 (4%)
Query: 55 HGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEEFKNKYLG 110
HGK+Y EE R ++F +++ I+ N +T+Y +GLN+F DM+ EEF+N + G
Sbjct: 26 HGKSYGHDEEHFRR-QLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDMTSEEFRN-FKG 83
Query: 111 LKPQFPTRRQPSAEFSYRDV-KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGI 169
LK ++ F + +ALP VDWR+KG VTPVKNQG CGSCWAFST ++EG
Sbjct: 84 LKFDATKTKRNGTRFQKELLGEALPTQVDWREKGYVTPVKNQGQCGSCWAFSTTGSLEGQ 143
Query: 170 NQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT 228
+ +G L SLSEQ L+DC NNGCNGGLMD F YI +GG+ EE YPY ++G
Sbjct: 144 HFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQNGGIDTEESYPYTGKDGD 203
Query: 229 CEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP- 286
C + + + G+ DVP+ DE +L A+A PVSVAI+AS FQ+Y GV+ P
Sbjct: 204 CAFNENSVG-ARVKGFVDVPQRDEAALQAAVASVGPVSVAIDASNDSFQYYKEGVYDEPS 262
Query: 287 CG-AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMAS 345
C ++LDHGV VGYG G DY +VKNSWGP WG+ GYI+M RN E CGI MAS
Sbjct: 263 CSFSQLDHGVLVVGYGTENGVDYWLVKNSWGPTWGQDGYIKMMRN---KENQCGIASMAS 319
Query: 346 IP 347
P
Sbjct: 320 YP 321
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 125/232 (53%), Positives = 161/232 (69%), Gaps = 6/232 (2%)
Query: 122 SAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
S F Y +V A+P ++DWR GAVTP+K+QG CG CWAFS VAA EGI +I +G L S
Sbjct: 3 STGFRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLIS 62
Query: 180 LSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQEL+DCD + GC GGLMD AFK+I+ +GGL E +YPY +G C K
Sbjct: 63 LSEQELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKC--KSGSNSA 120
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I GY+DVP NDE +L+KA+A+QPVSVA++ FQFYSGGV TG CG +LDHG+AA+
Sbjct: 121 ANIKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAI 180
Query: 299 GYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
GYGK S G+ Y ++KNSWG WGE GY+RM+++ +G+CG+ S P +
Sbjct: 181 GYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYPTE 232
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 142/309 (45%), Positives = 187/309 (60%), Gaps = 8/309 (2%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
+ F S+ + + K+Y EEK R+ IFK NL +I N++ SY L +N F D+S +EF+
Sbjct: 114 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFR 173
Query: 106 NKYLGLKPQFPTRRQP---SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
KYLG K + + E LP VDWR +G VTPVK+Q CGSCWAFST
Sbjct: 174 RKYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFST 233
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
A+EG + +G L SLSEQEL+DC + N C+GG M+ AF+Y++ SGG+ E+ YP
Sbjct: 234 TGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYP 293
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
YL + C + E +VV I G++DVP E ++ ALA PVS+AIEA FQFY G
Sbjct: 294 YLARDEECRAQSCE-KVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEG 352
Query: 282 VFTGPCGAELDHGVAAVGYGKSKGS--DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
VF CG +LDHGV VGYG K S D+ I+KNSWG WG GY+ M + G+ EG CG
Sbjct: 353 VFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGE-EGQCG 411
Query: 340 INKMASIPL 348
+ AS P+
Sbjct: 412 LLLDASFPV 420
>gi|46576360|sp|P60994.1|ERVB_TABDI RecName: Full=Ervatamin-B; Short=ERV-B
gi|30749291|pdb|1IWD|A Chain A, Proposed Amino Acid Sequence And The 1.63 Angstrom X-ray
Crystal Structure Of A Plant Cysteine Protease Ervatamin
B: Insight Into The Structural Basis Of Its Stability
And Substrate Specificity
Length = 215
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 121/217 (55%), Positives = 155/217 (71%), Gaps = 3/217 (1%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP VDWR KGAV +KNQ CGSCWAFS VAAVE IN+I +G L SLSEQEL+DCDT+
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA- 59
Query: 193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
++GCNGG M+ AF+YI+ +GG+ +++YPY +G+C K + VV+I+G+Q V N+E
Sbjct: 60 SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSC--KPYRLRVVSINGFQRVTRNNE 117
Query: 253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
+L A+A QPVSV +EA+G FQ YS G+FTGPCG +HGV VGYG G +Y IV+
Sbjct: 118 SALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVR 177
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
NSWG WG +GYI M+RN GLCGI ++ S P K
Sbjct: 178 NSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYPTK 214
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 157/349 (44%), Positives = 213/349 (61%), Gaps = 32/349 (9%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
++LSL+++ VG SP + + D+ ELF+ +H KTY ++ + R
Sbjct: 1 MILSLTVACI----------FVGVSPAAVDAHDEHWELFKR---QHNKTY-LQKQDVGRR 46
Query: 70 EIFKENLKHIDQRNKEV----TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEF 125
IF+ N+K I+ N +SY LGLN FADM+ +EF+ KY G + F ++
Sbjct: 47 AIFEANIKKINAHNLLYDLGRSSYRLGLNGFADMTPDEFE-KYRGTR--FEANEARVSKL 103
Query: 126 SYRDVKAL--PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
+RD +++ P +VDWR +G VTPVKNQG CGSCWAFST A+EG + SG+L SLSEQ
Sbjct: 104 QHRDNRSMHVPDTVDWRTEGYVTPVKNQGVCGSCWAFSTTGALEGQHFRRSGDLVSLSEQ 163
Query: 184 ELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
L+DC + N GCNGGLMD AF++I +GGL E+ YPY ++GTC + ++
Sbjct: 164 MLVDCSAVYGNAGCNGGLMDNAFRFIKDAGGLETEKSYPYTGKDGTCHFDARGIG-AKLT 222
Query: 243 GYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVFTG-PCGA-ELDHGVAAVG 299
G+ DVP DE++L +A PVSVAI+ASG +FQFY GV+ C + LDHGV VG
Sbjct: 223 GFVDVPSRDEEALKEAAGVVGPVSVAIDASGQNFQFYKDGVYDEITCSSTSLDHGVLVVG 282
Query: 300 YGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
YG ++ G DY +VKNSWG WG+ GYI+M RN E CGI MAS P
Sbjct: 283 YGTTRDGKDYWLVKNSWGSSWGQSGYIQMSRN---KENQCGIATMASYP 328
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 137/309 (44%), Positives = 185/309 (59%), Gaps = 12/309 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F + H K Y EE+L R+ IFK NL +I N + SY L +N+F D++ EEF+ +
Sbjct: 89 FYQFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHNMQGYSYVLKMNKFGDLTLEEFRQR 148
Query: 108 YLGLKPQFPTRRQPSAEFSYR----DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
YLG K P R P E + +P VDWR++G VT VK+QG CGSCWAFS
Sbjct: 149 YLGYKK--PDLRTPPREVDTTLESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFSAT 206
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
A+EG+ +G L +LS+Q+L+DC N GC+GG M+ AF+Y+V +GG+ E+YPY
Sbjct: 207 GAMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPY 266
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGG 281
+ ++G C+ + V TI+GY+ VP E+S+ ALA PVSVAI+A+ FQFY G
Sbjct: 267 MRKDGVCK-SSQCTSVATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYDG 325
Query: 282 VFTGPCGAELDHGVAAVGYGKSKG--SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
+F PCG LDHGV VGY DY I+KNSWG WG+ GY+ M + G P G CG
Sbjct: 326 IFDAPCGTNLDHGVLLVGYSAETAGQGDYWIMKNSWGAAWGKGGYMLMAMHKG-PAGQCG 384
Query: 340 INKMASIPL 348
+ S P+
Sbjct: 385 VLLDGSFPV 393
>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
Length = 330
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 139/305 (45%), Positives = 186/305 (60%), Gaps = 13/305 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKN 106
F WM KH ++Y E ++++ FK+N+ I N S LGL +FAD+++EE++
Sbjct: 33 FLGWMKKHDRSYH-HHEFNNKYQAFKDNMDFIHNWNTNKNSKTVLGLTQFADLTNEEYRK 91
Query: 107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
YLG K + + P S+DWR KGAV+ VK+QG CGSCW+FST +V
Sbjct: 92 IYLGTKVNVAPEKHNFNMIHFTG----PDSIDWRTKGAVSHVKDQGQCGSCWSFSTTGSV 147
Query: 167 EGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
EG +QI +GN+ +LSEQ L+DC F NNGC+GGLM AFK+I++ GG+ E+ YPY
Sbjct: 148 EGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFIMSQGGVATEDSYPYNAV 207
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG 285
+G C+ K M ISGY+++ + E L AL QPVS+AI+AS FQ Y GV+
Sbjct: 208 QGKCKFTK-SMVGANISGYKEITQGSELELQAALTKQPVSIAIDASQQSFQLYKSGVYDE 266
Query: 286 P-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
P C + +LDHGV AVGYG G DY IVKNSW WG+ GYI M RN + CG+ M
Sbjct: 267 PECSSYQLDHGVLAVGYGTENGKDYYIVKNSWADSWGQDGYIFMSRN---AKNQCGVATM 323
Query: 344 ASIPL 348
AS P+
Sbjct: 324 ASYPI 328
>gi|310942960|pdb|3P5W|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 123/218 (56%), Positives = 158/218 (72%), Gaps = 2/218 (0%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP VDWR GAV +K+QG CGSCWAFST+AAVEGIN+I +G+L SLSEQEL+DC +
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSCWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 193 NN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
N GC+GG M F++I+ +GG++ E +YPY EEG C ++ + V+I Y++VP N+
Sbjct: 61 NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIV 311
E +L A+A+QPVSVA+EA+G +FQ YS G+FTGPCG +DH V VGYG G DY IV
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIV 180
Query: 312 KNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
KNSWG WGE GY+R++RN G G CGI K AS P+K
Sbjct: 181 KNSWGTTWGEEGYMRIQRNVGGV-GQCGIAKKASYPVK 217
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 196/314 (62%), Gaps = 16/314 (5%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNEFADMSH 101
+L++ + + H + Y EE + R E+F+ NLK I+ N + +SY +G+N+FADM
Sbjct: 42 KLWQDFKTVHERNYGETEE-MQRKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEV 100
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVK---ALPKSVDWRKKGAVTPVKNQGSCGSCW 158
+EF + G + T+ + Y +LP VDWRK+G VTP+K+QG CGSCW
Sbjct: 101 KEFASVVNGFRMNNRTKVRDHLHSHYISPAIPVSLPAEVDWRKEGYVTPIKDQGHCGSCW 160
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKE 217
+FST A+EG + +G L SLSEQ LIDC TS+ NNGCNGG+MDYAF+YI + G E
Sbjct: 161 SFSTTGALEGQHFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDNDGDDTE 220
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
+ YPY +G C KKE + T +GY D+P+ DE+ + +A+A PVSVAI+AS T FQ
Sbjct: 221 DSYPYEAADGPCRFKKEYVG-ATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDASHTSFQ 279
Query: 277 FYSGGVFTG-PCGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
Y GV+ C E LDHGV VGYG G DY +VKNSWG KWG+ GYI+M RN
Sbjct: 280 MYQSGVYDEVECDPEGLDHGVLVVGYGTELGQDYWLVKNSWGTKWGDEGYIKMSRNKNNQ 339
Query: 335 EGLCGINKMASIPL 348
CGI+ MAS PL
Sbjct: 340 ---CGISSMASYPL 350
>gi|414591548|tpg|DAA42119.1| TPA: hypothetical protein ZEAMMB73_388689, partial [Zea mays]
Length = 229
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 125/197 (63%), Positives = 147/197 (74%), Gaps = 1/197 (0%)
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGL 214
GSCWAFS +AAVEG+N+I++G L SLSEQEL+DCD N GC+GGLMDYAF+YI +GG+
Sbjct: 13 GSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAFQYIQRNGGV 72
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
E +YPYL E+ +C KE VTI GY+DVP N+E +L KA+A QPV+VAIEASG D
Sbjct: 73 TTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVAIEASGQD 132
Query: 275 FQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
FQFYS GVFTG CG +LDHGVAAVGYG + G+ Y VKNSWG WGERGYIRM+R
Sbjct: 133 FQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGYIRMQRGVPD 192
Query: 334 PEGLCGINKMASIPLKK 350
GLCGI S P KK
Sbjct: 193 SRGLCGIAMEPSYPTKK 209
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 205/349 (58%), Gaps = 29/349 (8%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+LL+++++ +C++ ++ + PE +E++ HGK YK E++
Sbjct: 2 KVLLVAVAVIAVSCANRFYNIN-----PEE----------WETFKVVHGKNYKNQFEEMF 46
Query: 68 RFEIFKENLKHIDQRNKEV----TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA 123
R +IF N K I+ N + SY + +N F D+ E K G K T+R+
Sbjct: 47 RRKIFMNNKKRIEAHNAKYEQGEVSYKMKMNHFGDLMSHEIKALMNGFKMTPNTKREGKI 106
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F D LPKSVDWR+KGAVTPVK+QG CGSCW+FS ++EG + G L SLSEQ
Sbjct: 107 YFPSND--KLPKSVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQ 164
Query: 184 ELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
L+DC + NNGC GGLMD AF+Y+ + G+ E YPY + C KK+++ T
Sbjct: 165 NLMDCSKEYGNNGCEGGLMDKAFQYVSDNKGIDTESSYPYEARDYACRFKKDKVG-GTDK 223
Query: 243 GYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVG 299
GY D+PE DE++L ALA P+SVAI+AS F FYS GV+ P C + +LDHGV AVG
Sbjct: 224 GYVDIPEGDEKALQNALATVGPISVAIDASHESFHFYSEGVYNEPYCSSYDLDHGVLAVG 283
Query: 300 YGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
YG G DY +VKNSWGP WGE GYI++ RN CGI MAS P+
Sbjct: 284 YGTENGQDYWLVKNSWGPSWGESGYIKIARNHSNH---CGIASMASYPI 329
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 143/302 (47%), Positives = 181/302 (59%), Gaps = 11/302 (3%)
Query: 51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG 110
W H K Y E+ R+ I+K+N+ I + N + + L +N F DM++ EF+ K G
Sbjct: 30 WKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKSKNVILRMNHFGDMTNTEFRAKMNG 89
Query: 111 LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGIN 170
L + Q + F A P +VDWR +G VTPVKNQG CGSCWAFS+ A+EG +
Sbjct: 90 L---LLHKHQNGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGALEGQH 146
Query: 171 QIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTC 229
+G L SLSEQ L+DC T + NNGCNGGLMD AF YI A+GG+ E YPY ++GTC
Sbjct: 147 FKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTETGYPYEGQDGTC 206
Query: 230 EDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP-C 287
K + +G+ D+PE DE +L +A+A PVSVAI+AS FQFY GV+ P C
Sbjct: 207 RYSKSSIG-ADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQC 265
Query: 288 G-AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASI 346
+ LDHGV VGYG G DY +VKNSWG WG GYI M RN + CGI AS
Sbjct: 266 SPSALDHGVLVVGYGTDNGKDYWLVKNSWGTGWGTEGYIYMSRNN---QNQCGIASKASY 322
Query: 347 PL 348
PL
Sbjct: 323 PL 324
>gi|428170119|gb|EKX39047.1| hypothetical protein GUITHDRAFT_154556 [Guillardia theta CCMP2712]
Length = 352
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 151/348 (43%), Positives = 199/348 (57%), Gaps = 19/348 (5%)
Query: 18 LFACSSLAHDFSIVGYSPEHLTSMDKLIEL-FESWMSKHGKTYKCIEEKLHRFEIFKENL 76
+ +++A +I +S+D I L F SW +K K Y E L RF +FK N+
Sbjct: 4 ILLLAAIAATCAIPTSPASKTSSVDDEIHLAFISWKNKFEKVYDGAEH-LARFAVFKANM 62
Query: 77 KHIDQRNKEVT----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV-- 130
+ I N ++ + N+FADM+ EEFK LG KP+ +R S ++
Sbjct: 63 EIIRAHNALYELGEETFSMAANQFADMTAEEFKRTVLGYKPELKGKRLLQGLNSGKNCTH 122
Query: 131 ----KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
PK++DWR K AVTPVKNQG CGSCW+FST AVEG + L SLSE+EL+
Sbjct: 123 RSNNSTRPKAIDWRTKSAVTPVKNQGQCGSCWSFSTTGAVEGAWVVAGHPLISLSEEELV 182
Query: 187 DCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT---CEDKKEEMEVVTISG 243
CDT + GCNGGLMD A+ +I+ +GG+ E+ YPY+ GT C +V +IS
Sbjct: 183 QCDTKSDQGCNGGLMDNAYAWIIQNGGIAAEDVYPYISGNGTTGVCHVAFLSKKVASISD 242
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG-PCGAELDHGVAAVGYG- 301
+ D+ DE L AL QPV+VAIEA + FQFY+GGV CG +LDHGV AVGYG
Sbjct: 243 WCDLKPEDESDLELALVQQPVAVAIEADQSSFQFYNGGVLPAKKCGTKLDHGVLAVGYGY 302
Query: 302 -KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE-GLCGINKMASIP 347
K Y IVKNSWG +WG+ GYIR+++ K + CGI K AS P
Sbjct: 303 DKKHKMHYWIVKNSWGAEWGDEGYIRLEKMPKKTKHSACGIAKAASYP 350
>gi|157834287|pdb|1YAL|A Chain A, Carica Papaya Chymopapain At 1.7 Angstroms Resolution
Length = 218
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 122/216 (56%), Positives = 153/216 (70%), Gaps = 2/216 (0%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P+S+DWR KGAVTPVKNQG+CGS WAFST+A VEGIN+IV+GNL LSEQEL+DCD +
Sbjct: 2 PQSIDWRAKGAVTPVKNQGACGSXWAFSTIATVEGINKIVTGNLLELSEQELVDCD-KHS 60
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GC GG + +Y VA+ G+H + YPY ++ C + V I+GY+ VP N E
Sbjct: 61 YGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNXET 119
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
S L ALA+QP+SV +EA G FQ Y GVF GPCG +LDH V AVGYG S G +YII+KN
Sbjct: 120 SFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIIIKN 179
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWGP WGE+GY+R+KR +G +G CG+ K + P K
Sbjct: 180 SWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 215
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 140/312 (44%), Positives = 185/312 (59%), Gaps = 15/312 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLNEFADMSHEE 103
F +W K G++Y E+ R +I+ N + H ++ ++Y LG+ +AD+ HEE
Sbjct: 26 FHAWKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEHEE 85
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKA---LPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
FK G+ +P S+ + LP+++DWR+ G VTPVKNQGSCGSCW+F
Sbjct: 86 FKQTVFGVCLGSFNASKPRGGSSFLKMHRFYNLPQTIDWRQWGFVTPVKNQGSCGSCWSF 145
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
S+ A+EG N +G L SLSEQEL+DC ++ N GCNGG MD AF+YIV GG+H E+
Sbjct: 146 SSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYIVNKGGIHTEDS 205
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFY 278
YPY + G C E+ T +GY D+P +E +L +A+A PVSVAI AS FQ Y
Sbjct: 206 YPYEGQVGQCRANYGEIG-ATCTGYYDIPSGNEHALKEAVATFGPVSVAIHASDQSFQLY 264
Query: 279 SGGVFTGP--CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
GV+ P G LDH V VGYG G DY +VKNSWGP WG++GYI+M RN
Sbjct: 265 HSGVYNNPYCSGTALDHAVLIVGYGTEYGQDYWLVKNSWGPAWGDQGYIKMSRNRYNQ-- 322
Query: 337 LCGINKMASIPL 348
CGI AS PL
Sbjct: 323 -CGIASAASFPL 333
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 154/348 (44%), Positives = 202/348 (58%), Gaps = 25/348 (7%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
+L LSL ++ A + A+ I L +E++ + H K+Y+ E+L R
Sbjct: 1 MLRLSLLCAIVAVTVAANSHEI-------------LRTQWEAFKTTHKKSYESHMEELLR 47
Query: 69 FEIFKEN----LKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE 124
F+IF EN KH + K + SY LG+N+F D+ EF + G + Q +R
Sbjct: 48 FKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGYRGQRTSRGSTFMP 107
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
+ + +LP +VDWRKKGAVTPVK+QG CGSCWAFS ++EG + + G L SLSEQ
Sbjct: 108 PANVNDSSLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELVSLSEQN 167
Query: 185 LIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
L+DC SF NNGC GGLMD AFKYI A+ G+ EE YPY + C KKE++ T +G
Sbjct: 168 LVDCSQSFGNNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMDDKCRFKKEDVG-ATDTG 226
Query: 244 YQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGY 300
+ D+ E L KA+A P+SVAI+A + FQ YS GV+ P C + ELDHGV AVGY
Sbjct: 227 FVDIEGGSEDDLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPECSSEELDHGVLAVGY 286
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
G G Y +VKNSWG WG+ GYI M R+ CGI AS PL
Sbjct: 287 GVKDGKKYWLVKNSWGGSWGDNGYILMSRDKNNQ---CGIASAASYPL 331
>gi|294883322|ref|XP_002770704.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873993|gb|EER02713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 333
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 139/292 (47%), Positives = 183/292 (62%), Gaps = 11/292 (3%)
Query: 45 IEL-FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEE 103
+EL F + K GK Y+ EE++ R IF+ NL HI+Q N + SY LG+NE AD++HEE
Sbjct: 24 VELAFMGFQHKFGKNYESKEEEVKRNAIFQANLHHIEQVNAKDLSYKLGVNEHADLTHEE 83
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
F LG + TRR D LP SVDWR K +TPVK+QGSCGSCWAFST
Sbjct: 84 FAALKLG-TLKMSTRRDDKFVIE-ADTTQLPTSVDWRNKNVLTPVKDQGSCGSCWAFSTT 141
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
A+E I +G L SLSEQ+L+DC + + NNGC GGLMD A++YI S GL +E Y Y
Sbjct: 142 GALEAQYAIATGKLLSLSEQQLVDCSSGYGNNGCEGGLMDDAYEYI-KSAGLDQESTYSY 200
Query: 223 LMEEGTCE----DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
+ C+ + + + ++G+ + + EQSL+KALA PVSVA+ A+ DF+FY
Sbjct: 201 NGTDDVCQGSLAKRSDGIPAGEVTGFH-MLDKTEQSLMKALADAPVSVAMYAADPDFRFY 259
Query: 279 SGGVF-TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
GV+ + C +LDHGV AVGYG GSDY I++NSWG WG+ GY +KR
Sbjct: 260 KSGVYSSATCNGKLDHGVVAVGYGTENGSDYFIIRNSWGSSWGQAGYFYLKR 311
>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 157/358 (43%), Positives = 208/358 (58%), Gaps = 27/358 (7%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA +KL ++ + L + ++ P L D + E E WM++HG+TY+
Sbjct: 1 MALPLQTKLAIVLMILVTWVSQAM----------PRPLIDEDAVAEKHEQWMARHGRTYQ 50
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLK-PQ-FPT 117
EEK RF IFK+NLKHI+ N +Y LGLN FAD++ EEF Y G K P+ PT
Sbjct: 51 DDEEKERRFHIFKKNLKHIENFNNAFNRTYKLGLNHFADLTDEEFLATYTGYKMPKVLPT 110
Query: 118 RRQPSAEFSYRDV---KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVS 174
+ DV +P+S+DWR +G VTPVKNQG CG CWAFS AAVEGI
Sbjct: 111 ANITTKTTQSSDVLYEANVPESIDWRTRGVVTPVKNQGRCGCCWAFSAAAAVEGI----I 166
Query: 175 GNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
GN SLS Q+L+DC +NGCNGG MD AF+YI+ + GL YPY + C +
Sbjct: 167 GNGVSLSAQQLLDC-VPDSNGCNGGFMDNAFRYIIQNQGLASATYYPYQLMREMC---RP 222
Query: 235 EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEA-SGTDFQFYSGGVFTGP-CGAELD 292
ISGY DV DE++L A+A QPVS A++A S +F++Y GG+F CG+ L
Sbjct: 223 SNNAARISGYVDVTPADEETLKSAVARQPVSAAVDATSELNFKYYGGGIFPPQDCGSTLT 282
Query: 293 HGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
H + VGYG S +G+ Y ++KNSWG WGE GY+R++R+ G G CGI AS P +
Sbjct: 283 HAITIVGYGTSAEGTKYWLIKNSWGEGWGEGGYMRLQRDVGSYGGACGIALRASYPTR 340
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 191/313 (61%), Gaps = 14/313 (4%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSH 101
E + W ++HGK Y EE+ R I+++NL + + N + +Y LG+N+FAD+ +
Sbjct: 26 EDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLKN 85
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEF-SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
EEF G + ++ + F ++ LPK+VDWR KG VTPVK+QG CGSCWAF
Sbjct: 86 EEFVAMMTGFRVNGTSKAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCWAF 145
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
ST ++EG + +G L SLSEQ L+DC N GC+GGLMD AF+YI+ +GG+ EE
Sbjct: 146 STTGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNEGCDGGLMDQAFQYIIKAGGIDTEES 205
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFY 278
YPY +G C KK + T++GY DV + E +L KA+AH P+SVAI+AS FQ Y
Sbjct: 206 YPYKAVDGECHFKKANIG-ATVTGYTDVTSDSETALQKAVAHIGPISVAIDASHMSFQLY 264
Query: 279 SGGVFTGP-CGAE-LDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
GV+ P C + LDHGV AVGYG S G+DY IVKNSW WG GY+ M RN +
Sbjct: 265 KSGVYNEPDCSSTLLDHGVLAVGYGTTSDGTDYWIVKNSWAETWGMNGYLWMSRN---KD 321
Query: 336 GLCGINKMASIPL 348
CGI AS PL
Sbjct: 322 NQCGIATQASYPL 334
>gi|113120263|gb|ABI30271.1| VXH-D [Vasconcellea x heilbornii]
Length = 276
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 140/274 (51%), Positives = 190/274 (69%), Gaps = 5/274 (1%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SKLL +++ LS+ S FSIVGYSP+ LTS +KLI LF+SWM ++ K YK I+E
Sbjct: 6 SFSKLLFVAICLSVHMGLSYGA-FSIVGYSPDDLTSTEKLINLFDSWMVEYDKVYKDIDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSA 123
K++RFEIFK+NLK+ID+ NK+ +YWLGL F D++++EFK KY+G + + T + +
Sbjct: 65 KIYRFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKYVGSISESWSTTEESND 124
Query: 124 E-FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
E F Y D +P S+DWR+KGAVTPV+NQG CGSCW FS+VAAVEGIN+IV+G L SLSE
Sbjct: 125 EGFIYDDAVNIPTSIDWRQKGAVTPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSE 184
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DC+ + GC GG YA +Y VA+ G+H + YPY + C + + V
Sbjct: 185 QELLDCERR-SYGCRGGFPLYALQY-VANSGIHLRQYYPYEGVQRQCRASQAKGPKVKTD 242
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
G VP N+EQ+L++ +A QPVS+ +EA G FQ
Sbjct: 243 GVGRVPRNNEQALIQRIAIQPVSIVVEAKGRAFQ 276
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 147/309 (47%), Positives = 191/309 (61%), Gaps = 12/309 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLNEFADMSHEE 103
+E++ + H K+Y+ E+L R++IF EN KH + K + SY LG+N+F D+ E
Sbjct: 7 WEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPHE 66
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
F + G + R + + +LPK+VDWRKKGAVTPVK+QG CGSCWAFS
Sbjct: 67 FAKMFNGYHGERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSAT 126
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPY 222
++EG + + SG L SLSEQ LIDC SF N GC GGLMD AFKYI A+ G+ EE YPY
Sbjct: 127 GSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEESYPY 186
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
+G C KKE++ T +G+ D+ + E L KA+A P+SVAI+AS + FQ YS G
Sbjct: 187 EAMDGDCRFKKEDVG-ATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQLYSEG 245
Query: 282 VFTGP-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
V+ P C + ELDHGV AVGYG G Y +VKNSW WG+ GYI M R+ + CG
Sbjct: 246 VYDEPNCSSEELDHGVLAVGYGVKNGKKYWLVKNSWAETWGDNGYILMSRD---KDNQCG 302
Query: 340 INKMASIPL 348
I AS PL
Sbjct: 303 IASSASYPL 311
>gi|357153071|ref|XP_003576329.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 398
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 187/340 (55%), Gaps = 38/340 (11%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADM 99
++ F+ WM+ G++Y EE RFE++K N+++I+ N E + + LG F D+
Sbjct: 58 MMGRFQGWMAAQGRSYWTAEETARRFEVYKSNVRYIEAVNAEAATTGLTFELGEGPFTDL 117
Query: 100 SHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKAL-------------------------- 133
+HEEF Y G P P + + D + +
Sbjct: 118 THEEFSALYNGSMP--PPEEEEGDDIQEEDEQVIATVVDGVDVNVAVHTNLSAGGPRPWP 175
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P+S DWRK GAVTP+K+QG CGSCWAF TVA +EG ++IV GNL SLSEQ+LIDCD + N
Sbjct: 176 PRSRDWRKHGAVTPIKDQGRCGSCWAFPTVATIEGKHKIVRGNLVSLSEQQLIDCDYT-N 234
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
+GC GG + A+++I GGL YPY G C K I+G++ V E
Sbjct: 235 SGCKGGFVIRAYRWIRKIGGLTTSSAYPYKGARGKC--MKRRRAAARIAGWRSVRSRSEV 292
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCG-AELDHGVAAVGYGKS--KGSDYII 310
+L+ A+A QPV+V I ASG +FQ Y G+ GPC A L+H V VGYG+ G+ Y I
Sbjct: 293 ALVNAVAGQPVAVYISASGKNFQHYKKGILNGPCDTARLNHAVTVVGYGRQADTGAKYWI 352
Query: 311 VKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
VKNSWG WG+ GYI MKR T P G CGI PL K
Sbjct: 353 VKNSWGTTWGQEGYILMKRGTRNPRGQCGIATSPVFPLMK 392
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 200/322 (62%), Gaps = 19/322 (5%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLN 94
+S + L +E++ + H K+Y+ E+L RF+IF EN +H ++ + + SY LG+N
Sbjct: 18 SSHEILRTQWEAFKATHKKSYQSNMEELLRFKIFSENSLLVARHNEKYARGLVSYKLGMN 77
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQ----PSAEFSYRDVKALPKSVDWRKKGAVTPVKN 150
+F D+ EF + G + R P A +Y +LP+S+DWR+KGAVTPVKN
Sbjct: 78 QFGDLLPHEFARMFNGYRGARTAGRGSTFLPPANVNY---SSLPQSMDWREKGAVTPVKN 134
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIV 209
QG CGSCWAFST ++EG + + +G L SLSEQ L+DC +F N+GC GGLMD AF+YI
Sbjct: 135 QGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYIK 194
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAI 268
A+GG+ E+ YPY E+G C KK+ + T +G+ D+ + E L KA+A PVSVAI
Sbjct: 195 ANGGIDTEKSYPYEAEDGECRFKKQNVG-ATDTGFVDIEQGSEDDLKKAVATVGPVSVAI 253
Query: 269 EASGTDFQFYSGGVFT-GPCGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIR 326
+AS + FQ YS GV+ C +E LDHGV VGYG G Y +VKNSW WG+ GYI+
Sbjct: 254 DASHSSFQLYSEGVYDETECSSEQLDHGVLVVGYGVEDGKKYWLVKNSWAESWGDNGYIK 313
Query: 327 MKRNTGKPEGLCGINKMASIPL 348
M R+ + CGI AS PL
Sbjct: 314 MSRD---KDNQCGIASAASYPL 332
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 142/306 (46%), Positives = 181/306 (59%), Gaps = 9/306 (2%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
+++W S HGK Y E+ R I++ NLK I N+ S+ L +N DM+ E
Sbjct: 29 WKAWKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHNEGKHSFKLAMNHLGDMTSLEISQT 88
Query: 108 YLGLKPQFPTRRQP-SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
LGLK + QP A F + S+DWR KG VTPVKNQG CGSCWAFST A+
Sbjct: 89 LLGLKLKKHAESQPKGATFLPPANVKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTTGAL 148
Query: 167 EGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
EG + +G L SLSEQ L+DC + NNGC GGLMD AF+YI +GG+ E+ YPYL +
Sbjct: 149 EGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPYLAK 208
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFT 284
+G C K + +G+ D+P DE +L +ALA P+S+AI+AS + F FY GV+
Sbjct: 209 DGVCHYNKSAIGAKD-TGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQGVYD 267
Query: 285 GP--CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
P LDHGV AVGYG G DY +VKNSWGP WGE GYI++ RN CG+
Sbjct: 268 DPDCSSTRLDHGVLAVGYGTDDGKDYWLVKNSWGPSWGEEGYIKIARND---HDKCGVAS 324
Query: 343 MASIPL 348
AS PL
Sbjct: 325 KASYPL 330
>gi|302831223|ref|XP_002947177.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
nagariensis]
gi|300267584|gb|EFJ51767.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
nagariensis]
Length = 514
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 160/362 (44%), Positives = 208/362 (57%), Gaps = 54/362 (14%)
Query: 36 EHLTSMDKLI-------ELFESWMSKHGKTYKCIE---EKLHRFEIFKENLKHIDQRNKE 85
E L S D L F W ++G+TY +E E R IF +N++ I + +++
Sbjct: 19 EQLASSDLLALAKVEPHRAFTLWSRQYGRTY--VEQSPEYTRRLSIFSDNVRAIQESHEK 76
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQ-----FPTRRQPSAEFSYRDVKAL--PKSVD 138
L LNE+AD++ EEF + LGL+ +RR S ++R A+ PK++D
Sbjct: 77 DPGVTLALNEYADLTWEEFSSTRLGLRIDQDQLDRRSRRSASRRNAWRYAAAVDNPKAID 136
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS------- 191
WR+KGAV VKNQG CGSCWAFST A+EGIN IV+G L SLSEQ+L+DCDT
Sbjct: 137 WREKGAVAEVKNQGQCGSCWAFSTTGAIEGINAIVTGQLQSLSEQQLVDCDTGKRTVTRS 196
Query: 192 -------------------FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT---C 229
N GC+GGLMD AFKY++ +GGL E+DY Y G C
Sbjct: 197 KRSCTVILPSYSSNSCRNESNMGCSGGLMDDAFKYVIQNGGLDTEQDYAYWSGYGLGFWC 256
Query: 230 EDKKE-EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCG 288
+K+ + V+I GY+DVP+ E +LLKA+AHQPV+VAI +G QFYS GV + C
Sbjct: 257 NKRKQTDRPAVSIDGYEDVPQG-EDNLLKAVAHQPVAVAI-CAGASMQFYSRGVIS-TCC 313
Query: 289 AELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
L+HGV VGY S+ G Y IVKNSWG WGE+GY R+K G+ GLCGI AS P
Sbjct: 314 EGLNHGVLTVGYNVSQDGEKYWIVKNSWGAGWGEQGYFRLKMGVGE-TGLCGIASAASYP 372
Query: 348 LK 349
K
Sbjct: 373 TK 374
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 197/312 (63%), Gaps = 15/312 (4%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFADMSH 101
E ++ + + GK Y +EE++ RF+IF++ L+ I++ N++ SY++G+N+F+DMSH
Sbjct: 52 ETWKEFKTLFGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFSDMSH 111
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
+E+ ++ GL+ + SY + K L VDWR KG VTPVKNQG CGSCW+F
Sbjct: 112 DEYL-RHNGLRRGNRKYSKGEGCDSYTKSGKQLDDKVDWRDKGYVTPVKNQGQCGSCWSF 170
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
ST ++EG + +G L SLSEQ+L+DC +F N GCNGGLMD AF+YI + GGL E+D
Sbjct: 171 STTGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSIGGLEGEDD 230
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFY 278
YPY ++G C KK + +G DV DE +L ALA P+SVAI+AS FQ Y
Sbjct: 231 YPYTAKQGKCHLKKSLFK-ANDTGCTDVESGDEDALKDALASVGPISVAIDASHASFQSY 289
Query: 279 SGGVFT-GPCGAE-LDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
GGV+ C ++ LDHGV VGYG + G DY +VKNSWG WGE GYI+M RN +
Sbjct: 290 DGGVYDEEECSSQNLDHGVLTVGYGTEENGGDYWLVKNSWGEMWGEEGYIKMSRN---KD 346
Query: 336 GLCGINKMASIP 347
CGI AS P
Sbjct: 347 NQCGIATQASYP 358
>gi|310942958|pdb|3P5U|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942959|pdb|3P5V|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942961|pdb|3P5X|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 122/218 (55%), Positives = 157/218 (72%), Gaps = 2/218 (0%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP VDWR GAV +K+QG CGS WAFST+AAVEGIN+I +G+L SLSEQEL+DC +
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSXWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 193 NN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
N GC+GG M F++I+ +GG++ E +YPY EEG C ++ + V+I Y++VP N+
Sbjct: 61 NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIV 311
E +L A+A+QPVSVA+EA+G +FQ YS G+FTGPCG +DH V VGYG G DY IV
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIV 180
Query: 312 KNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
KNSWG WGE GY+R++RN G G CGI K AS P+K
Sbjct: 181 KNSWGTTWGEEGYMRIQRNVGGV-GQCGIAKKASYPVK 217
>gi|116666824|pdb|2BDZ|A Chain A, Mexicain From Jacaratia Mexicana
gi|116666825|pdb|2BDZ|B Chain B, Mexicain From Jacaratia Mexicana
gi|116666826|pdb|2BDZ|C Chain C, Mexicain From Jacaratia Mexicana
gi|116666827|pdb|2BDZ|D Chain D, Mexicain From Jacaratia Mexicana
Length = 214
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 117/216 (54%), Positives = 161/216 (74%), Gaps = 6/216 (2%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P+S+DWR+KGAVTPVKNQ CGSCWAFSTVA +EGIN+I++G L SLSEQEL+DC+ +
Sbjct: 2 PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCERR-S 60
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
+GC+GG + +Y+V +G +H E +YPY ++G C K ++ V I+GY+ VP NDE
Sbjct: 61 HGCDGGYQTTSLQYVVDNG-VHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDEI 119
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
SL++A+A+QPVSV ++ G FQFY GG++ GPCG DH V AVGYGK+ Y+++KN
Sbjct: 120 SLIQAIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGYGKT----YLLLKN 175
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWGP WGE+GYIR+KR +G+ +G CG+ + P+K
Sbjct: 176 SWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFFPIK 211
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 151/321 (47%), Positives = 197/321 (61%), Gaps = 18/321 (5%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLN 94
+S + L +E++ + H KTY+ E+L RF+IF EN KH + K + SY LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADMSHEEFK---NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQ 151
+F D+ EF N Y G + + P A + +LPK+VDWRKKGAVTPVK+Q
Sbjct: 78 QFGDLLAHEFARIFNGYHGSRKSGGSTFLPPANV---NDSSLPKAVDWRKKGAVTPVKDQ 134
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVA 210
G CGSCWAFST ++EG + + +G L SLSEQ L+DC SF NNGC GGLM+ AFKYI A
Sbjct: 135 GQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIE 269
+ G+ E+ YPY +G C KKE++ T +GY ++ E L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGCEDDLKKAVATVGPISVAID 253
Query: 270 ASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRM 327
AS + FQ YS GV+ P C +E LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 328 KRNTGKPEGLCGINKMASIPL 348
R+ CGI AS PL
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 146/382 (38%), Positives = 208/382 (54%), Gaps = 43/382 (11%)
Query: 3 FFSHSKLLLLSLSLSLFACSSL------------AHDFSIVGYSPEHLTSMDKLIELFES 50
FS + L+L + L CS L + + S +G H D ++ F
Sbjct: 7 MFSTCRCSSLALCVLLATCSFLMLAGCSSESLTTSSEHSDIGIDKHH----DLMMARFHV 62
Query: 51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKN 106
WM+ ++Y EK HRF++++ N+++I+ N E T+ Y LG F D++ EEF +
Sbjct: 63 WMTVQNRSYPTSSEKAHRFKVYRSNMRYIEALNAEATTSGFTYELGEGPFTDLTDEEFIS 122
Query: 107 KYLGLKP------------QFPTRRQPSAEFS-----YRDVKA-LPKSVDWRKKGAVTPV 148
Y G P Q T S + Y + A P +DWRK+GAVTPV
Sbjct: 123 LYTGKIPDDDHREDGVHDEQIITTHAGSVNGAEGVTVYANFSAGAPIRMDWRKRGAVTPV 182
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
K+QG CGSCWAF TVA +EGI++I G L SLSEQ+L+DCD + GCNGG AF++I
Sbjct: 183 KDQGKCGSCWAFPTVATIEGIHKIKRGRLVSLSEQQLVDCDF-LDGGCNGGWPRNAFQWI 241
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
+ +GG+ Y Y EG C+ ++ I+GY+ V N E S++ +A+QP++ +I
Sbjct: 242 IQNGGITTTSSYTYKAAEGQCKGNRK--PAAKITGYRKVKSNSEVSMVNIVANQPIAASI 299
Query: 269 EASGTDFQFYSGGVFTGPCG-AELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIR 326
G FQ Y GG++ GPC ++L+H + VGYG ++ G+ Y IVKNSWG WG +GY+
Sbjct: 300 VVHGGQFQHYKGGIYNGPCATSKLNHVITIVGYGQQAYGAKYWIVKNSWGAAWGNKGYML 359
Query: 327 MKRNTGKPEGLCGINKMASIPL 348
MKR T P G CGI PL
Sbjct: 360 MKRGTKNPLGQCGIAVRPIFPL 381
>gi|413956349|gb|AFW88998.1| hypothetical protein ZEAMMB73_678859 [Zea mays]
Length = 1140
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 116/184 (63%), Positives = 141/184 (76%)
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGL 214
GSCWAFST+AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+
Sbjct: 780 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGI 839
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
E+DYPY +G C+ ++ +VVTI Y+DVP NDE+SL KA+A+QPVSVAIEA+GT
Sbjct: 840 DTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTT 899
Query: 275 FQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
FQ YS G+FTG CG LDHGV AVGYG G DY I+KNSWG WGE G +R
Sbjct: 900 FQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIMKNSWGSSWGESGRAPTRRTLAPA 959
Query: 335 EGLC 338
+C
Sbjct: 960 PAVC 963
>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
Length = 351
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 154/352 (43%), Positives = 202/352 (57%), Gaps = 17/352 (4%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMS---KHGKTYKCIEEKLH 67
+L+L C L+ D + ++ ++++ +W +H K Y IEE+
Sbjct: 1 MLTLIFVTLFCCVLSKDLHWESHRDNLYSNFQEVLDAEVAWHKFKLEHNKVYVGIEEESL 60
Query: 68 RFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA 123
R IF N K I N S+ +G+NEFADM+ EF GLKP TR S
Sbjct: 61 RKTIFATNYKFIKDHNALHATGEKSFTVGVNEFADMTVHEFAQMMNGLKPD-STRVSGST 119
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
S LP VDWR KG V+ VKNQGSCGSCWAFST ++EG + +G + LSEQ
Sbjct: 120 YLSPNIDAPLPVEVDWRTKGLVSEVKNQGSCGSCWAFSTTGSLEGQHMRKTGTMVDLSEQ 179
Query: 184 ELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
L+DC TS+ N+GCNGGLM AFKYI + G+ EE YPY +G C+ KK ++ T++
Sbjct: 180 NLVDCSTSYGNDGCNGGLMTNAFKYIKDNKGIDTEEAYPYAGRDGDCKFKKNKVG-ATVT 238
Query: 243 GYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP-C-GAELDHGVAAVG 299
G+ ++P +E+ L +ALA PVSVAI+A+ F Y GV+ P C A+LDHGV AVG
Sbjct: 239 GFVEIPAGNEKKLQEALATVGPVSVAIDANHQSFMLYKSGVYDEPECDSAQLDHGVLAVG 298
Query: 300 YGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE---GLCGINKMASIPL 348
YG G DY IVKNSWG WGE+GYIR T P+ G+CGI AS P+
Sbjct: 299 YGSIHGKDYYIVKNSWGTTWGEQGYIRFS-TTAVPDAIGGICGILLDASYPV 349
>gi|340368358|ref|XP_003382719.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 329
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 141/307 (45%), Positives = 184/307 (59%), Gaps = 11/307 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN--KEVTSYWLGLNEFADMSHEEFK 105
F+ W K+ K Y+ E +L R I++ N K ++ N + + + +NEFAD+ EF
Sbjct: 23 FQDWKVKYNKAYETKETELARQVIWESNKKFVENHNANSDKFGFTVAMNEFADLGAGEFA 82
Query: 106 NKYLGLKPQFPTRRQPSA-EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
N Y G+ P P+ + + + R AL SVDWRK GAVT VKNQG CG+CWAFS
Sbjct: 83 NIYNGIIPHPPSYNNTNTFKRTVRSTFALADSVDWRKSGAVTGVKNQGKCGACWAFSATG 142
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
A+EG + I +G L SLSEQ+L+DC +SF NNGC GGLMD AF+Y+ G EE YPYL
Sbjct: 143 ALEGQHFINTGTLISLSEQQLMDCSSSFGNNGCKGGLMDNAFRYLETVAGDMTEEAYPYL 202
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGV 282
E GTC E +V Y+D+PE DE +L +A+A P+SV+I + + FQ Y GV
Sbjct: 203 AEVGTCRYNSSEAKVKNTV-YKDIPEGDEDALQEAVATIGPISVSINSEHSSFQLYDQGV 261
Query: 283 FTGPC--GAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
+ P ++LDHGV +GYG S +DY +VKNSWG WG GYI M RN E CGI
Sbjct: 262 YYEPTCSSSKLDHGVLVIGYGTSDNNDYWLVKNSWGTNWGMDGYIMMSRN---KENNCGI 318
Query: 341 NKMASIP 347
AS P
Sbjct: 319 ATRASYP 325
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 197/321 (61%), Gaps = 18/321 (5%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLN 94
+S + L +E++ + H KTY+ E+L RF+IF EN KH + K + SY LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEF---SYRDVKALPKSVDWRKKGAVTPVKNQ 151
+F D+ EF + G TR+ + F + + +LPK+VDWRKKGAVTPVK+Q
Sbjct: 78 QFGDLLAHEFARIFNG---HHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQ 134
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVA 210
G CGSCWAFS ++EG + + +G L SLSEQ L+DC SF NNGC GGLM+ AFKYI A
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIE 269
+ G+ E+ YPY +G C KKE++ T +GY ++ E L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEDDLKKAVATVGPISVAID 253
Query: 270 ASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRM 327
AS + FQ YS GV+ P C +E LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 328 KRNTGKPEGLCGINKMASIPL 348
R+ CGI AS PL
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|357446993|ref|XP_003593772.1| Cysteine proteinase [Medicago truncatula]
gi|355482820|gb|AES64023.1| Cysteine proteinase [Medicago truncatula]
Length = 339
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 139/330 (42%), Positives = 195/330 (59%), Gaps = 17/330 (5%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN---KEVT 87
+G + + L + DK IE+F+ WM +HG+ YK ++E +F+IF NLK+I + N K
Sbjct: 1 MGPNLDKLPTQDKTIEIFQLWMKEHGRVYKDLDEMAKKFDIFISNLKYITETNAKRKSSN 60
Query: 88 SYWLGLNEFADMSHEEFKNKYL---GLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGA 144
+ LGL F D S EEF+ +YL + T + S + P S+DWR KG
Sbjct: 61 GFLLGLTNFTDWSSEEFQERYLHNIDMPTDIDTMKVNDVHLS---SCSAPSSLDWRSKGV 117
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYA 204
V+ +K+Q +CGSCWAFS V A+EGIN I +G L +LSEQEL+DCD + GCN G ++ A
Sbjct: 118 VSDIKDQKNCGSCWAFSAVGAIEGINAITTGKLINLSEQELLDCD-PISGGCNSGWVNKA 176
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKK-EEMEVVTISGYQDVPENDEQSLLKALAHQP 263
F +++ + G+ + DYPY E+G C+ + + +I+ Y V ++D Q LL A+A QP
Sbjct: 177 FDWVIRNKGVALDNDYPYTAEKGVCKASQIPNSAISSINTYHHVEQSD-QGLLCAVAKQP 235
Query: 264 VSVAIEASGTDFQFYSGGVFTGP-C---GAELDHGVAAVGYGKSKGSDYIIVKNSWGPKW 319
VSV + A DF YS G++ GP C + +H V VGY G DY IVKN WG W
Sbjct: 236 VSVCLYAP-QDFHHYSSGIYDGPNCPVNSKDTNHCVLIVGYDSVDGQDYWIVKNQWGTSW 294
Query: 320 GERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G GY+ +KRNT K G+C IN A P+K
Sbjct: 295 GMEGYMHIKRNTNKKYGVCAINSWAYNPVK 324
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 151/323 (46%), Positives = 195/323 (60%), Gaps = 21/323 (6%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENL----KHIDQRNKEVTSYWLGLN 94
+S + L +E++ S+H K Y E+L RF+IF EN KH + K + SY L +N
Sbjct: 18 SSQEILRTEWEAFKSQHNKAYSSHVEELLRFKIFTENTLLVAKHNAKYAKGLVSYKLAMN 77
Query: 95 EFADMSHEEFK---NKYLGL--KPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVK 149
+F D+ EF N Y G K Q PT P+ + +LP +VDWRKKGAVTPVK
Sbjct: 78 KFGDLLPHEFAKMVNGYRGKQNKEQRPTFIPPAN----LNDSSLPTTVDWRKKGAVTPVK 133
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYI 208
NQG CGSCWAFST ++EG + +G L SLSEQ L+DC F N GCNGGLMD F+YI
Sbjct: 134 NQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYI 193
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVA 267
A+GG+ EE +PY ++G C+ KK ++ T +G+ D+ + E L KA+A PVSVA
Sbjct: 194 KANGGIDTEESHPYTAQDGDCKFKKADVG-ATDAGFVDIQQGSEDDLKKAVATVGPVSVA 252
Query: 268 IEASGTDFQFYSGGVFTGP--CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYI 325
I+AS FQ YS GV+ P ++LDHGV VGYG G Y +VKNSWG WG+ GYI
Sbjct: 253 IDASHGSFQLYSQGVYDEPDCSSSQLDHGVLTVGYGVKNGKKYWLVKNSWGGDWGDNGYI 312
Query: 326 RMKRNTGKPEGLCGINKMASIPL 348
M R+ + CGI AS PL
Sbjct: 313 LMSRD---KDNQCGIASSASYPL 332
>gi|356552228|ref|XP_003544471.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 351
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/329 (43%), Positives = 194/329 (58%), Gaps = 16/329 (4%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSM-----DKLIELFESWMSKHGKTYKCIEE 64
+ + L +FA SS A D SI+ + H D+++ +FE W+ KH K Y + E
Sbjct: 3 MAIVLLFMVFAVSS-ALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNALGE 61
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGL---KPQFPTRRQP 121
K RF+IFK NL+ ID+RN +Y LGLN FAD+++ E++ YL P+ P
Sbjct: 62 KEKRFQIFKNNLRFIDERNSLNRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTPP 121
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQG-SCGSCWAFSTVAAVEGINQIVSGNLTSL 180
+ R +PKSVDWRK+GAVTPVKNQG +C SCWAF+ V AVE + +I +G+L SL
Sbjct: 122 RNHYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLISL 181
Query: 181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
SEQE++DC TS + GC GG + + + YI G+ E+DYPY +EG C+ K+ +VT
Sbjct: 182 SEQEVVDCTTSSSRGCGGGDIQHGYIYI-RKNGISLEKDYPYRGDEGKCDSNKKN-AIVT 239
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY 300
I G+ VP E++L +AL D F GVF G CG EL+H + VGY
Sbjct: 240 IDGHGWVPTQLEEALNRALFCYCAYFLY----VDKFFLCQGVFKGKCGTELNHALLLVGY 295
Query: 301 GKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
G K DY I KNS+ KWGE GYIR++R
Sbjct: 296 GTEKDGDYWIAKNSYSDKWGENGYIRIQR 324
>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 136/308 (44%), Positives = 185/308 (60%), Gaps = 19/308 (6%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSHEEF 104
++FE WM+K GK Y C EK +RF +F++N++ I R + L +N+FAD++++EF
Sbjct: 39 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 98
Query: 105 KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
+ + G KP P + P D LP +DWR KGAVT VK+QG+CGSCWAF+ VA
Sbjct: 99 VSTHTGAKPPCP-KDAPRGV----DPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFAAVA 153
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
A+EG+ QI +G LT LSEQEL+DCDT ++GC GG D AF+ + A GG+ E Y Y
Sbjct: 154 AIEGLTQIRTGKLTPLSEQELVDCDTG-SSGCAGGHTDRAFELVAAKGGITAESGYRYEG 212
Query: 225 EEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
G C D I G++ VP DE+ L A+A QPV+ I+ASG FQFY GVF
Sbjct: 213 YRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGSGVF 272
Query: 284 TGPCGA---------ELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
GPCG+ +H V VGY + + G Y + KNSWG WGE+GYI ++++
Sbjct: 273 PGPCGSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILLEKDVA 332
Query: 333 KPEGLCGI 340
P G CG+
Sbjct: 333 SPHGTCGV 340
>gi|59798093|sp|P84346.1|MEX1_JACME RecName: Full=Mexicain
Length = 214
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 117/216 (54%), Positives = 161/216 (74%), Gaps = 6/216 (2%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P+S+DWR+KGAVTPVKNQ CGSCWAFSTVA +EGIN+I++G L SLSEQEL+DC+ +
Sbjct: 2 PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYR-S 60
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
+GC+GG + +Y+V +G +H E +YPY ++G C K ++ V I+GY+ VP NDE
Sbjct: 61 HGCDGGYQTPSLQYVVDNG-VHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDEI 119
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
SL++A+A+QPVSV ++ G FQFY GG++ GPCG DH V AVGYGK+ Y+++KN
Sbjct: 120 SLIQAIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGYGKT----YLLLKN 175
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWGP WGE+GYIR+KR +G+ +G CG+ + P+K
Sbjct: 176 SWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFFPIK 211
>gi|4469159|emb|CAB38317.1| chymopapain isoform V [Carica papaya]
Length = 227
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 121/216 (56%), Positives = 151/216 (69%), Gaps = 2/216 (0%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P+S+DWR KGAVTPVKNQG+CGSCWAFST+A VEGIN+IV+GNL LSEQEL+DCD +
Sbjct: 2 PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCD-KHS 60
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GC GG + +Y VA+ G+H + YP ++ C + V I+GY+ VP N E
Sbjct: 61 YGCKGGYQTTSLQY-VANNGVHTSKVYPCQAKQYKCRATDKPGPKVKITGYKRVPSNCET 119
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
S L ALA+QP+S +EA G FQ Y GVF GPCG +LDH V AVGYG S G +YII+KN
Sbjct: 120 SFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIIIKN 179
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWGP WGE GY+R+KR +G +G CG+ K + P K
Sbjct: 180 SWGPNWGEEGYMRLKRQSGNSQGTCGVYKSSYYPFK 215
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 198/321 (61%), Gaps = 18/321 (5%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLN 94
+S + L +E++ + H KTY+ E+L RF+IF EN KH + K + SY LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEF---SYRDVKALPKSVDWRKKGAVTPVKNQ 151
+F D+ EF + G + TR+ + F + + +LPK+VDWRKKGAVTPVK+Q
Sbjct: 78 QFGDLLAHEFARIFNGHRG---TRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQ 134
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVA 210
G CGSCWAFS ++EG + + +G L SLSEQ L+DC SF NNGC GGLM+ AFKYI A
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIE 269
+ G+ E+ YPY +G C KKE++ T +GY ++ E L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEVDLKKAVATVGPISVAID 253
Query: 270 ASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRM 327
AS + FQ YS GV+ P C +E LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 328 KRNTGKPEGLCGINKMASIPL 348
R+ CGI AS PL
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 191/312 (61%), Gaps = 14/312 (4%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSH 101
E ++ W ++HGK Y EE+ R I+++NL + + N + +Y LG+N+FAD+ +
Sbjct: 26 EDWKEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIRHNLKYDLGHFTYDLGMNQFADLQN 85
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEF-SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
+EF G + ++ + F +V LPK+VDWR KG VTPVK+QG CGSCWAF
Sbjct: 86 KEFVAMMTGFRVNGTSKAAKGSTFLPPNNVGKLPKTVDWRTKGYVTPVKDQGQCGSCWAF 145
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
S ++EG + +G L SLSEQ L+DC N GCNGGLMD AF+YI+ +GG+ EE Y
Sbjct: 146 SATGSLEGQHFKKTGKLVSLSEQNLVDCSDK-NYGCNGGLMDRAFQYIIDAGGIDTEESY 204
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYS 279
PY+ +G C K + T++GY DV E++L KA+AH P+SVAI+AS FQ Y
Sbjct: 205 PYIAMDGNCHFKTANVG-ATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHFSFQLYQ 263
Query: 280 GGVFTGP-CGAE-LDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
GV+ P C + LDHGV AVGYG + G+DY IVKNSW WG GYI M RN +
Sbjct: 264 SGVYNEPGCSSTLLDHGVLAVGYGTTIDGTDYWIVKNSWAETWGMNGYIWMSRN---KDN 320
Query: 337 LCGINKMASIPL 348
CGI AS PL
Sbjct: 321 QCGIATQASYPL 332
>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
Length = 340
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 141/317 (44%), Positives = 193/317 (60%), Gaps = 9/317 (2%)
Query: 34 SPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV--TSYWL 91
S + L + D+ F +MS+ K YK EE R + +K N+ I+ N + TS+ L
Sbjct: 28 SSQSLYTADQDHIDFVHFMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQNDGTSFTL 87
Query: 92 GLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQ 151
G N AD +H+E+K K LG KP+ T ++ +S ++K +P+S+DWR+KGAV VK+Q
Sbjct: 88 GPNHLADYTHDEYK-KMLGYKPRNKTGKEV---YSTPNLKDIPESIDWREKGAVNAVKDQ 143
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVAS 211
G CGSCWAFST+A++E I +G L SLSEQ+L+DC + N GCNGG M A YI ++
Sbjct: 144 GQCGSCWAFSTIASLESRYFIETGKLQSLSEQQLVDCSKNGNEGCNGGDMGLAMDYIASA 203
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEAS 271
GG+ E+DYPY+ ++ TC + + EV T G+ ++ +L A+A PVSVAIEA
Sbjct: 204 GGVETEKDYPYVGKDQTCAFEASK-EVATDKGHINIVPGKFATLQAAIAEGPVSVAIEAD 262
Query: 272 GTDFQFYSGGVFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRN 330
FQFY G+F CG LDHGVAAVGYG G Y IV+NSW WG +GYI + N
Sbjct: 263 SLFFQFYRSGIFDSSWCGTNLDHGVAAVGYGVDNGKQYYIVRNSWSDSWGLKGYINIIAN 322
Query: 331 TGKPEGLCGINKMASIP 347
G G+CGI +P
Sbjct: 323 -GDGNGMCGIQMEPVVP 338
>gi|59798094|sp|P84347.1|MEX2_JACME RecName: Full=Chymomexicain
Length = 215
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 118/216 (54%), Positives = 158/216 (73%), Gaps = 5/216 (2%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P+S+DWR KGAVTPVKNQ CGSCWAFSTVA VEGIN+I +G L SLSEQEL+DCD +
Sbjct: 2 PESIDWRDKGAVTPVKNQNPCGSCWAFSTVATVEGINKIRTGKLISLSEQELLDCDRR-S 60
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
+GC GG + +Y+ +GG+H E++YPY ++G C K+++ V I+GY+ VP NDE
Sbjct: 61 HGCKGGYQTGSIQYVADNGGVHTEKEYPYEKKQGKCRAKEKKGTKVQITGYKRVPANDEI 120
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
SL++ + +QPVSV E+ G FQ Y GG+F GPCG + DH V A+GYGK++ ++ KN
Sbjct: 121 SLIQGIGNQPVSVLHESKGRAFQLYKGGIFNGPCGYKNDHAVTAIGYGKAQ----LLDKN 176
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWGP WGE+GYI++KR +GK EG CG+ K + P+K
Sbjct: 177 SWGPNWGEKGYIKIKRASGKSEGTCGVYKSSYFPIK 212
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 192/308 (62%), Gaps = 15/308 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK---EVTSYWLGLNEFADMSHEEF 104
+ ++ S H K+Y+ +E+L R IF++NL I++ N+ + + LG+NEFADM++ EF
Sbjct: 28 WNAFKSTHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEF 87
Query: 105 KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
N LGL + + + F V+ LP VDW +KG VT VKNQG CGSCWAFST
Sbjct: 88 SNMLLGLGGR--NKIAGDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTG 145
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
++EG +G L SLSEQ L+DC TS N GCNGGLMD AF YI +GG+ E YPY
Sbjct: 146 SLEGQVFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPYT 205
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGV 282
+GTC + ++ T+SG+ DV DE +L +A+A P+SVAI+AS FQFY GGV
Sbjct: 206 GSDGTCRFLENKVG-ATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGV 264
Query: 283 FTGP--CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
+ P C + ELDHGV VGYG G DY +VKNSWG WG +GYI+M RN + CG
Sbjct: 265 YN-PWFCSSTELDHGVLVVGYGTEGGKDYWLVKNSWGSSWGLKGYIKMVRN---KKNRCG 320
Query: 340 INKMASIP 347
I AS P
Sbjct: 321 IATQASYP 328
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 141/305 (46%), Positives = 182/305 (59%), Gaps = 11/305 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
+ +W HGKTY EE L R I+ +NL+ + + N E SY L +N FAD++ EFK +
Sbjct: 27 WHAWKDFHGKTYTGEEEDLRR-AIWNDNLEIVKKHNAENHSYKLDMNHFADLTVTEFKQR 85
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
++G + + + F LP VDWR KG VT VKNQG CGSCWAFS+ ++E
Sbjct: 86 FMGYRA--ASNSTGGSTFLPLSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLE 143
Query: 168 GINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
G + +G L SLSEQ L+DC + NNGC GGLMDYAFKYI + G+ E+ YPY +
Sbjct: 144 GQHFRKTGKLVSLSEQNLVDCSKKYGNNGCEGGLMDYAFKYIKNNDGIDTEQSYPYTARD 203
Query: 227 GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTG 285
G C K + T++GY DV E L A+A P+SVAI+A + FQ Y GV++
Sbjct: 204 GQCHFKPGSVG-ATVTGYTDVQRGSEGDLQSAVATVGPISVAIDAGHSSFQLYKTGVYSE 262
Query: 286 P-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
P C + +LDHGV AVGYG G DY +VKNSWG WG GYI+M RN + CGI
Sbjct: 263 PDCSSTQLDHGVLAVGYGAEDGKDYWLVKNSWGEGWGMNGYIKMSRN---KDNQCGIATQ 319
Query: 344 ASIPL 348
AS PL
Sbjct: 320 ASYPL 324
>gi|219112639|ref|XP_002178071.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410956|gb|EEC50885.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 360
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 196/332 (59%), Gaps = 28/332 (8%)
Query: 43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE--VTSYWLGLNEFADMS 100
+L+ F+ W+ H K Y + K+ R I+ N + I+ N + S+ LG NEF+DM+
Sbjct: 29 ELMSKFKGWVDFHQKMYDSHDNKMERLNIWLNNDERIEAHNNQNPTPSFALGHNEFSDMT 88
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK------------------ALPKSVDWRKK 142
+EF +Y L P R++ +A+ D LP ++W +
Sbjct: 89 EDEFA-QYFRLGPYASVRQKEAAQAKIMDPDQQISTAERRRLWEEQAPLTLPDYMNWVQA 147
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAVTP+KNQG+CGSCWAFST A+EG + +G L +LSEQ LIDCD + GCNGGLMD
Sbjct: 148 GAVTPMKNQGACGSCWAFSTTGALEGAKFLKTGELVALSEQHLIDCD-KVDLGCNGGLMD 206
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEG-TCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH 261
AFK+ ++ GL EE+YPYL ++ TC ++E + + DVP DE++LL A+A
Sbjct: 207 NAFKFDMSEAGLCSEEEYPYLAKQSRTCMTNCTKVEGSGVKTFIDVPPGDEKALLSAIAM 266
Query: 262 QPVSVAIEASGTDFQFYSGGVFT-GPCG--AELDHGVAAVGYGKSKGSD--YIIVKNSWG 316
QP+SVAI+AS FQFY GV T CG A +DHGV AVGYG ++ Y +VKNSWG
Sbjct: 267 QPISVAIQASQFVFQFYKNGVLTDDSCGSRASIDHGVLAVGYGTDVDTNEPYFLVKNSWG 326
Query: 317 PKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
WG++GY+++ R G+C I KMAS P+
Sbjct: 327 ETWGDKGYVKLGRGGKNEFGMCAILKMASFPV 358
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 155/357 (43%), Positives = 207/357 (57%), Gaps = 34/357 (9%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
+ L++L ++L A + V YS + + E + ++ +H K Y E+
Sbjct: 2 RFALITLLIALVAMTQ------AVSYS-------ELVREEWNTFKLEHRKNYADSTEETF 48
Query: 68 RFEIFKENLKHIDQRNKEV----TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA 123
R +IF EN HI + N+ SY L LN++ADM H EF+ G + + +
Sbjct: 49 RMKIFNENKHHIAKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTD 108
Query: 124 E-------FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGN 176
E S VK LP +VDWR KGAVT VK+QG CGSCWAFS+ A+EG + SG
Sbjct: 109 ESFTGVTFISPEHVK-LPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGT 167
Query: 177 LTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEE 235
L SLSEQ L+DC T + NNGCNGGLMD AF+Y+ +GG+ E+ Y Y + +C K
Sbjct: 168 LVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCHFDKNS 227
Query: 236 MEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP-CGAE-LD 292
+ T G+ D+P+ +E+ L +A+A PVSVAI+AS FQFYS GV+ P C AE LD
Sbjct: 228 IG-ATDRGFADIPQGNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAENLD 286
Query: 293 HGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
HGV VGYG K GSDY +VKNSWG WG++G+I+M RN E CGI +S PL
Sbjct: 287 HGVLVVGYGTEKDGSDYWLVKNSWGTTWGDKGFIKMSRN---KENQCGIASASSYPL 340
>gi|242072388|ref|XP_002446130.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
gi|241937313|gb|EES10458.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
Length = 276
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 134/281 (47%), Positives = 180/281 (64%), Gaps = 31/281 (11%)
Query: 73 KENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFK-NKYLGLKPQFPTRRQPSAEFSYRD- 129
++N+ ++ N + +WLG+N+FAD++ EEFK NK G KP + P+ F Y +
Sbjct: 19 RDNVAFVESFNANKNNKFWLGVNQFADLTTEEFKANK--GFKPT-SAEKVPTTGFKYENL 75
Query: 130 -VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
V ALP +VDWR KGAVTP+KNQG CG CWAFS VAA+EGI ++ +GNL SLS+QEL+DC
Sbjct: 76 SVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSKQELVDC 135
Query: 189 DT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
DT S + GC E PY +G C K TI G++DV
Sbjct: 136 DTHSMDEGC--------------------EVQLPYKAVDGKC--KGGSKSAATIKGHEDV 173
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGS 306
P N+E +L+KA+A+QPVSVA++AS F YSGGV TG CG ELDHG+AA+GYG +S G+
Sbjct: 174 PVNNEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGT 233
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
Y I+KNSWG WGE+G++RM+++ G+CG+ S P
Sbjct: 234 KYWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 274
>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
Length = 307
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 144/297 (48%), Positives = 182/297 (61%), Gaps = 15/297 (5%)
Query: 63 EEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKNKYLG--LKPQFP 116
+E+ R EIF+ N K I+ N E +YWLG N+FA M+++EF +G L +
Sbjct: 14 KEESRRMEIFENNTKLINLHNNEADLGMHTYWLGHNQFAHMTNDEFVANVIGGCLLDRNA 73
Query: 117 TRRQPSAEFSY-RDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
++ Y ++ LP +VDWR KG VTPVKNQ CGSCWAFST ++EG +G
Sbjct: 74 SKSTADRVHQYDSNLVELPDTVDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKKTG 133
Query: 176 NLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
L SLSEQ L+DC F N GCNGGLMD AFKYI A+GG+ E+ YPY +G C K
Sbjct: 134 KLVSLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARDGKCRFKPA 193
Query: 235 EMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP-CGA-EL 291
++ T++GY D+ E DE +L +A+A P+SVAI+AS FQ YS GV+ P C + EL
Sbjct: 194 DVG-ATVTGYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSSTEL 252
Query: 292 DHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
DHGV AVGYG G DY +VKNSWG WG+ GYI M RN CGI AS PL
Sbjct: 253 DHGVLAVGYGTEGGKDYWLVKNSWGEVWGQNGYIMMSRNKNNQ---CGIATSASYPL 306
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 196/321 (61%), Gaps = 18/321 (5%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLN 94
+S + L +E++ + H KTY+ E+L RF+IF EN KH + K + SY LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEF---SYRDVKALPKSVDWRKKGAVTPVKNQ 151
+F D+ EF + G TR+ + F + + +LPK VDWRKKGAVTPVK+Q
Sbjct: 78 QFGDLLAHEFARIFNG---HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQ 134
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVA 210
G CGSCWAFS ++EG + + +G L SLSEQ L+DC SF NNGC GGLM+ AFKYI A
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIE 269
+ G+ E+ YPY +G C KKE++ T +GY ++ E L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYKAVDGECRFKKEDVG-ATDTGYVEIKAGSEVDLKKAVATVGPISVAID 253
Query: 270 ASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRM 327
AS + FQ YS GV+ P C +E LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 328 KRNTGKPEGLCGINKMASIPL 348
R+ CGI AS PL
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 189/312 (60%), Gaps = 15/312 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNEFADMSHEE 103
F +W K G++Y+ E++ R +I+ N K + N + + SY LG+ +FADM +EE
Sbjct: 27 FHAWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMDNEE 86
Query: 104 FKNKY-LGLKPQFPTR--RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
+K+ LG F T R+ SA F + LP +VDWR KG VT VK+Q CGSCWAF
Sbjct: 87 YKSLISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSCWAF 146
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
S ++EG N +G L SLSEQ+L+DC + N GCNGGLMDYAFKYI +GG+ E+
Sbjct: 147 SATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDTEKS 206
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFY 278
YPY E+G C K E + +GY DV DE +L +A+A PVSV I+AS + FQ Y
Sbjct: 207 YPYEAEDGQCRFKPENVG-AKCTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQLY 265
Query: 279 SGGVFT-GPCGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
GV+ C ++ LDHGV AVGYG G DY +VKNSWG WG+ GYI M RN +
Sbjct: 266 DSGVYDEQDCSSQDLDHGVLAVGYGTDNGQDYWLVKNSWGLGWGQEGYIMMSRN---KDN 322
Query: 337 LCGINKMASIPL 348
CGI AS PL
Sbjct: 323 QCGIATAASYPL 334
>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
Length = 327
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 136/308 (44%), Positives = 185/308 (60%), Gaps = 19/308 (6%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSHEEF 104
++FE WM+K GK Y C EK +RF +F++N++ I R + L +N+FAD++++EF
Sbjct: 17 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 76
Query: 105 KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
+ + G KP P + P D LP +DWR KGAVT VK+QG+CGSCWAF+ VA
Sbjct: 77 VSTHTGAKPPCP-KDAPRGV----DPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFAAVA 131
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
A+EG+ QI +G LT LSEQEL+DCDT ++GC GG D AF+ + A GG+ E Y Y
Sbjct: 132 AIEGLTQIRTGKLTPLSEQELVDCDTG-SSGCAGGHTDRAFELVAAKGGITAESGYRYEG 190
Query: 225 EEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
G C D I G++ VP DE+ L A+A QPV+ I+ASG FQFY GVF
Sbjct: 191 YRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGSGVF 250
Query: 284 TGPCGA---------ELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
GPCG+ +H V VGY + + G Y + KNSWG WGE+GYI ++++
Sbjct: 251 PGPCGSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILLEKDVA 310
Query: 333 KPEGLCGI 340
P G CG+
Sbjct: 311 SPHGTCGV 318
>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
Length = 291
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 119/263 (45%), Positives = 175/263 (66%), Gaps = 5/263 (1%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMS 100
D +++ FE WM+++G+ YK +EK+ RF+IFK N+ HI+ N+ SY LG+N+F DM+
Sbjct: 31 DPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMT 90
Query: 101 HEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
+ EF +Y G + ++P F ++ A+ +S+DWR GAVT VK+Q CGSCWA
Sbjct: 91 NNEFVAQYTGGISRPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWA 150
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
FS +A VEGI +IV+G L SLSEQE++DC S NGC+GG +D A+ +I+++ G+ E D
Sbjct: 151 FSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS--NGCDGGFVDNAYDFIISNNGVASEAD 208
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS 279
YPY +G C I+GY V NDE S+ A+ +QP++ AI+ASG +FQ+Y+
Sbjct: 209 YPYQAYQGDCA-ANSWPNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYN 267
Query: 280 GGVFTGPCGAELDHGVAAVGYGK 302
GGVF+GPCG L+H + +GYG+
Sbjct: 268 GGVFSGPCGTSLNHAITIIGYGQ 290
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 196/321 (61%), Gaps = 18/321 (5%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLN 94
+S + L +E++ + H KTY+ E+L RF+IF EN KH + K + SY LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEF---SYRDVKALPKSVDWRKKGAVTPVKNQ 151
+F D+ EF + G TR+ + F + + +LPK VDWRKKGAVTPVK+Q
Sbjct: 78 QFGDLLAHEFARIFNG---HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQ 134
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVA 210
G CGSCWAFS ++EG + + +G L SLSEQ L+DC SF NNGC GGLM+ AFKYI A
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIE 269
+ G+ E+ YPY +G C KKE++ T +GY ++ E L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEVDLKKAVATVGPISVAID 253
Query: 270 ASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRM 327
AS + FQ YS GV+ P C +E LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 328 KRNTGKPEGLCGINKMASIPL 348
R+ CGI AS PL
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 141/322 (43%), Positives = 193/322 (59%), Gaps = 13/322 (4%)
Query: 35 PEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYW 90
P L + +L FE + S G+ Y E +LHR IF+ NL+ I + N + +++
Sbjct: 20 PSMLLTEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFS 79
Query: 91 LGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKN 150
+ +N F D+S+EEF+ + G + + + + DV+ALP +VDW KG VTP+KN
Sbjct: 80 VSVNNFTDLSNEEFRATFNGYR-RLAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKN 138
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIV 209
Q CGSCWAFS VA++EG + + +G L SLSEQ L+DC + + GC+GG MDYAFKY++
Sbjct: 139 QQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVI 198
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAI 268
+ G+ E YPY + +CE K+ + TI + DV DE +L A+A P+SVAI
Sbjct: 199 QNRGIDTEASYPYKAIDESCEFKRNSIG-ATIHSFVDVKTGDESALQNAVASIGPISVAI 257
Query: 269 EASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIR 326
+AS FQFYS GV+ P C E LDHGV AVGYG G Y VKNSWG WG++GYI
Sbjct: 258 DASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGVPYWKVKNSWGTSWGQKGYIF 317
Query: 327 MKRNTGKPEGLCGINKMASIPL 348
M RN + CGI AS P+
Sbjct: 318 MSRN---KQNQCGIATKASYPV 336
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 196/321 (61%), Gaps = 18/321 (5%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLN 94
+S + L +E++ + H KTY+ E+L RF+IF EN KH + K + SY LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEF---SYRDVKALPKSVDWRKKGAVTPVKNQ 151
+F D+ EF + G TR+ + F + + +LPK VDWRKKGAVTPVK+Q
Sbjct: 78 QFGDLLAHEFARIFNG---HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQ 134
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVA 210
G CGSCWAFS ++EG + + +G L SLSEQ L+DC SF NNGC GGLM+ AFKYI A
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIE 269
+ G+ E+ YPY +G C KKE++ T +GY ++ E L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEVDLKKAVATVGPISVAID 253
Query: 270 ASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRM 327
AS + FQ YS GV+ P C +E LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 328 KRNTGKPEGLCGINKMASIPL 348
R+ CGI AS PL
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 136/306 (44%), Positives = 190/306 (62%), Gaps = 12/306 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN--KEVTSYWLGLNEFADMSHEEFK 105
F+ W K+ K Y+ E +L R I++ N K ++ N + + + +NEFAD+ EF
Sbjct: 24 FQDWKVKYNKVYETKETELERQIIWESNKKFVENHNANSDKFGFTVAMNEFADLDAGEFG 83
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
+ GL P+ P+ + + VK +P +VDW++KGAVTP+KNQG CGSCW+FS+ +
Sbjct: 84 RIFNGLLPR-PSSYNSTNIYKPSGVK-VPDTVDWKEKGAVTPIKNQGQCGSCWSFSSTGS 141
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
+EG + I +G L SLSEQ+L+DC T + N+GCNGGLMD +F+Y+ + G E++YPY
Sbjct: 142 LEGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDNSFRYLKSVAGDETEDNYPYTA 201
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGV- 282
E G C + VVT Y D+P+ DE SL A+A+ P+SVAI+AS + FQ Y+ GV
Sbjct: 202 ENGVCR-YDSSLAVVTDKSYVDIPQGDEDSLKDAVANVGPISVAIDASHSSFQLYNSGVY 260
Query: 283 FTGPCGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
+ C + +LDHGV A+GYG G DY +VKNSWG WG GYI+M RN CGI
Sbjct: 261 YASTCSSTQLDHGVLAIGYGTEDGKDYWLVKNSWGTSWGMEGYIKMSRNRNNN---CGIA 317
Query: 342 KMASIP 347
AS P
Sbjct: 318 TQASYP 323
>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
Length = 330
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 133/313 (42%), Positives = 183/313 (58%), Gaps = 15/313 (4%)
Query: 42 DKLIELFESWMSKHGKT-YKCI---EEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFA 97
D L +F WM ++ K+ Y+ + EE ++R+ ++++ ++ N++ SY+L +N+F
Sbjct: 24 DPLTGVFAKWMRENTKSNYRFVYSNEEFIYRWNVWRD-----EEHNRQNKSYFLAMNQFG 78
Query: 98 DMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
D+++ EF + GL + + +P DWR+KGAVT VKNQG CGSC
Sbjct: 79 DLTNAEFNRLFKGLAFDYSKHAKIHTAAPEAPATGIPSEFDWRQKGAVTHVKNQGQCGSC 138
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHK 216
W+FST + EG N + +G L SLSEQ LIDC S+ NNGCNGGLMDYAF+YI+ + G+
Sbjct: 139 WSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMDYAFEYIINNRGIDT 198
Query: 217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
E YPY + +++GY DV DE +LL A +PVSVAI+AS FQ
Sbjct: 199 EASYPYQTAGPLTCQYNAANKGGSLTGYTDVTSGDENALLNAAVKEPVSVAIDASHNSFQ 258
Query: 277 FYSGGVF--TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
FYSGGV+ + +LDHGV VG+G G D+ VKNSWG WG GYI+M RN
Sbjct: 259 FYSGGVYYESACSSTQLDHGVLVVGWGSENGQDFWWVKNSWGASWGLNGYIKMSRNQNNN 318
Query: 335 EGLCGINKMASIP 347
CGI AS P
Sbjct: 319 ---CGIATAASYP 328
>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
gi|255645733|gb|ACU23360.1| unknown [Glycine max]
Length = 362
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 153/365 (41%), Positives = 209/365 (57%), Gaps = 26/365 (7%)
Query: 1 MAFFSHSKLLLLSLSLSLFACS-SLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTY 59
M +KL + L F CS SLA + + E S +++ +LF++W +H + Y
Sbjct: 1 MMSLQRTKLFPFFIVLVSFTCSLSLAMSSNQL----EQFASEEEVFQLFQAWQKEHKREY 56
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEEFKNKYLGLKPQF 115
EEK RF+IF+ NL++I++ N + T + LGLN+FADMS EEF YL + +
Sbjct: 57 GNQEEKAKRFQIFQSNLRYINEMNAKRKSPTTQHRLGLNKFADMSPEEFMKTYLK-EIEM 115
Query: 116 P----TRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQ 171
P R+ + D LP SVDWR KGAVT V++QG C S WAFS A+EGIN+
Sbjct: 116 PYSNLESRKKLQKGDDADCDNLPHSVDWRDKGAVTEVRDQGKCQSHWAFSVTGAIEGINK 175
Query: 172 IVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED 231
IV+GNL SLS Q+++DCD + ++GC GG AF Y++ +GG+ E YPY + GTC
Sbjct: 176 IVTGNLVSLSVQQVVDCDPA-SHGCAGGFYFNAFGYVIENGGIDTEAHYPYTAQNGTC-- 232
Query: 232 KKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGAE 290
K +VV+I V E++LL ++ QPVSV+I+A+G QFY+GGV+ G C
Sbjct: 233 KANANKVVSIDNLL-VVVGPEEALLCRVSKQPVSVSIDATG--LQFYAGGVYGGENCSKN 289
Query: 291 LDHGVAA---VGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK--PEGLCGINKMAS 345
VGYG G DY IVKNSWG WGE GY+ +KRN P G+C IN
Sbjct: 290 STKATLVCLIVGYGSVGGEDYWIVKNSWGKDWGEEGYLLIKRNVSDEWPYGVCAINAAPG 349
Query: 346 IPLKK 350
P+ K
Sbjct: 350 FPIIK 354
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 193/322 (59%), Gaps = 13/322 (4%)
Query: 35 PEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYW 90
P L + +L FE + S G+ Y E +LHR IF+ NL+ I + N + +++
Sbjct: 20 PSMLLTEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFS 79
Query: 91 LGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKN 150
+ +N F D+S+EEF+ + G + + + + DV+ALP +VDW KG VTP+KN
Sbjct: 80 VSVNNFTDLSNEEFRATFNGYR-RLAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKN 138
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIV 209
Q CGSCWAFS VA++EG + + +G L SLSEQ L+DC + + GC+GG MDYAFKY++
Sbjct: 139 QQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVI 198
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAI 268
+ G+ E YPY + +CE K+ + TI + DV DE +L A+A P+SVAI
Sbjct: 199 QNRGIDTEASYPYKAIDESCEFKRNSVG-ATIHSFVDVKTGDESALQNAVASIGPISVAI 257
Query: 269 EASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIR 326
+A+ FQFYS GV+ P C E LDHGV AVGYG G+ Y VKNSWG WG +GYI
Sbjct: 258 DAAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGAPYWKVKNSWGTSWGRKGYIF 317
Query: 327 MKRNTGKPEGLCGINKMASIPL 348
M RN + CGI AS P+
Sbjct: 318 MSRN---KQNQCGIATKASYPV 336
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 147/349 (42%), Positives = 205/349 (58%), Gaps = 21/349 (6%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
+LL SL + A +S G + L +++ F SW + + ++Y EE+ R
Sbjct: 15 ILLACCSLIMLAAASGGGGVDDDGVGGDRL-----MMDRFLSWQATYNRSYPTAEERQRR 69
Query: 69 FEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR-----QPS 122
F++++ N++HI+ N+ +Y LG N+FAD++ EEF + Y P RR + +
Sbjct: 70 FQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEEEFLDLYT--MKGMPVRRDAGKKRAN 127
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQG-SCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
S V A P SVDWR KGAVTP+KNQG SC SCWAF T A +E I +I +G L SLS
Sbjct: 128 VSSSAAAVDA-PTSVDWRSKGAVTPIKNQGPSCSSCWAFVTAATIESITKITTGKLVSLS 186
Query: 182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
EQELIDCD ++ GCN G +++++ +GGL E +YPY C + TI
Sbjct: 187 EQELIDCD-PYDGGCNLGYFVNGYRWVIQNGGLTTEANYPYQARRYACSRSRAAQHAATI 245
Query: 242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG 301
S Y +P + Q L +A+A QPV+ AIE G+ QFYSGGVF+G CG ++H + VGYG
Sbjct: 246 SDYVQLPAGEGQ-LQQAVAQQPVAAAIEMGGS-LQFYSGGVFSGQCGTRMNHAITVVGYG 303
Query: 302 --KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
S G Y +VKNSWG WGERGY+RM+R+ G+ GLCGI + P+
Sbjct: 304 ADSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGRG-GLCGIALDLAYPV 351
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 142/309 (45%), Positives = 187/309 (60%), Gaps = 16/309 (5%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
E + W H K Y E+ R+ I+K+N + I + N + + L +N+F DM++ EFK
Sbjct: 25 ESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFLLKMNQFGDMTNSEFK 84
Query: 106 --NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
N YL K + F + P +VDWR +G VTPVK+QG CGSCWAFST
Sbjct: 85 AFNGYLSHK------HVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTT 138
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
++EG + +G L SLSEQ L+DC T++ NNGCNGGLMD AF YI + G+ E YPY
Sbjct: 139 GSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENKGIDSEASYPY 198
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
E+G C KK + T +G+ D+PE +E L +A+A P+SVAI+AS FQFYS G
Sbjct: 199 TAEDGKCVFKKPSV-AATDTGFVDLPEGNENKLKEAVASVGPISVAIDASHESFQFYSSG 257
Query: 282 VFTGP-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
V+ P C + ELDHGV VGYG G DY +VKNSW WG++GYI+M+RN + CG
Sbjct: 258 VYNEPSCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNA---KNQCG 314
Query: 340 INKMASIPL 348
I AS PL
Sbjct: 315 IATKASYPL 323
>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 334
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 131/346 (37%), Positives = 198/346 (57%), Gaps = 21/346 (6%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
++S+ A + L+ D I P + +++ + WM++ + YK EK R +
Sbjct: 1 MVSVRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLK 60
Query: 71 IFKENLKHIDQ-RNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPT------RRQPSA 123
+FK+NLK I+ N SY LG+NEF D EEF + GL+ + + +PS
Sbjct: 61 VFKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSR 120
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
++ D+ +S DWR +GAVTPVK QG+C + +I NL +LSEQ
Sbjct: 121 NWNMSDIDMEDESKDWRDEGAVTPVKYQGACR-------------LTKISGKNLLTLSEQ 167
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
+LIDCD N GCNGG + AFKYI+ +GG+ E +YPY +++ +C I G
Sbjct: 168 QLIDCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRG 227
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG-PCGAELDHGVAAVGYGK 302
+Q VP ++E++LL+A+ QPVSV I+A F Y GGV+ G CG +++H V VGYG
Sbjct: 228 FQMVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGT 287
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
G +Y ++KNSWG WGE GY+R++R+ P+G+CGI ++A+ P+
Sbjct: 288 MSGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 333
>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
Length = 339
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 153/354 (43%), Positives = 208/354 (58%), Gaps = 30/354 (8%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+ L +L+L L AC + +P +++D + +++W + H K Y EE
Sbjct: 2 KVYLCALALFLEACFA----------APSLDSALD---DHWQAWKTWHSKKYHQQEEGWR 48
Query: 68 RFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA 123
R I+++NLK I N + + SY LG+N F DM++EEF+ G K ++ +
Sbjct: 49 RM-IWEKNLKMIQLHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGYKHSKTEKKYRGS 107
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
EF + +PKSVDWR+KG VTPVK+QG CGSCWAFST ++EG + +G L SLSEQ
Sbjct: 108 EFLEPNFLVVPKSVDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQ 167
Query: 184 ELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
L+DC N GCNGGLMD AF+YI +GG+ EE YPY+ ++ K E +
Sbjct: 168 NLVDCSRPEGNQGCNGGLMDQAFEYIADNGGIDSEESYPYIAKDDEDCLYKSEFNAANDT 227
Query: 243 GYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVG 299
G+ DVPE E++L+KA+A PVSVAI+AS + FQFY G++ P C + ELDHGV VG
Sbjct: 228 GFVDVPEGHERALMKAVAAVGPVSVAIDASHSTFQFYESGIYYDPDCSSEELDHGVLVVG 287
Query: 300 YGKSKGSD-----YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
YG D Y IVKNSW KWG++GYI M ++ CGI AS PL
Sbjct: 288 YGFEGTDDDNKKKYWIVKNSWSDKWGDKGYILMAKDRNNH---CGIATAASYPL 338
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 147/321 (45%), Positives = 197/321 (61%), Gaps = 18/321 (5%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLN 94
+S + L +E++ + H KTY+ E+L RF+IF E+ +H + K + SY LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIARHNAKYAKGLVSYKLGMN 77
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEF---SYRDVKALPKSVDWRKKGAVTPVKNQ 151
+F D+ EF + G TR+ + F + + +LPK+VDWRKKGAVTPVK+Q
Sbjct: 78 QFGDLLAHEFARIFNG---HHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQ 134
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVA 210
G CGSCWAFS ++EG + + +G L SLSEQ L+DC SF NNGC GGLM+ AFKYI A
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIE 269
+ G+ E+ YPY +G C KKE++ T +GY ++ E L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEDDLKKAVATVGPISVAID 253
Query: 270 ASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRM 327
AS + FQ YS GV+ P C +E LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 328 KRNTGKPEGLCGINKMASIPL 348
R+ CGI AS PL
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 151/322 (46%), Positives = 194/322 (60%), Gaps = 22/322 (6%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK----EVTSYWLGLNEFADM 99
++E +ES+ +H K Y+ E+ R +IF EN + I NK +Y LG+N++ DM
Sbjct: 25 VMEEWESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDM 84
Query: 100 SHEEFKNKYLGLKPQF------PTRRQPSAEFSY--RDVKALPKSVDWRKKGAVTPVKNQ 151
H EF N G + R A F DV +PKSVDWR+KGAVT VK+Q
Sbjct: 85 LHHEFVNMMNGFRANTSGAGYKANRGFQGAHFVEPPEDV-VMPKSVDWREKGAVTEVKDQ 143
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVA 210
GSCGSCWAFS A+EG + +G+L SLSEQ L+DC + F NNGCNGGLMD AF+YI
Sbjct: 144 GSCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIKV 203
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIE 269
+GG+ E+ YPY E+ C G+ DV E +E +L KA+A PVSVAI+
Sbjct: 204 NGGIDTEKSYPYEAEDEPCRYNPANAG-ADDRGFVDVREGNENALKKAIATIGPVSVAID 262
Query: 270 ASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIR 326
AS FQFY GV++ P C AE LDHGV AVGYG ++ G DY +VKNSW WG++GYI+
Sbjct: 263 ASQDSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTTEDGQDYWLVKNSWSKSWGDQGYIK 322
Query: 327 MKRNTGKPEGLCGINKMASIPL 348
+ RN +CGI AS PL
Sbjct: 323 IARNQNN---MCGIASAASYPL 341
>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
Length = 332
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 152/320 (47%), Positives = 195/320 (60%), Gaps = 16/320 (5%)
Query: 38 LTSMDK-LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLG 92
T +DK + +E+W H K Y EE+ +R +I+++NL+ + + N E + SY LG
Sbjct: 17 FTIIDKGFDDTWEAWKQTHSKQYT-KEEEDNRRKIWEDNLQKVSKHNTEHSLGLHSYTLG 75
Query: 93 LNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQG 152
+N++AD+ EEF GLK RQ SY +A P SVDWR +G VTPVK+QG
Sbjct: 76 MNKYADLRGEEFVQMMNGLKFDASRERQGIKFLSYAKFQA-PDSVDWRDEGYVTPVKDQG 134
Query: 153 SCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVAS 211
CGSCWAFST ++EG + +G LTSLSEQ L+DC S+ NNGC GGLMDYAF+YI +
Sbjct: 135 QCGSCWAFSTTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDN 194
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKAL-AHQPVSVAIEA 270
G+ E+ YPY E+ TC + + T SGY DV DE +L +A A+ P+SVAI+A
Sbjct: 195 LGIDTEDKYPYEAEDDTCRFSPDNVG-ATDSGYVDVDSGDEDALKEACAANGPISVAIDA 253
Query: 271 SGTDFQFYSGGVFT-GPCGA-ELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRM 327
S FQ Y GV+ C + ELDHGV VGYG S G DY IVKNSWG WG+ GYI M
Sbjct: 254 SHESFQLYESGVYDEESCSSIELDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWM 313
Query: 328 KRNTGKPEGLCGINKMASIP 347
RN + CGI AS P
Sbjct: 314 SRN---KDNQCGIATSASYP 330
>gi|18202414|sp|P82473.1|CPGP1_ZINOF RecName: Full=Zingipain-1; AltName: Full=Cysteine proteinase GP-I
Length = 221
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 122/218 (55%), Positives = 151/218 (69%), Gaps = 2/218 (0%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP S+DWR+KGAV PVKNQG CGSCWAF +AAVEGINQIV+G+L SLSEQ+L+DC T
Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTR- 61
Query: 193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
N+GC GG AF+YI+ +GG++ EE YPY GTC D KE VV+I Y++VP NDE
Sbjct: 62 NHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTC-DTKENAHVVSIDSYRNVPSNDE 120
Query: 253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
+SL KA+A+QPVSV ++A+G DFQ Y G+FTG C +H G DY VK
Sbjct: 121 KSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGGRETENDKDYWTVK 180
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
NSWG WGE GYIR++RN + G CGI S P+K+
Sbjct: 181 NSWGKNWGESGYIRVERNIAESSGKCGIAISPSYPIKE 218
>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
Length = 349
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 191/319 (59%), Gaps = 19/319 (5%)
Query: 35 PEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLN 94
P H+ +D+ F++W +++ +TY EE RF ++ EN+K I+ N+ +SY LG N
Sbjct: 28 PIHIPLLDR----FQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSSYELGEN 83
Query: 95 EFADMSHEEFKNKYLGLK----PQFPTRRQPSAEFSYR-------DVKALPKSVDWRKKG 143
+FAD++ EEFK+ YL +K P + + R + P SVDWR KG
Sbjct: 84 QFADLTEEEFKDTYL-MKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKG 142
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLM-D 202
AVTPVK+Q CGSCWAF+ VA++EG+++I +G L SLSEQE++DCD NN G
Sbjct: 143 AVTPVKSQQHCGSCWAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSS 202
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
A +++ +GGL E DYPY+ +G C K I G Q V +E +L A+A +
Sbjct: 203 SAMEWVTRNGGLTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGR 262
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGE 321
PV+V+I AS FQFY G+F+GPC +H V VGYG + G Y IVKNSWG +WGE
Sbjct: 263 PVAVSINAS-RAFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGE 321
Query: 322 RGYIRMKRNTGKPEGLCGI 340
+GY+RM+R EG+CGI
Sbjct: 322 KGYVRMQRGVRAREGVCGI 340
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 141/309 (45%), Positives = 187/309 (60%), Gaps = 16/309 (5%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
E + W H K Y E+ R+ I+K+N + I + N + + L +N+F DM++ EFK
Sbjct: 25 ESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFILKMNQFGDMTNSEFK 84
Query: 106 --NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
N YL K + F + P +VDWR +G VTPVK+QG CGSCWAFST
Sbjct: 85 AFNGYLSHK------HVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTT 138
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
++EG + +G L SLSEQ L+DC T++ NNGC+GGLMD AF YI + G+ E YPY
Sbjct: 139 GSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPY 198
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
E+G C KK + T +G+ D+PE +E L +A+A P+SVAI+AS FQFYS G
Sbjct: 199 TAEDGKCVFKKSSV-AATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQFYSSG 257
Query: 282 VFTGP-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
V+ P C + ELDHGV VGYG G DY +VKNSW WG++GYI+M+RN + CG
Sbjct: 258 VYNEPSCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNA---KNQCG 314
Query: 340 INKMASIPL 348
I AS PL
Sbjct: 315 IATKASYPL 323
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 148/321 (46%), Positives = 196/321 (61%), Gaps = 18/321 (5%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLN 94
+S + L +E++ + H K+Y+ E+L RF+IF EN KH + K + SY LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKSYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEF---SYRDVKALPKSVDWRKKGAVTPVKNQ 151
+F D+ EF + G TR+ + F + + +LPK VDWRKKGAVTPVK+Q
Sbjct: 78 QFGDLLAHEFARIFNG---HHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQ 134
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVA 210
G CGSCWAFS ++EG + + +G L SLSEQ L+DC SF NNGC GGLM+ AFKYI A
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIE 269
+ G+ E+ YPY +G C KKE++ T +GY ++ E L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEVDLKKAVATVGPISVAID 253
Query: 270 ASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRM 327
AS + FQ YS GV+ P C +E LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 328 KRNTGKPEGLCGINKMASIPL 348
R+ CGI AS PL
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 148/321 (46%), Positives = 195/321 (60%), Gaps = 18/321 (5%)
Query: 39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLN 94
+S + L +E++ + H KTY+ E+L RF+IF EN KH + K + SY LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEF---SYRDVKALPKSVDWRKKGAVTPVKNQ 151
+F D+ EF + G TR+ + F + + +LPK VDWRKKGAVTPVK+Q
Sbjct: 78 QFGDLLAHEFARIFNG---HHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQ 134
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVA 210
G CGSCWAFS ++EG + + +G L SLSEQ L+DC SF NNGC GGLM+ AFKYI
Sbjct: 135 GQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKE 194
Query: 211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIE 269
+ G+ E+ YPY +G C KKE++ T +GY ++ E L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEDDLKKAVATVGPISVAID 253
Query: 270 ASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRM 327
AS + FQ YS GV+ P C +E LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 328 KRNTGKPEGLCGINKMASIPL 348
R+ CGI AS PL
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 360
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 134/323 (41%), Positives = 191/323 (59%), Gaps = 22/323 (6%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHE 102
+++ F W + H ++Y+ EE+L RF+++++N+++I+ N+ +Y LG N+FAD++ E
Sbjct: 38 MMDRFLMWQATHNQSYRSAEERLRRFQVYRDNVEYIETTNRRGDLTYQLGENQFADLTRE 97
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYR---------------DVKALPKSVDWRKKGAVTP 147
EF ++ + + DV P SVDWR KGAV P
Sbjct: 98 EFIARFTSYNGDDDRTGDDDSVITTAAVGGGDPDLWSSGGDDVSLDPPSVDWRAKGAVVP 157
Query: 148 VKNQGSCGSC-WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFK 206
K+Q S S WAF VA +E ++ I +G L +LSEQ+L+DCD ++ GCN G AF
Sbjct: 158 PKSQSSSCSSSWAFVAVATIESLHAIKTGKLVALSEQQLVDCD-QYDGGCNRGTFRRAFH 216
Query: 207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSV 266
+++ +GGL E +YPY +GTC K + V ISG+ VP ++E ++ A+A QPV+
Sbjct: 217 WVIQNGGLTTEAEYPYTAAQGTCNSAKSDHHVAAISGHASVPGSNELAMKHAVATQPVAA 276
Query: 267 AIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG--KSKGSDYIIVKNSWGPKWGERGY 324
AIE G+D QFY GV++GPCGA L+H V VGYG +S G Y IVKNSWG WGERGY
Sbjct: 277 AIEL-GSDMQFYKSGVYSGPCGARLEHAVTVVGYGADESTGDKYWIVKNSWGQTWGERGY 335
Query: 325 IRMKRNTGKPEGLCGINKMASIP 347
IRM+R P GLCGI + P
Sbjct: 336 IRMQRKILGP-GLCGIMLDVAYP 357
>gi|326493706|dbj|BAJ85314.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 143/360 (39%), Positives = 196/360 (54%), Gaps = 24/360 (6%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
M S + L+L L L+ F + L P ++ E F WM K+ K Y
Sbjct: 1 MKMASSTPYLVLLLCLTTFLQAWLTAATYPPPAPPAFELPESEVRERFSKWMIKYSKHYS 60
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG-----------------LNEFADMSHEE 103
C +E+ RF++FK N I Q +++ + +G +N F D+S E
Sbjct: 61 CKQEEEMRFQVFKNNTNSIGQLDRQNPNPGVGGALGPSGSQVHTFQKVSMNRFGDLSPRE 120
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
+Y GL P+ Y K P VDWR GAVT VK+QG+CGSCWAF+ V
Sbjct: 121 VIQQYTGLNTTSFRTASPT-YLPYHSFK--PCCVDWRSSGAVTGVKHQGTCGSCWAFAAV 177
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
AA+EG+N+I +G L SLSEQ L+DCDT + GC GG D A + A GG+ EE YPY
Sbjct: 178 AAIEGMNKIRTGELVSLSEQVLVDCDT-VSTGCGGGHSDSAMALVAARGGITSEERYPYA 236
Query: 224 MEEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+G C+ DK +I G++ VP N+E L A+A QPV+V I+ASG+ FQFYSGG+
Sbjct: 237 GFQGKCDVDKLMFDHQASIKGFKAVPSNNEAQLAIAVAMQPVTVYIDASGSAFQFYSGGI 296
Query: 283 FTGPCGAELDHGVAAVGY--GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
+ GPC A ++H V VGY G +G+ Y I KNSW WGE+GY+ + ++ G CG+
Sbjct: 297 YRGPCSANVNHAVTIVGYCEGPGEGNKYWIAKNSWSNDWGEQGYVYLAKDVAWSTGTCGL 356
>gi|125592011|gb|EAZ32361.1| hypothetical protein OsJ_16571 [Oryza sativa Japonica Group]
Length = 416
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 140/293 (47%), Positives = 170/293 (58%), Gaps = 35/293 (11%)
Query: 62 IEEKLHRFEIFKENLKHIDQRN---KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
I E RF +F +NLK +D N E + LG+N FAD+++ EF+ YLG P R
Sbjct: 46 IGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAGRGR 105
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVT-PVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
R A + + V+ALP SVDWR KGAV PVKNQG CG+ G
Sbjct: 106 RVGEA-YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGA-----------------GGVR 147
Query: 178 TSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
+EQ L +MD AF +I +GGL EEDYPY +G C K +
Sbjct: 148 EERAEQRL-----------QRWIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSRK 196
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAA 297
VV+I G++DVPENDE SL KA+AHQPVSVAI+A G +FQ Y GVFTG CG LDHGV A
Sbjct: 197 VVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVVA 256
Query: 298 VGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
VGYG + G+ Y V+NSWGP WGE GYIRM+RN G CGI MAS P+
Sbjct: 257 VGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 309
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 137/308 (44%), Positives = 184/308 (59%), Gaps = 6/308 (1%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHE 102
++E + WM K+ +TY E R +IFKENL++I+ N SY LGLN ++D++ E
Sbjct: 29 VVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENFNNVGNKSYKLGLNRYSDLTSE 88
Query: 103 EFKNKYLGLK--PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
EF + G K Q + S + +P + DWR+KG VT VKNQ CG CWAF
Sbjct: 89 EFIASHTGFKVSDQLSDSKMRSVAIPFNLNDDVPTNFDWREKGVVTDVKNQRQCGCCWAF 148
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
+ VAAVEGI +I +GNL SLSEQ+L+DCD ++GC GG AF I+ S G+ KE+DY
Sbjct: 149 TAVAAVEGIVKIKNGNLISLSEQQLVDCDRQ-SSGCGGGDFVLAFDSIIKSRGIVKEDDY 207
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY + + I+GY VP NDEQ LL+A+ QPVSVAI S DF Y G
Sbjct: 208 PYKANDVQTCQLGQIPGAAQINGYFKVPANDEQQLLRAVLQQPVSVAISTS-YDFHHYMG 266
Query: 281 GVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
GV+ G CG +L+H V +GYG S+ G Y ++KNSWG WGE+GY+++ R + G C
Sbjct: 267 GVYEGSCGPKLNHAVTIIGYGVSEAGKKYWLIKNSWGETWGEKGYMKVLRESSATGGQCS 326
Query: 340 INKMASIP 347
I A+ P
Sbjct: 327 IAVHAAYP 334
>gi|326431661|gb|EGD77231.1| cysteine protease [Salpingoeca sp. ATCC 50818]
Length = 347
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 179/312 (57%), Gaps = 19/312 (6%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
FE + K+ K Y+ EE+ R IF+E+L I++ N E +Y +G+NEFAD++ EE
Sbjct: 31 FEEFKDKYNKVYESAEEEARRAAIFQESLDFIEKHNAEAAAGMHTYLVGVNEFADLTREE 90
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS--------VDWRKKGAVTPVKNQGSCG 155
F+ ++ P +R P + D A+ + +DWRK+GAVTPV+NQG CG
Sbjct: 91 FRQHHVTRLPFDDDKRDPVTATLHLDEHAVHAADSNGDSSGIDWRKRGAVTPVRNQGQCG 150
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
+ F+ V AVEG++ I SGNL LS Q++IDC S GC+GG + FKYI +GGL
Sbjct: 151 NPAIFAAVEAVEGMHAISSGNLVELSTQQVIDC--SGTPGCSGGSLVSFFKYIARNGGLD 208
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
DYP G C KE V + GY VP +E L A+ PV+VAIEA F
Sbjct: 209 SAADYPTSGAGGQCNKAKEARHVAKVGGYSVVPPRNETKLAAAVFKMPVAVAIEADTPSF 268
Query: 276 QFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
Q Y+ GV++GPCG +LDH V VGY +Y IVKNSWG WG++GYI MKR G
Sbjct: 269 QMYTSGVYSGPCGTQLDHAVLVVGY----TDEYWIVKNSWGASWGDQGYIMMKRGVGA-A 323
Query: 336 GLCGINKMASIP 347
G+CGI A P
Sbjct: 324 GICGITLDAMYP 335
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 195/331 (58%), Gaps = 17/331 (5%)
Query: 28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT 87
F I+ S T+MD E F+ HGK YK +E+ R IF++N + I + N+E
Sbjct: 3 FLILVLSVTMATAMDVEWEAFKL---THGKQYKSPDEENVRRAIFRDNNQMIKEHNQEAA 59
Query: 88 ----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALP--KSVDWRK 141
SY++G+N+F D++H E+ +G P +E + L +VDWR+
Sbjct: 60 MGRRSYFMGMNQFGDLAHSEYLELVVG-PGLLPLNLSTPSENVFESTPGLQVDDTVDWRQ 118
Query: 142 KGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGL 200
KGAVTP+K+QG CGSCWAFST ++EG + + +G L SLSEQ L+DC F N GC GGL
Sbjct: 119 KGAVTPIKDQGHCGSCWAFSTTGSLEGQHFMKTGKLVSLSEQNLLDCSRRFGNKGCEGGL 178
Query: 201 MDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA 260
MD AF+YI ++GG+ EE YPY+ ++ D K T+S Y D+ DE +L++A+
Sbjct: 179 MDQAFRYIKSNGGIDTEECYPYMAKDEKVCDYKTSCSGATLSSYTDIKAMDEMALMQAVG 238
Query: 261 H-QPVSVAIEASGTDFQFYSGGVFTGP-CG-AELDHGVAAVGYGKSKGSDYIIVKNSWGP 317
PVSVAI+AS +FY G++ P C +LDHGV AVGYG G DY +VKNSWG
Sbjct: 239 TVGPVSVAIDASHKSLRFYKSGIYDEPECSRTKLDHGVLAVGYGSMDGMDYWLVKNSWGS 298
Query: 318 KWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
WG+ GY++M RN CGI AS P+
Sbjct: 299 AWGDMGYVKMTRNKNNQ---CGIATKASYPV 326
>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 139/283 (49%), Positives = 180/283 (63%), Gaps = 15/283 (5%)
Query: 71 IFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRD 129
+FKEN+ +I+ N Y +N+FA + FK + T F + +
Sbjct: 57 VFKENVNYIEACNNAADKPYKRDINQFA--PKKRFKGHMCSSIIRITT-------FKFEN 107
Query: 130 VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS-EQELIDC 188
V A P +VD R+K AVTP+K+QG CG WA S VAA EGI+ + +G L LS EQEL+DC
Sbjct: 108 VTATPSTVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHALXAGKLILLSSEQELVDC 167
Query: 189 DTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI-SGYQD 246
DT + C GGLMD AFK+I+ + GL+ E +YPY +G C + + TI +GY+D
Sbjct: 168 DTKGVDQDCQGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNAYEADKNAATIITGYED 227
Query: 247 VPENDEQS-LLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-K 304
VP N+E++ L KA+A+ PVSVAI+ASG+DFQFY GVFTG CG ELDHGV AVGYG S
Sbjct: 228 VPANNEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDD 287
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G++Y +VKNS G +WGE GYIRM+R E LCGI AS P
Sbjct: 288 GTEYWLVKNSRGTEWGEEGYIRMQRGVDSEEALCGIAVQASYP 330
>gi|388501884|gb|AFK39008.1| unknown [Lotus japonicus]
Length = 151
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 117/150 (78%), Positives = 131/150 (87%)
Query: 201 MDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA 260
MDYAF +IV +GGLHKE+DYPY+MEEGTCE KEE +VVTISGY DVP+N+EQSLLKALA
Sbjct: 1 MDYAFSFIVENGGLHKEDDYPYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALA 60
Query: 261 HQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWG 320
+QP+SVAIEASG DFQFYSGGVF G CG +LDHGVAAVGYG SKG DYI VKNSWG KWG
Sbjct: 61 NQPLSVAIEASGRDFQFYSGGVFDGHCGTQLDHGVAAVGYGTSKGLDYITVKNSWGTKWG 120
Query: 321 ERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
E+GYIR +RN GKPEG+CG+ KMAS P KK
Sbjct: 121 EKGYIRFRRNNGKPEGMCGLYKMASYPTKK 150
>gi|116788286|gb|ABK24823.1| unknown [Picea sitchensis]
Length = 294
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 126/273 (46%), Positives = 180/273 (65%), Gaps = 13/273 (4%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
++L L + L SS+ + + Y+P L S + L+ LF+ W + HGKTY + L RF
Sbjct: 6 MILKLVMLLLVFSSV----TAITYNPRDL-SENGLLSLFDRWCNHHGKTYTAKQRPL-RF 59
Query: 70 EIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPT----RRQPSAE 124
++FKENL +I + N ++WLGLN F+D++ +EF+ + +GL+ P+ RR+P +
Sbjct: 60 QVFKENLFYISEHNSRGNHTFWLGLNAFSDLTSDEFRTQQMGLRGHPPSLKSRRREPKSG 119
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
++ +P S+DWR K AVT VK+QG+CG CWAFS A+EGIN+IV+G+L SLSEQE
Sbjct: 120 L--LELYNIPSSLDWRDKDAVTGVKDQGACGDCWAFSATGAIEGINKIVTGSLVSLSEQE 177
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
L DCDTS+N+GC+GGLMDYAF++++ +GG+ E DYPY + C KK VVTI Y
Sbjct: 178 LCDCDTSYNSGCDGGLMDYAFQWVIVNGGIDTEVDYPYKGVQKACNSKKVNRRVVTIDDY 237
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
DVP N+E++LL+A+ QPVSV I FQ
Sbjct: 238 IDVPANNERALLQAVVGQPVSVGISGGERAFQL 270
>gi|449469176|ref|XP_004152297.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 340
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 138/348 (39%), Positives = 200/348 (57%), Gaps = 19/348 (5%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K L++ + L FA S L F + + S L++L++ W S H + + E
Sbjct: 5 KFLIVFVVLIAFA-SHLCEGFDL---ERKDFESEKSLMQLYKRWSSHH-RISRNAHEMHK 59
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE--- 124
RF+IF++N K + + N S L LN+FAD+S +EF Y + +
Sbjct: 60 RFKIFQDNAKRVFKVNHMGKSLKLRLNQFADLSDDEFSMMYGSNITHYNNLHAKAGGRVG 119
Query: 125 -FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y +P S+DWR+KGAV +KNQG C VAAVE I+QI + L SLSEQ
Sbjct: 120 GFMYERAMNIPFSIDWREKGAVNAIKNQGLC-------AVAAVESIHQIKTNELVSLSEQ 172
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
E++DCD GC GG D AF++I+ +GG+ EE+YPY G C + E VTI G
Sbjct: 173 EVVDCDYKVG-GCRGGNYDSAFEFIMQNGGITIEENYPYFAGNGYCRRRGPNSERVTIDG 231
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT--GPCGAELDHGVAAVGYG 301
Y+ VP+N+E +L+KA+AHQPV+V++ +SG+DF+FY G+ CG +DH V VGYG
Sbjct: 232 YECVPQNNEYALMKAVAHQPVAVSVASSGSDFRFYGEGMLREGSFCGYRIDHTVVVVGYG 291
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+ DY I++N +G +WG GY++M+R T P+G+CG+ S P+K
Sbjct: 292 SDEEGDYWIIRNQYGTQWGMNGYMKMQRGTRNPQGVCGMAMQPSFPVK 339
>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
Length = 384
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 143/303 (47%), Positives = 188/303 (62%), Gaps = 15/303 (4%)
Query: 55 HGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFADMSHEEFKNKYLG 110
H K+Y+ EE+ RFEIF+EN+ I++ NK SY+LG+N+F D+ + EF N + G
Sbjct: 86 HDKSYEDHEEESRRFEIFRENVLRIEKHNKLFHLGKKSYYLGVNQFTDLEYAEFVN-FNG 144
Query: 111 LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGIN 170
LK + S+ S ++ +P SVDWR KG VT VKNQG+CGSCWAFS ++EG
Sbjct: 145 LKMTNLNNTKCSSHLSANNI-VVPDSVDWRSKGYVTKVKNQGACGSCWAFSATGSLEGQY 203
Query: 171 QIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTC 229
+G L LSE +L+DC SF N GCNGG M+ AFKY+ + GG+ E DYPY + TC
Sbjct: 204 FRKNGKLVPLSESQLVDCSGSFGNEGCNGGFMENAFKYVKSVGGIESESDYPYKARQRTC 263
Query: 230 EDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFTGP-C 287
K ++ + T+SG DV E SL + ++ PVSVAI+A + FQ Y+GGV+ P C
Sbjct: 264 AFDKTKV-IATVSGCVDVESGSESSLKEVVSEVGPVSVAIDAGHSSFQLYAGGVYDEPLC 322
Query: 288 G-AELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMAS 345
+ L+HGV VGYG S +G DY IVKNSWG +WG GYI+M RN CGI AS
Sbjct: 323 STSRLNHGVLCVGYGTSLQGKDYWIVKNSWGVRWGVEGYIKMSRNKNNQ---CGIASEAS 379
Query: 346 IPL 348
PL
Sbjct: 380 YPL 382
>gi|9502426|gb|AAF88125.1|AC021043_18 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 365
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 134/364 (36%), Positives = 204/364 (56%), Gaps = 26/364 (7%)
Query: 11 LLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFE 70
++S+ A + L+ D I P + +++ + WM++ + YK EK R +
Sbjct: 1 MVSVRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLK 60
Query: 71 IFKENLKHIDQ-RNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPT------RRQPSA 123
+FK+NLK I+ N SY LG+NEF D EEF + GL+ + + +PS
Sbjct: 61 VFKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSR 120
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA------------FSTVAAV----- 166
++ D+ +S DWR +GAVTPVK QG+C ++ + V
Sbjct: 121 NWNMSDIDMEDESKDWRDEGAVTPVKYQGACPEFPTKQIRRNSLVGKQYTKLLGVLSDWG 180
Query: 167 -EGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
EG+ +I NL +LSEQ+LIDCD N GCNGG + AFKYI+ +GG+ E +YPY ++
Sbjct: 181 DEGLTKISGKNLLTLSEQQLIDCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVK 240
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG 285
+ +C I G+Q VP ++E++LL+A+ QPVSV I+A F Y GGV+ G
Sbjct: 241 KESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAG 300
Query: 286 -PCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
CG +++H V VGYG G +Y ++KNSWG WGE GY+R++R+ P+G+CGI ++A
Sbjct: 301 LDCGTDVNHAVTIVGYGTMSGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVA 360
Query: 345 SIPL 348
+ P+
Sbjct: 361 AYPV 364
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 134/319 (42%), Positives = 188/319 (58%), Gaps = 15/319 (4%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-----TSYWLGLNEFAD 98
+ E +E WM++ G+TYK EK RFE+FK N ID N + L N+FAD
Sbjct: 16 MRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTNKFAD 75
Query: 99 MSHEEFKNKYL-GLKPQF-PTRRQPSAEFSYRDVKA--LPKSVDWRKKGAVTPVKNQGSC 154
++ +EF+N Y+ G + + PT F + V +P S+DWR +GAVT VK+Q C
Sbjct: 76 LTEDEFRNIYVTGHRVNYRPTSLVTDTVFKFGAVSLSDVPPSIDWRARGAVTSVKDQHLC 135
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGL 214
CWAFS+ AAVEGI+QI +GN SLS Q+L+DC + N C G +D A++YI SGGL
Sbjct: 136 ACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYEYIARSGGL 195
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
++DYPY GTC ++ V ISG+Q VP +E +LL A+AHQPVSVA++
Sbjct: 196 VADQDYPYEGHSGTCRVYGKQA-VARISGFQYVPARNETALLLAVAHQPVSVALDGLSRA 254
Query: 275 FQFYSGGVFTG---PCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRN 330
Q G+F PC L+H + VGYG + G+ Y ++KNSWG WG++GY++ R+
Sbjct: 255 LQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGDKGYVKFARD 314
Query: 331 TGKP-EGLCGINKMASIPL 348
G+CG+ AS P+
Sbjct: 315 VASEINGVCGLALEASYPV 333
>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
Length = 349
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 190/319 (59%), Gaps = 19/319 (5%)
Query: 35 PEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLN 94
P H+ +D+ F++W +++ +TY EE RF ++ EN+K I+ N+ +SY LG N
Sbjct: 28 PIHIPLLDR----FQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSSYELGEN 83
Query: 95 EFADMSHEEFKNKYLGLK----PQFPTRRQPSAEFSYR-------DVKALPKSVDWRKKG 143
FAD++ EEFK+ YL +K P + + R + P SVDWR KG
Sbjct: 84 RFADLTEEEFKDTYL-MKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKG 142
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLM-D 202
AVTPVK+Q CGSCWAF+ VA++EG+++I +G L SLSEQE++DCD NN G
Sbjct: 143 AVTPVKSQQHCGSCWAFAAVASIEGVHKIKTGLLVSLSEQEIVDCDRGGNNHGCHGGHSS 202
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
A +++ +GGL E DYPY+ +G C K I G Q V +E +L A+A +
Sbjct: 203 SAMEWVTRNGGLTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGR 262
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGE 321
PV+V+I AS FQFY G+F+GPC +H V VGYG + G Y IVKNSWG +WGE
Sbjct: 263 PVAVSINAS-RAFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGE 321
Query: 322 RGYIRMKRNTGKPEGLCGI 340
+GY+RM+R EG+CGI
Sbjct: 322 KGYVRMQRGVRAREGVCGI 340
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 143/318 (44%), Positives = 190/318 (59%), Gaps = 14/318 (4%)
Query: 40 SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNE 95
S E + W ++HGK Y EE+ R I+++NL + + N + +Y LG+N+
Sbjct: 20 SFTDFDEDWNEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGINQ 79
Query: 96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEF-SYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
F D+ +EEF G + ++ + F +V LPK+VDWR KG VTPVK+QG C
Sbjct: 80 FTDLQNEEFVAMMTGFRVSGTSKAAKGSTFLPPNNVGELPKTVDWRTKGYVTPVKDQGQC 139
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGL 214
GSCWAFST +VEG + +G L SLSEQ L+DC + + GC+GG MD AF+YI+ +GG+
Sbjct: 140 GSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDC-SGRDAGCDGGFMDRAFQYIIDAGGI 198
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGT 273
E YPY +G C KK + T++GY DV E++L KA+AH P+SVAI+AS
Sbjct: 199 DTEASYPYKAVDGKCHFKKANVG-ATVTGYTDVTSGSEKALQKAVAHVGPISVAIDASHM 257
Query: 274 DFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRN 330
FQ Y GV+ P C + LDHGV AVGYG S G+DY IVKNSW WG GY+ M RN
Sbjct: 258 SFQHYKSGVYNEPGCDSTVLDHGVLAVGYGTSSDGTDYWIVKNSWAETWGMNGYVWMSRN 317
Query: 331 TGKPEGLCGINKMASIPL 348
+ CGI AS PL
Sbjct: 318 ---KDNQCGIATNASYPL 332
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 250 bits (639), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 186/318 (58%), Gaps = 18/318 (5%)
Query: 45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMS 100
+ LF++W + K Y+ +EE+ + + N I + N + + SY L +NE+ D++
Sbjct: 26 VSLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNKISEHNMQYSLKQKSYRLEMNEYGDLT 85
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA------LPKSVDWRKKGAVTPVKNQGSC 154
EEF + G + +R+ + +Y ++ + LP VDWRK G VTPVKNQG C
Sbjct: 86 SEEFSSMMNGYRNDIRLKRKSTGGSTYLNLLSFGSQIQLPTLVDWRKHGLVTPVKNQGQC 145
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGG 213
GSCW+FS ++EG ++ +G L SLSEQ LIDC T N+GCNGGLMD AFKYI GG
Sbjct: 146 GSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDCSTPEGNDGCNGGLMDQAFKYIKIQGG 205
Query: 214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASG 272
+ E YPY ++ TC + T +G+ D+ DE+ L +A A P+SVAI+AS
Sbjct: 206 IDTEAYYPYEAKDDTCRFNITD-SGATDTGFVDIKSGDEEMLKEAAATVGPISVAIDASH 264
Query: 273 TDFQFYSGGVF--TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRN 330
T FQFYS GV+ T LDHGV VGYG G DY +VKNSWG WGE GYI+M RN
Sbjct: 265 TSFQFYSNGVYSETACSSTMLDHGVLVVGYGTENGKDYWLVKNSWGEGWGEAGYIKMSRN 324
Query: 331 TGKPEGLCGINKMASIPL 348
+ CGI AS PL
Sbjct: 325 ---ADNQCGIATQASYPL 339
>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 189/312 (60%), Gaps = 23/312 (7%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHE 102
+ E E M+++ K YK E F N+ +I+ N Y G+N+F
Sbjct: 35 MYERHEQRMTRYSKVYKDPPES------FXGNVNYIEACNNAADKPYKXGINQFPP---- 84
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTP--VKNQGSCGSCWAF 160
+N++ G R F + +V A P +VD R+KGAVTP VK+QG CG WA
Sbjct: 85 --RNRFKGHMCSSIIR---ITTFKFENVTATPSTVDCRQKGAVTPYTVKDQGQCGCFWAL 139
Query: 161 STVAAVEGINQIVSGNLTSLS-EQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEE 218
S VAA EGI+ + +G L LS E EL+DCDT + GC GGL D AFK+I+ + GL+ E
Sbjct: 140 SAVAATEGIHALXAGKLILLSXEPELVDCDTKGVDQGCEGGLTDDAFKFIIQNHGLNTEA 199
Query: 219 DYPYLMEEGTCEDKKEEMEVVTI-SGYQDVPENDEQS-LLKALAHQPVSVAIEASGTDFQ 276
+YPY +G C + + TI +GY DVP N+E++ L KA+A+ PVSVAI+ASG+DFQ
Sbjct: 200 NYPYKGVDGKCNANEADKNAATIITGYDDVPANNEKAHLQKAVANNPVSVAIDASGSDFQ 259
Query: 277 FYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
FY GVFTG CG ELDHGV AVGYG S G++Y +VKNS GP+WGE GYIRM+R E
Sbjct: 260 FYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWLVKNSRGPEWGEEGYIRMQRGVDSEE 319
Query: 336 GLCGINKMASIP 347
LCGI AS P
Sbjct: 320 ALCGIAVQASYP 331
>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 136/307 (44%), Positives = 183/307 (59%), Gaps = 12/307 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHE 102
+++ F W + H ++Y EE+L RFE+++ N+++ID N+ +Y LG N+FAD++ E
Sbjct: 41 MMDRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADLTGE 100
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGS-CGSCWAFS 161
EF +Y G A+ S P SVDWR KGAVTPVKNQGS C SCWAFS
Sbjct: 101 EFLARYAGGHTGSAITTAAEADGSLEADP--PASVDWRAKGAVTPVKNQGSQCYSCWAFS 158
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
VA +E + I +G L +LSEQ+L+DCD ++ GCN G AF++I+ +GG+ YP
Sbjct: 159 AVATMESLYFIKTGKLVALSEQQLVDCD-KYDGGCNKGYYHRAFQWIMENGGITTAAQYP 217
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
Y G C K VTI+G+ V +N E +L A+A QP+ VAIE QFY G
Sbjct: 218 YKAVRGACSAAK---PAVTITGHLAVAKN-ELALQSAVARQPIGVAIEVP-ISMQFYKSG 272
Query: 282 VFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
VF+ CG ++ H V VGYG + G Y +VKNSWG WGE GYIRM+R+ G GLCGI
Sbjct: 273 VFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMRRDVGG-GGLCGI 331
Query: 341 NKMASIP 347
+ P
Sbjct: 332 ALDTAYP 338
>gi|340370270|ref|XP_003383669.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 326
Score = 249 bits (637), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 136/306 (44%), Positives = 187/306 (61%), Gaps = 12/306 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN--KEVTSYWLGLNEFADMSHEEFK 105
F+ W K+ K Y+ E +L R I++ N K ++ N + + + +NEFAD+ EF
Sbjct: 23 FQDWKVKYNKVYETKETELERQIIWESNKKFVENHNANSDKFGFTVAMNEFADLDAGEFA 82
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
N Y GL P+ P + F V ++ +VDWR+KGAVT VKNQG CGSCW+FS+ +
Sbjct: 83 NIYNGLLPR-PASYNSTKLFKKTGV-SVGDTVDWREKGAVTEVKNQGKCGSCWSFSSTGS 140
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
+EG + + +G L+SLSEQ+L+DC TSF N+GC GGLMD +F+Y+ G EE YPY
Sbjct: 141 LEGQHFLKTGTLSSLSEQQLMDCSTSFGNHGCKGGLMDNSFRYLETVAGDMSEEMYPYTA 200
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVF 283
E+G C + E + +GY+D+P DE +L +A+A P+SVAI+A FQ Y G++
Sbjct: 201 EDGFCRYRSSEA-IAKDTGYKDIPRGDEDALKEAVATVGPISVAIDAGHRSFQLYHEGIY 259
Query: 284 TGPC--GAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
P +LDHGV AVGYG +G +Y +VKNSWGP WG GY+ M RN E CGI
Sbjct: 260 YEPACSSTKLDHGVLAVGYGTGEGEEYWLVKNSWGPSWGNEGYVMMSRNR---ENNCGIA 316
Query: 342 KMASIP 347
AS P
Sbjct: 317 TQASYP 322
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 144/324 (44%), Positives = 186/324 (57%), Gaps = 20/324 (6%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYW 90
+ Y E T D I W H K Y E+ R+ I+K+N + I + N + +
Sbjct: 14 LAYIIERPTEDDSWIR----WKMAHNKAYSHDGEETVRYTIWKDNERRIREHNLQGGDFL 69
Query: 91 LGLNEFADMSHEEFK--NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
L +N+F DM++ EFK N YL K + F + P SVDWR +G VTPV
Sbjct: 70 LEMNQFGDMTNNEFKDFNGYLSHK------HVSGSTFLTPNSFVAPDSVDWRNEGYVTPV 123
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKY 207
K+QG CGSCWAFST ++EG N +G L SLSEQ L+DC T++ NNGCNGGLMD AF Y
Sbjct: 124 KDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTY 183
Query: 208 IVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSV 266
I + G+ E YPY ++G C K + T +G+ D+P DE L +A+A P+SV
Sbjct: 184 IKENNGIDSEASYPYTAKDGKCAFTKPNV-AATDTGFVDIPSGDENKLKEAVASVGPISV 242
Query: 267 AIEASGTDFQFYSGGVFT-GPCGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGY 324
AI+AS FQFY GV+ C + ELDHGV VGYG G DY +VKNSW WG++GY
Sbjct: 243 AIDASHFSFQFYRKGVYNERKCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGY 302
Query: 325 IRMKRNTGKPEGLCGINKMASIPL 348
I+M RN + CGI AS PL
Sbjct: 303 IKMSRNA---KNQCGIATNASYPL 323
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 141/291 (48%), Positives = 188/291 (64%), Gaps = 7/291 (2%)
Query: 62 IEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSHEEFKNKYLGLK--PQFPTR 118
I E R IFK NL++I+ N SY LGLN+++D++ +EF + GLK Q +
Sbjct: 76 ISELEKRKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQLSSS 135
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
+ SA + +P + DWR++GAVT VK+QGSCG CWAFS VAAVEG +I +G L
Sbjct: 136 KMRSAAVPFNLNDDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGELI 195
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
SLSEQ+L+DCD N+GC+GG MD AFKYI+ G+ E DYPY TC+ +
Sbjct: 196 SLSEQQLVDCDER-NSGCHGGNMDSAFKYIIQK-GIVSEADYPYQEGSQTCQLNDQMKFE 253
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
I+ + DVP NDEQ LL+A+A QPVSV IE G +FQ Y G V++G CG ++H V AV
Sbjct: 254 AQITNFIDVPANDEQQLLQAVAQQPVSVGIEV-GDEFQHYMGDVYSGTCGQSMNHAVTAV 312
Query: 299 GYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
GYG S+ G+ Y ++KNSWG WGE GY+++ R +G+P G CGI AS P+
Sbjct: 313 GYGVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCGIAAHASYPI 363
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 150/328 (45%), Positives = 196/328 (59%), Gaps = 15/328 (4%)
Query: 30 IVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KE 85
+VG + LT ++++ H K Y+ + R +IF +N I + N K
Sbjct: 14 LVGAASAALTLEQLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLIARHNIKHAKG 73
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
T+Y L +N+F DM H EF + GL R + + + +LPKSVDWR+KGAV
Sbjct: 74 ETTYKLKMNQFGDMLHHEFVSTMNGLLRS--NRTYFGSTWIEPESVSLPKSVDWREKGAV 131
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYA 204
TPVKNQG CGSCW+FST A+EG +G L SLSEQ LIDC TS+ NNGC GGLMD A
Sbjct: 132 TPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNGCGGGLMDNA 191
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QP 263
F YI + G+ EE YPY ++G C KE+ +G+ D+P +E++L KALA P
Sbjct: 192 FTYIKENHGIDTEESYPYEGKQGKCRYHKED-SAGRDTGFVDIPSGNERALAKALATIGP 250
Query: 264 VSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKS-KGSDYIIVKNSWGPKWG 320
VSVAI+AS FQFY GV+ P C + LDHGV AVGYG + G DY I+KNSWG +WG
Sbjct: 251 VSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQDYYIIKNSWGERWG 310
Query: 321 ERGYIRMKRNTGKPEGLCGINKMASIPL 348
+ GY+ M RN+ + CG+ AS PL
Sbjct: 311 QEGYVLMARNS---KNECGVATQASYPL 335
>gi|294938848|ref|XP_002782226.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239893730|gb|EER14021.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 334
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 176/288 (61%), Gaps = 9/288 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F + K GK Y+ EE++ R IF+ +L +I+Q N + SY LG+NE AD++HEEF
Sbjct: 28 FMGFQHKFGKNYESKEEEIKRNAIFRAHLHYIEQVNAKNLSYKLGVNEHADLTHEEFAAL 87
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
LG + +R D L SVDWR KG +TP+K+QG CGSCWAFS A+E
Sbjct: 88 KLGTSSKMSMKRDDKLVVK-ADTTQLLTSVDWRSKGVLTPIKDQGPCGSCWAFSATGALE 146
Query: 168 GINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
I +G L SLSEQ+LIDC +S+ N GC+GGLM+ A+ YI S GL +E YPY+ +
Sbjct: 147 AQYAIATGKLLSLSEQQLIDCSSSYGNEGCSGGLMENAYTYI-KSAGLDQESTYPYIAKN 205
Query: 227 GTC----EDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
C E + + + ++G+ + + EQ L+KALA PVS+A+ AS DF+FY GV
Sbjct: 206 NACQVSLEKRSDGIPAGEVTGFH-MLDQTEQGLMKALADAPVSIAMYASDPDFRFYQSGV 264
Query: 283 FTG-PCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
++ C +DHGV AVGYG G DY +++NSWG WG+ GY +KR
Sbjct: 265 YSSKTCHGTIDHGVVAVGYGTENGEDYFVIRNSWGSSWGQDGYFYLKR 312
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 150/328 (45%), Positives = 196/328 (59%), Gaps = 15/328 (4%)
Query: 30 IVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KE 85
+VG + LT ++++ H K Y+ + R +IF +N I + N K
Sbjct: 9 LVGAASAALTLEQLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLIARHNIKHAKG 68
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
T+Y L +N+F DM H EF + GL R + + + +LPKSVDWR+KGAV
Sbjct: 69 ETTYKLKMNQFGDMLHHEFVSTMNGLLRS--NRTYFGSTWIEPESVSLPKSVDWREKGAV 126
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYA 204
TPVKNQG CGSCW+FST A+EG +G L SLSEQ LIDC TS+ NNGC GGLMD A
Sbjct: 127 TPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNGCGGGLMDNA 186
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QP 263
F YI + G+ EE YPY ++G C KE+ +G+ D+P +E++L KALA P
Sbjct: 187 FTYIKENHGIDTEESYPYEGKQGKCRYHKED-SAGRDTGFVDIPSGNERALAKALATIGP 245
Query: 264 VSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKS-KGSDYIIVKNSWGPKWG 320
VSVAI+AS FQFY GV+ P C + LDHGV AVGYG + G DY I+KNSWG +WG
Sbjct: 246 VSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQDYYIIKNSWGERWG 305
Query: 321 ERGYIRMKRNTGKPEGLCGINKMASIPL 348
+ GY+ M RN+ + CG+ AS PL
Sbjct: 306 QEGYVLMARNS---KNECGVATQASYPL 330
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 150/358 (41%), Positives = 207/358 (57%), Gaps = 24/358 (6%)
Query: 8 KLLLLSLSLSLFA-----CSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCI 62
++L L+ LS F + L+ + +P +D +L++SW H K Y
Sbjct: 2 RVLFLARRLSRFVNMNVCLTILSLCLGLAFAAPRVDPDLDSHWQLWKSW---HSKDYHER 58
Query: 63 EEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
EE R ++++NLK I+ N + + SY LG+N+F DM+ EEF+ G K + R
Sbjct: 59 EESWRRV-VWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKHKKSER 117
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
+ ++F P+SVDWR+KG VTPVK+QG CGSCWAFST A+EG + +G L
Sbjct: 118 KYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLV 177
Query: 179 SLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEME 237
SLSEQ L+DC N GCNGGLMD AF+Y+ +GG+ EE YPY ++ K E
Sbjct: 178 SLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYN 237
Query: 238 VVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHG 294
+G+ D+P+ E++L+KA+A PVSVAI+A + FQFY G++ P C +E LDHG
Sbjct: 238 AANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHG 297
Query: 295 VAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
V VGYG G Y IVKNSWG KWG++GYI M ++ + CGI AS PL
Sbjct: 298 VLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDR---KNHCGIATAASYPL 352
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 197/323 (60%), Gaps = 21/323 (6%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFA 97
D ++E + ++ +H K Y+ E+ R +IF EN I + N+ S+ L +N++A
Sbjct: 23 DVVMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 82
Query: 98 DMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK-------ALPKSVDWRKKGAVTPVKN 150
D+ H EF+ G + + + E S++ V LPKSVDWR KGAVT VK+
Sbjct: 83 DLLHHEFRQLMNGFNYTLHKQLRAADE-SFKGVTFISPAHVTLPKSVDWRTKGAVTAVKD 141
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIV 209
QG CGSCWAFS+ A+EG + SG L SLSEQ L+DC T + NNGCNGGLMD AF+YI
Sbjct: 142 QGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIK 201
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAI 268
+GG+ E+ YPY + +C K + T G+ D+P+ DE+ + +A+A PVSVAI
Sbjct: 202 DNGGIDTEKSYPYEAIDDSCHFNKGTIG-ATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 260
Query: 269 EASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYI 325
+AS FQFYS GV+ P C A+ LDHGV VG+G + G DY +VKNSWG WG++G+I
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFI 320
Query: 326 RMKRNTGKPEGLCGINKMASIPL 348
+M RN E CGI +S PL
Sbjct: 321 KMLRN---KENQCGIASASSYPL 340
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 140/307 (45%), Positives = 187/307 (60%), Gaps = 11/307 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE--VTSYWLGLNEFADMSHEEFK 105
+W ++HGK+Y+ +E++ R ++ N K+ID+ N+ V Y L +N+F D+ + EFK
Sbjct: 22 LRAWKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFK 81
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
+ Y G + R+ + R V+ LP SVDW KKG VTPVKNQG CGSCW+FS +
Sbjct: 82 SLYNGYRMSNAPRKGKPFVPAAR-VQDLPASVDWSKKGWVTPVKNQGQCGSCWSFSATGS 140
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
+EG + +G L SLSEQ L+DC + N+GCNGGLMD AF+Y++ + G+ E YPY
Sbjct: 141 MEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASYPYRA 200
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVF 283
+ TC+ ++ TISGY DV ++ E L A+A PVSVAI+AS FQFYS GV+
Sbjct: 201 VDSTCKFNTADVG-ATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYSSGVY 259
Query: 284 TG-PCGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
C + LDHGV AVGYG DY +VKNSWG WG GYI M RN CGI
Sbjct: 260 DPLICSSTNLDHGVLAVGYGTDGSKDYWLVKNSWGASWGMSGYIEMVRNHNNK---CGIA 316
Query: 342 KMASIPL 348
AS P+
Sbjct: 317 TSASYPV 323
>gi|294885989|ref|XP_002771502.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875206|gb|EER03318.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 138/319 (43%), Positives = 191/319 (59%), Gaps = 26/319 (8%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F + KHGK+Y EE++ R IF +NL +I++ N + SY LG+NE+ D++ EEF
Sbjct: 27 FIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVNAQNLSYKLGVNEYTDLTLEEFAAL 86
Query: 108 YL-------GLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
L G+ F P+ LP SVDWRKKG + PVK+QG CGSCWAF
Sbjct: 87 KLSSTDMSEGMGDGFVAGAGPT-------TTTLPTSVDWRKKGVLNPVKDQGYCGSCWAF 139
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
S + A+E I +G L SLSEQ+L+DC ++ N GCNGGLMD AF+YI A+ G+ KE
Sbjct: 140 SAIGALEPRYAIATGKLLSLSEQQLVDCAGAYGNEGCNGGLMDKAFEYIKAT-GVDKEST 198
Query: 220 YPYLMEEGTC----EDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
YPY+ + TC E+K + + V ++G Q + + E++L++ +A PVS+A+ A+ F
Sbjct: 199 YPYVGSDETCQATVENKTDGLPVGEVTGNQMLHQT-EKALMEGVAAAPVSIAMYANLQSF 257
Query: 276 QFYSGGVFTGP-C---GAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNT 331
Q Y GV++ P C G +DHGV AVGYG G DY I++NSWG WG+ GY+ +KR
Sbjct: 258 QHYKSGVYSDPNCNAKGGSIDHGVVAVGYGTENGQDYFIIRNSWGRSWGQDGYVYLKRGV 317
Query: 332 GKPEGLCGINKMASIPLKK 350
G G C I K +P K
Sbjct: 318 GS-FGQCNIYKYMCVPTLK 335
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 197/323 (60%), Gaps = 21/323 (6%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFA 97
D ++E + ++ +H K Y+ E+ R +IF EN I + N+ S+ L +N++A
Sbjct: 57 DVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 116
Query: 98 DMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK-------ALPKSVDWRKKGAVTPVKN 150
D+ H EF+ G + + + E S++ V LPKSVDWR KGAVT VK+
Sbjct: 117 DLLHHEFRQLMNGFNYTLHKQLRAADE-SFKGVTFISPAHVTLPKSVDWRTKGAVTAVKD 175
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIV 209
QG CGSCWAFS+ A+EG + SG L SLSEQ L+DC T + NNGCNGGLMD AF+YI
Sbjct: 176 QGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIK 235
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAI 268
+GG+ E+ YPY + +C K + T G+ D+P+ DE+ + +A+A PVSVAI
Sbjct: 236 DNGGIDTEKSYPYEAIDDSCHFNKGTVG-ATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 294
Query: 269 EASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYI 325
+AS FQFYS GV+ P C A+ LDHGV VG+G + G DY +VKNSWG WG++G+I
Sbjct: 295 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 354
Query: 326 RMKRNTGKPEGLCGINKMASIPL 348
+M RN E CGI +S PL
Sbjct: 355 KMLRN---KENQCGIASASSYPL 374
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 189/313 (60%), Gaps = 24/313 (7%)
Query: 54 KHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKNKYL 109
KH K+YK +E+L RF++F N K I+Q N E S+ L LN+FADM++ EF+ +
Sbjct: 49 KHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQRMN 108
Query: 110 GLKPQFPTRR-----QPSAE----FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
G K P +R QP E F D +P SVDWRK+G VT VK+QGSCGSCWAF
Sbjct: 109 GFK--LPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAF 166
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
S ++EG + +G L SLSEQ L+DCD + ++ GCNGG MD AF+Y+ + G+ E
Sbjct: 167 SATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGIDTEAS 226
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFY 278
YPY +G C K E++ T +G+ D+PE +E L A+A PVSVAI+A+ FQFY
Sbjct: 227 YPYKGRDGRCRFKSEDVG-ATDTGFVDIPEGNETLLEAAIATVGPVSVAIDAASFKFQFY 285
Query: 279 SGGVFTG-PCGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
S GV+ C E LDHGV AVGY +K G Y IVKNSW WG+ GYI M R +
Sbjct: 286 SHGVYYDRSCSPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSEDWGDDGYILMSR---RKN 342
Query: 336 GLCGINKMASIPL 348
CGI MAS P
Sbjct: 343 NNCGIATMASYPF 355
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 197/323 (60%), Gaps = 21/323 (6%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFA 97
D ++E + ++ +H K Y+ E+ R +IF EN I + N+ S+ L +N++A
Sbjct: 53 DVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 112
Query: 98 DMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK-------ALPKSVDWRKKGAVTPVKN 150
D+ H EF+ G + + + E S++ V LPKSVDWR KGAVT VK+
Sbjct: 113 DLLHHEFRQLMNGFNYTLHKQLRAADE-SFKGVTFISPAHVTLPKSVDWRTKGAVTAVKD 171
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIV 209
QG CGSCWAFS+ A+EG + SG L SLSEQ L+DC T + NNGCNGGLMD AF+YI
Sbjct: 172 QGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIK 231
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAI 268
+GG+ E+ YPY + +C K + T G+ D+P+ DE+ + +A+A PVSVAI
Sbjct: 232 DNGGIDTEKSYPYEAIDDSCHFNKGTVG-ATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 290
Query: 269 EASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYI 325
+AS FQFYS GV+ P C A+ LDHGV VG+G + G DY +VKNSWG WG++G+I
Sbjct: 291 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 350
Query: 326 RMKRNTGKPEGLCGINKMASIPL 348
+M RN E CGI +S PL
Sbjct: 351 KMLRN---KENQCGIASASSYPL 370
>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
Length = 358
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 135/310 (43%), Positives = 192/310 (61%), Gaps = 10/310 (3%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHE 102
+++ F W + + ++Y EE+ RF++++ N++HI+ N+ +Y LG N+FAD++ E
Sbjct: 53 MMDRFLRWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADLTEE 112
Query: 103 EFKNKYLGLKPQFPTRRQPS--AEFSYRDVKALPKSVDWRKKGAVTPVKNQG-SCGSCWA 159
EF + Y +K P RR + ++ V P SVDWR +GAVTP+KNQG SC SCWA
Sbjct: 113 EFLDLYT-MKGMPPVRRDAGKKQQANFSSVVDAPTSVDWRSRGAVTPIKNQGPSCSSCWA 171
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
F T A +E I QI +G L SLSEQELIDCD ++ GCN G +K+++ +GGL E +
Sbjct: 172 FVTAATIESITQIRTGKLVSLSEQELIDCD-PYDGGCNLGYFVNGYKWVIQNGGLTTEAN 230
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS 279
YPY C K IS Y+ +P+ E L +A+A QPV+ AIE G+ QFYS
Sbjct: 231 YPYQARRYQCNRSKAGQRAARISNYRQLPQG-EAQLQQAVAQQPVAAAIEMGGS-LQFYS 288
Query: 280 GGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GGV++G CG ++H + VGYG S G Y +VKNSWG WGERGY+RM+++ + GLC
Sbjct: 289 GGVWSGQCGTRMNHAITVVGYGADSSGVKYWLVKNSWGQTWGERGYLRMRKDV-RQGGLC 347
Query: 339 GINKMASIPL 348
GI + P+
Sbjct: 348 GIALDLAYPI 357
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 188/310 (60%), Gaps = 14/310 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNEFADMSHEE 103
+ESW K+GK+Y E++ R +++ NL+ + Q N + +Y LG+N +AD+ +EE
Sbjct: 19 WESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEE 78
Query: 104 FKNKYLGLKPQFPTRRQPSAE-FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
F G + Q S + F LP SVDWR +G VTPVK+QG CGSCW+FS
Sbjct: 79 FM-ALKGSSGILQAKDQSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFSA 137
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYP 221
++EG + +G L SLSEQ+L+DC S+ N GC+GGLM+ A+ YI +GG+ E YP
Sbjct: 138 TGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYGCSGGLMESAYDYIRDAGGVQLESAYP 197
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSG 280
Y + G C + + V T +G+ +P DEQSL++A+ PV+VAI+ASG DFQ Y
Sbjct: 198 YTAQNGRCHFDQSKA-VATCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGYDFQLYES 256
Query: 281 GVF--TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GV+ + + LDHGV A GYG G+DY +VKNSWGP WG +GYI+M RN C
Sbjct: 257 GVYDRSRCSSSSLDHGVLAAGYGTEGGNDYWLVKNSWGPGWGAQGYIKMSRNKSNQ---C 313
Query: 339 GINKMASIPL 348
GI MA PL
Sbjct: 314 GIATMACYPL 323
>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 361
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 155/363 (42%), Positives = 205/363 (56%), Gaps = 38/363 (10%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA S S L++ LFACS L + G + T L+E F++W +++ +TY
Sbjct: 3 MATASASLALVM-----LFACSLL-----LAGTAFSDDTIAIPLLERFKAWQAEYNRTYA 52
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVT--SYWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
EE RF ++ ENL+ I N+ T SY LG N+F D++ EEFK+ YL + P
Sbjct: 53 TPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQP-- 110
Query: 119 RQPSAEFSYRDVKAL--------------PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
P+AE V + P SVDWR KGAVTPVKNQ CGSCWAF+TVA
Sbjct: 111 --PAAEAMPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVA 168
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
++EG++QI +G L SLSEQE++DCD N+ GC GG A +++ +GGL E DYPY+
Sbjct: 169 SIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGGLTTESDYPYV 228
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
+ C K I GYQ V +E L +A+A +PV+V I+AS FQFY GVF
Sbjct: 229 GSQRQCMSGKLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDASRA-FQFYKRGVF 287
Query: 284 TGPCG-AELDHGVAAVGYGKSKGS-----DYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
+GPC ++H V VGYG + Y IVKNSWG +WGE GY+RM R EG+
Sbjct: 288 SGPCNTTTVNHAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRAREGM 347
Query: 338 CGI 340
C I
Sbjct: 348 CAI 350
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 197/323 (60%), Gaps = 21/323 (6%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFA 97
D ++E + ++ +H K Y+ E+ R +IF EN I + N+ S+ L +N++A
Sbjct: 23 DVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 82
Query: 98 DMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK-------ALPKSVDWRKKGAVTPVKN 150
D+ H EF+ G + + + E S++ V LPKSVDWR KGAVT VK+
Sbjct: 83 DLLHHEFRQLMNGFNYTLHKQLRAADE-SFKGVTFISPAHVTLPKSVDWRTKGAVTAVKD 141
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIV 209
QG CGSCWAFS+ A+EG + SG L SLSEQ L+DC T + NNGCNGGLMD AF+YI
Sbjct: 142 QGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIK 201
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAI 268
+GG+ E+ YPY + +C K + T G+ D+P+ DE+ + +A+A PVSVAI
Sbjct: 202 DNGGIDTEKSYPYEAIDDSCHFNKGTVG-ATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 260
Query: 269 EASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYI 325
+AS FQFYS GV+ P C A+ LDHGV VG+G + G DY +VKNSWG WG++G+I
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 320
Query: 326 RMKRNTGKPEGLCGINKMASIPL 348
+M RN E CGI +S PL
Sbjct: 321 KMLRN---KENQCGIASASSYPL 340
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 132/269 (49%), Positives = 175/269 (65%), Gaps = 8/269 (2%)
Query: 19 FACSSLAHDFSIVGYSPEHLTSM---DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
+ C + A FSI ++ + + + E E WM+ + + YK EK R++IFKEN
Sbjct: 7 YICITFALFFSIGAWTSQCMARTLQEASMYERHEQWMASYARVYKDANEKQMRYKIFKEN 66
Query: 76 LKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALP 134
++ ID N E SY L +N+FAD+++EEFK+ G K + + + F Y +V A+P
Sbjct: 67 VQRIDSFNSESDKSYKLAVNQFADLTNEEFKSLRNGFKGHMCSAQ--AGHFRYENVTAVP 124
Query: 135 KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFN 193
S+DWRKKGAVT +K QG CGSCWAFS VAAVEGI +I +G L SLSEQEL+DCDT S +
Sbjct: 125 ASIDWRKKGAVTQIKEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSED 184
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GC GGLMD AFK+I GL E YPY + TC+ K+E I+GY+DVP NDE
Sbjct: 185 QGCQGGLMDDAFKFI-EQHGLASEATYPYDAADSTCKTKEEAKPSAKITGYEDVPANDEA 243
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+L A+A+QPVSVAI+A G +FQFYS G+
Sbjct: 244 ALKNAVANQPVSVAIDAGGFEFQFYSSGI 272
>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 358
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 142/324 (43%), Positives = 197/324 (60%), Gaps = 21/324 (6%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHE 102
+++ F ++ + + +TY EE+L RFE+++ N+ +I+ N+ +Y LG N+FAD++ +
Sbjct: 36 MMDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQ 95
Query: 103 EFKNKY-----LGLKPQFPTRRQ-------PSAEFS---YRDV--KALPKSVDWRKKGAV 145
EF+ Y + +P RRQ P E Y D +A P SVDWR KGAV
Sbjct: 96 EFRAMYTMPARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVDWRSKGAV 155
Query: 146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
TPVK+QG CG CWAF+TVA +EG+++I +G L SLSEQEL+DCD + + C GGL + A
Sbjct: 156 TPVKDQGGCGCCWAFATVATIEGLHKIKTGQLVSLSEQELVDCDDADDG-CGGGLPEIAM 214
Query: 206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVS 265
+++ +GGL E +YPY + G C+ K I+ Q V N E L +A+A QPV+
Sbjct: 215 EWVAHNGGLTTEANYPYTGKAGKCDRGKASNHAAKIAAAQMVRANSEAELERAVARQPVA 274
Query: 266 VAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGY 324
VAI A + FY GV++GPC AE DH V VGYG +KG Y I+KNSW WGE+GY
Sbjct: 275 VAINAPDS-LMFYKSGVYSGPCTAEFDHAVTVVGYGADNKGHKYWIIKNSWAETWGEKGY 333
Query: 325 IRMKRNTGKPEGLCGINKMASIPL 348
RM+R EGLCGI AS P+
Sbjct: 334 GRMQRGVAAKEGLCGIATHASYPV 357
>gi|300175452|emb|CBK20763.2| unnamed protein product [Blastocystis hominis]
Length = 313
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 141/304 (46%), Positives = 188/304 (61%), Gaps = 19/304 (6%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK-N 106
F S+ +++GK Y E+ R ++F N++ + N E Y +G FADM++ EF +
Sbjct: 23 FNSFEARYGKNYINAAERAFRQKVFAYNMEWAQKINSEDHPYTVGATPFADMTNTEFAVS 82
Query: 107 KYLG--LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
K G LKP+ P E ++VDWR+KGAVTPVKNQ SCGSCWAFS
Sbjct: 83 KLCGCMLKPKMTKPATPIME-------PAAEAVDWREKGAVTPVKNQASCGSCWAFSATG 135
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
A+EG N + +G L SLSEQ+L+DCD ++GC GGLM YAF+Y G+ KEEDYPY
Sbjct: 136 AMEGRNFVANGELISLSEQQLVDCDHQ-SSGCGGGLMTYAFEY-AKKKGMCKEEDYPYHA 193
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF- 283
+ C+D K VV GY++VP D +L +A++ PVSVA+EA FQ Y+GGV
Sbjct: 194 VDEDCKDDK-CTPVVFPKGYEEVPRFDGAALKQAVSQGPVSVAVEADSIVFQMYTGGVID 252
Query: 284 TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
+ CG L+HGV AVGY G+DY IVKNSWG WG++GY+++K T G+CGIN+M
Sbjct: 253 SSACGTSLNHGVLAVGY----GADYWIVKNSWGESWGDKGYLKIKY-TESGAGICGINQM 307
Query: 344 ASIP 347
S P
Sbjct: 308 NSYP 311
>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 359
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 155/363 (42%), Positives = 205/363 (56%), Gaps = 38/363 (10%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK 60
MA S S L++ LFACS L + G + T L+E F++W +++ +TY
Sbjct: 3 MATASASLALVM-----LFACSLL-----LAGTAFSDDTIAIPLLERFKAWQAEYNRTYA 52
Query: 61 CIEEKLHRFEIFKENLKHIDQRNKEVT--SYWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
EE RF ++ ENL+ I N+ T SY LG N+F D++ EEFK+ YL + P
Sbjct: 53 TPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQP-- 110
Query: 119 RQPSAEFSYRDVKAL--------------PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
P+AE V + P SVDWR KGAVTPVKNQ CGSCWAF+TVA
Sbjct: 111 --PAAEAMPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVA 168
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
++EG++QI +G L SLSEQE++DCD N+ GC GG A +++ +GGL E DYPY+
Sbjct: 169 SIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGGLTTESDYPYV 228
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
+ C K I GYQ V +E L +A+A +PV+V I+AS FQFY GVF
Sbjct: 229 GSQRQCMSGKLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDASRA-FQFYKRGVF 287
Query: 284 TGPCG-AELDHGVAAVGYGKSKGS-----DYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
+GPC ++H V VGYG + Y IVKNSWG +WGE GY+RM R EG+
Sbjct: 288 SGPCNTTTVNHAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRAREGM 347
Query: 338 CGI 340
C I
Sbjct: 348 CAI 350
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 189/315 (60%), Gaps = 17/315 (5%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-----EVTSYWLGLNEFAD 98
L + + + ++HG+ Y ++E+ +R +F++N + ID N EVT + L +N+F D
Sbjct: 20 LRQQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVT-FTLQMNQFGD 78
Query: 99 MSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
M+ EEF G P+RR P+A + LPK VDWR KGAVTPVK+Q CGSCW
Sbjct: 79 MTSEEFTATMNGFL-NVPSRR-PTAILRADPDETLPKEVDWRTKGAVTPVKDQKQCGSCW 136
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKE 217
AFST ++EG + + G L SLSEQ L+DC F N GC GGLMD AF+YI A+ G+ E
Sbjct: 137 AFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDTE 196
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
+ YPY ++G C + T +GY DV E +L KA+A P+SVAI+AS FQ
Sbjct: 197 DSYPYEAQDGKCRFDASNVG-ATDTGYVDVEHGSESALKKAVATIGPISVAIDASQPSFQ 255
Query: 277 FYSGGVF--TGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
FY GV+ G LDHGV AVGYG++ KG Y +VKNSW WG +GYI+M R+
Sbjct: 256 FYHDGVYYEEGCSSTMLDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNKGYIQMSRD--- 312
Query: 334 PEGLCGINKMASIPL 348
+ CGI AS PL
Sbjct: 313 KKNNCGIASQASYPL 327
>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
Length = 382
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 155/364 (42%), Positives = 206/364 (56%), Gaps = 35/364 (9%)
Query: 2 AFFSHSKLLLLSLSLSL---FACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKT 58
++ H + + + S SL FACS L + G + T L+E F++W +++ +T
Sbjct: 20 SYHIHHNMTMATASASLALMFACSLL-----LAGTAFSDDTIAIPLLERFKAWQAEYNRT 74
Query: 59 YKCIEEKLHRFEIFKENLKHIDQRNKEVT--SYWLGLNEFADMSHEEFKNKYLGLKPQFP 116
Y EE RF I+ EN++ I N+ T SY LG N+F D++ EEFK+ YL + P
Sbjct: 75 YATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQP 134
Query: 117 TRRQPSAEFSYRDVKAL--------------PKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
P+AE V + P SVDWR KGAVT VK+Q CGSCWAF+T
Sbjct: 135 ----PAAEAMPPTVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFAT 190
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFN-NGCNGGLMDYAFKYIVASGGLHKEEDYP 221
VA++EG++QI +G L SLSEQE++DCD N NGC GG A +++ +GGL E DYP
Sbjct: 191 VASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVTRNGGLTTESDYP 250
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
Y+ + C K I GYQ V N+E L +A+A QPV+V ++AS FQFY G
Sbjct: 251 YVGSQRQCMSGKLGHHAARIRGYQAVQRNNEAELERAVAGQPVAVFVDAS-RAFQFYKSG 309
Query: 282 VFTGPC-GAELDHGVAAVGYGK----SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
VF+GPC ++H V VGYG S G Y IVKNSWG WGE GY+RM R EG
Sbjct: 310 VFSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWGENGYVRMARRVRAREG 369
Query: 337 LCGI 340
+C I
Sbjct: 370 MCAI 373
>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
Length = 372
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 146/308 (47%), Positives = 181/308 (58%), Gaps = 16/308 (5%)
Query: 53 SKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFADMSHEEFKNKY 108
+ H K YK E+ +R +IF +N + I + N++ +Y LG+N++ DM H E N
Sbjct: 68 THHKKVYKSPIEEGYRMKIFLDNKRKIVEHNRKYEMKEVNYKLGMNKYGDMLHHELINTL 127
Query: 109 LGLKPQFPTRRQP--SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
G + A F LPKSVDWRKKGAVT +K+QG CGSCWAFS+ A+
Sbjct: 128 NGFNKSVTVSEEQLIGATFIEPANVELPKSVDWRKKGAVTAIKDQGQCGSCWAFSSTGAL 187
Query: 167 EGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
EG + SG L SLSEQ LIDC + NNGCNGGLMDYAF+YI + GL E+ YPY E
Sbjct: 188 EGQHFRQSGVLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFRYIKENKGLDTEKSYPYEAE 247
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFT 284
C + + G+ D+PE DE L A+A P+SVAI+AS F FYS GV+
Sbjct: 248 NDQCRYNPKNSGASDV-GFVDIPEGDEDKLKAAVATIGPISVAIDASHESFHFYSEGVYY 306
Query: 285 GP-CG-AELDHGVAAVGYGKSKGS--DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
P C A LDHGV VGYG G+ DY +VKNSWG WGE+GYI+M RN E CGI
Sbjct: 307 EPECSPANLDHGVLIVGYGTDSGTGEDYWLVKNSWGETWGEKGYIKMARNK---ENHCGI 363
Query: 341 NKMASIPL 348
AS PL
Sbjct: 364 ASSASYPL 371
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 143/323 (44%), Positives = 197/323 (60%), Gaps = 21/323 (6%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFA 97
D ++E + ++ +H K Y+ E+ R +IF EN I + N+ S+ L +N++A
Sbjct: 23 DVVMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 82
Query: 98 DMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK-------ALPKSVDWRKKGAVTPVKN 150
D+ H EF+ G + + + E S++ V LPKSVDWR KGAVT VK+
Sbjct: 83 DLLHHEFRQLMNGFNYTLHKQLRAADE-SFKGVTFISPAHVTLPKSVDWRTKGAVTAVKD 141
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIV 209
QG CGSCWAFS+ A+EG + SG L SLSEQ L+DC T + NNGCNGGLMD AF+YI
Sbjct: 142 QGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIK 201
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAI 268
+GG+ E+ YPY + +C K + T G+ D+P+ DE+ + +A+A PV+VAI
Sbjct: 202 DNGGIDTEKSYPYEAIDDSCHFNKGTIG-ATDRGFTDIPQGDEKKMAEAVATVGPVAVAI 260
Query: 269 EASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYI 325
+AS FQFYS GV+ P C A+ LDHGV VG+G + G DY +VKNSWG WG++G+I
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 320
Query: 326 RMKRNTGKPEGLCGINKMASIPL 348
+M RN E CGI +S PL
Sbjct: 321 KMLRN---KENQCGIASASSYPL 340
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 145/348 (41%), Positives = 204/348 (58%), Gaps = 32/348 (9%)
Query: 16 LSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
L+L A + SI D + E ++++ +H K Y E+ R +IF EN
Sbjct: 5 LALLALVAFVQAISIT----------DVIKEEWQTFKMEHRKNYLSEVEERFRMKIFNEN 54
Query: 76 LKHIDQRNKEV----TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK 131
I + N+ S+ LGLN++ADM H EFK G R++ A+ + +
Sbjct: 55 RHKIAKHNQLYAQGKVSFKLGLNKYADMLHHEFKETMNGYNHTM--RKELRAQEGFNGIT 112
Query: 132 -------ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
+PK+VDWR+ GAVT VK+QG CGSCW+FS+ ++EG + +G L SLSEQ
Sbjct: 113 YISPANVQVPKAVDWRQHGAVTSVKDQGHCGSCWSFSSTGSLEGQHFRKAGVLVSLSEQN 172
Query: 185 LIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
L+DC T + NNGCNGGLMD AF+YI +GG+ E+ YPY + +C K + T +G
Sbjct: 173 LVDCSTKYGNNGCNGGLMDNAFRYIKDNGGVDTEKSYPYEGIDDSCHFNKATVG-ATDTG 231
Query: 244 YQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGY 300
+ D+P+ DE++++KA+A PV+VAI+AS FQ YS GV+ P C ++ LDHGV VGY
Sbjct: 232 FVDIPQGDEEAMMKAVATMGPVAVAIDASNESFQLYSEGVYNDPNCSSDNLDHGVLVVGY 291
Query: 301 GKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G K G DY +VKNSWG WG++GYI+M RN + CGI +S P
Sbjct: 292 GTDKDGQDYWLVKNSWGTTWGDQGYIKMARN---QDNQCGIATASSFP 336
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 138/328 (42%), Positives = 196/328 (59%), Gaps = 16/328 (4%)
Query: 28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT 87
+V +PE ++D + W ++H +TY E+ R +++NLK I+ N E +
Sbjct: 12 LGLVAATPEFDQTLDSQ---WHQWKAQHRRTYAANEDGWRR-ATWEKNLKMIEMHNLEYS 67
Query: 88 ----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKG 143
S+ LG+N+F DM+ EEFK G +R + + + LPKSVDWR+KG
Sbjct: 68 AGKHSFQLGMNKFGDMTTEEFKQVMNGYNSNGSQKRTKGSLYREPLLAQLPKSVDWREKG 127
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMD 202
VTPVKNQG CGSCWAFS ++EG + L SLSEQ L+DC TS NNGC+GGLMD
Sbjct: 128 YVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTSEGNNGCSGGLMD 187
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH- 261
AF+Y+ +GG+ E+ YPYL ++ C+ + E ++G+ D+P +E++L+KA+A+
Sbjct: 188 NAFEYVKNNGGIDTEQAYPYLGQDNECK-YRAECSGANVTGFVDIPSMNERALMKAVANV 246
Query: 262 QPVSVAIEASGTDFQFYSGGVFTGP--CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKW 319
P+SVAI+A FQFY GV+ P ++LDHGV VGYG +Y IVKNSWG +W
Sbjct: 247 GPISVAIDAGNPSFQFYESGVYYEPQCSSSQLDHGVLVVGYGSIGKDEYWIVKNSWGEEW 306
Query: 320 GERGYIRMKRNTGKPEGLCGINKMASIP 347
G++GY+ M + CGI AS P
Sbjct: 307 GKKGYVLMAKFRNNH---CGIATAASYP 331
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 199/327 (60%), Gaps = 21/327 (6%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGL 93
++ D ++E + ++ +H K Y+ E+ R +IF EN I + N+ S+ L +
Sbjct: 19 ISFADVVMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAV 78
Query: 94 NEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK-------ALPKSVDWRKKGAVT 146
N++AD+ H EF+ G ++ S + S++ V LPKSVDWR KGAVT
Sbjct: 79 NKYADLLHHEFRQLMNGFNYTLH-KQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVT 137
Query: 147 PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAF 205
VK+QG CGSCWAFS+ A+EG + SG L SLSEQ L+DC T + NNGCNGGLMD AF
Sbjct: 138 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 197
Query: 206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPV 264
+YI +GG+ E+ YPY + +C K + T G+ D+P+ DE+ + +A+A PV
Sbjct: 198 RYIKDNGGIDTEKSYPYEAIDDSCHFNKGAIG-ATDRGFTDIPQGDEKKMAEAVATVGPV 256
Query: 265 SVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGE 321
+VAI+AS FQFYS GV+ P C A+ LDHGV VGYG + G DY +VKNSWG WG+
Sbjct: 257 AVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTWGD 316
Query: 322 RGYIRMKRNTGKPEGLCGINKMASIPL 348
+G+I+M RN + CGI +S PL
Sbjct: 317 KGFIKMLRN---KDNQCGIASASSYPL 340
>gi|294885991|ref|XP_002771503.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875207|gb|EER03319.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 191/319 (59%), Gaps = 26/319 (8%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F + KHGK+Y +E++ R IF +NL +I++ N + SY LG+NE+ D++ EEF
Sbjct: 27 FIGFQKKHGKSYDNKDEEMKRAAIFHDNLNYIEEVNAQNLSYKLGVNEYTDLTLEEFAAL 86
Query: 108 YL-------GLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
L G+ F P+ LP SVDWRKKG + PVK+QG CGSCWAF
Sbjct: 87 KLSSTDMSEGMGDGFVAGAGPT-------TTTLPTSVDWRKKGVLNPVKDQGYCGSCWAF 139
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
S + A+E I +G L SLSEQ+L+DC ++ N GCNGGLMD AF+YI A+ G+ KE
Sbjct: 140 SAIGALEPRYAIATGKLLSLSEQQLVDCAGAYGNEGCNGGLMDKAFEYIKAT-GVDKEST 198
Query: 220 YPYLMEEGTC----EDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
YPY+ + TC E+K + + V ++G Q + + E++L++ +A PVS+A+ A+ F
Sbjct: 199 YPYVGSDETCQATVENKTDGLPVGEVTGNQMLHQT-EKALMEGVAAAPVSIAMYANLQSF 257
Query: 276 QFYSGGVFTGP-C---GAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNT 331
Q Y GV++ P C G +DHGV AVGYG G DY I++NSWG WG+ GY+ +KR
Sbjct: 258 QHYKSGVYSDPNCNAKGGSIDHGVVAVGYGTENGQDYFIIRNSWGRSWGQDGYVYLKRGV 317
Query: 332 GKPEGLCGINKMASIPLKK 350
G G C I K +P K
Sbjct: 318 GS-FGQCNIYKYMCVPTLK 335
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 139/324 (42%), Positives = 190/324 (58%), Gaps = 15/324 (4%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TS 88
++ H D E + ++ ++ GK+YK E+L R ++KEN + ID+ NK S
Sbjct: 11 FAISHTALHDYFPEEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVS 70
Query: 89 YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
Y L +N F D+ EFK K + ++Q S E LP VDWR+KGAVTPV
Sbjct: 71 YKLKMNHFGDLMQHEFKALN---KLKRSAKQQNSGEVFRATGGKLPAKVDWRQKGAVTPV 127
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKY 207
K+ G CGSCWAFS+ ++ G + + L SLSEQ+L+DC ++ N+GC+GG+M AF+Y
Sbjct: 128 KDPGQCGSCWAFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQY 187
Query: 208 IVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSV 266
I +GG+ E YPY E+ C K + + T GY D+ + DE +L +A+A P+SV
Sbjct: 188 IKGNGGIDTEGSYPYEAEDDKCRYKTKSV-AGTDKGYVDIAQGDENALKEAVAEIGPISV 246
Query: 267 AIEASGTDFQFYSGGVFTGP--CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGY 324
AI+A FQFYS G++ P ELDHGV VGYG G DY +VKNSWGP WGE GY
Sbjct: 247 AIDAGNLSFQFYSEGIYDEPFCSNTELDHGVLVVGYGTENGQDYWLVKNSWGPSWGENGY 306
Query: 325 IRMKRNTGKPEGLCGINKMASIPL 348
I++ RN CGI MAS P+
Sbjct: 307 IKIARNHNNH---CGIASMASYPI 327
>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
Length = 258
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 133/265 (50%), Positives = 173/265 (65%), Gaps = 19/265 (7%)
Query: 93 LNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY-----RDVKALPKSVDWRKKGAVTP 147
LNEFADM+++EF Y GL+P P + A F Y D ++VDWR+KGAVT
Sbjct: 3 LNEFADMTNDEFMAMYTGLRP-VPAGAKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTG 61
Query: 148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKY 207
+K+Q CG CWAF+ VAAVEGI+QI +GNL SLSEQ+++DCDT NNGCNGG +D AF+Y
Sbjct: 62 IKDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQY 121
Query: 208 IVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVA 267
IV +GGL E+ YPY + C+ + V ISGYQDVP DE +L A+A+QPVSVA
Sbjct: 122 IVGNGGLATEDAYPYTAAQAMCQSVQ---PVAAISGYQDVPSGDEAALAAAVANQPVSVA 178
Query: 268 IEASGTDFQFYSGGVFTGP-CGA--ELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERG 323
I+A +FQ Y GGV T C L+H V AVGYG ++ G+ Y ++KN WG WGE G
Sbjct: 179 IDAH--NFQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGG 236
Query: 324 YIRMKRNTGKPEGLCGINKMASIPL 348
Y+R++R CG+ + AS P+
Sbjct: 237 YLRLERGANA----CGVAQQASYPV 257
>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
Length = 344
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 180/322 (55%), Gaps = 29/322 (9%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F WM H K+Y EE R+ IFK N+ ++ Q N + + LGLN FAD+++EE++N
Sbjct: 30 FTDWMITHQKSYTS-EEFGARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYRNT 88
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
YLG K + E + A K DWR +GAVTPVKNQG CG CW+FST + E
Sbjct: 89 YLGTKFDASSLIGTQEEKVFTTSSAASK--DWRSEGAVTPVKNQGQCGGCWSFSTTGSTE 146
Query: 168 GINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEG 227
G + G L SLSEQ LIDC T N+GC+GGLM YAF+YI+ + G+ E YPY E G
Sbjct: 147 GAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYAFEYIINNNGIDTESSYPYKAENG 205
Query: 228 TCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP- 286
CE K E T+S Y+ V E SL A+ PVSVAI+AS FQ Y+ G++ P
Sbjct: 206 KCEYKSEN-SGATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPE 264
Query: 287 CGAE-LDHGVAAVGY-------------------GKSKGSDYIIVKNSWGPKWGERGYIR 326
C +E LDHGV AVGY S ++Y IVKNSWG WG GYI
Sbjct: 265 CSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWGIEGYIL 324
Query: 327 MKRNTGKPEGLCGINKMASIPL 348
M RN + CGI AS P+
Sbjct: 325 MSRN---RDNNCGIASSASFPV 343
>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 135/313 (43%), Positives = 177/313 (56%), Gaps = 18/313 (5%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNK 107
E WM+K+G+ Y EKL R E+F N +HID N+ +Y LGLN F+D+++EEF
Sbjct: 42 ERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEEFAQT 101
Query: 108 YLGLKPQ------FPTRRQPSAEFSYRD--VKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
+LG + Q P P+A + D +++ P SVDWR +GAVTPVK+QG CGSCWA
Sbjct: 102 HLGYRHQPGPGGLRPEDSSPAAAVNVTDAQLQSTPDSVDWRARGAVTPVKHQGHCGSCWA 161
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
F+ VAA EG+ QI +GNL S+SEQ+++DC T + C G ++ A YI ASGGL E
Sbjct: 162 FAAVAATEGLVQIATGNLISMSEQQVLDC-TGGTSSCKSGYVNAALTYITASGGLQTEAA 220
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQD--VPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
Y Y E+G C G + DE +L +A QPV+VA+EA DF
Sbjct: 221 YAYSAEQGACRSGGASPNSAAAVGVHRSAMLNGDEGALQVLVAGQPVAVAVEAE-PDFHH 279
Query: 278 YSGGVFTG--PCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
Y GV+ G CG +L H V VGYG G Y +VKN WG WGE GY+R+ R G
Sbjct: 280 YKSGVYVGSPSCGQKLHHAVTVVGYGADGDGQGYWVVKNQWGAGWGEVGYMRLTRGNGGN 339
Query: 335 EGLCGINKMASIP 347
CG+ A P
Sbjct: 340 N--CGMATHAYYP 350
>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332642714|gb|AEE76235.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 290
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 122/241 (50%), Positives = 163/241 (67%), Gaps = 4/241 (1%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFK 105
++E W+ ++ K Y + EK RF+IFK+NLK +D+ N ++ +GL FAD+++EEF+
Sbjct: 43 MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR 102
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
YL K + + + Y++ LP VDWR GAV VK+QG+CGSCWAFS V A
Sbjct: 103 AIYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGA 162
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
VEGINQI +G L SLSEQEL+DCD F N GC+GG+M+YAF++I+ +GG+ ++DYPY
Sbjct: 163 VEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNA 222
Query: 225 EE-GTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+ G C DK VVTI GY+DVP +DE+SL KA+AHQPVSVAIEAS FQ Y
Sbjct: 223 NDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSVN 282
Query: 283 F 283
F
Sbjct: 283 F 283
>gi|3097321|dbj|BAA25899.1| Bd 30K [Glycine max]
gi|84371705|gb|ABC56139.1| 34 kDa maturing seed protein [Glycine max]
gi|195957142|gb|ACG59282.1| major allergen Gly m Bd 30K [Glycine max]
gi|223452512|gb|ACM89583.1| maturing seed protein [Glycine max]
gi|226432468|gb|ACO55749.1| Gly m Bd 30K allergen [Glycine max]
gi|320090153|gb|ADW08728.1| P34 allergen [Glycine max]
gi|320090155|gb|ADW08729.1| P34 allergen [Glycine max]
gi|320090157|gb|ADW08730.1| P34 allergen [Glycine max]
gi|320090159|gb|ADW08731.1| P34 allergen [Glycine max]
Length = 379
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 143/344 (41%), Positives = 199/344 (57%), Gaps = 32/344 (9%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN---KE 85
SI+ T+ ++ LF+ W S+HG+ Y EE+ R EIFK NL +I N K
Sbjct: 25 SILDLDLTKFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNLNYIRDMNANRKS 84
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA-------LPKSVD 138
S+ LGLN+FAD++ +EF KYL Q P + + + +K P S D
Sbjct: 85 PHSHRLGLNKFADITPQEFSKKYL----QAPKDVSQQIKMANKKMKKEQYSCDHPPASWD 140
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
WRKKG +T VK QG CGS WAFS A+E + I +G+L SLSEQEL+DC + GC
Sbjct: 141 WRKKGVITQVKYQGGCGSGWAFSATGAIEAAHAIATGDLVSLSEQELVDC-VEESEGCYN 199
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND------- 251
G +F++++ GG+ ++DYPY +EG C+ K + + VTI GY+ + +D
Sbjct: 200 GWHYQSFEWVLEHGGIATDDDYPYRAKEGRCKANKIQ-DKVTIDGYETLIMSDESTESET 258
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG-----PCGAELDHGVAAVGYGKSKGS 306
EQ+ L A+ QP+SV+I+A DF Y+GG++ G P G ++H V VGYG + G
Sbjct: 259 EQAFLSAILEQPISVSIDAK--DFHLYTGGIYDGENCTSPYG--INHFVLLVGYGSADGV 314
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
DY I KNSWG WGE GYI ++RNTG G+CG+N AS P K+
Sbjct: 315 DYWIAKNSWGEDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTKE 358
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 142/323 (43%), Positives = 197/323 (60%), Gaps = 21/323 (6%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFA 97
D ++E + ++ +H K Y+ E+ R +IF EN I + N+ S+ L +N++A
Sbjct: 23 DVVMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 82
Query: 98 DMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK-------ALPKSVDWRKKGAVTPVKN 150
D+ H EF+ G + + + + S++ V LPKSVDWR KGAVT VK+
Sbjct: 83 DLLHHEFRQLMNGFNYTLHKQLRATDD-SFKGVTFISPAHVTLPKSVDWRSKGAVTAVKD 141
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIV 209
QG CGSCWAFS+ A+EG + SG L SLSEQ L+DC T + NNGCNGGLMD AF+YI
Sbjct: 142 QGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIK 201
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAI 268
+GG+ E+ YPY + +C K + T G+ D+P+ DE+ + +A+A PVSVAI
Sbjct: 202 DNGGIDTEKSYPYEAIDDSCHFNKGTIG-ATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 260
Query: 269 EASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYI 325
+AS FQFYS GV+ P C A+ LDHGV VG+G + G DY +VKNSWG WG++G+I
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFI 320
Query: 326 RMKRNTGKPEGLCGINKMASIPL 348
+M RN + CGI +S PL
Sbjct: 321 KMLRN---KDNQCGIASASSYPL 340
>gi|357518983|ref|XP_003629780.1| Cysteine proteinase [Medicago truncatula]
gi|355523802|gb|AET04256.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 148/357 (41%), Positives = 212/357 (59%), Gaps = 29/357 (8%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSP-EHLTSMDKLIELFESWMSKHGKTY 59
M F S L+L+S++ FA SS ++SI + + +S +++ ELF+ W +HG+ Y
Sbjct: 22 MTKFILSFLILISITCLSFALSS---EYSISSHGKLDKFSSDEEVFELFQMWKKEHGRDY 78
Query: 60 KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYL-GLKPQFPTR 118
EE+ +++ + K T + L LN+FADMS EEF YL ++ Q P+
Sbjct: 79 ANSEEE------------NMNAKRKSQTQHRLSLNKFADMSPEEFSKTYLPKIEMQVPSN 126
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
R + D + LP SVDWR+KGAVT V++QG C S WAFS A+EG+N+IV+GNL
Sbjct: 127 RDNAKLKDDDDCENLPTSVDWREKGAVTEVRDQGDCQSHWAFSVTGAIEGLNKIVTGNLI 186
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
+LS QEL+DCD + + GC GG AF Y++ +GG+ E +YPYL + GTC K+ +V
Sbjct: 187 NLSAQELVDCDPA-SKGCAGGFYFNAFGYVIENGGIDTEANYPYLAKNGTC--KENANKV 243
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGAELDHGVAA 297
V+I V + E++LL + QPVSV+++A+G QFY+GGV+ G C E +
Sbjct: 244 VSIDNLL-VLDGTEEALLCRTSKQPVSVSLDATG--LQFYAGGVYGGENCKKESRNANLV 300
Query: 298 ---VGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK--PEGLCGINKMASIPLK 349
VGY G DY IVKNSWG WGE+GY+ +KRN + P G+C IN P+K
Sbjct: 301 GLIVGYDSVNGEDYWIVKNSWGKDWGEKGYLFIKRNVFEDWPFGVCAINAAVGYPVK 357
>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 150/356 (42%), Positives = 201/356 (56%), Gaps = 35/356 (9%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESW---MSKHGKTYKCIEEKL 66
+++ L L +FA SS++ +++++IE E W ++ K Y+ ++E+
Sbjct: 3 VVIVLGLVVFAISSVSS------------INLNEVIE--EEWSLFKAQFKKIYEDVKEEA 48
Query: 67 HRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYLGLKPQFPT----- 117
R +++ +N I + NK + Y L +N F D+ E+K G KP
Sbjct: 49 FRKKVYLDNKLKIARHNKLYETGEETYALEMNHFGDLMQHEYKKMMNGFKPSLAGGDKNF 108
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ F + +PK++DWRKKG VTPVKNQG CGSCW+FS ++EG + +G L
Sbjct: 109 TDDDAVTFLKSENVVVPKAIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVL 168
Query: 178 TSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQ LIDC + NNGC GGLMD AFKYI ++ GL E+ YPY E+ C E
Sbjct: 169 VSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPEN- 227
Query: 237 EVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP--CGAELDH 293
T G+ D+PE DE +L+ ALA PVS+AI+AS FQFY GVF P ELDH
Sbjct: 228 SGATDKGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDH 287
Query: 294 GVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
GV AVGYG KG DY IVKNSWG WG++GYI M RN + CG+ AS PL
Sbjct: 288 GVLAVGYGTDHKGGDYWIVKNSWGKTWGDQGYIMMARN---KKNNCGVASSASYPL 340
>gi|150261413|pdb|2PNS|A Chain A, 1.9 Angstrom Resolution Crystal Structure Of A Plant
Cysteine Protease Ervatamin-C Refinement With Cdna
Derived Amino Acid Sequence
gi|150261414|pdb|2PNS|B Chain B, 1.9 Angstrom Resolution Crystal Structure Of A Plant
Cysteine Protease Ervatamin-C Refinement With Cdna
Derived Amino Acid Sequence
gi|166007115|pdb|2PRE|A Chain A, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
Complexed With Irreversible Inhibitor E-64 At 2.7 A
Resolution
gi|166007116|pdb|2PRE|B Chain B, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
Complexed With Irreversible Inhibitor E-64 At 2.7 A
Resolution
Length = 208
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 124/217 (57%), Positives = 153/217 (70%), Gaps = 10/217 (4%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP+ +DWRKKGAVTPVKNQG CGSCWAFSTV+ VE INQI +GNL SLSEQ+L+DC+
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNKK- 59
Query: 193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
N+GC GG YA++YI+ +GG+ E +YPY +G C K +VV I GY+ VP +E
Sbjct: 60 NHGCKGGAFVYAYQYIIDNGGIDTEANYPYKAVQGPCRAAK---KVVRIDGYKGVPHCNE 116
Query: 253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
+L KA+A QP VAI+AS FQ Y G+F+GPCG +L+HGV VGY K DY IV+
Sbjct: 117 NALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKLNHGVVIVGYWK----DYWIVR 172
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
NSWG WGE+GYIRMKR G GLCGI ++ P K
Sbjct: 173 NSWGRYWGEQGYIRMKRVGG--CGLCGIARLPYYPTK 207
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 193/324 (59%), Gaps = 18/324 (5%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGL 93
++ +D + E + ++ +H K Y E+ R +IF EN I + N+ SY LGL
Sbjct: 18 ISPLDLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGL 77
Query: 94 NEFADMSHEEFK---NKYLGLKPQFPTRRQPSAEFSYRDVK--ALPKSVDWRKKGAVTPV 148
N++ADM H EFK N Y Q R +Y +PKSVDWR+ GAVT V
Sbjct: 78 NKYADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGV 137
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKY 207
K+QG CGSCWAFS+ A+EG + +G L SLSEQ L+DC T + NNGCNGGLMD AF+Y
Sbjct: 138 KDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRY 197
Query: 208 IVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSV 266
I +GG+ E+ YPY + +C K + T +G+ D+PE DE+ + KA+A PVSV
Sbjct: 198 IKDNGGIDTEKSYPYEGIDDSCHFNKATIG-ATDTGFVDIPEGDEEKMKKAVATMGPVSV 256
Query: 267 AIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERG 323
AI+AS FQ YS GV+ P C + LDHGV VGYG + G DY +VKNSWG WGE+G
Sbjct: 257 AIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQG 316
Query: 324 YIRMKRNTGKPEGLCGINKMASIP 347
YI+M RN CGI +S P
Sbjct: 317 YIKMARNQNNQ---CGIATASSYP 337
>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 326
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 142/308 (46%), Positives = 181/308 (58%), Gaps = 18/308 (5%)
Query: 51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKN 106
W + HGK Y +E+ RF+IF+EN I Q N+E +Y LG+N F D+ H EF
Sbjct: 26 WKATHGKVYNSADEESLRFKIFQENSLMITQHNEEYRQGFHTYILGMNHFGDLLHSEFLE 85
Query: 107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
+ G F F++ +P +W KGAVTPVK+QG CGSCWAFS +V
Sbjct: 86 RSNG----FQGGVSGGDVFTFDTNAPVPSYANWTAKGAVTPVKDQGKCGSCWAFSATGSV 141
Query: 167 EGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
EG + L SLSEQ+L+DC N GC GGLMD AFKY +A+ G+ E+ YPY +
Sbjct: 142 EGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYFIANKGIANEKSYPYTAK 201
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFT 284
+ C+ KK M V TIS ++DV DE L A+A+ PVSVAI+AS + FQFY GV+
Sbjct: 202 DNDCKYKK-SMSVATISSFKDVKHKDEDQLKMAVANVGPVSVAIDASSSKFQFYESGVYY 260
Query: 285 GP-CGAE-LDHGVAAVGYGKSK--GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
C +E LDHGV AVGYG K G D+ +VKNSW WG GYI+M RN + CGI
Sbjct: 261 DENCSSEVLDHGVLAVGYGTDKKSGMDFWLVKNSWAASWGLNGYIKMARN---KDNNCGI 317
Query: 341 NKMASIPL 348
MAS P+
Sbjct: 318 ATMASYPI 325
>gi|294883340|ref|XP_002770717.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239874002|gb|EER02722.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 333
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 140/298 (46%), Positives = 179/298 (60%), Gaps = 11/298 (3%)
Query: 42 DKLIEL-FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMS 100
++ +EL F + K GK Y+ EE++ R IF+ NL HI+ N + SY LG+NE AD++
Sbjct: 21 EETVELAFMGFQHKFGKNYESKEEEVKRNAIFQANLHHIEHVNAKNLSYKLGVNEHADLT 80
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
HEEF LG + TRR D LP SVDWR K ++PVKNQGSCGSCWAF
Sbjct: 81 HEEFAALKLG-TLEMSTRRDDKFVVE-ADTTQLPTSVDWRNKSVLSPVKNQGSCGSCWAF 138
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEED 219
S A+E I +G L LS QEL+DC +S+ N GC GGLM A+KYI S GL +E
Sbjct: 139 SAAGALEAQYAIATGKLRPLSVQELVDCSSSYGNKGCLGGLMTNAYKYI-KSAGLDQEST 197
Query: 220 YPYLMEEGTC----EDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
YPY C E K + + ++G + + EQSL+KALA PVS+A+ A +F
Sbjct: 198 YPYKGWNKHCFRSSEKKADGIPAGEVTGSHMLAQT-EQSLMKALAAAPVSLAMYARDRNF 256
Query: 276 QFYSGGVFTG-PCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
+FY GV++ C E+DHGV AVGYG KGSDY I+KNSWG WG GY +KR G
Sbjct: 257 RFYRSGVYSSTTCNGEIDHGVVAVGYGADKGSDYFILKNSWGSSWGIGGYFYLKRGVG 314
>gi|294883334|ref|XP_002770714.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873999|gb|EER02719.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 330
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 178/309 (57%), Gaps = 7/309 (2%)
Query: 45 IEL-FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEE 103
+EL F + K GK Y+ EE++ R IF+ NL I+Q N + SY LG+NE+AD++HEE
Sbjct: 24 VELAFMGFQHKFGKNYESKEEEVKRNAIFQANLHLIEQVNAKNLSYKLGVNEYADLTHEE 83
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
F LG P + F D LP SVDWR K ++PVK+QGSCGSCWAFS
Sbjct: 84 FAALKLGTLKMRPAEHASLSLFVSADTTQLPTSVDWRNKSVLSPVKDQGSCGSCWAFSAA 143
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
A+E I +G L LSEQ+L+DC + NGC GG M A+KYI S GL +E YPY
Sbjct: 144 GALEAQYAIATGKLRPLSEQQLVDCSHKYGTNGCFGGFMADAYKYI-KSAGLDQESTYPY 202
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
C ++++ + + + D EQSL+KALA PVSVA+ AS F Y GV
Sbjct: 203 KGVNEPCRPREKKADGIPVRFVLDT--KTEQSLMKALADAPVSVAMYASDFLFHLYLSGV 260
Query: 283 FTG-PCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
++ C E+DH V AVGYG +GSDY I+KNSWG WG GY +KR G G C I
Sbjct: 261 YSSTTCNGEIDHAVVAVGYGADEGSDYFILKNSWGSSWGMGGYFFLKRGVGG-HGECNIL 319
Query: 342 KMASIPLKK 350
+ +P K
Sbjct: 320 EYMVVPTLK 328
>gi|229366214|gb|ACQ58087.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 144/314 (45%), Positives = 184/314 (58%), Gaps = 19/314 (6%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLNEFADMSHEE 103
F +W + G++Y E+ R EI+ N L H ++ + SY LG+ FADM +EE
Sbjct: 26 FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85
Query: 104 FKNKY----LG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
+K + LG P R+ SA + LP SVDWR+KG VT VK+Q CGSCW
Sbjct: 86 YKRQISQGCLGSFNASLP--RRGSAYLRLPEGADLPNSVDWREKGYVTDVKDQKQCGSCW 143
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKE 217
AFST ++EG +G L SLSEQ+L+DC + N GC GGLMD AF+YI A+GG+ E
Sbjct: 144 AFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDTE 203
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
+ YPY E+G C + T +GY DV + DE +L +ALA PVSVAI+AS + FQ
Sbjct: 204 DSYPYEAEDGQCRYNSANIG-ATCTGYVDVKQGDEDALKEALATIGPVSVAIDASHSSFQ 262
Query: 277 FYSGGVFTGP--CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
Y GV+ P +ELDHGV AVGYG G DY +VKNSWG WG +GYI M RN
Sbjct: 263 LYESGVYDEPECSSSELDHGVLAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRN---K 319
Query: 335 EGLCGINKMASIPL 348
CGI +S PL
Sbjct: 320 HNQCGIATASSYPL 333
>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
Length = 356
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 155/355 (43%), Positives = 203/355 (57%), Gaps = 33/355 (9%)
Query: 9 LLLLSLSLSL-FACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
+ S SL+L FACS L + G + T L+E F++W +++ +TY EE
Sbjct: 3 MATASASLALMFACSLL-----LAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQ 57
Query: 68 RFEIFKENLKHIDQRNKEVT--SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEF 125
RF I+ EN++ I N+ T SY LG N+F D++ EEFK+ YL + P P+AE
Sbjct: 58 RFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQP----PAAEA 113
Query: 126 SYRDVKAL--------------PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQ 171
V + P SVDWR KGAVT VK+Q CGSCWAF+TVA++EG++Q
Sbjct: 114 MGPTVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQ 173
Query: 172 IVSGNLTSLSEQELIDCDTSFN-NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE 230
I +G L SLSEQE++DCD N NGC GG A +++ +GGL E DYPY+ + C
Sbjct: 174 IKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVTRNGGLTTESDYPYVGSQRQCM 233
Query: 231 DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPC-GA 289
K I GYQ V N+E L +A+A +PV+V I+AS FQFY GVF+GPC
Sbjct: 234 SGKLGHHAARIRGYQAVQRNNEAELERAVAERPVAVFIDAS-RAFQFYKSGVFSGPCDTT 292
Query: 290 ELDHGVAAVGYGK----SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
++H V VGYG S G Y IVKNSWG WGE GY+RM R EG+C I
Sbjct: 293 TVNHVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWGENGYVRMARRVRAREGMCAI 347
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 186/312 (59%), Gaps = 17/312 (5%)
Query: 50 SWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFK 105
SW S H K Y EE R I+++NLK I+ N + + SY LG+N F DM++EEF+
Sbjct: 30 SWKSWHSKKYHEKEEGWRRM-IWEKNLKMIELHNLDHSLGKHSYRLGMNHFGDMTNEEFR 88
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
G K R+ ++F + PKSVDWR+KG VTPVK+QG CGSCWAFS A
Sbjct: 89 QVMNGFKQSRSQRKYKGSQFLEPNFLQAPKSVDWREKGYVTPVKDQGQCGSCWAFSATGA 148
Query: 166 VEGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
+EG + +G L SLSEQ LIDC N GCNGGLMD AF+YI + G+ EE YPY+
Sbjct: 149 LEGQHFRKTGKLVSLSEQNLIDCSGPEGNQGCNGGLMDQAFQYIKDNNGIDSEESYPYIG 208
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVF 283
++ K E +G+ D+PE E++L+KA+A P+SVAI+AS T FQFY GV+
Sbjct: 209 KDDEDCLYKPEYNSANDTGFVDIPEGRERALMKAVAAVGPISVAIDASHTSFQFYESGVY 268
Query: 284 TGP-CGA-ELDHGVAAVGYGKSKGSD-----YIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
P C + ELDHGV VGYG D Y IVKNSW KWG++GYI M ++
Sbjct: 269 YEPQCNSEELDHGVLVVGYGYEGTDDDNKKRYWIVKNSWSEKWGDQGYIHMAKDRSNN-- 326
Query: 337 LCGINKMASIPL 348
CGI AS P+
Sbjct: 327 -CGIASAASYPM 337
>gi|308322281|gb|ADO28278.1| cathepsin L [Ictalurus furcatus]
Length = 359
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/311 (43%), Positives = 186/311 (59%), Gaps = 14/311 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNEFADMSHEE 103
F+ W K GK YK +EE+ R + ++EN K + N K + SY LG+N FADMS++E
Sbjct: 25 FQEWKQKFGKIYKSVEEESQRKKTWQENHKLVMNHNILADKGIKSYRLGMNYFADMSNQE 84
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDV--KALPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
++ F SA R V ALP +V+W + G VT V+ Q C SCWAFS
Sbjct: 85 YRQSVFKGCLSFNRTLNHSAATFLRQVGGPALPNTVNWTQMGYVTEVEEQKQCNSCWAFS 144
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDY 220
A+EG +G L SLS+Q+L+DC F NNGC GGLM++AF+Y+ +GGLH EE Y
Sbjct: 145 ATGALEGQTFKKTGKLVSLSKQQLVDCSKKFGNNGCKGGLMNWAFEYVKENGGLHTEESY 204
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYS 279
PY ++G+C D + VT +G+ + DE +L +A+A P+SVAI+A+ T FQ Y
Sbjct: 205 PYEAKDGSCRDNLGTVG-VTCTGHVQINSEDENALQEAVATIGPISVAIDANHTSFQLYE 263
Query: 280 GGVFTGP-CG-AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
G++ P C +++HGV AVGYG G DY ++KNSWG WG++GYI+M RN
Sbjct: 264 SGLYDEPDCSCTDMNHGVLAVGYGTDDGKDYWLIKNSWGINWGDKGYIKMSRNKNNQ--- 320
Query: 338 CGINKMASIPL 348
CGI AS PL
Sbjct: 321 CGIATAASYPL 331
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 142/317 (44%), Positives = 189/317 (59%), Gaps = 20/317 (6%)
Query: 40 SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNE 95
++D++ LF++ H KTY E + RF I++ +L I+Q N E ++ LG+NE
Sbjct: 19 ALDEMWTLFKT---THSKTYATEAEDMRRF-IWERHLNMINQHNIEADLGKHTFSLGMNE 74
Query: 96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
+ D++ E Y + + + F + +PK+VDWR+KG VTPVKNQG CG
Sbjct: 75 YGDLTQHE----YAAMSGYKMAKSSVGSSFLEPENLQVPKTVDWREKGYVTPVKNQGQCG 130
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGL 214
SCWAFS+ ++EG +G L S+SEQ L+DC N GC+GGLMD AF YI + G+
Sbjct: 131 SCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMDNAFTYIKKNMGI 190
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGT 273
E+ YPY +G C KK + V T SG+ D+P DE +L A+A PVSVAI+AS T
Sbjct: 191 DSEKSYPYEAVDGECRYKKSD-SVTTDSGFVDIPHGDETALRTAVASVGPVSVAIDASHT 249
Query: 274 DFQFYSGGVFT-GPCGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNT 331
FQFY GV+T C + +LDHGV VGYG G DY +VKNSWG WGE GYI++ RN
Sbjct: 250 SFQFYKTGVYTEANCSSTQLDHGVLVVGYGVENGQDYWLVKNSWGASWGEAGYIKLARNH 309
Query: 332 GKPEGLCGINKMASIPL 348
G CGI AS PL
Sbjct: 310 GNQ---CGIASQASYPL 323
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 197/324 (60%), Gaps = 23/324 (7%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-----EVTSYWLGLNEF 96
D + E ++++ +H K Y+ E+ R +IF EN I + N+ EV S+ +GLN++
Sbjct: 22 DVIKEEWQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEV-SFKMGLNKY 80
Query: 97 ADMSHEEFKNKYLGLKPQFPTR-RQPSAEF------SYRDVKALPKSVDWRKKGAVTPVK 149
ADM H EF G + R A F S VK LP+SVDWR KGAVT VK
Sbjct: 81 ADMLHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPEHVK-LPQSVDWRNKGAVTGVK 139
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYI 208
+QG CGSCWAFS+ A+EG + +G L SLSEQ L+DC T + NNGCNGGLMD AF+YI
Sbjct: 140 DQGHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 199
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVA 267
+GG+ E+ YPY + +C K + T G+ D+P+ DE+ L +A+A PVSVA
Sbjct: 200 KDNGGIDTEKSYPYEGIDDSCHFNKGTIG-ATDRGFTDIPQGDEKKLAQAVATIGPVSVA 258
Query: 268 IEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGY 324
I+AS FQFYS GV+ P C + LDHGV VGYG + G DY +VKNSWG WG++G+
Sbjct: 259 IDASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTDENGKDYWLVKNSWGTTWGDKGF 318
Query: 325 IRMKRNTGKPEGLCGINKMASIPL 348
I+M RN + CGI +S PL
Sbjct: 319 IKMARN---DDNQCGIATASSYPL 339
>gi|50539796|ref|NP_001002368.1| cathepsin L.1 precursor [Danio rerio]
gi|49900360|gb|AAH75887.1| Cathepsin L.1 [Danio rerio]
Length = 334
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 141/312 (45%), Positives = 183/312 (58%), Gaps = 15/312 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLK----HIDQRNKEVTSYWLGLNEFADMSHEE 103
F +W K GK+Y+ EE+ HR + N K H ++ + SY LG+ FADMS+EE
Sbjct: 26 FHAWKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFADMSNEE 85
Query: 104 FKNKYLG--LKPQFPTR-RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
++ L T+ R S F R +P +VDWR KG VT +K+Q CGSCWAF
Sbjct: 86 YRQLVFRGCLGSMNNTKARGGSTFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGSCWAF 145
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
S ++EG +G L SLSEQ+L+DC S+ N GC+GGLMD AF+YI A+ GL E+
Sbjct: 146 SATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLDTEDS 205
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFY 278
YPY ++G C + + +GY D+ DE +L +A+A P+SVAI+A + FQ Y
Sbjct: 206 YPYEAQDGECRFNPSTVG-ASCTGYVDIASGDESALQEAVATIGPISVAIDAGHSSFQLY 264
Query: 279 SGGVFTGP--CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
S GV+ P +ELDHGV AVGYG S G DY IVKNSWG WG +GYI M RN
Sbjct: 265 SSGVYNEPDCSSSELDHGVLAVGYGSSNGDDYWIVKNSWGLDWGVQGYILMSRNKSNQ-- 322
Query: 337 LCGINKMASIPL 348
CGI AS PL
Sbjct: 323 -CGIATAASYPL 333
>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 184/314 (58%), Gaps = 17/314 (5%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHE 102
+++ F W + H ++Y EE+L RFE+++ N+++ID N+ +Y LG N+FAD++ E
Sbjct: 41 MMDRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADLTGE 100
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKA-------LPKSVDWRKKGAVTPVKNQGS-C 154
EF +Y G A+ + + P SVDWR KGAVTPVKNQGS C
Sbjct: 101 EFLARYAGGHTGSAITTAAEADGLWSSGGSDGSLEADPPASVDWRAKGAVTPVKNQGSQC 160
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGL 214
SCWAFS VA +E + I +G L +LSEQ+L+DCD ++ GCN G AF++I+ +GG+
Sbjct: 161 YSCWAFSAVATMESLYFIKTGKLVALSEQQLVDCD-KYDGGCNKGYYHRAFQWIMENGGI 219
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
YPY G C K VTI+G+ V +N E +L A+A QP+ VAIE
Sbjct: 220 TTAAQYPYKAVRGACSAAK---PAVTITGHLAVAKN-ELALQSAVARQPIGVAIEVP-IS 274
Query: 275 FQFYSGGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
QFY GVF+ CG ++ H V VGYG + G Y +VKNSWG WGE GYIRM+R+ G
Sbjct: 275 MQFYKSGVFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMRRDVGG 334
Query: 334 PEGLCGINKMASIP 347
GLCGI + P
Sbjct: 335 -GGLCGIALDTAYP 347
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 183/315 (58%), Gaps = 14/315 (4%)
Query: 43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFAD 98
KL + ++ W + K Y EE + R ++ NL+ + + N + V +YWLG+N++AD
Sbjct: 23 KLNQHWKLWKEANNKRYSDAEEHVRR-ATWEGNLQKVQEHNLQADLGVHTYWLGMNKYAD 81
Query: 99 MSHEEFKNKYLGLKPQFPTRR-QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
M+ EF G +R Q FS+ ALP +VDWR KG VT VK+QG CGSC
Sbjct: 82 MTVTEFVKVMNGYNATMRGQRTQDRHTFSFNSKIALPDTVDWRDKGYVTDVKDQGQCGSC 141
Query: 158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHK 216
WAFST A+EG + +G L SLSEQ L+DC N GCNGGLMD AF+YI + G+
Sbjct: 142 WAFSTTGALEGQHFKQTGKLVSLSEQNLVDCSGKQGNMGCNGGLMDQAFEYIKENNGIDT 201
Query: 217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDF 275
E+ YPY + C K + T +G+ D+ DE +L +A+A P+SVAI+A T F
Sbjct: 202 EDSYPYEAVDNQCRFKAANVG-ATDTGFTDITSKDESALQQAVATVGPISVAIDAGHTSF 260
Query: 276 QFYSGGVFTGP-CG-AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
Q Y GV+ P C LDHGV AVGYG G DY +VKNSWG WG++GYI+M RN
Sbjct: 261 QLYKHGVYNEPFCSQTRLDHGVLAVGYGTDSGKDYWLVKNSWGEGWGDKGYIKMTRN--- 317
Query: 334 PEGLCGINKMASIPL 348
CGI AS PL
Sbjct: 318 KRNQCGIATAASYPL 332
>gi|194352764|emb|CAQ00110.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 406
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 149/385 (38%), Positives = 205/385 (53%), Gaps = 47/385 (12%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDK----LIELFESWMSKHGKTYKCIEE 64
+LL + L L CSS + S V S + D +++ F WM+ H ++Y E
Sbjct: 20 VLLATSCLLLAGCSSESLLTSDVLPSEQSDIDTDNHQDLMMDRFHVWMTVHNRSYSTAGE 79
Query: 65 KLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYLG---------- 110
K RFE+++ N++ I+ N E + Y LG F D+++EEF Y G
Sbjct: 80 KARRFEVYRSNMRFIEAVNAEAATSGLTYELGEGPFTDLTNEEFMELYTGQILEDDQSED 139
Query: 111 --LKPQFPTRRQPSAE--------FSYRDVKA-LPKSVDWRKKGAVTPVKNQGSCGSCWA 159
Q T S + Y + A P S+DWRK+G VTPVKNQ CGSCWA
Sbjct: 140 GDDDEQIITTHAGSIDGLGTHKGATVYANFSASAPTSIDWRKRGVVTPVKNQKQCGSCWA 199
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
F TVA +EGI++I G L SLSEQ+LIDCD +NGC GGL+ AF++I +GG+
Sbjct: 200 FPTVATIEGIHKIKRGTLVSLSEQQLIDCD-YLDNGCKGGLVTRAFQWIKKNGGITSTSS 258
Query: 220 YPYLMEEGTC-EDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
Y Y G C ++K ++V G++ V N E SL+ A+A+QPV+V+I + + F Y
Sbjct: 259 YKYKAVRGRCLRNRKPAAKIV---GFRKVKSNSEVSLMNAVANQPVAVSISSHSSHFHHY 315
Query: 279 SGGVFTGPCG-AELDHGVAAVGYGKSK------------GSDYIIVKNSWGPKWGERGYI 325
GG++ GPC +L+H V VGYG+ + G+ Y IVKNSWG WG++GYI
Sbjct: 316 KGGIYNGPCSTTKLNHAVTVVGYGQQQQNGADSVHASAPGAKYWIVKNSWGTTWGDKGYI 375
Query: 326 RMKRNTGKPEGLCGINKMASIPLKK 350
MKR T G CGI PL K
Sbjct: 376 LMKRGTKHSSGQCGIATRPVFPLMK 400
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 186/312 (59%), Gaps = 15/312 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNEFADMSHEE 103
F +W K ++Y E+ HR +I+ N K + N + + SY LG+ FADM +EE
Sbjct: 26 FHAWKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADMENEE 85
Query: 104 FKNKY-LGLKPQFPTR--RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
+K G F R+ S F + LP +VDWR KG VT VK+Q CGSCWAF
Sbjct: 86 YKRVISQGCLHSFNASLPRRGSTFFRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGSCWAF 145
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
S ++EG + +G L SLSEQ+L+DC + N GC GGLMDYAF+YI A+GG+ EE
Sbjct: 146 SATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYIQANGGIDTEES 205
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFY 278
YPY E G C + + T +GY +V + DE +L +A+A P+SV I+AS FQFY
Sbjct: 206 YPYEAENGKCRYNPDNIG-ATSTGYTEVSQGDEDALKEAVATIGPISVGIDASQMSFQFY 264
Query: 279 SGGVFTGP-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
GV+ P C + ELDHGV AVGYG G+DY +VKNSWG +WG++GYI+M RN
Sbjct: 265 ESGVYNEPDCSSLELDHGVLAVGYGTEDGNDYWLVKNSWGLEWGDKGYIKMSRNKSNQ-- 322
Query: 337 LCGINKMASIPL 348
CGI AS PL
Sbjct: 323 -CGIATAASYPL 333
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 142/310 (45%), Positives = 185/310 (59%), Gaps = 12/310 (3%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK--EVTSYWLGLNEFADMSHEE 103
E +ESW +HGK Y E+L R I++ N K++D+ N E + +G+N+FAD+ E
Sbjct: 20 EEWESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSE 79
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
F Y G + ++ S FS + V LP SVDWR KG VT +KNQG CGSCWAFS V
Sbjct: 80 FGRLYNGYNNKPSMKKAQSKVFSTK-VGDLPTSVDWRTKGFVTAIKNQGQCGSCWAFSAV 138
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
A +EG + +G L SLSEQ L+DC T+ N GCNGGLMD AF+Y++ +GG+ E YPY
Sbjct: 139 AGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTEASYPY 198
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDV-PENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSG 280
+ C+ + T SG+ D+ P E +L A+A P+SVAI+AS T FQ Y
Sbjct: 199 KAVDQKCKFNAANVG-STCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQLYKS 257
Query: 281 GVFTGPCGAE--LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GV++ ++ LDHGV AVGY S G Y IVKNSWG WG+ GYI M RN C
Sbjct: 258 GVYSESACSQTSLDHGVTAVGYDSSSGVAYWIVKNSWGTTWGQAGYIWMSRNKNNQ---C 314
Query: 339 GINKMASIPL 348
GI AS P+
Sbjct: 315 GIATAASYPI 324
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 193/324 (59%), Gaps = 23/324 (7%)
Query: 44 LIELF-ESWMS---KHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNE 95
L EL E W + +H K Y E+ R +I+ +N I + N+ Y L +N+
Sbjct: 19 LYELVKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNK 78
Query: 96 FADMSHEEFKNKYLGL------KPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVK 149
+AD+ HEEF G K R + F +P +VDWRKKGAVTPVK
Sbjct: 79 YADLLHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVK 138
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYI 208
+QG CGSCW+FS A+EG + +G L SLSEQ L+DC + NNGCNGG+MDYAF+YI
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYI 198
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVA 267
+GG+ E+ YPY + TC + + T GY D+P+ DE++L KALA PVS+A
Sbjct: 199 KDNGGIDTEKSYPYEAIDDTCHFNPKAVG-ATDKGYVDIPQGDEEALKKALATVGPVSIA 257
Query: 268 IEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGY 324
I+AS FQFYS GV+ P C +E LDHGV AVGYG S +G DY +VKNSWG WG++GY
Sbjct: 258 IDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGY 317
Query: 325 IRMKRNTGKPEGLCGINKMASIPL 348
++M RN + CG+ AS PL
Sbjct: 318 VKMARNR---DNHCGVATCASYPL 338
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 152/336 (45%), Positives = 191/336 (56%), Gaps = 23/336 (6%)
Query: 30 IVGYSPEHLTSMDKLIELFESWMS---KHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
I ++ H S +L+ + WM+ +H K YK E+ R +IF +N I + N
Sbjct: 9 ITIFATVHAVSFFELVN--QEWMTFKMEHKKAYKSDVEERFRMKIFMDNKHKIAKHNSNY 66
Query: 87 ----TSYWLGLNEFADMSHEEFKNKYLG----LKPQFPTRRQP-SAEFSYRDVKALPKSV 137
SY L +N++ DM H EF N G + Q + R P A F ALPK V
Sbjct: 67 EMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERMPIGASFIEPANVALPKKV 126
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGC 196
DWRK+GAVTPVK+QG CGSCW+FS A+EG + +G L SLSEQ LIDC + NNGC
Sbjct: 127 DWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGC 186
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
NGGLMD AF+YI + GL E YPY E C + + GY D+P +E+ L
Sbjct: 187 NGGLMDQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDV-GYIDIPTGNEKLLK 245
Query: 257 KALAH-QPVSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYGKSK-GSDYIIVK 312
A+A PVSVAI+AS FQFYS GV+ P C + ELDHGV +GYG ++ G DY +VK
Sbjct: 246 AAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGEDYWLVK 305
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
NSWG WG GYI+M RN CGI AS PL
Sbjct: 306 NSWGETWGNNGYIKMARNKLNH---CGIASSASYPL 338
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 198/320 (61%), Gaps = 22/320 (6%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSH 101
E + ++ +H K Y E+ R +I+ +N I + N+ + L +N++ D+ H
Sbjct: 25 EEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLH 84
Query: 102 EEFK------NKYLGLKPQFPTRR--QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGS 153
EEF N+ KP + +P +V+ +PK+VDWR+KGAVTPVK+QG
Sbjct: 85 EEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVE-VPKTVDWREKGAVTPVKDQGH 143
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASG 212
CGSCW+FS A+EG + +G L SLSEQ L+DC T + NNGCNGG+MD+AF+YI +G
Sbjct: 144 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDNG 203
Query: 213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEAS 271
G+ E+ YPY + TC + + T G+ D+P+ DE++L+KA+A PVSVAI+AS
Sbjct: 204 GIDTEKAYPYEAIDDTCHYNPKAVG-ATDKGFVDIPQGDEKALMKAIATAGPVSVAIDAS 262
Query: 272 GTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMK 328
FQFYS GV+ P C +E LDHGV AVGYG S +G DY +VKNSWG WG++GY++M
Sbjct: 263 HESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMA 322
Query: 329 RNTGKPEGLCGINKMASIPL 348
RN + CGI AS PL
Sbjct: 323 RNR---DNHCGIATAASYPL 339
>gi|1222694|gb|AAA92018.1| CP5 [Dictyostelium discoideum]
Length = 344
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 143/322 (44%), Positives = 180/322 (55%), Gaps = 29/322 (9%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F WM H K+Y EE R+ IF N+ ++ Q N + + LGLN FAD+++EE++N
Sbjct: 30 FTDWMITHQKSYTS-EEFGARYNIFTANMDYVQQWNSKGSETVLGLNNFADITNEEYRNT 88
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
YLG K + E + + A K DWR +GAVTPVKNQG CG CW+FST + E
Sbjct: 89 YLGTKFDASSLIGTQEEKVHTNSSAASK--DWRSEGAVTPVKNQGQCGGCWSFSTTGSTE 146
Query: 168 GINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEG 227
G + G L SLSEQ LIDC T N+GC+GGLM YAF+YI+ + G+ E YPY E G
Sbjct: 147 GAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYAFEYIINNNGIDTESSYPYKAENG 205
Query: 228 TCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP- 286
CE K E T+S Y+ V E SL A+ PVSVAI+AS FQ Y+ G++ P
Sbjct: 206 KCEYKSEN-SGATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPE 264
Query: 287 CGAE-LDHGVAAVGY-------------------GKSKGSDYIIVKNSWGPKWGERGYIR 326
C +E LDHGV AVGY S ++Y IVKNSWG WG GYI
Sbjct: 265 CSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWGIEGYIL 324
Query: 327 MKRNTGKPEGLCGINKMASIPL 348
M RN + CGI AS P+
Sbjct: 325 MSRN---RDNNCGIASSASFPV 343
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 143/320 (44%), Positives = 192/320 (60%), Gaps = 19/320 (5%)
Query: 41 MDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEF 96
+D +L++SW H K Y EE R ++++NLK I+ N + T SY LG+N+F
Sbjct: 6 LDGHWQLWKSW---HNKDYHEREESWRRV-VWEKNLKMIELHNLDHTLGKHSYKLGMNQF 61
Query: 97 ADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGS 156
DM+ EEF+ G + R+ ++F P+SVDWR+KG VTPVK+QG CGS
Sbjct: 62 GDMTTEEFRQLMNGYAHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGS 121
Query: 157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLH 215
CWAFST A+EG + +G L SLSEQ L+DC N GCNGGLMD AF+Y+ +GG+
Sbjct: 122 CWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGID 181
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTD 274
EE YPY ++ K E +G+ D+P+ E++L+KA+A PVSVAI+A +
Sbjct: 182 SEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSS 241
Query: 275 FQFYSGGVFTGP-CGAE-LDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMK 328
FQFY G++ P C +E LDHGV VGYG G Y IVKNSWG KWG++GYI M
Sbjct: 242 FQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMA 301
Query: 329 RNTGKPEGLCGINKMASIPL 348
++ + CGI AS PL
Sbjct: 302 KDR---KNHCGIATAASYPL 318
>gi|294897727|ref|XP_002776051.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239882576|gb|EER07867.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 361
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/295 (44%), Positives = 176/295 (59%), Gaps = 16/295 (5%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F + K GK Y+ EE++ R IF+ NL HI+Q N SY LG+NE+ D++HEEF
Sbjct: 31 FIGFQYKFGKKYESKEEEIKRNAIFQVNLHHIEQINARNLSYKLGVNEYTDLTHEEFAAL 90
Query: 108 YLGLKPQFPTRRQP-------SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
LG+ + R+ S+ D L SVDWR K +TP+K+QG CGSCWAF
Sbjct: 91 KLGI-LKMSLRKDDNWISLANSSLLVSADTTQLAASVDWRNKSVLTPIKDQGHCGSCWAF 149
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEED 219
S+ A+E I +G L SLSEQ+L+DC +S+ N+GCNGG M YA+ YI +S G+ +E
Sbjct: 150 SSTGALEAQYAIATGKLLSLSEQQLVDCSSSYGNHGCNGGWMQYAYDYIKSS-GIDQEST 208
Query: 220 YPYLMEEGTCEDKKEEME----VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
YPY + TC+ E++ V ++GY + E EQ+L+ L PVSVA+ AS DF
Sbjct: 209 YPYEASDNTCQKSLEKLSDGLPVGEVTGYH-MLEQTEQALMTRLVAAPVSVAMYASDPDF 267
Query: 276 QFYSGGVFTG-PCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
QFY GV++ C LDH V AVGYG G DY I +NSWG WG+ GY +KR
Sbjct: 268 QFYKSGVYSSDTCNGGLDHAVVAVGYGNENGEDYFIGRNSWGTSWGQDGYFYLKR 322
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 145/326 (44%), Positives = 192/326 (58%), Gaps = 17/326 (5%)
Query: 36 EHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK----EVTSYWL 91
+ ++ D + E + ++ H K Y+ E+ R +IF EN + + NK + S+ L
Sbjct: 15 QAVSFFDLVQEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKL 74
Query: 92 GLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK----ALPKSVDWRKKGAVTP 147
G+N++ADM H EF G R ++ S + LP +DWR KGAVTP
Sbjct: 75 GINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTP 134
Query: 148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFK 206
VK+QG CGSCW+FS ++EG + SG L SLSEQ L+DC F NNGCNGGLMD AF+
Sbjct: 135 VKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFR 194
Query: 207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVS 265
YI A+GG+ E+ YPY E+ C K + + T GY D+ +E L A+A PVS
Sbjct: 195 YIKANGGIDTEQAYPYKAEDEKCHYKPKN-KGATDRGYVDIESGNEDKLQSAVATVGPVS 253
Query: 266 VAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGER 322
VAI+AS FQ YSGGV+ P C A +LDHGV VGYG + G+DY +VKNSWG WG++
Sbjct: 254 VAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQ 313
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPL 348
GYI+M RN + CGI AS PL
Sbjct: 314 GYIKMARNR---DNNCGIATEASYPL 336
>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
Length = 333
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 146/354 (41%), Positives = 205/354 (57%), Gaps = 34/354 (9%)
Query: 6 HSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEK 65
H L L +L L + +S A F+ + L + W + +GK Y +E+
Sbjct: 2 HPSLFLAALCLGI---ASAAPRFN------------ENLDARWTRWKAANGKLYN-KDEE 45
Query: 66 LHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP 121
+ R ++++N+K IDQ N+E + S+ L +N F D+++EEFK GLK Q P +
Sbjct: 46 VWRRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGDLTNEEFKQVMNGLKIQNP---RE 102
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
F P SVDWR+KG VTPVK+QG CGSCWAFS A+EG +G L SLS
Sbjct: 103 GNMFQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLS 162
Query: 182 EQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
EQ L+DC + N GCNGGLMD AF+Y+ +GGL EE YPYL ++G C+ K E+
Sbjct: 163 EQNLVDCSRAEGNAGCNGGLMDNAFRYVKDNGGLDSEESYPYLAQDGRCKYKPEQ-SAAN 221
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAV 298
+G+ D+ +++E +L P+SVAI+AS F+FY G++ P C +E LDHGV V
Sbjct: 222 DTGFADIHQDEESLMLSVATVGPISVAIDASLDTFRFYYKGIYYDPNCSSEDLDHGVLVV 281
Query: 299 GYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
GYG +++ +Y IVKNSWG +WG +GYI M ++ G CGI AS P+
Sbjct: 282 GYGSDEREAENKNYWIVKNSWGTQWGMQGYILMAKDRGNH---CGIATSASFPI 332
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 196/339 (57%), Gaps = 17/339 (5%)
Query: 21 CSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKC-IEEKLHRFEIFKENLKHI 79
CS+L ++ + ++ D ++ +ESW HGKTY IEEKL R +I+ EN I
Sbjct: 3 CSTLLLSVLVIASTANAVSFFDVVLSDWESWKLMHGKTYSSSIEEKL-RLKIYMENSLKI 61
Query: 80 DQRNKE----VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPK 135
+ N E + Y++ +N + D+ H EF G + T ++++ LP
Sbjct: 62 SRHNSEALNGIHPYYMKMNHYGDLLHHEFVAMVNGYQYANKTASLGGTYIPNKNIQ-LPT 120
Query: 136 SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NN 194
VDWR++GAVTPVKNQG CGSCW+FS A+EG + +G L SLSEQ L+DC F NN
Sbjct: 121 HVDWREEGAVTPVKNQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDCSRKFGNN 180
Query: 195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQS 254
GC GGLMD+AF YI + G+ E YPY +G C + I G+ D+ + E+
Sbjct: 181 GCEGGLMDFAFTYIRDNKGIDTEASYPYEGIDGHCHYNPKNKGGSDI-GFVDIKKGSEKD 239
Query: 255 LLKALAH-QPVSVAIEASGTDFQFYSGGVFT-GPCGA-ELDHGVAAVGYGKS--KGSDYI 309
L KA+A P+SVAI+AS FQFYS GV+ C + ELDHGV VG+G G DY
Sbjct: 240 LKKAVAGVGPISVAIDASHMSFQFYSHGVYVESKCSSEELDHGVLVVGFGTDSVSGEDYW 299
Query: 310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
+VKNSW KWG++GYI+M RN E +CGI AS P+
Sbjct: 300 LVKNSWSEKWGDQGYIKMARN---KENMCGIASSASYPV 335
>gi|229367042|gb|ACQ58501.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 143/314 (45%), Positives = 184/314 (58%), Gaps = 19/314 (6%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKEN----LKHIDQRNKEVTSYWLGLNEFADMSHEE 103
F +W + G++Y E+ R EI+ N L H ++ + SY LG+ FADM +EE
Sbjct: 26 FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85
Query: 104 FKNKY----LG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
+K + LG P R+ SA + LP SVDWR+KG VT VK+Q CGSCW
Sbjct: 86 YKRQISQGCLGSFNASLP--RRGSAYLRLPEGADLPNSVDWREKGYVTEVKDQKQCGSCW 143
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKE 217
AFST ++EG +G L SLSEQ+L+DC + N GC GGLMD AF+YI A+GG+ E
Sbjct: 144 AFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDTE 203
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
+ YPY E+G C + T +GY DV + DE +L +A+A PVSVAI+AS + FQ
Sbjct: 204 DSYPYEAEDGQCRYNSANIG-ATCTGYVDVKQGDEDALKEAVATIGPVSVAIDASHSSFQ 262
Query: 277 FYSGGVFTGP--CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
Y GV+ P +ELDHGV AVGYG G DY +VKNSWG WG +GYI M RN
Sbjct: 263 LYESGVYDEPECSSSELDHGVLAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRN---K 319
Query: 335 EGLCGINKMASIPL 348
CGI +S PL
Sbjct: 320 HNQCGIATASSYPL 333
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 193/324 (59%), Gaps = 23/324 (7%)
Query: 44 LIELF-ESWMS---KHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNE 95
L EL E W + +H K Y E+ R +I+ +N I + N+ Y L +N+
Sbjct: 19 LYELVKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNK 78
Query: 96 FADMSHEEFKNKYLGL------KPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVK 149
+AD+ HEEF G K R + F +P +VDWRKKGAVTPVK
Sbjct: 79 YADLLHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVK 138
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYI 208
+QG CGSCW+FS A+EG + +G L SLSEQ L+DC + NNGCNGG+MDYAF+YI
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYI 198
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVA 267
+GG+ E+ YPY + TC + + T GY D+P+ DE++L KALA PVS+A
Sbjct: 199 KDNGGIDTEKSYPYEAIDDTCHFNPKAVG-ATDKGYVDIPQGDEEALKKALATVGPVSIA 257
Query: 268 IEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGY 324
I+AS FQFYS GV+ P C +E LDHGV AVGYG S +G DY +VKNSWG WG++GY
Sbjct: 258 IDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGY 317
Query: 325 IRMKRNTGKPEGLCGINKMASIPL 348
++M RN + CG+ AS PL
Sbjct: 318 VKMARNH---DNHCGVATCASYPL 338
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 145/326 (44%), Positives = 191/326 (58%), Gaps = 17/326 (5%)
Query: 36 EHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK----EVTSYWL 91
+ ++ D + E + ++ H K Y+ E+ R +IF EN + + NK + S+ L
Sbjct: 15 QAVSFFDLVQEQWGAFKMTHNKQYQSETEERFRMKIFMENSHTVAKHNKLYAQGLVSFKL 74
Query: 92 GLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK----ALPKSVDWRKKGAVTP 147
G+N++ADM H EF G R ++ S + LP +DWR KGAVTP
Sbjct: 75 GINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTP 134
Query: 148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFK 206
VK+QG CGSCW+FS ++EG + SG L SLSEQ L+DC F NNGCNGGLMD AF+
Sbjct: 135 VKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFR 194
Query: 207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVS 265
YI A+GG+ E+ YPY E+ C K + + T GY D+ +E L A+A PVS
Sbjct: 195 YIKANGGIDTEQAYPYKAEDEKCHYKPKN-KGATDRGYVDIESGNEDKLQSAVATVGPVS 253
Query: 266 VAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGER 322
VAI+AS FQ YSGGV+ P C A +LDHGV VGYG + G+DY +VKNSWG WG++
Sbjct: 254 VAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQ 313
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPL 348
GYI+M RN CGI AS PL
Sbjct: 314 GYIKMARNRNNN---CGIATEASYPL 336
>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 350
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 134/300 (44%), Positives = 174/300 (58%), Gaps = 11/300 (3%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNK 107
E WM+K G+ Y EK R +F N +++D N+ +Y LGLNEF+D++ EF
Sbjct: 41 EQWMAKFGRVYTDANEKARRQAVFGANARYVDAVNRAGNRTYTLGLNEFSDLTDNEFAKT 100
Query: 108 YLGLKPQFPTRRQPS--AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
+LG + P S + Y +PKS DWR KGAVT VK+QG CG CWAF+ VAA
Sbjct: 101 HLGYREFRPETANISKGVDPGYGLAGNIPKSFDWRTKGAVTEVKSQGGCGCCWAFAAVAA 160
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
EG+ +I G L S+SEQ+++DC T NN C GG M+ A Y+ ASGGL EEDY Y E
Sbjct: 161 TEGLVKIAKGTLISMSEQQVLDCTTG-NNTCKGGYMNDALSYVFASGGLQTEEDYEYNAE 219
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKAL-AHQPVSVAIEASGTDFQFYSGGVFT 284
+G C ++ + +P + + LL+ L A QPV VA+EA GTDF+ Y GGVFT
Sbjct: 220 KGACRRDVTPNPATSVGHAEYMPLDGNEFLLQKLVARQPVVVAVEAYGTDFKNYGGGVFT 279
Query: 285 G--PCGAELDHGVAAVGYGKSKGSD--YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
G CG LDH VGYG + G Y +VKN WG WGE GY+R+ R G CG+
Sbjct: 280 GSPSCGQNLDHFFTVVGYGFADGGKQMYWLVKNQWGTSWGESGYMRIAR--GSSARNCGM 337
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 182/304 (59%), Gaps = 14/304 (4%)
Query: 55 HGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFADMSHEEFKNKYLG 110
H K Y E+ +R +IF EN K I++ N S+ L LN ADM E+ + YLG
Sbjct: 34 HRKEYDNELEESYRKKIFLENKKRIEKHNSRYKQGKVSFKLKLNHLADMLIHEYSDVYLG 93
Query: 111 LKPQFPTRRQPSAEFSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEG 168
+++ L K VDWR KGAVTPVKNQG CGSCWAFST A+EG
Sbjct: 94 FNKSSKANNNKLQSYTFIPPAHVTLNKEVDWRTKGAVTPVKNQGHCGSCWAFSTTGALEG 153
Query: 169 INQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEG 227
N +G L SLSEQ L+DC S+ NNGC GGLMD AF+YI + G+ E+ YPY E+
Sbjct: 154 QNFRKTGKLVSLSEQNLVDCSGSYGNNGCEGGLMDNAFQYIKENHGIDTEKSYPYEGEDE 213
Query: 228 TCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP 286
TC +K + T SG+ D+ + DE++L++A+A P+SVAI+AS FQFYS GV+ P
Sbjct: 214 TCRFRKTSIG-ATDSGFVDITQGDEEALMQAVATIGPISVAIDASHQSFQFYSEGVYYEP 272
Query: 287 -CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
C +E LDHGV VGYG Y +VKNSWG +WG+ GYI+M R+ + CGI A
Sbjct: 273 ECSSENLDHGVLVVGYGVEDNQKYWLVKNSWGTQWGDGGYIKMARD---QDNNCGIATQA 329
Query: 345 SIPL 348
S PL
Sbjct: 330 SYPL 333
>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
Length = 341
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 150/356 (42%), Positives = 199/356 (55%), Gaps = 35/356 (9%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESW---MSKHGKTYKCIEEKL 66
+++ L L +FA SS++ +++++IE E W + K Y+ ++E+
Sbjct: 3 VVIVLGLVVFAISSVSS------------INLNEIIE--EEWDLFKVQFKKIYEDVKEEA 48
Query: 67 HRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYLGLKPQFPT----- 117
R +++ +N I + NK + Y L +N F D+ E+ G KP
Sbjct: 49 FRKKVYLDNKLKIARHNKLYETGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDKNF 108
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ F + +PKS+DWRKKG VTPVKNQG CGSCW+FS ++EG + +G L
Sbjct: 109 TDDDAVTFLKSENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVL 168
Query: 178 TSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQ LIDC + NNGC GGLMD AFKYI ++ GL E+ YPY E+ C E
Sbjct: 169 VSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPEN- 227
Query: 237 EVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP--CGAELDH 293
T G+ D+PE DE +L+ ALA PVS+AI+AS FQFY GVF P ELDH
Sbjct: 228 SGATDKGFVDIPEGDEDALVHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDH 287
Query: 294 GVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
GV AVGYG KG DY IVKNSWG WG++GYI M RN + CG+ AS PL
Sbjct: 288 GVLAVGYGTDHKGGDYWIVKNSWGKTWGDQGYIMMARN---KKNNCGVASSASYPL 340
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 200/324 (61%), Gaps = 19/324 (5%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGL 93
++ D + E ++++ +H K + E+ R +IF EN I + N+ S+ LGL
Sbjct: 17 ISYTDVIKEEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGL 76
Query: 94 NEFADMSHEEFKNKYLGLKPQFPT--RRQPSAEFSY---RDVKALPKSVDWRKKGAVTPV 148
N+++DM + EFK G R Q + Y +V+ +PKSVDWR+ GAVT V
Sbjct: 77 NKYSDMLYHEFKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQ-IPKSVDWRQHGAVTAV 135
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKY 207
K+QG CGSCWAFS+ AA+EG + +G L SLSEQ L+DC T + NNGCNGGLMD AF+Y
Sbjct: 136 KDQGHCGSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRY 195
Query: 208 IVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSV 266
I +GG+ E+ YPY + +C K + T +G+ D+P+ DE++L+KA+A PVSV
Sbjct: 196 IKDNGGIDTEKSYPYEGIDDSCHFTKSGVG-ATDTGFVDIPQGDEEALMKAVATMGPVSV 254
Query: 267 AIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERG 323
AI+AS FQ YS GV+ P C A+ LDHGV VGYG K G DY +VKNSWG WG++G
Sbjct: 255 AIDASHESFQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTWGDQG 314
Query: 324 YIRMKRNTGKPEGLCGINKMASIP 347
YI+M RN + CGI +S P
Sbjct: 315 YIKMARN---QDNQCGIATASSYP 335
>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
Length = 503
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 146/354 (41%), Positives = 205/354 (57%), Gaps = 34/354 (9%)
Query: 6 HSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEK 65
H L L +L L + +S A F+ + L + W + +GK Y +E+
Sbjct: 2 HPSLFLAALCLGI---ASAAPRFN------------ENLDARWTRWKAANGKLYN-KDEE 45
Query: 66 LHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP 121
+ R ++++N+K IDQ N+E + S+ L +N F D+++EEFK GLK Q P +
Sbjct: 46 VWRRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGDLTNEEFKQVMNGLKIQNP---RE 102
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
F P SVDWR+KG VTPVK+QG CGSCWAFS A+EG +G L SLS
Sbjct: 103 GNMFQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLS 162
Query: 182 EQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
EQ L+DC + N GCNGGLMD AF+Y+ +GGL EE YPYL ++G C+ K E+
Sbjct: 163 EQNLVDCSRAEGNAGCNGGLMDNAFRYVKDNGGLDSEESYPYLAQDGRCKYKPEQ-SAAN 221
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAV 298
+G+ D+ +++E +L P+SVAI+AS F+FY G++ P C +E LDHGV V
Sbjct: 222 DTGFADIHQDEESLMLSVATVGPISVAIDASLDTFRFYYKGIYYDPNCSSEDLDHGVLVV 281
Query: 299 GYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
GYG +++ +Y IVKNSWG +WG +GYI M ++ G CGI AS P+
Sbjct: 282 GYGSDEREAENKNYWIVKNSWGTQWGMQGYILMAKDRGNH---CGIATSASFPI 332
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 61/104 (58%), Gaps = 6/104 (5%)
Query: 233 KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGAE- 290
+ E ++G +VP+ +E +L A PVS AI AS FQF G++ P C +E
Sbjct: 386 RPECSAADVTGPVNVPQQEEAVMLAVAAGGPVSAAIRASLGSFQFCKEGIYYDPNCSSED 445
Query: 291 LDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRN 330
LDHGV VGYG +++ +Y IVKNSWG WG +GY+ + R+
Sbjct: 446 LDHGVLVVGYGSDEREAENKNYWIVKNSWGTDWGLQGYMLLVRD 489
>gi|157829826|pdb|1AEC|A Chain A, Crystal Structure Of Actinidin-E-64 Complex+
Length = 218
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 117/218 (53%), Positives = 150/218 (68%), Gaps = 2/218 (0%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IV+G L SLSEQELIDC +
Sbjct: 1 LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQ 60
Query: 193 NN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
N GCNGG + F++I+ +GG++ EE+YPY ++G C + + VTI Y++VP N+
Sbjct: 61 NTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNN 120
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIV 311
E +L A+ +QPVSVA++A+G F+ YS G+FTGPCG +DH V VGYG G DY IV
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIV 180
Query: 312 KNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
KNSW WGE GY+R+ RN G G CGI M S P+K
Sbjct: 181 KNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 217
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 145/335 (43%), Positives = 195/335 (58%), Gaps = 18/335 (5%)
Query: 28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT 87
+ V S + ++ D + E + S+ +H K Y E+ R +IF EN + + NK +
Sbjct: 7 LAAVVISCQAVSFYDLVQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHNKLFS 66
Query: 88 S----YWLGLNEFADMSHEEFKNKYLGL-KPQFPTRRQPSAEFSYRDVK----ALPKSVD 138
+ LGLN++ADM H EF + G K + + + R + LP +VD
Sbjct: 67 QGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVD 126
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCN 197
WR KGAVT VK+QG CGSCW+FS ++EG + +G L SLSEQ L+DC + NNGCN
Sbjct: 127 WRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCN 186
Query: 198 GGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLK 257
GGLMD AF+YI +GG+ E+ YPYL E+ C K + T G+ D+ E +E L
Sbjct: 187 GGLMDNAFRYIKDNGGIDTEKSYPYLAEDEKCH-YKAQNSGATDKGFVDIEEANEDDLKA 245
Query: 258 ALAH-QPVSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYGKS-KGSDYIIVKN 313
A+A PVS+AI+AS FQ YS GV++ P C + ELDHGV VGYG S G DY +VKN
Sbjct: 246 AVATVGPVSIAIDASHETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDGQDYWLVKN 305
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
SWGP WG GYI+M RN + +CG+ AS PL
Sbjct: 306 SWGPSWGLNGYIKMARN---QDNMCGVASQASYPL 337
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 152/332 (45%), Positives = 188/332 (56%), Gaps = 23/332 (6%)
Query: 34 SPEHLTSMDKLIELFESWMS---KHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV---- 86
S H S +L+ + WM+ +H K YK E+ R +IF +N I + N
Sbjct: 19 SRTHAVSFFELVN--QEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKK 76
Query: 87 TSYWLGLNEFADMSHEEFKNKYLG----LKPQFPTRRQP-SAEFSYRDVKALPKSVDWRK 141
SY L +N++ DM H EF N G + Q + R P A F LPK VDWRK
Sbjct: 77 VSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERLPVGASFIEPANVVLPKKVDWRK 136
Query: 142 KGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGL 200
+GAVTPVK+QG CGSCW+FS A+EG + +G L SLSEQ LIDC + NNGCNGGL
Sbjct: 137 EGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGL 196
Query: 201 MDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA 260
MD AF+YI + GL E YPY E C + + GY D+P DE+ L A+A
Sbjct: 197 MDQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDV-GYIDIPTGDEKLLKAAVA 255
Query: 261 H-QPVSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYGKSK-GSDYIIVKNSWG 316
PVSVAI+AS FQFYS GV+ P C + ELDHGV +GYG ++ G DY +VKNSWG
Sbjct: 256 TIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWG 315
Query: 317 PKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
WG GYI+M RN CGI AS PL
Sbjct: 316 ETWGNNGYIKMARNKLNH---CGIASSASYPL 344
>gi|297740510|emb|CBI30692.3| unnamed protein product [Vitis vinifera]
Length = 377
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 119/220 (54%), Positives = 153/220 (69%), Gaps = 5/220 (2%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P S+DWRKKG VT +K+QG CGSCWAFS+ A+EGIN IV+G+L SLSEQEL+DCDT+ N
Sbjct: 13 PSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTT-N 71
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
GC GG MDYAF++++++GG+ E DYPY +GTC KE+ +VV+I GY+DV E+D
Sbjct: 72 YGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDGTCNTTKEDTKVVSIDGYKDVDESD-S 130
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTG---PCGAELDHGVAAVGYGKSKGSDYII 310
+LL A +QP+SV ++ S DFQ Y+ G++ G ++DH V VGYG DY I
Sbjct: 131 ALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHAVLIVGYGSEDSEDYWI 190
Query: 311 VKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
KNSWG WG GY +KRNT P G C IN MAS P K+
Sbjct: 191 CKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYPTKE 230
>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
Length = 314
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 135/290 (46%), Positives = 177/290 (61%), Gaps = 12/290 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIF----KENLKHIDQRNKEVTSYWLGLNEFADMSHEE 103
+ES+ +K+GKTY+ E + R I+ ++ ++H + + + SY LGLN FADM + E
Sbjct: 27 WESYKAKYGKTYESNENEAARRTIYFMAKEKVMEHNARFEQGLVSYKLGLNSFADMHNGE 86
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
F+ G + P + S LP SVDWR KGAVTP+KNQG CGSCWAFST
Sbjct: 87 FRKMMNGYRRGTP---RNSVVVHVESNITLPASVDWRTKGAVTPIKNQGQCGSCWAFSTT 143
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
++EG + + G L SLSEQEL+DC + N+GC+GGLMD AF YI + G+ E+ YPY
Sbjct: 144 GSLEGQHALKKGKLVSLSEQELVDCSAAEGNDGCDGGLMDDAFTYIKKNNGIDTEQSYPY 203
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
E+GTC KK ++ T++G+ DV E L A A P+SVAI+AS DFQ Y G
Sbjct: 204 TGEDGTCSFKKSDV-AATVTGFVDVTSGSESGLQDASATIGPISVAIDASSWDFQLYESG 262
Query: 282 VF-TGPCG-AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
V+ C ELDHGV VGYG G+ Y +VKNSWG WG GYI+M R
Sbjct: 263 VYDVSDCSTTELDHGVLVVGYGTDDGTAYWLVKNSWGTDWGHHGYIQMSR 312
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 193/318 (60%), Gaps = 24/318 (7%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-----EVTSYWLGLNEFAD 98
L + ++++ ++HG+ Y ++E+ +R +F++N + ID N EVT + L +N+F D
Sbjct: 19 LRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVT-FTLQMNQFGD 77
Query: 99 MSHEEF---KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
M+ EE N +LG PTRR P+A D + LP+ VDWR KGAVTPVK+Q CG
Sbjct: 78 MTSEEIVATMNGFLGA----PTRR-PAAVLKADD-ETLPEKVDWRTKGAVTPVKDQKQCG 131
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGL 214
SCWAFST ++EG + + G L SLSEQ L+DC F N GC GGLMD AF+YI A+ G+
Sbjct: 132 SCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGI 191
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGT 273
E+ YPY ++G C + T +GY DV E +L KA+A P+SV I+AS +
Sbjct: 192 DTEDSYPYEAQDGKCRFDASNVG-ATDTGYVDVEHGSESALKKAVATIGPISVGIDASQS 250
Query: 274 DFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRN 330
F FY GV+ C + LDHGV AVGYG + G D+ +VKNSW WG++GYI+M RN
Sbjct: 251 TFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRN 310
Query: 331 TGKPEGLCGINKMASIPL 348
CGI AS PL
Sbjct: 311 RNNN---CGIASQASYPL 325
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 193/318 (60%), Gaps = 24/318 (7%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-----EVTSYWLGLNEFAD 98
L + ++++ ++HG+ Y ++E+ +R +F++N + ID N EVT + L +N+F D
Sbjct: 18 LRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVT-FTLQMNQFGD 76
Query: 99 MSHEEF---KNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
M+ EE N +LG PTRR P+A D + LP+ VDWR KGAVTPVK+Q CG
Sbjct: 77 MTSEEIVATMNGFLGA----PTRR-PAAVLKADD-ETLPEKVDWRTKGAVTPVKDQKQCG 130
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGL 214
SCWAFST ++EG + + G L SLSEQ L+DC F N GC GGLMD AF+YI A+ G+
Sbjct: 131 SCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKANKGI 190
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGT 273
E+ YPY ++G C + T +GY DV E +L KA+A P+SV I+AS +
Sbjct: 191 DTEDSYPYEAQDGKCRFDASNVG-ATDTGYVDVEHGSESALKKAVATIGPISVGIDASQS 249
Query: 274 DFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRN 330
F FY GV+ C + LDHGV AVGYG + G D+ +VKNSW WG++GYI+M RN
Sbjct: 250 TFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRN 309
Query: 331 TGKPEGLCGINKMASIPL 348
CGI AS PL
Sbjct: 310 RNNN---CGIASQASYPL 324
>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 354
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 134/318 (42%), Positives = 186/318 (58%), Gaps = 25/318 (7%)
Query: 49 ESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNK 107
E WM++ G+ YK +EK R E+F N +H+D N+ +Y LGLN F+D++ EF +
Sbjct: 39 ERWMARFGRAYKDADEKARRQEVFGANARHVDAVNRSGNRTYTLGLNHFSDLTDHEFLQQ 98
Query: 108 YLG-----------LKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGAVTPVKNQGSCG 155
+LG L+P+ + +A Y +DV P SVDWR +GAVT +KNQ SCG
Sbjct: 99 HLGYRHHQPGPGGLLRPEDQDMSKATALADYGQDV---PDSVDWRAQGAVTEIKNQRSCG 155
Query: 156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
SCWAF+ VAA EG+ +I +GNL S+SEQ+++DC T N C+GG ++ A +Y+ ASGGL
Sbjct: 156 SCWAFAAVAATEGLVKIATGNLISMSEQQVLDC-TGGGNTCDGGDINAALRYVAASGGLQ 214
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTD 274
E Y Y ++G C ++ G + ++ L+ LA QPV+VA+EAS D
Sbjct: 215 PEAAYAYAAQKGACRGASPANSAASVGGARFARLGGDEGALRGLAAGQPVAVALEASEPD 274
Query: 275 FQFYSGGVFTG--PCGAELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRN 330
F+ Y GV+ G CG L+HGV VGYG G +Y +VKN WG WGE+GY+R+ R
Sbjct: 275 FRHYKSGVYAGSASCGRRLNHGVTVVGYGAEDDSGDEYWVVKNQWGTLWGEKGYMRVAR- 333
Query: 331 TGKPEGL-CGINKMASIP 347
G G CGI A P
Sbjct: 334 -GDVAGANCGIASYAYYP 350
>gi|348542776|ref|XP_003458860.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 147/314 (46%), Positives = 186/314 (59%), Gaps = 19/314 (6%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNEFADMSHEE 103
F +W K GK+Y E+ HR +I+ N KH+ N + SY LG+ FADM +EE
Sbjct: 26 FHAWRLKFGKSYDSPSEESHRKQIWLTNRKHVLMHNILADQGFKSYRLGMTYFADMENEE 85
Query: 104 FKNKY----LG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
+K LG P R+ S + LP +VDWR++G VT VK+Q CGSCW
Sbjct: 86 YKKLVSRGCLGSFNASLP--RRGSTFLRLPEGIDLPDAVDWREQGYVTGVKDQKQCGSCW 143
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKE 217
AFS A+EG + +G L SLSEQ+L+DC ++ N GCNGG MD AF+YI A+GG+ E
Sbjct: 144 AFSATGALEGQHFRKTGILVSLSEQQLVDCSGAYGNEGCNGGWMDSAFRYIEANGGIDTE 203
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
YPY E+ C + T SGY DV + DE++L +A+A PVSVAI+AS FQ
Sbjct: 204 ASYPYEAEDWLCRYNPASVG-ATCSGYVDVNKYDEEALKEAVATIGPVSVAIDASHASFQ 262
Query: 277 FYSGGVFTGP-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
FY+ GV+ P C + ELDHGV AVGYG G DY +VKNSWG WGE GYI+M RN
Sbjct: 263 FYTSGVYDEPGCSSIELDHGVLAVGYGTENGHDYWLVKNSWGRGWGEMGYIKMSRN---K 319
Query: 335 EGLCGINKMASIPL 348
CGI AS PL
Sbjct: 320 HNQCGIASAASYPL 333
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 146/320 (45%), Positives = 183/320 (57%), Gaps = 19/320 (5%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADM 99
++E +E++ +H K Y E+ R +IF EN I NK +Y L +N++ DM
Sbjct: 25 VLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDM 84
Query: 100 SHEEFKNKYLGLKPQFP-----TRRQPSAEFSYRDVKA-LPKSVDWRKKGAVTPVKNQGS 153
H EF + G + R A F D LPK+VDWR KGAVTP+K+QG
Sbjct: 85 LHHEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQGQ 144
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASG 212
CGSCWAFS A+EG +G L SLSEQ L+DC F NNGCNGGLMD AF+Y+ +G
Sbjct: 145 CGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKENG 204
Query: 213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEAS 271
G+ EE YPY E+ C G+ DV E E +L KA+A PVSVAI+AS
Sbjct: 205 GIDTEESYPYDAEDEKCH-YNPRAAGAEDKGFVDVREGSEHALKKAVATVGPVSVAIDAS 263
Query: 272 GTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMK 328
FQFYS GV+ P C E LDHGV VGYG G+DY +VKNSWG WG++GY++M
Sbjct: 264 HESFQFYSHGVYIEPECSPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGDQGYVKMA 323
Query: 329 RNTGKPEGLCGINKMASIPL 348
RN + CGI AS PL
Sbjct: 324 RNR---DNQCGIASSASFPL 340
>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
Length = 492
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 135/308 (43%), Positives = 172/308 (55%), Gaps = 36/308 (11%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F SW+ H T+ E R E + N +I N + +S+ LG N F+ +++EEF+ +
Sbjct: 33 FVSWLKTHHLTFSDAFEYAKRLETYIANDIYILTHNLQESSFKLGHNAFSHLTNEEFRQR 92
Query: 108 YLGLKP--QFPTRR------QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
+ G K + T+R S F Y D LP+SVDW +KGAVT VKNQG CGSCWA
Sbjct: 93 FNGFKASDDYLTKRLAQSNVASSTNFQYID---LPESVDWVEKGAVTGVKNQGMCGSCWA 149
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
FST A+EG I SG L SLSEQEL+DCD + ++GCNGGLMD+AF +I G+ EED
Sbjct: 150 FSTTGAIEGATFISSGKLVSLSEQELVDCDHNGDHGCNGGLMDHAFSWISEHDGICSEED 209
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS 279
Y Y+ + C K VV+ PV+VAI+A FQFY
Sbjct: 210 YAYIHSQSLCRSCK---PVVS----------------------PVAVAIDAGDRSFQFYQ 244
Query: 280 GGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
GV+ CG +LDHGV VGYG G Y VKNSWG WGE+GYIR+ R+ G CG
Sbjct: 245 SGVYNKTCGTQLDHGVLTVGYGVEDGQKYWKVKNSWGNSWGEKGYIRLSRDQNGRSGQCG 304
Query: 340 INKMASIP 347
I + S P
Sbjct: 305 IAMVPSYP 312
>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 134/308 (43%), Positives = 189/308 (61%), Gaps = 13/308 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
+ W ++HGK+Y+ E+ L R +++NLK I++ N+E + S+ L +N+F DMS EE
Sbjct: 29 WHQWKAQHGKSYEANEDSLRR-ATWEKNLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEE 87
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
FK G K RR + + + LP+SVDWR+KG VTPVK QG CG+CW+FS V
Sbjct: 88 FKQVMNGYKSNGSQRRTKGSLYRESLLAQLPESVDWREKGYVTPVKEQGDCGACWSFSAV 147
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
A+EG +G L SLS Q LIDC NNGC+GG MD AF+Y+ +GG+ EE YPY
Sbjct: 148 GAIEGQWFRKTGKLVSLSIQNLIDCTIPEGNNGCDGGFMDNAFQYVQDNGGIDTEECYPY 207
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
+ ++ C+ K E I+G+ D+P DE++L++A+A P+SV I+++ F+FY G
Sbjct: 208 VAQDTECK-YKPECSGANITGFVDIPSMDERALMEAVATVGPISVGIDSANPSFKFYQSG 266
Query: 282 VFTGP--CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
V+ P ++LDHGV VGYG +Y IVKNSWG WG+ GYI M ++ + CG
Sbjct: 267 VYYEPDCSSSQLDHGVLVVGYGSIGKDEYWIVKNSWGEAWGDNGYILMAKDK---DNHCG 323
Query: 340 INKMASIP 347
I AS P
Sbjct: 324 IATEASYP 331
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 144/326 (44%), Positives = 192/326 (58%), Gaps = 17/326 (5%)
Query: 36 EHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK----EVTSYWL 91
+ ++ D + E + ++ H K Y+ E+ R +IF EN + + NK + S+ L
Sbjct: 15 QAVSFFDLVQEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKL 74
Query: 92 GLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK----ALPKSVDWRKKGAVTP 147
G+N++ADM H EF G R ++ S + LP +DWR KGAVTP
Sbjct: 75 GINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTP 134
Query: 148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFK 206
VK+QG CGSCW+FS ++EG + SG L SLSEQ L+DC F NNGCNGGLMD AF+
Sbjct: 135 VKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFR 194
Query: 207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVS 265
YI A+GG+ E+ YPY E+ C K + + T GY D+ +E L A+A PVS
Sbjct: 195 YIKANGGIDTEQAYPYKAEDEKCHYKPKN-KGATDRGYVDIESGNEDKLQSAVATVGPVS 253
Query: 266 VAIEASGTDFQFYSGGVFTGP-CG-AELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGER 322
VAI+AS FQ YSGGV+ P C ++LDHGV VGYG + G+DY +VKNSWG WG++
Sbjct: 254 VAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQ 313
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPL 348
GYI+M RN + CGI AS PL
Sbjct: 314 GYIKMARNR---DNNCGIATEASYPL 336
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 182/311 (58%), Gaps = 23/311 (7%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
F+S+ KHGKTYK E+ RF IF+ENL+ I+ N E + SY G+N+FADM+ E
Sbjct: 26 FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAE 85
Query: 104 FKNKYLGLKPQFPTRRQPSAE--FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
FK L Q T+ A F D ++P+S+DWR + VTP+K+Q CGSCWAF+
Sbjct: 86 FKAM---LATQVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWAFA 142
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
V + EG + +G LT SEQ+L+DC T N GC+GG +D F YI + GL E DYP
Sbjct: 143 VVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPYI-QTNGLELESDYP 201
Query: 222 YLMEEGTCEDKKEEMEVVT-ISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYS 279
Y +G C E +VVT +S Y VP N EQ+LL+A+ PV++AI A D QFY
Sbjct: 202 YTGYDGYC--SYESSKVVTKVSSYVSVPAN-EQALLEAVGTAGPVAIAINAD--DLQFYF 256
Query: 280 GGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
G+ C E LDHGV AVGY G DY ++KNSWG WGE GY R R + +
Sbjct: 257 SGIIDDKYCDPEYLDHGVLAVGYDSENGRDYWLIKNSWGADWGESGYFRFLRG----QNI 312
Query: 338 CGINKMASIPL 348
CG+ + A PL
Sbjct: 313 CGVKEDAVYPL 323
>gi|225707912|gb|ACO09802.1| Cathepsin K precursor [Osmerus mordax]
Length = 331
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 144/336 (42%), Positives = 195/336 (58%), Gaps = 27/336 (8%)
Query: 22 SSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ 81
S+LAH V E +E+W + H K Y ++E+ R I+++N++ I+
Sbjct: 13 STLAHPMDEVSLDTE-----------WENWKTTHNKEYNGLDEEGIRRAIWEKNMRMIEA 61
Query: 82 RNKEVT----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRD-VKALPKS 136
N+E SY LG+N DM+ EE K +GL Q P R F + V+ LPKS
Sbjct: 62 HNQEAALGMHSYELGMNNLGDMTSEEVAEKMMGL--QVPLNRDRGNTFVPDNTVERLPKS 119
Query: 137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGC 196
+D+R+KG VTPVKNQGSCGSCWAFS+V A+EG +G L LS Q L+DC T NNGC
Sbjct: 120 IDYRRKGMVTPVKNQGSCGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCVTE-NNGC 178
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
GG M AF Y+ + G+ E YPY+ ++ TC M + GY+++PE +E++L
Sbjct: 179 GGGYMTNAFNYVRDNQGIDSEAAYPYIGQDETCAYNVSGM-TASCRGYKEIPEGNERALT 237
Query: 257 KALAH-QPVSVAIEASGTDFQFYSGGV-FTGPCGA-ELDHGVAAVGYGKS-KGSDYIIVK 312
A+A PVSV I+A+ + FQFY GV + C +++H V AVGYG + KG Y IVK
Sbjct: 238 VAVAKVGPVSVGIDATLSTFQFYQKGVYYDRNCNKDDINHAVLAVGYGVTPKGKKYWIVK 297
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
NSW WG +GYI M RN G LCGI +AS P+
Sbjct: 298 NSWSESWGNKGYILMARNRGN---LCGIANLASYPI 330
>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
Length = 334
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 141/314 (44%), Positives = 182/314 (57%), Gaps = 19/314 (6%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNEFADMSHEE 103
F +W K G+TY E+ R + + N K + N + + SY LG+ FADM +EE
Sbjct: 26 FHAWRLKFGRTYSSPTEEAQRRQTWLNNRKLVLVHNILADQGIKSYRLGMTYFADMENEE 85
Query: 104 FKNKY----LG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
+K LG P R+ S F + K LP +VDWR KG VT VK+Q CGSCW
Sbjct: 86 YKRLISQGCLGSFNASLP--RRGSTFFRLPENKDLPAAVDWRDKGYVTDVKDQKQCGSCW 143
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKE 217
AFS ++EG +G L SLSEQ+L+DC + N GC GGLMD AF+YI A+GG+ E
Sbjct: 144 AFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNMGCGGGLMDDAFRYIQATGGIDTE 203
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
E YPY E+G C K + + T +GY DV DE +L +A+A P+SV I+AS FQ
Sbjct: 204 ESYPYEAEDGECRYKPDAVG-ATCTGYVDVSSGDEDALQEAVATIGPISVGIDASHISFQ 262
Query: 277 FYSGGVFTGP--CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
Y G++ P +ELDHGV AVGYG G DY +VKNSWG WG++GYI+M +N
Sbjct: 263 LYESGLYDEPQCSSSELDHGVLAVGYGSENGQDYWLVKNSWGLTWGDQGYIKMSKNKSNQ 322
Query: 335 EGLCGINKMASIPL 348
CGI AS PL
Sbjct: 323 ---CGIATAASYPL 333
>gi|161172356|pdb|3BCN|A Chain A, Crystal Structure Of A Papain-Like Cysteine Protease
Ervatamin-A Complexed With Irreversible Inhibitor E-64
gi|161172357|pdb|3BCN|B Chain B, Crystal Structure Of A Papain-Like Cysteine Protease
Ervatamin-A Complexed With Irreversible Inhibitor E-64
Length = 209
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 123/217 (56%), Positives = 150/217 (69%), Gaps = 10/217 (4%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP+ VDWR KGAV P+KNQG CGSCWAFSTV VE INQI +GNL SLSEQ+L+DC
Sbjct: 1 LPEHVDWRAKGAVIPLKNQGKCGSCWAFSTVTTVESINQIRTGNLISLSEQQLVDCSKK- 59
Query: 193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
N+GC GG D A++YI+A+GG+ E +YPY +G C K +VV I G + VP+ +E
Sbjct: 60 NHGCKGGYFDRAYQYIIANGGIDTEANYPYKAFQGPCRAAK---KVVRIDGCKGVPQCNE 116
Query: 253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
+L A+A QP VAI+AS FQ Y GG+FTGPCG +L+HGV VGYGK DY IV+
Sbjct: 117 NALKNAVASQPSVVAIDASSKQFQHYKGGIFTGPCGTKLNHGVVIVGYGK----DYWIVR 172
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
NSWG WGE+GY RMKR G GLCGI ++ P K
Sbjct: 173 NSWGRHWGEQGYTRMKRVGGC--GLCGIARLPFYPTK 207
>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
Length = 341
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 151/356 (42%), Positives = 197/356 (55%), Gaps = 35/356 (9%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESW---MSKHGKTYKCIEEKL 66
+++ L L FA S+++ +++++IE E W + K Y+ I+E+
Sbjct: 3 VVIVLGLVAFAISTVSS------------INLNEVIE--EEWSLFKIQFKKLYEDIKEET 48
Query: 67 HRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYLGLKPQFPT----- 117
R +++ +N I + NK S Y L +N F D+ E+ G KP
Sbjct: 49 FRKKVYLDNKLKIARHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNF 108
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ F + +PKSVDWRKKG VTPVKNQG CGSCW+FS ++EG + +G L
Sbjct: 109 TNDEAVTFLKSENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVL 168
Query: 178 TSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQ LIDC + NNGC GGLMD AFKYI ++ GL E+ YPY E+ C E
Sbjct: 169 VSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPEN- 227
Query: 237 EVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP--CGAELDH 293
T G+ D+PE DE +L+ ALA PVS+AI+AS FQFY GVF P ELDH
Sbjct: 228 SGATDKGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDH 287
Query: 294 GVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
GV AVG+G KG DY IVKNSWG WG+ GYI M RN + CG+ AS PL
Sbjct: 288 GVLAVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMARN---KKNNCGVASSASYPL 340
>gi|62955291|ref|NP_001017661.1| cathepsin S, b.2 precursor [Danio rerio]
gi|62204682|gb|AAH93339.1| Cathepsin S, b.2 [Danio rerio]
gi|182891354|gb|AAI64362.1| Ctssb.2 protein [Danio rerio]
Length = 330
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 133/313 (42%), Positives = 187/313 (59%), Gaps = 12/313 (3%)
Query: 43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFAD 98
L + +E W KH K Y C +E++ R E+++ NL+ I N E + SY L +N AD
Sbjct: 22 NLDQHWELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMAD 81
Query: 99 MSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
M+ EE + L + P ++P+AE+ +P ++DWR KG VT VKNQG+CGSCW
Sbjct: 82 MTTEEIL-QTLAVTRVPPGFKRPTAEYVSSSFAVVPDTLDWRDKGYVTSVKNQGACGSCW 140
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKE 217
AFS+V A+EG +G L LS Q L+DC + + N GCNGG M AF+Y++ +GG+ E
Sbjct: 141 AFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGIDSE 200
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
YPY +G+C + + Y+ V + DEQ+L +ALA+ PVSVAI+A+ F
Sbjct: 201 SSYPYQGTQGSCRYDPSQ-RAANCTSYKFVSQGDEQALKEALANIGPVSVAIDATRPQFI 259
Query: 277 FYSGGVFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
FY GV+ P C +++HGV AVGYG G DY +VKNSWG +G+ GYIR+ RN
Sbjct: 260 FYRSGVYDDPSCTQKVNHGVLAVGYGTLSGQDYWLVKNSWGAGFGDGGYIRIARNKNN-- 317
Query: 336 GLCGINKMASIPL 348
+CGI A P+
Sbjct: 318 -MCGIASEACYPI 329
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 143/324 (44%), Positives = 197/324 (60%), Gaps = 23/324 (7%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-----EVTSYWLGLNEF 96
D + E + ++ +H KTY+ E+ R +IF EN I + N+ EVT + + +N++
Sbjct: 21 DVIKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVT-FKMAVNKY 79
Query: 97 ADMSHEEFKNKYLGLKPQFPTRRQPSAE-------FSYRDVKALPKSVDWRKKGAVTPVK 149
ADM H EF+ G + S S VK LPKSVDWR+KGAVT VK
Sbjct: 80 ADMLHHEFRETMNGFNYTLHKELRASDPSFTGITFISPAHVK-LPKSVDWREKGAVTAVK 138
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYI 208
+QG CGSCWAFS+ A+EG + +G L SLSEQ L+DC + NNGCNGGLMD AF+YI
Sbjct: 139 DQGHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYI 198
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVA 267
+GG+ E+ YPY + +C K+ + T G+ D+P+ +E+ + +A+A PVSVA
Sbjct: 199 KDNGGIDTEKSYPYEGIDDSCHFNKDSVG-ATDRGFADIPQGNEKKMAEAVATIGPVSVA 257
Query: 268 IEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGY 324
I+AS FQFYS G++ P C ++ LDHGV VGYG + G DY +VKNSWG WG++G+
Sbjct: 258 IDASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESGKDYWLVKNSWGTTWGDKGF 317
Query: 325 IRMKRNTGKPEGLCGINKMASIPL 348
I+M RN + CGI +S PL
Sbjct: 318 IKMARN---EDNQCGIASASSYPL 338
>gi|403333364|gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 143/330 (43%), Positives = 192/330 (58%), Gaps = 25/330 (7%)
Query: 31 VGYSP---EHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEV 86
V Y+P + T + F ++++K+GK+Y EE R ++FK+NL + N +
Sbjct: 23 VSYNPSATQLYTPITAEDHAFTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNARND 82
Query: 87 TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKAL--PKS--VDWRKK 142
+Y LGLN+FAD + E+K + LG Q + P R++K L PK+ V+W ++
Sbjct: 83 VTYRLGLNKFADYTEAEYK-RLLGFGGQ--KNKNP------RNIKVLGAPKNDGVNWVEQ 133
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLM 201
GAVTPVK+QG CGSCW+FS A+EG +I G L SLSEQ+L+DC + N GC GG M
Sbjct: 134 GAVTPVKDQGQCGSCWSFSATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWM 193
Query: 202 DYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH 261
D AF+Y+ + L E+ YPY + TC + VV + + DV N+ L AL
Sbjct: 194 DQAFQYVEQT-ALETEDQYPYEAVDDTC--RASSAGVVKVDSFVDVTPNNVNELKAALDK 250
Query: 262 QPVSVAIEASGTDFQFYSGGVFT-GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWG 320
PVSVAIEA FQFYSGGV CG LDHGV AVGYG G DY +VKNSWG WG
Sbjct: 251 GPVSVAIEADQMVFQFYSGGVINDASCGTTLDHGVLAVGYGNESGQDYFLVKNSWGASWG 310
Query: 321 ERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
E GY+++ P+ +CGI AS P+ K
Sbjct: 311 EEGYVKI---AASPDNICGILSQASYPIMK 337
>gi|328872971|gb|EGG21338.1| cysteine proteinase 5 precursor [Dictyostelium fasciculatum]
Length = 358
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 136/333 (40%), Positives = 188/333 (56%), Gaps = 37/333 (11%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F +WM KH ++Y E R+ ++K+N+ ++++ N + + LGLN ADM+++E++
Sbjct: 30 FTNWMQKHSRSYASHEFNT-RYSVYKKNMDYVNEWNSKGSETVLGLNSLADMTNQEYQAI 88
Query: 108 YLGLKPQFPTRRQPSAEFS-YRDVK-ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
YLG K R ++ + + V+ ALP S+DW +GAVT VKNQG CGSCW+FS +
Sbjct: 89 YLGTKTDATARLAAASASASFGKVQGALPASIDWVAQGAVTQVKNQGQCGSCWSFSATGS 148
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
EG +QI + NL +LSEQ LIDC +S+ N+GCNGGLMD AFKYI+A+GG+ E YPY+
Sbjct: 149 TEGAHQISTSNLVALSEQNLIDCSSSYGNDGCNGGLMDNAFKYIIANGGIDTEASYPYVA 208
Query: 225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
+ C+ T+S Y DV E +L PVSVAI+AS FQ Y GV+
Sbjct: 209 KVQKCKYNPAN-SGATLSSYVDVTSGSESALQSQTVKGPVSVAIDASHQSFQLYDSGVYY 267
Query: 285 GPC--GAELDHGVAAVGYGK---------------------------SKGSDYIIVKNSW 315
P LDHGV VGYG ++G+ + VKNSW
Sbjct: 268 EPACSSTNLDHGVLVVGYGTASANGSSDSDSSAASQSSSSESSDDQATQGAQFWKVKNSW 327
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
GP+WG GYI+M RN + CGI AS P+
Sbjct: 328 GPEWGLSGYIQMARNR---DNNCGIATTASQPI 357
>gi|67678376|gb|AAH96862.1| Cathepsin S, b.2 [Danio rerio]
Length = 330
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 132/308 (42%), Positives = 185/308 (60%), Gaps = 12/308 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
+E W KH K Y C +E++ R E+++ NL+ I N E + SY L +N ADM+ EE
Sbjct: 27 WELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMADMTTEE 86
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
+ L + P ++P+AE+ +P ++DWR KG VT VKNQG+CGSCWAFS+V
Sbjct: 87 IL-QTLAVTRVPPGFKRPTAEYVSSSFAVVPDTLDWRDKGYVTSVKNQGACGSCWAFSSV 145
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPY 222
A+EG +G L LS Q L+DC + + N GCNGG M AF+Y++ +GG+ E YPY
Sbjct: 146 GALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGIDSESSYPY 205
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
+G+C + + Y+ V + DEQ+L +ALA+ PVSVAI+A+ F FY G
Sbjct: 206 QGTQGSCRYDPSQ-RAANCTSYKFVSQGDEQALKEALANIGPVSVAIDATRPQFIFYRSG 264
Query: 282 VFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
V+ P C +++HGV AVGYG G DY +VKNSWG +G+ GYIR+ RN +CGI
Sbjct: 265 VYDDPSCTQKVNHGVLAVGYGTLSGQDYWLVKNSWGAGFGDGGYIRIARNKNN---MCGI 321
Query: 341 NKMASIPL 348
A P+
Sbjct: 322 ASEACYPI 329
>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
Length = 337
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 146/307 (47%), Positives = 182/307 (59%), Gaps = 19/307 (6%)
Query: 55 HGKTYKCIEEKLHRFEIFKENLKHIDQRNK-----EVTSYWLGLNEFADMSHEEFKNKYL 109
H K YK E+ +R +I+ +N + I + N+ EVT Y LG+N++ DM H EF N
Sbjct: 36 HNKVYKSPVEEGYRMKIYMDNKRKIAEHNRKYELNEVT-YKLGMNKYGDMLHHEFVNTLN 94
Query: 110 GLKPQFPT--RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
G + S +VK LP VDW K+GAVT VK+QG CGSCWAFS+ A+E
Sbjct: 95 GFNKSVTAGIETEGVTFISPANVK-LPDEVDWTKQGAVTAVKDQGHCGSCWAFSSTGALE 153
Query: 168 GINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
G + +G L SLSEQ LIDC + NNGCNGGLMDYAF+YI + GL E+ YPY E
Sbjct: 154 GQHFRSTGYLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFQYIKDNKGLDTEKTYPYEAEN 213
Query: 227 GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTG 285
C T GY D+P+ DE+ L A+A P+SVAI+AS FQ YS GV+
Sbjct: 214 DRCRYNPRN-SGATDKGYVDIPQGDEEKLKAAVATIGPISVAIDASHESFQLYSEGVYYD 272
Query: 286 P-CGAE-LDHGVAAVGYG--KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
P C AE LDHGV VGYG ++ G DY +VKNSWG WG++GYI+M RN CGI
Sbjct: 273 PDCSAENLDHGVLIVGYGTDETSGHDYWLVKNSWGKTWGQKGYIKMARNKNNH---CGIA 329
Query: 342 KMASIPL 348
AS PL
Sbjct: 330 SSASYPL 336
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 145/329 (44%), Positives = 187/329 (56%), Gaps = 18/329 (5%)
Query: 34 SPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSY 89
+ + ++ + + E + ++ H K Y E+ R +IF EN I N++ SY
Sbjct: 14 AAQAISFFNLVTEEWNTFKVTHRKAYDSKIEESFRMKIFMENWHKIALHNQKYELNEVSY 73
Query: 90 WLGLNEFADMSHEEFKNKYLG----LKPQFPTRRQP-SAEFSYRDVKALPKSVDWRKKGA 144
LG+N++ DM H EF N G + Q +R+P + F +P SVDWR GA
Sbjct: 74 KLGMNKYGDMLHHEFINTLNGFNKSVSAQLRAQRRPIGSRFIEPANVEIPSSVDWRTHGA 133
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDY 203
VTP+K+QG CGSCW+FS A+EG + ++G L SLSEQ LIDC + NNGCNGGLMD
Sbjct: 134 VTPIKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGRYGNNGCNGGLMDQ 193
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-Q 262
AF+YI + GL E YPY E C T SGY D+PE +E+ L A+A
Sbjct: 194 AFQYIKDNHGLDTEISYPYEAENDKCRYNPRN-NGATDSGYVDIPEGNEKKLKAAVATIG 252
Query: 263 PVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKG-SDYIIVKNSWGPKW 319
PVSVAI+AS FQFY GV+ P C +E LDHGV VGYG DY +VKNSWG W
Sbjct: 253 PVSVAIDASAESFQFYREGVYYEPRCSSENLDHGVLVVGYGTDDNDQDYWLVKNSWGVTW 312
Query: 320 GERGYIRMKRNTGKPEGLCGINKMASIPL 348
G+ GYI+M RN + CGI AS PL
Sbjct: 313 GDEGYIKMARNK---DNHCGIASSASYPL 338
>gi|52076128|dbj|BAD46641.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|52076135|dbj|BAD46648.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 374
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 146/336 (43%), Positives = 197/336 (58%), Gaps = 31/336 (9%)
Query: 38 LTSMDKLIELFESWMSKHG---KTYKCIEEKLHRFEIFKENLKHI-DQRNKEVTSYWLGL 93
L S + + L++ W +G + + + +K RFE+FK+N ++I D K+ SY LGL
Sbjct: 33 LESEESMWSLYQRWRHVYGAASSSPRDLADKGSRFEVFKKNARYIHDFNRKKGMSYKLGL 92
Query: 94 NEFADMSHEEFKNKYLGLKP------QFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTP 147
N+FAD++ EEF KY G P + T P A + P + DWR+ GAVT
Sbjct: 93 NKFADLTLEEFTAKYTGANPGPITGLKNGTGSPPLAAVA----GDAPPAWDWREHGAVTR 148
Query: 148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKY 207
VK+QG CGSCWAFS V AVEGIN I++GNL +LSEQ+++DC + + C+GG YAF Y
Sbjct: 149 VKDQGPCGSCWAFSVVEAVEGINAIMTGNLLTLSEQQVLDCSGAGD--CSGGYTSYAFDY 206
Query: 208 IVASGGLHKE--------EDY----PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
V++G + E+Y Y + C + +V I Y V NDE++L
Sbjct: 207 AVSNGITLDQCFSPPTTGENYFYYPAYEAVQEPCRFDPNKAPIVKIDSYSFVDPNDEEAL 266
Query: 256 LKALAHQ-PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKN 313
+A+ Q PVSV IEAS +F Y GGVF+GPCG EL+H V VGY +++ G+ Y IVKN
Sbjct: 267 KQAVYSQGPVSVLIEAS-YEFMIYQGGVFSGPCGTELNHAVLVVGYDETEDGTPYWIVKN 325
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWG WGE GYIRM RN PEG+CGI P+K
Sbjct: 326 SWGAGWGESGYIRMIRNIPAPEGICGIAMYPIYPIK 361
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 143/311 (45%), Positives = 188/311 (60%), Gaps = 21/311 (6%)
Query: 54 KHGKTYKCIEEKLHRFEIFKENLKHIDQRNK----EVTSYWLGLNEFADMSHEEFKNKYL 109
+H K Y E+ R +IF EN I + N+ SY L +N++ADM H EF+
Sbjct: 111 EHRKNYLDETEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMN 170
Query: 110 GLKPQFPTRRQPSAEFSYRDVK-------ALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
G + + E S++ V LPKSVDWR KGAVT VK+QG CGSCWAFS+
Sbjct: 171 GFNYTLHKELRAADE-SFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSS 229
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
A+EG + SG L SLSEQ L+DC T + NNGCNGGLMD AF+YI +GG+ E+ YP
Sbjct: 230 TGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYP 289
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSG 280
Y + +C K + T G+ D+P+ +E+ L +A+A PVSVAI+AS FQFYS
Sbjct: 290 YEALDDSCHFNKGTIG-ATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSE 348
Query: 281 GVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
GV+ P C A+ LDHGV VG+G + G DY +VKNSWG WG++G+I+M RN +
Sbjct: 349 GVYVEPACDAQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRN---KDNQ 405
Query: 338 CGINKMASIPL 348
CGI +S PL
Sbjct: 406 CGIASASSYPL 416
>gi|169659203|dbj|BAG12786.1| putative cysteine protease [Sorogena stoianovitchae]
Length = 293
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/295 (47%), Positives = 186/295 (63%), Gaps = 15/295 (5%)
Query: 54 KHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP 113
++ KTY E+K HR +F E+++ ++ N + SY LGLN+FAD++ EEF + YLGL
Sbjct: 12 EYNKTYGGAEDK-HRLALFAESVRIVETENAKGHSYTLGLNQFADLTTEEFSSLYLGLV- 69
Query: 114 QFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIV 173
+ Q S +D + ++VDWR+KGAVTPVK+Q SCGSCWAFS A+EG
Sbjct: 70 -LENKVQASESVVLQDGDS-EENVDWRQKGAVTPVKDQKSCGSCWAFSATGAMEGALVKS 127
Query: 174 SGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKK 233
+G L +LSEQ+L+DC T NGCNGGLM AF Y++ G E+DYPY +G C+
Sbjct: 128 TGKLINLSEQQLVDCVTKC-NGCNGGLMTAAFDYVLGR-GRATEKDYPYKGVDGRCKQTA 185
Query: 234 EEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDH 293
+ + I GY +VP+N+ ++L A+A P+SVA+ A+GT Q Y GV CG LDH
Sbjct: 186 TDNK---IKGYNNVPQNNYKALKAAVA-SPLSVAVNAAGT-IQRYKSGVIDANCGTRLDH 240
Query: 294 GVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNT-GKPEGLCGINKMASIP 347
GV AVGY +G DY IVKNSWG +GE GY R+K T G+CGIN MA+ P
Sbjct: 241 GVLAVGY---QGEDYWIVKNSWGNGYGENGYFRVKMGTQNGGAGVCGINMMAAQP 292
>gi|213512938|ref|NP_001133871.1| Cathepsin K precursor [Salmo salar]
gi|209155648|gb|ACI34056.1| Cathepsin K precursor [Salmo salar]
gi|223647252|gb|ACN10384.1| Cathepsin K precursor [Salmo salar]
gi|223673129|gb|ACN12746.1| Cathepsin K precursor [Salmo salar]
Length = 331
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 146/348 (41%), Positives = 204/348 (58%), Gaps = 27/348 (7%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
+LL + LF S LAH P + S+D ++SW + H + Y + E++ R
Sbjct: 1 MLLCGCVLLFLGSVLAH--------PLNEMSLDAQ---WDSWKTTHLREYNGLGEEVIRR 49
Query: 70 EIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEF 125
I+++N++ I+ N+E + SY LG+N DM+ EE K GL Q P R S +
Sbjct: 50 TIWEKNMRLIEAHNEEAALGIHSYELGMNHLGDMTSEEIAEKLTGL--QVPMNRDRSNTW 107
Query: 126 -SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
+V +P+S+D+RKKG VTPVKNQ SCGSCWAFS+ A+EG +G L LS Q
Sbjct: 108 IPDNNVVKIPRSIDYRKKGMVTPVKNQLSCGSCWAFSSAGALEGQLAKTTGKLIDLSPQN 167
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
L+DC T NNGC GG M AF+Y+ +GG+ EE YPYL ++G C M G+
Sbjct: 168 LVDCVTE-NNGCGGGYMTNAFEYVEENGGIDTEEAYPYLGQDGQCAYNASGMG-AQCRGF 225
Query: 245 QDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYG 301
+++PE DE +L KA+ PV+V I+A+ + FQFY GV+ P C +++H V AVGYG
Sbjct: 226 KEIPEGDEWALTKAVVKVGPVAVGIDATLSTFQFYQRGVYYDPNCNKDDINHAVLAVGYG 285
Query: 302 KS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
++ KG + IVKNSW WG++GYI M RN G CGI +AS P+
Sbjct: 286 QTAKGMKFWIVKNSWSESWGKQGYIMMARNRGNA---CGIANLASYPI 330
>gi|432910512|ref|XP_004078392.1| PREDICTED: cathepsin K-like [Oryzias latipes]
Length = 331
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 144/326 (44%), Positives = 190/326 (58%), Gaps = 18/326 (5%)
Query: 34 SPEHLTSMDK--LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT---- 87
S ++ MD+ L +E W H K Y +EE+ R I+++NL+ I+ N+E
Sbjct: 12 SASVMSQMDETTLDAHWEEWKMTHTKEYITVEEEGIRRAIWEKNLRMIEAHNQEAALGMH 71
Query: 88 SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR-DVKALPKSVDWRKKGAVT 146
+Y LG+N+F DM+ EE + GL Q P +P + LPKSVD+RKKG VT
Sbjct: 72 TYTLGMNQFGDMTQEEVVERMTGL--QMPLNPEPRVPMETDGSLIKLPKSVDYRKKGMVT 129
Query: 147 PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFK 206
VKNQGSCGSCWAFS+V A+EG +GNL LS Q L+DC T N+GC GG M AFK
Sbjct: 130 SVKNQGSCGSCWAFSSVGALEGQLAKKTGNLVDLSPQNLVDCVTE-NDGCGGGYMTNAFK 188
Query: 207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVS 265
Y+ +GG+ E YPY+ E+ C + I GY++VPE DE +L AL PVS
Sbjct: 189 YVQENGGIDSEAAYPYMGEDQPCRYNVSGL-AAQIKGYKEVPEGDEHALAVALFKAGPVS 247
Query: 266 VAIEASGTDFQFYSGGV-FTGPCGAE-LDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGER 322
V I+AS F +Y G+ F C E ++H V AVGYG +KG + IVKNSWG WG +
Sbjct: 248 VGIDASQNSFLYYQKGIYFDRNCNKEDINHAVLAVGYGVNAKGKKFWIVKNSWGETWGNK 307
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPL 348
GY+ M RN G +CGI +AS P+
Sbjct: 308 GYVLMARNRGN---VCGIANLASYPV 330
>gi|194352770|emb|CAQ00113.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 310
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 136/309 (44%), Positives = 192/309 (62%), Gaps = 18/309 (5%)
Query: 55 HGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV-TSYWLGLNEFADMSHEEFKNKYLG--L 111
GK+Y ++E+L RFE+++ N++ I+ N++ Y LG N+F D++ EEF +Y G
Sbjct: 2 RGKSYPAVDEELRRFEVYRRNVERIEATNRDGGRGYTLGENQFTDLTSEEFLARYTGRFA 61
Query: 112 KPQ--------FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
P+ TR E ++ A+P+SVDWR KGAVTPV+NQG C + AF+ +
Sbjct: 62 PPEMTHNGGMLITTRAGDVVEAHRGNLSAVPESVDWRAKGAVTPVRNQGGCEASVAFAAL 121
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCN-GGLMDYAFKYIVASGGLHKEEDYPY 222
AAVEG+ QI +G L S+S QEL+DCD S + CN GG A YI +GG+ DYPY
Sbjct: 122 AAVEGLYQIKTGKLVSMSVQELVDCD-SLSTHCNPGGTPAAALSYIQRNGGIAAAADYPY 180
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPEND--EQSLLKALAHQPVSVAIEASGTDFQFYSG 280
+EG C + + V++ GY+ +P N+ EQ LL+A+A QPV+VA++AS +FQ Y
Sbjct: 181 TAQEGVC-NTDVPLVAVSLRGYRKLPYNEQSEQKLLEAVAQQPVAVAVDASSFEFQTYKD 239
Query: 281 GVFTGPCGAELDHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
GVF+GPCG +++H VA VGYGK + G Y I+KNS+G WG GY+ M+R P GLC
Sbjct: 240 GVFSGPCGFQVNHYVAIVGYGKDAATGKKYWIIKNSFGQSWGMDGYMLMERGIVDPRGLC 299
Query: 339 GINKMASIP 347
IN + P
Sbjct: 300 SINSYPAYP 308
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 189/316 (59%), Gaps = 29/316 (9%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F S+ +K KTY EE HRF +FK NL+ K S G+ +F+D++ EF+ +
Sbjct: 56 FASFKAKFAKTYATKEEHDHRFGVFKSNLRRARLHAKLDPSAVHGVTKFSDLTPAEFRRQ 115
Query: 108 YLGLKP-QFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
+LGLKP +FP Q + +D LPK DWR KGAVT VK+QG+CGSCW+FST A+
Sbjct: 116 FLGLKPLRFPAHAQKAPILPTKD---LPKDFDWRDKGAVTNVKDQGACGSCWSFSTTGAL 172
Query: 167 EGINQIVSGNLTSLSEQELIDCD--------TSFNNGCNGGLMDYAFKYIVASGGLHKEE 218
EG + + +G L SLSEQ+L+DCD + ++GCNGGLM+ AF+YI+ SGG+ KE+
Sbjct: 173 EGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQSGGVQKEK 232
Query: 219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
DYPY +GTC+ K ++ T+S Y V ++EQ + + P++VAI A Q Y
Sbjct: 233 DYPYTGRDGTCKFDKTKV-AATVSNYSVVSLDEEQIAANLVKNGPLAVAINA--VFMQTY 289
Query: 279 SGGVFTGP--CGAELDHGVAAVGYGKS-------KGSDYIIVKNSWGPKWGERGYIRMKR 329
GGV + P CG LDHGV VGYG+ K Y I+KNSWG WGE GY ++ R
Sbjct: 290 VGGV-SCPYICGKHLDHGVLLVGYGEGAYAPIRFKNKPYWIIKNSWGESWGENGYYKICR 348
Query: 330 NTGKPEGLCGINKMAS 345
+CG++ M S
Sbjct: 349 G----RNVCGVDSMVS 360
>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 336
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 135/306 (44%), Positives = 180/306 (58%), Gaps = 13/306 (4%)
Query: 51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKN 106
W H K+Y +L R +++EN+K I+ N + + + LG+NE+ DM E ++
Sbjct: 35 WKIAHTKSYTNDMHELERRLVWEENVKMINMHNLDHSLHKKGFRLGMNEYGDMRLHEVRS 94
Query: 107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
G K T+ Q S + +++ +P +VDWR KG VTPVKNQG CGSCWAFST ++
Sbjct: 95 TMNGYKSSNVTKVQGSTFLTPSNIQ-VPDTVDWRTKGYVTPVKNQGQCGSCWAFSTTGSL 153
Query: 167 EGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
EG + L SLSEQ L+DC T N GC GGLMD F+Y++ + G+ E+ YPY E
Sbjct: 154 EGQTFKKTSKLVSLSEQNLVDCSRTEGNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAE 213
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFT 284
+ TC K + ++G+ DV DEQ+L++A+A PVSVAI+AS FQ Y GV+
Sbjct: 214 DETCH-YKASCDSAEVTGFTDVTSGDEQALMEAVASVGPVSVAIDASHQSFQLYESGVYD 272
Query: 285 GP--CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
P +ELDHGV VGYG G DY +VKNSWG WG GYI+M RN CGI
Sbjct: 273 EPECSSSELDHGVLVVGYGTDGGKDYWLVKNSWGETWGLSGYIKMSRNKSNQ---CGIAT 329
Query: 343 MASIPL 348
AS PL
Sbjct: 330 SASYPL 335
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 136/310 (43%), Positives = 181/310 (58%), Gaps = 21/310 (6%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
F+S+ KHGKTYK E+ RF IF+ENL+ I+ N E + SY G+N+FADM+ E
Sbjct: 26 FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAE 85
Query: 104 FKNKYLGLKPQFPTRRQPSAE--FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
FK L Q T+ A F D ++P+S+DWR + VTP+K+Q CGSCW+F+
Sbjct: 86 FKAM---LATQVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWSFA 142
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
V + EG + +G LT SEQ+L+DC T N GC+GG +D F YI + GL E DYP
Sbjct: 143 VVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPYI-QTNGLELESDYP 201
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSG 280
Y +G+C ++ V +S Y VP N EQ+LL+A+ PV++AI A D QFY
Sbjct: 202 YTGYDGSCSYDSSKV-VTKVSSYVSVPAN-EQALLEAVGTAGPVAIAINAD--DLQFYFS 257
Query: 281 GVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
G+ C E LDHGV AVGY G DY ++KNSWG WGE GY R R + +C
Sbjct: 258 GIIDDKYCDPEWLDHGVLAVGYNSENGLDYWLIKNSWGADWGESGYFRFLRG----QNIC 313
Query: 339 GINKMASIPL 348
G+ + A PL
Sbjct: 314 GVKEDAVYPL 323
>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 185/310 (59%), Gaps = 22/310 (7%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHEEFKN 106
F ++++K+GK+Y EE R ++FK+NL + N + +Y LGLN+FAD + E+K
Sbjct: 43 FTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNVRNDVTYRLGLNKFADYTEAEYK- 101
Query: 107 KYLGLKPQFPTRRQPSAEFSYRDVKAL--PKS--VDWRKKGAVTPVKNQGSCGSCWAFST 162
+ LG Q + P R++K L PK+ V+W ++GAVTPVK+QG CGSCW+FS
Sbjct: 102 RLLGFGGQ--KNKNP------RNIKVLGAPKNDGVNWVEQGAVTPVKDQGQCGSCWSFSA 153
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
A+EG +I G L SLSEQ+L+DC + N GC GG MD AF+Y+ + L E+ YP
Sbjct: 154 TGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQT-ALETEDQYP 212
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
Y + TC + VV + + DV N+ L AL PVSVAIEA FQFYSGG
Sbjct: 213 YEAVDDTC--RASSAGVVKVDSFVDVTPNNVNELKAALDKGPVSVAIEADQMVFQFYSGG 270
Query: 282 VFT-GPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
V CG LDHGV AVGYG G DY +VKNSWG WGE GY+++ P+ +CGI
Sbjct: 271 VINDASCGTTLDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVKI---AASPDNICGI 327
Query: 341 NKMASIPLKK 350
AS P+ K
Sbjct: 328 LSQASYPIMK 337
>gi|19698257|dbj|BAB86771.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 176/313 (56%), Gaps = 15/313 (4%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK----EVTSYWLGLNEFADM 99
L + F W +K GK+Y +EE+ HR ++ N + I N+ V SY GLN+F+DM
Sbjct: 18 LDQEFNEWKAKFGKSYPSLEEEAHRKGLWLANHQKIQAHNQLADQGVHSYRQGLNQFSDM 77
Query: 100 SHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
HEEF+ L R S F +V L SVDWR G V+P+KNQG CGSCW+
Sbjct: 78 DHEEFRQTVLTKMDPPKNNRGASEPFRAPNV-GLAASVDWRTSGCVSPIKNQGQCGSCWS 136
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEE 218
FS A+E + G L SLSEQ+L+DC + N GCNGG D+AF+Y+ A+GG+ E
Sbjct: 137 FSATGALESQTCLRRGYLPSLSEQQLVDCSGPYGNYGCNGGWPDHAFQYVQANGGIDSES 196
Query: 219 DYPYLMEEGTCEDKKEEMEVVTISGYQDV-PENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
YPY GTC T SGYQDV P E +L +A+ P+S+AI+ASG +Q
Sbjct: 197 YYPYQARVGTCH-YNSAYSAATCSGYQDVTPVGSESALQYYVANVGPLSIAIDASG--WQ 253
Query: 277 FYSGGVFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
Y GVF P C DH V VGYG G DY +VKNSWG WGE+GYI M RN
Sbjct: 254 SYQSGVFNDPSCSQTADHAVLLVGYGTYNGQDYWLVKNSWGTWWGEQGYIMMARNANNQ- 312
Query: 336 GLCGINKMASIPL 348
CGI AS PL
Sbjct: 313 --CGIANHASYPL 323
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 192/320 (60%), Gaps = 19/320 (5%)
Query: 41 MDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEF 96
+D +L++SW H K Y EE R ++++NLK I+ N + SY LG+N+F
Sbjct: 130 LDGHWQLWKSW---HRKDYHEREEGWRRV-VWEKNLKMIEIHNLDHALGKHSYKLGMNQF 185
Query: 97 ADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGS 156
DM+ EEF+ G + R+ ++F + P+SVDWR+KG VTPVK+QG CGS
Sbjct: 186 GDMTTEEFRQLMNGYVHKKSERKYRGSQFLEPNFLEAPRSVDWREKGYVTPVKDQGQCGS 245
Query: 157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLH 215
CWAFST A+EG + +G L SLSEQ L+DC N GCNGGLMD AF+Y+ +GG+
Sbjct: 246 CWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGID 305
Query: 216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTD 274
EE YPY ++ K E +G+ D+P+ E++L+KA+A PVSVAI+A +
Sbjct: 306 SEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSS 365
Query: 275 FQFYSGGVFTGP-CGAE-LDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMK 328
FQFY G++ P C +E LDHGV VGYG G Y IVKNSWG KWG++GYI M
Sbjct: 366 FQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMA 425
Query: 329 RNTGKPEGLCGINKMASIPL 348
++ + CGI AS PL
Sbjct: 426 KDR---KNHCGIATAASYPL 442
>gi|403376023|gb|EJY87990.1| Cathepsin L [Oxytricha trifallax]
Length = 343
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 139/309 (44%), Positives = 186/309 (60%), Gaps = 16/309 (5%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLNEFADMSHEEFKN 106
F ++++K+GK+Y EE R+E +++N+ + Q N + ++ LG+N+F D + EE+K
Sbjct: 43 FANYLAKYGKSYGTKEEFQFRYEQYQKNMAKVAQYNGQNGNTFRLGINKFTDYTPEEYK- 101
Query: 107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
LG KPQ + + E SY + P S+DWR+KGAVTPVK+QG CGSCWAFS A+
Sbjct: 102 VLLGYKPQ---SKPMTLEASYLSEENTPASIDWREKGAVTPVKDQGQCGSCWAFSATGAL 158
Query: 167 EGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
EG QI + L S+SEQ+L+DC NNGCNGG M AF Y + + E DY Y ++
Sbjct: 159 EGHYQISNNKLISISEQQLVDCSHDGNNGCNGGEMYLAFDY-ASKNKMELESDYVYHAKD 217
Query: 227 GTC--EDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
C E K +ME +Q VP+N L ALA+ PVSVAIEA FQ Y GG+
Sbjct: 218 EKCSYEASKGKMEA---DHFQRVPKNSPAQLKAALANGPVSVAIEADNEVFQAYDGGILN 274
Query: 285 GP-CGAELDHGVAAVGYGKSKGS--DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
CG LDHGV AVG+G + S DY IVKNSWG WG+ G+I++ G EG+CGI
Sbjct: 275 SKECGTNLDHGVLAVGFGHDEASKQDYFIVKNSWGQYWGDHGFIKIAAVDG--EGICGIQ 332
Query: 342 KMASIPLKK 350
A P+ K
Sbjct: 333 MDAVYPIVK 341
>gi|19698255|dbj|BAB86770.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 176/313 (56%), Gaps = 15/313 (4%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK----EVTSYWLGLNEFADM 99
L + F W +K GK+Y +E++ HR ++ N + I N+ V SY GLN+F+DM
Sbjct: 18 LDQEFNEWKAKFGKSYPSLEKEAHRKGLWLANHQKIQAHNQLADQGVHSYRQGLNQFSDM 77
Query: 100 SHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
HEEF+ L R S F +V L SVDWR G V+P+KNQG CGSCW+
Sbjct: 78 DHEEFRQTVLTKMDPPKNNRGASEPFRALNV-GLAASVDWRTSGCVSPIKNQGQCGSCWS 136
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEE 218
FS A+E + G L SLSEQ+L+DC S+ N GCNGG D AF+YI A+GG+ E
Sbjct: 137 FSATGALESQTCLRRGYLPSLSEQQLVDCSGSYGNYGCNGGWPDQAFQYIQANGGIDSES 196
Query: 219 DYPYLMEEGTCEDKKEEMEVVTISGYQDV-PENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
YPY GTC T SGYQDV P E +L +A+ P+S+AI+ASG +Q
Sbjct: 197 YYPYQARVGTCH-YNSAYSAATCSGYQDVTPVGSESALQYYVANVGPLSIAIDASG--WQ 253
Query: 277 FYSGGVFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
Y GVF P C DH V VGYG G DY +VKNSWG WGE+GYI M RN
Sbjct: 254 SYQSGVFNDPSCSQTADHAVLLVGYGTYNGQDYWLVKNSWGTWWGEQGYIMMTRNANNQ- 312
Query: 336 GLCGINKMASIPL 348
CGI AS PL
Sbjct: 313 --CGIANHASYPL 323
>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 151/356 (42%), Positives = 196/356 (55%), Gaps = 35/356 (9%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESW---MSKHGKTYKCIEEKL 66
+++ L L FA S+++ +++++IE E W + K Y+ I+E+
Sbjct: 3 VVIVLGLVAFAISTVSS------------INLNEVIE--EEWSLFKIQFKKLYEDIKEET 48
Query: 67 HRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYLGLKPQFPT----- 117
R +++ +N I NK S Y L +N F D+ E+ G KP
Sbjct: 49 FRKKVYLDNKLKIAGHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNF 108
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ F + +PKSVDWRKKG VTPVKNQG CGSCW+FS ++EG + +G L
Sbjct: 109 TNDEAVTFLKSENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVL 168
Query: 178 TSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQ LIDC + NNGC GGLMD AFKYI ++ GL E+ YPY E+ C E
Sbjct: 169 VSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPEN- 227
Query: 237 EVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP--CGAELDH 293
T G+ D+PE DE +L+ ALA PVS+AI+AS FQFY GVF P ELDH
Sbjct: 228 SGATDKGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDH 287
Query: 294 GVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
GV AVG+G KG DY IVKNSWG WG+ GYI M RN + CG+ AS PL
Sbjct: 288 GVLAVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMARN---KKNNCGVASSASYPL 340
>gi|46576373|sp|P83654.1|ERVC_TABDI RecName: Full=Ervatamin-C; Short=ERV-C
gi|46014979|pdb|1O0E|A Chain A, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
Protease Ervatamin C
gi|46014980|pdb|1O0E|B Chain B, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
Protease Ervatamin C
Length = 208
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 121/217 (55%), Positives = 155/217 (71%), Gaps = 10/217 (4%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP+ +DWRKKGAVTPVKNQGSCGSCWAFSTV+ VE INQI +GNL SLSEQEL+DCD
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59
Query: 193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
N+GC GG +A++YI+ +GG+ + +YPY +G C+ +VV+I GY VP +E
Sbjct: 60 NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAAS---KVVSIDGYNGVPFCNE 116
Query: 253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
+L +A+A QP +VAI+AS FQ YS G+F+GPCG +L+HGV VGY ++Y IV+
Sbjct: 117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGY----QANYWIVR 172
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
NSWG WGE+GYIRM R G GLCGI ++ P K
Sbjct: 173 NSWGRYWGEKGYIRMLRVGGC--GLCGIARLPYYPTK 207
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 187/314 (59%), Gaps = 16/314 (5%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNEFADMSH 101
+L++ + + H +TY EE R E+F+ NLK I N + + Y +G+N+FADM
Sbjct: 41 KLWQDFKTVHERTYGETEES-QRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQFADMEA 99
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVK---ALPKSVDWRKKGAVTPVKNQGSCGSCW 158
EF + G + T + +Y ++P VDWRK+G VTPVKNQG CGSCW
Sbjct: 100 NEFASIMNGFRMNNRTEVRDHLHANYISPAIPVSVPAEVDWRKEGYVTPVKNQGQCGSCW 159
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKE 217
AFST ++EG + +G L SLSEQ L+DC TS+ N GCNGG++DYAF+YI + G E
Sbjct: 160 AFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIKDNDGDDTE 219
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
YPY +GTC K + T +GY D+P+ DE + +A+A PVSVAI+AS + FQ
Sbjct: 220 ACYPYEAVDGTCRFKSVCVG-ATCTGYTDLPKGDEAKMKEAVALVGPVSVAIDASHSSFQ 278
Query: 277 FYSGGVFT-GPCGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
Y G++ C +LDH V VGYG +G DY +VKNSWG WG+ GYI+M RN
Sbjct: 279 MYQSGIYVEQECSPKQLDHAVLVVGYGTEQGQDYWLVKNSWGTTWGDEGYIKMARNM--- 335
Query: 335 EGLCGINKMASIPL 348
+ CGI AS PL
Sbjct: 336 DNQCGIASQASYPL 349
>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
Length = 355
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 144/336 (42%), Positives = 202/336 (60%), Gaps = 19/336 (5%)
Query: 26 HDFSIVGYSPEHLTS-MDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK 84
HD + + + L +D+ ++ + GK+Y+ EE + E F +N+ HI++ NK
Sbjct: 25 HDHGVRVHRQKSLRQKIDEAFNKWDDYKETFGKSYEPEEENDY-MEAFVKNVIHIEEHNK 83
Query: 85 E----VTSYWLGLNEFADMSHEEFK--NKYLGLKPQFPTRRQPSA-EFSYRDVKALPKSV 137
E ++ +GLNE AD+ +++ N Y ++ QF Q + +F +P+SV
Sbjct: 84 EHRLGRKTFEMGLNEIADLPFSQYRKLNGYR-MRRQFGDSMQSNGTKFLVPFNVQIPESV 142
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGC 196
DWR++G VTPVKNQG CGSCWAFS+ A+EG + +G L SLSEQ L+DC T + N+GC
Sbjct: 143 DWREEGLVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGC 202
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
NGGLMD AF+YI + G+ E+ YPY+ E C K+ + G+ D+PE DE++L
Sbjct: 203 NGGLMDLAFEYIKENHGVDTEDSYPYVGRETKCHFKRNTVG-ADDKGFVDLPEGDEEALK 261
Query: 257 KALAHQ-PVSVAIEASGTDFQFYSGGV-FTGPCGA-ELDHGVAAVGYGKS-KGSDYIIVK 312
KA+A Q P+S+AI+A FQ Y GV F C + ELDHGV VGYG + DY +VK
Sbjct: 262 KAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVK 321
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
NSWGP WGE+GYIR+ RN CG+ AS PL
Sbjct: 322 NSWGPTWGEKGYIRIARNRNNH---CGVATKASYPL 354
>gi|66812702|ref|XP_640530.1| counting factor associated protein [Dictyostelium discoideum AX4]
gi|74897159|sp|Q54TR1.1|CFAD_DICDI RecName: Full=Counting factor associated protein D; Flags:
Precursor
gi|60468561|gb|EAL66564.1| counting factor associated protein [Dictyostelium discoideum AX4]
Length = 531
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 124/311 (39%), Positives = 183/311 (58%), Gaps = 14/311 (4%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
LF+ + +++ K Y +E RF FK K I N + +SY LG+N +AD+S++EF
Sbjct: 223 NLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSYKLGMNHYADLSNKEFN 282
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRD--VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
+KP+ A+ + D ++++P +VDWR + VTPVK+QG CGSCW F +
Sbjct: 283 TL---VKPKVARPSVTGADSVHDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGST 339
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
++EG N + +G L SLSEQ+L+DC + + GC GG AF+Y++ G L E +YPY
Sbjct: 340 GSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYPY 399
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
LM+ G C D+ V+I+GY +V E +L A+A PV++AI+AS DF++Y G
Sbjct: 400 LMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSG 459
Query: 282 VFTGPCGA----ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
V+ P +LDH V A+GYG +G DY +VKNSW WG GY+ M RN L
Sbjct: 460 VYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGMDGYVYMARNDNN---L 516
Query: 338 CGINKMASIPL 348
CG++ A+ P+
Sbjct: 517 CGVSSQATYPI 527
>gi|323451555|gb|EGB07432.1| hypothetical protein AURANDRAFT_2413 [Aureococcus anophagefferens]
Length = 263
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 128/270 (47%), Positives = 167/270 (61%), Gaps = 11/270 (4%)
Query: 82 RNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY---RDVKALPKSVD 138
N + ++Y LG NEF+ M +EF +Y+G + + Y + V A+ VD
Sbjct: 1 HNAKNSTYKLGHNEFSGMFWDEFVAQYVGDATGAKAYMERERNYDYTLAKQVDAVASDVD 60
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
W GAVT VKNQG CGSCW+FST A+EG +I LTSLSEQ L+DCDT+ ++GCNG
Sbjct: 61 WVASGAVTGVKNQGQCGSCWSFSTTGALEGAFEIAGNTLTSLSEQNLVDCDTT-DSGCNG 119
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
GLMD AFK+I ++GG+ E DY Y +GTC+ + +V T+SG+ DVP DE +L A
Sbjct: 120 GLMDNAFKWIQSNGGICSEADYAYTAAKGTCKTTCD--KVATLSGHTDVPSGDEDALKTA 177
Query: 259 LAHQPVSVAIEASGTDFQFYSGGVF-TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGP 317
+A PVS+AIEA + FQ YS G+ + CG LDHGV VGYG GS+Y VKNSWG
Sbjct: 178 VAIGPVSIAIEADKSVFQSYSSGILDSSACGTNLDHGVLVVGYGTDDGSEYWKVKNSWGT 237
Query: 318 KWGERGYIRMKRNTGKPEGLCGINKMASIP 347
WGE GY+R+ R + +CGI S P
Sbjct: 238 TWGESGYVRIARGS----NICGIASEPSYP 263
>gi|1834307|dbj|BAA09820.1| cysteine proteinase [Spirometra erinaceieuropaei]
gi|1834309|dbj|BAA09821.1| cysteine proteinase [Spirometra erinaceieuropaei]
Length = 336
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 135/310 (43%), Positives = 184/310 (59%), Gaps = 13/310 (4%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK----EVTSYWLGLNEFADMSH 101
EL+++W K Y EE+LHR F NL I + N+ ++ SY + LN+F+D++
Sbjct: 30 ELWKAWKLAFKKEYFSSEEELHRKRAFFNNLDFIIRHNQRYYQQLESYAVRLNDFSDLTP 89
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
EF +YL L+ T+ + S + LP SV+WR++GAVT VKNQG CGSCW+FS
Sbjct: 90 GEFAERYLCLRGIVLTKLRRKEAVSVPLKENLPDSVNWRERGAVTSVKNQGQCGSCWSFS 149
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDY 220
A+EG QI +G L SLSEQ+L+DC + N GCNGGLM AF+Y G+ E DY
Sbjct: 150 ANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQY-AQRYGVEAEVDY 208
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYS 279
Y +G C ++++ V ++GY ++PE DE L +A+A P+SV I+A+ F YS
Sbjct: 209 RYTERDGVCR-YRQDLVVANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFMSYS 267
Query: 280 GGVFTGPCGA--ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
GVF + +DHGV VGYG G Y +VKNSWG WGE GY++M RN +
Sbjct: 268 HGVFVSKTCSPYAIDHGVLVVGYGAENGDAYWLVKNSWGSSWGEDGYLKMARNRNN---M 324
Query: 338 CGINKMASIP 347
CGI MAS P
Sbjct: 325 CGIASMASYP 334
>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
Length = 354
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 144/336 (42%), Positives = 202/336 (60%), Gaps = 19/336 (5%)
Query: 26 HDFSIVGYSPEHLTS-MDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK 84
HD + + + L +D+ ++ + GK+Y+ EE + E F +N+ HI++ NK
Sbjct: 24 HDHGVRVHRQKSLRQKIDEAFNKWDDYKETFGKSYEPDEENDY-MEAFVKNVIHIEEHNK 82
Query: 85 E----VTSYWLGLNEFADMSHEEFK--NKYLGLKPQFPTRRQPSA-EFSYRDVKALPKSV 137
E ++ +GLNE AD+ +++ N Y ++ QF Q + +F +P+SV
Sbjct: 83 EHRLGRKTFEMGLNEIADLPFSQYRKLNGYR-MRRQFGDSLQSNGTKFLVPFNVQIPESV 141
Query: 138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGC 196
DWR++G VTPVKNQG CGSCWAFS+ A+EG + +G L SLSEQ L+DC T + N+GC
Sbjct: 142 DWREEGLVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGC 201
Query: 197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLL 256
NGGLMD AF+YI + G+ E+ YPY+ E C K+ + G+ D+PE DE++L
Sbjct: 202 NGGLMDLAFEYIKENHGVDTEDSYPYVGRETKCHFKRNAVG-ADDKGFVDLPEGDEEALK 260
Query: 257 KALAHQ-PVSVAIEASGTDFQFYSGGV-FTGPCGA-ELDHGVAAVGYGKS-KGSDYIIVK 312
KA+A Q P+S+AI+A FQ Y GV F C + ELDHGV VGYG + DY +VK
Sbjct: 261 KAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVK 320
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
NSWGP WGE+GYIR+ RN CG+ AS PL
Sbjct: 321 NSWGPTWGEKGYIRIARNRNNH---CGVATKASYPL 353
>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
Length = 330
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 133/309 (43%), Positives = 182/309 (58%), Gaps = 14/309 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
+E W S H + Y + E+ R I+++N++ I+ N+E + S+ +G+N DM+ EE
Sbjct: 27 WEEWKSTHRREYNGLGEEGIRRAIWEKNMRMIEAHNEEAALGIHSFEMGMNHLGDMTSEE 86
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKA-LPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
K GL Q P ++ S + D+ + +PKSVD+RKKG VT VKNQG+CGSCWAFS
Sbjct: 87 VVEKMTGL--QIPMNQERSFTLAMDDMPSKIPKSVDYRKKGMVTSVKNQGACGSCWAFSA 144
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
A+EG +G L LS Q L+DC + N+GCNGG M AF+Y++ + G+ + YP
Sbjct: 145 AGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVIDNHGIDSDASYP 204
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSG 280
Y + C S YQ +PE DE +L +ALA P+SVAI+A F FY
Sbjct: 205 YTGRDEQCR-YNPATRAANCSSYQFLPEGDENALKQALATIGPISVAIDARRPRFSFYRS 263
Query: 281 GVFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
GV+ P C E++HGV AVGYG G DY +VKNSWG +G++GYIRM RNTG CG
Sbjct: 264 GVYNDPSCTQEVNHGVLAVGYGSLNGQDYWLVKNSWGSTFGDQGYIRMARNTGNQ---CG 320
Query: 340 INKMASIPL 348
I A P+
Sbjct: 321 IALYACYPV 329
>gi|294890024|ref|XP_002773045.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239877748|gb|EER04861.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 329
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 184/317 (58%), Gaps = 18/317 (5%)
Query: 42 DKLIEL-FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMS 100
++ +EL F + K GK Y+ EE++ R IF+ NL HI+ N + SY LG+NE AD++
Sbjct: 21 EETVELAFMGFQHKFGKNYESKEEEVKRNAIFQANLHHIEHVNAKNLSYKLGVNEHADLT 80
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYR-DVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
HEEF LG + TRR EF D LP SVDWR K +TPVKNQGSCGS WA
Sbjct: 81 HEEFAALKLG-TLKMSTRRDD--EFVVEADTTQLPTSVDWRNKSVLTPVKNQGSCGSSWA 137
Query: 160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEE 218
FST A+ I +G L SLSEQEL+DC + N+GC GG M A++YI GL +E
Sbjct: 138 FSTTGALGAQYAIATGKLLSLSEQELVDCSLKYGNDGCIGGYMGAAYEYI-NQAGLDQES 196
Query: 219 DYPYLMEEGTC----EDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
YPY + C E K + + V + + EQSL+KALA PVSV + AS +
Sbjct: 197 TYPYKGWDEPCFRSSEKKADGIPVRFV-----LNTKTEQSLMKALADAPVSVGMYASDPN 251
Query: 275 FQFYSGGVFTG-PCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
F+FY GV++ C E DH V AVGYG KGSDY I+KNSWG KWG GY +KR G
Sbjct: 252 FRFYRSGVYSSTTCNGETDHAVVAVGYGADKGSDYFILKNSWGSKWGIGGYFFLKRGVGG 311
Query: 334 PEGLCGINKMASIPLKK 350
G C I + +P K
Sbjct: 312 -HGECNILEYMLVPTLK 327
>gi|15128493|dbj|BAB62718.1| plerocercoid growth factor/cysteine protease [Spirometra
erinaceieuropaei]
gi|15130639|dbj|BAB62799.1| plerocercoid growth factor-2/cysteine protease [Spirometra
erinaceieuropaei]
Length = 336
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 135/310 (43%), Positives = 184/310 (59%), Gaps = 13/310 (4%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK----EVTSYWLGLNEFADMSH 101
EL+++W K Y EE+LHR F NL I + N+ ++ SY + LN+F+D++
Sbjct: 30 ELWKAWKLAFKKEYFSSEEELHRKRAFFNNLDFIIRHNQRYYQQLESYAVRLNDFSDLTP 89
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
EF +YL L+ T+ + S + LP SV+WR++GAVT VKNQG CGSCW+FS
Sbjct: 90 GEFAERYLCLRGIVLTKLRRKEAVSVPLKENLPDSVNWRERGAVTSVKNQGQCGSCWSFS 149
Query: 162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDY 220
A+EG QI +G L SLSEQ+L+DC + N GCNGGLM AF+Y G+ E DY
Sbjct: 150 ANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQY-AQRYGVEAEVDY 208
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYS 279
Y +G C ++++ V ++GY ++PE DE L +A+A P+SV I+A+ F YS
Sbjct: 209 RYTERDGVCR-YRQDLVVANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFMSYS 267
Query: 280 GGVFTGPCGA--ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
GVF + +DHGV VGYG G Y +VKNSWG WGE GY++M RN +
Sbjct: 268 HGVFVSKTCSPYAIDHGVLVVGYGAENGEAYWLVKNSWGSSWGEGGYVKMARNRNN---M 324
Query: 338 CGINKMASIP 347
CGI MAS P
Sbjct: 325 CGIASMASYP 334
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 138/319 (43%), Positives = 185/319 (57%), Gaps = 22/319 (6%)
Query: 44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFADM 99
L +++ +M+ + + Y E RF+IF N I + N SY +G+NEF+D
Sbjct: 62 LSSIWKHFMTTYKRNYIDPSEHERRFKIFANNFVRISKHNVRFIQGQVSYTMGINEFSDK 121
Query: 100 SHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS-VDWRKKGAVTPVKNQGSCGSCW 158
+ EE K + + R S Y + A P S +DWR KGAVTPVKNQG+CGSCW
Sbjct: 122 TDEELK-RLRCFRGSLNASRDGS---KYITIAAPPPSEIDWRNKGAVTPVKNQGNCGSCW 177
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKE 217
AFS A+EG N + +GNL SLSEQ+L+DC + + NN CNGGLMD AFKY+ S G+ E
Sbjct: 178 AFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNGIDTE 237
Query: 218 EDYPYLMEEG-----TCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEAS 271
YPY+ E TC +E VV ++GY D+P L +A+ H P+SVAI A
Sbjct: 238 ASYPYVSGETGDANPTCRFNLKE-AVVRVTGYIDLPRGQVSELKQAVGHYGPISVAINAG 296
Query: 272 GTDFQFYSGGVFT-GPCGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
F Y GV++ C + +LDHGV VGYG+ G Y ++KNSWGP WGE GY+++ R
Sbjct: 297 LPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYGEENGIPYWLIKNSWGPHWGENGYVKILR 356
Query: 330 NTGKPEGLCGINKMASIPL 348
+ LCG+ MAS PL
Sbjct: 357 DHNN---LCGVASMASYPL 372
>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
Length = 324
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 134/309 (43%), Positives = 182/309 (58%), Gaps = 15/309 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
F + ++G+ Y +E+ +R ++ +N++ I+ N++ T +Y L +N+F DM++EE
Sbjct: 22 FHQFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDMTNEE 81
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
GL P +R A RD LP VDWR KGAVTPVK+Q +CGSCWAFS
Sbjct: 82 INAVMNGLLPASESR--GVAVLGGRD-DTLPAEVDWRTKGAVTPVKDQKACGSCWAFSAT 138
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
++EG + + G L SLSEQ L+DC T ++GC GGLMD+AF YI +GG+ E YPY
Sbjct: 139 GSLEGQHFLKDGKLVSLSEQNLVDCSTKQGDHGCGGGLMDFAFTYIKDNGGIDTEASYPY 198
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
+G C+ T++GY DV + E +L KA+A P+SVAI+AS + F FY G
Sbjct: 199 EATDGKCQYNPAN-SGATVTGYVDVEHDSEDALQKAVATIGPISVAIDASRSTFHFYHKG 257
Query: 282 V-FTGPCGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
V + C + LDHGV AVGYG G+DY +VKNSW WG G+I M RN CG
Sbjct: 258 VYYDKECSSTSLDHGVLAVGYGTQDGTDYWLVKNSWNITWGNHGFIEMSRNRNNN---CG 314
Query: 340 INKMASIPL 348
I AS PL
Sbjct: 315 IATQASYPL 323
>gi|50657029|emb|CAH04632.1| cathepsin L [Suberites domuncula]
Length = 324
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 145/347 (41%), Positives = 188/347 (54%), Gaps = 31/347 (8%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
K+L++ LSL A S A DF PE + W +H K Y E+L
Sbjct: 2 KVLII---LSLVALSVAAFDF------PEEWVA----------WKQEHSKEYTEELEELR 42
Query: 68 RFEIFKENLKHIDQRNK--EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEF 125
R I++ N K ID N + Y L +NEF D+S EFK Y G Q R + F
Sbjct: 43 RHTIWQSNKKFIDSHNSVSDKFGYTLEMNEFGDLSGVEFKQIYNGYIMQ--ERANDTKLF 100
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
+ SVDWR+KG V+ VKNQG CGSCW+FS ++EG + + G L SLSEQ L
Sbjct: 101 TASPYMEPAASVDWRQKGVVSEVKNQGQCGSCWSFSATGSLEGQHALKMGRLVSLSEQNL 160
Query: 186 IDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DC + F N+GC GG+MD AF+Y++++ G+ E YPY ++G C + + S Y
Sbjct: 161 MDCSSRFGNHGCKGGIMDDAFRYVISNHGVDTESSYPYTAKDGYCRFNQNNVGATETS-Y 219
Query: 245 QDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP--CGAELDHGVAAVGYG 301
+D+ E SL +A A P+SVAI+AS FQFY GV+ P + LDHGV VGYG
Sbjct: 220 RDIARGSESSLTQASAQIGPISVAIDASHRSFQFYKNGVYYEPSCSSSRLDHGVLVVGYG 279
Query: 302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
G DY IVKNSWG +WG GYI M RN CGI AS P+
Sbjct: 280 TEGGQDYFIVKNSWGTRWGMDGYIMMSRNR---RNNCGIASQASYPI 323
>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 183/314 (58%), Gaps = 17/314 (5%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
+E W +HGK Y+ E+ R IF++N I + N + SY L +N+F DM HEE
Sbjct: 24 WEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEE 83
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKA-LPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
F + +G + + +E D LPKSVDWR V+ VK+QG CGSCWAFST
Sbjct: 84 FHQRIMGGCLKIVKKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFST 143
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
++EG + +G L LSEQ+L+DC F N GC GGLMD AF+YI A+GGL EE YP
Sbjct: 144 TGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYP 203
Query: 222 YL-MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYS 279
Y ++ C+ + T+ GY+DV +E +L +A+A PVSVAI+A FQFYS
Sbjct: 204 YTATDDKPCKFDNSSVG-ATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYS 262
Query: 280 GGVFTGP-CGAE-LDHGVAAVGYGKSKGSD---YIIVKNSWGPKWGERGYIRMKRNTGKP 334
GV+ P C E LDHGV AVGYG + + IVKNSWGP WG++GYI M RN
Sbjct: 263 SGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQ 322
Query: 335 EGLCGINKMASIPL 348
CGI AS PL
Sbjct: 323 ---CGIATSASYPL 333
>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 333
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 144/354 (40%), Positives = 199/354 (56%), Gaps = 34/354 (9%)
Query: 6 HSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEK 65
H L L +L L I +P+ S+D EL+ W + HGK Y EE
Sbjct: 2 HPSLFLAALCLG------------IASAAPQLNQSLD---ELWSQWKATHGKLYGMDEEG 46
Query: 66 LHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP 121
R E++K+N+K I Q N E + S+ + +N F DM++EEFK GL+ Q + +
Sbjct: 47 WRR-EVWKKNMKMIRQHNWEHSQGKHSFTVAMNGFGDMTNEEFKQVMNGLQMQ---KHKK 102
Query: 122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
F +P SVDWR+KG VTPVK+QG CGSCWAFS A+EG +G L SLS
Sbjct: 103 GKMFQAPLFAKIPSSVDWREKGYVTPVKDQGPCGSCWAFSATGALEGQMFRKTGKLVSLS 162
Query: 182 EQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
EQ L+DC + N GCNGGLM+ AF+Y+ +GGL EE YPY ++ +C+ K ++
Sbjct: 163 EQNLVDCSQAEGNEGCNGGLMNNAFQYVKDNGGLDSEESYPYHAQDESCKYKPQD-SAAN 221
Query: 241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAV 298
+G+ D+P+ ++ ++ P+SV I+AS FQFY G++ P C +E LDHGV +
Sbjct: 222 DTGFFDIPQQEKALMVAVATKGPISVGIDASHFTFQFYHEGIYYDPDCSSEDLDHGVLVI 281
Query: 299 GYGKSKGSD----YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
GYG G Y IVKNSWG WG GYI+M ++ + CGI MAS P+
Sbjct: 282 GYGTEIGQSINKTYWIVKNSWGANWGIDGYIKMAKDR---KNHCGIATMASFPV 332
>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
Length = 329
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 132/307 (42%), Positives = 178/307 (57%), Gaps = 16/307 (5%)
Query: 51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKN 106
W H KTY E+L R EI++ NL+ I N E + +Y LG+N DM+ EE
Sbjct: 29 WKKNHSKTYTSELEELGRREIWERNLRLITVHNLEASLGMHTYDLGMNHMGDMTREEILQ 88
Query: 107 KYLG--LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
+ G ++P R P F ++P SVDWR+KG VT VKNQGSCGSCWAFS
Sbjct: 89 MFAGTRVRPNLTRRSSP---FVASAGISVPDSVDWREKGYVTEVKNQGSCGSCWAFSAAG 145
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
A+EG + +G + SLS Q L+DC + + N GCNGG M AF+Y++ GG+ +E YPY
Sbjct: 146 ALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTQAFQYVIDDGGIDSDEAYPYT 205
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGV 282
+G C + + S Y V E DE++L +A+A P+SVAI+A+ F Y GV
Sbjct: 206 AMDGQCRYDQSQ-RAANCSSYNYVSEGDEEALKQAVATIGPISVAIDATRPMFILYHSGV 264
Query: 283 FTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
++ P C ++HGV VGYG G DY +VKNSWG ++G+ GYIR+ RN G +CGI
Sbjct: 265 YSDPTCTQNVNHGVLVVGYGSLNGEDYWLVKNSWGTRFGDGGYIRIARNKGN---MCGIA 321
Query: 342 KMASIPL 348
A PL
Sbjct: 322 NYACYPL 328
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 145/346 (41%), Positives = 198/346 (57%), Gaps = 26/346 (7%)
Query: 18 LFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLK 77
LFA +L V Y+ D + E ++++ +H K Y E+ R +IF EN
Sbjct: 4 LFALLALVAVAQAVSYA-------DVIKEEWQTFKLEHRKNYVDETEERFRLKIFNENKH 56
Query: 78 HIDQRNKEV----TSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPS------AEFSY 127
I + N+ S+ + +N++ADM H EF G + + S F
Sbjct: 57 KIAKHNQRYASGEVSFKMAVNKYADMLHHEFHTTMNGFNYTLHKQLRASDPSFVGVTFIS 116
Query: 128 RDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELID 187
+ +PKSVDWR KGAVT VK+QG CGSCWAFS+ A+EG + +G L SLSEQ L+D
Sbjct: 117 PEHVKIPKSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVD 176
Query: 188 CDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQD 246
C T + NNGCNGGLMD AF+YI +GG+ E+ YPY + +C K + T G D
Sbjct: 177 CSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIG-ATDRGSVD 235
Query: 247 VPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKS 303
+P+ DE+ + +A+A PVSVAI+AS FQFYS G++ P C + LDHGV VGYG
Sbjct: 236 IPQGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGTD 295
Query: 304 K-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
+ G DY +VKNSWG WG++G+I+M RN + CGI +S PL
Sbjct: 296 ESGQDYWLVKNSWGTTWGDKGFIKMARNA---DNQCGIASASSYPL 338
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 141/319 (44%), Positives = 193/319 (60%), Gaps = 24/319 (7%)
Query: 49 ESWMS---KHGKTYKCIEEKLHRFEIFKENLKHIDQRNK----EVTSYWLGLNEFADMSH 101
E W + +H K Y+ E+ R +IF EN I + N+ S+ + +N++ADM H
Sbjct: 27 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLH 86
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVK-------ALPKSVDWRKKGAVTPVKNQGSC 154
EF + G ++ +A+ S++ V LPK VDWR KGAVT VK+QG C
Sbjct: 87 HEFYSTMNGFNYTLH-KQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQGHC 145
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGG 213
GSCWAFS+ A+EG + SG L SLSEQ L+DC T + NNGCNGGLMD AF+YI +GG
Sbjct: 146 GSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 205
Query: 214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASG 272
+ E+ YPY + +C K + T G+ D+P+ +E+ + +A+A PV+VAI+AS
Sbjct: 206 IDTEKSYPYEAIDDSCHFNKGTIG-ATDRGFVDIPQGNEKKMAEAVATIGPVAVAIDASH 264
Query: 273 TDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKR 329
FQFYS GV+ P C A+ LDHGV VG+G + G DY +VKNSWG WG++G+I+M R
Sbjct: 265 ESFQFYSEGVYNEPACDAQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLR 324
Query: 330 NTGKPEGLCGINKMASIPL 348
N E CGI +S PL
Sbjct: 325 N---KENQCGIASASSYPL 340
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 141/319 (44%), Positives = 193/319 (60%), Gaps = 24/319 (7%)
Query: 49 ESWMS---KHGKTYKCIEEKLHRFEIFKENLKHIDQRNK----EVTSYWLGLNEFADMSH 101
E W + +H K Y+ E+ R +IF EN I + N+ S+ + +N++ADM H
Sbjct: 27 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLH 86
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVK-------ALPKSVDWRKKGAVTPVKNQGSC 154
EF + G ++ +A+ S++ V LPK VDWR KGAVT VK+QG C
Sbjct: 87 HEFYSTMNGFNYTLH-KQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQGHC 145
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGG 213
GSCWAFS+ A+EG + SG L SLSEQ L+DC T + NNGCNGGLMD AF+YI +GG
Sbjct: 146 GSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 205
Query: 214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASG 272
+ E+ YPY + +C K + T G+ D+P+ +E+ + +A+A PV+VAI+AS
Sbjct: 206 IDTEKSYPYEAIDDSCHFNKGSIG-ATDRGFVDIPQGNEKKMAEAVATIGPVAVAIDASH 264
Query: 273 TDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKR 329
FQFYS GV+ P C A+ LDHGV VG+G + G DY +VKNSWG WG++G+I+M R
Sbjct: 265 ESFQFYSEGVYNEPACDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 324
Query: 330 NTGKPEGLCGINKMASIPL 348
N E CGI +S PL
Sbjct: 325 N---KENQCGIASASSYPL 340
>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 179/313 (57%), Gaps = 15/313 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
+E W +HGK Y+ E+ R IF++N I + N + SY L +N+F DM HEE
Sbjct: 24 WEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEE 83
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKA-LPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
F + +G + + ++ D LPKSVDWR V+ VK+QG CGSCWAFST
Sbjct: 84 FHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFST 143
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
++EG + +G L LSEQ+L+DC F N GC GGLMD AF+YI A+GGL EE YP
Sbjct: 144 TGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYITANGGLDTEESYP 203
Query: 222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSG 280
Y + T+ GY+DV +E +L +A+A PVSVAI+A FQFYS
Sbjct: 204 YTATDDEPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYSS 263
Query: 281 GVFTGP-CGAE-LDHGVAAVGYGKSKGSD---YIIVKNSWGPKWGERGYIRMKRNTGKPE 335
GV+ P C E LDHGV AVGYG + + IVKNSWGP WG++GYI M RN
Sbjct: 264 GVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQ- 322
Query: 336 GLCGINKMASIPL 348
CGI AS PL
Sbjct: 323 --CGIATSASYPL 333
>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
Length = 341
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 149/356 (41%), Positives = 197/356 (55%), Gaps = 35/356 (9%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESW---MSKHGKTYKCIEEKL 66
+++ L L FA SS++ +++++IE E W + K Y+ I+E+
Sbjct: 3 VVIVLGLVAFAISSVSS------------INLNEVIE--EEWSLFKMQFKKLYEDIKEET 48
Query: 67 HRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKNKYLGLKPQFPT----- 117
R +++ +N I + NK S Y L +N F D+ E+ G KP
Sbjct: 49 FRKKVYLDNKLKIARHNKLYESGEETYALEMNHFGDLMQHEYSKMMNGFKPSLAGGDSNF 108
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
F + +PKS+DWRKKG VTPVKNQG CGSCW+FS ++EG + +G L
Sbjct: 109 TNDEGVTFLKSENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVL 168
Query: 178 TSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQ LIDC + NNGC GGLMD AFKYI ++ GL E+ YPY E+ C +
Sbjct: 169 VSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPDN- 227
Query: 237 EVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTGP--CGAELDH 293
T +G+ D+PE DE++L+ ALA PVS+AI+AS FQFY GVF P ELDH
Sbjct: 228 SGATDNGFVDIPEGDEEALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDH 287
Query: 294 GVAAVGY-GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
GV AVG+ KG DY IVKNSWG WG+ GYI M RN + CG+ AS PL
Sbjct: 288 GVLAVGFRTDKKGGDYWIVKNSWGKTWGDEGYIMMARN---KKNNCGVASSASYPL 340
>gi|313507179|pdb|2ACT|A Chain A, Crystallographic Refinement Of The Structure Of Actinidin
At 1.7 Angstroms Resolution By Fast Fourier
Least-Squares Methods
Length = 220
Score = 240 bits (612), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 113/218 (51%), Positives = 150/218 (68%), Gaps = 2/218 (0%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP VDWR GAV +K+QG CG WAFS +A VEGIN+I SG+L SLSEQELIDC +
Sbjct: 1 LPSYVDWRSAGAVVDIKSQGECGGXWAFSAIATVEGINKITSGSLISLSEQELIDCGRTQ 60
Query: 193 NN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
N GC+GG + F++I+ GG++ EE+YPY ++G C+ ++ + VTI Y++VP N+
Sbjct: 61 NTRGCDGGYITDGFQFIINDGGINTEENYPYTAQDGDCDVALQDQKYVTIDTYENVPYNN 120
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIV 311
E +L A+ +QPVSVA++A+G F+ Y+ G+FTGPCG +DH + VGYG G DY IV
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYASGIFTGPCGTAVDHAIVIVGYGTEGGVDYWIV 180
Query: 312 KNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
KNSW WGE GY+R+ RN G G CGI M S P+K
Sbjct: 181 KNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 217
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 240 bits (612), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 148/330 (44%), Positives = 195/330 (59%), Gaps = 32/330 (9%)
Query: 43 KLIELFESWMSKHGKTY----KCIEEKLHRFEIFKENL----KHIDQRNKEVTSYWLGLN 94
K + + SW+ ++ K + E FE+F++NL KH ++ N+ + SY +GLN
Sbjct: 22 KYLSAWSSWVKEYNKEHWVDPYSSPESTRAFEVFQKNLDMIMKHNEEYNQGLQSYEMGLN 81
Query: 95 EFADMSHEEFKNKYLGLK----PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKN 150
FA ++ EEF +YLG Q TRR E R +P SVDWR+KGAV VKN
Sbjct: 82 GFAHLTFEEFSAQYLGYGGAEVEQPKTRRAGKHERKSRS--EIPASVDWREKGAVAEVKN 139
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIV 209
QG+CGSCWAFS VAA+EG + + SG L SLSEQ+L+DC F N+GC GG MD AF+Y +
Sbjct: 140 QGACGSCWAFSAVAALEGAHFLNSGELISLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWM 199
Query: 210 ASGGL--HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSV 266
+ G E+DYPY +G C+ + + TISGY DV + +E LL A+A+ PVSV
Sbjct: 200 NNTGHGDDSEKDYPYKGMDGKCKFSADGVR-ATISGYNDVKQGNETDLLDAVANVGPVSV 258
Query: 267 AIEASGTDFQFYSGGVF---TGPCGAELDHGVAAVGYGKS-----KGSDYIIVKNSWGPK 318
AI A G QFY GVF G C L+HGV AVGYG + + DY I+KNSWG
Sbjct: 259 AIHA-GAALQFYLRGVFNGVAGTCFGPLNHGVTAVGYGTASLRFGRKMDYWIIKNSWGMG 317
Query: 319 WGERGYIRMKRNTGKPEGLCGINKMASIPL 348
WGE+G++R R + LCG+ AS PL
Sbjct: 318 WGEKGFVRFARG----KNLCGVANGASYPL 343
>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
Length = 335
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 189/317 (59%), Gaps = 16/317 (5%)
Query: 43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFAD 98
+L + + SW S+HGK+Y + ++ R I++ENL+ I+Q N E + ++ +G+N+F D
Sbjct: 23 QLDDHWNSWKSQHGKSYH-EDVEVGRRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGD 81
Query: 99 MSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
M++EEF+ G K Q P R A F A P+ VDWR++G VTPVK+Q CGSCW
Sbjct: 82 MTNEEFRQAMNGYK-QDPNRTSKGALFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCW 140
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKE 217
+FS+ A+EG +G L S+SEQ L+DC N GCNGG+MD AF+Y+ + GL E
Sbjct: 141 SFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSE 200
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
+ YPYL + V I+G+ D+P+ +E +L+ A+A PVSVAI+AS Q
Sbjct: 201 QSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAIDASHQSLQ 260
Query: 277 FYSGGV-FTGPCGAELDHGVAAVGYGKS----KGSDYIIVKNSWGPKWGERGYIRMKRNT 331
FY G+ + C + LDH V VGYG G+ Y IVKNSW KWG++GYI M ++
Sbjct: 261 FYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDK 320
Query: 332 GKPEGLCGINKMASIPL 348
CGI MAS PL
Sbjct: 321 NNH---CGIATMASYPL 334
>gi|317135059|gb|ADV03094.1| cathepsin L [Hyriopsis cumingii]
gi|372126672|gb|AEX88474.1| cathepsin L [Hyriopsis schlegelii]
Length = 333
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 134/311 (43%), Positives = 185/311 (59%), Gaps = 18/311 (5%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
++ ++ + KTY+ EE + R+ ++K+N I++ N + +YWL +NE+ D+++EE
Sbjct: 30 WQEFVRIYNKTYRAHEEPV-RYSVWKDNFLAINRHNSKADQGFHTYWLAMNEYGDLTNEE 88
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
+ GLK R+ F Y ++ P VDWR KG VTPVKNQG CGSC+AFS
Sbjct: 89 YFRLRTGLKINANIERR-GLVFKYTNLSEYPSEVDWRSKGYVTPVKNQGGCGSCYAFSAT 147
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSF---NNGCNGGLMDYAFKYIVASGGLHKEEDY 220
AVEG + +G L SLSEQ ++DC SF N GC GGLMD +F YI + G+ EE Y
Sbjct: 148 GAVEGQHFRKTGKLVSLSEQNIVDC--SFKEGNKGCRGGLMDKSFTYIKDNNGIDTEEAY 205
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYS 279
PY +G C ++ E+ T+ GY D+PENDE +L A+ P+SVAI+ +F+FY
Sbjct: 206 PYEARDGPCRFRRSEVG-ATVRGYVDLPENDEIALQHAVTTIGPISVAIDGHHFNFRFYH 264
Query: 280 GGVFTGP-CG-AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
GVF P C +++HGV VGYG G DY +VKNSWG +WG GYI M RN +
Sbjct: 265 HGVFDNPNCSKTKINHGVLVVGYGTRDGLDYWLVKNSWGERWGAEGYILMSRNN---DNQ 321
Query: 338 CGINKMASIPL 348
C I AS P+
Sbjct: 322 CCITCAASYPI 332
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 138/318 (43%), Positives = 184/318 (57%), Gaps = 14/318 (4%)
Query: 40 SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNE 95
S E + W ++HGK Y EE+ R I+++NL + + N + +Y LG+N+
Sbjct: 20 SFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQ 79
Query: 96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEF-SYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
FAD+ +EEF G + ++ + F +V LPK+VDWR KG VTPVK+QG C
Sbjct: 80 FADLQNEEFVAMMTGFRVNGTSKAAKGSTFLPSNNVDKLPKTVDWRTKGYVTPVKDQGQC 139
Query: 155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGL 214
GSCWAFS ++EG +G L SLSEQ L+DC N GC+GG MD AF+YI+ +GG+
Sbjct: 140 GSCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDCSYR-NYGCHGGFMDRAFQYIIDAGGI 198
Query: 215 HKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGT 273
E Y Y +G C KK + T++GY DV E++L KA+AH P+SVAI+AS
Sbjct: 199 DTEATYSYRAVDGNCHFKKANVG-ATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHK 257
Query: 274 DFQFYSGGVFTGP-CG-AELDHGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRN 330
F+FY GV+ P C L H V VGYG S G+DY IVKNSW WG GY+ M RN
Sbjct: 258 FFKFYKSGVYNEPGCSTTRLGHAVLVVGYGTTSDGTDYWIVKNSWAKTWGMNGYLWMSRN 317
Query: 331 TGKPEGLCGINKMASIPL 348
+ CGI AS P+
Sbjct: 318 ---KDNQCGIASEASYPM 332
>gi|219362839|ref|NP_001136636.1| uncharacterized protein LOC100216764 precursor [Zea mays]
gi|194696462|gb|ACF82315.1| unknown [Zea mays]
gi|413934556|gb|AFW69107.1| hypothetical protein ZEAMMB73_554980 [Zea mays]
Length = 361
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 146/356 (41%), Positives = 200/356 (56%), Gaps = 22/356 (6%)
Query: 12 LSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEI 71
++ +L + S S + Y+ L S + L L+E W + H + + EK RF +
Sbjct: 11 MAAALVVVIALSTTPAASAIDYTEHDLASEESLWALYERWCA-HYNMARDLGEKTRRFNL 69
Query: 72 FKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEF----SY 127
FKEN I + N+ +Y LGLN F+DM+ EEF G P +R E +
Sbjct: 70 FKENAHRIYEHNQGNATYTLGLNRFSDMTDEEFSRSPYGRCLFAPVQRISDGENEELQQH 129
Query: 128 RDVK------------ALPKSVDWRKKGAVTPVKNQG-SCGSCWAFSTVAAVEGINQIVS 174
DV LP SVDWR + +VT VK+QG +CGSCWAF+ +AAVEGIN I +
Sbjct: 130 EDVSFNLTHGGATAALGLPPSVDWRGR-SVTRVKDQGLTCGSCWAFAAIAAVEGINAIRT 188
Query: 175 GNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
+L +LSEQ+L+DCD + ++GC GG + A +IV + G+ E YPY+ +G C +
Sbjct: 189 WSLVTLSEQQLVDCD-NVDHGCAGGWIPSALDFIVRNRGIVPEGTYPYIGTQGRC--RHV 245
Query: 235 EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHG 294
VTI GY+ V D +L+ A+A QPV+VA+E+S F+ Y GGVF G CG L H
Sbjct: 246 MAPPVTIDGYRRVLPFDVNALMSAVAAQPVAVAMESSAWAFRHYQGGVFNGNCGGRLGHA 305
Query: 295 VAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
A VGYG G + IVKNSWGPKWGE GY+R+ RN G+CGI P+K+
Sbjct: 306 AAVVGYGDGAGGPFWIVKNSWGPKWGEGGYVRISRNAPNRLGICGILTQPLYPVKR 361
>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
Length = 335
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 188/317 (59%), Gaps = 16/317 (5%)
Query: 43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFAD 98
+L + + SW S+HGK+Y + ++ R I++ENL+ I+Q N E + ++ +G+N+F D
Sbjct: 23 QLDDHWNSWKSQHGKSYH-EDVEVGRRMIWEENLRKIEQHNFEYSLGNHTFKMGMNQFGD 81
Query: 99 MSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
M++EEF+ G K Q P R A F A P+ VDWR++G VTPVK+Q CGSCW
Sbjct: 82 MTNEEFRQAMNGYK-QDPNRTSKGALFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCW 140
Query: 159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKE 217
+FS+ A+EG +G L S+SEQ L+DC N GCNGG+MD AF+Y+ + GL E
Sbjct: 141 SFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSE 200
Query: 218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
+ YPYL + V I+G+ D+P +E +L+ A+A PVSVAI+AS Q
Sbjct: 201 QSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSLQ 260
Query: 277 FYSGGV-FTGPCGAELDHGVAAVGYGKS----KGSDYIIVKNSWGPKWGERGYIRMKRNT 331
FY G+ + C + LDH V VGYG G+ Y IVKNSW KWG++GYI M ++
Sbjct: 261 FYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDK 320
Query: 332 GKPEGLCGINKMASIPL 348
CGI MAS PL
Sbjct: 321 NNH---CGIATMASYPL 334
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 150/335 (44%), Positives = 195/335 (58%), Gaps = 26/335 (7%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT- 87
S V +P + ++ L+++W SKH Y EE R ++++NLK I+ N E T
Sbjct: 14 SAVCAAPRFDSQLEDHWHLWKNWHSKH---YHESEEGWRRM-VWEKNLKKIEIHNLEHTM 69
Query: 88 ---SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGA 144
SY LG+N F DM++EEF+ G K Q R+ + F + PK+VDWR+KG
Sbjct: 70 GKHSYRLGMNHFGDMTNEEFRQTMNGYK-QTTERKFKGSLFMEPNYLQAPKAVDWREKGY 128
Query: 145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDY 203
VTPVK+QGSCGSCWAFST A+EG +G L SLSEQ L+DC N GCNGGLMD
Sbjct: 129 VTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQ 188
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDK---KEEMEVVTISGYQDVPENDEQSLLKALA 260
AF+YI + GL EE YPY+ GT ED K E +G+ D+P E +++KA+A
Sbjct: 189 AFQYIQDNAGLDTEESYPYV---GTDEDPCHYKPEFSAANETGFVDIPSGKEHAMMKAVA 245
Query: 261 H-QPVSVAIEASGTDFQFYSGGV-FTGPCGA-ELDHGVAAVGYG----KSKGSDYIIVKN 313
PVSVAI+A FQFY G+ + C + ELDHGV VGYG G Y IVKN
Sbjct: 246 AVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKN 305
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
SW KWG++GYI M ++ + CGI +S PL
Sbjct: 306 SWSEKWGDKGYIYMAKDR---KNHCGIATASSYPL 337
>gi|449524450|ref|XP_004169236.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 283
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 126/292 (43%), Positives = 176/292 (60%), Gaps = 21/292 (7%)
Query: 67 HRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKY-------LGLKPQFPTRR 119
RF++FK+N KH+ + N S L LN+FADMS +EF Y L + R
Sbjct: 3 RRFKVFKDNAKHVFKVNHMGKSLKLKLNQFADMSDDEFSKTYGSNITYYKNLHAKVGGR- 61
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
F Y +P S+DWRKKGA + C CWAF+ VAAVE I+QI + L S
Sbjct: 62 --VGGFMYERATNIPSSIDWRKKGA------RRMC--CWAFAAVAAVESIHQIRTNELVS 111
Query: 180 LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVV 239
LSEQE++DCD GC GG AF++I+ +GG+ E +YPY +G C + E V
Sbjct: 112 LSEQEVVDCDYKVG-GCRGGDYISAFEFIMENGGITVENNYPYYAGDGYCRRRGPNNERV 170
Query: 240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT--GPCGAELDHGVAA 297
TI GY++VP N+E +L+KA+AHQPV+V+I + G+DF+FY G+FT CG +DH V
Sbjct: 171 TIDGYENVPRNNEYALMKAVAHQPVAVSIASRGSDFKFYGEGMFTEENFCGIRIDHTVVV 230
Query: 298 VGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG + DY I++N +G +WG GY++M+R T P+G+CG+ + P+K
Sbjct: 231 VGYGSDEEGDYWIIRNQYGTQWGMNGYMKMQRGTRSPQGVCGMAMYPAFPVK 282
>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
Length = 401
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 185/311 (59%), Gaps = 17/311 (5%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKEN---LKHIDQRNKEVTSYWLGLNEFADMSHEEF 104
F WM H K+Y + L RFEI+K N + H ++++ +S+ + +N+F D++ +EF
Sbjct: 95 FTEWMRTHRKSYH-HDHFLPRFEIWKTNNRWITHWNKKHANASSFTVAINQFGDLTSDEF 153
Query: 105 KNKYLGL----KPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
Y GL P+ + + +++ + +P+S DWR+KG V+ VK+QG CGSCWAF
Sbjct: 154 NRLYNGLHVFSAPKASEKVERPRQWA--NTAGIPESGDWRQKGVVSRVKDQGMCGSCWAF 211
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSF--NNGCNGGLMDYAFKYIVASGGLHKEE 218
ST + EGIN I + L LSEQ L+DC T+ N GCNGG MD AF+YI+ + G+ E
Sbjct: 212 STTGSTEGINAITTSRLVPLSEQNLVDCATAAYDNYGCNGGFMDNAFRYIIDNKGIDSEA 271
Query: 219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
YPY+ +G C + + + +P+ DE++LL A A QP+SV I+A FQFY
Sbjct: 272 SYPYVAADGQCRFNPKTVYGGKGGTLKSLPKGDEKALLVAAARQPISVGIDAGRPSFQFY 331
Query: 279 SGGVFTGP-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
S GV+ P C + EL+HGV VG+G +G Y +VKNSWG WG GYI+M R+
Sbjct: 332 SKGVYNEPECSSTELNHGVLIVGWGVERGQAYWLVKNSWGQTWGMDGYIKMSRDKNNQ-- 389
Query: 337 LCGINKMASIP 347
CGI +AS P
Sbjct: 390 -CGIATLASYP 399
>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 147/354 (41%), Positives = 206/354 (58%), Gaps = 19/354 (5%)
Query: 9 LLLLSLSLSLFAC--SSLAHDFSIVGYSPEHLTS-MDKLIELFESWMSKHGKTYKCIEEK 65
L L+ L S+FA S HD +I + + L +D+ +L++ + GK+Y EE
Sbjct: 5 LSLVLLCASVFASIDSGSRHDHTIRLHRVKSLRQKIDEAFKLWDDYKESFGKSYNKDEEN 64
Query: 66 LHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEEFK--NKYLGLKPQFPTRR 119
+ E F +N+ HID+ N+E ++ +GLN AD+ +++ N Y + + +
Sbjct: 65 DY-MEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKLNGYRHRRNFGDSMQ 123
Query: 120 QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS 179
++ +P SVDWR KG VT VKNQG CGSCWAFS A+EG + SG + S
Sbjct: 124 SNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARASGKMVS 183
Query: 180 LSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
LSEQ L+DC T + N+GCNGGLMD AF+YI + G+ EE YPY+ E C KK+++
Sbjct: 184 LSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKCHFKKKDIGA 243
Query: 239 VTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFTG-PCGA-ELDHGV 295
G+ D+PE DE++L A+A Q P+S+AI+A FQ Y GV+ C + ELDHGV
Sbjct: 244 ED-KGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSEELDHGV 302
Query: 296 AAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
VGYG + DY ++KNSWGP WGE+GYIR+ RN CG+ AS PL
Sbjct: 303 LLVGYGTDPEAGDYWLIKNSWGPGWGEKGYIRIARNRSNH---CGVATKASYPL 353
>gi|351629617|gb|AEQ54772.1| KDEL-tailed cysteine proteinase CP4, partial [Coffea canephora]
Length = 215
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 122/201 (60%), Positives = 148/201 (73%), Gaps = 4/201 (1%)
Query: 152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVAS 211
G CGSCWAFSTV VEGIN+I +G L SLSEQEL+DC+T N GCNGGLM+ A+++I S
Sbjct: 1 GKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETD-NEGCNGGLMENAYEFIKKS 59
Query: 212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEAS 271
GG+ E YPY +G+C+ K VTI G++ VP NDE +L+KA+A+QPVSVAI+AS
Sbjct: 60 GGITTERLYPYKARDGSCDSSKMNAPAVTIDGHEMVPANDENALMKAVANQPVSVAIDAS 119
Query: 272 GTDFQFYSGGVFTG-PCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKR 329
G+D QFYS GV+TG CG ELDHGVA VGYG + G+ Y IVKNSWG WGE+GYIRM+R
Sbjct: 120 GSDMQFYSEGVYTGDSCGNELDHGVAVVGYGTALDGTKYWIVKNSWGTGWGEQGYIRMQR 179
Query: 330 NTGKPE-GLCGINKMASIPLK 349
E G+CGI AS PLK
Sbjct: 180 GVDAAEGGVCGIAMEASYPLK 200
>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
Length = 324
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 146/349 (41%), Positives = 201/349 (57%), Gaps = 36/349 (10%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
KL +L+L++S+ A S+ A+ + + +KH KTY E+ +
Sbjct: 3 KLTILALAISVAAASTEAN---------------------WAIFKAKHNKTYSGDEDIIR 41
Query: 68 RFEIFKENLKHIDQRN----KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA 123
R+ I++ NL+ I+ N K +++Y+LG N++ADM++EEF+ GL+
Sbjct: 42 RY-IWQTNLQKIEAHNELYAKGLSTYFLGENKYADMTNEEFRRTLSGLRVDKELTPGDFV 100
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
++D +LP +VDWRK+G VT VK+QG CGSCWAFST ++EG + + L SLSE
Sbjct: 101 SGMFKD--SLPTAVDWRKEGYVTEVKDQGQCGSCWAFSTTGSLEGQHFKATKQLVSLSES 158
Query: 184 ELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
L+DC + N GCNGGLMD AFKYI + G+ E+ YPY E+ C KK + T
Sbjct: 159 NLVDCSKKWGNQGCNGGLMDNAFKYIADNKGIDTEKSYPYKPEDRKCNFKKANVG-ATDK 217
Query: 243 GYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFT-GPCGAE-LDHGVAAVG 299
Y+D+ E +L +A+A P+SVAI+AS FQ YSGGV+ C + LDHGV AVG
Sbjct: 218 LYKDITSGSEDALQEAVATIGPISVAIDASHDSFQLYSGGVYNEKACSTKTLDHGVLAVG 277
Query: 300 YGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
Y G DY IVKNSWG WG GYI M RN + CGI MAS P+
Sbjct: 278 YDSKNGDDYWIVKNSWGKSWGIDGYIWMSRN---KKNQCGIATMASYPV 323
>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
Length = 331
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 143/332 (43%), Positives = 191/332 (57%), Gaps = 24/332 (7%)
Query: 28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT 87
IV +P+ S+D + W + HGK Y EE R ++++NLK I Q N+E +
Sbjct: 12 LGIVSAAPKLYQSLDAR---WSQWKAAHGKLYDENEEGWRR-AVWEKNLKVIKQHNQEYS 67
Query: 88 ----SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKG 143
S+ + +N F D+++EEFK GLK Q +R+ F P SVDWRKKG
Sbjct: 68 QGKHSFTMAMNAFGDLTNEEFKQVMNGLKSQ---KRKEGNVFQAPPFAETPSSVDWRKKG 124
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMD 202
VTPVKNQG CGSCWAFS A+EG + L SLSEQ L+DC + N GC+GGLMD
Sbjct: 125 YVTPVKNQGPCGSCWAFSATGALEGQMFRKTKRLVSLSEQNLVDCSQAEGNEGCSGGLMD 184
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF+Y+ +GGL EE YPY ++ +C+ K E+ +G+ D+ +E L
Sbjct: 185 YAFQYVKDNGGLDSEESYPYRAQDESCKYKPEQ-SAANDTGFMDIHPEEESLKLAVATVG 243
Query: 263 PVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSD-----YIIVKNSW 315
P+S AI+AS + FQFY G++ P C +E LDHG+ VGYG S+G D Y IVKNSW
Sbjct: 244 PISAAIDASLSTFQFYHKGIYYDPDCSSENLDHGILVVGYG-SQGEDSEKQKYWIVKNSW 302
Query: 316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
G WG +GYI M ++ + CGI AS P
Sbjct: 303 GTDWGTQGYILMAKDR---DNHCGIATAASFP 331
>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
Length = 335
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 139/312 (44%), Positives = 187/312 (59%), Gaps = 19/312 (6%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
+ F+ W KHGK Y EE R ++F +N+ +ID NK+ SY L +NE+ADM+ +EFK
Sbjct: 33 DYFKEWQEKHGKVYSTEEESQSRLKVFMKNVIYIDNHNKQGHSYELEVNEYADMTLDEFK 92
Query: 106 NKYLGLKPQF--PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
++YL ++PQ T S YRD PK++DWR KGAVTPVKNQG CGSCW FST
Sbjct: 93 DQYL-MEPQHCSATHSLKSDPPKYRDP---PKAIDWRSKGAVTPVKNQGQCGSCWTFSTT 148
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
+E + + +G L SLSEQ+L+DC +F NNGCNGGL AF+YI +GGL EE YPY
Sbjct: 149 GCLESHHFLKTGQLVSLSEQQLVDCAQAFNNNGCNGGLPSQAFEYIHYNGGLDSEESYPY 208
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
+ C E+ T+S ++ DE L A+ PVS+A + S DF+FY G
Sbjct: 209 RAHDEKCHFVPSEVS-ATVSNVVNITSKDEMQLYNAVGTVGPVSIAYDVSA-DFRFYKKG 266
Query: 282 VF-TGPCGAE---LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
V+ + C + ++H V AVGY ++ G DY IVKNSWG K+G GY + R E
Sbjct: 267 VYKSKECKTDPEHVNHAVLAVGYNTTESGEDYWIVKNSWGTKFGINGYFWIARG----EN 322
Query: 337 LCGINKMASIPL 348
+CG+ AS P+
Sbjct: 323 MCGLADCASYPI 334
>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
Length = 344
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 185/312 (59%), Gaps = 16/312 (5%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-----EVTSYWLGLNEFADMSHE 102
F + S++ K Y + +R +++K+N K + + N+ EVT Y + LN ADM
Sbjct: 23 FTRFKSQYRKDYPSDSVERYRKKVYKQNEKFVREHNERYERGEVT-YKMALNHLADMHPR 81
Query: 103 EFKNKYLGLKPQF-PTRRQPSA-EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
EF +LG T + P F + + K VDWR+KGA++PVK+QG CGSCWAF
Sbjct: 82 EFMATFLGFNRSLRATNKVPEGIPFRHNKDAVIQKEVDWRQKGAISPVKDQGHCGSCWAF 141
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEED 219
S+ A+E + G SLSEQ LIDC ++ NNGC GGLM+ AF+Y+ + G+ EE
Sbjct: 142 SSTGALEAHTFLKKGRRVSLSEQNLIDCSLNYGNNGCEGGLMEQAFQYVRDNDGIDTEEA 201
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFY 278
YPY E+ C KK + T +G+ +P DEQ+L++A+A Q P+S+AI+AS FQFY
Sbjct: 202 YPYEGEDSECRFKKNNVG-ATDAGFVTIPSGDEQALMEAVATQGPLSIAIDASNPSFQFY 260
Query: 279 SGGVFTGP--CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
S GV+ P A+LDHGV VGYG K Y +VKNSW +WGE GYI+M RN +
Sbjct: 261 SEGVYYEPECSSAQLDHGVLLVGYGVEKDQKYWLVKNSWSEQWGENGYIKMARNK---DN 317
Query: 337 LCGINKMASIPL 348
CGI AS P+
Sbjct: 318 NCGIATQASFPI 329
>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 338
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 139/327 (42%), Positives = 194/327 (59%), Gaps = 19/327 (5%)
Query: 34 SPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSY 89
+P + +D+ +L+++W H K+Y EE R +++ENLK I N E + +Y
Sbjct: 18 APLGDSELDRHWKLWKNW---HQKSYHEAEEGWRR-TVWEENLKAIQLHNLEQSLGLHTY 73
Query: 90 WLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVK 149
LG+N+F D+++EEF+ G + R + F + +P SVDWR G VTPVK
Sbjct: 74 RLGMNQFGDLTNEEFQEILTGERHFSKGNRINGSAFLEANFVQVPTSVDWRDHGYVTPVK 133
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAFKYI 208
NQG CGSCWAFST A+EG SG L SLSEQ L+DC N GC+GG++D AF+YI
Sbjct: 134 NQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSWQQGNQGCHGGIVDLAFQYI 193
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVA 267
+ + G+ E+ YPY ++ K E ++G+ D+P + E++L+KA+A PVSV
Sbjct: 194 LQNQGIDSEDCYPYTAKDTAQCTFKPECATAPVTGFVDIPPHSEEALMKAVATVGPVSVG 253
Query: 268 IEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK----GSDYIIVKNSWGPKWGE 321
I+AS T F+FY G+F P C +E LDH V VGYG + G Y IVKNSWG WG+
Sbjct: 254 IDASSTSFRFYQSGIFYDPKCSSESLDHAVLVVGYGYEREDEAGKKYWIVKNSWGKHWGD 313
Query: 322 RGYIRMKRNTGKPEGLCGINKMASIPL 348
RGY+ M ++ G CGI +AS PL
Sbjct: 314 RGYVYMSKDRGNH---CGIATVASYPL 337
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 145/309 (46%), Positives = 179/309 (57%), Gaps = 18/309 (5%)
Query: 54 KHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFADMSHEEFKNKYL 109
+H K YK E+ R +IF +N I + N SY L +N++ DM H EF N
Sbjct: 34 EHNKVYKNDVEERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLN 93
Query: 110 G----LKPQFPTRRQP-SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
G + Q + R P +A F LPK+VDWR+ GAVTPVK+QG CGSCW+FS
Sbjct: 94 GFNKSINTQLRSERLPIAASFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATG 153
Query: 165 AVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
A+EG + +G L LSEQ LIDC + NNGCNGGLMD AF+YI + GL E YPY
Sbjct: 154 ALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYE 213
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGV 282
E C + GY D+P+ +E+ L A+A PVSVAI+AS FQFYS GV
Sbjct: 214 AENDKCRYNAANSGARDV-GYVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGV 272
Query: 283 FTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
+ P C +E LDHGV AVGYG + G DY +VKNSWG WG+ GYI+M RN CG
Sbjct: 273 YYEPECSSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDNGYIKMARNKLNH---CG 329
Query: 340 INKMASIPL 348
I AS PL
Sbjct: 330 IASTASYPL 338
>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 141/314 (44%), Positives = 183/314 (58%), Gaps = 17/314 (5%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
+E W +HGK Y+ E+ R IF++N I + N + SY L +N+F DM HEE
Sbjct: 24 WEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEE 83
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKA-LPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
F + +G + + ++ D LPKSVDWR V+ VK+QG CGSCWAFST
Sbjct: 84 FHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFST 143
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
++EG + +G L LSEQ+L+DC F N GC GGLMD AF+YI A+GGL EE YP
Sbjct: 144 TGSLEGQHSSKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYP 203
Query: 222 YL-MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYS 279
Y ++ C+ + T+ GY+DV +E +L +A+A PVSVAI+A FQFYS
Sbjct: 204 YTATDDKPCKFDNSSVG-ATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYS 262
Query: 280 GGVFTGP-CGAE-LDHGVAAVGYGKSKGSD---YIIVKNSWGPKWGERGYIRMKRNTGKP 334
GV+ P C E LDHGV AVGYG + + IVKNSWGP WG++GYI M RN
Sbjct: 263 SGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQ 322
Query: 335 EGLCGINKMASIPL 348
CGI AS PL
Sbjct: 323 ---CGIATSASYPL 333
>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 141/314 (44%), Positives = 183/314 (58%), Gaps = 17/314 (5%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
+E W +HGK Y+ E+ R IF++N I + N + SY L +N+F DM HEE
Sbjct: 24 WEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEE 83
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKA-LPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
F + +G + + ++ D LPKSVDWR V+ VK+QG CGSCWAFST
Sbjct: 84 FHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFST 143
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
++EG + +G L LSEQ+L+DC F N GC GGLMD AF+YI A+GGL EE YP
Sbjct: 144 TGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYP 203
Query: 222 YL-MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYS 279
Y ++ C+ + T+ GY+DV +E +L +A+A PVSVAI+A FQFYS
Sbjct: 204 YTATDDKPCKFDNSSVG-ATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYS 262
Query: 280 GGVFTGP-CGAE-LDHGVAAVGYGKSKGSD---YIIVKNSWGPKWGERGYIRMKRNTGKP 334
GV+ P C E LDHGV AVGYG + + IVKNSWGP WG++GYI M RN
Sbjct: 263 SGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQ 322
Query: 335 EGLCGINKMASIPL 348
CGI AS PL
Sbjct: 323 ---CGIATSASYPL 333
>gi|351721011|ref|NP_001238219.1| P34 probable thiol protease precursor [Glycine max]
gi|1199563|gb|AAB09252.1| 34 kDa maturing seed vacuolar thiol protease precursor [Glycine
max]
Length = 379
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 140/344 (40%), Positives = 196/344 (56%), Gaps = 32/344 (9%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN---KE 85
SI+ T+ ++ LF+ W S+HG+ Y EE+ R EIFK N +I N K
Sbjct: 25 SILDLDLTKFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKS 84
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA-------LPKSVD 138
S+ LGLN+FAD++ +EF KYL Q P + + + +K P S D
Sbjct: 85 PHSHRLGLNKFADITPQEFSKKYL----QAPKDVSQQIKMANKKMKKEQYSCDHPPASWD 140
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
WRKKG +T VK QG CG WAFS A+E + I +G+L SLSEQEL+DC + G
Sbjct: 141 WRKKGVITQVKYQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDC-VEESEGSYN 199
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND------- 251
G +F++++ GG+ ++DYPY +EG C+ K + + VTI GY+ + +D
Sbjct: 200 GWQYQSFEWVLEHGGIATDDDYPYRAKEGRCKANKIQ-DKVTIDGYETLIMSDESTESET 258
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG-----PCGAELDHGVAAVGYGKSKGS 306
EQ+ L A+ QP+SV+I+A DF Y+GG++ G P G ++H V VGYG + G
Sbjct: 259 EQAFLSAILEQPISVSIDAK--DFHLYTGGIYDGENCTSPYG--INHFVLLVGYGSADGV 314
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
DY I KNSWG WGE GYI ++RNTG G+CG+N AS P K+
Sbjct: 315 DYWIAKNSWGEDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTKE 358
>gi|339765072|gb|AEK01110.1| cathepsin L [Cristaria plicata]
gi|397880684|gb|AFO67888.1| cathepsin L [Cristaria plicata]
Length = 333
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 141/327 (43%), Positives = 193/327 (59%), Gaps = 23/327 (7%)
Query: 37 HLTSMDKL----IEL-FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VT 87
HL S D L + + ++ ++ H KTY EE L R+ ++KEN+ I++ N + V
Sbjct: 14 HLKSADGLSVSALNIGWQEFVRTHNKTYSAHEE-LFRYAVWKENVLAINRHNSKADQGVH 72
Query: 88 SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTP 147
+YWL +NE+ D+++EE+ G R S F Y ++ P+ VDWR+KG VT
Sbjct: 73 TYWLSMNEYGDLTNEEYFRLRTGFIMNGNIERSGSI-FKYTNLSEYPRQVDWRRKGYVTR 131
Query: 148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF---NNGCNGGLMDYA 204
VK+QG CGSC+AFS A+EG + +G L SLSEQ ++DC SF N GC GGLMD +
Sbjct: 132 VKDQGGCGSCYAFSATGALEGQHFRKTGKLVSLSEQNIVDC--SFKEGNKGCKGGLMDKS 189
Query: 205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QP 263
F YI + G+ KEE YPY +G C ++ E+ T GY D+PENDE +L A+A P
Sbjct: 190 FTYIKNNNGIDKEEAYPYEARDGPCRFRRSEVG-ATDRGYVDLPENDETALRHAVATIGP 248
Query: 264 VSVAIEASGTDFQFYSGGVFTGP-CG-AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGE 321
+SVAI+ +F+FY GVF P C +++HGV VGYG G DY +VKNSWG WG
Sbjct: 249 ISVAIDGHHFNFRFYDHGVFDNPNCSKTKINHGVLVVGYGTRNGLDYWMVKNSWGRGWGA 308
Query: 322 RGYIRMKRNTGKPEGLCGINKMASIPL 348
+GYI M RN + C I AS P+
Sbjct: 309 KGYILMSRNN---DNQCCIACAASYPI 332
>gi|118140100|gb|ABK63481.1| cathepsin S [Channa argus]
Length = 335
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 132/308 (42%), Positives = 175/308 (56%), Gaps = 14/308 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
++ W H K Y+ E HR E++++NLK I N E + +Y LG+N+ D++ EE
Sbjct: 34 WQMWKKTHNKMYQNEVEDAHRRELWEKNLKFISMHNLEASMGIHTYELGMNQMGDLTQEE 93
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
Y L+P R P F+ + A P ++DWR G VT VKNQGSCGSCWAFS V
Sbjct: 94 ILKTYATLRPPTDVHRTP---FTRKSGVAAPGAMDWRDLGCVTSVKNQGSCGSCWAFSAV 150
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
A+EG +G L LS Q L+DC + N+GC+GG M AF+Y++ + G+ E YPY
Sbjct: 151 GALEGQLAKTTGKLVDLSPQNLVDCSGKYGNHGCDGGFMTNAFQYVIENQGIESEASYPY 210
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
+ E C EE S Y +PE DE++L +A+A P+SVAI+AS F FYS G
Sbjct: 211 IGLEQQCHYNPEE-SAANCSQYHFLPEKDEEALKEAIATIGPISVAIDASKPTFTFYSSG 269
Query: 282 VFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
V+ P C ++HGV AVGYG D +VKNSWG +G+ GYIRM RN G CGI
Sbjct: 270 VYDDPTCSEVINHGVLAVGYGTQSTQDSWLVKNSWGTYFGDSGYIRMSRNKGNQ---CGI 326
Query: 341 NKMASIPL 348
PL
Sbjct: 327 ALYGCYPL 334
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.134 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,763,333,599
Number of Sequences: 23463169
Number of extensions: 251148388
Number of successful extensions: 674933
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6006
Number of HSP's successfully gapped in prelim test: 1496
Number of HSP's that attempted gapping in prelim test: 646199
Number of HSP's gapped (non-prelim): 8758
length of query: 350
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 207
effective length of database: 9,003,962,200
effective search space: 1863820175400
effective search space used: 1863820175400
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)