BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018649
(352 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 590 bits (1522), Expect = e-166, Method: Compositional matrix adjust.
Identities = 283/352 (80%), Positives = 314/352 (89%), Gaps = 3/352 (0%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
M+ SS ++ +SF S FARD SIVGY+PEDLTSNDKLIDLFESW+S+F +VYE
Sbjct: 1 MSPSSYSFLFFLAVSLSFLAYSGFARD-SIVGYAPEDLTSNDKLIDLFESWISRFGRVYE 59
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
S +EKLERFEIFKDNL HID+TN+K++NYWLGLNEFADL HEEFK +LGLKPDL++R
Sbjct: 60 SAEEKLERFEIFKDNLFHIDDTNKKVRNYWLGLNEFADLSHEEFKNKYLGLKPDLSKRA- 118
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
Q E+F+YKDV +PKSVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGINQIVTGNL S
Sbjct: 119 QCPEEFTYKDVA-IPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 177
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQELIDCD TYNNGCNGGLMDYAF YIV+ GGLHKEEDYPYIMEEGTC+M K ES+ V
Sbjct: 178 LSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGLHKEEDYPYIMEEGTCDMRKEESDAV 237
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+GYHDVPQNSE+SLLKALANQPLS+AIEASGRDFQFYSGGV+DGHCGT+LDHGVAAVG
Sbjct: 238 TISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVG 297
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
YG+++GLDYIIVKNSWGPKWGEKGYIRMKR T KPEG+CGI KMASYP KKK
Sbjct: 298 YGTSKGLDYIIVKNSWGPKWGEKGYIRMKRKTSKPEGICGIYKMASYPTKKK 349
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 590 bits (1522), Expect = e-166, Method: Compositional matrix adjust.
Identities = 277/337 (82%), Positives = 306/337 (90%), Gaps = 1/337 (0%)
Query: 16 ISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDN 75
+SFF S ARDFSIVGY+PEDLTS D++IDLFESW+SK +K+YES++EK RFEIFKDN
Sbjct: 1 MSFFASSCLARDFSIVGYAPEDLTSRDRIIDLFESWISKHQKIYESIEEKWHRFEIFKDN 60
Query: 76 LRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLP 135
L HIDETN+K+ NYWLGLNEFADL HEEFK +LGL DL+ R++ S E+F+YKDV +P
Sbjct: 61 LFHIDETNKKVVNYWLGLNEFADLSHEEFKNKYLGLNVDLSNRRECS-EEFTYKDVSSIP 119
Query: 136 KSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN 195
KSVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGINQIVTGNL SLSEQEL+DCD TYNN
Sbjct: 120 KSVDWRKKGAVTDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNN 179
Query: 196 GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDS 255
GCNGGLMDYAF YI+S GGLHKEEDYPYIMEEGTCEM K ESEVVTI+GYHDVPQNSE+S
Sbjct: 180 GCNGGLMDYAFAYIISNGGLHKEEDYPYIMEEGTCEMRKAESEVVTISGYHDVPQNSEES 239
Query: 256 LLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNS 315
LLKALANQPLSVAI+ASGRDFQFYSGGV+DGHCGT+LDHGVAAVGYGS +GLD+I+VKNS
Sbjct: 240 LLKALANQPLSVAIDASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGSAKGLDFIVVKNS 299
Query: 316 WGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
WG KWGEKG+IRMKRNTGKP GLCGINKMASYP KKK
Sbjct: 300 WGSKWGEKGFIRMKRNTGKPAGLCGINKMASYPTKKK 336
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 586 bits (1510), Expect = e-165, Method: Compositional matrix adjust.
Identities = 280/337 (83%), Positives = 303/337 (89%), Gaps = 1/337 (0%)
Query: 16 ISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDN 75
+SFF S ARDFSIVGY+PEDLTS DK+IDLFESW+SK K+YES++EK RFEIFKDN
Sbjct: 1 MSFFANSGLARDFSIVGYTPEDLTSGDKIIDLFESWISKHGKIYESIEEKWLRFEIFKDN 60
Query: 76 LRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLP 135
L HIDETN+K+ NYWLGLNEF+DL HEEFK +LGLK D++ R++ S E F+YKDV+ +P
Sbjct: 61 LFHIDETNKKVVNYWLGLNEFSDLSHEEFKNKYLGLKVDMSERRECSQE-FNYKDVMSIP 119
Query: 136 KSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN 195
KSVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGINQIVTGNL SLSEQEL+DCD T N
Sbjct: 120 KSVDWRKKGAVTDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNY 179
Query: 196 GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDS 255
GCNGGLMDYAF YI+S GGLHKE DYPYIMEEGTCEM K ESEVVTI+GYHDVPQNSE+S
Sbjct: 180 GCNGGLMDYAFSYIISNGGLHKEVDYPYIMEEGTCEMRKEESEVVTISGYHDVPQNSEES 239
Query: 256 LLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNS 315
LLKALANQPLSVAIEASGRDFQFYSGGV+DGHCGTQLDHGVAAVGYGST GLDYIIVKNS
Sbjct: 240 LLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGTQLDHGVAAVGYGSTNGLDYIIVKNS 299
Query: 316 WGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
WG KWGEKGYIRMKRNTGKP GLCGINKMASYP KKK
Sbjct: 300 WGSKWGEKGYIRMKRNTGKPAGLCGINKMASYPTKKK 336
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 571 bits (1472), Expect = e-160, Method: Compositional matrix adjust.
Identities = 277/353 (78%), Positives = 308/353 (87%), Gaps = 5/353 (1%)
Query: 1 MALS-SQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY 59
MA S S+ + SFC+ F +F RDFSIVGYS EDL S DKLI+LFESWMSK K+Y
Sbjct: 1 MAFSFSKALVLACSFCL--FASLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSKHGKIY 58
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
+S++EKL RFEIFKDNL+HIDE N+ + NYWLGLNEFADL H+EFK +LGLK D +RR+
Sbjct: 59 QSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRR 118
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
+S E+F+YKDV +LPKSVDWRKKGAV VKNQGSCGSCWAFSTVAAVEGINQIVTGNL
Sbjct: 119 -ESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLT 176
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
SLSEQELIDCD TYNNGCNGGLMDYAF +IV GGLHKEEDYPYIMEEGTCEMTK E+EV
Sbjct: 177 SLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEV 236
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
VTI+GYHDVPQN+E SLLKALANQPLSVAIEASGRDFQFYSGGV+DGHCG+ LDHGVAAV
Sbjct: 237 VTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAV 296
Query: 300 GYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
GYG+ +G+DYIIVKNSWG KWGEKGYIRM+RN GKPEG+CGI KMASYP KKK
Sbjct: 297 GYGTAKGVDYIIVKNSWGSKWGEKGYIRMRRNIGKPEGICGIYKMASYPTKKK 349
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 266/345 (77%), Positives = 305/345 (88%), Gaps = 1/345 (0%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
KT++++ + F+ +F RDFSIVGYS EDL S DKLI+LFESWMS+ K+YE+++EKL
Sbjct: 7 KTLVLTCSLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIEEKLL 66
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
RFE+FKDNL+HID+ N+ + NYWLGLNEFADL H+EFK +LGLK DL++R++ S E+F+
Sbjct: 67 RFEVFKDNLKHIDDRNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRESSEEEFT 126
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
Y+DV DLPKSVDWRKKGAVT VKNQG CGSCWAFSTVAAVEGINQIVTGNL SLSEQELI
Sbjct: 127 YRDV-DLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELI 185
Query: 188 DCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHD 247
DCD TYNNGCNGGLMDYAF +IV GGLHKEEDYPYIMEE TCEM K SEVVTINGYHD
Sbjct: 186 DCDTTYNNGCNGGLMDYAFSFIVKNGGLHKEEDYPYIMEESTCEMKKEVSEVVTINGYHD 245
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGL 307
VPQN+E SLLKALANQPLSVAIEASGRDFQFYSGGV+DGHCG++LDHGV+AVGYG+++GL
Sbjct: 246 VPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSELDHGVSAVGYGTSKGL 305
Query: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
DYIIVKNSWG KWGEKG+IRMKRN GK EG+CG+ KMASYP KKK
Sbjct: 306 DYIIVKNSWGAKWGEKGFIRMKRNIGKSEGICGLYKMASYPTKKK 350
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 569 bits (1467), Expect = e-160, Method: Compositional matrix adjust.
Identities = 272/352 (77%), Positives = 303/352 (86%), Gaps = 2/352 (0%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA SS +LI+ F +F RDFSIVGYS EDL S DKLI+LFESWMS+ K+YE
Sbjct: 1 MAFSSSKALVLIACSFCLFASLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYE 60
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
+++EKL RFEIFKDNL+HIDE N+ + NYWLGLNEFADL H EF +LGLK D +RR+
Sbjct: 61 NIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHREFNNKYLGLKVDYSRRR- 119
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+S E+F+YKDV +LPKSVDWRKKGAV VKNQGSCGSCWAFSTVAAVEGINQIVTGNL S
Sbjct: 120 ESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 178
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQELIDCD TYNNGCNGGLMDYAF +IV GGLHKEEDYPYIMEEGTCEMTK E++VV
Sbjct: 179 LSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETQVV 238
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+GYHDVPQN+E SLLKALANQPLSVAIEASGRDFQFYSGGV+DGHCG+ LDHGVAAVG
Sbjct: 239 TISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVG 298
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
YG+ +G+DYI VKNSWG KWGEKGYIRM+RN GKPEG+CGI KMASYP KKK
Sbjct: 299 YGTAKGVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGICGIYKMASYPTKKK 350
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 270/352 (76%), Positives = 304/352 (86%), Gaps = 4/352 (1%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
+ S + + SFC+ F +F RDFSIVGYS EDL S DKLI+LFESW+S+ K+Y+
Sbjct: 3 FSTSKALRVLACSFCL--FASFTFGRDFSIVGYSSEDLKSMDKLIELFESWISRHGKIYQ 60
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
S++EKL RFEIFKDNL+HIDE N+ + NYWLGLNEFADL H+EFK +LGLK D +RR+
Sbjct: 61 SIEEKLHRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRR- 119
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+S E+F+YKDV +LPKSVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGINQIVTGNL S
Sbjct: 120 ESPEEFTYKDV-ELPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 178
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQELIDCD TYNNGCNGGLMDYAF +IV GLHKEEDYPYIMEEGTCEM K E+EVV
Sbjct: 179 LSEQELIDCDRTYNNGCNGGLMDYAFSFIVENDGLHKEEDYPYIMEEGTCEMAKEETEVV 238
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+GYHDVPQN+E SLLKALANQPLSVAIEASGRDFQFYSGGV+DGHCG+ LDHGVAAVG
Sbjct: 239 TISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVG 298
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
YG+ +G+DYI VKNSWG KWGEKGYIRM+RN GKPEG+CGI KMASYP KKK
Sbjct: 299 YGTAKGVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGICGIYKMASYPTKKK 350
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 565 bits (1457), Expect = e-159, Method: Compositional matrix adjust.
Identities = 270/352 (76%), Positives = 302/352 (85%), Gaps = 2/352 (0%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA SS +LI+ F +F RDFSIVGYS EDL S DKLI+LFESWMS+ K+YE
Sbjct: 1 MAFSSSKALVLIACSFCLFASLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYE 60
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
+++EKL RFEIFKDNL+HIDE N+ + NYWLGL+EFADL H EF +LGLK D +RR+
Sbjct: 61 NIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLSEFADLSHREFNNKYLGLKVDYSRRR- 119
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+S E+F+YKDV +LPKSVDWRKKGAV VKNQGSCGSCWAFSTVAAVEGINQIVTGNL S
Sbjct: 120 ESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 178
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQELIDCD TYNNGCNGGLMDYAF +IV GGLHKEEDYPYIMEEG CEMTK E++VV
Sbjct: 179 LSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGACEMTKEETQVV 238
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+GYHDVPQN+E SLLKALANQPLSVAIEASGRDFQFYSGGV+DGHCG+ LDHGVAAVG
Sbjct: 239 TISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVG 298
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
YG+ +G+DYI VKNSWG KWGEKGYIRM+RN GKPEG+CGI KMASYP KKK
Sbjct: 299 YGTAKGVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGICGIYKMASYPTKKK 350
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 565 bits (1456), Expect = e-158, Method: Compositional matrix adjust.
Identities = 267/352 (75%), Positives = 309/352 (87%), Gaps = 4/352 (1%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MALS K + ++ C+SFF+ +SF +DFSIVGY PEDLTS D+LI+LFE W+S K+YE
Sbjct: 1 MALS---KLLPLAMCMSFFVVTSFGKDFSIVGYWPEDLTSMDRLIELFEEWISNHGKIYE 57
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
+++EK RFE+FKDNL+HIDETN+K+ +YWLG+NEFADL H+EFK M+LGLK + +R +
Sbjct: 58 TIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSRTR- 116
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
QS E+F+YKDVVDLPKSVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGIN+IV GNL S
Sbjct: 117 QSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGGNLTS 176
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQELIDCD YNNGC+GGLMDYAF +IVS+GGLHKEEDYPY+ E TC+ KGE EVV
Sbjct: 177 LSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVESTCDNKKGELEVV 236
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+GY DVP+N+E SL+KALA+QPLSVAIEASGRDFQFYSGGV+DG CGTQLDHGV AVG
Sbjct: 237 TISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGGVFDGPCGTQLDHGVTAVG 296
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
YGS++G+DYIIVKNSWGPKWGEKGYIRMKRNTGKP GLCGINKMASYP K K
Sbjct: 297 YGSSKGVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAGLCGINKMASYPTKSK 348
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 270/352 (76%), Positives = 305/352 (86%), Gaps = 3/352 (0%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA SS K + ++ F + A DFSIVGYS EDL S DKLI+LFESWMS+ K+Y+
Sbjct: 1 MAFSSS-KALFLACSFCLFASLAVAGDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYQ 59
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
S++EKL RF+IFKDNL+HIDE N+ + NYWLGLNEFADL H+EFK +LGLK D +RR+
Sbjct: 60 SIEEKLHRFDIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRR- 118
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+S E+F+YKD +LPKSVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGINQIVTGNL S
Sbjct: 119 ESPEEFTYKDF-ELPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 177
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQELIDCD TYNNGCNGGLMDYAF +IV GGLHKEEDYPYIMEEGTCEMTK E+EVV
Sbjct: 178 LSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVV 237
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+GYHDVPQN+E SLLKAL NQPLSVAIEASGRDFQFYSGGV+DGHCG+ LDHGVAAVG
Sbjct: 238 TISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVG 297
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
YG+++G++YIIVKNSWG KWGEKGYIRM+RN GKPEG+CGI KMASYP KKK
Sbjct: 298 YGTSKGVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEGICGIYKMASYPTKKK 349
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 267/352 (75%), Positives = 305/352 (86%), Gaps = 3/352 (0%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MALS KT ++F S F+ S A DFSIVGYSPE LTS DKL++LFESW+S K Y
Sbjct: 1 MALSV-LKTSFLTFFASLFVCSVLAHDFSIVGYSPEHLTSVDKLVELFESWISGHGKAYN 59
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
SL+EKL RFE+FK+NL+HID+ N+++ +YWLGLNEFADL HEEFK FLGL P+ R+K
Sbjct: 60 SLEEKLHRFEVFKENLKHIDQRNKEVTSYWLGLNEFADLSHEEFKSKFLGLYPEFPRKK- 118
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
S EDFSY+DVVDLPKS+DWRKKGAVT VKNQGSCGSCWAFSTVAAVEGINQIV GNL S
Sbjct: 119 -SSEDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVAGNLTS 177
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQ+LIDCD ++NNGCNGGLMDYAF++IV+ GGLHKEEDYPY+MEEGTC+ + E EVV
Sbjct: 178 LSEQQLIDCDTSFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYLMEEGTCDEKREEMEVV 237
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+GYHDVP+N E SLLKALA+QPLSVAI+ASGRDFQFYSGGV+ G CGT LDHGVAAVG
Sbjct: 238 TISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYSGGVFSGPCGTDLDHGVAAVG 297
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
YGS+ G+DYIIVKNSWGPKWGE+GY+RMKRNTGKPEGLCGINKMASYP K+K
Sbjct: 298 YGSSSGIDYIIVKNSWGPKWGERGYLRMKRNTGKPEGLCGINKMASYPTKQK 349
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 263/346 (76%), Positives = 305/346 (88%), Gaps = 2/346 (0%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
KT++++ + F+ +F RDFSIVGYS EDL S DKLI+LFESWMS+ K+YE+++EKL
Sbjct: 7 KTLVLTCSLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIEEKLL 66
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED-F 126
RFE+FKDNL+HIDE N+ + NYWLGLNEFADL H+EFK +LGLK +L++R++ S+E+ F
Sbjct: 67 RFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVNLSQRRESSNEEEF 126
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
+Y+DV DLPKSVDWRKKGAVT VKNQG CGSCWAFSTVAAVEGINQIVTGNL SLSEQEL
Sbjct: 127 TYRDV-DLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQEL 185
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
IDCD TYNNGCNGGLMDYAF +IV GGLHKE+DYPYIMEE TCEM K E++VVTINGYH
Sbjct: 186 IDCDTTYNNGCNGGLMDYAFSFIVQNGGLHKEDDYPYIMEESTCEMKKEETQVVTINGYH 245
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG 306
DVPQN+E SLLKALANQPLSVAIEAS RDFQFYSGGV+DGHCG+ LDHGV+AVGYG+++
Sbjct: 246 DVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGGVFDGHCGSDLDHGVSAVGYGTSKN 305
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
LDYIIVKNSWG KWGEKG+IRMKRN GKPEG+CG+ KMASYP KKK
Sbjct: 306 LDYIIVKNSWGAKWGEKGFIRMKRNIGKPEGICGLYKMASYPTKKK 351
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 562 bits (1449), Expect = e-158, Method: Compositional matrix adjust.
Identities = 265/352 (75%), Positives = 306/352 (86%), Gaps = 1/352 (0%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA S ++ C+SFF+ +SF +DFSIVGY PEDLTS D+LI+LFE W+S K+YE
Sbjct: 1 MAPSPYSFYFFLAMCMSFFVVTSFGKDFSIVGYWPEDLTSMDRLIELFEEWISNHGKIYE 60
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
+++EK RFE+FKDNL+HIDETN+K+ +YWLG+NEFADL H+EFK M+LGLK + +R +
Sbjct: 61 TIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSRTR- 119
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
QS E+F+YKDVVDLPKSVDWRKKGAVT VKNQGSCGSCWAFSTVAAVEGIN+IV GNL S
Sbjct: 120 QSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGGNLTS 179
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQELIDCD YNNGC+GGLMDYAF +IVS+GGLHKEEDYPY+ E TC+ KGE EVV
Sbjct: 180 LSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVESTCDNKKGELEVV 239
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+GY DVP+N+E SL+KALA+QPLSVAIEASGRDFQFYSGGV+DG CGTQLDHGV AVG
Sbjct: 240 TISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGGVFDGPCGTQLDHGVTAVG 299
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
YGS++G+DYIIVKNSWGPKWGEKGYIRMKRNTGKP GLCGINKMASYP K K
Sbjct: 300 YGSSKGVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAGLCGINKMASYPTKSK 351
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 562 bits (1448), Expect = e-157, Method: Compositional matrix adjust.
Identities = 265/352 (75%), Positives = 301/352 (85%), Gaps = 1/352 (0%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MALS L+ ++ F S+FARDFSIVGYSP+DLTS DKL DLFESWMSK K Y
Sbjct: 1 MALSPFSNFFLLFISMAVFAYSAFARDFSIVGYSPDDLTSMDKLTDLFESWMSKHGKSYR 60
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
S +EKL RFE+F+DNL+HIDETN+K+ +YWLGLNEFADL HEEFK +LGLK +L +R+D
Sbjct: 61 SFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGLKIELPKRRD 120
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
S E+FSYKDV DLPKSVDWRKKGAV HVKNQG+CGSCWAFSTVAAVEGINQIVTGNL +
Sbjct: 121 -SPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTA 179
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQELIDCD +NNGCNGGLMDYAF +I+S GGL KEEDYPY+MEEGTC K E EVV
Sbjct: 180 LSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEGTCGEKKEELEVV 239
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+GYHDVP+++E S LKALANQPLSVAIEAS R FQFYSGG+++GHCGT+LDHGVAAVG
Sbjct: 240 TISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHCGTELDHGVAAVG 299
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
YG+++G+DYI VKNSWG KWGEKGYIRMKRN GKPEG+CGI KMASYP K K
Sbjct: 300 YGTSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMASYPTKNK 351
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 560 bits (1443), Expect = e-157, Method: Compositional matrix adjust.
Identities = 262/346 (75%), Positives = 304/346 (87%), Gaps = 2/346 (0%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
KT++++ + F+ +F RDFSIVGYS EDL S DKLI+LFESWMS+ K+YE+++EKL
Sbjct: 7 KTLVLTCSLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIEEKLL 66
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED-F 126
RFE+FKDNL+HID+ N+ + NYWLGLNEFADL H+EFK +LGLK DL++R++ S+E+ F
Sbjct: 67 RFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRESSNEEEF 126
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
+Y+DV DLPKSVDWRKKGAVT VKNQG CGSCWAFSTVAAVEGINQIVTGNL SLSEQEL
Sbjct: 127 TYRDV-DLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQEL 185
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
IDCD TYNNGCNGGLMDYAF +I GGLHKEEDYPYIMEE TCEM K E++VVTINGYH
Sbjct: 186 IDCDTTYNNGCNGGLMDYAFSFIGQNGGLHKEEDYPYIMEESTCEMKKEETQVVTINGYH 245
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG 306
DVPQN+E SLLKALANQPLSVAIEAS RDFQFYSGGV+DGHCG+ LDHGV+AVGYG+++
Sbjct: 246 DVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGGVFDGHCGSDLDHGVSAVGYGTSKN 305
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
LDYIIVKNSWG KWGEKG+IRMKR+ GKPEG+CG+ KMASYP KKK
Sbjct: 306 LDYIIVKNSWGAKWGEKGFIRMKRDIGKPEGICGLYKMASYPTKKK 351
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 262/348 (75%), Positives = 298/348 (85%), Gaps = 1/348 (0%)
Query: 5 SQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDE 64
S KT L +S S+ A +FSI+GY+PEDLTS K+I LFESW++K K+YESLDE
Sbjct: 6 SSKKTSLFLVFVSVLACSALANEFSILGYAPEDLTSIHKVIHLFESWLAKHSKIYESLDE 65
Query: 65 KLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
KL RFEIF DNL+HID+TN+K+ NYWLGLNEFADL HEEFK FLGLK +L RKD+S E
Sbjct: 66 KLHRFEIFMDNLKHIDDTNKKVSNYWLGLNEFADLTHEEFKNKFLGLKGELPERKDESIE 125
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
+FSY+D VDLPKSVDWRKKGAV VKNQG CGSCWAFSTVAAVEGINQIVTGNL LSEQ
Sbjct: 126 EFSYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQ 185
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
ELIDCD T+NNGCNGGLMDYAF Y++ + GLHKEE+YPYIM EGTC+ K SE VTI+G
Sbjct: 186 ELIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSETVTISG 244
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
YHDVP+N+EDS LKALANQP+SVAIEASGRDFQFYSGGV+DGHCGT+LDHGVAAVGYG+T
Sbjct: 245 YHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTT 304
Query: 305 RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
+GLDY+IV+NSWGPKWGEKGYIRMKR TGKP G+CG+ MASYP K+K
Sbjct: 305 KGLDYVIVRNSWGPKWGEKGYIRMKRKTGKPHGMCGLYMMASYPTKQK 352
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 263/357 (73%), Positives = 306/357 (85%), Gaps = 5/357 (1%)
Query: 1 MALSSQFKTILISFCIS---FFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEK 57
MALSS + + +S + + + D+SIVGYSPEDL S+DKLI+LFE+W+S FEK
Sbjct: 1 MALSSPSRILCFPLALSAATLSLSVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEK 60
Query: 58 VYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
YE+++EKL RFE+FKDNL+HIDETN+K+K+YWLGLNEFADL HEEFK+M+LGLK D+ R
Sbjct: 61 AYETVEEKLLRFEVFKDNLKHIDETNKKVKSYWLGLNEFADLSHEEFKKMYLGLKTDIVR 120
Query: 118 R-KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG 176
R +++S+ +F+Y+DV +PKSVDWRKKGAV VKNQGSCGSCWAFSTVAAVEGIN+IVTG
Sbjct: 121 RDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTG 180
Query: 177 NLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
NL +LSEQELIDCD TYNNGCNGGLMDYAF+YIV GGL KEEDYPY MEEGTCEM K E
Sbjct: 181 NLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDE 240
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG-GVYDGHCGTQLDHG 295
SE VTI+G+ DVP N E SLLKALA+QPLSVAI+ASGR+FQFYSG V+DG CG LDHG
Sbjct: 241 SETVTIDGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYSGVSVFDGRCGVDLDHG 300
Query: 296 VAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
VAAVGYGS++G DYIIVKNSWGPKWGEKGYIR+KRNTGKPEGLCGINKMAS+P K K
Sbjct: 301 VAAVGYGSSKGSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCGINKMASFPTKTK 357
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 258/327 (78%), Positives = 293/327 (89%), Gaps = 1/327 (0%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI 86
D+SIVGYSPEDL S+DKLI+LFE+W+S FEK YE+++EK RFE+FKDNL+HIDETN+K
Sbjct: 30 DYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG 89
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARR-KDQSHEDFSYKDVVDLPKSVDWRKKGA 145
K+YWLGLNEFADL HEEFK+M+LGLK D+ RR +++S+ +F+Y+DV +PKSVDWRKKGA
Sbjct: 90 KSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGA 149
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYA 205
V VKNQGSCGSCWAFSTVAAVEGIN+IVTGNL +LSEQELIDCD TYNNGCNGGLMDYA
Sbjct: 150 VAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYA 209
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPL 265
F+YIV GGL KEEDYPY MEEGTCEM K ESE VTING+ DVP N E SLLKALA+QPL
Sbjct: 210 FEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPL 269
Query: 266 SVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGY 325
SVAI+ASGR+FQFYSGGV+DG CG LDHGVAAVGYGS++G DYIIVKNSWGPKWGEKGY
Sbjct: 270 SVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGY 329
Query: 326 IRMKRNTGKPEGLCGINKMASYPIKKK 352
IR+KRNTGKPEGLCGINKMAS+P K K
Sbjct: 330 IRLKRNTGKPEGLCGINKMASFPTKTK 356
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 550 bits (1417), Expect = e-154, Method: Compositional matrix adjust.
Identities = 258/345 (74%), Positives = 293/345 (84%), Gaps = 1/345 (0%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
KT L+ +S S+ A +FSI+GY+PEDLTS K+I LFESW+ K K YESLDEKL
Sbjct: 9 KTSLLFLFVSILACSALAHEFSILGYAPEDLTSIHKVIHLFESWLVKHSKFYESLDEKLH 68
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
RFEIF DNL+HIDETN+K+ NYWLGLNEFADL HEEFK FLG K +LA RKD+S ++F
Sbjct: 69 RFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERKDESSKEFG 128
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
Y+D VDLPKSVDWRKKGAV VKNQG CGSCWAFSTVAAVEGINQIVTGNL LSEQELI
Sbjct: 129 YRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQELI 188
Query: 188 DCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHD 247
DCD T+NNGCNGGLMDYAF Y++ + GLHKEE+YPYIM EGTC+ K SE VTI+GYHD
Sbjct: 189 DCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSEKVTISGYHD 247
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGL 307
VP+N E S LKALANQP+SVAIEASGRDFQFYSGGV+DGHCGT+LDHGVAAVGYG+T+GL
Sbjct: 248 VPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTTKGL 307
Query: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
DY+IV+NSWGPKWGEKGYIRMKR +GKP G+CG+ MASYP K+K
Sbjct: 308 DYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYPTKQK 352
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 257/345 (74%), Positives = 292/345 (84%), Gaps = 1/345 (0%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
KT L+ +S S A +FSI+GY+PEDLTS K+I LFESW+ K K YESLDEKL
Sbjct: 9 KTSLLFLFVSILACSPLAHEFSILGYAPEDLTSIHKVIHLFESWLVKHSKFYESLDEKLH 68
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
RFEIF DNL+HIDETN+K+ NYWLGLNEFADL HEEFK FLG K +LA RKD+S ++F
Sbjct: 69 RFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERKDESSKEFG 128
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
Y+D VDLPKSVDWRKKGAV VKNQG CG+CWAFSTVAAVEGINQIVTGNL LSEQELI
Sbjct: 129 YRDFVDLPKSVDWRKKGAVAPVKNQGQCGNCWAFSTVAAVEGINQIVTGNLTMLSEQELI 188
Query: 188 DCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHD 247
DCD T+NNGCNGGLMDYAF Y++ + GLHKEE+YPYIM EGTC+ K SE VTI+GYHD
Sbjct: 189 DCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSEKVTISGYHD 247
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGL 307
VP+N E S LKALANQP+SVAIEASGRDFQFYSGGV+DGHCGT+LDHGVAAVGYG+T+GL
Sbjct: 248 VPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTTKGL 307
Query: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
DY+IV+NSWGPKWGEKGYIRMKR +GKP G+CG+ MASYP K+K
Sbjct: 308 DYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYPTKQK 352
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 546 bits (1406), Expect = e-153, Method: Compositional matrix adjust.
Identities = 252/325 (77%), Positives = 288/325 (88%), Gaps = 1/325 (0%)
Query: 26 RDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK 85
RDFSIVGYSPEDLT DKLI FESW+SK KVY+S++EKL RFE+F++NL HIDE N++
Sbjct: 382 RDFSIVGYSPEDLTCIDKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKE 441
Query: 86 IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
+ +YWLGLNEFADL HEEFK +LGL+ + R +D S E F Y+DV DLP+SVDWRKKGA
Sbjct: 442 VSSYWLGLNEFADLSHEEFKSKYLGLRAEFPRSRDYSGE-FRYRDVADLPESVDWRKKGA 500
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYA 205
VTHVKNQG+CGSCWAFSTVAAVEGINQIVTGNL +LSEQELIDCD T+N+GCNGGLMDYA
Sbjct: 501 VTHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYA 560
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPL 265
F +I S GGLHKE+DYPY+MEEGTCE K + ++VTI+GY DVP+ E+SLLKALA+QPL
Sbjct: 561 FAFIASNGGLHKEDDYPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPL 620
Query: 266 SVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGY 325
SVAIEASGRDFQFYSGGV++G CGT+LDHGVAAVGYGS++GLDYIIVKNSWGPKWGEKGY
Sbjct: 621 SVAIEASGRDFQFYSGGVFNGPCGTELDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGY 680
Query: 326 IRMKRNTGKPEGLCGINKMASYPIK 350
IRMKRNTGK EGLCGINKMASYP K
Sbjct: 681 IRMKRNTGKTEGLCGINKMASYPTK 705
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 539 bits (1388), Expect = e-151, Method: Compositional matrix adjust.
Identities = 246/344 (71%), Positives = 294/344 (85%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++L++ S + S+ ARDFSIVGY+PE LTS +KL++LFESWMS+ KVY+S++EK+ R
Sbjct: 12 SLLVAISASALLCSALARDFSIVGYTPEQLTSTEKLLELFESWMSEHSKVYKSVEEKVHR 71
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
FE+F++NL HID+ N +I +YWLGLNEFADL HEEFK +LGL RK Q +F Y
Sbjct: 72 FEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRY 131
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
+D+ DLPKSVDWRKKGAV VK+QG CGSCWAFSTVAAVEGINQI TGNL+SLSEQELID
Sbjct: 132 RDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELID 191
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
CD T+N+GCNGGLMDYAFQYI+STGGLHKE+DYPY+MEEG C+ K + E VTI+GY DV
Sbjct: 192 CDTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDV 251
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLD 308
P+N ++SL+KALA+QP+SVAIEASGRDFQFY GGV++G CGT LDHGVAAVGYGS++G D
Sbjct: 252 PENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGQCGTDLDHGVAAVGYGSSKGSD 311
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
Y+IVKNSWGP+WGEKG+IRMKRNTGKPEGLCGINKMASYP K K
Sbjct: 312 YVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASYPTKTK 355
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 536 bits (1380), Expect = e-150, Method: Compositional matrix adjust.
Identities = 245/344 (71%), Positives = 293/344 (85%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++L++ S + +FARDFSIVGY+PE LT+ DKL++LFESWMS+ K Y+S++EK+ R
Sbjct: 12 SLLVAISASALLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHR 71
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
FE+F++NL HID+ N +I +YWLGLNEFADL HEEFK +LGL RK Q +F Y
Sbjct: 72 FEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRY 131
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
+D+ DLPKSVDWRKKGAV VK+QG CGSCWAFSTVAAVEGINQI TGNL+SLSEQELID
Sbjct: 132 RDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELID 191
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
CD T+N+GCNGGLMDYAFQYI+STGGLHKE+DYPY+MEEG C+ K + E VTI+GY DV
Sbjct: 192 CDTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDV 251
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLD 308
P+N ++SL+KALA+QP+SVAIEASGRDFQFY GGV++G CGT LDHGVAAVGYGS++G D
Sbjct: 252 PENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSD 311
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
Y+IVKNSWGP+WGEKG+IRMKRNTGKPEGLCGINKMASYP K K
Sbjct: 312 YVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASYPTKTK 355
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 534 bits (1376), Expect = e-149, Method: Compositional matrix adjust.
Identities = 264/350 (75%), Positives = 297/350 (84%), Gaps = 6/350 (1%)
Query: 1 MALS-SQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY 59
MA S S+ + SFC+ F +F RDFSIVGYS EDL S DKLI+LFESWMSK K+Y
Sbjct: 1 MAFSFSKALVLACSFCL--FASLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSKHGKIY 58
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
+S++EKL RFEIFKDNL+HIDE N+ + NYWLGLNEFADL H+EFK +LGLK D +RR+
Sbjct: 59 QSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRR 118
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
+S E+F+YKDV +LPKSVDWRKKGAV VKNQGSCGSCWAFSTVAAVEGINQIVTGNL
Sbjct: 119 -ESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLT 176
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
SLSEQELIDCD TY+NGCNGGLMDYAF +IV GGLHKEEDYPYIMEEGTCEMTK E+EV
Sbjct: 177 SLSEQELIDCDRTYSNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEV 236
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
VTI+GYHDVPQN+E SLLKALANQ LSVAIEASGRDFQFYSGGV+DGHCG+ LDHGVAAV
Sbjct: 237 VTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAV 296
Query: 300 GYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
GYG+ +G+DYIIVKNSWG KWGEKGYIRM R T + G +MASYP+
Sbjct: 297 GYGTAKGVDYIIVKNSWGSKWGEKGYIRM-RGTLETRGNLRYLQMASYPL 345
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 533 bits (1373), Expect = e-149, Method: Compositional matrix adjust.
Identities = 256/352 (72%), Positives = 288/352 (81%), Gaps = 3/352 (0%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MALS+ K LI + FI + A DFSIVGYSPE L S DK I+LFESWMSK K Y
Sbjct: 1 MALSTFSKATLI-LSATLFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYR 59
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
S++EKL RFEIF DNL+HIDETN+K+ +YWLGLNEFADL HEEFK +LGL+ + R++
Sbjct: 60 SIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRKR- 118
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
S FSY DV DLP+SVDWR KGAVT VKNQGSCGSCWAFSTVAAVEGINQIVTGNL S
Sbjct: 119 -SSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 177
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQELIDCD ++NNGC GGLMDYAFQYI+S GL KEEDYPY+MEEG C K + EVV
Sbjct: 178 LSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVV 237
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+GY DVP N E SLLKAL++QP+SVAIEAS R+FQFY GG++ G CGTQ+DHGV AVG
Sbjct: 238 TISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVG 297
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
YGS+ G DYIIVKNSWGPKWGE GYIRMKRNTGKPEGLCGIN+MASYP K+K
Sbjct: 298 YGSSEGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYPTKEK 349
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 532 bits (1371), Expect = e-149, Method: Compositional matrix adjust.
Identities = 256/352 (72%), Positives = 288/352 (81%), Gaps = 3/352 (0%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MALS+ K LI + FI + A DFSIVGYSPE L S DK I+LFESWMSK K Y
Sbjct: 1 MALSTFSKATLI-LSATLFITYATAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKAYR 59
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
S++EKL RFEIF DNL+HIDETN+K+ +YWLGLNEFADL HEEFK +LGL+ + R++
Sbjct: 60 SIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRKR- 118
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
S FSY DV DLP+SVDWR KGAVT VKNQGSCGSCWAFSTVAAVEGINQIVTGNL S
Sbjct: 119 -SSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 177
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQELIDCD ++NNGC GGLMDYAFQYI+S GL KEEDYPY+MEEG C K + EVV
Sbjct: 178 LSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVV 237
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+GY DVP N E SLLKAL++QP+SVAIEAS R+FQFY GG++ G CGTQ+DHGV AVG
Sbjct: 238 TISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVG 297
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
YGS+ G DYIIVKNSWGPKWGE GYIRMKRNTGKPEGLCGIN+MASYP K+K
Sbjct: 298 YGSSEGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYPTKEK 349
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 525 bits (1353), Expect = e-146, Method: Compositional matrix adjust.
Identities = 250/335 (74%), Positives = 284/335 (84%), Gaps = 2/335 (0%)
Query: 3 LSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESL 62
SS+ + + CI F + F+ +FSI+GY+PEDLTS K+I LFES + K K+YES
Sbjct: 5 FSSKKTSAFLCICIGFGM-FGFSHEFSILGYAPEDLTSIHKVIHLFESSLVKHSKIYESF 63
Query: 63 DEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS 122
DEKL RFEIF DNL+HIDETN+K+ NYWLGLNEFADL HEEFK FLG K +LA RKD+S
Sbjct: 64 DEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKNKFLGFKGELAERKDES 123
Query: 123 HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLS 182
E F Y+D VDLPKSVDWRKKGAV+ VKNQG CGSCWAFSTVAAVEGINQIVTGNL LS
Sbjct: 124 IEQFRYRDFVDLPKSVDWRKKGAVSPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTVLS 183
Query: 183 EQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
EQELIDCD T+NNGCNGGLMDYAF Y V+ GLHKEE+YPYIM EGTC+ + SE VTI
Sbjct: 184 EQELIDCDTTFNNGCNGGLMDYAFAY-VTRNGLHKEEEYPYIMSEGTCDEKRDASEKVTI 242
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
+GYHDVP+N+EDS LKALANQP+SVAIEASGRDFQFYSGGV+DGHCGT+LDHGVAAVGYG
Sbjct: 243 SGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYG 302
Query: 303 STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
+++GLDY+IV+NSWGPKWGEKGYIRMKRNTGKP G
Sbjct: 303 TSKGLDYVIVRNSWGPKWGEKGYIRMKRNTGKPMG 337
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 513 bits (1320), Expect = e-143, Method: Compositional matrix adjust.
Identities = 244/325 (75%), Positives = 275/325 (84%), Gaps = 22/325 (6%)
Query: 26 RDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK 85
RDFSIVGYSPEDLT DKLI FESW+SK KVY+S++EKL RFE+F++NL HIDE N++
Sbjct: 27 RDFSIVGYSPEDLTCIDKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKE 86
Query: 86 IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
+ +YWLGLNEFADL HEEFK KDV DLP+SVDWRKKGA
Sbjct: 87 VSSYWLGLNEFADLSHEEFKS----------------------KDVADLPESVDWRKKGA 124
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYA 205
VTHVKNQG+CGSCWAFSTVAAVEGINQIVTGNL +LSEQELIDCD T+N+GCNGGLMDYA
Sbjct: 125 VTHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYA 184
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPL 265
F +I S GGLHKE+DYPY+MEEGTCE K + ++VTI+GY DVP+ E+SLLKALA+QPL
Sbjct: 185 FAFIASNGGLHKEDDYPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPL 244
Query: 266 SVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGY 325
SVAIEASGRDFQFYSGGV++G CGT+LDHGVAAVGYGS++GLDYIIVKNSWGPKWGEKGY
Sbjct: 245 SVAIEASGRDFQFYSGGVFNGPCGTELDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGY 304
Query: 326 IRMKRNTGKPEGLCGINKMASYPIK 350
IRMKRNTGK EGLCGINKMASYP K
Sbjct: 305 IRMKRNTGKTEGLCGINKMASYPTK 329
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 500 bits (1287), Expect = e-139, Method: Compositional matrix adjust.
Identities = 233/301 (77%), Positives = 264/301 (87%), Gaps = 1/301 (0%)
Query: 52 MSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGL 111
MSK K Y S +EKL RFE+F+DNL+HIDETN+K+ +YWLGLNEFADL HEEFK +LGL
Sbjct: 1 MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGL 60
Query: 112 KPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGIN 171
K +L +R+D S E+FSYKDV DLPKSVDWRKKGAV HVKNQG+CGSCWAFSTVAAVEGIN
Sbjct: 61 KIELPKRRD-SPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGIN 119
Query: 172 QIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCE 231
QIVTGNL +LSEQELIDCD +NNGCNGGLMDYAF +I+S GGL KEEDYPY+MEEGTC
Sbjct: 120 QIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEGTCG 179
Query: 232 MTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQ 291
K E EVVTI+GYHDVP+++E S LKALANQPLSVAIEAS R FQFYSGG+++GHCGT+
Sbjct: 180 EKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHCGTE 239
Query: 292 LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
LDHGVAAVGYG+++G+DYI VKNSWG KWGEKGYIRMKRN GKPEG+CGI KMASYP K
Sbjct: 240 LDHGVAAVGYGTSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMASYPTKN 299
Query: 352 K 352
K
Sbjct: 300 K 300
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 498 bits (1283), Expect = e-138, Method: Compositional matrix adjust.
Identities = 240/329 (72%), Positives = 272/329 (82%), Gaps = 7/329 (2%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI 86
DFSIVGYS EDL+SND++I+LFE W++K +K Y S +EKL RFE+FKDNL+HID+ NR++
Sbjct: 129 DFSIVGYSEEDLSSNDRIIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREV 188
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DLPKSVDWRKKG 144
+YWLGLNEFADL HEEFK +LGL P R +S F Y+DV DLPKSVDWR KG
Sbjct: 189 TSYWLGLNEFADLTHEEFKATYLGLAPPAPAR--ESRGSFKYEDVSADDLPKSVDWRTKG 246
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDY 204
AVT VKNQG CGSCWAFSTVAAVEGIN IVTGNL +LSEQELIDC NNGCNGGLMDY
Sbjct: 247 AVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGGLMDY 306
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTC-EMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
AF YI S+GGLH EE YPY+MEEG+C + K ESE VTI+GY DVP ++E +L+KALA+Q
Sbjct: 307 AFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQ 366
Query: 264 PLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST--RGLDYIIVKNSWGPKWG 321
P+SVAIEASGR FQFYSGGV+DG CGTQLDHGVAAVGYGS +G DYIIV+NSWG KWG
Sbjct: 367 PVSVAIEASGRHFQFYSGGVFDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAKWG 426
Query: 322 EKGYIRMKRNTGKPEGLCGINKMASYPIK 350
EKGYIRMKR TGK EGLCGINKMASYP K
Sbjct: 427 EKGYIRMKRGTGKGEGLCGINKMASYPTK 455
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 491 bits (1265), Expect = e-136, Method: Compositional matrix adjust.
Identities = 242/342 (70%), Positives = 267/342 (78%), Gaps = 27/342 (7%)
Query: 11 LISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFE 70
L + S I S A DFSIVGYSPE LTS KL +LFESWMSK K YES++EKL R E
Sbjct: 10 LFTIFTSLVICSVVAHDFSIVGYSPEHLTSMHKLTELFESWMSKHGKTYESIEEKLHRLE 69
Query: 71 IFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKD 130
+FKDNL HID NR + YWL LNEFADL HEEFK + RR +
Sbjct: 70 VFKDNLMHIDRRNRDVTTYWLALNEFADLSHEEFKSKLAQI-----RRLE---------- 114
Query: 131 VVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD 190
KGAV VKNQGSCGSCWAFSTVAAVEGINQIVTGNL SLSEQELIDCD
Sbjct: 115 ------------KGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD 162
Query: 191 NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQ 250
++N+GCNGGLMDYAF YIV+ GGLHKEEDYPY+MEEGTC+ + E EVVTI+GYHDVP+
Sbjct: 163 TSFNSGCNGGLMDYAFDYIVNNGGLHKEEDYPYLMEEGTCDEKREEMEVVTISGYHDVPE 222
Query: 251 NSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYI 310
N+E+SLLKALA+QPLS+AIEASGRDFQFY GV++G CGT LDHGVAAVGYGS++GLDYI
Sbjct: 223 NNEESLLKALAHQPLSIAIEASGRDFQFYGRGVFNGPCGTDLDHGVAAVGYGSSKGLDYI 282
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP KKK
Sbjct: 283 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPTKKK 324
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 242/355 (68%), Positives = 280/355 (78%), Gaps = 13/355 (3%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
M LS + + C++ R+S DFSIVGYS EDL+SN++L++LFE W++K +K Y
Sbjct: 8 MKLSGALLLLCVGACVA---RNS---DFSIVGYSEEDLSSNERLVELFEKWLAKHQKAYA 61
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
S +EKL RFE+FKDNL+HID+ NR++ +YWLGLNEFADL H+EFK +LGL ARR
Sbjct: 62 SFEEKLHRFEVFKDNLKHIDKINREVTSYWLGLNEFADLTHDEFKAAYLGLDAAPARRG- 120
Query: 121 QSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
S F Y+DV DLPKSVDWRKKGAVT VKNQG CGSCWAFSTVAAVEGIN IVTGNL
Sbjct: 121 -SSRSFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNL 179
Query: 179 ASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTC-EMTKGES 237
+LSEQELIDC N+GCNGGLMDYAF YI S+GGLH EE YPY+MEEG+C + K ES
Sbjct: 180 TALSEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKAES 239
Query: 238 EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVA 297
E VTI+GY DVP N E +L+KALA+QP+SVAIEASGR FQFYSGGV+DG CG QLDHGVA
Sbjct: 240 EAVTISGYEDVPANDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVA 299
Query: 298 AVGYGSTRGL--DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
AVGYGS +G DYIIV+NSWG +WGEKGYIRMKR T EGLCGINKMASYP K
Sbjct: 300 AVGYGSDKGKGHDYIIVRNSWGAQWGEKGYIRMKRGTSNGEGLCGINKMASYPTK 354
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 482 bits (1241), Expect = e-134, Method: Compositional matrix adjust.
Identities = 232/328 (70%), Positives = 264/328 (80%), Gaps = 5/328 (1%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI 86
+FSIVGYS EDL S+D+LI+LFE W++K+ K Y S +EK+ RFE+FKDNL HID+ N+K+
Sbjct: 30 EFSIVGYSEEDLASHDRLIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKKV 89
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARR--KDQSHEDFSYKDVV--DLPKSVDWRK 142
+YWLGLNEFADL H+EFK +LGL P R K S E+F Y + ++PK +DWRK
Sbjct: 90 TSYWLGLNEFADLTHDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRK 149
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
K AVT VKNQG CGSCWAFSTVAAVEGIN IVTGNL SLSEQELIDC NNGCNGGLM
Sbjct: 150 KNAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLM 209
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
DYAF YI STGGL EE YPY MEEG C+ KG + VVTI+GY DVP N E +L+KALA+
Sbjct: 210 DYAFSYIASTGGLRTEEAYPYAMEEGDCDEGKGAA-VVTISGYEDVPANDEQALVKALAH 268
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
QP+SVAIEASGR FQFYSGGV+DG CG QLDHGV AVGYG+++G DYIIVKNSWGP WGE
Sbjct: 269 QPVSVAIEASGRHFQFYSGGVFDGPCGEQLDHGVTAVGYGTSKGQDYIIVKNSWGPHWGE 328
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPIK 350
KGYIRMKR TGK EGLCGINKMASYP K
Sbjct: 329 KGYIRMKRGTGKGEGLCGINKMASYPTK 356
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 479 bits (1233), Expect = e-133, Method: Compositional matrix adjust.
Identities = 230/328 (70%), Positives = 262/328 (79%), Gaps = 5/328 (1%)
Query: 28 FSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK 87
FSIVGYSPEDL +D+LI LFE W++K+ K Y S +EKL RFE+FKDNL HIDE N+K+
Sbjct: 46 FSIVGYSPEDLVHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVT 105
Query: 88 NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGA 145
YWLGLN FADL H+EFK +LGL+ ++ S F Y V D +P SVDWRKKGA
Sbjct: 106 TYWLGLNAFADLTHDEFKATYLGLRQPETKKTTDSR--FRYGGVADDDVPASVDWRKKGA 163
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYA 205
VT VKNQG CGSCWAFSTVAAVEGINQIVTGNL SLSEQEL+DC NNGCNGG+MD A
Sbjct: 164 VTDVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNA 223
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE-VVTINGYHDVPQNSEDSLLKALANQP 264
F YI S+GGL EE YPY+MEEG C+ + E VVTI+GY DVP N E +L+KALA+QP
Sbjct: 224 FSYIASSGGLRTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQP 283
Query: 265 LSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKG 324
LSVAIEASGR FQFYSGGV++G CG++LDHGVAAVGYGS++G DYIIVKNSWG WGEKG
Sbjct: 284 LSVAIEASGRHFQFYSGGVFNGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGSHWGEKG 343
Query: 325 YIRMKRNTGKPEGLCGINKMASYPIKKK 352
YIRMKR TGKPEGLCGINKMASYP K +
Sbjct: 344 YIRMKRGTGKPEGLCGINKMASYPTKDQ 371
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 234/347 (67%), Positives = 268/347 (77%), Gaps = 23/347 (6%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI 86
DFSIVGYS EDL+S++ L +LFE W+S+ + Y SL+EKL RF++FKDNL HIDETNRK+
Sbjct: 38 DFSIVGYSEEDLSSHESLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRKV 97
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV--------VDLPKSV 138
+YWLGLNEFADL H+EFK +LGL+ + +D ++ LPKSV
Sbjct: 98 SSYWLGLNEFADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSV 157
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
DWR KGAVT VKNQG CGSCWAFSTVAAVEGINQIVTGNL +LSEQELIDCD NNGCN
Sbjct: 158 DWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNNGCN 217
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTK-------GESE-------VVTING 244
GGLMDYAF YI GGLH EE YPY+MEEGTC+ + G SE VVTI+G
Sbjct: 218 GGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISG 277
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS- 303
Y DVP+N+E +LLKALA QP+SVAIEASGR+FQFYSGGV+DG CGTQLDHGVAAVGYG+
Sbjct: 278 YEDVPRNNEQALLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVAAVGYGTA 337
Query: 304 TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
+G DYIIVKNSWGP WGEKGYIRM+R TGK +GLCGINKMASYP K
Sbjct: 338 AKGHDYIIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMASYPTK 384
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 477 bits (1227), Expect = e-132, Method: Compositional matrix adjust.
Identities = 235/353 (66%), Positives = 278/353 (78%), Gaps = 9/353 (2%)
Query: 3 LSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESL 62
++S+ ++ C+ + + DFSIVGYS EDL+S+D+L++LFE W++K +K Y S
Sbjct: 1 MASKLSVAVLLLCVGACVARN--SDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASF 58
Query: 63 DEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS 122
+EKL RFE+FKDNL+ IDE NR++ +YWLGLNEFADL H+EFK +LGL P + S
Sbjct: 59 EEKLHRFEVFKDNLKLIDEINREVTSYWLGLNEFADLTHDEFKTTYLGLSP--PPARRSS 116
Query: 123 HEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
F Y++V DLPK+VDWRKKGAVT VKNQG CGSCWAFSTVAAVEGIN IVTGNL +
Sbjct: 117 SRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTA 176
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTC-EMTKGESEV 239
LSEQELIDC N+GCNGG+MDYAF YI S+GGLH EE YPY+MEEG+C + K ESE
Sbjct: 177 LSEQELIDCSVDGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEA 236
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
V+I+GY DVP E +L+KALA+QP+SVAIEASGR FQFYSGGV+DG CG QLDHGVAAV
Sbjct: 237 VSISGYEDVPTKDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVAAV 296
Query: 300 GYGSTRGL--DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
GYGS +G DYIIVKNSWG KWGEKGYIRMKR TGK EGLCGINKMASYP K
Sbjct: 297 GYGSDKGKGHDYIIVKNSWGGKWGEKGYIRMKRGTGKSEGLCGINKMASYPTK 349
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 476 bits (1225), Expect = e-132, Method: Compositional matrix adjust.
Identities = 231/346 (66%), Positives = 270/346 (78%), Gaps = 11/346 (3%)
Query: 15 CISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKD 74
C + + + SIVGYS EDL S+++L++LFE +M+K+ K Y SL+EKL RFE+FKD
Sbjct: 19 CGGACVAVAMPSELSIVGYSEEDLASHERLMELFEKFMAKYRKAYSSLEEKLRRFEVFKD 78
Query: 75 NLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV--V 132
NL HIDE N+KI YWLGLNEFADL H+EFK +LGL ARR + + + F Y++V
Sbjct: 79 NLNHIDEENKKITGYWLGLNEFADLTHDEFKAAYLGLTLTPARR-NSNDQLFRYEEVEAA 137
Query: 133 DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT 192
LPK VDWRKKGAVT VKNQG CGSCWAFSTVAAVEGIN IVTGNL LSEQELIDCD
Sbjct: 138 SLPKEVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTD 197
Query: 193 YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE-------VVTINGY 245
NNGC+GGLMDYAF YI + GGLH EE YPY+MEEGTC E + VTI+GY
Sbjct: 198 GNNGCSGGLMDYAFSYIAANGGLHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGY 257
Query: 246 HDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS-T 304
DVP+N+E +LLKALA+QP+SVAIEASGR+FQFYSGGV+DG CGT+LDHGV AVGYG+ +
Sbjct: 258 EDVPRNNEQALLKALAHQPVSVAIEASGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTAS 317
Query: 305 RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
+G DYIIVKNSWG WGEKGYIRM+R TGK +GLCGINKMASYP K
Sbjct: 318 KGHDYIIVKNSWGSHWGEKGYIRMRRGTGKHDGLCGINKMASYPTK 363
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 469 bits (1207), Expect = e-130, Method: Compositional matrix adjust.
Identities = 221/287 (77%), Positives = 254/287 (88%), Gaps = 1/287 (0%)
Query: 26 RDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK 85
D+SIVGYSPEDL S+DKLI+LFE+W+S FEK YE+++EK RFE+FKDNL+HIDETN+K
Sbjct: 29 HDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKK 88
Query: 86 IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR-KDQSHEDFSYKDVVDLPKSVDWRKKG 144
K+YWLGLNEFADL HEEFK+M+LGLK D+ RR +++S+ +F+Y+DV +PKSVDWRKKG
Sbjct: 89 GKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKG 148
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDY 204
AV VKNQGSCGSCWAFSTVAAVEGIN+IVTGNL +LSEQELIDCD TYNNGCNGGLMDY
Sbjct: 149 AVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDY 208
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQP 264
AF+YIV GGL KEEDYPY MEEGTCEM K ESE VTING+ DVP N E SLLKALA+QP
Sbjct: 209 AFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQP 268
Query: 265 LSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYII 311
LSVAI+ASGR+FQFYSGGV+DG CG LDHGVAAVGYGS++G DYII
Sbjct: 269 LSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYII 315
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 228/329 (69%), Positives = 259/329 (78%), Gaps = 11/329 (3%)
Query: 28 FSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-I 86
FSIVGYSPEDLT +D+L+ LFE W++K+ K Y S +EKL RFE+FKDNL HIDE NRK +
Sbjct: 52 FSIVGYSPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEV 111
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDL----PKSVDWRK 142
+YWLGLN FADL H+EFK +LGL P K S F Y V D P SVDWRK
Sbjct: 112 TSYWLGLNAFADLTHDEFKATYLGLLP-----KRTSGGRFRYGGVGDGGDEVPASVDWRK 166
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAVT VKNQG CGSCWAFSTVAAVEGINQIVTGNL SLSEQ+L+DC NNGC+GG+M
Sbjct: 167 KGAVTEVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVM 226
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV-VTINGYHDVPQNSEDSLLKALA 261
D AF +I + GL EE YPY+MEEG C+ + EV VTI+GY DVP N E +L+KALA
Sbjct: 227 DNAFSFIATGAGLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALA 286
Query: 262 NQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWG 321
+QP+SVAIEASGR FQFYSGGV+DG CG++LDHGVAAVGYGS++G DYIIVKNSWG WG
Sbjct: 287 HQPVSVAIEASGRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGTHWG 346
Query: 322 EKGYIRMKRNTGKPEGLCGINKMASYPIK 350
EKGYIRMKR TGKPEGLCGINKMASYP K
Sbjct: 347 EKGYIRMKRGTGKPEGLCGINKMASYPTK 375
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 228/329 (69%), Positives = 259/329 (78%), Gaps = 11/329 (3%)
Query: 28 FSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-I 86
FSIVGYSPEDLT +D+L+ LFE W++K+ K Y S +EKL RFE+FKDNL HIDE NRK +
Sbjct: 66 FSIVGYSPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEV 125
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDL----PKSVDWRK 142
+YWLGLN FADL H+EFK +LGL P K S F Y V D P SVDWRK
Sbjct: 126 TSYWLGLNAFADLTHDEFKATYLGLLP-----KRTSGGRFRYGGVGDGGDEVPASVDWRK 180
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAVT VKNQG CGSCWAFSTVAAVEGINQIVTGNL SLSEQ+L+DC NNGC+GG+M
Sbjct: 181 KGAVTEVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVM 240
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV-VTINGYHDVPQNSEDSLLKALA 261
D AF +I + GL EE YPY+MEEG C+ + EV VTI+GY DVP N E +L+KALA
Sbjct: 241 DNAFSFIATGAGLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALA 300
Query: 262 NQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWG 321
+QP+SVAIEASGR FQFYSGGV+DG CG++LDHGVAAVGYGS++G DYIIVKNSWG WG
Sbjct: 301 HQPVSVAIEASGRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGTHWG 360
Query: 322 EKGYIRMKRNTGKPEGLCGINKMASYPIK 350
EKGYIRMKR TGKPEGLCGINKMASYP K
Sbjct: 361 EKGYIRMKRGTGKPEGLCGINKMASYPTK 389
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 459 bits (1180), Expect = e-126, Method: Compositional matrix adjust.
Identities = 238/375 (63%), Positives = 270/375 (72%), Gaps = 37/375 (9%)
Query: 10 ILISFCISFF---IRSSFAR-DFSIVGYSPEDLTSNDKLIDLFESWMSKFEK-VYESLDE 64
I++ CI + AR DFSIVGYS EDL+S++ L +LFE W+S+ K Y SL+E
Sbjct: 6 IVVVLCIGLLSSCVGLGLARGDFSIVGYSEEDLSSHESLAELFERWLSRHRKGAYASLEE 65
Query: 65 KLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
KL RFE+FKDNL HIDETNRK+ +YWLGLNEFADL H+EFK +L D H
Sbjct: 66 KLRRFEVFKDNLHHIDETNRKVSSYWLGLNEFADLTHDEFKATYL-GLSPSGGGGDVVHM 124
Query: 125 D--------------------FSYK--DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
F Y+ D LPKSVDWR KGAVT VKNQG CGSCWAFS
Sbjct: 125 HHDDDDEEPEEEGSSSSSSFRFRYEGVDAARLPKSVDWRSKGAVTGVKNQGQCGSCWAFS 184
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
TVAAVEGINQIVTGNL +LSEQEL+DCD NNGCNGGLMDYAF YI GGLH EE YP
Sbjct: 185 TVAAVEGINQIVTGNLTALSEQELVDCDTDGNNGCNGGLMDYAFSYIAHNGGLHTEEAYP 244
Query: 223 YIMEEGTCEMTKGES-EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
Y+MEEGTC ++G S VVTI+GY DVP+N+E +LLKALA+QP+SVAIEASGR+ QFYSG
Sbjct: 245 YLMEEGTC--SRGSSAAVVTISGYEDVPRNNEQALLKALAHQPVSVAIEASGRNLQFYSG 302
Query: 282 GVYDGHCGTQLDHGVAAVGYGSTRG------LDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
GV+DG CGTQLDHGVAAVGYG+ DYIIVKNSWGP WGEKGYIRM+R TGK
Sbjct: 303 GVFDGPCGTQLDHGVAAVGYGTAGKDNGHVVADYIIVKNSWGPSWGEKGYIRMRRGTGKR 362
Query: 336 EGLCGINKMASYPIK 350
+GLCGINKM SYP K
Sbjct: 363 QGLCGINKMPSYPTK 377
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 218/352 (61%), Positives = 261/352 (74%), Gaps = 2/352 (0%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
+A+ S+ + + ++ D S+VGYS EDL +KL+ LF SW K K+Y
Sbjct: 8 LAMDSKLSMLFLLLGFVACSATASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYA 67
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
S EK++R+EIFK NLRHI ETNR+ +YWLGLN FAD+ HEEFK +LGLKP LARR
Sbjct: 68 SPKEKVKRYEIFKRNLRHIVETNRRNGSYWLGLNHFADIAHEEFKASYLGLKPGLARRDA 127
Query: 121 QSH--EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
Q H F Y + V+LP +VDWRKKGAVT VKNQG CGSCWAFSTVAAVEGINQIVTG L
Sbjct: 128 QPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKL 187
Query: 179 ASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DCDNT+N+GC GGLMD+AF YI+ G++ EEDYPY+MEEG C + S+
Sbjct: 188 VSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSK 247
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
V+TI GY DVP NSE SLLKALA+QP+SV I A RDFQFY GG++DG CG Q DH + A
Sbjct: 248 VITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHALTA 307
Query: 299 VGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VGYGS G DYII+KNSWG WGE+GY R++R TGKPEG+C I K+ASYP K
Sbjct: 308 VGYGSYYGQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYPTK 359
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 457 bits (1175), Expect = e-126, Method: Compositional matrix adjust.
Identities = 220/347 (63%), Positives = 261/347 (75%), Gaps = 4/347 (1%)
Query: 8 KTILISFCISFFIRSSFA--RDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEK 65
K ++ + F S+ A D S+VGYS EDL +KL+ LF SW K K+Y S EK
Sbjct: 4 KLSMLFLLLGFVACSATASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKEK 63
Query: 66 LERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSH-- 123
++R+EIFK NLRHI ETNR+ +YWLGLN FAD+ HEEFK +LGLKP LARR Q H
Sbjct: 64 VKRYEIFKRNLRHIVETNRRNGSYWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGS 123
Query: 124 EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
F Y + V+LP +VDWRKKGAVT VKNQG CGSCWAFSTVAAVEGINQIVTG L SLSE
Sbjct: 124 TTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSE 183
Query: 184 QELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTIN 243
QEL+DCDNT+N+GC GGLMD+AF YI+ G++ EEDYPY+MEEG C + S+V+TI
Sbjct: 184 QELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSKVITIT 243
Query: 244 GYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS 303
GY DVP+NSE SLLKALA+QP+SV I A RDFQFY GG++DG CG Q DH + AVGYGS
Sbjct: 244 GYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHALTAVGYGS 303
Query: 304 TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G DYII+KNSWG WGE+GY R++R TGKPEG+C I K+ASYP K
Sbjct: 304 YYGQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYPTK 350
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 210/350 (60%), Positives = 263/350 (75%), Gaps = 1/350 (0%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ S+ +S + S+ D S+VGYS EDL KL+DLF SW K K+Y
Sbjct: 1 MAMGSKLSLFFLSLGFVAYSSSASHNDPSVVGYSQEDLALPYKLVDLFSSWSVKHSKIYV 60
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
S +EK++R+E+FK NL+HI ETNR+ +YWLGLN+FAD+ HEEFK +LGLK +
Sbjct: 61 SPEEKVKRYEVFKQNLKHIVETNRRNGSYWLGLNQFADVAHEEFKSTYLGLKTGM-DGPA 119
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
++ F Y++ V+LP SVDWRKKGAVT VKNQG CGSCWAFSTVAAVEGINQI TG L S
Sbjct: 120 RAPTAFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIATGKLES 179
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQEL+DCD T+++GC GG MD+AF YI+ G+H ++DYPY+MEEG C+ + +S+VV
Sbjct: 180 LSEQELMDCDTTFDHGCGGGFMDFAFAYIMGNLGIHTDDDYPYLMEEGYCKEKQPQSKVV 239
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+GY DVP+NSE SLLKALA+QP+SV I A +DFQFY GV++G CGT+LDH + AVG
Sbjct: 240 TISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQFYKRGVFEGSCGTELDHALTAVG 299
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
YGS+ G DYII+KNSWG WGE+GY R+KR TGKPEG+C I MASYP K
Sbjct: 300 YGSSDGQDYIIMKNSWGKSWGEQGYFRIKRGTGKPEGVCSIYSMASYPTK 349
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 213/300 (71%), Positives = 239/300 (79%), Gaps = 5/300 (1%)
Query: 55 FEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD 114
+ K Y S +EK+ RFE+FKDNL HID+ N+K+ +YWLGLNEFADL H+EFK +LGL P
Sbjct: 36 YRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTSYWLGLNEFADLTHDEFKATYLGLTPP 95
Query: 115 LARR--KDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
R K S E+F Y + ++PK +DWRKK AVT VKNQG CGSCWAFSTVAAVEGI
Sbjct: 96 PTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGI 155
Query: 171 NQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTC 230
N IVTGNL SLSEQELIDC NNGCNGGLMDYAF YI STGGL EE YPY MEEG C
Sbjct: 156 NAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTEEAYPYAMEEGDC 215
Query: 231 EMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGT 290
+ KG + VVTI+GY DVP N E +L+KALA+QP+SVAIEASGR FQFYSGGV+DG CG
Sbjct: 216 DEGKGAA-VVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGE 274
Query: 291 QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
QLDHGV AVGYG+++G DYIIVKNSWGP WGEKGYIRMKR TGK EGLCGINKMASYP K
Sbjct: 275 QLDHGVTAVGYGTSKGQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCGINKMASYPTK 334
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 436 bits (1121), Expect = e-120, Method: Compositional matrix adjust.
Identities = 209/343 (60%), Positives = 254/343 (74%), Gaps = 2/343 (0%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERF 69
+L+ F +S RD S+VGYS EDL ++L++LF+SW K K+Y S EKL+R+
Sbjct: 7 VLVLFLAFAACSASHHRDPSVVGYSQEDLALPNRLVNLFKSWSVKHRKIYVSPKEKLKRY 66
Query: 70 EIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED--FS 127
IFK NL HI ETNRK +YWLGLN+FAD+ HEEFK LGLK L+R Q+ F
Sbjct: 67 GIFKQNLMHIAETNRKNGSYWLGLNQFADITHEEFKANHLGLKQGLSRMGAQTRTPTTFR 126
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
Y +LP SVDWR KGAVT VKNQG CGSCWAFS+VAAVEGINQIVTG L SLSEQEL+
Sbjct: 127 YAAAANLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQELM 186
Query: 188 DCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHD 247
DCD ++GC GGLMD+AF YI+ + G+H E+DYPY+MEEG C+ + + VVTI GY D
Sbjct: 187 DCDTMLDHGCEGGLMDFAFAYIMGSQGIHAEDDYPYLMEEGYCKEKQPYANVVTITGYED 246
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGL 307
VP+NSE SLLKALA+QP+SV I A RDFQFY GGV+DG C +LDH + AVGYGS+ G
Sbjct: 247 VPENSEISLLKALAHQPVSVGIAAGSRDFQFYKGGVFDGSCSDELDHALTAVGYGSSYGQ 306
Query: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
+YI +KNSWG WGE+GY+R+K TGKPEG+CGI MASYP+K
Sbjct: 307 NYITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPVK 349
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 204/251 (81%), Positives = 230/251 (91%), Gaps = 2/251 (0%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRH 101
DKLI+LFESWMS+ K+YES++EKL RFEIFKDNL+HIDETN+ + NYWLGLNEFADL H
Sbjct: 2 DKLIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVSNYWLGLNEFADLSH 61
Query: 102 EEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
EFK+ +LGLK D + R++ S E+F+Y+DV DLPKSVDWRKKGAVT++KNQGSCGSCWAF
Sbjct: 62 HEFKKQYLGLKVDFSTRRESS-EEFTYRDV-DLPKSVDWRKKGAVTNIKNQGSCGSCWAF 119
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
STVAAVEGINQIVTGNL SLSEQELIDCD TYN+GCNGGLMDYAF +IV GGLHKE+DY
Sbjct: 120 STVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGGLMDYAFSFIVENGGLHKEDDY 179
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PYIMEEGTCEM+K ES+VVTI+GYHDVPQN+E SLLKALANQPLSVAIEASGRDFQFYSG
Sbjct: 180 PYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 239
Query: 282 GVYDGHCGTQL 292
GV+DGHCGTQL
Sbjct: 240 GVFDGHCGTQL 250
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 205/348 (58%), Positives = 252/348 (72%), Gaps = 6/348 (1%)
Query: 10 ILISFCI---SFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKL 66
IL+ F + S S+ DFSI+GY +DL +D +++L+E W+++ +K Y L EK
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGEKQ 62
Query: 67 ERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED 125
RF +FKDN +I + N+ +Y LGLN+FADL HEEFK +LG K D +R S
Sbjct: 63 NRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSNSPSP 122
Query: 126 -FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
+ Y D DLP+S+DWR+KGAVT VK+QGSCGSCWAFSTVAAVEGINQIVTGNL SLSEQ
Sbjct: 123 RYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQ 182
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
EL+DCD +YN GCNGGLMDYAFQ+I++ GGL E+DYPY +G+C+ + + VVTI+
Sbjct: 183 ELVDCDTSYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDGSCDAYRKNAHVVTIDD 242
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
Y DVP+N E SL KA ANQP+SVAIEASGR FQFY GV+ CGTQLDHGV VGYGS
Sbjct: 243 YEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSTCGTQLDHGVTLVGYGSE 302
Query: 305 RGLDYIIVKNSWGPKWGEKGYIRMKRNT-GKPEGLCGINKMASYPIKK 351
G DY IVKNSWG WGEKG+IR++RN G G+CGI ASYP+KK
Sbjct: 303 SGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGIAMEASYPLKK 350
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 203/348 (58%), Positives = 254/348 (72%), Gaps = 6/348 (1%)
Query: 10 ILISFCI---SFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKL 66
IL+ F + S S+ DFSI+ Y +DL +D +++L+E W+++ +K Y LDEK
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDEKQ 62
Query: 67 ERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED 125
++F +FKDN +I + N+ +Y LGLN+FADL HEEFK +LG K D +R +S
Sbjct: 63 KKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKAAYLGTKLDAKKRLSRSPSP 122
Query: 126 -FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
+ Y DLP+S+DWR+KGAVT VKNQGSCGSCWAFSTVAAVEGINQIVTGNL SLSEQ
Sbjct: 123 RYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQ 182
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
EL+DCD +YN GCNGGLMDYAFQ+I+S GGL E+DYPY G+C+ + + VVTI+
Sbjct: 183 ELVDCDTSYNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKANNGSCDAYRKNAHVVTIDD 242
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
Y DVP+N E SL KA ANQP+SVAIEASGR FQFY GV+ +CGTQLDHGV VGYGS
Sbjct: 243 YEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSNCGTQLDHGVTLVGYGSE 302
Query: 305 RGLDYIIVKNSWGPKWGEKGYIRMKRNT-GKPEGLCGINKMASYPIKK 351
G+DY +VKNSWG WGEKG+I+++RN G G+CGI ASYP+KK
Sbjct: 303 SGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGIAMEASYPVKK 350
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 195/357 (54%), Positives = 257/357 (71%), Gaps = 8/357 (2%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLT-----SNDKLIDLFESWMSKF 55
M L + + +SF + S A D SI+ Y T ++D+++ ++E W+ K
Sbjct: 2 MGLFGSSAAMFVLLFLSFTLSS--ASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQ 59
Query: 56 EKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDL 115
KVY +L E+ +RF++FKDNLR IDE N + + Y LGLN FADL +EE++ +LG + +
Sbjct: 60 GKVYNALGEREKRFQVFKDNLRFIDEHNSENRTYKLGLNGFADLTNEEYRSTYLGARGGM 119
Query: 116 AR-RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIV 174
R R ++ + ++ + LP SVDWRK+GAV VK+QGSCGSCWAFST+AAVEGIN+IV
Sbjct: 120 KRNRLRKTSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIV 179
Query: 175 TGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTK 234
TG+L SLSEQEL+DCD +YN GCNGGLMDYAF++I++ GG+ EEDYPY+ +G C+ +
Sbjct: 180 TGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDTYR 239
Query: 235 GESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDH 294
++VVTI+ Y DVP NSE +L KA+ANQP+SVAIEA GRDFQFY+ G++ G CGTQLDH
Sbjct: 240 KNAKVVTIDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLDH 299
Query: 295 GVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
GVAAVGYG+ G DY IV+NSWG WGE GY+RM R+ P G+CGI ASYPIKK
Sbjct: 300 GVAAVGYGTENGKDYWIVRNSWGKSWGENGYLRMARSINSPTGICGIAMEASYPIKK 356
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 196/332 (59%), Positives = 247/332 (74%), Gaps = 4/332 (1%)
Query: 22 SSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
S+ DFSI+ S +DL +D +++L+E W+++ ++ Y LDEK +RF +FKDN +I E
Sbjct: 18 SASRADFSII--SSKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHE 75
Query: 82 TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS-HEDFSYKDVVDLPKSVDW 140
N+ ++Y LGLN+FADL HEEFK +LG K D +R + + Y D DLP+S+DW
Sbjct: 76 HNQGNRSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSRPPSRRYQYSDGEDLPESIDW 135
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGG 200
R+KGAVT VK+QGSCGSCWAFSTVAAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGG
Sbjct: 136 REKGAVTSVKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGG 195
Query: 201 LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKAL 260
LMDYAF++I++ GGL EEDYPY +G+C+ + + VVTI+ Y DVP+N E SL KA
Sbjct: 196 LMDYAFEFIINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAA 255
Query: 261 ANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKW 320
ANQP+SVAIEASGR+FQFY GV+ CGTQLDHGV VGYGS G DY VKNSWG W
Sbjct: 256 ANQPISVAIEASGREFQFYDSGVFTSTCGTQLDHGVTLVGYGSESGTDYWTVKNSWGKSW 315
Query: 321 GEKGYIRMKRNTG-KPEGLCGINKMASYPIKK 351
GE+G+IR++RN G+CGI ASYP+KK
Sbjct: 316 GEEGFIRLQRNIEVASTGMCGIAMEASYPVKK 347
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 188/277 (67%), Positives = 230/277 (83%), Gaps = 1/277 (0%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++L++ S + +FARDFSIVGY+PE LT+ DKL++LFESWMS+ K Y+S++EK+ R
Sbjct: 12 SLLVAISASALLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHR 71
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
FE+F++NL HID+ N +I +YWLGLNEFADL HEEFK +LGL RK Q +F Y
Sbjct: 72 FEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRY 131
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
+D+ DLPKSVDWRKKGAV VK+QG CGSCWAFSTVAAVEGINQI TGNL+SLSEQELID
Sbjct: 132 RDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELID 191
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
CD T+N+GCNGGLMDYAFQYI+STGGLHKE+DYPY+MEEG C+ K + E VTI+GY DV
Sbjct: 192 CDTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDV 251
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
P+N ++SL+KALA+QP+SVAIEASGRDFQFY GVY+
Sbjct: 252 PENDDESLVKALAHQPVSVAIEASGRDFQFYK-GVYN 287
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 202/350 (57%), Positives = 245/350 (70%), Gaps = 11/350 (3%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
+ + F ++ RD S+VGYS EDL LF SW K K+Y S EKLER
Sbjct: 8 AVFVLFLAFAACSANHHRDPSVVGYSQEDLALPS---SLFRSWSVKHGKLYASPTEKLER 64
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR---KDQSHED 125
+EIFK NL HI ETNRK +YWLGLN+FAD+ HEEFK +LGLK L R + ++
Sbjct: 65 YEIFKQNLMHIAETNRKNGSYWLGLNQFADVAHEEFKASYLGLKRALPRAGAPQTRTPTA 124
Query: 126 FSYKDVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
F Y LP SVDWR KGAVT VKNQG CGSCWAFS+VAAVEGINQIVTG L SLSE
Sbjct: 125 FRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSE 184
Query: 184 QELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVT-- 241
QEL+DCD T ++GC GG MD AF Y++ + G+H E+DYPY+MEEG C+ + +T
Sbjct: 185 QELVDCDTTLDHGCEGGTMDLAFAYMMGSQGIHAEDDYPYLMEEGYCKEKQPCVLGITEQ 244
Query: 242 -INGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
+ G+ DVP+NSE SLLKALA+QP+SV I A RDFQFY GGV+DG C +LDH + AVG
Sbjct: 245 DLTGFEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYRGGVFDGACSVELDHALTAVG 304
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
YGS+ G +YI +KNSWG WGE+GY+R+K TGKPEG+CGI MASYP+K
Sbjct: 305 YGSSYGQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPVK 354
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 198/345 (57%), Positives = 246/345 (71%), Gaps = 7/345 (2%)
Query: 14 FCISFFIRS-SFARDFSIVGYSPEDLT-----SNDKLIDLFESWMSKFEKVYESLDEKLE 67
F + FF + S A D SI+ Y T ++D+++ ++E W+ K K Y SL EK
Sbjct: 2 FMLLFFASTLSSASDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLVKHGKAYNSLGEKER 61
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
RFE+FKDNLR IDE N + + Y +GLN FADL +EE++ M+LG + R K + D
Sbjct: 62 RFEVFKDNLRFIDEHNSENRTYRVGLNRFADLTNEEYRSMYLGALSGIRRNKLRKISDRY 121
Query: 128 YKDVVD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
V D LP SVDWRK+GAV VK+QGSCGSCWAFS VAAVEGIN+IVTG+L SLSEQEL
Sbjct: 122 TPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSEQEL 181
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
+DCDN+YN GCNGGLMDY F++I++ GG+ EEDYPY+ +G C+ + + VV+I+ Y
Sbjct: 182 VDCDNSYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVSIDSYE 241
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG 306
DVP N+E +L KA+ANQP+SVAIEA GRDFQ YS GV+ G CGT LDHGV AVGYG+ G
Sbjct: 242 DVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGYGTENG 301
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
DY IV+NSWG WGE GY+RM RN KP G+CGI ASYPIKK
Sbjct: 302 QDYWIVRNSWGKSWGESGYLRMARNIRKPTGICGIAMEASYPIKK 346
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 197/361 (54%), Positives = 252/361 (69%), Gaps = 13/361 (3%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDK--------LIDLFESWM 52
M L ++ + + + S+ A D SI+GY D T DK ++ ++E+W+
Sbjct: 1 MGLCRSSSSMAVFLFLLLGLASASAXDMSIIGY---DETHGDKSSWRTDEDVMAVYEAWL 57
Query: 53 SKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLK 112
+K K Y +L EK RF+IFKDNLR IDE N + + Y +GLN FADL +EE++ M+LG +
Sbjct: 58 AKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSMYLGTR 117
Query: 113 PDLARRKDQSHED-FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGIN 171
RR D ++++ LP+SVDWRKKGAV VK+QGSCGSCWAFST+AAVEGIN
Sbjct: 118 TAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGIN 177
Query: 172 QIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCE 231
+IVTG L SLSEQEL+DCD +YN GCNGGLMDYAF++I++ GG+ EEDYPY +G C+
Sbjct: 178 KIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCD 237
Query: 232 MTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQ 291
+ + VVTI+GY DVP+N E SL KA+ANQP+SVAIEA GR+FQ Y G++ G CGT
Sbjct: 238 QYRKNAXVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTA 297
Query: 292 LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTG-KPEGLCGINKMASYPIK 350
LDHGV AVGYG+ G+DY IVKNSWG WGE+GYIRM+R+ G CGI ASYPIK
Sbjct: 298 LDHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPIK 357
Query: 351 K 351
K
Sbjct: 358 K 358
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/350 (55%), Positives = 247/350 (70%), Gaps = 4/350 (1%)
Query: 2 ALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYES 61
+SS K I ++ C+ + S A DF VGYS +DLTS ++LI LF+SWM K K+YES
Sbjct: 3 TMSSISKIIFLATCLIIHMSLSSA-DFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYES 61
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG-LKPDLARRKD 120
+DEK+ RFEIF+DNL +IDETN+K +YWLGLN FADL ++EFK+ ++G + D +
Sbjct: 62 IDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGSVAEDFTGLEH 121
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+EDF+YK V + P+S+DWR KGAVT VKNQGSCGSCWAFST+A VEG+N+IVTGNL
Sbjct: 122 FDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGSCGSCWAFSTIATVEGVNKIVTGNLLE 181
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQEL+DCD ++GC GG + QY V+ G+H + YPY + C T V
Sbjct: 182 LSEQELVDCDKN-SHGCKGGYQTTSLQY-VADNGVHTSKVYPYQAKAMQCRATDKPGPKV 239
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
I GY VP N E S L ALANQPLSV +EA G+ FQ Y GV+DG CGT+LDH V AVG
Sbjct: 240 KITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVG 299
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
YG++ G +YII+KNSWGP WGEKGY+R+KR +G +G CG+ K + YP K
Sbjct: 300 YGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 349
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 396 bits (1018), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/350 (55%), Positives = 248/350 (70%), Gaps = 4/350 (1%)
Query: 2 ALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYES 61
+SS K I ++ C+ + S A DF VGYS +DLTS ++LI LF+SWM K K+YES
Sbjct: 3 TMSSISKIIFLATCLIIHMGLSSA-DFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYES 61
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG-LKPDLARRKD 120
+DEK+ RFEIF+DNL +IDETN+K +YWLGLN FADL ++EFK+ ++G + D +
Sbjct: 62 IDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEH 121
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+EDF+YK V + P+S+DWR KGAVT VKNQG+CGSCWAFST+A VEGIN+IVTGNL
Sbjct: 122 FDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLE 181
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQEL+DCD ++ GC GG + QY V+ G+H + YPY ++ C T V
Sbjct: 182 LSEQELVDCDK-HSYGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKV 239
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
I GY VP N E S L ALANQPLSV +EA G+ FQ Y GV+DG CGT+LDH V AVG
Sbjct: 240 KITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVG 299
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
YG++ G +YII+KNSWGP WGEKGY+R+KR +G +G CG+ K + YP K
Sbjct: 300 YGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 349
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 396 bits (1017), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/337 (57%), Positives = 244/337 (72%), Gaps = 13/337 (3%)
Query: 25 ARDFSIVGYSPEDLTSNDK--------LIDLFESWMSKFEKVYESLDEKLERFEIFKDNL 76
A D SI+GY D T DK ++ ++E+W++K K Y +L EK RF+IFKDNL
Sbjct: 23 ALDMSIIGY---DETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALGEKERRFQIFKDNL 79
Query: 77 RHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED-FSYKDVVDLP 135
R IDE N + + Y +GLN FADL +EE++ M+LG + RR D ++++ LP
Sbjct: 80 RFIDEHNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRRSSNKISDRYAFRVGDSLP 139
Query: 136 KSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN 195
+SVDWRKKGAV VK+QGSCGSCWAFST+AAVEGIN+IVTG L SLSEQEL+DCD +YN
Sbjct: 140 ESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNE 199
Query: 196 GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDS 255
GCNGGLMDYAF++I++ GG+ EEDYPY +G C+ + ++VVTI+GY DVP+N E S
Sbjct: 200 GCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAKVVTIDGYEDVPENDEKS 259
Query: 256 LLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNS 315
L KA+ANQP+SVAIEA GR+FQ Y G++ G CGT LDHGV AVGYG+ G+DY IVKNS
Sbjct: 260 LEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYGTENGVDYWIVKNS 319
Query: 316 WGPKWGEKGYIRMKRNTG-KPEGLCGINKMASYPIKK 351
WG WGE+GYIRM+R+ G CGI ASYPIKK
Sbjct: 320 WGASWGEEGYIRMERDLATSATGKCGIAMEASYPIKK 356
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 191/346 (55%), Positives = 243/346 (70%), Gaps = 9/346 (2%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYS---PEDLTSNDKLIDLFESWMSKFEKVYESLDEK 65
L+ C +F S A D SI+ Y P T + + ++E W++ K Y ++ EK
Sbjct: 10 ACLLFLCFAF----SSALDMSIISYDQTHPPQRTDAEAMA-IYEKWLTTHGKAYNAIGEK 64
Query: 66 LERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED 125
RFEIFKDNLR +DE N +Y +GLN FADL +EE++ MFLG ++ R + D
Sbjct: 65 ERRFEIFKDNLRFVDEHNAVAGSYRVGLNRFADLTNEEYRSMFLGGNMEMKERSASTKSD 124
Query: 126 -FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
++++ LP SVDWR+KGAV+ VK+QG CGSCWAFST++AVEGINQIVTG L SLSEQ
Sbjct: 125 RYAFRAGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQ 184
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
EL+DCD +YN GCNGGLMDY FQ+I++ GG+ EEDYPY +GTC+ + + VV+ING
Sbjct: 185 ELVDCDKSYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVVSING 244
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
Y DVP++ E+SL KA+ANQP+SVAIEA GR FQ Y GV+ GHCGT LDHGV AVGYG+
Sbjct: 245 YEDVPEDDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTGHCGTNLDHGVVAVGYGTE 304
Query: 305 RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G+DY V+NSWGPKWGE GYI+++RN G CGI MASYP K
Sbjct: 305 NGVDYWTVRNSWGPKWGENGYIKLERNINATSGKCGIASMASYPTK 350
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 193/350 (55%), Positives = 247/350 (70%), Gaps = 4/350 (1%)
Query: 2 ALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYES 61
+SS K I ++ C+ + S A DF VGYS +DLTS ++LI LF+SWM K K+YES
Sbjct: 3 TMSSISKIIFLATCLIIHMGLSSA-DFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYES 61
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG-LKPDLARRKD 120
+DEK+ RFEIF+DNL +IDETN+K +YWLGLN FADL ++EFK+ ++G + D +
Sbjct: 62 IDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEH 121
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+EDF+YK V + P+S+DWR KGAVT VKNQG+CGSCWAFST+A VEGIN+IVTGNL
Sbjct: 122 FDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLE 181
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQEL+DCD ++ GC GG + QY V+ G+H + YPY ++ C T V
Sbjct: 182 LSEQELVDCDK-HSYGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKV 239
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
I GY VP N E S L ALANQPLS +EA G+ FQ Y GV+DG CGT+LDH V AVG
Sbjct: 240 KITGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVG 299
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
YG++ G +YII+KNSWGP WGEKGY+R+KR +G +G CG+ K + YP K
Sbjct: 300 YGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 349
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 184/310 (59%), Positives = 224/310 (72%), Gaps = 9/310 (2%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFK 105
L+E WM +VY + EK RF+IF+DN +I+E NR++ + YWLGLN FAD+ H+EFK
Sbjct: 33 LYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFK 92
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
++ G K L+ + F YKD +LP DWR KGAV VKNQG+CGSCWAFSTVA
Sbjct: 93 ALYFGTKVPLS---NTIKSGFRYKDATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTVA 149
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
AVEG+NQIVTG L SLSEQEL+DCD N GCNGGLMD AF++I+ GGL E DYPY
Sbjct: 150 AVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPYKA 209
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
G+C+ ++ S VVTI+G+ DVP SE LLKA+ANQP+SVAIEASGR+FQ YSGGVY
Sbjct: 210 VSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVYT 269
Query: 286 GHCGTQLDHGVAAVGYGSTR-----GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GHCG +LDHGV AVGYG+++ DY IV+NSWG WGE GYIR++RN P G CG
Sbjct: 270 GHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASPRGKCG 329
Query: 341 INKMASYPIK 350
I MASYP+K
Sbjct: 330 IAMMASYPVK 339
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 192/338 (56%), Positives = 242/338 (71%), Gaps = 9/338 (2%)
Query: 23 SFARDFSIVGYSPEDLT-----SNDKLIDLFESWMSKFEKVYE---SLDEKLERFEIFKD 74
S A D SIV Y LT ++D+++ ++E W+ K K + +L EK RF++FKD
Sbjct: 21 SSALDMSIVSYDQTHLTKSSWRTDDEVMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKD 80
Query: 75 NLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD- 133
NLR IDE N + ++Y +GLN FADL +EE++ M+LG + R + + V D
Sbjct: 81 NLRFIDEHNSENRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRSSNRYLPRVGDS 140
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP SVDWRK+GAV VK+QGSCGSCWAFST+AAVEGIN+IVTG+L SLSEQEL+DCD +Y
Sbjct: 141 LPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSY 200
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
N GCNGGLMDYAFQ+I++ GG+ EEDYPY+ +GTC+ + ++VVTI+ Y DVP N E
Sbjct: 201 NEGCNGGLMDYAFQFIINNGGIDSEEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPVNDE 260
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
+L KA+ANQP+SVAIEA GR+FQFY G++ G CGT LDHGVAAVGYG+ G DY IV+
Sbjct: 261 KALQKAVANQPVSVAIEAGGREFQFYQSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVR 320
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
NSWG WGE GYIRM+RN G CGI SYPIKK
Sbjct: 321 NSWGKSWGESGYIRMERNIATATGKCGIAIEPSYPIKK 358
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 196/344 (56%), Positives = 240/344 (69%), Gaps = 12/344 (3%)
Query: 18 FFIRSSFARDFSIVGYS------PEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEI 71
+F+ A D SI+ Y+ PE + + + L+E W+ K+ K Y +L EK RFEI
Sbjct: 15 YFLSVCLAIDMSIIDYNLKHGQVPE--RTEAETLRLYEMWLVKYGKAYNALGEKERRFEI 72
Query: 72 FKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR--KDQSHEDFSY 128
FKDNL+ +D+ N +Y LGLN+FADL +EE++ +LG + D RR + +
Sbjct: 73 FKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGGPKSARYLF 132
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
KD DLP+SVDWR+KGAV VK+QG CGSCWAFSTV AVEGINQIVTGNL SLSEQEL+D
Sbjct: 133 KDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVD 192
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
CD YN GCNGGLMDYAF++I+ GG+ EEDYPY + C+ + + VVTI+GY DV
Sbjct: 193 CDKVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDSMCDPNRKNARVVTIDGYEDV 252
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLD 308
PQN E SL KA+ANQP+SVAIEA GR FQ Y GV+ G CGTQLDHGV AVGYG+ G+D
Sbjct: 253 PQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGVFTGSCGTQLDHGVVAVGYGTENGVD 312
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIKK 351
Y +V+NSWGP WGE GYIRM+RN E G CGI ASYP KK
Sbjct: 313 YWVVRNSWGPAWGENGYIRMERNVASTETGKCGIAMEASYPTKK 356
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 192/350 (54%), Positives = 246/350 (70%), Gaps = 4/350 (1%)
Query: 2 ALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYES 61
+SS K I ++ C+ + S A DF VGYS +DLTS ++LI LF+SWM K K+YES
Sbjct: 3 TMSSISKIIFLATCLIIHMGLSSA-DFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYES 61
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG-LKPDLARRKD 120
+DEK+ RFEIF+DNL +IDETN+K +YWLGLN FADL ++EFK+ ++G + D +
Sbjct: 62 IDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEH 121
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+EDF+YK V + P+S+DWR KGAVT VKNQG+CGSCWAFST+A VEGIN+IVTGNL
Sbjct: 122 FDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLE 181
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQEL+DCD ++ GC GG + QY V+ G+H + YP ++ C T V
Sbjct: 182 LSEQELVDCDK-HSYGCKGGYQTTSLQY-VANNGVHTSKVYPCQAKQYKCRATDKPGPKV 239
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
I GY VP N E S L ALANQPLS +EA G+ FQ Y GV+DG CGT+LDH V AVG
Sbjct: 240 KITGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVG 299
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
YG++ G +YII+KNSWGP WGEKGY+R+KR +G +G CG+ K + YP K
Sbjct: 300 YGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 349
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 390 bits (1001), Expect = e-106, Method: Compositional matrix adjust.
Identities = 186/347 (53%), Positives = 250/347 (72%), Gaps = 7/347 (2%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLT-----SNDKLIDLFESWMSKFEKVYESLDE 64
+L+ + F + S+F D SI+ Y T ++D+++ ++E W+ K K Y +L E
Sbjct: 1 MLMLLFLVFALSSAF--DMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGE 58
Query: 65 KLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
K +RFEIFKDNL ID+ N + + Y +GLN FADL +EEF+ M+LG + +R ++ +
Sbjct: 59 KEKRFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSD 118
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
++ + LP SVDWRK+GAV VK+QG CGSCWAFST+AAVEGIN+IVTG+L +LSEQ
Sbjct: 119 RYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQ 178
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
EL+DCD +YN GCNGGLMDYAF++I++ GG+ E+DYPY+ +G C+ + ++VV+I+
Sbjct: 179 ELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDS 238
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
Y DVP+N E +L KA+ANQP+SVAIE GR+FQ Y+ GV+ G CGT LDHGVAAVGYG+
Sbjct: 239 YEDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTE 298
Query: 305 RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
+G DY IV+NSWG WGE GYIRM+RN P G CGI SYPIKK
Sbjct: 299 KGKDYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPIKK 345
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 389 bits (1000), Expect = e-106, Method: Compositional matrix adjust.
Identities = 195/353 (55%), Positives = 251/353 (71%), Gaps = 15/353 (4%)
Query: 9 TILISFCISFFIRSSFARDFSIVGY------SPEDLTSNDKLIDLFESWMSKFEKVYESL 62
TIL+ ++ I S+A D SI+ Y + E+ S+ ++ ++E+WM K K +S
Sbjct: 7 TILL---LAMMIGVSYAADMSIISYDEKHHITAENERSDAEVARIYEAWMEKHGKKAQSN 63
Query: 63 ----DEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
+EK +RFEIFKDNLR IDE N K +Y LGL FADL +EE++ ++LG K +R
Sbjct: 64 GLVGEEKDQRFEIFKDNLRFIDEHNNKNLSYKLGLTRFADLTNEEYRSIYLGAKSK--KR 121
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
++ + + + +P SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN+IVTG+L
Sbjct: 122 VLKTSDRYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDL 181
Query: 179 ASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DCD +YN GCNGGLMDYAF++I+ GG+ EEDYPY +G C+ T+ ++
Sbjct: 182 ISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQTRKNAK 241
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
VVTI+ Y DVP+N+E +L K LANQP+SVAIEA GR FQ YS GV+DG CGT+LDHGV A
Sbjct: 242 VVTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGVFDGICGTELDHGVVA 301
Query: 299 VGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
VGYG+ G DY IV+NSWG WGE GYI+M RN +P G CGI ASYPIKK
Sbjct: 302 VGYGTENGKDYWIVRNSWGGSWGESGYIKMARNIAEPTGKCGIAMEASYPIKK 354
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 389 bits (1000), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/350 (53%), Positives = 248/350 (70%), Gaps = 14/350 (4%)
Query: 2 ALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYES 61
ALSS F +IS+ + +SS+ D D+++ ++E W+ K K Y +
Sbjct: 19 ALSSAFDMSIISYHQTHATKSSWRTD--------------DEVMAMYEEWLVKHGKNYNA 64
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQ 121
L EK +RFEIFKDNL ID+ N + + Y +GLN FADL +EEF+ M+LG + +R +
Sbjct: 65 LGEKEKRFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPK 124
Query: 122 SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASL 181
+ + ++ + LP SVDWRK+GAV VK+QG CGSCWAFST+AAVEGIN+IVTG+L +L
Sbjct: 125 TSDRYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIAL 184
Query: 182 SEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVT 241
SEQEL+DCD +YN GCNGGLMDYAF++I++ GG+ E+DYPY+ +G C+ + ++VV+
Sbjct: 185 SEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVS 244
Query: 242 INGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGY 301
I+ Y DVP+N E +L KA+ANQP+SVAIE GR+FQ Y+ GV+ G CGT LDHGVAAVGY
Sbjct: 245 IDSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGY 304
Query: 302 GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
G+ +G DY IV+NSWG WGE GYIRM+RN P G CGI SYPIKK
Sbjct: 305 GTEKGKDYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPIKK 354
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 191/350 (54%), Positives = 250/350 (71%), Gaps = 9/350 (2%)
Query: 2 ALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYES 61
+ S K I ++ C+ + S A DFSIVGYS +DLTS ++LI LFESWM K ++VY +
Sbjct: 3 TICSISKLIFVATCLIVHVGLSSA-DFSIVGYSQDDLTSTERLIRLFESWMLKHDRVYNN 61
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG-LKPDLARRKD 120
++EK+ RFEIFKDNL +IDETN+K +YWLGLNEF DL H+EFKE ++G + D +
Sbjct: 62 IEEKIHRFEIFKDNLMYIDETNKKNNSYWLGLNEFVDLTHDEFKEKYVGSIGEDFVTIEQ 121
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+ E+F YK VVD P+S+DWR KGAVT VK CGSCWAFSTVA VEGIN+IVTG L S
Sbjct: 122 SNDEEFPYKHVVDYPESIDWRDKGAVTPVK-PNPCGSCWAFSTVATVEGINKIVTGKLIS 180
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQEL+DCD ++GC GG + QY+V G+H E++YPY ++G C + + V
Sbjct: 181 LSEQELLDCDRR-SHGCKGGYQTTSLQYVVDN-GVHTEKEYPYEKKQGKCRAKEKKGTKV 238
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
I GY VP N E SL++A+ANQP+SV +E+ GR FQ Y GG+++G CGT+LDH V A+G
Sbjct: 239 QITGYKRVPANDEISLIQAIANQPVSVLLESKGRAFQLYKGGIFNGPCGTKLDHAVTAIG 298
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
YG T YI++KNSWGP WGEKGY+++KR +GK EG CG+ K + +P K
Sbjct: 299 YGKT----YILIKNSWGPNWGEKGYLKIKRASGKSEGTCGVYKSSYFPTK 344
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 186/325 (57%), Positives = 242/325 (74%), Gaps = 7/325 (2%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI 86
DF+IVGYS +DLTS ++L+ LFESW + +K+Y+++DEK+ RFEIFKDNL +IDETN+K
Sbjct: 1 DFAIVGYSQDDLTSIERLVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKKN 60
Query: 87 KNYWLGLNEFADLRHEEFKEMFLG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
+YWLGLNEFADL H+EFK ++G L D + E+F YK VVD P+S+DWR+KGA
Sbjct: 61 SSYWLGLNEFADLTHDEFKAKYVGSLGEDSTIIEQSDDEEFPYKHVVDYPESIDWRQKGA 120
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYA 205
VT VKNQ CGSCWAFSTVA VEGIN+IVTG L SLSEQEL+DCD ++GC GG +
Sbjct: 121 VTPVKNQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCKGGYQTTS 179
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPL 265
QY V+ G+H E++YPY ++G C + V I GY VP N+E SL++A+ANQP+
Sbjct: 180 LQY-VADNGVHTEKEYPYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPV 238
Query: 266 SVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGY 325
SV +E+ GR FQFY GG+++G CGT++DH V AVGYG +YI++KNSWGPKWGEKGY
Sbjct: 239 SVVVESKGRAFQFYKGGIFEGPCGTKVDHAVTAVGYGK----NYILIKNSWGPKWGEKGY 294
Query: 326 IRMKRNTGKPEGLCGINKMASYPIK 350
IR+KR +GK +G CG+ + +P K
Sbjct: 295 IRIKRASGKSKGTCGVYSSSYFPTK 319
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 190/326 (58%), Positives = 236/326 (72%), Gaps = 4/326 (1%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI 86
D SI+G T +D+++ ++ESW+ K K Y ++ EK +RF+IFKDNLR IDE N +
Sbjct: 26 DMSIIGELSSSRT-DDEVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAES 84
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DLPKSVDWRKKG 144
+ Y +GLN FADL ++E++ M+LG + RR Y V LP SVDWR+KG
Sbjct: 85 RTYKVGLNRFADLTNDEYRSMYLGARTGSRRRLSTQKRSDRYVPVAGESLPDSVDWREKG 144
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDY 204
AV VK+QGSCGSCWAFST+AAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGGLMDY
Sbjct: 145 AVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDY 204
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQP 264
AF++I+ GG+ EEDYPY +G C+ + ++VVTI+ Y DVP N+E +L KA+ANQP
Sbjct: 205 AFEFIIKNGGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQP 264
Query: 265 LSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKG 324
+SVAIEASG FQFY GV+ G+CGT LDHGV AVGYG+ +DY IVKNSWG WGE G
Sbjct: 265 VSVAIEASGMAFQFYESGVFTGNCGTALDHGVTAVGYGTENSVDYWIVKNSWGSSWGESG 324
Query: 325 YIRMKRNTGKPEGLCGINKMASYPIK 350
YIRM+RNTG G CGI SYPIK
Sbjct: 325 YIRMERNTGA-TGKCGIAVEPSYPIK 349
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/310 (58%), Positives = 223/310 (71%), Gaps = 9/310 (2%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFK 105
L+E WM +VY + EK RF+IF+DN +I+E NR++ + YWLGLN FAD+ H+EFK
Sbjct: 33 LYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFK 92
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
++ G K L+ + F Y+D +LP DWR KGAV VKNQG+CGSCWAFSTVA
Sbjct: 93 ALYFGTKVPLS---NTIKSGFRYEDATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTVA 149
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
AVEG+NQIVTG L SLSEQEL+DCD N GCNGGLMD AF++I+ GGL E DYPY
Sbjct: 150 AVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPYKA 209
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
G+C+ ++ S VVTI+G+ DVP SE LLKA+ANQP+SVAIEASGR+FQ YSGGVY
Sbjct: 210 VSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVYT 269
Query: 286 GHCGTQLDHGVAAVGYGSTR-----GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GHCG +LDHGV AVGYG+++ DY IV+NSWG WGE GYIR++RN G CG
Sbjct: 270 GHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASSRGKCG 329
Query: 341 INKMASYPIK 350
I MASYP+K
Sbjct: 330 IAMMASYPVK 339
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 193/343 (56%), Positives = 243/343 (70%), Gaps = 12/343 (3%)
Query: 19 FIRSSFARDFSIVGY------SPEDLTSNDKLIDLFESWMSKFEKVYESLD----EKLER 68
I S+A D SI+ Y + E S+ ++ ++E+WM + K + + EK +R
Sbjct: 15 MIGVSYAMDMSIISYDENHHITTETSRSDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQR 74
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
FEIFKDNLR IDE N K +Y LGL FADL +EE++ M+LG KP +R ++ + +
Sbjct: 75 FEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEYRSMYLGAKP--TKRVLKTSDRYQA 132
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
+ LP SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN+IVTG+L SLSEQEL+D
Sbjct: 133 RVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVD 192
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
CD +YN GCNGGLMDYAF++I+ GG+ E DYPY +G C+ + ++VVTI+ Y DV
Sbjct: 193 CDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDV 252
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLD 308
P+NSE SL KALA+QP+SVAIEA GR FQ YS GV+DG CGT+LDHGV AVGYG+ G D
Sbjct: 253 PENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYGTENGKD 312
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
Y IV+NSWG +WGE GYI+M RN P G CGI ASYPIKK
Sbjct: 313 YWIVRNSWGNRWGESGYIKMARNIEAPTGKCGIAMEASYPIKK 355
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 190/351 (54%), Positives = 246/351 (70%), Gaps = 6/351 (1%)
Query: 3 LSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESL 62
+S+ TI + F I FI SS A D SI+ + +D++ L+E+W+ K K Y L
Sbjct: 1 MSTSKSTIFLLFSI-IFIVSSSALDLSIIDRAFN--RPDDEIASLYETWLVKHGKNYNGL 57
Query: 63 DEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD---LARRK 119
EK RF IFKDNLR +DE N + ++ LGLN FADL +EE++ ++LG +P +AR
Sbjct: 58 GEKQLRFNIFKDNLRFVDERNSENLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSG 117
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
+ ++++ LP+SVDWRKKGAV +K+QGSCGSCWAFS +AAVEG+NQIVTG+L
Sbjct: 118 RSKSDRYAFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVTGDLI 177
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
SLSEQEL++CD +YN+GC+GGLMDYAF++I+ G+ +EDYPY +G C+ + ++V
Sbjct: 178 SLSEQELVECDTSYNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRDGRCDTNRKNAKV 237
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
VTI+ Y D P E SL KA+ANQP+SVAIE GRDFQ Y GV+ G CGT LDHGVA V
Sbjct: 238 VTIDDYEDSPVYDEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHGVAVV 297
Query: 300 GYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
GYG+ GLDY IV+NSWG WGE GYIRM+RNT P G+CGI SYPIK
Sbjct: 298 GYGTEDGLDYWIVRNSWGDTWGEGGYIRMQRNTKLPSGICGIAIEPSYPIK 348
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 184/328 (56%), Positives = 239/328 (72%), Gaps = 4/328 (1%)
Query: 27 DFSIVGYSPE-DLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK 85
D SI+ Y + ++ +++ ++E+W+ K K Y +L E+ RFEIFKDNLR I+E N
Sbjct: 32 DMSIISYGDRLEKRTDAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV 91
Query: 86 IKNYWLGLNEFADLRHEEFKEMFLGLKPDLAR--RKDQSHEDFSYKDVVDLPKSVDWRKK 143
+ Y +GLN FADL +EE++ +LG + + R R + + +S++ DLP+SVDWR+K
Sbjct: 92 NRTYKVGLNRFADLTNEEYRSRYLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREK 151
Query: 144 GAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMD 203
GAV VK+QG+CGSCWAFST+AAVEGINQI TG+L SLSEQEL+DCD +YN GCNGGLMD
Sbjct: 152 GAVVPVKDQGNCGSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMD 211
Query: 204 YAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
YAF++I++ GG+ EEDYPY + TC+ + + VV+I+GY DVPQN E SL KA+ANQ
Sbjct: 212 YAFEFIINNGGIDSEEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQ 271
Query: 264 PLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEK 323
P+SVAIEA GR FQ Y GV+ G CGTQLDHGV AVGYG+ +DY IV+NSWGP WGE
Sbjct: 272 PVSVAIEAGGRAFQLYQSGVFTGQCGTQLDHGVVAVGYGTENSVDYWIVRNSWGPNWGES 331
Query: 324 GYIRMKRN-TGKPEGLCGINKMASYPIK 350
GYI+++RN G G CGI SYPIK
Sbjct: 332 GYIKLERNLAGTETGKCGIAIEPSYPIK 359
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 382 bits (982), Expect = e-104, Method: Compositional matrix adjust.
Identities = 192/343 (55%), Positives = 244/343 (71%), Gaps = 12/343 (3%)
Query: 19 FIRSSFARDFSIVGY------SPEDLTSNDKLIDLFESWMSKFEKVYESLD----EKLER 68
I S+A D SI+ Y S S+ ++ ++E+WM + K + + EK +R
Sbjct: 15 MIGVSYAIDMSIISYDENHHISTVSSRSDAEVERIYEAWMVEHGKKKMNQNGLGAEKDQR 74
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
FEIFKDNLR+IDE N K +Y LGL FADL ++E++ M+LG KP +R ++ + +
Sbjct: 75 FEIFKDNLRYIDEHNTKNLSYKLGLTRFADLTNDEYRSMYLGAKP--VKRVLKTSDRYEA 132
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
+ LP SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN+IVTG+L SLSEQEL+D
Sbjct: 133 RVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVD 192
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
CD +YN GCNGGLMDYAF++I+ GG+ E DYPY +G C+ + ++VVTI+ Y DV
Sbjct: 193 CDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDV 252
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLD 308
P+NSE SL KALA+QP+SVAIEA GR FQ YS GV+DG CGT+LDHGV AVGYG+ G D
Sbjct: 253 PENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGICGTELDHGVVAVGYGTENGKD 312
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
Y IV+NSWG +WGE GYI+M RN +P G CGI ASYPIKK
Sbjct: 313 YWIVRNSWGNRWGESGYIKMARNIAEPTGKCGIAMEASYPIKK 355
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 193/352 (54%), Positives = 244/352 (69%), Gaps = 10/352 (2%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPE-----DLTSNDKLIDLFESWMSKFEKVYESL 62
++ L F + F SS A D SIV Y ++D+++ ++E+W+ K K Y +L
Sbjct: 5 RSSLSLFLLMIFTASS-AVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNAL 63
Query: 63 DEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD---LARRK 119
EK +RF IFKDNLR IDE N + Y LGLN FADL +EE++ M+LG+KP + R+
Sbjct: 64 GEKEKRFGIFKDNLRFIDEHNSQNLTYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKV 123
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
+ + F+ + LP +DWRK+GAV VK+QGSCGSCWAFST+AAVEGINQIVTG+L
Sbjct: 124 SRKSDRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLI 183
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
SLSEQEL+DCD +YN GCNGGLMDYAF++I++ GG+ EEDYPY + C+ + + V
Sbjct: 184 SLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANV 243
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
V+I+GY DVP+N E +L KA+A QP+SVAIEA GR FQ Y GV+ G CGT LDHGVAAV
Sbjct: 244 VSIDGYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAV 303
Query: 300 GYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPIK 350
GYG+ G DY IV NSWG WGE GYIRM+RN G G CGI SYPIK
Sbjct: 304 GYGTENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPIK 355
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 194/361 (53%), Positives = 250/361 (69%), Gaps = 13/361 (3%)
Query: 2 ALSSQFKTILISFCISFFIRSSFARDFSIVGY-------SPEDLTSNDKLIDLFESWMSK 54
+S + + +++ F ++ F+ S A D SI+ Y SP L ++D+L+ L+ESW+ K
Sbjct: 8 TMSPRPQCLVLFFSLASFLMLSSASDMSIITYDETHGLNSPP-LRTHDQLLSLYESWLVK 66
Query: 55 FEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFKEMFLGLKP 113
K Y +L EK RF IFKDN+ +D N + ++Y LGLN+FADL ++E++ ++L K
Sbjct: 67 HHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRSLYLSGKM 126
Query: 114 DLARRKDQS---HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
RK++ + F ++D LP+SVDWR +GAV VK+QG CGSCWAFSTV AVEGI
Sbjct: 127 MKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFSTVGAVEGI 186
Query: 171 NQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTC 230
N+IVTG L SLSEQEL+DCDN YN GCNGGLMDYAF++IV GG+ E+DYPY +G C
Sbjct: 187 NKIVTGELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDTEDDYPYKGVDGLC 246
Query: 231 EMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGT 290
+ + ++VVTINGY DVP N E SL KA+A+QP+SVAIEA GR FQ Y GV+ G CGT
Sbjct: 247 DQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGVFTGQCGT 306
Query: 291 QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPI 349
+LDHGV AVGYGS G DY IV+NSWGP WGE GYIR++RN G CGI ASYP
Sbjct: 307 ELDHGVVAVGYGSENGKDYWIVRNSWGPDWGESGYIRLERNVASTSTGKCGIAMQASYPT 366
Query: 350 K 350
K
Sbjct: 367 K 367
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/334 (55%), Positives = 240/334 (71%), Gaps = 4/334 (1%)
Query: 21 RSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHID 80
+ + R +I+ Y +L S+D ++D+F W+ + +VY SL EK RF+IFKDNL +I
Sbjct: 25 QGNVGRADAIMDYEAHELHSDDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIH 84
Query: 81 ETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDW 140
N++ K+YWLGLN+F+DL H+EF+ ++LG++P ++ + F Y+DVV + VDW
Sbjct: 85 NHNKQEKSYWLGLNKFSDLTHDEFRALYLGIRPAGRAHGLRNGDRFIYEDVV-AEEMVDW 143
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGG 200
RKKGAV+ VK+QGSCGSCWAFS + +VEG+N IVTG L SLSEQEL+DCD N GCNGG
Sbjct: 144 RKKGAVSDVKDQGSCGSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGG 203
Query: 201 LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKA 259
LMDYAF +I+ GG+ EEDYPY +G C+ + E S+VV I+ Y DVP SE SLLKA
Sbjct: 204 LMDYAFDFIIKNGGIDTEEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKA 263
Query: 260 LANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGP 318
++ P+SVAIEA GRDFQ Y GGV+ G CGT LDHGV AVGYG+ G++Y IVKNSWGP
Sbjct: 264 VSKNPVSVAIEAGGRDFQHYQGGVFTGPCGTDLDHGVLAVGYGTDDDGVNYWIVKNSWGP 323
Query: 319 KWGEKGYIRMKR-NTGKPEGLCGINKMASYPIKK 351
WGEKGYIRM+R + G CGIN S+PIKK
Sbjct: 324 SWGEKGYIRMERMGSNSTSGKCGINIEPSFPIKK 357
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 379 bits (974), Expect = e-103, Method: Compositional matrix adjust.
Identities = 190/330 (57%), Positives = 242/330 (73%), Gaps = 8/330 (2%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
K I + C+S + S A DFSIVGYS +DLTS + I LFESWM K +KVY+++DEK+
Sbjct: 9 KLIFVVTCLSLHLGLSSA-DFSIVGYSQDDLTSIESSIRLFESWMLKHDKVYKTIDEKIY 67
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE-DF 126
RFE FKDNL +IDETN+K +YWLGLNEFADL H+EFKE ++G P+ + +QS + +F
Sbjct: 68 RFETFKDNLMYIDETNKKNNSYWLGLNEFADLTHDEFKEKYVGSIPEDSMIIEQSDDVEF 127
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
K VVD P+S+DWR+KGAVT VKNQ CGSCWAFSTVA VEGIN+IVTGNL SLSEQEL
Sbjct: 128 PNKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWAFSTVATVEGINKIVTGNLISLSEQEL 187
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
+DCD ++GC GG + +Y+V G+H E++YPY ++G C + V INGY
Sbjct: 188 LDCDRR-SHGCKGGYQTTSLKYVVDN-GVHTEKEYPYEKKQGNCRAKNKKGLKVYINGYK 245
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG 306
VP N E SL+K ++ QP+SV +E+ GR FQFY GGV+ G CGT+LDH V AVGYG
Sbjct: 246 RVPSNDEISLIKTISIQPVSVLVESKGRPFQFYKGGVFGGPCGTKLDHAVTAVGYGK--- 302
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
DYI++KNSWGPKWG+KGYI++KR +G+ E
Sbjct: 303 -DYILIKNSWGPKWGDKGYIKIKRASGQSE 331
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 379 bits (974), Expect = e-103, Method: Compositional matrix adjust.
Identities = 188/332 (56%), Positives = 239/332 (71%), Gaps = 7/332 (2%)
Query: 27 DFSIVGYSPE-----DLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
D SIV Y+ + L ++ ++ ++E W+ + K Y +L EK +RFEIFKDNLR IDE
Sbjct: 25 DMSIVDYNIKHGTKYPLRTDSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDE 84
Query: 82 TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR-KDQSHEDFSYKDVVDLPKSVDW 140
N ++Y +GLN FADL +EE+K MFLG K + R + + +KD DLP++VDW
Sbjct: 85 HNSVDRSYKVGLNRFADLTNEEYKAMFLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDW 144
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGG 200
R+KGAV VK+QG CGSCWAFSTV AVEGINQIVTG L SLSEQEL+DCD +YN GCNGG
Sbjct: 145 REKGAVVPVKDQGQCGSCWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGG 204
Query: 201 LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKAL 260
LMDYAF++I++ GG+ EEDYPY + C+ + ++VVTI+GY DVP+N E+SL KA+
Sbjct: 205 LMDYAFEFIINNGGIDTEEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAV 264
Query: 261 ANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKW 320
A+QP+SVAIEA GR FQ Y GV+ G CGT+LDHGV AVGYG+ G++Y IV+NSWG W
Sbjct: 265 AHQPVSVAIEAGGRAFQLYKSGVFTGRCGTELDHGVVAVGYGTENGVNYWIVRNSWGSAW 324
Query: 321 GEKGYIRMKRNTGKPE-GLCGINKMASYPIKK 351
GE GYIRM+RN + G CGI SYP KK
Sbjct: 325 GESGYIRMERNVANTKTGKCGIAIQPSYPTKK 356
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/336 (56%), Positives = 237/336 (70%), Gaps = 6/336 (1%)
Query: 19 FIRSSFARDFSIVGYSPEDLT----SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKD 74
I S A D SI+ Y T S+ ++ L+E W+ K K SL EK RFEIFKD
Sbjct: 9 MIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRRFEIFKD 68
Query: 75 NLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDL 134
NLR IDE N K +Y LGL +FADL ++E++ M+LG + L R+ +S + + +
Sbjct: 69 NLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSR--LKRKATKSSLRYEVRVGDAI 126
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P+SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN+IVTG+L +LSEQEL+DCD +YN
Sbjct: 127 PESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTSYN 186
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GCNGGLMDYAF++I++ GG+ EEDYPY +G C+ T+ ++VVTI+ Y DVP NSE+
Sbjct: 187 EGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDLYEDVPANSEE 246
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
SL KAL++QP+SVAIE GR FQ Y G++DG CGT LDHGV AVGYG+ G DY IVKN
Sbjct: 247 SLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGTENGKDYWIVKN 306
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG WGE GYIRM+RN G CGI SYPIK
Sbjct: 307 SWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPIK 342
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/336 (56%), Positives = 236/336 (70%), Gaps = 6/336 (1%)
Query: 19 FIRSSFARDFSIVGYSPEDLT----SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKD 74
I S A D SI+ Y T S+ ++ L+E W+ K K SL EK RFEIFKD
Sbjct: 9 MIVVSSAMDMSIISYDKNHHTVSSRSDVEVSRLYEEWVVKHGKAQNSLTEKDRRFEIFKD 68
Query: 75 NLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDL 134
NLR IDE N K +Y LGL +FADL ++E++ M+LG + L R+ ++ + + +
Sbjct: 69 NLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSR--LKRKATKTSLRYEARVGDAI 126
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P+SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN+IVTG+L SLSEQEL+DCD +YN
Sbjct: 127 PESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYN 186
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GCNGGLMDYAF++I+ GG+ EEDYPY +G C+ T+ ++VVTI+ Y DVP NSE+
Sbjct: 187 EGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDSYEDVPANSEE 246
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
SL KAL++QP+SVAIE GR FQ Y G++DG CGT LDHGV AVGYG+ G DY IVKN
Sbjct: 247 SLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGTENGKDYWIVKN 306
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG WGE GYIRM+RN G CGI SYPIK
Sbjct: 307 SWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPIK 342
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 189/334 (56%), Positives = 242/334 (72%), Gaps = 10/334 (2%)
Query: 27 DFSIVGYSPED-----LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
D SI+ Y + + S D++ ++FESW+ K K Y ++DEK +RF+IF+DNL++IDE
Sbjct: 24 DMSIITYDQQHPAKGLVRSEDEVKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDE 83
Query: 82 TNR-KIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DLPKSV 138
N + ++Y LGLN FAD+ +EE++ +LG K D +R +S D Y V LP S+
Sbjct: 84 KNSLENRSYKLGLNRFADITNEEYRTGYLGAKRDASRNMVKSKSD-RYAPVAGDSLPDSI 142
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
DWR+KGAVT VK+QGSCGSCWAFST+AAVEG+NQ+ TGNL SLSEQEL+DCD N GCN
Sbjct: 143 DWREKGAVTGVKDQGSCGSCWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCN 202
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCE-MTKGESEVVTINGYHDVPQNSEDSLL 257
GG M YAFQ+I+ GG+ EEDYPY ++G C+ + ++V +I+GY +VP N+E SL
Sbjct: 203 GGDMGYAFQFIIKNGGIDSEEDYPYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQ 262
Query: 258 KALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWG 317
KA+ANQP+SVAIEA G DFQ YS G++ G CGT LDHGVAAVGYG+ G+DY IVKNSWG
Sbjct: 263 KAVANQPVSVAIEAGGYDFQLYSSGIFTGSCGTDLDHGVAAVGYGTENGVDYWIVKNSWG 322
Query: 318 PKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
WGEKGY+RM+RN GLCGI ASYP KK
Sbjct: 323 DYWGEKGYVRMQRNVKAKTGLCGIAMEASYPTKK 356
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/336 (56%), Positives = 237/336 (70%), Gaps = 6/336 (1%)
Query: 19 FIRSSFARDFSIVGYSPEDLT----SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKD 74
I S A D SI+ Y T S+ ++ L+E W+ K K SL EK RFEIFKD
Sbjct: 15 MIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRRFEIFKD 74
Query: 75 NLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDL 134
NLR IDE N K +Y LGL +FADL ++E++ M+LG + L R+ +S + + +
Sbjct: 75 NLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSR--LKRKATKSSLRYEVRVGDAI 132
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P+SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN+IVTG+L +LSEQEL+DCD +YN
Sbjct: 133 PESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTSYN 192
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GCNGGLMDYAF++I++ GG+ EEDYPY +G C+ T+ ++VVTI+ Y DVP NSE+
Sbjct: 193 EGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDLYEDVPANSEE 252
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
SL KAL++QP+SVAIE GR FQ Y G++DG CGT LDHGV AVGYG+ G DY IVKN
Sbjct: 253 SLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGTENGKDYWIVKN 312
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG WGE GYIRM+RN G CGI SYPIK
Sbjct: 313 SWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPIK 348
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 187/343 (54%), Positives = 241/343 (70%), Gaps = 6/343 (1%)
Query: 11 LISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFE 70
L+ I+ ++ R +IV Y L S+D ++D+F W+ +VY SL EK RF+
Sbjct: 12 LVLLVIAIGQQADAGRANAIVDYEGNQLHSDDAILDVFHQWLETHSRVYRSLSEKHHRFQ 71
Query: 71 IFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKD 130
IFK+N +I N++ K+YWLGLN+F+DL H+EF+ +LG KP +RK+ +F Y+D
Sbjct: 72 IFKENFLYIHAHNKQQKSYWLGLNKFSDLTHQEFRAQYLGTKPVNRQRKE---ANFMYED 128
Query: 131 VVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD 190
V PK VDWR KGAVT VK+QG+CGSCWAFS V +VEG+N I TG L SLSEQEL+DCD
Sbjct: 129 VEAEPK-VDWRLKGAVTDVKDQGACGSCWAFSAVGSVEGVNAIKTGELVSLSEQELVDCD 187
Query: 191 NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQ 250
N GCNGGLMDYAF++I+ GG+ E+DYPY +G C+ + S+VV I+ Y DVP
Sbjct: 188 RKQNQGCNGGLMDYAFEFIIKNGGIDTEKDYPYKARDGRCDEGRRNSKVVVIDDYQDVPT 247
Query: 251 NSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS-TRGLDY 309
SE +L+KAL P+SVAIEA GRDFQ Y GGV+ G CG++LDHGV AVGYG+ G++Y
Sbjct: 248 QSESALMKALTKNPVSVAIEAGGRDFQHYQGGVFTGPCGSELDHGVLAVGYGTDDDGVNY 307
Query: 310 IIVKNSWGPKWGEKGYIRMKR-NTGKPEGLCGINKMASYPIKK 351
IVKNSWGP WGEKGYIRM+R + +G CGIN AS+PIKK
Sbjct: 308 WIVKNSWGPGWGEKGYIRMERFGSDSTDGKCGINIEASFPIKK 350
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 191/353 (54%), Positives = 244/353 (69%), Gaps = 11/353 (3%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYS-PED-LTSNDK----LIDLFESWMSKFEKVYESLD 63
+ I+ F + S SI+ Y P D L S ++ ++ ++E W+ K K Y ++
Sbjct: 8 LCIAISFLFMVFSLSLASMSIIDYDLPADPLQSTERTEAHMMKMYEHWLVKHGKNYNAIG 67
Query: 64 EKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARR-KDQ 121
EK RFEIFKDNLR +DE N + Y LGL +FADL +EE++ M+LG K + + + +
Sbjct: 68 EKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKMEKKEKLRTE 127
Query: 122 SHEDFSYK--DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
+ + +K + DLP VDWR+KGAVT VK+QG CGSCWAFSTV +VEGINQIVTG+L
Sbjct: 128 RSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGINQIVTGDLI 187
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
SLSEQEL+DCD YN GCNGGLMDYAF++I+ GG+ E DYPY + C+ + + V
Sbjct: 188 SLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGIDSEADYPYRASDNMCDSNRKNAHV 247
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
VTI+GY DVP+N E+SL KA+ANQP+SVAIEA GR+FQ Y GV+ G CGT LDHGV AV
Sbjct: 248 VTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSGVFTGRCGTNLDHGVVAV 307
Query: 300 GYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIKK 351
GYG+ G+DY IV+NSWGPKWGE GYIRM+RN + G CGI ASYP KK
Sbjct: 308 GYGTENGIDYWIVRNSWGPKWGESGYIRMERNVASTDTGKCGIAMEASYPTKK 360
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 185/348 (53%), Positives = 241/348 (69%), Gaps = 13/348 (3%)
Query: 16 ISFFIRSSF--ARDFSIVGYSP---------EDLTSNDKLIDLFESWMSKFEKVYESLDE 64
+SFF S A D SI+ Y L ++D++ L+ESW+ K K Y +L E
Sbjct: 9 LSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKTYNALGE 68
Query: 65 KLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKP--DLARRKDQS 122
K RF+IFKDNLR IDE N Y LGLN+FADL +EE++ + G+K D +
Sbjct: 69 KDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMTYTGIKTIDDKKKLSKMK 128
Query: 123 HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLS 182
+ ++Y+ LP+ VDWR++GAVT VK+QGSCGSCWAFST +VEG+N+IVTG+L S+S
Sbjct: 129 SDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIVTGDLISVS 188
Query: 183 EQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
EQEL++CD +YN GCNGGLMDYAF++I+ GG+ EEDYPY ++G C+ K ++VVTI
Sbjct: 189 EQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNKKNAKVVTI 248
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
+ Y DVP N E SL KA++NQP++VAIEA GRDFQFY+ G++ G CGT LDHGV A GYG
Sbjct: 249 DSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDHGVLAAGYG 308
Query: 303 STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
+ G DY +VKNSWG +WGE GY++M+RN G CGI ASYPIK
Sbjct: 309 TEDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAMEASYPIK 356
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 185/347 (53%), Positives = 241/347 (69%), Gaps = 11/347 (3%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
TIL F I S A D + P SND+++ ++E W+ K +KVY L EK +R
Sbjct: 5 TILPFFLFFSLITFSLALDIQL----PTG-RSNDEVMTMYEEWLVKHQKVYNGLREKDQR 59
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR----KDQSHE 124
F+IFKDNL IDE N + Y +GLN+FAD+ +EE+++M+LG + D+ RR K H
Sbjct: 60 FQIFKDNLNFIDEHNAQNYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHR 119
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
++Y LP VDWR KGA+TH+K+QGSCGSCWAFST+A VE IN+IVTG L SLSEQ
Sbjct: 120 -YAYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQ 178
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
EL+DCD +N GCNGGLMDYAF++I+ GG+ ++ YPY EG C+ T+ ++++V+I+G
Sbjct: 179 ELVDCDRAFNEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDG 238
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
Y DVP N+E++L KA+A+QP+SVAIEASGR Q Y GV+ G CGT LDH V VGYGS
Sbjct: 239 YEDVPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYGSE 298
Query: 305 RGLDYIIVKNSWGPKWGEKGYIRMKRNT-GKPEGLCGINKMASYPIK 350
GLDY +V+NSWG WGE GY +M+RN G G CGI ASYP+K
Sbjct: 299 NGLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPVK 345
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 185/315 (58%), Positives = 230/315 (73%), Gaps = 4/315 (1%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLD-EKLERFEIFKDNLRHIDETN-RKIKNYWLGLNEFA 97
S+++++ L+ESW+ + K Y L EK +RFEIFKDNLR+IDE N R ++Y LGLN FA
Sbjct: 41 SDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNRFA 100
Query: 98 DLRHEEFKEMFLGLKPDLARR--KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSC 155
DL +EE++ +LG K D RR K +S ++ K LP S+DWR+KGAV VK+QGSC
Sbjct: 101 DLTNEEYRSTYLGAKTDARRRIAKTKSDRRYAPKAGGSLPDSIDWREKGAVAEVKDQGSC 160
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGL 215
GSCWAFST+AAVEGINQIVTG L SLSEQEL+DCD +YN GCNGGLMDYAF++I+ GG+
Sbjct: 161 GSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 220
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E DYPY G C+ T+ ++VV+I+GY DV E +L +A+A QP+SVAIEA GRD
Sbjct: 221 DTEADYPYTGRYGRCDQTRKNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEAGGRD 280
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
FQ YS G++ G CGT LDHGV AVGYG+ G+DY IVKNSW WGEKGY+RM+RN
Sbjct: 281 FQLYSSGIFTGSCGTDLDHGVTAVGYGTENGVDYWIVKNSWAASWGEKGYLRMQRNVKDK 340
Query: 336 EGLCGINKMASYPIK 350
GLCGI SYP K
Sbjct: 341 NGLCGIAIEPSYPTK 355
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/338 (56%), Positives = 244/338 (72%), Gaps = 8/338 (2%)
Query: 19 FIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLD--EKLERFEIFKDNL 76
++ S+ A DF+ G++ EDL S L L+++W + + SLD E ERFEIFK+N+
Sbjct: 18 WVLSASASDFT-PGFTDEDLESEKSLRSLYDNWALQ-HRSSRSLDSEEHAERFEIFKENV 75
Query: 77 RHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPK 136
++ID N+K Y LGLN+FADL +EEFK +++G K DL ++ F Y++ LP
Sbjct: 76 KYIDSVNKKDSPYKLGLNKFADLSNEEFKAIYMGTKMDLRGDREVQSGSFMYQNSEPLPA 135
Query: 137 SVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNG 196
S+DWR+KGAV VKNQG CGSCWAFSTVA+VEGIN I TGNL SLSEQ+L+DC +T N+G
Sbjct: 136 SIDWRQKGAVAAVKNQGHCGSCWAFSTVASVEGINYITTGNLVSLSEQQLVDC-STENSG 194
Query: 197 CNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV--VTINGYHDVPQNSED 254
CNGGLMD AFQYI++ GG+ E++YPY E C TK S+ V I+G+ DVP N+E
Sbjct: 195 CNGGLMDTAFQYIINNGGIVTEDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQ 254
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVK 313
+L +A+A+QP+SVAIEASG+DFQFYS GV+ G CGT LDHGV AVGYG S G++Y IV+
Sbjct: 255 ALKEAVAHQPVSVAIEASGQDFQFYSTGVFTGKCGTALDHGVVAVGYGTSPEGINYWIVR 314
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
NSWGPKWGE+GYIRM++ EG CGI ASYP KK
Sbjct: 315 NSWGPKWGEEGYIRMQQGIEAAEGKCGIAMQASYPTKK 352
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 376 bits (965), Expect = e-102, Method: Compositional matrix adjust.
Identities = 184/335 (54%), Positives = 233/335 (69%), Gaps = 10/335 (2%)
Query: 27 DFSIVGYSPEDLT-----SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
D SI+ Y S +++ L+E W++K + Y +L EK RFEIFKDN+ ID
Sbjct: 24 DMSIISYDEAHGVRGLERSEEEMRILYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDA 83
Query: 82 TNRKI----KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED-FSYKDVVDLPK 136
N +++ LGLN FAD+ +EE++ ++LG +P RR+ + D + Y DLP+
Sbjct: 84 HNAAADAGHRSFRLGLNRFADMTNEEYRAVYLGTRPAGHRRRARVGSDRYRYNAGEDLPE 143
Query: 137 SVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNG 196
SVDWR KGAV VK+QGSCGSCWAFSTVAAVEGIN+IVTG+L SLSEQEL+DCDN YN G
Sbjct: 144 SVDWRAKGAVAAVKDQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQG 203
Query: 197 CNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSL 256
CNGGLMDY F++I++ GG+ EEDYPY +G C+ + ++VV+I+GY DVP N E +L
Sbjct: 204 CNGGLMDYGFEFIINNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKAL 263
Query: 257 LKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSW 316
KA+ANQP+SVAIEA GR+FQ Y G++ G CGT LDHGV AVGYG+ G DY IV+NSW
Sbjct: 264 QKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRNSW 323
Query: 317 GPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
G WGE GYIRM+RN G CGI SYP KK
Sbjct: 324 GGDWGESGYIRMERNVNTSTGKCGIAIEPSYPTKK 358
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 189/324 (58%), Positives = 229/324 (70%), Gaps = 6/324 (1%)
Query: 32 GYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYW 90
G PE + + I +E W+ K + Y +L EK RFEIFKDNL+ IDE N +Y
Sbjct: 11 GQVPERTEAETRRI--YEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYK 68
Query: 91 LGLNEFADLRHEEFKEMFLGLKPDLARR--KDQSHEDFSYKDVVDLPKSVDWRKKGAVTH 148
LGLN+FADL ++E++ ++LG + D R E + +K+ DLP++VDWR+KGAV
Sbjct: 69 LGLNKFADLSNDEYRSVYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAP 128
Query: 149 VKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQY 208
VK+QG CGSCWAFSTV AVEGINQIVTGNL SLSEQEL+DCD TYN GCNGGLMDYAF +
Sbjct: 129 VKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDF 188
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVA 268
I+ GG+ EEDYPY + C+ + + VVTI+GY DVPQN E SL KA+ANQP+SVA
Sbjct: 189 IIENGGIDTEEDYPYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVA 248
Query: 269 IEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
IEA GR FQ Y GV+ G CGTQLDHGV VGYG+ G+DY IV+NSWGP WGE GYIRM
Sbjct: 249 IEAGGRGFQLYQSGVFTGSCGTQLDHGVVTVGYGTEHGVDYWIVRNSWGPAWGENGYIRM 308
Query: 329 KRNTGKPE-GLCGINKMASYPIKK 351
+R+ E G CGI ASYP KK
Sbjct: 309 ERDVASTETGKCGIAMEASYPTKK 332
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 182/346 (52%), Positives = 237/346 (68%), Gaps = 10/346 (2%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++LI + S A SI+ YS ++++D++E W+ K KVY LDEK +R
Sbjct: 3 SMLIPTLLLLSFTFSHATAMSIINYS------ENEVMDMYEEWLVKHRKVYNGLDEKEKR 56
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR---KDQSHED 125
F++FKDNL I + N + Y LGLN+FAD+ +EE++ M+LG + D RR +
Sbjct: 57 FQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHR 116
Query: 126 FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQE 185
++Y LP VDWR KGAV +K+QG+CGSCWAFSTVAAVEGIN IVTG SLSEQE
Sbjct: 117 YAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQE 176
Query: 186 LIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGY 245
L+DCD Y+ GCNGGLMDYAFQ+I+ GG+ EEDYPY +GTC+ TK +++VV I+GY
Sbjct: 177 LVDCDREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGY 236
Query: 246 HDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR 305
DVP N+E++L KA+++QP+SVAIEASGR Q Y GV+ G CGT LDHGV VGYG+
Sbjct: 237 EDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGTEN 296
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNT-GKPEGLCGINKMASYPIK 350
G+DY +V+NSWG WGE GY +M+RN EG CGI SYP+K
Sbjct: 297 GVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVK 342
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 181/334 (54%), Positives = 231/334 (69%), Gaps = 12/334 (3%)
Query: 23 SFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDET 82
+ A D SIV Y S +++ ++ WM++ Y ++ E+ RFE F+DNLR+ID+
Sbjct: 21 AAAADMSIVSYGER---SEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 83 NRK----IKNYWLGLNEFADLRHEEFKEMFLG--LKPDLARRKDQSHEDFSYKDVVDLPK 136
N + ++ LGLN FADL +EE++ +LG KPD R+ + + D +LP+
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPD---RERKLSARYQAADNDELPE 134
Query: 137 SVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNG 196
SVDWRKKGAV VK+QG CGSCWAFS +AAVEGINQIVTG++ LSEQEL+DCD +YN G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194
Query: 197 CNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSL 256
CNGGLMDYAF++I++ GG+ EEDYPY + C+ K ++VVTI+GY DVP NSE SL
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254
Query: 257 LKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSW 316
KA+ANQP+SVAIEA GR FQ Y G++ G CGT LDHGVAAVGYG+ G DY +V+NSW
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSW 314
Query: 317 GPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G WGE GYIRM+RN G CGI SYP K
Sbjct: 315 GSVWGEDGYIRMERNIKASSGKCGIAVEPSYPTK 348
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 183/335 (54%), Positives = 235/335 (70%), Gaps = 10/335 (2%)
Query: 27 DFSIVGYSPEDLT-----SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
D SI+ Y S +++ L+E W++K + +L EK RFEIFKDN+R ID
Sbjct: 24 DMSIISYDEAHGVQGLERSEEEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDA 83
Query: 82 TNRKI----KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED-FSYKDVVDLPK 136
N +++ LGLN FAD+ +EE++ ++LG +P RR+ + D + Y +LP+
Sbjct: 84 HNAAADSGHRSFRLGLNRFADMTNEEYRTVYLGTRPASHRRRARLGSDRYRYNAGEELPE 143
Query: 137 SVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNG 196
SVDWR KGAVT VK+QGSCGSCWAFST+AAVEGIN+IVTG+L SLSEQEL+DCDN N G
Sbjct: 144 SVDWRDKGAVTTVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQG 203
Query: 197 CNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSL 256
CNGGLMDYAF++I++ GG+ EEDYPY +G C+ + ++VV+I+GY DVP N E +L
Sbjct: 204 CNGGLMDYAFEFIINNGGIDTEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEKAL 263
Query: 257 LKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSW 316
KA+ANQP+SVAIEA GR+FQ Y G++ G CGT LDHGV AVGYG+ G DY IV+NSW
Sbjct: 264 QKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRNSW 323
Query: 317 GPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
G WGE GYIRM+RN G CGI +SYP KK
Sbjct: 324 GGDWGESGYIRMERNVNASTGKCGIAMESSYPTKK 358
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 189/335 (56%), Positives = 237/335 (70%), Gaps = 12/335 (3%)
Query: 22 SSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
+ +A D SI+ Y E T + ++E+W+ K K Y +L EK RF+IFKDNLR I+E
Sbjct: 28 AGWAMDMSIIDYD-ESHTRH-----VYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEE 81
Query: 82 TN-RKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD----QSHEDFSYKDVVDLPK 136
N K+Y LGLN+FADL +EE++ MFLG + + K + + ++Y+ +LP
Sbjct: 82 HNGAGDKSYKLGLNKFADLTNEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPA 141
Query: 137 SVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNG 196
VDWR+KGAVT +K+QG CGSCWAFSTV AVEGINQIVTGNL SLSEQEL+DCD YN G
Sbjct: 142 MVDWREKGAVTPIKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMG 201
Query: 197 CNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSL 256
CNGGLMDYAF++IV GG+ EEDYPY ++ TC+ + + VVTI+GY DVP N E SL
Sbjct: 202 CNGGLMDYAFEFIVQNGGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSL 261
Query: 257 LKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSW 316
+KA+ANQP+SVAIEA G +FQ Y GV+ G CGT LDHGV AVGYG+ G DY +V+NSW
Sbjct: 262 MKAVANQPVSVAIEAGGMEFQLYQSGVFTGRCGTNLDHGVVAVGYGTENGTDYWLVRNSW 321
Query: 317 GPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
G WGE GYI+++RN E G CGI ASYPIK
Sbjct: 322 GSAWGENGYIKLERNVQNTETGKCGIAIEASYPIK 356
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 181/346 (52%), Positives = 237/346 (68%), Gaps = 10/346 (2%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++LI + S A SI+ YS ++++D++E W+ K KVY LDEK +R
Sbjct: 3 SMLIPTLLLLSFTFSHATAMSIINYS------ENEVMDMYEEWLVKHRKVYNGLDEKEKR 56
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR---KDQSHED 125
F++FKDNL I + N + Y LGLN+FAD+ ++E++ M+LG + D RR +
Sbjct: 57 FQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHR 116
Query: 126 FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQE 185
++Y LP VDWR KGAV +K+QG+CGSCWAFSTVAAVEGIN IVTG SLSEQE
Sbjct: 117 YAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQE 176
Query: 186 LIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGY 245
L+DCD Y+ GCNGGLMDYAFQ+I+ GG+ EEDYPY +GTC+ TK +++VV I+GY
Sbjct: 177 LVDCDREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGY 236
Query: 246 HDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR 305
DVP N+E++L KA+++QP+SVAIEASGR Q Y GV+ G CGT LDHGV VGYG+
Sbjct: 237 EDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGTEN 296
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNT-GKPEGLCGINKMASYPIK 350
G+DY +V+NSWG WGE GY +M+RN EG CGI SYP+K
Sbjct: 297 GVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVK 342
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 186/355 (52%), Positives = 245/355 (69%), Gaps = 13/355 (3%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDL-------TSNDKLIDLFESWMSKFEKVYE 60
KTI+ + + F S+A D SI+ Y + D++ + +E W+++ + Y
Sbjct: 3 KTIITTLLFALFSSLSYAIDMSIIDYKNNHYARKWTLQSDEDQVKNRYEMWLAEHGRAYN 62
Query: 61 SLDEKLERFEIFKDNLRHID-ETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
+L EK +RFEIFKDNLR I+ N + Y +GLN+FADL +EE++ M+LG K D RR
Sbjct: 63 ALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNEEYRTMYLGTKSDARRRF 122
Query: 120 DQSH---EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG 176
+S + ++ + +P SVDWRK+GAV +KNQGSCGSCWAFSTVAAVEGINQIVTG
Sbjct: 123 VKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFSTVAAVEGINQIVTG 182
Query: 177 NLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
+ +LSEQEL+DCD N+GCNGGLMDYAF++I+S GG+ E+ YPY EG C+ +
Sbjct: 183 EMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGMDTEKHYPYRGVEGRCDPVRKN 242
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+VV+I+GY DVP+N E +L KA+A+QP+ VAIEASGR FQ YS GV+ G CG ++DHGV
Sbjct: 243 YKVVSIDGYEDVPRN-ERALQKAVAHQPVCVAIEASGRAFQLYSSGVFTGECGEEVDHGV 301
Query: 297 AAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
VGYGS G+DY IV+NSWG KWGE GY++M+RN K G CGI ASYP K
Sbjct: 302 VVVGYGSEDGVDYWIVRNSWGTKWGENGYVKMERNVKKSHLGKCGIMTEASYPTK 356
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 178/325 (54%), Positives = 234/325 (72%), Gaps = 7/325 (2%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWL 91
Y + +++++ + +E W+++ K Y +L EK RF IF DNL+ IDE N ++Y +
Sbjct: 21 YVTSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKV 80
Query: 92 GLNEFADLRHEEFKEMFLGLKPDLARR-----KDQSHEDFSYKDVVDLPKSVDWRKKGAV 146
GLN+FADL +EE++ M+LG K D RR + + ++ ++ P VDWR++GAV
Sbjct: 81 GLNQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGAV 140
Query: 147 THVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAF 206
+ VKNQG CGSCWAFSTVA+VEGIN+IVTG+L SLSEQEL+DCDN YN+GCNGG MDYAF
Sbjct: 141 SPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYAF 200
Query: 207 QYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLS 266
Q+IVS GG+ E DYPY C+ + ++++V+I+GY DVP +E +L+KA+A+QP+S
Sbjct: 201 QFIVSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPVS 260
Query: 267 VAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYI 326
V IEASGR FQ Y+ GV G CGT LDHGV VGYGS G DY IV+NSWGP+WGE GYI
Sbjct: 261 VGIEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYGSENGKDYWIVRNSWGPEWGEDGYI 320
Query: 327 RMKRN-TGKPEGLCGINKMASYPIK 350
RM+RN P G+CGI MASYPIK
Sbjct: 321 RMERNMVDTPVGMCGITLMASYPIK 345
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 177/318 (55%), Positives = 227/318 (71%), Gaps = 7/318 (2%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNE 95
S+D++ L+++W ++ + Y +LDE +R EIF+DNLR ID+ N ++ LGL
Sbjct: 39 SDDEVHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTR 98
Query: 96 FADLRHEEFKEMFLGLKPDLARRKDQS---HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQ 152
FADL +EE++ +LG++ +RR+ S + ++ DLP S+DWR KGAV VK+Q
Sbjct: 99 FADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKDQ 158
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVST 212
GSCGSCWAFST+AAVEGIN IVTG+L SLSEQEL+DCD YN GCNGGLMDYAF++I+S
Sbjct: 159 GSCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIISN 218
Query: 213 GGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEAS 272
GG+ +EDYPY +G+C+ + + VVTI+ Y DVP N E SL KA+ANQP+SVAIEA
Sbjct: 219 GGIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAG 278
Query: 273 GRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNT 332
GR FQ Y G++ G+CGT+LDHGV A+GYGS G Y IVKNSWG WGE GYIRM+RN
Sbjct: 279 GRAFQLYESGIFTGYCGTELDHGVTAIGYGSENGKYYWIVKNSWGSDWGESGYIRMERNI 338
Query: 333 GKPEGLCGINKMASYPIK 350
G CGI ASYPIK
Sbjct: 339 NSATGKCGIAMEASYPIK 356
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 177/328 (53%), Positives = 233/328 (71%), Gaps = 8/328 (2%)
Query: 28 FSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-- 85
SIV Y S ++ ++ WM+ + Y ++ E+ RFE+F+DNLR++D N
Sbjct: 29 MSIVSYGER---SEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAAD 85
Query: 86 --IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKK 143
+ ++ LGLN FADL ++E++ +LG++ +R+ + + + D DLP+SVDWR K
Sbjct: 86 AGVHSFRLGLNRFADLTNDEYRATYLGVR-SRPQRERRLGDRYLAGDNEDLPESVDWRAK 144
Query: 144 GAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMD 203
GAV VK+QGSCGSCWAFST+AAVEGINQIVTG++ SLSEQEL+DCD +YN GCNGGLMD
Sbjct: 145 GAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMD 204
Query: 204 YAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
YAF++I++ GG+ EEDYPY +G C++ + ++VVTI+ Y DVP NSE SL KA+ANQ
Sbjct: 205 YAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQ 264
Query: 264 PLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEK 323
P+SVAIEA GR FQ Y+ G++ G CGT LDHGV AVGYG+ G DY IVKNSWG WGE
Sbjct: 265 PISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGES 324
Query: 324 GYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY+RM+RN G CGI SYP+KK
Sbjct: 325 GYVRMERNIKASSGKCGIAVEPSYPLKK 352
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 176/328 (53%), Positives = 233/328 (71%), Gaps = 8/328 (2%)
Query: 28 FSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-- 85
SIV Y S ++ ++ WM+ + Y ++ E+ RFE+F+DNLR++D N
Sbjct: 29 MSIVSYGER---SEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAAD 85
Query: 86 --IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKK 143
+ ++ LGLN FADL ++E++ +LG++ +R+ + + + D DLP+SVDWR K
Sbjct: 86 AGVHSFRLGLNRFADLTNDEYRATYLGVR-SRPQRERRLGDRYLAGDNEDLPESVDWRAK 144
Query: 144 GAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMD 203
GAV +K+QGSCGSCWAFST+AAVEGINQIVTG++ SLSEQEL+DCD +YN GCNGGLMD
Sbjct: 145 GAVAEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMD 204
Query: 204 YAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
YAF++I++ GG+ EEDYPY +G C++ + ++VVTI+ Y DVP NSE SL KA+ANQ
Sbjct: 205 YAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQ 264
Query: 264 PLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEK 323
P+SVAIEA GR FQ Y+ G++ G CGT LDHGV AVGYG+ G DY IVKNSWG WGE
Sbjct: 265 PISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGES 324
Query: 324 GYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY+RM+RN G CGI SYP+KK
Sbjct: 325 GYVRMERNIKASSGKCGIAVEPSYPLKK 352
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 182/333 (54%), Positives = 238/333 (71%), Gaps = 4/333 (1%)
Query: 22 SSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
S F D + + + S+D+++ +++ W+ K K Y L EK +RFEIFK+NLR IDE
Sbjct: 2 SIFNHDDNHLSHDQSSWRSDDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDE 61
Query: 82 TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSH---EDFSYKDVVDLPKSV 138
N + + Y +GL +FADL ++E++ MFLG + D RR +S E ++YK LP+SV
Sbjct: 62 HNSQNRTYKVGLTKFADLTNQEYRAMFLGTRSDPKRRLMKSKNPSERYAYKAGDKLPESV 121
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
DWR KGAV +K+QGSCGSCWAFSTVAAVEGINQIVTG L SLSEQEL+DCD YN GCN
Sbjct: 122 DWRGKGAVNPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGCN 181
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
GGLMDYAFQ+I++ GGL E+DYPY+ + TC+ K +++ V+I+G+ DV E +L K
Sbjct: 182 GGLMDYAFQFIINNGGLDTEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQK 241
Query: 259 ALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGP 318
A+A+QP+SVAIEASG QFY GV+ G CGT LDHGV VGYG+ +GLDY +V+NSWG
Sbjct: 242 AVAHQPVSVAIEASGMALQFYQSGVFTGECGTALDHGVVVVGYGTEKGLDYWLVRNSWGT 301
Query: 319 KWGEKGYIRMKRNTGKP-EGLCGINKMASYPIK 350
+WGE GYI+M+RN G CGI +SYP+K
Sbjct: 302 EWGEHGYIKMQRNVRDTYTGRCGIAMESSYPVK 334
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 186/345 (53%), Positives = 236/345 (68%), Gaps = 12/345 (3%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERF 69
+++SF + + +SF + +DL S + L DL+E W S V SL EK +RF
Sbjct: 9 VVLSFSLVLGVANSF-------DFHDKDLASEESLWDLYERWRSH-HTVSRSLGEKHKRF 60
Query: 70 EIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLK---PDLARRKDQSHEDF 126
+FK NL H+ TN+ K Y L LN+FAD+ + EF+ + G K P + R + F
Sbjct: 61 NVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAF 120
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
Y+ VV +P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI T L +LSEQEL
Sbjct: 121 MYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQEL 180
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
+DCD N GCNGGLM+ AF++I GG+ E +YPY +EGTC+ +K V+I+G+
Sbjct: 181 VDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHE 240
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-R 305
+VP N ED+LLKA+ANQP+SVAI+A G DFQFYS GV+ G C T L+HGVA VGYG+T
Sbjct: 241 NVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD 300
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G +Y IV+NSWGP+WGE GYIRM+RN K EGLCGI + SYPIK
Sbjct: 301 GTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIK 345
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 179/330 (54%), Positives = 231/330 (70%), Gaps = 12/330 (3%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
D SIV Y S +++ ++ WMS+ + Y ++ E+ RFE+F+DNLR+ID+ N
Sbjct: 23 DMSIVSYGER---SEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAA 79
Query: 86 ---IKNYWLGLNEFADLRHEEFKEMFLG--LKPDLARRKDQSHEDFSYKDVVDLPKSVDW 140
+ ++ LGLN FADL +EE++ +LG KPD R+ ++ D +LP++VDW
Sbjct: 80 DAGLHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQ---ADDNEELPETVDW 136
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGG 200
RKKGAV +K+QG CGSCWAFS +AAVEGINQIVTG++ LSEQEL+DCD +YN GCNGG
Sbjct: 137 RKKGAVAAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGG 196
Query: 201 LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKAL 260
LMDYAF++I++ GG+ EEDYPY + C+ K ++VVTI+GY DVP NSE SL KA+
Sbjct: 197 LMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAV 256
Query: 261 ANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKW 320
ANQP+SVAIEA GR FQ Y G++ G CGT LDHGVAAVGYG+ G DY +V+NSWG W
Sbjct: 257 ANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGTVW 316
Query: 321 GEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
GE GYIRM+RN G CGI SYP K
Sbjct: 317 GEDGYIRMERNIKASSGKCGIAVEPSYPTK 346
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 185/355 (52%), Positives = 244/355 (68%), Gaps = 6/355 (1%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDL--TSNDKLIDLFESWMSKFEKV 58
MA S TI + + F SS A D SI+ Y + S+D++ L+ESW+ + K
Sbjct: 1 MAAHSSTLTISLLLMLIFSTLSS-ASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKS 59
Query: 59 YESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y +L EK +RF+IFKDNL++IDE N ++Y LGL +FADL +EE++ ++LG K R
Sbjct: 60 YNALGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDR 119
Query: 118 RKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
RK ++ Y V LP+SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IVT
Sbjct: 120 RKLSKNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVT 179
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SLSEQEL+DCD +YN GC+GGLMDYAF+++++ GG+ EEDYPY C+ +
Sbjct: 180 GNLISLSEQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRK 239
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++VV I+ Y DVP N+E +L KA+A+QP+S+AIEA GRD Q Y G++ G CGT +DHG
Sbjct: 240 NAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHG 299
Query: 296 VAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
V A GYGS G+DY IV+NSWG KWGEKGY+R++RN GLCG+ SYP+K
Sbjct: 300 VVAAGYGSENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGLATEPSYPVK 354
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 187/345 (54%), Positives = 236/345 (68%), Gaps = 10/345 (2%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERF 69
I+++ C+ + ++ DF +D+ S + L +L+E W S V SL+EK +RF
Sbjct: 5 IVLALCMLMVLETTKGLDFH-----NKDVESENSLWELYERWRS-HHTVARSLEEKAKRF 58
Query: 70 EIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR---KDQSHEDF 126
+FK N++HI ETN+K K+Y L LN+F D+ EEF+ + G R + ++ + F
Sbjct: 59 NVFKHNVKHIHETNKKDKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSF 118
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
Y +V LP SVDWRK GAVT VKNQG CGSCWAFSTV AVEGINQI T L SLSEQEL
Sbjct: 119 MYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQEL 178
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
+DCD N GCNGGLMD AF++I GGL E YPY + TC+ K + VV+I+G+
Sbjct: 179 VDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHE 238
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-R 305
DVP+NSED L+KA+ANQP+SVAI+A G DFQFYS GV+ G CGT+L+HGVA VGYG+T
Sbjct: 239 DVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTID 298
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G Y IVKNSWG +WGEKGYIRM+R EGLCGI ASYP+K
Sbjct: 299 GTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPLK 343
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 187/345 (54%), Positives = 238/345 (68%), Gaps = 12/345 (3%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERF 69
+++SF + + +SF + +DL S + L DL+E W S V SL EK +RF
Sbjct: 8 VVLSFSLVLGVANSF-------DFHDKDLASEESLWDLYERWRSH-HTVSRSLGEKHKRF 59
Query: 70 EIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR-KDQSHED--F 126
+FK NL H+ TN+ K Y L LN+FAD+ + EF+ + G K + R + HE+ F
Sbjct: 60 NVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRGTPHENGAF 119
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
Y+ VV +P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI T L +LSEQEL
Sbjct: 120 MYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQEL 179
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
+DCD N GCNGGLM+ AF++I GG+ E +YPY +EGTC+ +K V+I+G+
Sbjct: 180 VDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHE 239
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-R 305
+VP N ED+LLKA+ANQP+SVAI+A G DFQFYS GV+ G C T L+HGVA VGYG+T
Sbjct: 240 NVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD 299
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G +Y IV+NSWGP+WGE GYIRM+RN K EGLCGI + SYPIK
Sbjct: 300 GTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIK 344
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 193/347 (55%), Positives = 242/347 (69%), Gaps = 10/347 (2%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
K L+ F ++ +R + DF ++L + +KL +L+E W S V SLDEK +
Sbjct: 3 KLFLVLFSLALVLRLGESFDFHE-----KELETEEKLWELYERWRSH-HTVSRSLDEKDK 56
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR---KDQSHE 124
RF +FK N+ ++ N+K K Y L LN+FAD+ + EF+ + G K R +++
Sbjct: 57 RFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRHHYAGSKIKHHRSFLGASRANG 116
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
F Y +V D+P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI T L SLSEQ
Sbjct: 117 TFMYANVEDVPPSVDWRKKGAVTPVKDQGKCGSCWAFSTVVAVEGINQIKTNELVSLSEQ 176
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
EL+DCD + N GCNGGLMD AF++I GG++ EE+YPY+ E G C++ K S VV+I+G
Sbjct: 177 ELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTEENYPYMAEGGECDIQKRNSPVVSIDG 236
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
Y DVP N EDSLLKA+ANQP+SVAI+ASG DFQFYS GV+ G CGT+LDHGVA VGYG+T
Sbjct: 237 YEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQFYSEGVFTGDCGTELDHGVAIVGYGTT 296
Query: 305 -RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G Y IV+NSWGP+WGEKGYIRM+R EGLCGI SYPIK
Sbjct: 297 LDGTKYWIVRNSWGPEWGEKGYIRMQREIDAEEGLCGIAMQPSYPIK 343
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 369 bits (948), Expect = e-99, Method: Compositional matrix adjust.
Identities = 177/329 (53%), Positives = 233/329 (70%), Gaps = 8/329 (2%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
D SIV Y S +++ ++ WM++ + Y ++ E+ RFE+F+DNLR++D+ N
Sbjct: 24 DMSIVSYGER---SEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAA 80
Query: 86 ---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRK 142
+ ++ LGLN FADL +EE+++ +LG++ R + S + D +LP+SVDWR+
Sbjct: 81 DAGLHSFRLGLNRFADLTNEEYRDTYLGVRTKPVRERRLSGR-YQAADNEELPESVDWRE 139
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAV VK+QG CGSCWAFS +AAVEGINQIVTG++ +LSEQEL+DCD +YN GCNGGLM
Sbjct: 140 KGAVAKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQGCNGGLM 199
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
DYAF++I++ GG+ EEDYPY + C+ K ++VVTI+GY DVP NSE SL KA+AN
Sbjct: 200 DYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSELSLKKAVAN 259
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
QP+SVAIEA GR FQ Y G++ G CGT LDHGV AVGYGS G DY IVKNSWG WGE
Sbjct: 260 QPISVAIEAGGRAFQLYKSGIFTGRCGTALDHGVTAVGYGSENGKDYWIVKNSWGTVWGE 319
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY+R++RN G CGI SYP+KK
Sbjct: 320 DGYVRLERNIKATSGKCGIAIEPSYPLKK 348
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 181/332 (54%), Positives = 230/332 (69%), Gaps = 12/332 (3%)
Query: 25 ARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR 84
A D SIV Y S +++ ++ WM++ Y ++ E+ RFE F+DNLR+ID+ N
Sbjct: 23 AADMSIVSYGER---SEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNA 79
Query: 85 K----IKNYWLGLNEFADLRHEEFKEMFLG--LKPDLARRKDQSHEDFSYKDVVDLPKSV 138
+ ++ LGLN FADL +EE++ +LG KPD R+ ++ D +LP+SV
Sbjct: 80 AADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQ---AADNDELPESV 136
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
DWRKKGAV VK+QG CGSCWAFS +AAVEGINQIVTG++ LSEQEL+DCD +YN GCN
Sbjct: 137 DWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCN 196
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
GGLMDYAF++I++ GG+ EEDYPY + C+ K ++VVTI+GY DVP NSE SL K
Sbjct: 197 GGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQK 256
Query: 259 ALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGP 318
A+ANQP+SVAIEA GR FQ Y G++ G CGT LDHGVAAVGYG+ G DY +V+NSWG
Sbjct: 257 AVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGS 316
Query: 319 KWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WGE GYIRM+RN G CGI SYP K
Sbjct: 317 VWGEDGYIRMERNIKASSGKCGIAVEPSYPTK 348
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 369 bits (946), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 193/359 (53%), Positives = 243/359 (67%), Gaps = 13/359 (3%)
Query: 3 LSSQFKTILISFCISFFIRSSFARDFSIVGYS---PEDLTS---NDKLIDLFESWMSKFE 56
LS K +++ SF + S A D SI+ Y P+ TS N +++ ++E W+ K
Sbjct: 6 LSPAMKLMIVLIISSFTV--SLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHG 63
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLA 116
K Y L EK +RFEIFKDNL+ IDE N Y LGL FADL +EE++ FLG K D
Sbjct: 64 KSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPN 123
Query: 117 RRKDQ---SHEDFSYKDVVD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQ 172
RR + S + V D LP+SVDWRK+GAV VK+Q SCGSCWAFS +AAVEGIN+
Sbjct: 124 RRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINK 183
Query: 173 IVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEM 232
IVTG+L SLSEQEL+DCD +YN GCNGGLMDYAF++I+S GG+ E+DYPY +G C+
Sbjct: 184 IVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQ 243
Query: 233 TKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQL 292
+ ++VVTI+ Y DVP E +L KA+ANQP++VA+E GR+FQ Y GV+ G CGT L
Sbjct: 244 NRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTAL 303
Query: 293 DHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
DHGVAAVGYG+ G DY IV+NSWG WGE+GYIR++RN G CGI SYPIK
Sbjct: 304 DHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIK 362
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 186/336 (55%), Positives = 230/336 (68%), Gaps = 13/336 (3%)
Query: 27 DFSIV--GYSPEDLTSNDKLIDLFESWMSKFEKVY--------ESLDEKLERFEIFKDNL 76
D+SI+ GY P+DL+S ++L LF+SWM + K Y EK R+ IFKDNL
Sbjct: 34 DYSILDLGYDPQDLSSEERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNL 93
Query: 77 RHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DL 134
R I N K + Y+LGLN FADL +EEF+ G + D +R + SHE+F Y V DL
Sbjct: 94 RFIHGENEKNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRER-TSHEEFRYGSVQLKDL 152
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P S+DWR+KGAV VK+QGSCGSCWAFS VAA+EG+N++ TG L SLSEQEL+DCD +
Sbjct: 153 PDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGED 212
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GCNGGLMDYAF +++ GGL E DYPY C+ +K ++VVTI+GY DVP N E
Sbjct: 213 EGCNGGLMDYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDET 272
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
+LLKA+A+QP+SVAI+A G QFY G++ G CGT LDHGV VGYG G Y I+KN
Sbjct: 273 ALLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGKEDGKAYWIIKN 332
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG WGEKGY++M RNTG GLCGIN ASYP K
Sbjct: 333 SWGSNWGEKGYVKMARNTGLAAGLCGINMEASYPTK 368
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 184/342 (53%), Positives = 235/342 (68%), Gaps = 13/342 (3%)
Query: 19 FIRSSFARDFSIVGY------SPEDLTSNDKLIDLFESWMSKFEKVYE--SLDEKLERFE 70
+ S A D SI+ Y S S +++ ++E+W+ K K SL EK RFE
Sbjct: 15 MVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFE 74
Query: 71 IFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKD 130
IFKDNLR +DE N K +Y LGL FADL ++E++ +LG K + +K + Y+
Sbjct: 75 IFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKME---KKGERRTSLRYEA 131
Query: 131 VV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
V +LP+S+DWRKKGAV VK+QG CGSCWAFST+ AVEGINQIVTG+L +LSEQEL+D
Sbjct: 132 RVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVD 191
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
CD +YN GCNGGLMDYAF++I+ GG+ ++DYPY +GTC+ + ++VVTI+ Y DV
Sbjct: 192 CDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDV 251
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLD 308
P SE+SL KA+A+QP+S+AIEA GR FQ Y G++DG CGTQLDHGV AVGYG+ G D
Sbjct: 252 PTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKD 311
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
Y IV+NSWG WGE GY+RM RN G CGI SYPIK
Sbjct: 312 YWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIK 353
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 193/359 (53%), Positives = 243/359 (67%), Gaps = 13/359 (3%)
Query: 3 LSSQFKTILISFCISFFIRSSFARDFSIVGYS---PEDLTS---NDKLIDLFESWMSKFE 56
LS K +++ SF + S A D SI+ Y P+ TS N +++ ++E W+ K
Sbjct: 6 LSPAMKLMIVLIISSFTV--SLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHG 63
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLA 116
K Y L EK +RFEIFKDNL+ IDE N Y LGL FADL +EE++ FLG K D
Sbjct: 64 KSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPN 123
Query: 117 RRKDQ---SHEDFSYKDVVD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQ 172
RR + S + V D LP+SVDWRK+GAV VK+Q SCGSCWAFS +AAVEGIN+
Sbjct: 124 RRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINK 183
Query: 173 IVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEM 232
IVTG+L SLSEQEL+DCD +YN GCNGGLMDYAF++I+S GG+ E+DYPY +G C+
Sbjct: 184 IVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQ 243
Query: 233 TKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQL 292
+ ++VVTI+ Y DVP E +L KA+ANQP++VA+E GR+FQ Y GV+ G CGT L
Sbjct: 244 NRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTAL 303
Query: 293 DHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
DHGVAAVGYG+ G DY IV+NSWG WGE+GYIR++RN G CGI SYPIK
Sbjct: 304 DHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIK 362
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 184/342 (53%), Positives = 234/342 (68%), Gaps = 13/342 (3%)
Query: 19 FIRSSFARDFSIVGY------SPEDLTSNDKLIDLFESWMSKFEKVYE--SLDEKLERFE 70
+ S A D SI+ Y S S +++ ++E+W+ K K SL EK RFE
Sbjct: 15 MVTVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFE 74
Query: 71 IFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKD 130
IFKDNLR +DE N K +Y LGL FADL ++E++ +LG K +K + Y+
Sbjct: 75 IFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAK---MEKKGERRTSLRYEA 131
Query: 131 VV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
V +LP+S+DWRKKGAV VK+QG CGSCWAFST+ AVEGINQIVTG+L +LSEQEL+D
Sbjct: 132 RVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVD 191
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
CD +YN GCNGGLMDYAF++I+ GG+ ++DYPY +GTC+ + ++VVTI+ Y DV
Sbjct: 192 CDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDV 251
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLD 308
P SE+SL KA+A+QP+S+AIEA GR FQ Y G++DG CGTQLDHGV AVGYG+ G D
Sbjct: 252 PTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKD 311
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
Y IV+NSWG WGE GY+RM RN G CGI SYPIK
Sbjct: 312 YWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIK 353
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 368 bits (945), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 184/342 (53%), Positives = 235/342 (68%), Gaps = 13/342 (3%)
Query: 19 FIRSSFARDFSIVGY------SPEDLTSNDKLIDLFESWMSKFEKVYE--SLDEKLERFE 70
+ S A D SI+ Y S S +++ ++E+W+ K K SL EK RFE
Sbjct: 15 MVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFE 74
Query: 71 IFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKD 130
IFKDNLR +DE N K +Y LGL FADL ++E++ +LG K + +K + Y+
Sbjct: 75 IFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKME---KKGERRTSLRYEA 131
Query: 131 VV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
V +LP+S+DWRKKGAV VK+QG CGSCWAFST+ AVEGINQIVTG+L +LSEQEL+D
Sbjct: 132 RVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVD 191
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
CD +YN GCNGGLMDYAF++I+ GG+ ++DYPY +GTC+ + ++VVTI+ Y DV
Sbjct: 192 CDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDV 251
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLD 308
P SE+SL KA+A+QP+S+AIEA GR FQ Y G++DG CGTQLDHGV AVGYG+ G D
Sbjct: 252 PTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKD 311
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
Y IV+NSWG WGE GY+RM RN G CGI SYPIK
Sbjct: 312 YWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIK 353
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 368 bits (945), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 187/336 (55%), Positives = 230/336 (68%), Gaps = 13/336 (3%)
Query: 27 DFSIV--GYSPEDLTSNDKLIDLFESWMSKFEKVY--------ESLDEKLERFEIFKDNL 76
DFSI+ GY P+DL+S ++L LF+SWM + K Y EK R+ IFKDNL
Sbjct: 34 DFSILDLGYDPQDLSSEERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNL 93
Query: 77 RHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DL 134
R I N K + Y+LGLN FADL +EEF+ G + D +R + S+E+F Y V DL
Sbjct: 94 RFIHGENEKNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRER-TSYEEFRYGSVQLKDL 152
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P S+DWR+KGAV VK+QGSCGSCWAFS VAA+EG+N++ TG L SLSEQEL+DCD +
Sbjct: 153 PDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGED 212
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GCNGGLMDYAF +++ GGL E DYPY C+ +K ++VVTI+GY DVP N E
Sbjct: 213 EGCNGGLMDYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDET 272
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
+LLKA+A+QP+SVAI+A G QFY G++ G CGT LDHGV VGYG G Y I+KN
Sbjct: 273 ALLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGKEDGKAYWIIKN 332
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG WGEKGYI+M RNTG GLCGIN ASYP K
Sbjct: 333 SWGSNWGEKGYIKMARNTGLAAGLCGINMEASYPTK 368
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 367 bits (943), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 185/342 (54%), Positives = 238/342 (69%), Gaps = 13/342 (3%)
Query: 19 FIRSSFARDFSIVGY------SPEDLTSNDKLIDLFESWMSKFEKVYE--SLDEKLERFE 70
+ + A D SI+ Y S S+ +++ ++E+W+ K K SL EK RFE
Sbjct: 8 MVAVASAVDMSIISYDEKHGVSTTGGRSDAEVMSIYEAWLVKHGKAQNQNSLVEKDRRFE 67
Query: 71 IFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLA--RRKDQSHEDFSY 128
IFKDNLR ID+ N+K +Y LGL FADL ++E++ +LG K + RR Q +E
Sbjct: 68 IFKDNLRFIDDHNKKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSQRYE---A 124
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
+ +LP+S+DWRKKGAV VK+QGSCGSCWAFST+ AVEGINQIVTG+L +LSEQEL+D
Sbjct: 125 RVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVD 184
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
CD +YN GCNGGLMDYAF++I+ GG+ ++DYPY +GTC+ + ++VVTI+ Y DV
Sbjct: 185 CDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDV 244
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLD 308
P SE+SL KA+A+QP+SVAIEA GR FQ Y G++DG CGTQLDHGV AVGYG+ G D
Sbjct: 245 PTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGIFDGTCGTQLDHGVVAVGYGTENGKD 304
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
Y IV+NSWG WGE GY++M RN G CGI SYPIK
Sbjct: 305 YWIVRNSWGKSWGESGYLKMARNIASSSGKCGIAIEPSYPIK 346
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 367 bits (943), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 180/345 (52%), Positives = 233/345 (67%), Gaps = 7/345 (2%)
Query: 11 LISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFE 70
+I+ S + SF +I + + T N+ ++ ++E W+ K +KVY L EK +RF+
Sbjct: 4 IITLVTSTLLFLSFTLSCAIDTSTITNYTDNE-VMTMYEEWLVKHQKVYNGLREKDKRFQ 62
Query: 71 IFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLGLKPDLARR----KDQSHED 125
+FKDNL I E N N Y LGLN+FAD+ +EE++ M+ G K D RR K H
Sbjct: 63 VFKDNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHR- 121
Query: 126 FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQE 185
++Y LP VDWR KGAV +K+QGSCGSCWAFSTVA VE IN+IVTG SLSEQE
Sbjct: 122 YAYSAGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQE 181
Query: 186 LIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGY 245
L+DCD YN GCNGGLMDYAF++I+ GG+ ++DYPY +G C+ TK ++VV I+G+
Sbjct: 182 LVDCDRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGF 241
Query: 246 HDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR 305
DVP E++L KA+A+QP+S+AIEASGRD Q Y GV+ G CGT LDHGV VGYGS
Sbjct: 242 EDVPPYDENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYGSEN 301
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G+DY +V+NSWG WGE GY +M+RN P G CGI ASYP+K
Sbjct: 302 GVDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPVK 346
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 367 bits (942), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 182/315 (57%), Positives = 221/315 (70%), Gaps = 8/315 (2%)
Query: 41 NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLR 100
++++ L+ESW+ K Y ++ EK RFEIFKDNLR IDE NR+ + Y +GL FADL
Sbjct: 55 DEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADLT 114
Query: 101 HEEFKEMFLG----LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
+EE++ FLG KP L+ K + + D DLP VDWRKKGAV VK+QG CG
Sbjct: 115 NEEYRARFLGGRFSRKPRLSAAKSGRYA-AALGD--DLPDDVDWRKKGAVATVKDQGQCG 171
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
SCWAFS+VAAVEGINQIVTG L LSEQEL+DCD ++N GCNGGLMDYAFQ+I+ GG+
Sbjct: 172 SCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGID 231
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
EEDYPY + C+ + ++VVTI+GY DVP+N E SL KA+ANQP+SVAIEA GR F
Sbjct: 232 TEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAF 291
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK-P 335
Q Y GV+ G CGT LDHGV AVGYG+ G DY IV+NSWG WGE GYIR++RN
Sbjct: 292 QLYQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLERNVANIT 351
Query: 336 EGLCGINKMASYPIK 350
G CGI SYP K
Sbjct: 352 TGKCGIAVQPSYPTK 366
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 367 bits (942), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 180/335 (53%), Positives = 226/335 (67%), Gaps = 7/335 (2%)
Query: 23 SFARDFSIVGYSPEDLTS----NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRH 78
+ A D SI+ Y S +D+++ ++ SW+ K K Y +L EK RF+IFKDNLR+
Sbjct: 20 ALASDMSIINYDQTHTNSLIRTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 79
Query: 79 IDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DLP 135
ID N ++Y LGLN FADL +EE++ +LG K +R K Y V +LP
Sbjct: 80 IDNHNADPDRSYELGLNRFADLTNEEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEELP 139
Query: 136 KSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN 195
S+DWR+KGAV VK+QGSCGSCWAFS + AVEGINQI TG L +LSEQEL+DCD +YN
Sbjct: 140 DSIDWREKGAVAAVKDQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNE 199
Query: 196 GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDS 255
GC GGLMDYAF +I+ GG+ + DYPY +GTC K ++VVTI+ Y DVP E +
Sbjct: 200 GCEGGLMDYAFNFIIKNGGIDSDLDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKA 259
Query: 256 LLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNS 315
L KA ANQP+SVAIEA G DFQ Y G++ G CGT +DHGV VGYGS G+DY IV+NS
Sbjct: 260 LQKAAANQPISVAIEAGGMDFQLYVSGIFTGKCGTAVDHGVVVVGYGSEEGMDYWIVRNS 319
Query: 316 WGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WG WGE GY++M+RN GK GLCGI SYP+K
Sbjct: 320 WGAAWGEAGYLKMQRNVGKSSGLCGITIEPSYPVK 354
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 367 bits (941), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 175/329 (53%), Positives = 230/329 (69%), Gaps = 8/329 (2%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
D SIV Y S+++ ++ WM+ + Y ++ E+ R+++F+DNLR+ID N
Sbjct: 28 DMSIVSYGER---SDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAA 84
Query: 86 ---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRK 142
+ ++ LGLN FADL ++E++ +LG + R + + D DLP+SVDWR
Sbjct: 85 DAGVHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGAR-YHAADNEDLPESVDWRA 143
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAV VK+QGSCGSCWAFST+AAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGGLM
Sbjct: 144 KGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLM 203
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
DYAF++I++ GG+ E+DYPY +G C++ + ++VVTI+ Y DVP N E SL KA+AN
Sbjct: 204 DYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVAN 263
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
QP+SVAIEA+G FQ YS G++ G CGT LDHGV AVGYG+ G DY IVKNSWG WGE
Sbjct: 264 QPVSVAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGE 323
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY+RM+RN G CGI SYP+K+
Sbjct: 324 SGYVRMERNIKASSGKCGIAVEPSYPLKE 352
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 366 bits (940), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 177/339 (52%), Positives = 236/339 (69%), Gaps = 12/339 (3%)
Query: 16 ISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDN 75
+SFF S A S S+ ++ ++++ W++K K Y +DE+ +RF+IFK+N
Sbjct: 11 LSFFFLSISASALS--------RRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKEN 62
Query: 76 LRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR---KDQSHEDFSYKDVV 132
L+ ID+ N + + Y +GLN FADL +EE++ ++LG + ARR + ++ ++
Sbjct: 63 LKFIDDHNSENRTYKVGLNMFADLTNEEYRALYLGTRSPPARRVMKAKTASRRYAVNNLD 122
Query: 133 DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT 192
LP+S+DWR +GAV VKNQGSCGSCWAFST+AAVEGINQIVTG L SLSEQEL+ CD
Sbjct: 123 RLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKK 182
Query: 193 YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNS 252
YN+GCNGGLMDYAFQ+I+ GGL EEDYPY +G C+ T+ ++VV+I+ Y DVP N
Sbjct: 183 YNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPAND 242
Query: 253 EDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIV 312
E+SL KA+A+QP+SVAIEASG Q Y GV+ G CG+ LDHGV AVGYG G+DY +V
Sbjct: 243 EESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGKENGVDYWLV 302
Query: 313 KNSWGPKWGEKGYIRMKRNTGK-PEGLCGINKMASYPIK 350
+NSWG WGE GY +++RN EG CGI ASYP+K
Sbjct: 303 RNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPVK 341
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 366 bits (939), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 186/347 (53%), Positives = 235/347 (67%), Gaps = 10/347 (2%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
K +LI I+ + S + DF +D++S++ L DL+E W S V +L+EK +
Sbjct: 5 KLLLIVLSIALVLVVSESFDFH-----DKDVSSDESLWDLYERWRS-HHTVSRNLNEKQK 58
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD---LARRKDQSHE 124
RF +FK N+ H+ TN+ K Y L LN+FAD+ + EFK + G K + + R +
Sbjct: 59 RFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSG 118
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
F Y++ P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI T L LSEQ
Sbjct: 119 TFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
ELIDCDN N GCNGGLM+YAF+YI GG+ E YPY +G+C+ TK V+I+G
Sbjct: 179 ELIDCDNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDATKENVPAVSIDG 238
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
+ VP N ED+LLKA+ANQP+SVAI+A G DFQFYS GV+ G CG +L+HGVA VGYG+T
Sbjct: 239 HETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT 298
Query: 305 -RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G +Y IV+NSWG +WGE+GYIRMKRN EGLCGI ASYP+K
Sbjct: 299 VDGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPVK 345
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 181/355 (50%), Positives = 243/355 (68%), Gaps = 6/355 (1%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDL--TSNDKLIDLFESWMSKFEKV 58
MA S TI I + F SS A D SI+ Y + ++D++ L+ESW+ + K
Sbjct: 1 MAAHSSTLTISILLMLIFSTLSS-ASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKS 59
Query: 59 YESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y +L EK +RF+IFKDNLR+IDE N ++Y LGL +FADL +EE++ ++LG K R
Sbjct: 60 YNALGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDR 119
Query: 118 RKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
+K ++ Y V LP+S+DWR+KG + VK+QGSCGSCWAFS VAA+E IN IVT
Sbjct: 120 KKLSKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVT 179
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SLSEQEL+DCD +YN GC+GGLMDYAF++++ GG+ EEDYPY G C+ +
Sbjct: 180 GNLISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRK 239
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++VV I+ Y DVP N+E +L KA+A+QP+S+A+EA GRDFQ Y G++ G CGT +DHG
Sbjct: 240 NAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHG 299
Query: 296 VAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
V GYG+ G+DY IV+NSWG WGE GY+R++RN GLCG+ SYP+K
Sbjct: 300 VVIAGYGTENGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCGLAIEPSYPVK 354
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 175/329 (53%), Positives = 229/329 (69%), Gaps = 8/329 (2%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
D SIV Y S ++ ++ WM+ + Y ++ E+ R+++F+DNLR+ID N
Sbjct: 23 DMSIVSYGER---SXEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAA 79
Query: 86 ---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRK 142
+ ++ LGLN FADL ++E++ +LG + R + + D DLP+SVDWR
Sbjct: 80 DAGVHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGAR-YHAADNEDLPESVDWRA 138
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAV VK+QGSCGSCWAFST+AAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGGLM
Sbjct: 139 KGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLM 198
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
DYAF++I++ GG+ E+DYPY +G C++ + ++VVTI+ Y DVP N E SL KA+AN
Sbjct: 199 DYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVAN 258
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
QP+SVAIEA+G FQ YS G++ G CGT LDHGV AVGYG+ G DY IVKNSWG WGE
Sbjct: 259 QPVSVAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGE 318
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY+RM+RN G CGI SYP+K+
Sbjct: 319 SGYVRMERNIKASSGKCGIAVEPSYPLKE 347
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 180/332 (54%), Positives = 229/332 (68%), Gaps = 12/332 (3%)
Query: 25 ARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR 84
A D SIV Y S +++ ++ WM++ Y + E+ RFE F++NLR+ID+ N
Sbjct: 22 AADMSIVFYGER---SEEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNA 78
Query: 85 K----IKNYWLGLNEFADLRHEEFKEMFLG--LKPDLARRKDQSHEDFSYKDVVDLPKSV 138
+ ++ LGLN FADL +EE++ +LG KPD R+ ++ D +LP+SV
Sbjct: 79 AADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQ---AADNDELPESV 135
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
DWRKKGAV VK+QG CGSCWAFS +AAVEGINQIVTG++ LSEQEL+DCD +YN GCN
Sbjct: 136 DWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCN 195
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
GGLMDYAF++I++ GG+ EEDYPY + C+ K ++VVTI+GY DVP NSE SL K
Sbjct: 196 GGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQK 255
Query: 259 ALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGP 318
A+ANQP+SVAIEA GR FQ Y G++ G CGT LDHGVAAVGYG+ G DY +V+NSWG
Sbjct: 256 AVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGS 315
Query: 319 KWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WGE GYIRM+RN G CGI SYP K
Sbjct: 316 VWGENGYIRMERNIKASSGKCGIAVEPSYPTK 347
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 185/345 (53%), Positives = 235/345 (68%), Gaps = 10/345 (2%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERF 69
I+++ C+ + ++ + DF +D+ S D L +L+E W S + SL+EK +RF
Sbjct: 5 IVLALCMLMVLETTKSLDFH-----EKDVESEDSLWELYERWKS-HHTIARSLEEKAKRF 58
Query: 70 EIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR---KDQSHEDF 126
+FK N++HI ETN+K +Y L LN+F D+ EEF+ + G R + Q+ + F
Sbjct: 59 NVFKHNVKHIHETNKKENSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGERQTTKSF 118
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
Y +V LP SVDWRK GAVT VKNQG CGSCWAFSTV AVEGINQI T L SLSEQEL
Sbjct: 119 MYANVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQEL 178
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
+DCD N GCNGGLMD AF++I GGL E YPY + TC+ K + VV+I+G+
Sbjct: 179 VDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHE 238
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-R 305
DVP+NSE L+KA+A+QP+SVAI+A G DFQFYS GV+ G CGT+L+HGVA VGYG+T
Sbjct: 239 DVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTID 298
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G Y IVKNSWG +WGEKGYIRM+R EGLCGI ASYP+K
Sbjct: 299 GTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPLK 343
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 179/345 (51%), Positives = 231/345 (66%), Gaps = 7/345 (2%)
Query: 11 LISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFE 70
+++ IS + SF +I + + T N+ ++ ++E W+ K +KVY L EK +RF+
Sbjct: 4 IMTLMISTLLFLSFTLSCAIDTSTITNYTDNE-VMTMYEEWLVKHQKVYNGLGEKDKRFQ 62
Query: 71 IFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLGLKPDLARR----KDQSHED 125
+FKDNL I E N N Y LGLN+FAD+ +EE++ M+ G K D RR K H
Sbjct: 63 VFKDNLGFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHR- 121
Query: 126 FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQE 185
++Y LP VDWR KGAV +K+QGSCGSCWAFSTVA VE IN+IVTG SLSEQE
Sbjct: 122 YAYSAGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQE 181
Query: 186 LIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGY 245
L+DCD YN GCNGGLMDYAF++I+ GG+ ++DYPY +G C+ TK ++ V I+GY
Sbjct: 182 LVDCDRAYNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKAVNIDGY 241
Query: 246 HDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR 305
DVP E++L KA+A QP+S+AIEASGR Q Y GV+ G CGT LDHGV VGYGS
Sbjct: 242 EDVPPYDENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVVVVGYGSEN 301
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G+DY +V+NSWG WGE GY +M+RN P G CGI ASYP+K
Sbjct: 302 GVDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPVK 346
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 178/311 (57%), Positives = 228/311 (73%), Gaps = 6/311 (1%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEF 104
+ L+E W+ K K Y +L EK +RF+IFKDNLR ID+ N + Y LGLN FADL +EE+
Sbjct: 1 MSLYEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNADNRTYKLGLNRFADLTNEEY 60
Query: 105 KEMFLGLKPDLARR----KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWA 160
+ +LG + D RR K QS+ ++ + +LP+SVDWR + AV VK+QG+CGSCWA
Sbjct: 61 RARYLGTRIDPNRRFVKTKTQSNR-YAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWA 119
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FST+ AVEGIN+IVTG+L SLSEQEL+DCD +YN GCNGGLMDYA+++I++ GG+ EED
Sbjct: 120 FSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDSEED 179
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY +GTC+ + ++VVTI+ Y DVP N E +L KA+ANQP+SVAIE GR+FQ Y
Sbjct: 180 YPYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYV 239
Query: 281 GGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLC 339
GV+ G CGT LDHGV AVGYGS +G DY IV+NSWG WGE+GY+R++RN K G C
Sbjct: 240 SGVFTGRCGTALDHGVVAVGYGSVKGHDYWIVRNSWGASWGEEGYVRLERNLAKSRSGKC 299
Query: 340 GINKMASYPIK 350
GI SYPIK
Sbjct: 300 GIAIEPSYPIK 310
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 189/347 (54%), Positives = 241/347 (69%), Gaps = 9/347 (2%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLT---SNDKLIDLFESWMSKFEKVYESLDEK 65
TIL+ F + F SS A D SI+ Y S+++L+ ++E W+ K KVY +L EK
Sbjct: 40 TILLLFTV--FAVSS-ALDMSIISYDNAHAATSRSDEELMSMYEQWLVKHGKVYNALGEK 96
Query: 66 LERFEIFKDNLRHIDETN-RKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
+RF+IFKDNLR ID+ N ++ + Y LGLN FADL +EE++ +LG K D RR ++
Sbjct: 97 EKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEYRAKYLGTKIDPNRRLGKTPS 156
Query: 125 DFSYKDVVD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
+ V D LP+SVDWRK+GAV VK+QG CGSCWAFS + AVEGIN+IVTG L SLSE
Sbjct: 157 NRYAPRVGDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSE 216
Query: 184 QELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTIN 243
QEL+DCD YN GCNGGLMDYAF++I++ GG+ EEDYPY +G C+ + ++VV+I+
Sbjct: 217 QELVDCDTGYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRGVDGRCDTYRKNAKVVSID 276
Query: 244 GYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS 303
Y DVP E +L KA+ANQP+SVAIE GR+FQ Y GV+ G CGT LDHGV AVGYG+
Sbjct: 277 DYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGT 336
Query: 304 TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPI 349
G DY IV+NSWGP WGE GYIR++RN G CGI SYP+
Sbjct: 337 ANGHDYWIVRNSWGPSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 383
>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
endopeptidase; AltName: Full=Papaya peptidase B;
AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
Precursor
gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
Length = 348
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 188/351 (53%), Positives = 247/351 (70%), Gaps = 5/351 (1%)
Query: 1 MALSSQF-KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY 59
MA+ F K + ++ C+ + S+ DFSIVGYS +DLTS ++LI LF SWM K K Y
Sbjct: 1 MAIICSFSKLLFVAICLFGHMSLSYC-DFSIVGYSQDDLTSTERLIQLFNSWMLKHNKNY 59
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
+++DEKL RFEIFKDNL++IDE N+ I YWLGLNEF+DL ++EFKE ++G P+ +
Sbjct: 60 KNVDEKLYRFEIFKDNLKYIDERNKMINGYWLGLNEFSDLSNDEFKEKYVGSLPEDYTNQ 119
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
E+F +D+VDLP+SVDWR KGAVT VK+QG C SCWAFSTVA VEGIN+I TGNL
Sbjct: 120 PYD-EEFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKIKTGNLV 178
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
LSEQEL+DCD + GCN G + QY V+ G+H YPYI ++ TC +
Sbjct: 179 ELSEQELVDCDKQ-SYGCNRGYQSTSLQY-VAQNGIHLRAKYPYIAKQQTCRANQVGGPK 236
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
V NG V N+E SLL A+A+QP+SV +E++GRDFQ Y GG+++G CGT++DH V AV
Sbjct: 237 VKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVDHAVTAV 296
Query: 300 GYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
GYG + G YI++KNSWGP WGE GYIR++R +G G+CG+ + + YPIK
Sbjct: 297 GYGKSGGKGYILIKNSWGPGWGENGYIRIRRASGNSPGVCGVYRSSYYPIK 347
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 177/331 (53%), Positives = 232/331 (70%), Gaps = 6/331 (1%)
Query: 25 ARDFSIVGYSPEDL--TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDET 82
A D SI+ Y +++D ++ +ESW+ K K Y +L EK +RF+IFKDN +IDE
Sbjct: 19 AADMSIITYDQTHAVGSTDDVIMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQ 78
Query: 83 NR-KIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DLPKSVD 139
N K +++ LGLN FADL +EE++ + G++ +R+K S + Y + LP+SVD
Sbjct: 79 NAAKDRSFKLGLNRFADLTNEEYRSKYTGIRTKDSRKK-VSGKSQRYASLAGESLPESVD 137
Query: 140 WRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNG 199
WR+ GAV VK+QG CGSCWAFST++AVEGINQI TG L +LSEQEL+DCD +YN GCNG
Sbjct: 138 WREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNG 197
Query: 200 GLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKA 259
GLMD AFQ+I++ GG+ + DYPY +G C+ + ++VVTI+ Y DVP+ E +L KA
Sbjct: 198 GLMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKA 257
Query: 260 LANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPK 319
ANQP+SVAIEASGRDFQFY G++ G CGT LDHGV VGYG+ G DY IV+NSWG
Sbjct: 258 AANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYGTENGKDYWIVRNSWGAD 317
Query: 320 WGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WGEKGY+RM+R G+CGI SYP+K
Sbjct: 318 WGEKGYLRMERGISSKAGICGITSEPSYPVK 348
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 186/340 (54%), Positives = 231/340 (67%), Gaps = 8/340 (2%)
Query: 18 FFIRSSFARDFSI---VGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKD 74
F+ S A I + + +DL S + L DL+E W S V SLDEK +RF +FK+
Sbjct: 7 LFVALSLALVLGITESLDFHEKDLESEESLWDLYERWRS-HHTVSTSLDEKHKRFNVFKE 65
Query: 75 NLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD---LARRKDQSHEDFSYKDV 131
N+ H+ +TN+ K Y L LN+FAD+ + EF+ ++ G K + R + + F Y V
Sbjct: 66 NVMHVHKTNKMGKPYKLKLNKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGNGSFMYGKV 125
Query: 132 VDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDN 191
+P SVDWRKKGAVT VK+QG CGSCWAFST+ AVEGIN I T L SLSEQEL+DCD
Sbjct: 126 EKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINYIKTNELVSLSEQELVDCDT 185
Query: 192 TYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQN 251
T N GCNGGLM+YAF++I G+ E YPY E+G C+ K + V+I+GY VP+N
Sbjct: 186 TENQGCNGGLMEYAFEFIKKKRGITTESTYPYKAEDGHCDAAKENNPAVSIDGYEKVPEN 245
Query: 252 SEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYI 310
ED+LLKA ANQP+SVAI+A G DFQFYS GV+ G CGT+LDHGVA VGYG+T G Y
Sbjct: 246 DEDALLKAAANQPVSVAIDAGGSDFQFYSEGVFIGECGTELDHGVAVVGYGTTLDGTKYW 305
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
IV+NSWGP+WGEKGYIRM+R EGLCGI ASYPIK
Sbjct: 306 IVRNSWGPEWGEKGYIRMQRGISDKEGLCGIAMEASYPIK 345
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 181/322 (56%), Positives = 227/322 (70%), Gaps = 5/322 (1%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ +DL S + L DL+E W S V SL EK +RF +FK+N+ H+ TN+ K Y L
Sbjct: 25 FHEKDLASEESLWDLYERWRS-HHTVSRSLTEKHKRFNVFKENVMHVHNTNKMDKPYKLK 83
Query: 93 LNEFADLRHEEFKEMFLGLKPD---LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
LN+FAD+ + EF+ + G K + + R + F Y+ V +P SVDWRKKGAVT V
Sbjct: 84 LNKFADMTNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVGSVPASVDWRKKGAVTDV 143
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K+QG CGSCWAFSTV AVEGINQI T L SLSEQEL+DCD N GCNGGLM+ AF++I
Sbjct: 144 KDQGQCGSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFI 203
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
GG+ E +YPY +EGTC+ +K V+I+G+ +VP N E++LLKA+ANQP+SVAI
Sbjct: 204 KQKGGITTESNYPYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAI 263
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRM 328
+A G DFQFYS GV G C T L+HGVA VGYG+T G +Y IV+NSWGP+WGE+GYIRM
Sbjct: 264 DAGGSDFQFYSEGVLTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRM 323
Query: 329 KRNTGKPEGLCGINKMASYPIK 350
+RN K EGLCGI MASYPIK
Sbjct: 324 QRNISKKEGLCGIAMMASYPIK 345
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 364 bits (935), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 179/336 (53%), Positives = 235/336 (69%), Gaps = 9/336 (2%)
Query: 24 FARDFSIVGYS--PEDLTS---NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRH 78
+A SI+ Y+ P +S +++++ ++ W++K K Y + E+ RFEIFKDNL+
Sbjct: 18 YAAHMSIIDYNTNPNHKSSSRTDEEVMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKF 77
Query: 79 IDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR---KDQSHEDFSYKDVVDLP 135
+DE N + ++Y +GLN FADL +EE++ MFLG K D RR + ++ +D LP
Sbjct: 78 VDEHNSENRSYKVGLNRFADLTNEEYRSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLP 137
Query: 136 KSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN 195
+SVDWR+ GAV +K+QGSCGSCWAFSTVAAVEG+NQI TG + LSEQEL+DCD TY+
Sbjct: 138 ESVDWRESGAVAPIKDQGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDA 197
Query: 196 GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDS 255
GCNGGLMDYAF++I++ GG+ EEDYPY +GTC+ + ++VV+IN Y DVP E +
Sbjct: 198 GCNGGLMDYAFEFIINNGGIDTEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMA 257
Query: 256 LLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNS 315
L KA+A+QP+SVAIEASGR FQ Y GV+ G CG LDHGV VGYG+ G D+ IV+NS
Sbjct: 258 LKKAVAHQPVSVAIEASGRAFQLYLSGVFTGECGRALDHGVVVVGYGTDNGADHWIVRNS 317
Query: 316 WGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPIK 350
WG WGE GYIRM+RN G CGI ASYPIK
Sbjct: 318 WGTSWGENGYIRMERNVVDNFGGKCGIAMQASYPIK 353
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 364 bits (935), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 183/357 (51%), Positives = 243/357 (68%), Gaps = 20/357 (5%)
Query: 1 MALSSQFKTIL-ISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY 59
MA + T+L +SF +S+ I++S +I+ Y+ +++++ ++E W+ + +K Y
Sbjct: 1 MASMTMIYTLLFLSFTLSYAIKTS-----TIINYT------DNEVMAMYEEWLVRHQKGY 49
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
L +K +RF++FKDNL I E N + N Y LGLN+FAD+ +EE++ M+LG K + RR
Sbjct: 50 NELGKKDKRFQVFKDNLGFIQEHNNNLNNTYKLGLNKFADMTNEEYRAMYLGTKSNAKRR 109
Query: 119 ----KDQSHE-DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
K H FS +D LP VDWR KGAV +K+QGSCGSCWAFSTVA VE IN+I
Sbjct: 110 LMKTKSTGHRYAFSARD--RLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKI 167
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
VTG SLSEQEL+DCD YN GCNGGLMDYAF++I+ GG+ ++DYPY +G C+ T
Sbjct: 168 VTGKFVSLSEQELVDCDRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPT 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
K ++VV I+GY DVP E++L KA+A+QP+SVAIEASGR Q Y GV+ G CGT LD
Sbjct: 228 KKNAKVVNIDGYEDVPPYDENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLD 287
Query: 294 HGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
HGV VGYGS G+DY +V+NSWG WGE GY +M+RN G CGI ASYP+K
Sbjct: 288 HGVVVVGYGSENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPVK 344
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 364 bits (935), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 185/355 (52%), Positives = 244/355 (68%), Gaps = 13/355 (3%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDL-------TSNDKLIDLFESWMSKFEKVYE 60
KTI+ + + S+A D SI+ Y + D++ + +E W+++ + Y
Sbjct: 3 KTIITTLLFALSSSLSYAIDMSIIDYKNNHYARKWTLQSDEDQVKNRYEMWLAEHGRAYN 62
Query: 61 SLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
+L EK +RFEIFKDNLR I+E N + Y +GLN+FADL +EE++ M+LG K D RR
Sbjct: 63 ALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFADLTNEEYRTMYLGTKSDARRRF 122
Query: 120 DQSH---EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG 176
+S + ++ + +P SVDWRK+GAV +KNQGSCGSCWAFSTVAAV GINQIVTG
Sbjct: 123 VKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFSTVAAVGGINQIVTG 182
Query: 177 NLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
+ +LSEQEL+DCD N+GCNGGLMDYAF++I+S GG+ E+ YPY EG C+ +
Sbjct: 183 EMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGMDTEKHYPYRGVEGRCDPVRKN 242
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+VV+I+GY DVP+N E +L KA+A+QP+ VAIEASGR FQ YS GV+ G CG ++DHGV
Sbjct: 243 YKVVSIDGYEDVPRN-ERALQKAVAHQPVCVAIEASGRAFQLYSSGVFTGECGEEVDHGV 301
Query: 297 AAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
VGYGS G+DY IV+NSWG KWGE GY++M+RN K G CGI ASYP K
Sbjct: 302 VVVGYGSEDGVDYWIVRNSWGTKWGENGYVKMERNVKKSHLGKCGIMTEASYPTK 356
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 364 bits (934), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 182/351 (51%), Positives = 246/351 (70%), Gaps = 13/351 (3%)
Query: 9 TILISFCISFFIRSSFARDFSIVGY-----SPEDLTSNDKLIDLFESWMSKFEKVYESLD 63
++ I+ + F+ SS A D SI+ Y S ++D+++ ++ESW+ K K Y +L
Sbjct: 7 SMAIALLFALFVASS-ALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKSYNALG 65
Query: 64 EKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK--PDLARRKD 120
EK +RF+IFKDNLR IDE N + +Y +GLN FADL +EE++ +LG K P L++ K
Sbjct: 66 EKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKPKLSKVK- 124
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+ ++ + LP+SVDWR KGAV +K+QGSCGSCWAFSTV AVEGINQIVTG L +
Sbjct: 125 --SDRYAPRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELIT 182
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQEL+DCD +YN GC+GGLMDY F++I++ GG+ ++DYPY+ + C+ + ++VV
Sbjct: 183 LSEQELVDCDKSYNEGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVV 242
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI+ Y DVP N+E++L KA+A+QP+SV IE GR FQFY G++ G CGT LDHGV VG
Sbjct: 243 TIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVG 302
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPIK 350
YG+ +G DY IV+NSWG WGE GYIRM+RN G G CGI SYP+K
Sbjct: 303 YGTEKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLK 353
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 363 bits (933), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 180/322 (55%), Positives = 226/322 (70%), Gaps = 5/322 (1%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ +DL S + L DL+E W S V SL EK +RF +FK N+ H+ TN+ K Y L
Sbjct: 25 FHEKDLESEESLWDLYERWRS-HHTVSRSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLK 83
Query: 93 LNEFADLRHEEFKEMFLGLKPD---LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
LN+FAD+ + EF+ + G K + + R F Y+ V +P SVDWRKKGAVT V
Sbjct: 84 LNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDV 143
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K+QG CGSCWAFST+ AVEGINQI T L SLSEQEL+DCD N GCNGGLM+ AF++I
Sbjct: 144 KDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFI 203
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
GG+ E +YPY +EGTC+ +K V+I+G+ +VP N E++LLKA+ANQP+SVAI
Sbjct: 204 KQKGGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAI 263
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRM 328
+A G DFQFYS GV+ G C T L+HGVA VGYG+T G +Y IV+NSWGP+WGE+GYIRM
Sbjct: 264 DAGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRM 323
Query: 329 KRNTGKPEGLCGINKMASYPIK 350
+RN K EGLCGI MASYPIK
Sbjct: 324 QRNISKKEGLCGIAMMASYPIK 345
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 363 bits (932), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 177/329 (53%), Positives = 231/329 (70%), Gaps = 8/329 (2%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
D SIV Y S ++ L+ W ++ K Y ++ E+ R+ F+DNLR+IDE N
Sbjct: 22 DMSIVSYGER---SEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAA 78
Query: 86 ---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRK 142
+ ++ LGLN FADL +EE+++ +LGL+ + RR+ + + + D LP+SVDWR
Sbjct: 79 DAGVHSFRLGLNRFADLTNEEYRDTYLGLR-NKPRRERKVSDRYLAADNEALPESVDWRT 137
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAV +K+QG CGSCWAFS +AAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGGLM
Sbjct: 138 KGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLM 197
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
DYAF +I++ GG+ E+DYPY ++ C++ + ++VVTI+ Y DV NSE SL KA+AN
Sbjct: 198 DYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVAN 257
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
QP+SVAIEA GR FQ YS G++ G CGT LDHGVAAVGYG+ G DY IV+NSWG WGE
Sbjct: 258 QPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGE 317
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY+RM+RN G CGI SYP+KK
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPLKK 346
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 363 bits (931), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 185/347 (53%), Positives = 234/347 (67%), Gaps = 10/347 (2%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
K +LI I+ + S + DF +D++S++ L DL+E W S V +L+EK +
Sbjct: 5 KLLLIVLSIALVLVVSESFDFH-----DKDVSSDESLWDLYERWRS-HHTVSRNLNEKQK 58
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD---LARRKDQSHE 124
RF +FK N+ H+ TN+ K Y L LN+FAD+ + EFK + G K + + R +
Sbjct: 59 RFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGTKVNHHRMFRGTPRVSG 118
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
F Y++ P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI T L LSEQ
Sbjct: 119 TFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
ELIDCDN N GCNGGLM+YAF+YI GG+ E YPY +G+C+ TK V+I+G
Sbjct: 179 ELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDG 238
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
+ VP N ED+LLKA+ANQP+SVAI+A G DFQFYS GV+ G CG +L+HGVA VGYG+T
Sbjct: 239 HETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT 298
Query: 305 -RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G +Y IV+NSWG +WGE+G IRMKRN EGLCGI ASYP+K
Sbjct: 299 VDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPVK 345
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 363 bits (931), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 181/355 (50%), Positives = 238/355 (67%), Gaps = 19/355 (5%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKF----EKVYESLDEK 65
+L + ++ + + AR + + ++ +DL S + L L+E W S + + D+K
Sbjct: 6 VLAAVSLALLVLAPPAR--AGIPFTEKDLASEESLRALYEQWRSHYMVSRPAGLQEQDDK 63
Query: 66 LERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG---------LKPDLA 116
F +FK+N+R+I E N+K +++ L LN+FAD+ +EF+ + L +
Sbjct: 64 ARWFNVFKENVRYIHEANKKGRSFRLALNKFADMTTDEFRRAYAAGSRTRHHRALSSGIR 123
Query: 117 RRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG 176
R D S F Y +LP +VDWR++GAVT +K+QG CGSCWAFST+AAVEGIN+I TG
Sbjct: 124 RHGDGS---FMYAQAGNLPLAVDWRQRGAVTGIKDQGQCGSCWAFSTIAAVEGINKIRTG 180
Query: 177 NLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL+DCD+ N GCNGGLMDYAFQYI GG+ E +YPY+ E+ +C K
Sbjct: 181 KLVSLSEQELVDCDDVDNQGCNGGLMDYAFQYIKRNGGITTESNYPYLAEQRSCNKAKER 240
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
S VTI+GY DVP N+ED+L KA+ANQP+S+AIEASG+DFQFYS GV+ G CGT+LDHGV
Sbjct: 241 SHDVTIDGYEDVPANNEDALQKAVANQPVSIAIEASGQDFQFYSEGVFTGSCGTELDHGV 300
Query: 297 AAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
AAVGYG TR G Y IVKNSWG WGE+GYIRM+R +GLCGI SYP K
Sbjct: 301 AAVGYGITRDGTKYWIVKNSWGEDWGERGYIRMQRGISDSQGLCGIAMEPSYPTK 355
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 363 bits (931), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 183/323 (56%), Positives = 228/323 (70%), Gaps = 8/323 (2%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ ++L + D L D++E W KV + EKL RF +FK N+ H+ ETN+ K Y L
Sbjct: 25 FHEKELETEDNLWDMYERWR---HKVATNHGEKLRRFNVFKSNVLHVHETNKMDKPYKLK 81
Query: 93 LNEFADLRHEEFKEMFLGLK---PDLARRKDQS-HEDFSYKDVVDLPKSVDWRKKGAVTH 148
LN+FAD+ + EF+ ++ G K D + + D+S + F Y +V +P SVDWRKKGAV
Sbjct: 82 LNKFADMTNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVESVPTSVDWRKKGAVAP 141
Query: 149 VKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQY 208
VK+QG CGSCWAFSTVAAVEGIN+I T L SLSEQEL+DCD N GCNGGLMD AF +
Sbjct: 142 VKDQGQCGSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDF 201
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVA 268
I TGGL +E+ YPY E+G C+ K S VV+I+G+ DVP+N E SL+KA+ANQP++VA
Sbjct: 202 IKKTGGLTREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVA 261
Query: 269 IEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIR 327
I+A DFQFYS GV+ G CGTQLDHGVAAVGYG+T G Y IV+NSWG +WGEKGYIR
Sbjct: 262 IDAGSSDFQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSEWGEKGYIR 321
Query: 328 MKRNTGKPEGLCGINKMASYPIK 350
M+R GLCGI ASYPIK
Sbjct: 322 MERGISDKRGLCGIAMEASYPIK 344
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 363 bits (931), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 190/369 (51%), Positives = 247/369 (66%), Gaps = 24/369 (6%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDK--------LIDLFESWM 52
M +S +L+ + ++FA D SI+ Y D T +DK + +++E W
Sbjct: 1 MGSNSNRSPMLVILIVFTLFTATFALDMSIISY---DKTHSDKSSRRSDKEVKNIYEEWR 57
Query: 53 SKFEKVYESLD--EKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG 110
K K+ ++D EK +RFEIFKDNL+ IDE N + + Y +GLN FADL +EE++ +LG
Sbjct: 58 VKHGKLNNNIDGSEKDKRFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLG 117
Query: 111 LKPD-----LARRKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
K D +AR K +S+ Y V LPKSVDWR +GAV VK+QGSCGSCWAFST
Sbjct: 118 TKIDPIGMMMARTKTRSNR---YAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFST 174
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
+AAVEGIN+IVTG L SLSEQEL+DCD T N GC+GGLM+YAF++I++ GG+ +EDYPY
Sbjct: 175 IAAVEGINKIVTGELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGIDSDEDYPY 234
Query: 224 IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV 283
+G C+ K + VV+I+ Y VP E +L KA+ANQP+SVAIEA GR+FQ Y G+
Sbjct: 235 RGVDGKCDQYKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGI 294
Query: 284 YDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLCGIN 342
+ G CGT LDHGV AVGYG+ G+DY IV+NSWG WGE GY+RM+RN G CGI
Sbjct: 295 FTGKCGTALDHGVTAVGYGTENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGIV 354
Query: 343 KMASYPIKK 351
+SYPIKK
Sbjct: 355 MQSSYPIKK 363
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 363 bits (931), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 177/329 (53%), Positives = 231/329 (70%), Gaps = 8/329 (2%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
D SIV Y S ++ L+ W ++ K Y ++ E+ R+ F+DNLR+IDE N
Sbjct: 23 DMSIVSYGER---SEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAA 79
Query: 86 ---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRK 142
+ ++ LGLN FADL +EE+++ +LGL+ + RR+ + + + D LP+SVDWR
Sbjct: 80 DAGVHSFRLGLNRFADLTNEEYRDTYLGLR-NKPRRERKVSDRYLAADNEALPESVDWRT 138
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAV +K+QG CGSCWAFS +AAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGGLM
Sbjct: 139 KGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLM 198
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
DYAF +I++ GG+ E+DYPY ++ C++ + ++VVTI+ Y DV NSE SL KA+AN
Sbjct: 199 DYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVAN 258
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
QP+SVAIEA GR FQ YS G++ G CGT LDHGVAAVGYG+ G DY IV+NSWG WGE
Sbjct: 259 QPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGE 318
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY+RM+RN G CGI SYP+KK
Sbjct: 319 SGYVRMERNIKASSGKCGIAVEPSYPLKK 347
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 180/322 (55%), Positives = 226/322 (70%), Gaps = 5/322 (1%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ +DL S + L DL+E W S V SL EK +RF +FK N+ H+ TN+ K Y L
Sbjct: 25 FHEKDLESEESLWDLYERWRS-HHTVSRSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLK 83
Query: 93 LNEFADLRHEEFKEMFLGLKPD---LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
LN+FAD+ + EF+ + G K + + R F Y+ V +P SVDWRKKGAVT V
Sbjct: 84 LNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDV 143
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K+QG CGSCWAFST+ AVEGINQI T L SLSEQEL+DCD N GCNGGLM+ AF++I
Sbjct: 144 KDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFI 203
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
GG+ E +YPY +EGTC+ +K V+I+G+ +VP N E++LLKA+ANQP+SVAI
Sbjct: 204 KQKGGITTESNYPYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAI 263
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRM 328
+A G DFQFYS GV+ G C T L+HGVA VGYG+T G +Y IV+NSWGP+WGE+GYIRM
Sbjct: 264 DAGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRM 323
Query: 329 KRNTGKPEGLCGINKMASYPIK 350
+RN K EGLCGI MASYPIK
Sbjct: 324 QRNISKKEGLCGIAMMASYPIK 345
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 185/347 (53%), Positives = 234/347 (67%), Gaps = 10/347 (2%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
K +LI I+ + S + DF +D++S++ L DL+E W S V +L+EK +
Sbjct: 5 KLLLIVLSIALVLVVSESFDFH-----DKDVSSDESLWDLYERWRS-HHTVSRNLNEKQK 58
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD---LARRKDQSHE 124
RF +FK N+ H+ TN+ K Y L LN+FAD+ + EFK + G K + + R +
Sbjct: 59 RFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSG 118
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
F Y++ P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI T L LSEQ
Sbjct: 119 TFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
ELIDCDN N GCNGGLM+YAF+YI GG+ E YPY +G+C+ TK V+I+G
Sbjct: 179 ELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDG 238
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
+ VP N ED+LLKA+ANQP+SVAI+A G DFQFYS GV+ G CG +L+HGVA VGYG+T
Sbjct: 239 HETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT 298
Query: 305 -RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G +Y IV+NSWG +WGE+G IRMKRN EGLCGI ASYP+K
Sbjct: 299 VDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPVK 345
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 362 bits (929), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 174/329 (52%), Positives = 229/329 (69%), Gaps = 8/329 (2%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
D SIV Y S+++ ++ WM+ + Y ++ E+ R+++F+DNLR+ID N
Sbjct: 26 DMSIVSYGER---SDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAA 82
Query: 86 ---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRK 142
+ ++ LGLN FADL ++E++ +LG + R + + D DLP+SVDWR
Sbjct: 83 DAGVHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGAR-YHAADNEDLPESVDWRA 141
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAV VK+QGS GSCWAFST+AAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGGLM
Sbjct: 142 KGAVAEVKDQGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLM 201
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
DYAF++I++ GG+ E+DYPY +G C++ + ++VVTI+ Y DVP N E SL KA+AN
Sbjct: 202 DYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVAN 261
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
QP+SVAIEA+G FQ YS G++ G CGT LDHGV AVGYG+ G DY IVKNSWG WGE
Sbjct: 262 QPVSVAIEAAGTQFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGE 321
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY+RM+RN G CGI SYP+K+
Sbjct: 322 SGYVRMERNIKASSGKCGIAVEPSYPLKE 350
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 362 bits (929), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 177/329 (53%), Positives = 231/329 (70%), Gaps = 8/329 (2%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
D SIV Y S ++ L+ W ++ K Y ++ E+ R+ F+DNLR+IDE N
Sbjct: 22 DMSIVSYGER---SEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAA 78
Query: 86 ---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRK 142
+ ++ LGLN FADL +EE+++ +LGL+ + RR+ + + + D LP+SVDWR
Sbjct: 79 DAGVHSFRLGLNRFADLTNEEYRDTYLGLR-NKPRRERKVSDRYLAADNEALPESVDWRT 137
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAV +K+QG CGSCWAFS +AAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGGLM
Sbjct: 138 KGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLM 197
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
DYAF +I++ GG+ E+DYPY ++ C++ + ++VVTI+ Y DV NSE SL KA+AN
Sbjct: 198 DYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVAN 257
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
QP+SVAIEA GR FQ YS G++ G CGT LDHGVAAVGYG+ G DY IV+NSWG WGE
Sbjct: 258 QPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGE 317
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY+RM+RN G CGI SYP+KK
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPLKK 346
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 181/348 (52%), Positives = 234/348 (67%), Gaps = 14/348 (4%)
Query: 9 TILISFCISF-FIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
+I I+ + F I S A D S + SN++++ ++E W+ K KVY L EK +
Sbjct: 3 SITITSLLFFSLITLSLAMDTS--------MRSNEEVMTMYEEWLVKHHKVYNGLGEKDQ 54
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR----KDQSH 123
RFEIFKDNL IDE N + Y +GLN+FAD +EE++ M+LG K D R K +
Sbjct: 55 RFEIFKDNLGFIDEHNAQNYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIKITTG 114
Query: 124 EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
+++ LP VDWR KGAV H+K+QGSCGSCWAFST+A VE IN+IVTG L SLSE
Sbjct: 115 HRYAFNSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSE 174
Query: 184 QELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTIN 243
QEL+DCD +N GCNGGLMDYAF++IV GG+ E+DYPY EG C+ T+ ++VV+I+
Sbjct: 175 QELVDCDRAFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKVVSID 234
Query: 244 GYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS 303
GY DVP +E++L KA+ +QP+SVAIEA GR Q Y GV+ G CGT LDHGV VGYG
Sbjct: 235 GYEDVPAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGYGF 294
Query: 304 TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK-PEGLCGINKMASYPIK 350
G+DY +V+NSWG WGE GY +++RN K G CGI ASYP+K
Sbjct: 295 ENGVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPVK 342
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 177/321 (55%), Positives = 223/321 (69%), Gaps = 6/321 (1%)
Query: 35 PEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLN 94
P D+ + L F +W K KVY + +E+ RF ++KDNL +I + K +YWLGL
Sbjct: 32 PTDVGKDQLLAGQFAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRHSEKNLSYWLGLT 91
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHE---DFSYKDVVDLPKSVDWRKKGAVTHVKN 151
+FADL +EEF+ + G + D +RR + F Y + + PKS+DWR+KGAVT VK+
Sbjct: 92 KFADLTNEEFRRQYTGTRIDRSRRLKKGRNATGSFRYANS-EAPKSIDWREKGAVTSVKD 150
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVS 211
QGSCGSCWAFS V +VEGIN I TG+ SLS QEL+DCD YN GCNGGLMDYAF +++
Sbjct: 151 QGSCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQ 210
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEA 271
GG+ E+DYPY +G C++ K + VVTI+ Y DVP+N E++L KA+A QP+SVAIEA
Sbjct: 211 NGGIDTEKDYPYQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEA 270
Query: 272 SGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN 331
GRDFQ YSGGV+ G CGT LDHGV AVGYGS +GLDY IVKNSWG WGE GY+RM+RN
Sbjct: 271 GGRDFQLYSGGVFTGRCGTDLDHGVLAVGYGSEKGLDYWIVKNSWGEYWGESGYLRMQRN 330
Query: 332 TGKPE--GLCGINKMASYPIK 350
GLCGIN SY +K
Sbjct: 331 LKDDNGYGLCGINIEPSYAVK 351
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 185/354 (52%), Positives = 240/354 (67%), Gaps = 13/354 (3%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ +F +++S + + +SF + +DL S + L DL+E W S V
Sbjct: 1 MAMK-KFLWVVLSLSLVLGVANSF-------DFHDKDLESEESLWDLYERWRSH-HTVSR 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD---LAR 117
SL +K +RF +FK N+ H+ TN+ K Y L LN+FAD+ + EF+ + G K + + R
Sbjct: 52 SLGDKHKRFNVFKANMMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFR 111
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
+ + F Y+ V +P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI T
Sbjct: 112 DMPRGNGTFMYEKVGSVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNK 171
Query: 178 LASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGES 237
L SLSEQEL+DCD N GCNGGLM+ AFQ+I GG+ E YPY ++GTC+ +K
Sbjct: 172 LVSLSEQELVDCDTEENAGCNGGLMESAFQFIKQKGGITTESYYPYTAQDGTCDASKAND 231
Query: 238 EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVA 297
V+I+G+ +VP N E++LLKA+ANQP+SVAI+A G DFQFYS GV+ G C T+L+HGVA
Sbjct: 232 LAVSIDGHENVPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVA 291
Query: 298 AVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VGYG+T G Y IV+NSWGP+WGE GYIRM+RN K EGLCGI +ASYPIK
Sbjct: 292 IVGYGATVDGTSYWIVRNSWGPEWGELGYIRMQRNISKKEGLCGIAMLASYPIK 345
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 361 bits (927), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 179/301 (59%), Positives = 220/301 (73%), Gaps = 3/301 (0%)
Query: 52 MSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLG 110
+ K K Y +L K +RFEIFKDNLR IDE N+ + +++ LGLN+FADL +EE+K MFLG
Sbjct: 11 LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70
Query: 111 LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
+ + RK + F Y +LP+SVDWR+KGAV VK+QG CGSCWAFSTVAAVEGI
Sbjct: 71 GRM-VRDRKGFESDRFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGI 129
Query: 171 NQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTC 230
NQI TG+L SLSEQEL+DCD +N GCNGG MDYAF++IV GG+ E+DYPY +G C
Sbjct: 130 NQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDGQC 189
Query: 231 EMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGT 290
+ + ++VVTING+ DVPQN E SL KA+A+QP+SVAIEA GR FQ Y G+++G CGT
Sbjct: 190 DQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGLCGT 249
Query: 291 QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPI 349
LDHGV AVGYG+ G DY IV+NSWGP WGE GYIR++RN G CGI SYP
Sbjct: 250 DLDHGVVAVGYGTEDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQPSYPT 309
Query: 350 K 350
K
Sbjct: 310 K 310
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 361 bits (927), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 185/348 (53%), Positives = 242/348 (69%), Gaps = 4/348 (1%)
Query: 3 LSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESL 62
+ S K + ++ C+ + SF DFSIVGYS +DLTS ++LI LF SWM K YE++
Sbjct: 4 IPSISKLLFVAICLFVHMSVSFG-DFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENV 62
Query: 63 DEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS 122
DEKL RFEIFKDNL +IDETN+K +YWLGLNEFADL ++EF E ++G D A +
Sbjct: 63 DEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFNEKYVGSLID-ATIEQSY 121
Query: 123 HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLS 182
E+F +D V+LP++VDWRKKGAVT V++QGSCGSCWAFS VA VEGIN+I TG L LS
Sbjct: 122 DEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELS 181
Query: 183 EQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
EQEL+DC+ ++GC GG YA +Y V+ G+H YPY ++GTC + +V
Sbjct: 182 EQELVDCERR-SHGCKGGYPPYALEY-VAKNGIHLRSKYPYKAKQGTCRAKQVGGPIVKT 239
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
+G V N+E +LL A+A QP+SV +E+ GR FQ Y GG+++G CGT++DH V AVGYG
Sbjct: 240 SGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYG 299
Query: 303 STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
+ G YI++KNSWG WGEKGYIR+KR G G+CG+ K + YP K
Sbjct: 300 KSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPTK 347
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 183/328 (55%), Positives = 231/328 (70%), Gaps = 9/328 (2%)
Query: 32 GYSPEDLTSNDKLIDLFESWMSKFEKVYE-SLDEKLERFEIFKDNLRHIDETNRKIKNYW 90
G++ E+L S++ L L++ W + DE RFEIFK+N++HID N+K Y
Sbjct: 29 GFTDEELESDESLRGLYDKWALQHRSTRSLDSDEHARRFEIFKENVKHIDSVNKKDGPYK 88
Query: 91 LGLNEFADLRHEEFKEMFLGLKPDLAR--RKDQSHED--FSYKDVVDLPKSVDWRKKGAV 146
LGLN+FADL +EEFK M + K + + R D+ E F Y++ LP S+DWRKKGAV
Sbjct: 89 LGLNKFADLSNEEFKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAV 148
Query: 147 THVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAF 206
T VKNQG CGSCWAFST+A+VEGIN I TG L SLSEQ+L+DC N GCNGGLMD AF
Sbjct: 149 TPVKNQGQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSKE-NAGCNGGLMDNAF 207
Query: 207 QYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVT--INGYHDVPQNSEDSLLKALANQP 264
QYI+ GG+ E++YPY E G C TK ES+ + I+G+ DVP N+E +L KA+A+QP
Sbjct: 208 QYIIDNGGIVTEDEYPYTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQP 267
Query: 265 LSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEK 323
+S+AIEASG DFQFYS GV+ G CGT+LDHGV VGYG S G++Y IV+NSWGP+WGE+
Sbjct: 268 VSIAIEASGHDFQFYSTGVFTGKCGTELDHGVVVVGYGKSPEGINYWIVRNSWGPEWGEQ 327
Query: 324 GYIRMKRNTGKPEGLCGINKMASYPIKK 351
GYIRM+R EG CGI+ ASYP KK
Sbjct: 328 GYIRMQRGIEATEGKCGISMQASYPTKK 355
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 361 bits (926), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 185/350 (52%), Positives = 244/350 (69%), Gaps = 4/350 (1%)
Query: 3 LSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESL 62
+ S K + ++ C+ + SF DFSIVGYS +DLTS ++LI LF SWM K YE++
Sbjct: 4 IPSISKLLFVAICLFVHMSVSFG-DFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENV 62
Query: 63 DEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS 122
DEKL RFEIFKDNL +IDETN+K +Y LGLNEFADL ++EF E ++G D A +
Sbjct: 63 DEKLYRFEIFKDNLNYIDETNKKNNSYRLGLNEFADLSNDEFNEKYVGSLID-ATIEQSY 121
Query: 123 HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLS 182
E+F +D+V+LP++VDWRKKGAVT V++QGSCGSCWAFS VA VEGIN+I TG L LS
Sbjct: 122 DEEFINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELS 181
Query: 183 EQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
EQEL+DC+ ++GC GG YA +Y V+ G+H YPY ++GTC + +V
Sbjct: 182 EQELVDCERR-SHGCKGGYPPYALEY-VAKNGIHLRSKYPYKAKQGTCRAKQVGGPIVKT 239
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
+G V N+E +LL A+A QP+SV +E+ GR FQ Y GG+++G CGT++DH V AVGYG
Sbjct: 240 SGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYG 299
Query: 303 STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
+ G YI++KNSWG WGEKGYIR+KR G G+CG+ K + YPIK +
Sbjct: 300 KSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPIKNR 349
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 360 bits (925), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 178/310 (57%), Positives = 227/310 (73%), Gaps = 4/310 (1%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEF 104
+ +++ W++K K Y L E+ ERFEIFK+NLR IDE N + Y +GL +FADL +EE+
Sbjct: 1 MSMYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNHTYKVGLTKFADLTNEEY 60
Query: 105 KEMFLGLKPDLARRKDQSH---EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
+ MFLG + D RR +S E +++K LP+SVDWR KGAV +K+QGSCGSCWAF
Sbjct: 61 RAMFLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAF 120
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
STVAAVEGINQIVTG L SLSEQEL+DCD TYN GCNGGLMDYAFQ+I++ GGL E+DY
Sbjct: 121 STVAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGLDTEKDY 180
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY+ ++ C+ K +++ V+I+G+ DV E +L KA+A+QP+SVAIEASG QFY
Sbjct: 181 PYVGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFYQS 240
Query: 282 GVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP-EGLCG 340
GV+ G CGT LDHGV VGY S GLDY +V+NSWG +WGE GYI+M+RN G G CG
Sbjct: 241 GVFTGECGTALDHGVVVVGYASENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGRCG 300
Query: 341 INKMASYPIK 350
I +SYP+K
Sbjct: 301 IAMESSYPVK 310
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 360 bits (925), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 181/326 (55%), Positives = 220/326 (67%), Gaps = 11/326 (3%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ EDL S + L L+E W + + L +K RF +FK N+R I E NR+ + Y L
Sbjct: 141 FGAEDLASEEALWALYERWRGR-HALARDLGDKARRFNVFKANVRLIHEFNRRDEPYKLR 199
Query: 93 LNEFADLRHEEFKEMFLGLKPDLAR--RKDQ-----SHEDFSYKDVVDLPKSVDWRKKGA 145
LN F D+ +EF+ + G + R R D+ S F Y D D+P SVDWR+KGA
Sbjct: 200 LNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGA 259
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYA 205
VT VK+QG CGSCWAFST+AAVEGIN I T NL SLSEQ+L+DCD N GCNGGLMDYA
Sbjct: 260 VTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYA 319
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPL 265
FQYI GG+ E+ YPY + +C+ K + VVTI+GY DVP N E +L KA+A+QP+
Sbjct: 320 FQYIAKHGGVAAEDAYPYRARQASCK--KSPAPVVTIDGYEDVPANDESALKKAVAHQPV 377
Query: 266 SVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKG 324
SVAIEASG FQFYS GV+ G CGT+LDHGVAAVGYG T G Y +VKNSWGP+WGEKG
Sbjct: 378 SVAIEASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKG 437
Query: 325 YIRMKRNTGKPEGLCGINKMASYPIK 350
YIRM R+ EG CGI ASYP+K
Sbjct: 438 YIRMARDVAAKEGHCGIAMEASYPVK 463
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 360 bits (924), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 176/315 (55%), Positives = 222/315 (70%), Gaps = 4/315 (1%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADL 99
S ++++ +++ WM+K K Y L EK +RFEIFKDNL+ IDE N + + Y +GLN FADL
Sbjct: 38 SEEEVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQNRTYKVGLNRFADL 97
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD---LPKSVDWRKKGAVTHVKNQGSCG 156
+EE++ ++LG + D RR + V+ LP+SVDWR+ GAV VK+Q SCG
Sbjct: 98 TNEEYRAIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSCG 157
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
SCWAFSTVAAVEGINQIVTG L SLSEQEL+DCD Y+ GCNGGLMDYAF +I+ GGL
Sbjct: 158 SCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNGGLD 217
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E+DYPY +G C ++ S+VV+I+GY DVP E +L KA+A+QP+SVA+EA GR
Sbjct: 218 TEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRAL 277
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP- 335
Q Y G++ G CGT LDHG+ AVGYG+ G DY IV+NSWG WGE GYIRM+RN
Sbjct: 278 QLYVSGIFTGECGTALDHGIVAVGYGTENGTDYWIVRNSWGSSWGENGYIRMERNMADAF 337
Query: 336 EGLCGINKMASYPIK 350
G CGI ASYPIK
Sbjct: 338 SGKCGIAMEASYPIK 352
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 175/331 (52%), Positives = 229/331 (69%), Gaps = 5/331 (1%)
Query: 25 ARDFSIVGYSPED---LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
A D SI+ Y ++D+ LFESW+ K Y +L E+ +RF+IFK+NLR+IDE
Sbjct: 19 ATDMSIITYDETHAVGFKTDDEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDE 78
Query: 82 TNR-KIKNYWLGLNEFADLRHEEFKEMFLGLKP-DLARRKDQSHEDFSYKDVVDLPKSVD 139
N + + + LGLN+FADL +EE++ + G+K DL ++ ++ LP+SVD
Sbjct: 79 QNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSKDLRKKVSAKSGRYATLSGESLPESVD 138
Query: 140 WRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNG 199
WR+ GAV VK+QGSCGSCWAFST++AVEGINQI TG L +LSEQEL+DCD +YN GCNG
Sbjct: 139 WRESGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNG 198
Query: 200 GLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKA 259
GLMDYAF++I++ GG+ + DYPY +G C+ + ++VVTI+ Y DVP E +L KA
Sbjct: 199 GLMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKA 258
Query: 260 LANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPK 319
ANQP+SVAIEASGRDFQFY G++ G CG LDHGV VGYG+ G DY IV+NSWG
Sbjct: 259 AANQPISVAIEASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGTENGKDYWIVRNSWGAD 318
Query: 320 WGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WGE GY+RM+R G+CGI SYP+K
Sbjct: 319 WGENGYLRMERGISSKTGICGIAIEPSYPVK 349
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 171/328 (52%), Positives = 228/328 (69%), Gaps = 8/328 (2%)
Query: 28 FSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-- 85
SIV Y ++++ ++ WM+ + Y ++ + R+++F+DNLR+ID N
Sbjct: 27 MSIVSYGER---TDEEARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAAD 83
Query: 86 --IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKK 143
+ ++ LGLN FADL ++E+ +LG + R + + D DLP+SVDWR K
Sbjct: 84 AGVHSFRLGLNRFADLTNDEYPATYLGARTRPQRDRKLGAR-YHAADNEDLPESVDWRAK 142
Query: 144 GAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMD 203
GAV VK+QGSCG+CWAFST+AAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGGLMD
Sbjct: 143 GAVAEVKDQGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMD 202
Query: 204 YAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
YAF++I++ GG+ E+DYPY +G C++ + ++VVTI+ Y DVP N E SL KA+ANQ
Sbjct: 203 YAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQ 262
Query: 264 PLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEK 323
P+SVAIEA+G FQ YS G++ G CGT+LDHGV AVGYG+ G DY IVKNSWG WGE
Sbjct: 263 PVSVAIEAAGTAFQLYSSGIFTGSCGTRLDHGVTAVGYGTENGKDYWIVKNSWGSSWGES 322
Query: 324 GYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY+RM+RN G CGI SYP+K+
Sbjct: 323 GYVRMERNIKASSGKCGIAVEPSYPLKE 350
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 185/347 (53%), Positives = 234/347 (67%), Gaps = 10/347 (2%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
K L+ F ++ +R + DF ++L + +K +L+E W S V SLDEK +
Sbjct: 3 KLFLVLFTLALVLRLGESFDFH-----EKELETEEKFWELYERWRS-HHTVSRSLDEKHK 56
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR---KDQSHE 124
RF +FK N+ ++ N+K K Y L LN+FAD+ + EF++ + G K R +++
Sbjct: 57 RFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRQHYAGSKIKHHRTLLGASRANG 116
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
F Y + ++P S+DWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI T L SLSEQ
Sbjct: 117 TFMYANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGINQIKTKKLVSLSEQ 176
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
EL+DCD T N GCNGGLMD AF +I GG+ EE YPY E+ C++ K + VV+I+G
Sbjct: 177 ELVDCDTTENQGCNGGLMDPAFDFIKKRGGITTEERYPYKAEDDKCDIQKRNTPVVSIDG 236
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
+ DVP N ED+LLKA+ANQP+SVAI+ASG FQFYS GV+ G CGT+LDHGVA VGYG+T
Sbjct: 237 HEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGVFTGECGTELDHGVAIVGYGTT 296
Query: 305 -RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G Y IVKNSWG WGEKGYIRM+R EGLCGI SYPIK
Sbjct: 297 VDGTKYWIVKNSWGAGWGEKGYIRMQRKVDAEEGLCGIAMQPSYPIK 343
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 186/351 (52%), Positives = 239/351 (68%), Gaps = 12/351 (3%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYS---PEDLT--SNDKLIDLFESWMSKFEKVYESLD 63
TIL I+ S A D I+ Y P+ T +ND+++ ++E W+ K K Y +L
Sbjct: 6 TILF---ITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNALG 62
Query: 64 EKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG--LKPDLARRKDQ 121
EK +RFEIFKDNL IDE N K ++ LGLN FADL +EE++ FLG + P+ RK
Sbjct: 63 EKEKRFEIFKDNLGFIDEHNSKNLSFRLGLNRFADLTNEEYRTRFLGTRINPNRRNRKVN 122
Query: 122 SHEDFSYKDVVD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
S + V D LP+SVDWRK+GAV VK+QGSCGSCWAFS +AAVEG+N++ TG+L S
Sbjct: 123 SQTNRYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLATGDLIS 182
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQEL+DCD +YN GCNGGLMDYAF++I++ L EEDYPY +G C+ + ++VV
Sbjct: 183 LSEQELVDCDTSYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRCDQNRKNAKVV 242
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
+I+ Y DVP E +L KA+ANQ ++VA+E GR+FQ Y GV+ G CGT LDHGVAAVG
Sbjct: 243 SIDQYEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGTALDHGVAAVG 302
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
YG+ G DY IV+NSWG WGE GYIR++RN + G CGI SYPIK
Sbjct: 303 YGTENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPSYPIK 353
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 184/347 (53%), Positives = 235/347 (67%), Gaps = 10/347 (2%)
Query: 12 ISFCISFFIRSSFARDFSIVGY------SPEDLTSNDKLIDLFESWMSKFEKVYESLDEK 65
I + F SS A D SI+ Y L + ++L+ ++E W+ K KVY +L EK
Sbjct: 18 IVLLFTVFAVSS-ALDMSIISYDSAHADKAATLRTEEELMSMYEQWLVKHGKVYNALGEK 76
Query: 66 LERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
+RF+IFKDNLR ID+ N + + Y LGLN FADL +EE++ +LG K D RR ++
Sbjct: 77 EKRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYLGTKIDPNRRLGKTPS 136
Query: 125 DFSYKDVVD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
+ V D LP SVDWRK+GAV VK+QG CGSCWAFS + AVEGIN+IVTG L SLSE
Sbjct: 137 NRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSE 196
Query: 184 QELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTIN 243
QEL+DCD YN GCNGGLMDYAF++I++ GG+ +EDYPY +G C+ + ++VV+I+
Sbjct: 197 QELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDSDEDYPYRGVDGRCDTYRKNAKVVSID 256
Query: 244 GYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS 303
Y DVP E +L KA+ANQP+SVAIE GR+FQ Y GV+ G CGT LDHGV AVGYG+
Sbjct: 257 DYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGT 316
Query: 304 TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPI 349
+G DY IV+NSWG WGE GYIR++RN G CGI SYP+
Sbjct: 317 AKGHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 363
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 179/322 (55%), Positives = 224/322 (69%), Gaps = 5/322 (1%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ ++L + + L +L+E W S V SLDEK +RF +FK+N+ + E N+K + Y L
Sbjct: 23 FHQKELETEESLWNLYERWRSH-HTVSRSLDEKHKRFNVFKENVNFVHEFNKKDEPYKLK 81
Query: 93 LNEFADLRHEEFKEMFLGLKPD---LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
LN+FAD+ + EF+ + G K + + R + F Y+ V +P SVDWRKKGAVT +
Sbjct: 82 LNKFADMTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPI 141
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K+QG CGSCWAFSTV AVEGIN I T L SLSEQEL+DCD + N GCNGGLM YAF++I
Sbjct: 142 KDQGQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFI 201
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
GG+ E+ YPY E+GTC+++K S VV+I+G+ VP N+ED+LLKA ANQP+SVAI
Sbjct: 202 KEKGGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAI 261
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRM 328
+A G FQFYS GV+ G CGT LDHGVA VGYG+T G Y IVKNSWG WGE GYIRM
Sbjct: 262 DAGGSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRM 321
Query: 329 KRNTGKPEGLCGINKMASYPIK 350
KR EGLCGI ASYPIK
Sbjct: 322 KRGISAKEGLCGIAVEASYPIK 343
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 176/316 (55%), Positives = 230/316 (72%), Gaps = 6/316 (1%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLNEFAD 98
S+D+++ L++SW+ + K Y + E+ +RFEIFKDNLR IDE N Y LGLN+FAD
Sbjct: 37 SDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFAD 96
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSH---EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSC 155
L ++E++ FLG + D RR +S ++++ +LP SVDWR GAV+ VK+QGSC
Sbjct: 97 LTNQEYRAKFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVDWRDHGAVSPVKDQGSC 156
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGL 215
GSCWAFST+A VEGIN+IV+G L SLSEQEL+DCD +Y+ GCNGGLMDYAFQ+I+ GG+
Sbjct: 157 GSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAGCNGGLMDYAFQFIMDNGGI 216
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E+DYPY+ C+ TK ++VV+I+GY DVP N+E++L KA+A+QP+S+AIEA GR
Sbjct: 217 DTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVP-NNENALKKAVAHQPVSIAIEAGGRA 275
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
FQ Y GV++G CG LDHGV AVGYG+ G DY IV+NSWG WGE GYIRM+RN
Sbjct: 276 FQLYESGVFNGECGLALDHGVVAVGYGTDDNGQDYWIVRNSWGSNWGENGYIRMERNINA 335
Query: 335 PEGLCGINKMASYPIK 350
G CGI ASYP+K
Sbjct: 336 NTGKCGIAMEASYPVK 351
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 178/317 (56%), Positives = 222/317 (70%), Gaps = 5/317 (1%)
Query: 37 DLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEF 96
DL + L + F +W K KVY SL+E R+ ++KDNL +I + K ++YWLGL +F
Sbjct: 35 DLGNERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEKNRSYWLGLTKF 94
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
AD+ ++EF+ + G + D ++R + F Y D + P+SVDWRKKGAVT VK+QGSCG
Sbjct: 95 ADITNDEFRRQYTGTRIDRSKRSKRK-TGFRYADS-EAPESVDWRKKGAVTTVKDQGSCG 152
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
SCWAFS + +VEGIN I TG SLSEQEL+DCD YN GCNGGLMDYAF +I+ GG+
Sbjct: 153 SCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILENGGID 212
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E DYPY +G C+ K + VVTI+GY DVP+N E++L KA+A QP+SVAIEA GRDF
Sbjct: 213 TENDYPYKGLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 272
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN---TG 333
Q YSGGV+ G CGT LDHGV AVGYGS LDY IVKNSWG WGE GY+RM+RN +
Sbjct: 273 QLYSGGVFTGECGTDLDHGVLAVGYGSEGSLDYWIVKNSWGEYWGESGYLRMQRNIKDSN 332
Query: 334 KPEGLCGINKMASYPIK 350
GLCGIN SY +K
Sbjct: 333 HQFGLCGINIEPSYAVK 349
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 175/329 (53%), Positives = 229/329 (69%), Gaps = 8/329 (2%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
D SIV Y S ++ L+ W ++ K Y ++ E+ R+ F+DNLR+IDE N
Sbjct: 22 DMSIVSYGER---SEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAA 78
Query: 86 ---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRK 142
+ ++ LGLN FADL +EE+++ +LGL+ + RR+ + + + D LP+SVDWR
Sbjct: 79 DAGVHSFRLGLNRFADLTNEEYRDTYLGLR-NKPRRERKVSDRYLAADNEALPESVDWRT 137
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAV +K+QG CGSCWAFS +AAVE INQIVTG+L SLSEQEL+DCD +YN GCNGGLM
Sbjct: 138 KGAVAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLM 197
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
DYAF +I++ GG+ E+DYPY ++ C++ + ++VVTI+ Y DV NSE SL KA+ N
Sbjct: 198 DYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRN 257
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
QP+SVAIEA GR FQ YS G++ G CGT LDHGVAAVGYG+ G DY IV+NSWG WGE
Sbjct: 258 QPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGE 317
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY+RM+RN G CGI SYP+KK
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPLKK 346
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 182/342 (53%), Positives = 237/342 (69%), Gaps = 8/342 (2%)
Query: 16 ISFFIRSSFARDFSIVGYS-----PEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFE 70
+ F SS A D SI+ Y +++++ L+E W+ K K+Y +L EK +RF+
Sbjct: 4 FALFALSS-ALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKRFQ 62
Query: 71 IFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED-FSYK 129
IFKDNLR ID+ N + + Y LGLN FADL +EE++ +LG K D RR ++ + ++ +
Sbjct: 63 IFKDNLRFIDQQNAENRTYKLGLNRFADLTNEEYRARYLGTKIDPNRRLGRTPSNRYAPR 122
Query: 130 DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDC 189
LP SVDWRK+GAV VK+Q SCGSCWAFS + AVEGIN+IVTG+L SLSEQEL+DC
Sbjct: 123 VGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQELVDC 182
Query: 190 DNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVP 249
D YN GCNGGLMDYAF++I+ GG+ EEDYPY +G C+ + ++VV+I+GY DV
Sbjct: 183 DTGYNMGCNGGLMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGYEDVN 242
Query: 250 QNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDY 309
E +L KA+ANQP+SVA+E GR+FQ YS GV+ G CGT LDHGV AVGYG+ G D+
Sbjct: 243 TYDELALKKAVANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYGTDNGHDF 302
Query: 310 IIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
IV+NSWG WGE+GYIR++RN G G CGI SYPIK
Sbjct: 303 WIVRNSWGADWGEEGYIRLERNLGNSRSGKCGIAIEPSYPIK 344
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 357 bits (917), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 177/352 (50%), Positives = 239/352 (67%), Gaps = 8/352 (2%)
Query: 5 SQFKTILISFCISFFIRSSFARDFSIVGY--SPEDLT---SNDKLIDLFESWMSKFEKVY 59
S TILI + S+ D SI+ Y S D + S+++++ ++E W+ K KVY
Sbjct: 6 SLMATILIVLFTVLAVSSAL--DMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHGKVY 63
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
+++EK +RF+IFKDNL I+E N + Y +GLN F+DL +EE++ +LG K D +R
Sbjct: 64 NAVEEKEKRFQIFKDNLNFIEEHNAVNRTYKVGLNRFSDLSNEEYRSKYLGTKIDPSRMM 123
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
+ +S + +LP+SVDWRK+GAV VKNQ C CWAFS +AAVEGIN+IVTGNL
Sbjct: 124 ARPSRRYSPRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLT 183
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
+LSEQEL+DCD T N GC+GGL+DYAF++I++ GG+ EEDYP+ +G C+ K +
Sbjct: 184 ALSEQELLDCDRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADGICDQYKINARA 243
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
VTI+GY VP E +L KA+ANQP+SVAIEA G++FQ Y G++ G CGT +DHGV AV
Sbjct: 244 VTIDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSIDHGVTAV 303
Query: 300 GYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK-PEGLCGINKMASYPIK 350
GYG+ G+DY IVKNSWG WGE GY+ M+RN + G CGI + YPIK
Sbjct: 304 GYGTENGIDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPIK 355
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 357 bits (916), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 177/312 (56%), Positives = 216/312 (69%), Gaps = 7/312 (2%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEF 104
L+E W+ K Y L EK RFEIF DNLR+ID+ NR N Y LGL FADL +EE+
Sbjct: 37 LYEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYTLGLTRFADLTNEEY 96
Query: 105 KEMFLGLKPDLARRKDQSHEDFSYKDVV----DLPKSVDWRKKGAVTHVKNQGSCGSCWA 160
+ +LG+KP R + + +D+ DLP+ VDWR+KGAV +K+QG CGSCWA
Sbjct: 97 RSTYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAVAPIKDQGGCGSCWA 156
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FSTVAAVEGINQIVTG+L LSEQEL+DCD YN GCNGGLMDYAFQ+I+S GG+ EED
Sbjct: 157 FSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYAFQFIISNGGIDTEED 216
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY +G C+ + ++VV+I+ Y DV +N E +L A+A+QP+SVAIE GR FQ Y
Sbjct: 217 YPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVSVAIEGGGRSFQLYK 276
Query: 281 GGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLC 339
G++DG CG LDHGV AVGYG+ G DY IV+NSWG WGE GYIRM+RN G C
Sbjct: 277 SGIFDGRCGIDLDHGVVAVGYGTESGKDYWIVRNSWGKSWGEAGYIRMERNLPSSSSGKC 336
Query: 340 GINKMASYPIKK 351
GI SYPIKK
Sbjct: 337 GIAIEPSYPIKK 348
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 357 bits (916), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 177/313 (56%), Positives = 219/313 (69%), Gaps = 6/313 (1%)
Query: 41 NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADL 99
N + + +FE W+ + K Y L EK +RFEIF DNL+ + E N ++Y LGL FADL
Sbjct: 30 NPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFADL 89
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD-LPKSVDWRKKGAVTHVKNQGSCGSC 158
+EEF+ ++L + + R +D + +V D LP VDWR KGAV VK+QGSCGSC
Sbjct: 90 TNEEFRAIYL--RSKMERTRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKDQGSCGSC 147
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
WAFS + AVEGINQI TG L SLSEQEL+DCD +YNNGC GGLMDYAFQ+I+S GG+ E
Sbjct: 148 WAFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFIISNGGIDTE 207
Query: 219 EDYPYI-MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
EDYPY ++ C K + VVTI+GY DVP+N E+SL KALANQP+SVAIEA GR FQ
Sbjct: 208 EDYPYTATDDNICNTDKKNTRVVTIDGYEDVPEN-ENSLKKALANQPISVAIEAGGRGFQ 266
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
Y GV+ G CGT LDHGV AVGYG++ G DY I++NSWG WGE GYI+++RN G
Sbjct: 267 LYKSGVFTGTCGTALDHGVVAVGYGTSEGQDYWIIRNSWGSNWGESGYIKLQRNIKDSSG 326
Query: 338 LCGINKMASYPIK 350
CG+ MASYP K
Sbjct: 327 KCGVAMMASYPTK 339
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 357 bits (915), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 175/319 (54%), Positives = 224/319 (70%), Gaps = 5/319 (1%)
Query: 36 EDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNE 95
+DL S + DL+E W S V SL +K +RF +FK N+ H+ TN+ K Y L LN+
Sbjct: 28 KDLASEESFWDLYERWRS-HHTVSRSLGDKHKRFNVFKANVMHVHNTNKMDKPYKLKLNK 86
Query: 96 FADLRHEEFKEMFLGLKPDLARR---KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQ 152
FAD+ + EF+ + G K + R + + F Y+ V +P SVDWRK GAVT VK+Q
Sbjct: 87 FADMTNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSVDWRKNGAVTGVKDQ 146
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVST 212
G CGSCWAFSTV AVEGINQI T L SLSEQEL+DCD N GCNGGLM+ AF++I
Sbjct: 147 GQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIKQK 206
Query: 213 GGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEAS 272
GG+ E +YPY ++GTC+ +K V+I+G+ +VP N E++LLKA+ANQP+SVAI+A
Sbjct: 207 GGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAG 266
Query: 273 GRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRN 331
G DFQFYS GV+ G C T+L+HGVA VGYG+T G +Y V+NSWGP+WGE+GYIRM+R+
Sbjct: 267 GSDFQFYSEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQRS 326
Query: 332 TGKPEGLCGINKMASYPIK 350
K EGLCGI MASYPIK
Sbjct: 327 ISKKEGLCGIAMMASYPIK 345
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 357 bits (915), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 182/360 (50%), Positives = 244/360 (67%), Gaps = 15/360 (4%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYS-----PEDLTSNDKLIDLFESWMSKF 55
MAL T+L F S A D SI+ ++ S++++I ++ W++K
Sbjct: 1 MALPISLSTLLF-----LFFTLSSAWDMSILSHNHGHHHQSSWRSDNEVISMYNWWLAKH 55
Query: 56 EKVYESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD 114
K Y L E+ +RFEIFK+NLR IDE N K + Y +GL FADL +EE++ FLG K D
Sbjct: 56 SKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFLGTKSD 115
Query: 115 LARRKDQSH---EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGIN 171
RR +S + +++K LP+S+DWR+ GAV+ +K+QGSCGSCWAFST+AAVEG+N
Sbjct: 116 PKRRLMKSKNPSQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTIAAVEGVN 175
Query: 172 QIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCE 231
+IVTG L SLSEQEL+DCD +YN GCNGGLMD AFQ+I++ GG+ ++DYPY +G C+
Sbjct: 176 KIVTGELISLSEQELVDCDRSYNAGCNGGLMDNAFQFIINNGGIDTDKDYPYQAVDGKCD 235
Query: 232 MTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQ 291
TK +++ VTI+G+ DV E +L KA+A+QP+SVAIEASG QFY GV+ G CG+
Sbjct: 236 TTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVSVAIEASGMALQFYQSGVFTGECGSA 295
Query: 292 LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP-EGLCGINKMASYPIK 350
LDHGV VGYG+ G+DY +V+NSWG WGE GYI+M+RN G CGI +SYPIK
Sbjct: 296 LDHGVVIVGYGTEDGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGKCGIAMESSYPIK 355
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 357 bits (915), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 180/347 (51%), Positives = 230/347 (66%), Gaps = 4/347 (1%)
Query: 7 FKTILISFCISFFIRSSFARDFSIVGYSPEDLTSND-KLIDLFESWMSKFEKVYESLDEK 65
T + S ++ I S S+ + + T N+ + ++E W+ + K Y L EK
Sbjct: 1 MATSIKSITLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEK 60
Query: 66 LERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
RFEIFKDNL+ ++E ++ + Y +GL FADL ++EF+ ++L K + R + E
Sbjct: 61 ERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKG-E 119
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
+ YK LP ++DWR KGAV VK+QGSCGSCWAFS + AVEGINQI TG L SLSEQ
Sbjct: 120 KYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQ 179
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE-GTCEMTKGESEVVTIN 243
EL+DCD +YN+GC GGLMDYAF++I+ GG+ EEDYPYI + C K + VVTI+
Sbjct: 180 ELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTID 239
Query: 244 GYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS 303
GY DVPQN E SL KALANQP+SVAIEA GR FQ Y+ GV+ G CGT LDHGV AVGYGS
Sbjct: 240 GYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGS 299
Query: 304 TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G DY IV+NSWG WGE GY +++RN + G CG+ MASYP K
Sbjct: 300 EGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPTK 346
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 357 bits (915), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 176/316 (55%), Positives = 230/316 (72%), Gaps = 6/316 (1%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLNEFAD 98
S+D+++ L++SW+ + K Y + E+ +RFEIFKDNLR IDE N Y LGLN+FAD
Sbjct: 38 SDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFAD 97
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSH---EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSC 155
L ++E++ FLG + D RR +S ++++ +LP SV+WR GAV+ VK+QGSC
Sbjct: 98 LTNQEYRAKFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSC 157
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGL 215
GSCWAFS +AAVEGIN+IV+G L SLSEQEL+DCD +Y+ GCNGGLMDYAFQ+I+ GG+
Sbjct: 158 GSCWAFSAIAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDNGGI 217
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E+DYPY+ C+ TK ++VV+I+GY DVP N+E++L KA+A+QP+S+AIEA GR
Sbjct: 218 DTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVP-NNENALKKAVAHQPVSIAIEAGGRA 276
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
FQ Y GV++G CG LDHGV AVGYGS G DY IV+NSWG WGE GYIRM+RN
Sbjct: 277 FQLYESGVFNGECGLALDHGVVAVGYGSDDNGQDYWIVRNSWGGNWGENGYIRMERNINA 336
Query: 335 PEGLCGINKMASYPIK 350
G CGI ASYP+K
Sbjct: 337 NTGKCGIAMEASYPVK 352
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 356 bits (914), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 179/329 (54%), Positives = 225/329 (68%), Gaps = 13/329 (3%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFEKVYESL--DEKLERFEIFKDNLRHIDETNRKIKN 88
+ ++ +DL S + L L+E W S + L D + RF +FK+N R+I E N+K +
Sbjct: 23 IPFTEKDLASEENLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYIHEGNKKDRP 82
Query: 89 YWLGLNEFADLRHEEFKEMFLG------LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRK 142
+ L LN+FAD+ +EF+ + G L RR D S F Y D +LP +VDWR+
Sbjct: 83 FRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGS---FRYGDADNLPPAVDWRQ 139
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAVT +K+QG CGSCWAFST+ AVEGIN+I TG L SLSEQEL+DCDN N GC+GGLM
Sbjct: 140 KGAVTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLM 199
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
DYAFQ+I G+ E +YPY E+G+C++ K ++ VTI+GY DVP N E +L KA+A
Sbjct: 200 DYAFQFI-HKNGITTESNYPYQGEQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAG 258
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWG 321
QP+SVAI+ASG DFQFYS GV+ G C T LDHGVAAVGYG+TR G Y IVKNSWG WG
Sbjct: 259 QPVSVAIDASGNDFQFYSEGVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWG 318
Query: 322 EKGYIRMKRNTGKPEGLCGINKMASYPIK 350
EKGYIRM+R + EG CGI ASYP K
Sbjct: 319 EKGYIRMQRGVSQAEGQCGIAMQASYPTK 347
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 356 bits (913), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 175/329 (53%), Positives = 229/329 (69%), Gaps = 8/329 (2%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
D SIV Y S ++ L+ W ++ K Y ++ E+ R+ F+DNLR+IDE N
Sbjct: 22 DMSIVSYGER---SEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAA 78
Query: 86 ---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRK 142
+ ++ LGLN FADL +EE+++ +LGL+ + RR+ + + + D LP+SVDWR
Sbjct: 79 DAGVHSFRLGLNRFADLTNEEYRDTYLGLR-NKPRRERKVSDRYLAADNEALPESVDWRT 137
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAV +K+Q GSCWAFS +AAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGGLM
Sbjct: 138 KGAVAEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLM 197
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
DYAF +I++ GG+ E+DYPY ++ C++ + ++VVTI+ Y DV NSE SL KA+AN
Sbjct: 198 DYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVAN 257
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
QP+SVAIEA GR FQ YS G++ G CGT LDHGVAAVGYG+ G DY IV+NSWG WGE
Sbjct: 258 QPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGE 317
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY+RM+RN G CGI SYP+KK
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPLKK 346
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 355 bits (912), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 181/311 (58%), Positives = 221/311 (71%), Gaps = 9/311 (2%)
Query: 46 DLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFK 105
+L+E W S V SLDEK +RF +FK N+ ++ N+K K Y L LN+FAD+ + EF+
Sbjct: 36 ELYERWRS-HHTVSRSLDEKDKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFR 94
Query: 106 EMFLGLKPDLARR---KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
+ G K R +++ F Y +P +VDWRKKGAVT VK+QG CGSCWAFS
Sbjct: 95 HHYAGSKIKHHRTFLGASRANGTFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGSCWAFS 154
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
TV AVEGINQI T L SLSEQEL+DCD + N GCNGGLMD AF++I GG++ EE+YP
Sbjct: 155 TVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTEENYP 214
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
Y+ E G C++ K S VV+I+G+ DVP N E SLLKA+ANQP+SVAI+ASG DFQFYS G
Sbjct: 215 YMAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQFYSEG 274
Query: 283 VYDGHCGTQLDHGVAAVGYGSTRGLD---YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
V+ G CGT+LDHGVA VGYG+T LD Y IVKNSWGP+WGEKGYIRM+R EGLC
Sbjct: 275 VFTGDCGTELDHGVAIVGYGTT--LDRTKYWIVKNSWGPEWGEKGYIRMQREIDAEEGLC 332
Query: 340 GINKMASYPIK 350
GI SYPIK
Sbjct: 333 GIAMQPSYPIK 343
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 355 bits (912), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 177/341 (51%), Positives = 233/341 (68%), Gaps = 6/341 (1%)
Query: 11 LISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE-RF 69
+++ FI S A SI+ P+ ++D+++ L++ W +K K++ +L + E RF
Sbjct: 9 IMALLFFLFIALSAASPSSII---PQ--RTDDEVMALYDQWRAKHGKLHNNLGAEPENRF 63
Query: 70 EIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYK 129
IFKDNL+ IDE N + Y LGLN FADL +EE++ +LG K R++++ + +
Sbjct: 64 HIFKDNLKFIDEINAQNLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSNRYLPR 123
Query: 130 DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDC 189
DLP S+DWR KGAV VK+QGSCGSCWAFSTVA+VE INQIVTG+L +LSEQEL+DC
Sbjct: 124 LGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDC 183
Query: 190 DNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVP 249
D +YN GCNGGLMDYAF++I+ GGL EEDYPY + +C K ++VV I+ Y DVP
Sbjct: 184 DRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVAIDSYEDVP 243
Query: 250 QNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDY 309
N+E +L KA++ Q +SVAIE GR FQ Y G++ G CGT LDHGV VGYGS G+DY
Sbjct: 244 VNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYGSEGGVDY 303
Query: 310 IIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
IV+NSWG WGE GY++M+RN P GLCGI SYP K
Sbjct: 304 WIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPTK 344
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 179/325 (55%), Positives = 219/325 (67%), Gaps = 10/325 (3%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ EDL S + L L+E W + + L +K RF +FK N+R I E NR+ + Y L
Sbjct: 34 FGAEDLASEEALWALYERWRGR-HALARDLGDKARRFNVFKANVRLIHEFNRRDEPYKLR 92
Query: 93 LNEFADLRHEEFKEMFLGLKPDLAR--RKDQ----SHEDFSYKDVVDLPKSVDWRKKGAV 146
LN F D+ +EF+ + G + R R D+ + F Y D D+P SVDWR+KGAV
Sbjct: 93 LNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAV 152
Query: 147 THVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAF 206
T VK+QG CGSCWAFST+AAVEGIN I T NL SLSEQ+L+DCD N GCNGGLMDYAF
Sbjct: 153 TDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAF 212
Query: 207 QYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLS 266
QYI GG+ E+ YPY + +C+ K + VVTI+GY DVP N E +L KA+A+QP+S
Sbjct: 213 QYIAKHGGVAAEDAYPYRARQASCK--KSPAPVVTIDGYEDVPANDESALKKAVAHQPVS 270
Query: 267 VAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGY 325
VAIEASG FQFYS GV+ G CGT+LDHGV AVGYG T G Y +VKNSWGP+WGEKGY
Sbjct: 271 VAIEASGSHFQFYSEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGY 330
Query: 326 IRMKRNTGKPEGLCGINKMASYPIK 350
IRM R+ EG CGI ASYP+K
Sbjct: 331 IRMARDVAAKEGHCGIAMEASYPVK 355
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 353 bits (907), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 176/309 (56%), Positives = 218/309 (70%), Gaps = 5/309 (1%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKE 106
L+E W S V SL EK +RF +FK N H+ N+ K Y L LN+FAD+ + EF+
Sbjct: 37 LYERWRS-HHTVSRSLHEKQKRFNVFKHNAMHVHNANKMDKPYKLKLNKFADMTNHEFRN 95
Query: 107 MFLGLKPD---LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
+ G K + R + + F Y+ V +P SVDWRKKGAVT VK+QG CGSCWAFST
Sbjct: 96 TYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFST 155
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
+ AVEGINQI T L SLSEQEL+DCD N GCNGGLMDYAF++I GG+ E +YPY
Sbjct: 156 IVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPY 215
Query: 224 IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV 283
+GTC+++K + V+I+G+ +VP+N E++LLKA+ANQP+SVAI+A G DFQFYS GV
Sbjct: 216 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 275
Query: 284 YDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGIN 342
+ G CGT+LDHGVA VGYG+T G Y VKNSWGP+WGEKGYIRM+R EGLCGI
Sbjct: 276 FTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIA 335
Query: 343 KMASYPIKK 351
ASYPIKK
Sbjct: 336 MEASYPIKK 344
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 353 bits (906), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 176/323 (54%), Positives = 233/323 (72%), Gaps = 5/323 (1%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
Y EDL S + L +L+E W S V SL EK +RF +FK+NL+HI + N+K + Y L
Sbjct: 25 YKEEDLASEESLWNLYERWRS-HHTVSRSLTEKNQRFNVFKENLKHIHKVNQKDRPYKLR 83
Query: 93 LNEFADLRHEEFKEMFLGLKPDLAR--RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVK 150
LN+FAD+ + EF + + G K R + F++++ +LP S+DWRK+GAVT VK
Sbjct: 84 LNKFADMTNHEFLQHYGGSKVSHYRMFHGSRRQTGFAHENTSNLPSSIDWRKQGAVTGVK 143
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIV 210
+QG CGSCWAFS+VAAVEGIN+I TG L SLSEQEL+DC N+ N+GC+GGLM+ AF +I
Sbjct: 144 DQGKCGSCWAFSSVAAVEGINKIKTGELISLSEQELVDC-NSVNHGCDGGLMEQAFSFIE 202
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
TGGL E +YPY ++G C+ K + +VTI+GY VP+N E +L++A+ANQP+S+AI+
Sbjct: 203 KTGGLTTENNYPYRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIAID 262
Query: 271 ASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMK 329
A G+DFQFYS GVY G CGT+L+HGVA VGYG+T+ G Y IVKNSWG +WGE G+IRM+
Sbjct: 263 AGGQDFQFYSEGVYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWGSEWGENGFIRMQ 322
Query: 330 RNTGKPEGLCGINKMASYPIKKK 352
R EGLCGI ASYPIK++
Sbjct: 323 RENDVEEGLCGITLEASYPIKQR 345
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 353 bits (906), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 177/341 (51%), Positives = 231/341 (67%), Gaps = 20/341 (5%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
D SIV Y S ++ L+ W ++ K Y ++ E+ R+ F+DNLR+IDE N
Sbjct: 22 DMSIVSYGER---SEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAA 78
Query: 86 ---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRK 142
+ ++ LGLN FADL +EE+++ +LGL+ + RR+ + + + D LP+SVDWR
Sbjct: 79 DAGVHSFRLGLNRFADLTNEEYRDTYLGLR-NKPRRERKVSDRYLAADNEALPESVDWRT 137
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAV +K+QG CGSCWAFS +AAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGGLM
Sbjct: 138 KGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLM 197
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTK------------GESEVVTINGYHDVPQ 250
DYAF +I++ GG+ E+DYPY ++ C++ + ++VVTI+ Y DV
Sbjct: 198 DYAFDFIINNGGIDTEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTP 257
Query: 251 NSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYI 310
NSE SL KA+ANQP+SVAIEA GR FQ YS G++ G CGT LDHGVAAVGYG+ G DY
Sbjct: 258 NSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYW 317
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
IV+NSWG WGE GY+RM+RN G CGI SYP+KK
Sbjct: 318 IVRNSWGKSWGESGYVRMERNIKASSGKCGIAVEPSYPLKK 358
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 353 bits (905), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 180/347 (51%), Positives = 226/347 (65%), Gaps = 4/347 (1%)
Query: 7 FKTILISFCISFFIRSSFARDFSIVGYSPEDLTSND-KLIDLFESWMSKFEKVYESLDEK 65
T + S ++ I S S+ + D T N+ + ++E W+ + K Y L EK
Sbjct: 1 MATPIKSITLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEK 60
Query: 66 LERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
RFEIF DNL++I+E N + + +GL FADL ++EF+ ++L K + R + E
Sbjct: 61 ETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKG-E 119
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
+ YK LP +DWR KGAV VK+QG+CGSCWAFS + AVEGINQI TG L SLSEQ
Sbjct: 120 RYLYKVGDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQ 179
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI-MEEGTCEMTKGESEVVTIN 243
EL+DCD +YN GC GGLMDYAF++I+ GG+ EEDYPY ++ C K S VVTI+
Sbjct: 180 ELVDCDTSYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTID 239
Query: 244 GYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS 303
GY DVPQN E SL KALANQP+SVAIEA GR FQ Y GV+ G CGT LDHGV AVGYGS
Sbjct: 240 GYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYGS 299
Query: 304 TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G DY IV+NSWG WGE GY +++RN + G CG+ MASYP K
Sbjct: 300 EGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPTK 346
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 182/341 (53%), Positives = 234/341 (68%), Gaps = 9/341 (2%)
Query: 18 FFIRSSFARDFSIVG---YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKD 74
FF+ SFA + ++ +DL S + L DL+E W S V SLDEK RF +FK
Sbjct: 7 FFVALSFALVLRVAESFEFNEKDLESEEGLWDLYERWRSH-HTVSRSLDEKHNRFNVFKG 65
Query: 75 NLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD---LARRKDQSHEDFSYKDV 131
N+ H+ +N+ K Y L LN FAD+ + EF+ ++ G K + + R + + F Y++V
Sbjct: 66 NVMHVHSSNKMDKPYKLKLNRFADMTNHEFRSIYAGSKVNHHRMFRGTPRGNGTFMYQNV 125
Query: 132 VDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDN 191
+P SVDWRKKGAVT VK+QG CGSCWAFST+ AVEGINQI T L LSEQEL+DCD
Sbjct: 126 DRVPSSVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTHKLVPLSEQELVDCDT 185
Query: 192 TYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQN 251
T N GCNGGLM+ AF++I G+ +YPY ++GTC+ +K V+I+G+ +VP N
Sbjct: 186 TQNQGCNGGLMESAFEFIKQY-GITTASNYPYEAKDGTCDASKVNEPAVSIDGHENVPVN 244
Query: 252 SEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYI 310
+E +LLKA+A+QP+SVAIEA G DFQFYS GV+ G+CGT LDHGVA VGYG+T+ G Y
Sbjct: 245 NEAALLKAVAHQPVSVAIEAGGIDFQFYSEGVFTGNCGTALDHGVAIVGYGTTQDGTKYW 304
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
VKNSWG +WGEKGYIRMKR+ +GLCGI ASYPIKK
Sbjct: 305 TVKNSWGSEWGEKGYIRMKRSISVKKGLCGIAMEASYPIKK 345
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 179/319 (56%), Positives = 221/319 (69%), Gaps = 10/319 (3%)
Query: 37 DLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHI--DETNRKIKNYWLGLN 94
DL + L++ F +W K K Y ++ L RF ++KDNL +I ETNR Y LGL
Sbjct: 43 DLEHENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHSETNR---TYSLGLT 99
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
+FADL +EEF+ M+ G + D +RR + F Y D + P+SVDWRK GAVT VK+QGS
Sbjct: 100 KFADLTNEEFRRMYTGTRIDRSRRA-KRRTGFRYADS-EAPESVDWRKNGAVTSVKDQGS 157
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGG 214
CGSCWAFS V +VEGIN I G SLSEQEL+DCD YN GCNGGLMDYAF +I+ GG
Sbjct: 158 CGSCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGG 217
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
+ E+DYPY +G C+ +K + VVTI+GY DVP+N E++L KA+A QP+SVAIEA GR
Sbjct: 218 IDTEKDYPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGR 277
Query: 275 DFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN--- 331
DFQ Y+ GV+ G CGT LDHGV AVGYG+ G+DY IVKNSWG WGE GY+RMKRN
Sbjct: 278 DFQLYAQGVFSGECGTDLDHGVLAVGYGTEDGVDYWIVKNSWGEYWGESGYLRMKRNMKD 337
Query: 332 TGKPEGLCGINKMASYPIK 350
+ GLCGIN SY +K
Sbjct: 338 SNDGPGLCGINIEPSYAVK 356
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 183/346 (52%), Positives = 232/346 (67%), Gaps = 10/346 (2%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
K IL F + R + + D Y+ EDL S ++L DL+E W S V SL EK E
Sbjct: 5 KVILAVFSVVLVFRLADSFD-----YTEEDLASEERLRDLYERWRS-HHTVSRSLAEKQE 58
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLAR--RKDQSHED 125
RF +FK+NL+HI + N K + Y L LN FAD+ + EF + + G K R R +
Sbjct: 59 RFNVFKENLKHIHKVNHKDRPYKLKLNSFADMTNHEFLQHYGGSKVSHYRVLRGQRQGTG 118
Query: 126 FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQE 185
++D LP SVDWRK GAVT +K+QG CGSCWAFSTVAAVEGIN+I TG L SLSEQE
Sbjct: 119 SMHEDTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGINKIKTGELISLSEQE 178
Query: 186 LIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGY 245
L+DCD+ N+GCNGGLM+ AF +I GGL E YPY +E C+ K S VV I+GY
Sbjct: 179 LVDCDSD-NHGCNGGLMEDAFNFIKQIGGLTSENTYPYRAKEEPCDSNKMNSPVVNIDGY 237
Query: 246 HDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR 305
VP+N E++L+KA+ANQP+++A++A G+D QFYS ++ G CGT+L+HGVA VGYG+T+
Sbjct: 238 EMVPENDENALMKAVANQPVAIAMDAGGKDLQFYSEAIFTGDCGTELNHGVALVGYGTTQ 297
Query: 306 -GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G Y IVKNSWG WGEKGYIRM+R EGLCGI ASYP+K
Sbjct: 298 DGTKYWIVKNSWGTDWGEKGYIRMQRGIDAEEGLCGITMEASYPVK 343
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 178/341 (52%), Positives = 236/341 (69%), Gaps = 15/341 (4%)
Query: 25 ARDFSIVGYS------PEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRH 78
A D SI+ Y P S+D+++ ++ESW+ + K Y +L EK +RF IFKDNL
Sbjct: 24 AVDMSIISYDHNHNLLPSSSRSDDEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEF 83
Query: 79 IDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS-------HEDFSYKD 130
ID+ N + + +GLN+FADL +EEF+ ++LG K + S + + +K+
Sbjct: 84 IDQHNSDDSQTFKVGLNKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKE 143
Query: 131 VVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD 190
+LP++VDWRK GAV VK+QG CGSCWAFST+AAVEGINQIVTG L SLSEQEL+DCD
Sbjct: 144 GDELPEAVDWRKNGAVAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCD 203
Query: 191 NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQ 250
+YN+GC+GGLMDYA+++I++ GG+ + DYPY ++G C+ + ++VVTI+ + DVP+
Sbjct: 204 TSYNSGCDGGLMDYAYEFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPE 263
Query: 251 NSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYI 310
N E +L KA+A+QP+SVAIEA G FQFY GV+ G CG LDHGV AVGYGS G DY
Sbjct: 264 NDEKALQKAVAHQPVSVAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYGSDDGKDYW 323
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
IV+NSWG WGE GYIRM+RN + G CGI SYPIK
Sbjct: 324 IVRNSWGADWGESGYIRMERNLETVKTGKCGIAIEPSYPIK 364
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 173/322 (53%), Positives = 224/322 (69%), Gaps = 5/322 (1%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ +DL S + DL+E W S + V SL +K +RF +FK N+ H+ TN+ K Y L
Sbjct: 25 FHDKDLASEESFWDLYERWRS-YRTVSRSLGDKHKRFNVFKANVMHVHNTNKMDKPYKLK 83
Query: 93 LNEFADLRHEEFKEMFLGLKPDLARR---KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
LN+FAD+ + EF+ + G K + R + + F Y+ V +P S DWRK GAVT V
Sbjct: 84 LNKFADMTNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSADWRKNGAVTGV 143
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K+QG CGSCWAFSTV AVEGINQI T L SLSEQEL+DCD N GCNGGLM+ AF++I
Sbjct: 144 KDQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFI 203
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
GG+ E +YPY ++GTC+ +K V+I+G+ +VP N E++LLKA+ANQP+SVAI
Sbjct: 204 KQKGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAI 263
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRM 328
+A G DFQFY GV+ G C T+L+HGVA VGYG+T G +Y V+NSWGP+WGE+GYIRM
Sbjct: 264 DAGGFDFQFYFEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRM 323
Query: 329 KRNTGKPEGLCGINKMASYPIK 350
+R+ K EGLCGI MASYPIK
Sbjct: 324 QRSIFKKEGLCGIAMMASYPIK 345
>gi|2098464|pdb|1PCI|A Chain A, Procaricain
gi|2098465|pdb|1PCI|B Chain B, Procaricain
gi|2098466|pdb|1PCI|C Chain C, Procaricain
Length = 322
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 179/324 (55%), Positives = 231/324 (71%), Gaps = 3/324 (0%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI 86
DFSIVGYS +DLTS ++LI LF SWM K YE++DEKL RFEIFKDNL +IDETN+K
Sbjct: 1 DFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKN 60
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAV 146
+YWLGLNEFADL ++EF E ++G D A + E+F +D+V+LP++VDWRKKGAV
Sbjct: 61 NSYWLGLNEFADLSNDEFNEKYVGSLID-ATIEQSYDEEFINEDIVNLPENVDWRKKGAV 119
Query: 147 THVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAF 206
T V++QGSCGSCWAFS VA VEGIN+I TG L LSEQEL+DC+ ++GC GG YA
Sbjct: 120 TPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYAL 178
Query: 207 QYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLS 266
+Y V+ G+H YPY ++GTC + +V +G V N+E +LL A+A QP+S
Sbjct: 179 EY-VAKNGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVS 237
Query: 267 VAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYI 326
V +E+ GR FQ Y GG+++G CGT++D V AVGYG + G YI++KNSWG WGEKGYI
Sbjct: 238 VVVESKGRPFQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYI 297
Query: 327 RMKRNTGKPEGLCGINKMASYPIK 350
R+KR G G+CG+ K + YP K
Sbjct: 298 RIKRAPGNSPGVCGLYKSSYYPTK 321
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 351 bits (900), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 181/324 (55%), Positives = 227/324 (70%), Gaps = 7/324 (2%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNY 89
+ + +DL S + L L+E W + V LD+ +RF +FK+N++ I E N +K Y
Sbjct: 24 IPFDEKDLASEESLWSLYEKWRAH-HAVSRDLDDTDKRFNVFKENVKFIHEFNQKKDATY 82
Query: 90 WLGLNEFADLRHEEFKEMFLGLKPD--LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVT 147
L LN+F D+ ++EF+ + G K D + R + +FSY+ DLP SVDWR+KGAVT
Sbjct: 83 KLALNKFGDMTNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEKFHDLPTSVDWREKGAVT 142
Query: 148 HVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQ 207
VK+QG CGSCWAFSTV AVEGINQI T L SLSEQ+L+DCD T N+GCNGGLMDYAF
Sbjct: 143 GVKDQGQCGSCWAFSTVVAVEGINQIKTNELVSLSEQQLVDCD-TKNSGCNGGLMDYAFD 201
Query: 208 YIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSV 267
+I + GGL E+ YPY+ E+ +C ++ S VVTI+GY DVP+N+E +L+KA+ANQP+SV
Sbjct: 202 FIKNNGGLSSEDSYPYLAEQKSCG-SEANSAVVTIDGYQDVPRNNEAALMKAVANQPVSV 260
Query: 268 AIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYI 326
AIEASG FQFYS GV+ GHCGT+LDHGVAAVGYG G Y IVKNSWG WGE GYI
Sbjct: 261 AIEASGYAFQFYSQGVFSGHCGTELDHGVAAVGYGVDDDGKKYWIVKNSWGEGWGESGYI 320
Query: 327 RMKRNTGKPEGLCGINKMASYPIK 350
RM+R G CGI ASYPIK
Sbjct: 321 RMERGIKDKRGKCGIAMEASYPIK 344
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 351 bits (900), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 178/327 (54%), Positives = 220/327 (67%), Gaps = 8/327 (2%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN 88
S V + EDL S + L L+E W + V L +K RF +FK+N+R I + N++ +
Sbjct: 28 SAVEFGAEDLASEEALWALYERWRGR-HAVARDLGDKARRFNVFKENVRLIHDFNQRDEP 86
Query: 89 YWLGLNEFADLRHEEFKEMFLGLKPDLAR--RKDQ--SHEDFSYKDVVDLPKSVDWRKKG 144
Y L LN F D+ +EF+ + G + R R D+ S F Y DLP SVDWR+KG
Sbjct: 87 YKLRLNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKG 146
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDY 204
AVT VK+QG CGSCWAFST+AAVEGIN I T NL SLSEQ+L+DCD N GC+GGLMDY
Sbjct: 147 AVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDY 206
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQP 264
AFQYI GG+ E+ YPY + +C+ K + VTI+GY DVP N E +L KA+A+QP
Sbjct: 207 AFQYIAKHGGVAAEDAYPYKARQASCK--KSPAPAVTIDGYEDVPANDESALKKAVAHQP 264
Query: 265 LSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEK 323
+SVAIEASG FQFYS GV+ G CGT+LDHGV AVGYG + G Y +VKNSWGP+WGEK
Sbjct: 265 VSVAIEASGSHFQFYSEGVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEK 324
Query: 324 GYIRMKRNTGKPEGLCGINKMASYPIK 350
GYIRM R+ EG CGI ASYP+K
Sbjct: 325 GYIRMARDVAAKEGHCGIAMEASYPVK 351
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 351 bits (900), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 175/326 (53%), Positives = 225/326 (69%), Gaps = 7/326 (2%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFEKVYESL--DEKLERFEIFKDNLRHIDETNRKIKN 88
V ++ +DL S + L L+E W S + L D + RF +FK+N R++ E N++ +
Sbjct: 24 VPFTEKDLASEESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYVHEGNKRDRP 83
Query: 89 YWLGLNEFADLRHEEFKEMFLG--LKPDLA-RRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
+ L LN+FAD+ +EF+ + G ++ L+ + F Y D +LP +VDWR+KGA
Sbjct: 84 FRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGA 143
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYA 205
VT +K+QG CGSCWAFST+ AVEGIN+I TG L SLSEQEL+DCDN N GC GGLMDYA
Sbjct: 144 VTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCEGGLMDYA 203
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPL 265
FQ+I G+ E +YPY E+G+C+ K ++ VTI+GY DVP N E +L KA+A QP+
Sbjct: 204 FQFI-QKNGITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPV 262
Query: 266 SVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKG 324
SVAI+ASG+DFQFYS GV+ G C T LDHGVAAVGYG+TR G Y IVKNSWG WGEKG
Sbjct: 263 SVAIDASGQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKG 322
Query: 325 YIRMKRNTGKPEGLCGINKMASYPIK 350
YIRM+R + EGLCGI ASYP K
Sbjct: 323 YIRMQRGVSQTEGLCGIAMQASYPTK 348
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 350 bits (898), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 169/310 (54%), Positives = 226/310 (72%), Gaps = 9/310 (2%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-----TNRKIKNYWLGLNEFADLRHE 102
+SW+ K K Y +L EK +RF IF+DNL ID+ + LGLN+FADL ++
Sbjct: 5 LQSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTND 64
Query: 103 EFKEMFLGLK-PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
EF+ ++ G+K P+ A + + ++ K+ +LP+SVDWRKKGAV+HVK+QG CGSCWAF
Sbjct: 65 EFRRIYFGVKRPEKA--ESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAF 122
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
S + AVEGIN+IVTG+L +LSEQEL+DCD +YN+GC+GGLMDYAF++I++ GG+ ++DY
Sbjct: 123 SAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKDY 182
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY +G+C+ + ++VVTI+G DVP N+E +L KA+A+QP+ +AIEA GRDFQ Y
Sbjct: 183 PYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYKS 242
Query: 282 GVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GV+ G CGT LDHGV AVGYG+T G DY IV+NSWG WGE GYIRM+RNT G CG
Sbjct: 243 GVFTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTESKSGKCG 302
Query: 341 INKMASYPIK 350
I SYP+K
Sbjct: 303 IAIEPSYPVK 312
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 175/336 (52%), Positives = 226/336 (67%), Gaps = 18/336 (5%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE----------RFEIFKDNLRHID 80
V ++ +DL S + L L+E W S++ + L RF +FK+N+++I
Sbjct: 21 VPFTEKDLASEESLRGLYERWRSRYTVSPSTPGSGLRGKLADHDPARRFNVFKENVKYIH 80
Query: 81 ETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLAR-----RKDQSHEDFSYKDVVDLP 135
E N+K + + L LN+FAD+ +E + + G + R R+ Q +F+Y D +LP
Sbjct: 81 EANKKDRPFRLALNKFADMTTDELRHSYAGSRVRHHRALSGGRRAQG--NFTYSDAENLP 138
Query: 136 KSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN 195
+VDWR+KGAVT +K+QG CGSCWAFST+AAVE IN+I TG L SLSEQEL+DCDN +
Sbjct: 139 PAVDWREKGAVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLSEQELMDCDNVNDQ 198
Query: 196 GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDS 255
GC+GGLMDYAFQ+I GG+ E +YPY ++ TC+ K + V I+GY DVP N E +
Sbjct: 199 GCDGGLMDYAFQFIQKNGGVTSEANYPYQGQQNTCDQAKENTHDVAIDGYEDVPANDESA 258
Query: 256 LLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKN 314
L KA+A QP+SVAIEASG+DFQFYS GV+ G C T LDHGVAAVGYG+ R G Y IVKN
Sbjct: 259 LQKAVAYQPVSVAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAVGYGTARDGTKYWIVKN 318
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG WGEKGYIRM+R + EGLCGI ASYPIK
Sbjct: 319 SWGLDWGEKGYIRMQRGVSQAEGLCGIAMQASYPIK 354
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 178/349 (51%), Positives = 235/349 (67%), Gaps = 10/349 (2%)
Query: 7 FKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKL 66
K +L+ F S I + A F Y +++ S + L L++ W S V SL+E+
Sbjct: 1 MKKLLLIFLFSLVILQT-ACGFD---YDDKEIESEEGLSTLYDRWRS-HHSVPRSLNERE 55
Query: 67 ERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLAR-----RKDQ 121
+RF +F+ N+ H+ TN+K ++Y L LN+FADL EFK + G R ++
Sbjct: 56 KRFNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGS 115
Query: 122 SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASL 181
+ ++++ LP SVDWRKKGAVT +KNQG CGSCWAFSTVAAVEGIN+I T L SL
Sbjct: 116 KQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSL 175
Query: 182 SEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVT 241
SEQEL+DCD N GCNGGLM+ AF++I GG+ E+ YPY +G C+ +K +VT
Sbjct: 176 SEQELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVT 235
Query: 242 INGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGY 301
I+G+ DVP+N E++LLKA+ANQP+SVAI+A DFQFYS GV+ G CGT+L+HGVAAVGY
Sbjct: 236 IDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGY 295
Query: 302 GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
GS RG Y IV+NSWG +WGE GYI+++R +PEG CGI ASYPIK
Sbjct: 296 GSERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIK 344
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 180/343 (52%), Positives = 232/343 (67%), Gaps = 8/343 (2%)
Query: 14 FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFK 73
F + SF + ++ +DL S D L +L+E W + V LDEK RF +FK
Sbjct: 6 FIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTH-HTVARDLDEKNRRFNVFK 64
Query: 74 DNLRHIDETN-RKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK---DQSHEDFSYK 129
+N++ I E N +K Y L LN+F D+ ++EF+ + G K R + ++ F Y+
Sbjct: 65 ENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYE 124
Query: 130 DVVDLPK-SVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
+V LP S+DWR KGAVT VK+QG CGSCWAFST+A+VEGINQI TG L SLSEQEL+D
Sbjct: 125 NVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELVD 184
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
CD +YN GCNGGLMDYAF++I G+ E+ YPY ++GTC S VV+I+G+ DV
Sbjct: 185 CDTSYNEGCNGGLMDYAFEFI-QKNGITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDV 243
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GL 307
P N+E++L++A+ANQP+SV+IEASG FQFYS GV+ G CGT+LDHGVA VGYG+TR G
Sbjct: 244 PANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGT 303
Query: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
Y IVKNSWG +WGE GYIRM+R G CGI ASYPIK
Sbjct: 304 KYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPIK 346
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 172/334 (51%), Positives = 224/334 (67%), Gaps = 13/334 (3%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFEKVY----ESLDEKLERFEIFKDNLRHIDETNRKI 86
+ +S DL S + L L+E W S + +V + ++ RF +FK+N R++ E NRK
Sbjct: 24 IPFSERDLASEESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKD 83
Query: 87 -KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKD-------VVDLPKSV 138
+ + L LN+FAD+ +EF+ + G + R + F++ +LP +V
Sbjct: 84 GRPFRLALNKFADMTTDEFRRTYAGSRTRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAV 143
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
DWR +GAVT VK+QG CGSCWAFS +AAVEG+N+I+TG L SLSEQEL+DCD+ N GC+
Sbjct: 144 DWRLRGAVTGVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCD 203
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
GGLMDYAFQYI GG+ E +YPY+ E+ +C K S VTI+GY DVP N+ED+L K
Sbjct: 204 GGLMDYAFQYIQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQK 263
Query: 259 ALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWG 317
A+A+QP++VAIEASG+DFQFYS GV+ G CGT LDHGVAAVGYG+T G Y VKNSWG
Sbjct: 264 AVASQPVAVAIEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWG 323
Query: 318 PKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
WGE+GYIRM+R GLCGI SYP KK
Sbjct: 324 EDWGERGYIRMQRGVPDSRGLCGIAMEPSYPTKK 357
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 171/332 (51%), Positives = 225/332 (67%), Gaps = 10/332 (3%)
Query: 27 DFSIVGYSPEDLTSNDKLIDL-----FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
D SI+ Y+ E + + ++ W+++ + Y +L E RF +F DNLR D
Sbjct: 28 DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADA 87
Query: 82 TNRKIKN--YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVD 139
N + + + LG+N FADL +EEF+ FLG K + R + E + + V +LP+SVD
Sbjct: 88 HNARADDHGFRLGMNRFADLTNEEFRATFLGAK--VVERSRAAGERYRHDGVEELPESVD 145
Query: 140 WRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCN 198
WR+KGAV VKNQG CGSCWAFS V+ VE INQ+VTG + +LSEQEL++C N N+GCN
Sbjct: 146 WREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCN 205
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
GGLMD AF +I+ GG+ E+DYPY +G C++ + ++VV+I+G+ DVPQN E SL K
Sbjct: 206 GGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQK 265
Query: 259 ALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGP 318
A+A+QP+SVAIEA GR+FQ Y GV+ G CGT LDHGV AVGYG+ G DY IV+NSWGP
Sbjct: 266 AVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGP 325
Query: 319 KWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
KWGE GY+RM+RN G CGI MASYP K
Sbjct: 326 KWGESGYVRMERNINVTTGKCGIAMMASYPTK 357
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 174/342 (50%), Positives = 229/342 (66%), Gaps = 6/342 (1%)
Query: 13 SFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIF 72
S ++ + +F + ++ +DL S + L L+E W S V L EK +RF +F
Sbjct: 5 SMLLALVVALAFVGVARTIPFNEKDLASEESLWGLYERWRSH-HTVSRDLSEKNKRFNVF 63
Query: 73 KDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK---DQSHEDFSYK 129
K+N + I E N+K Y LGLN+FAD+ ++EF+ + G K R + ++ F Y+
Sbjct: 64 KENAKFIHEFNKKDAPYKLGLNKFADMTNQEFRSTYAGSKIHHHRTQRGTPRATGSFMYE 123
Query: 130 DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDC 189
+V +P SVDWR +GAV VK+QG CGSCWAFST+A+VEGIN+I T L LS Q+L+DC
Sbjct: 124 NVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKTNQLVPLSGQQLVDC 183
Query: 190 DNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVP 249
D N GCNGGLMDYAF++I S GG+ E YPY E+G+C ++ + VVTI+GY DVP
Sbjct: 184 DTDQNEGCNGGLMDYAFEFIKSNGGITSESAYPYTAEQGSC-ASESSAPVVTIDGYEDVP 242
Query: 250 QNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLD 308
N+E +L+KA+ANQ +SVAIEASG FQFYS GV+ G CG +LDHGVA VGYG+TR G
Sbjct: 243 ANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNELDHGVAVVGYGATRDGTK 302
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
Y IV+NSWG +WGEKGYIRM+R GLCGI SYP+K
Sbjct: 303 YWIVRNSWGAEWGEKGYIRMQRGIRARHGLCGIAMEPSYPLK 344
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 169/333 (50%), Positives = 226/333 (67%), Gaps = 11/333 (3%)
Query: 27 DFSIVGYSPEDLTSNDKLIDL-----FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
D SI+ Y+ E + + ++ W+++ + Y +L E+ RF +F DNL+ +D
Sbjct: 23 DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDA 82
Query: 82 TNRKIK---NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSV 138
N + + LG+N FADL ++EF+ FLG K + R + E + + V +LP+SV
Sbjct: 83 HNARADEHGGFRLGMNRFADLTNDEFRSTFLGAK--VVERSRAAGERYRHDGVEELPESV 140
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGC 197
DWR+KGAV VKNQG CGSCWAFS V+ VE INQ+VTG + +LSEQEL++C N N+GC
Sbjct: 141 DWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGC 200
Query: 198 NGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLL 257
NGGLMD AF +I+ GG+ E+DYPY +G C++ + ++VV+I+G+ DVPQN E SL
Sbjct: 201 NGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQ 260
Query: 258 KALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWG 317
KA+A+QP+SVAIEA GR+FQ Y GV+ G CGT LDHGV AVGYG+ G DY IV+NSWG
Sbjct: 261 KAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWG 320
Query: 318 PKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
PKWGE GY+RM+RN G CGI MASYP K
Sbjct: 321 PKWGESGYVRMERNINATTGKCGIAMMASYPTK 353
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 176/327 (53%), Positives = 224/327 (68%), Gaps = 7/327 (2%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE--RFEIFKDNLRHIDETNRKIKN 88
V ++ +DL S + L L+E+W S L + E RF +FK+N+R+I E N+K +
Sbjct: 23 VPFTEKDLASEESLRGLYETWRSHHTVSRRGLGAEAEARRFNVFKENVRYIHEANKKDRP 82
Query: 89 YWLGLNEFADLRHEEFKEMFLGLK----PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKG 144
+ L LN+FAD+ +EF+ + G + L+ + Q F Y D +LP +VDWR+KG
Sbjct: 83 FRLALNKFADMTTDEFRRTYAGSRVRHHRSLSGGRRQGGGSFMYADAENLPAAVDWRQKG 142
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDY 204
AVT +K+QG CGSCWAFST+ AVEGIN+I TG L SLSEQEL+DC+ N+GCNGGLMD
Sbjct: 143 AVTPIKDQGQCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGENDGCNGGLMDV 202
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQP 264
AFQ+I GG+ E YPY E+ +C+ +K S V+I+GY DVP N E +L KA+ANQP
Sbjct: 203 AFQFIQQNGGITTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESALQKAVANQP 262
Query: 265 LSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEK 323
+SVAI+ASG DFQFYS GV+ GT LDHGVAAVGYG+TR G Y IVKNSWG WGEK
Sbjct: 263 VSVAIDASGNDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEK 322
Query: 324 GYIRMKRNTGKPEGLCGINKMASYPIK 350
GYIRM+R + EGLCGI ASYP K
Sbjct: 323 GYIRMQRGVKQAEGLCGIAMEASYPTK 349
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 347 bits (890), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 175/321 (54%), Positives = 219/321 (68%), Gaps = 12/321 (3%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLER-FEIFKDNLRHIDETNRKIKN-YWLGLNEF 96
+S+ L + SW +KF K S + +R FE FK+N R+I+E NR K+ Y LGLN+F
Sbjct: 4 SSDSDLSGEYASWCAKFGKECASSNSLGDRRFETFKENFRYIEEHNRAGKHSYRLGLNQF 63
Query: 97 ADLRHEEFKEMFLGLKPDL-------ARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
+DL EEF++ FLGL+PDL R E F VDLP SVDWRK GAVT
Sbjct: 64 SDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQN---VDLPASVDWRKHGAVTAP 120
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K+QGSCG CWAF+T A+EGINQIVTG L SLSEQELIDCD + GC+GGLM+ A+Q+I
Sbjct: 121 KDQGSCGGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKADKGCDGGLMENAYQFI 180
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
V GGL E DYPY E C M K S VV I+GY +P E +LL+A+A QP+SVAI
Sbjct: 181 VENGGLDTETDYPYHASESHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVAKQPVSVAI 240
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMK 329
E + +DFQ Y+ GV+ GHCG +++HGV VGYG+ GLDY IVKNSW WG+ G+++M+
Sbjct: 241 EGASKDFQHYASGVFTGHCGEEINHGVLIVGYGTEDGLDYWIVKNSWAATWGDGGFVKMQ 300
Query: 330 RNTGKPEGLCGINKMASYPIK 350
RNTGK GLC IN +ASYP+K
Sbjct: 301 RNTGKRGGLCSINTLASYPVK 321
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 172/323 (53%), Positives = 221/323 (68%), Gaps = 5/323 (1%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ +DL S + L DL+E W S V SLDEK +RF +F+ N+ H+ TN+ K Y L
Sbjct: 23 FHEKDLESEESLWDLYEKWRSH-HTVSTSLDEKRKRFNVFRANVLHVHNTNKMDKPYKLK 81
Query: 93 LNEFADLRHEEFKEMFLGLK---PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
LN+FAD+ + EF+ + K + R + F Y ++ +P S+DWRKKGAVT V
Sbjct: 82 LNKFADMTNHEFRTAYASSKVKHHTMFRGAPLGNGSFMYGNIDKVPASIDWRKKGAVTPV 141
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K+QG CGSCWAFST+ AVEGIN I T L SLSEQEL+DC+ N+GCNGGLMDYAF++I
Sbjct: 142 KDQGKCGSCWAFSTIVAVEGINFIKTNKLISLSEQELVDCNTGENHGCNGGLMDYAFEFI 201
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
G+ E +YPY ++G C+ K V+I+G+ DV N+E++LLKA+ANQP+SVAI
Sbjct: 202 TKQKGITTEANYPYRAQDGHCDANKANQPAVSIDGHEDVLHNNENALLKAVANQPVSVAI 261
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRM 328
+A G DFQFYS GV+ G CG +LDHGVA VGYG+T G Y IV+NSWGP+WGE+GYIRM
Sbjct: 262 DAGGSDFQFYSEGVFTGECGKELDHGVAIVGYGTTVDGTKYWIVRNSWGPEWGERGYIRM 321
Query: 329 KRNTGKPEGLCGINKMASYPIKK 351
+R GLCGI ASYPIKK
Sbjct: 322 QRGISDRRGLCGIAMEASYPIKK 344
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 346 bits (888), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 175/326 (53%), Positives = 224/326 (68%), Gaps = 7/326 (2%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFEKVYESL--DEKLERFEIFKDNLRHIDETNRKIKN 88
V ++ +DL S + L L+E W S + L D + RF +FK N R++ E N++
Sbjct: 24 VPFTEKDLASEESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKQNARYVHEGNKRDMP 83
Query: 89 YWLGLNEFADLRHEEFKEMFLG--LKPDLA-RRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
+ L LN+FAD+ +EF+ + G ++ L+ + F Y D +LP +VDWR+KGA
Sbjct: 84 FRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGA 143
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYA 205
VT +K+QG CGSCWAFST+ AVEGIN+I TG L SLSEQEL+DCDN N GC+GGLMDYA
Sbjct: 144 VTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYA 203
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPL 265
FQ+I G+ E +YPY E+G+C+ K ++ VTI+GY DVP N E +L KA+A QP+
Sbjct: 204 FQFI-QKNGITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPV 262
Query: 266 SVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKG 324
SVAI+ASG+DFQFYS GV+ G C T LDHGVAAVGYG+TR G Y IVKNSWG WGEKG
Sbjct: 263 SVAIDASGQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKG 322
Query: 325 YIRMKRNTGKPEGLCGINKMASYPIK 350
YIRM+R + EGLCGI ASYP K
Sbjct: 323 YIRMQRGVSQTEGLCGIAMQASYPTK 348
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 169/306 (55%), Positives = 212/306 (69%), Gaps = 3/306 (0%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFK 105
LFESW + K Y S ++KL RF+IF++N + + N + +Y L LN FADL H EFK
Sbjct: 31 LFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFK 90
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVV-DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
LGL + S +F D V D+P S+DWRKKGAV+ VK+QG+CG+CW+FS
Sbjct: 91 ASRLGLSA-FSTSGKLSRRNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFSAT 149
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
A+EGIN+IVTG+L SLSEQEL+DCD +YNNGC GGLMDYA+Q+++ G+ EEDYPY
Sbjct: 150 GAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQ 209
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
E TC K + VVTI+GY DVPQN+E LLKA+A QP+SV I S R FQ YS G++
Sbjct: 210 AREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIF 269
Query: 285 DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
G C T LDH V VGYGS G+DY IVKNSWG WG GY+ M RN+G +GLCGIN +
Sbjct: 270 TGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINML 329
Query: 345 ASYPIK 350
AS+P+K
Sbjct: 330 ASFPVK 335
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 174/321 (54%), Positives = 219/321 (68%), Gaps = 12/321 (3%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLE-RFEIFKDNLRHIDETNRKIKN-YWLGLNEF 96
+S+ L + SW +KF K S + + RFE FK+N R+I+E NR K+ Y LGLN+F
Sbjct: 4 SSDSDLSGEYASWCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRAGKHSYRLGLNQF 63
Query: 97 ADLRHEEFKEMFLGLKPDL-------ARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
+DL EEF++ FLGL+PDL R E F VDLP SVDWR+ GAVT
Sbjct: 64 SDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQN---VDLPASVDWRQHGAVTAP 120
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K+QGSCG CWAF+T A+EGINQIVTG L SLSEQELIDCD + GC+GGLM+ A+Q+I
Sbjct: 121 KDQGSCGGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKKADKGCDGGLMENAYQFI 180
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
V GGL E DYPY E C M K S VV I+GY +P+ E +LL A+A QP+SVAI
Sbjct: 181 VENGGLDTETDYPYHASESHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVAKQPVSVAI 240
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMK 329
E + +DFQ Y+ GV+ GHCG +++HGV VGYG+ GLDY IVKNSW WG+ G+++M+
Sbjct: 241 EGASKDFQHYASGVFTGHCGEEINHGVLIVGYGTEDGLDYWIVKNSWAATWGDGGFVKMQ 300
Query: 330 RNTGKPEGLCGINKMASYPIK 350
RNTGK GLC IN +ASYP+K
Sbjct: 301 RNTGKRGGLCSINTLASYPVK 321
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 169/306 (55%), Positives = 212/306 (69%), Gaps = 32/306 (10%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKE 106
++E+W++K K Y +L EK RF+IFKDNLR IDE N + + Y +
Sbjct: 3 VYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKI--------------- 47
Query: 107 MFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
+ ++++ LP+SVDWRKKGAV VK+QGSCGSCWAFST+AA
Sbjct: 48 ----------------SDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAA 91
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
VEGIN+IVTG L SLSEQEL+DCD +YN GCNGGLMDYAF++I++ GG+ EEDYPY
Sbjct: 92 VEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKAS 151
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+G C+ + ++VVTI+GY DVP+N E SL KA+ANQP+SVAIEA GR+FQ Y G++ G
Sbjct: 152 DGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTG 211
Query: 287 HCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTG-KPEGLCGINKMA 345
CGT LDHGV AVGYG+ G+DY IVKNSWG WGE+GYIRM+R+ G CGI A
Sbjct: 212 RCGTALDHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEA 271
Query: 346 SYPIKK 351
SYPIKK
Sbjct: 272 SYPIKK 277
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 168/310 (54%), Positives = 211/310 (68%), Gaps = 2/310 (0%)
Query: 43 KLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRH 101
++ LFE+W + K Y S +EKL R ++F+DN + E N + +Y L LN FADL H
Sbjct: 25 EIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTH 84
Query: 102 EEFKEMFLGLKPDLARRKDQSHEDFSYKD-VVDLPKSVDWRKKGAVTHVKNQGSCGSCWA 160
EFK LGL + + + D V D+P SVDWRK GAVT VK+QG+CG+CW+
Sbjct: 85 HEFKASRLGLSSAASASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWS 144
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FS A+EGIN+IVTG+L SLSEQEL+DCD +YNNGC GG+MDYAFQ+++ G+ EED
Sbjct: 145 FSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEED 204
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY + +C K + VVTI+GY DVPQN+E LLKA+ANQP+SV I S R FQ YS
Sbjct: 205 YPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYS 264
Query: 281 GGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
G++ G C T LDH V VGYGS G+DY IVKNSWG WG GY+ M+RN+G GLCG
Sbjct: 265 KGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCG 324
Query: 341 INKMASYPIK 350
IN +ASYP K
Sbjct: 325 INMLASYPKK 334
>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
AltName: Allergen=Car p 1; Flags: Precursor
gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
gi|387885|gb|AAA72774.1| papain [synthetic construct]
gi|225437|prf||1303270A papain
Length = 345
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 174/348 (50%), Positives = 233/348 (66%), Gaps = 7/348 (2%)
Query: 3 LSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESL 62
+ S K + ++ C+ ++ SF DFSIVGYS DLTS ++LI LFESWM K K+Y+++
Sbjct: 4 IPSISKLLFVAICLFVYMGLSFG-DFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNI 62
Query: 63 DEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS 122
DEK+ RFEIFKDNL++IDETN+K +YWLGLN FAD+ ++EFKE + G + S
Sbjct: 63 DEKIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELS 122
Query: 123 HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLS 182
+E+ V++P+ VDWR+KGAVT VKNQGSCGSCWAFS V +EGI +I TGNL S
Sbjct: 123 YEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYS 182
Query: 183 EQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
EQEL+DCD + GCNGG A Q +V+ G+H YPY + C +
Sbjct: 183 EQELLDCDRR-SYGCNGGYPWSALQ-LVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKT 240
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
+G V +E +LL ++ANQP+SV +EA+G+DFQ Y GG++ G CG ++DH VAAVGYG
Sbjct: 241 DGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYG 300
Query: 303 STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
+YI++KNSWG WGE GYIR+KR TG G+CG+ + YP+K
Sbjct: 301 P----NYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVK 344
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 183/331 (55%), Positives = 227/331 (68%), Gaps = 12/331 (3%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIK 87
+ + + +DL + D L +L+E W S V LDEK +RF +FK+N R+I + N RK
Sbjct: 19 TAIDIADKDLETEDSLWNLYERWRS-HHTVSRDLDEKQKRFNVFKENPRYIHDFNKRKDI 77
Query: 88 NYWLGLNEFADLRHEEFKEMFLGLKPDLAR-----RKDQSHEDFSYK--DVVDLPKSVDW 140
Y L LN+FADL + EF+ + G + + R R+ + F Y+ D LP S+DW
Sbjct: 78 PYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRGSRRGGATNSFMYQSLDSRSLPASIDW 137
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGG 200
R+KGAVT VK+QG CGSCWAFSTVAAVEGINQI T L SLSEQELIDCD NNGCNGG
Sbjct: 138 RQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLLSLSEQELIDCDTDENNGCNGG 197
Query: 201 LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKAL 260
LMDYAF +I GG+ E +YPY E+ C T+ +S VV+I+G+ DVP N EDSLLKA+
Sbjct: 198 LMDYAFDFIKKNGGISSEAEYPYAAEDSYC-ATEKKSHVVSIDGHEDVPANDEDSLLKAV 256
Query: 261 ANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPK 319
ANQP+S+AIEASG DFQFYS GV+ G GT+LDHGVA VGYG T +G Y IV+NSWG +
Sbjct: 257 ANQPVSIAIEASGYDFQFYSEGVFTGRSGTELDHGVAIVGYGKTQQGTKYWIVRNSWGAE 316
Query: 320 WGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WGEKGYIR+ + LCG+ ASYPIK
Sbjct: 317 WGEKGYIRISAASDSKR-LCGLAMEASYPIK 346
>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
Length = 365
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 175/326 (53%), Positives = 222/326 (68%), Gaps = 7/326 (2%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFEKVYESL--DEKLERFEIFKDNLRHIDETNRKIKN 88
V + +DL S + L L+E W S + L D RF +FK N R++ E N++
Sbjct: 24 VPLTEKDLASEESLRGLYERWRSHYTVSRRGLGADAGERRFNVFKQNARYVHEGNKRDMP 83
Query: 89 YWLGLNEFADLRHEEFKEMFLG--LKPDLA-RRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
+ L LN+FAD+ +EF+ + G ++ L+ + F Y D +LP +VDWR+KGA
Sbjct: 84 FRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGA 143
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYA 205
VT +K+QG CGSCWAFST+ AVEGIN+I TG L SLSEQEL+DCDN N GC+GGLMDYA
Sbjct: 144 VTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYA 203
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPL 265
FQ+I G+ E +YPY E+G+C+ K ++ VTI+GY DVP N E +L KA+A QP+
Sbjct: 204 FQFI-QKNGITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPV 262
Query: 266 SVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKG 324
SVAI+ASG+DFQFYS GV+ G C T LDHGVAAVGYG+TR G Y IVKNSWG WGEKG
Sbjct: 263 SVAIDASGQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKG 322
Query: 325 YIRMKRNTGKPEGLCGINKMASYPIK 350
YIRM+R + EGLCGI ASYP K
Sbjct: 323 YIRMQRGVSQTEGLCGIAMQASYPTK 348
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 344 bits (882), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 176/349 (50%), Positives = 233/349 (66%), Gaps = 10/349 (2%)
Query: 7 FKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKL 66
K +L+ F S I + A F Y +++ S + L L++ W S V SL E+
Sbjct: 1 MKQLLLIFLFSLVILET-ACGFD---YEDKEIESEEGLSKLYDRWRS-HHSVPRSLHERE 55
Query: 67 ERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLAR-----RKDQ 121
+RF +F+ N+ H+ +N+K ++Y L LN+FADL EFK + G K R ++
Sbjct: 56 KRFNVFRHNVMHVHNSNKKNRSYKLKLNKFADLTIHEFKNAYTGSKIKHHRMLQGPKRGS 115
Query: 122 SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASL 181
+ +++V LP SVDWRKKGAVT +KNQG CGSCWAFSTVAAVEGIN+I T L SL
Sbjct: 116 KQFMYDHENVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSL 175
Query: 182 SEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVT 241
SEQEL+DCD N GCNGGLM+ AF++I GG+ E+ YPY +G C+ +K +VT
Sbjct: 176 SEQELVDCDTNQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVT 235
Query: 242 INGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGY 301
I+G+ +VP+N E++LLKA+ANQP+SVAI+A DFQFYS GV+ G CGT+L+HGVA VGY
Sbjct: 236 IDGHENVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELNHGVATVGY 295
Query: 302 GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
GS G Y IV+NSWG +WGE GYI+++R +PEG CGI ASYPIK
Sbjct: 296 GSQGGKKYWIVRNSWGTEWGEGGYIKIERGIDEPEGRCGIAMEASYPIK 344
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 343 bits (881), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 173/349 (49%), Positives = 234/349 (67%), Gaps = 13/349 (3%)
Query: 16 ISFFIRSSFARDFSIVGYSPEDLTSNDKLID-----LFESWMSKF-EKVYESLDEKLERF 69
+S F + D SI+ Y+ E + + ++ W ++ SL E+ RF
Sbjct: 15 VSGFGACAAGPDMSIISYNAEHGARGLERTEAEARAIYGLWRAEHGSGNSNSLGEEERRF 74
Query: 70 EIFKDNLRHIDETNRKI----KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSH-- 123
F DNLR +D N + + + LG+N FADL ++EF+ +LG+K RR ++
Sbjct: 75 RAFWDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEFRAAYLGVKGAGQRRSARAGVG 134
Query: 124 EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
E + + V +LP++VDWR+KGAV VKNQG CGSCWAFS V+AVE INQ+VTG L +LSE
Sbjct: 135 ERYRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSAVESINQLVTGELVTLSE 194
Query: 184 QELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
QEL++CD N +NGCNGGLMD AF +I++ GG+ E+DYPY +G C++ + ++VV+I
Sbjct: 195 QELVECDINGQSNGCNGGLMDDAFDFIINNGGIDTEDDYPYKALDGKCDINRRNAKVVSI 254
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
+G+ DVP+N E SL KA+A+QP+SVAIEA GR+FQ Y GV+ G CGT+LDHGV AVGYG
Sbjct: 255 DGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFTGRCGTELDHGVVAVGYG 314
Query: 303 STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
+ G DY IV+NSWGPKWGE GY+RM+RN G CGI M+SYP KK
Sbjct: 315 TENGKDYWIVRNSWGPKWGEAGYLRMERNINATTGKCGIAMMSSYPTKK 363
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 168/334 (50%), Positives = 227/334 (67%), Gaps = 10/334 (2%)
Query: 28 FSIVGYSPEDLT-----SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDET 82
SI+ Y+ E + + L+E W+++ + Y +L E+ RF +F DNLR +D
Sbjct: 24 MSIISYNEEHAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAH 83
Query: 83 NRKIK--NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS-HEDFSYKD-VVDLPKSV 138
N + + LG+N+FADL ++EF+ +LG + ARR+ + E + + +LP+SV
Sbjct: 84 NERAAEHGFRLGMNQFADLTNDEFRAAYLGARIPAARRRGTAVGERYRHGGGAEELPESV 143
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGC 197
DWR+KGAV VKNQG CGSCWAFS V++VE +NQIVTG + +LSEQEL++C + N+GC
Sbjct: 144 DWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGC 203
Query: 198 NGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLL 257
NGGLMD AF +I+ GG+ E DYPY +G C++ + ++VV+I+G+ DVP+N E SL
Sbjct: 204 NGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQ 263
Query: 258 KALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWG 317
KA+A+QP+SVAIEA GR+FQ Y GV+ G C T LDHGV AVGYG+ G DY IV+NSWG
Sbjct: 264 KAVAHQPVSVAIEAGGREFQLYKAGVFSGTCTTNLDHGVVAVGYGTENGKDYWIVRNSWG 323
Query: 318 PKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
KWGE GYIRM+RN G CGI MASYP KK
Sbjct: 324 AKWGEDGYIRMERNVNATTGKCGIAMMASYPTKK 357
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 164/313 (52%), Positives = 218/313 (69%), Gaps = 8/313 (2%)
Query: 47 LFESWMSKFEKVYESLDE----KLERFEIFKDNLRHIDETNRKI--KNYWLGLNEFADLR 100
+++ W+++ + Y +L E + RF +F DNLR +D N + + + LG+N+FADL
Sbjct: 56 MYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERAGARGFRLGMNQFADLT 115
Query: 101 HEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD-LPKSVDWRKKGAVTHVKNQGSCGSCW 159
++EF+ +LG ARR E + + + LP+SVDWR+KGAV VKNQG CGSCW
Sbjct: 116 NDEFRAAYLGAMVPAARRGAVVGERYRHDGAAEELPESVDWREKGAVAPVKNQGQCGSCW 175
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
AFS V++VE +NQIVTG + +LSEQEL++C + N+GCNGGLMD AF +I+ GG+ E
Sbjct: 176 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTE 235
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
+DYPY +G C+M + + VV+I+G+ DVP+N E SL KA+A+QP+SVAIEA GR+FQ
Sbjct: 236 DDYPYRAVDGKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQL 295
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
Y GV+ G C T LDHGV AVGYG+ G DY IV+NSWGPKWGE GYIRM+RN G
Sbjct: 296 YKSGVFSGSCTTNLDHGVVAVGYGAENGKDYWIVRNSWGPKWGEAGYIRMERNVNASTGK 355
Query: 339 CGINKMASYPIKK 351
CGI MASYP KK
Sbjct: 356 CGIAMMASYPTKK 368
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 177/347 (51%), Positives = 231/347 (66%), Gaps = 10/347 (2%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
K IL++ +S + A F + +DL S + L DL+E W S + V L+EK +
Sbjct: 3 KVILVA--LSLVLVFGLAESFD---FDEKDLASEESLWDLYERWRS-YHTVSRDLEEKNK 56
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD---LARRKDQSHE 124
RF +FK+N +H+ + N+ K Y L LN+FAD+ + EF+ + G K + R +
Sbjct: 57 RFNVFKENTKHVHKVNQMDKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTG 116
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
F ++ LP SVDWRKKGAVT +K+QG CGSCWAFSTV VEGINQI T L SLSEQ
Sbjct: 117 GFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQ 176
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
+LIDCD + ++GCNGGLM+ AF++I GG+ E +YPY ++ C+M K + VVTI+G
Sbjct: 177 QLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDG 236
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
+ VP N E +L+KA+A+QP+SVAI+A G D QFYS GV+DG CGT+LDHGVA VGYG+T
Sbjct: 237 HESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTT 296
Query: 305 -RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G Y IVKNSWG +WGEKGYIRM R EG CGI ASYP+K
Sbjct: 297 LDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPVK 343
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 177/347 (51%), Positives = 231/347 (66%), Gaps = 10/347 (2%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
K IL++ +S + A F + +DL S + L DL+E W S + V L+EK +
Sbjct: 5 KVILVA--LSLVLVFGLAESFD---FDEKDLASEESLWDLYERWRS-YHTVSRDLEEKNK 58
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD---LARRKDQSHE 124
RF +FK+N +H+ + N+ K Y L LN+FAD+ + EF+ + G K + R +
Sbjct: 59 RFNVFKENTKHVHKVNQMDKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTG 118
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
F ++ LP SVDWRKKGAVT +K+QG CGSCWAFSTV VEGINQI T L SLSEQ
Sbjct: 119 GFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQ 178
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
+LIDCD + ++GCNGGLM+ AF++I GG+ E +YPY ++ C+M K + VVTI+G
Sbjct: 179 QLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDG 238
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
+ VP N E +L+KA+A+QP+SVAI+A G D QFYS GV+DG CGT+LDHGVA VGYG+T
Sbjct: 239 HESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTT 298
Query: 305 -RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G Y IVKNSWG +WGEKGYIRM R EG CGI ASYP+K
Sbjct: 299 LDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPVK 345
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 174/337 (51%), Positives = 223/337 (66%), Gaps = 17/337 (5%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFE--------KVYESLDEKLERFEIFKDNLRHIDET 82
+ ++ DL+S + L L+E W S++ V E RF +F +N R+I E
Sbjct: 25 IPFTESDLSSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEA 84
Query: 83 NRKI-KNYWLGLNEFADLRHEEFKEMFLGLKP----DLARRKDQSHEDFSY--KDVVDLP 135
NR+ + + L LN+FAD+ +EF+ + G + L+ + F Y D +LP
Sbjct: 85 NRRGGRPFRLALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGGEGGSFRYGGDDEDNLP 144
Query: 136 KSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN 195
+VDWR++GAVT +K+QG CGSCWAFSTVAAVEG+N+I TG L +LSEQEL+DCD N
Sbjct: 145 PAVDWRERGAVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQ 204
Query: 196 GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDS 255
GC+GGLMDYAFQ+I GG+ E +YPY E+G C K S VTI+GY DVP N E +
Sbjct: 205 GCDGGLMDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESA 264
Query: 256 LLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKN 314
L KA+ANQP++VA+EASG+DFQFYS GV+ G CGT LDHGVAAVGYG TR G Y IVKN
Sbjct: 265 LQKAVANQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKN 324
Query: 315 SWGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPIK 350
SWG WGE+GYIRM+R + GLCGI ASYP+K
Sbjct: 325 SWGEDWGERGYIRMQRGVSSDSNGLCGIAMEASYPVK 361
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 168/334 (50%), Positives = 225/334 (67%), Gaps = 10/334 (2%)
Query: 28 FSIVGYSPEDLT-----SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDET 82
SI+ Y+ E + + L+E W+++ + Y +L E+ RF +F DNLR +D
Sbjct: 84 MSIISYNEEHAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAH 143
Query: 83 NRKIK--NYWLGLNEFADLRHEEFKEMFLGLK-PDLARRKDQSHEDFSYKD-VVDLPKSV 138
N + + LG+N+FADL ++EF+ +LG + P RR E + + +LP+SV
Sbjct: 144 NERAAEHGFRLGMNQFADLTNDEFRAAYLGARIPASRRRGTAVGERYRHGGGAEELPESV 203
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGC 197
DWR+KGAV VKNQG CGSCWAFS V++VE +NQIVTG + +LSEQEL++C + N+GC
Sbjct: 204 DWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGC 263
Query: 198 NGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLL 257
NGGLMD AF +I+ GG+ E DYPY +G C++ + ++VV+I+G+ DVP+N E SL
Sbjct: 264 NGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQ 323
Query: 258 KALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWG 317
KA+A+QP+SVAIEA GR+FQ Y GV+ G C T LDHGV AVGYG+ G DY IV+NSWG
Sbjct: 324 KAVAHQPVSVAIEAGGREFQLYKAGVFTGTCTTNLDHGVVAVGYGTENGKDYWIVRNSWG 383
Query: 318 PKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
KWGE GYIRM+RN G CGI MASYP KK
Sbjct: 384 AKWGEDGYIRMERNVNATTGKCGIAMMASYPTKK 417
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 164/295 (55%), Positives = 211/295 (71%), Gaps = 6/295 (2%)
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKEMFLGLKPDLARR- 118
++++ ERF IFKDNLR ID N KN Y LGL FA+L ++E++ ++LG + + RR
Sbjct: 22 INQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRI 81
Query: 119 ---KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
K+ + + + + V++P +VDWR+KGAV +K+QG+CGSCWAFST AAVEGIN+IVT
Sbjct: 82 TKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVT 141
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
G L SLSEQEL+DCD +YN GCNGGLMDYAFQ+I+ GGL+ E+DYPY G C
Sbjct: 142 GELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLK 201
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
S VVTI+GY DVP E +L +A++ QP+SVAI+A GR FQ Y G++ G CGT +DH
Sbjct: 202 NSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGKCGTNMDHA 261
Query: 296 VAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
V AVGYGS G+DY IV+NSWG +WGE GYIRM+RN G CGI ASYP+K
Sbjct: 262 VVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASKSGKCGIAIEASYPVK 316
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 164/295 (55%), Positives = 211/295 (71%), Gaps = 6/295 (2%)
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKEMFLGLKPDLARR- 118
++++ ERF IFKDNLR ID N KN Y LGL FA+L ++E++ ++LG + + RR
Sbjct: 22 INQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRI 81
Query: 119 ---KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
K+ + + + +V ++P +VDWR+KGAV +K+QG+CGSCWAFST AAVEGIN+IVT
Sbjct: 82 TKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVT 141
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
G L SLSEQEL+DCD +YN GCNGGLMDYAFQ+I+ GGL+ E+DYPY G C
Sbjct: 142 GELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLK 201
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
S VVTI+GY DVP E +L +A++ QP+SVAI+A GR FQ Y G++ G CGT +DH
Sbjct: 202 NSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGKCGTNMDHA 261
Query: 296 VAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
V AVGYGS G+DY IV+NSWG +WGE GYIRM+RN G CGI ASYP+K
Sbjct: 262 VVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASKSGKCGIAIEASYPVK 316
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 168/305 (55%), Positives = 209/305 (68%), Gaps = 32/305 (10%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKE 106
++E+W+ K K Y +L E+ RFEIFKDNLR I+E N + Y +G
Sbjct: 3 VYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVG-------------- 48
Query: 107 MFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
+ +S++ DLP+SVDWR+KGAV VK+QG+CGSCWAFST+AA
Sbjct: 49 -----------------DRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTIAA 91
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
VEGINQI TG+L SLSEQEL+DCD +YN GCNGGLMDYAF++I++ GG+ EEDYPY
Sbjct: 92 VEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYRAA 151
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+ TC+ + + VV+I+GY DVPQN E SL KA+ANQP+SVAIEA GR FQ Y GV+ G
Sbjct: 152 DTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGVFTG 211
Query: 287 HCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLCGINKMA 345
CGTQLDHGV AVGYG+ +DY IV+NSWGP WGE GYI+++RN G G CGI
Sbjct: 212 QCGTQLDHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCGIAIEP 271
Query: 346 SYPIK 350
SYPIK
Sbjct: 272 SYPIK 276
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 168/334 (50%), Positives = 225/334 (67%), Gaps = 10/334 (2%)
Query: 28 FSIVGYSPEDLT-----SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDET 82
SI+ Y+ E + + L+E W+++ + Y +L E+ RF +F DNLR +D
Sbjct: 27 MSIISYNEEHAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAH 86
Query: 83 NRKIK--NYWLGLNEFADLRHEEFKEMFLGLK-PDLARRKDQSHEDFSYKD-VVDLPKSV 138
N + + LG+N+FADL ++EF+ +LG + P RR E + + +LP+SV
Sbjct: 87 NERAAEHGFRLGMNQFADLTNDEFRAAYLGARIPASRRRGTAVGERYRHGGGAEELPESV 146
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGC 197
DWR+KGAV VKNQG CGSCWAFS V++VE +NQIVTG + +LSEQEL++C + N+GC
Sbjct: 147 DWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGC 206
Query: 198 NGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLL 257
NGGLMD AF +I+ GG+ E DYPY +G C++ + ++VV+I+G+ DVP+N E SL
Sbjct: 207 NGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQ 266
Query: 258 KALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWG 317
KA+A+QP+SVAIEA GR+FQ Y GV+ G C T LDHGV AVGYG+ G DY IV+NSWG
Sbjct: 267 KAVAHQPVSVAIEAGGREFQLYKAGVFTGTCTTNLDHGVVAVGYGTENGKDYWIVRNSWG 326
Query: 318 PKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
KWGE GYIRM+RN G CGI MASYP KK
Sbjct: 327 AKWGEDGYIRMERNVNATTGKCGIAMMASYPTKK 360
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 175/323 (54%), Positives = 217/323 (67%), Gaps = 6/323 (1%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ +D+ S + L +L+E W + +V L EK RF +FKDN+R I E NR+ + Y L
Sbjct: 33 FGDKDVASEEALWELYERWRGQ-HRVARDLGEKARRFNVFKDNVRLIHEFNRRDEPYKLR 91
Query: 93 LNEFADLRHEEFKEMFLGLKPD---LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
LN F D+ +EF+ + + + R + + F Y DLP +VDWR+KGAV V
Sbjct: 92 LNRFGDMTADEFRRAYASSRVSHHRMFRGRGERRSGFMYAGARDLPAAVDWREKGAVGAV 151
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDN-TYNNGCNGGLMDYAFQY 208
K+QG CGSCWAFST+AAVEGIN I T NL +LSEQ+L+DCD T N GC+GGLMD AFQY
Sbjct: 152 KDQGQCGSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQY 211
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVA 268
I GG+ YPY + +C+ + S VTI+GY DVP NSE +L KA+ANQP+SVA
Sbjct: 212 IAKHGGVAASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVA 271
Query: 269 IEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIR 327
IEA G FQFYS GV+ G CGT+LDHGVAAVGYG+T G Y IV+NSWG WGEKGYIR
Sbjct: 272 IEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIR 331
Query: 328 MKRNTGKPEGLCGINKMASYPIK 350
MKR+ EGLCGI ASYPIK
Sbjct: 332 MKRDVSAKEGLCGIAMEASYPIK 354
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 340 bits (872), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 168/311 (54%), Positives = 217/311 (69%), Gaps = 8/311 (2%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEF 104
+ ++E W+ K +K+Y L EK RF+IFKDNLR IDE N + +Y +GLN+FAD+ +EE+
Sbjct: 1 MTMYEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQNYSYKVGLNKFADINNEEY 60
Query: 105 KEMFLGLKPDLARR----KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWA 160
++M+LG K D RR K H +Y V+ K VDWR KGAVTH+K+QGSCGSCWA
Sbjct: 61 RDMYLGTKSDAKRRVMKTKITGHR-ITYNSVIVTVK-VDWRLKGAVTHIKDQGSCGSCWA 118
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FST+A VE IN+IVTG SLSEQEL+DCD +N GCNGGLMDYAF++I+ GG+ ++D
Sbjct: 119 FSTIATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQD 178
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY E C+ TK ++VV+I+GY DVP + ++L KA+A+QP+SVAI GR Q Y
Sbjct: 179 YPYNGFERKCDPTKKNAKVVSIDGYEDVP-SYMNALKKAVAHQPVSVAIAGLGRALQLYQ 237
Query: 281 GGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM-KRNTGKPEGLC 339
GV+ G CGT LDHGV VGYGS G+DY +V+NSWG WGE GY ++ RN C
Sbjct: 238 SGVFTGKCGTDLDHGVVVVGYGSENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKC 297
Query: 340 GINKMASYPIK 350
GI ASYP+K
Sbjct: 298 GIAMEASYPVK 308
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 178/345 (51%), Positives = 224/345 (64%), Gaps = 14/345 (4%)
Query: 14 FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFK 73
F + + +F SI +DL S D L L+E W S V LD+K +RF +FK
Sbjct: 5 FPVLLVLALAFGSTLSIP-IKEKDLESEDSLWSLYERWRSH-HAVSRDLDQKQKRFNVFK 62
Query: 74 DNLRHIDETNR-KIKNYWLGLNEFADLRHEEFKEMFLGLKPDL------ARRKDQSHEDF 126
+N++ I E N+ K + L LN+F D+ ++EF+ + G K +R S F
Sbjct: 63 ENVKFIHEFNKNKDVTFKLALNKFGDMTNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAKF 122
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
Y++ V P S+DWR++GAV VKNQG CGSCWAFS +AAVEGINQIVT L LSEQEL
Sbjct: 123 MYENAV-APPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVEGINQIVTKELVPLSEQEL 181
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
IDCD N GC+GGLMDYAF++I + GG+ E+ YPY E+ TC K S V I+GY
Sbjct: 182 IDCDTDQNQGCSGGLMDYAFEFIKNNGGITTEDVYPYQAEDATC---KKNSPAVVIDGYE 238
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR- 305
DVP N ED+L+KA+ANQP++VAIEASG FQFYS GV+ G CGT+LDHGVA VGYG+T+
Sbjct: 239 DVPTNDEDALMKAVANQPVAVAIEASGYVFQFYSEGVFTGRCGTELDHGVAVVGYGTTQD 298
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G Y V+NSWG WGE GY+RM+R GLCGI ASYPIK
Sbjct: 299 GTKYWTVRNSWGADWGESGYVRMQRGIKATHGLCGIAMQASYPIK 343
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 168/291 (57%), Positives = 211/291 (72%), Gaps = 7/291 (2%)
Query: 67 ERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
+RF IFKDNLR ID N K KN Y LGL +F DL +EE++ ++LG + + RR ++
Sbjct: 72 KRFNIFKDNLRFIDLHNEKNKNATYKLGLTKFTDLTNEEYRSLYLGARTEPVRRIAKAKN 131
Query: 125 -DFSYKDVVD---LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+ Y VD +P++VDWR KGAV +K+QG+CGSCWAFST AAVEGIN+IVTG L S
Sbjct: 132 VNQKYSAAVDGKEVPETVDWRLKGAVNPIKDQGTCGSCWAFSTAAAVEGINKIVTGELIS 191
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQEL+DCDN+YN GCNGGLMDYAFQ+I+ GGL E+DYPY G C ++VV
Sbjct: 192 LSEQELVDCDNSYNQGCNGGLMDYAFQFIMKNGGLKTEKDYPYRGFGGKCNSFLKNAKVV 251
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
+I+GY DVP E +L +A++ QP+SVAIEA GR FQ Y G++ G+CGT LDH V AVG
Sbjct: 252 SIDGYEDVPTKDETALKRAISLQPVSVAIEAGGRIFQHYQTGIFTGNCGTNLDHAVVAVG 311
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
YGS G+DY IV+NSWGP+WGE+GYIRM+RN + G CGI ASYP+K
Sbjct: 312 YGSENGVDYWIVRNSWGPRWGEEGYIRMERNLASSKSGKCGIAVEASYPVK 362
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 340 bits (871), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 167/339 (49%), Positives = 226/339 (66%), Gaps = 14/339 (4%)
Query: 27 DFSIVGYSPEDLTSNDKLID-----LFESWMSK----FEKVYESLDEKLERFEIFKDNLR 77
D SI+ Y+ E + + +++ W+++ S+ E+ RF F DNL
Sbjct: 27 DMSIIAYNAEHGARGLERTEAEARAVYDLWLAENGGGSSPNANSIPERERRFRAFWDNLN 86
Query: 78 HIDETNRKI----KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD 133
+D N + + Y LG+N FADL ++EF+ +LG+K AR E + + +
Sbjct: 87 FVDAHNARAAAGEEGYRLGMNRFADLTNDEFRAAYLGVKAQRARPGRMVGERYRHDGAEE 146
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NT 192
LP++VDWR+KGAV VKNQG CGSCWAFS V+ VE INQIVTG + +LSEQEL++CD N
Sbjct: 147 LPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNG 206
Query: 193 YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNS 252
++GCNGGLMD AF++I+ GG+ E+DYPY +G C++ + ++VV+I+G+ DVP+N
Sbjct: 207 QSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPEND 266
Query: 253 EDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIV 312
E SL KA+A+QP+SVAIEA GR+FQ Y GV+ G CGTQLDHGV AVGYG+ G DY IV
Sbjct: 267 EKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYWIV 326
Query: 313 KNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
+NSWGP WGE GY+RM+RN G CGI M+SYP KK
Sbjct: 327 RNSWGPNWGESGYLRMERNINVTSGKCGIAMMSSYPTKK 365
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 340 bits (871), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 163/309 (52%), Positives = 214/309 (69%), Gaps = 4/309 (1%)
Query: 47 LFESWMSKF-EKVYESLDEKLERFEIFKDNLRHIDETNRKI--KNYWLGLNEFADLRHEE 103
++E W+ + +V L E RF +F DNLR +D N + + LG+N+FADL ++E
Sbjct: 55 MYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERAGEHGFRLGMNQFADLTNDE 114
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F+ +LG + AR + E + + +LP+SVDWR+KGAV VKNQG CGSCWAFS
Sbjct: 115 FRAAYLGARIPAARSGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 174
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
V++VE INQIVTG + +LSEQEL++C + N+GCNGGLMD AF +I+ GG+ E+DYP
Sbjct: 175 VSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGGIDTEDDYP 234
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
Y +G C++ + ++VV+I+ + DVP+N E SL KA+A+QP+SVAIEA GR FQ Y G
Sbjct: 235 YKAVDGKCDINRRNAKVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEAGGRQFQLYKSG 294
Query: 283 VYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGIN 342
V+ G C T LDHGV AVGYG+ G DY IV+NSWGPKWGE GYIRM+RN G CGI
Sbjct: 295 VFSGSCTTNLDHGVVAVGYGTENGKDYWIVRNSWGPKWGEAGYIRMERNINATTGKCGIA 354
Query: 343 KMASYPIKK 351
MASYP KK
Sbjct: 355 MMASYPTKK 363
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 339 bits (870), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 173/337 (51%), Positives = 221/337 (65%), Gaps = 17/337 (5%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFE--------KVYESLDEKLERFEIFKDNLRHIDET 82
+ ++ DL+S + L L+E W S++ V E RF +F +N R+I E
Sbjct: 25 IPFTESDLSSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEA 84
Query: 83 NRKI-KNYWLGLNEFADLRHEEFKEMFLGLKP----DLARRKDQSHEDFSY--KDVVDLP 135
NR+ + + L LN+FAD+ +EF+ + G + L + F Y D +LP
Sbjct: 85 NRRGGRPFRLALNKFADMTTDEFRRTYAGSRARHHRSLRGGRGGEGGSFRYGGDDEDNLP 144
Query: 136 KSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN 195
+VDWR++GAVT +K+QG CGSCWAFS VAAVEG+N+I TG L +LSEQEL+DCD N
Sbjct: 145 PAVDWRERGAVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQ 204
Query: 196 GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDS 255
GC+GGLMDYAFQ+I GG+ E +YPY E+G C K S VTI+GY DVP N E +
Sbjct: 205 GCDGGLMDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESA 264
Query: 256 LLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKN 314
L KA+ANQP++VA+EASG+DFQFYS GV+ G CGT LDHGVAAVGYG TR G Y IVKN
Sbjct: 265 LQKAVANQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKN 324
Query: 315 SWGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPIK 350
SWG WGE+GYIRM+R + GLCGI ASYP+K
Sbjct: 325 SWGEDWGERGYIRMQRGVSSDSNGLCGIAMEASYPVK 361
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 173/326 (53%), Positives = 216/326 (66%), Gaps = 6/326 (1%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN 88
+ + + DL S++ L DL+E W + V EK RF FKDN+R+I E N++
Sbjct: 27 AAIPFDERDLESDEALWDLYERWQ-EHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPG 85
Query: 89 YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED---FSYKDVVDLPKSVDWRKKGA 145
Y LN F D+ EEF+ F G + RR + F Y+ V DLP++VDWR+KGA
Sbjct: 86 Y-APLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGA 144
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYA 205
VT VK+QG CGSCWAFSTV +VEGIN I TG L SLSEQELIDCD N+GC GGLM+ A
Sbjct: 145 VTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENA 204
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPL 265
F+YI +GG+ E YPY GTC+ + +V I+G+ +VP NSE +L KA+ANQP+
Sbjct: 205 FEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPV 264
Query: 266 SVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKG 324
SVAI+A + FQFYS GV+ G CGT LDHGVA VGYG T G +Y IVKNSWG WGE G
Sbjct: 265 SVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGG 324
Query: 325 YIRMKRNTGKPEGLCGINKMASYPIK 350
YIRM+R++G GLCGI ASYP+K
Sbjct: 325 YIRMQRDSGYDGGLCGIAMEASYPVK 350
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 174/357 (48%), Positives = 229/357 (64%), Gaps = 27/357 (7%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MAL+ + + ++ + + +S A S+ + + + + WM+++ +VY+
Sbjct: 1 MALTIKHQCTPLALLFTIGVLASLAAARSL---------NEASMTETHDQWMARYGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
+ +EK R IF++NL++I N+ K Y LG+NEFADL +EEF +R K
Sbjct: 52 TANEKNRRSTIFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFT---------TSRNK 102
Query: 120 DQSH------EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
+SH F Y++V +P ++DWRKKGAVT +KNQG CG CWAFS VAA+EGI Q+
Sbjct: 103 FKSHVCATVTNVFRYENVTAVPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQL 162
Query: 174 VTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEM 232
TG L SLSEQEL+DCD N + GC GGLMDYAF +I GL E +YPY +GTC
Sbjct: 163 KTGKLISLSEQELVDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNA 222
Query: 233 TKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQL 292
K + TI G+ DVP NSE +LLKA+ANQP+SVAI+ASG DFQFYS GV+ G CGT+L
Sbjct: 223 NKEANHAATITGHEDVPANSESALLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTEL 282
Query: 293 DHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
DHGV AVGYG+ G Y +VKNSWG WGE+GYI+M+R EGLCGI ASYP
Sbjct: 283 DHGVTAVGYGTAADGTKYWLVKNSWGTSWGEEGYIQMQRGVAAAEGLCGIAMQASYP 339
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 169/289 (58%), Positives = 203/289 (70%), Gaps = 9/289 (3%)
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLAR--RKDQ----S 122
F +FK N+R I E NR+ + Y L LN F D+ +EF+ + G + R R D+ +
Sbjct: 70 FNVFKANVRLIHEFNRRDEPYKLRLNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSSA 129
Query: 123 HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLS 182
F Y D D+P SVDWR+KGAVT VK+QG CGSCWAFST+AAVEGIN I T NL SLS
Sbjct: 130 SASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLS 189
Query: 183 EQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
EQ+L+DCD N GCNGGLMDYAFQYI GG+ E+ YPY + +C+ K + VVTI
Sbjct: 190 EQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASCK--KSPAPVVTI 247
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
+GY DVP N E +L KA+A+QP+SVAIEASG FQFYS GV+ G CGT+LDHGVAAVGYG
Sbjct: 248 DGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAAVGYG 307
Query: 303 STR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
T G Y +VKNSWGP+WGEKGYIRM R+ EG CGI ASYP+K
Sbjct: 308 VTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPVK 356
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 173/326 (53%), Positives = 216/326 (66%), Gaps = 6/326 (1%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN 88
+ + + DL S++ L DL+E W + V EK RF FKDN+R+I E N++
Sbjct: 27 AAIPFDERDLESDEALWDLYERWQ-EHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPG 85
Query: 89 YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED---FSYKDVVDLPKSVDWRKKGA 145
Y LN F D+ EEF+ F G + RR + F Y+ V DLP++VDWR+KGA
Sbjct: 86 Y-PPLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGA 144
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYA 205
VT VK+QG CGSCWAFSTV +VEGIN I TG L SLSEQELIDCD N+GC GGLM+ A
Sbjct: 145 VTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENA 204
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPL 265
F+YI +GG+ E YPY GTC+ + +V I+G+ +VP NSE +L KA+ANQP+
Sbjct: 205 FEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPV 264
Query: 266 SVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKG 324
SVAI+A + FQFYS GV+ G CGT LDHGVA VGYG T G +Y IVKNSWG WGE G
Sbjct: 265 SVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGG 324
Query: 325 YIRMKRNTGKPEGLCGINKMASYPIK 350
YIRM+R++G GLCGI ASYP+K
Sbjct: 325 YIRMQRDSGYDGGLCGIAMEASYPVK 350
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 167/291 (57%), Positives = 210/291 (72%), Gaps = 7/291 (2%)
Query: 67 ERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
+RF IFKDNLR ID N KN Y LGL +F DL ++E+++++LG + + ARR ++
Sbjct: 72 KRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKN 131
Query: 125 -DFSYKDVV---DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+ Y V ++P++VDWR+KGAV +K+QG+CGSCWAFST AAVEGIN+IVTG L S
Sbjct: 132 VNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELIS 191
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQEL+DCD +YN GCNGGLMDYAFQ+I+ GGL+ E+DYPY G C S VV
Sbjct: 192 LSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVV 251
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
+I+GY DVP E +L KA++ QP+SVAIEA GR FQ Y G++ G CGT LDH V AVG
Sbjct: 252 SIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVG 311
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPIK 350
YGS G+DY IV+NSWGP+WGE+GYIRM+RN G CGI ASYP+K
Sbjct: 312 YGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVK 362
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 167/291 (57%), Positives = 210/291 (72%), Gaps = 7/291 (2%)
Query: 67 ERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
+RF IFKDNLR ID N KN Y LGL +F DL ++E+++++LG + + ARR ++
Sbjct: 72 KRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKN 131
Query: 125 -DFSYKDVV---DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+ Y V ++P++VDWR+KGAV +K+QG+CGSCWAFST AAVEGIN+IVTG L S
Sbjct: 132 VNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELIS 191
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQEL+DCD +YN GCNGGLMDYAFQ+I+ GGL+ E+DYPY G C S VV
Sbjct: 192 LSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVV 251
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
+I+GY DVP E +L KA++ QP+SVAIEA GR FQ Y G++ G CGT LDH V AVG
Sbjct: 252 SIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVG 311
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPIK 350
YGS G+DY IV+NSWGP+WGE+GYIRM+RN G CGI ASYP+K
Sbjct: 312 YGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVK 362
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 163/306 (53%), Positives = 213/306 (69%), Gaps = 5/306 (1%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFK 105
LFE+W + K Y S +E+ R ++F+DN + + N K +Y L LN FADL H EFK
Sbjct: 28 LFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFK 87
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVV-DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
LGL A + +H + VV D+P S+DWR KG VT+VK+QGSCG+CW+FS
Sbjct: 88 TSRLGLS---AAPLNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSAT 144
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
A+EGIN+IVTG+L SLSEQELI+CD +YN+GC GGLMDYAFQ++++ G+ EEDYPY
Sbjct: 145 GAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYR 204
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
+GTC + + VVTI+ Y DVP+N+E LL+A+A QP+SV I S R FQ YS G++
Sbjct: 205 ARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIF 264
Query: 285 DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
G C T LDH V VGYGS G+DY IVKNSWG WG +GY+ M+RN+G +G+CGIN +
Sbjct: 265 TGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINML 324
Query: 345 ASYPIK 350
ASYP+K
Sbjct: 325 ASYPVK 330
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 163/291 (56%), Positives = 207/291 (71%), Gaps = 6/291 (2%)
Query: 64 EKLERFEIFKDNLRHIDETNRKIK---NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
E RF +F DNL+ +D N + + LG+N FADL +EEF+ FLG K +A R
Sbjct: 70 EHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGAK--VAERSR 127
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+ E + + V +LP+SVDWR+KGAV VKNQG CGSCWAFS V+ VE INQ+VTG + +
Sbjct: 128 AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMIT 187
Query: 181 LSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
LSEQEL++C N N+GCNGGLMD AF +I+ GG+ E+DYPY +G C++ + ++V
Sbjct: 188 LSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKV 247
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
V+I+G+ DVPQN E SL KA+A+QP+SVAIEA GR+FQ Y GV+ G CGT LDHGV AV
Sbjct: 248 VSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAV 307
Query: 300 GYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
GYG+ G DY IV+NSWGPKWGE GY+RM+RN G CGI MASYP K
Sbjct: 308 GYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPTK 358
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 174/352 (49%), Positives = 234/352 (66%), Gaps = 19/352 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
+++S F + L+ ++ I +S R +ND+++ ++ESW+ + K Y
Sbjct: 8 ISMSLLFFSTLLILSLALDIENSVQR-------------TNDQVMAMYESWLVEQGKSYN 54
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
SLDEK RFEIFK+NLR ID+ N ++Y LGLN FADL EE++ +LGLK + +
Sbjct: 55 SLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLK--MGPKT 112
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
D S+E + K LP VDWR GAV VKNQG C SCWAFS V AVEGIN+IVTGNL
Sbjct: 113 DVSNE-YMPKVGEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLI 171
Query: 180 SLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DC T GCN GLM AFQ+I++ GG++ E++YPY ++G C ++ +
Sbjct: 172 SLSEQELVDCGRTQRTKGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCNLSLKNQK 231
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
VTI+ Y +VP N+E +L KA+A QP+SV +E+ G F+ Y+ G++ G CGT +DHGV
Sbjct: 232 YVTIDNYKNVPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTI 291
Query: 299 VGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VGYG+ RG+DY IVKNSWG WGE GYIR++RN G G CGI +M SYP+K
Sbjct: 292 VGYGTERGMDYWIVKNSWGTNWGENGYIRIQRNIGG-AGKCGIARMPSYPVK 342
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 165/308 (53%), Positives = 215/308 (69%), Gaps = 6/308 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHE 102
+ + E WM+++ +VY+ DEK +R++IFKDN+ I+ N+ + K+Y L +NEFADL +E
Sbjct: 35 MYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNE 94
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
EF+ K + + S F Y+ V +P +VDWRKKGAVT +K+QG CGSCWAFS
Sbjct: 95 EFRASRNRFKAHICSTEATS---FKYEHVAAVPSTVDWRKKGAVTPIKDQGQCGSCWAFS 151
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
VAA+EGI Q+ TG L SLSEQEL+DCD + + GCNGGLMD AF++I GL E +Y
Sbjct: 152 AVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIEQNHGLATEANY 211
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY +GTC K INGY DVP N+E +L KA+A+QP++VAI+A G +FQFYS
Sbjct: 212 PYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSS 271
Query: 282 GVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GV+ G CGT+LDHGVAAVGYG++ G+ Y +VKNSWG WGE GYIRM+R+ EGLCG
Sbjct: 272 GVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWGTGWGEVGYIRMQRDVTAKEGLCG 331
Query: 341 INKMASYP 348
I ASYP
Sbjct: 332 IAMQASYP 339
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 337 bits (864), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 175/328 (53%), Positives = 218/328 (66%), Gaps = 7/328 (2%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIK 87
+ + + DL S++ L DL+E W + V EK RF FKDN+R+I E N R +
Sbjct: 27 AAIPFDERDLESDEALWDLYERWQ-EHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRGGR 85
Query: 88 NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED---FSYKDVVDLPKSVDWRKKG 144
Y L LN F D+ EEF+ F G + RR + F Y+ V DLP++VDWR+KG
Sbjct: 86 GYRLRLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKG 145
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDY 204
AVT VK+QG CGSCWAFSTV +VEGIN I TG L SLSEQELIDCD N+GC GGLM+
Sbjct: 146 AVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMEN 205
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG-ESEVVTINGYHDVPQNSEDSLLKALANQ 263
AF+YI +GG+ E YPY GTC+ + + +V I+G+ +VP NSE +L KA+ANQ
Sbjct: 206 AFEYIKHSGGITTESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVANQ 265
Query: 264 PLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGE 322
P+SVAI+A + FQFYS GV+ G CGT LDHGVA VGYG T G +Y IVKNSWG WGE
Sbjct: 266 PVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGE 325
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPIK 350
GYIRM+R++G GLCGI ASYP+K
Sbjct: 326 GGYIRMQRDSGYDGGLCGIAMEASYPVK 353
>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
Length = 360
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 179/330 (54%), Positives = 228/330 (69%), Gaps = 6/330 (1%)
Query: 26 RDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK 85
R + ++ DL S L DL+E W S V SLDEK RF +FK N+ H+ TN+
Sbjct: 18 RATNTFDFNEHDLDSEKSLWDLYERWRSH-HTVTRSLDEKHNRFNVFKANVMHVHNTNKL 76
Query: 86 IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR-KDQSHED--FSYKDVVDLPKSVDWRK 142
K Y L LN+FAD+ + EF+ ++ K R + S+E+ F Y++V ++P S+DWRK
Sbjct: 77 DKPYKLKLNKFADMTNYEFRRIYADSKVSHHRMFRGMSNENGTFMYENVKNVPSSIDWRK 136
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAVT VK+QG CGSCWAFST+ AVEGINQI T L SLSEQEL+DCD N GCNGGLM
Sbjct: 137 KGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGGNEGCNGGLM 196
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
+YAF++I G+ E +YPY ++GTC++ K + V+I+GY +VP N+E +LLKA A
Sbjct: 197 EYAFEFI-KQNGITTESNYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAK 255
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG-LDYIIVKNSWGPKWG 321
QP+SVAI+A G +FQFYS GV+ GHCGT L+HGVA VGYG T+ Y IVKNSWG +WG
Sbjct: 256 QPVSVAIDAGGYNFQFYSEGVFSGHCGTDLNHGVAVVGYGVTQDRTKYWIVKNSWGSEWG 315
Query: 322 EKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
E+GYIRM+R EGLCGI ASYPIKK
Sbjct: 316 EQGYIRMQRGISHKEGLCGIAMEASYPIKK 345
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 337 bits (864), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 166/291 (57%), Positives = 209/291 (71%), Gaps = 7/291 (2%)
Query: 67 ERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
+RF IFKDNLR ID N KN Y LGL +F DL ++E+++++LG + + ARR ++
Sbjct: 72 KRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKN 131
Query: 125 -DFSYKDVV---DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+ Y V ++P++VDWR+KGAV +K+QG+CGSCWAFST AAVEGIN+IVTG L S
Sbjct: 132 VNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELIS 191
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQEL+DCD +YN GCNGGLMDYAFQ+I+ GGL+ E+DYPY G C S VV
Sbjct: 192 LSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVV 251
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
+I+GY DVP E +L KA++ QP+ VAIEA GR FQ Y G++ G CGT LDH V AVG
Sbjct: 252 SIDGYEDVPTKDETALKKAISYQPVRVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVG 311
Query: 301 YGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPIK 350
YGS G+DY IV+NSWGP+WGE+GYIRM+RN G CGI ASYP+K
Sbjct: 312 YGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVK 362
>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
Length = 318
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 166/319 (52%), Positives = 228/319 (71%), Gaps = 9/319 (2%)
Query: 1 MALSSQF-KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY 59
MA S F K + ++ C+S + S+ FSIVGYSP+DLTS +KLI+LF+SWM +++KVY
Sbjct: 1 MATISSFSKLLFVAICLSVHMGLSYGA-FSIVGYSPDDLTSTEKLINLFDSWMVEYDKVY 59
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD-LARR 118
+ +DEK+ RFEIFKDNL++IDETN+K YWLGL F DL ++EFKE ++G P+ +
Sbjct: 60 KDIDEKIYRFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKYVGSIPENWSTT 119
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
++ + ++F Y DVV++P S+DWR+KGAVT V+NQGSCGSCW FS+VAAVEGIN+IVTG L
Sbjct: 120 EEPNDKEFIYDDVVNIPASIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEGINKIVTGQL 179
Query: 179 ASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DC+ + GC GG YA QY V+ G+H + YPY + C + +
Sbjct: 180 VSLSEQELLDCERR-SYGCRGGFPPYALQY-VANSGIHLRQYYPYEGVQRQCRAAQAKGP 237
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
V +G V +N+E +L++ +A QP+S+ +EA GR FQ Y GG++ G CGT +DH VAA
Sbjct: 238 KVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEAKGRAFQNYRGGIFAGPCGTSIDHAVAA 297
Query: 299 VGYGSTRGLDYIIVKNSWG 317
VGYG+ YI++KNSWG
Sbjct: 298 VGYGN----GYILIKNSWG 312
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 336 bits (862), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 159/296 (53%), Positives = 210/296 (70%), Gaps = 5/296 (1%)
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKI----KNYWLGLNEFADLRHEEFKEMFLGLKPDLA 116
S+ E+ RF F DNLR +D N + + + L +N FADL ++EF+ +LG+K A
Sbjct: 67 SIPERERRFRAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGQRA 126
Query: 117 RRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG 176
R E + + +LP++VDWR+KGAV VKNQG CGSCWAFS ++ VE INQIVTG
Sbjct: 127 RPGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAISTVESINQIVTG 186
Query: 177 NLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
+ +LSEQEL++CD N ++GCNGGLMD AF++I+ GG+ E+DYPY +G C++ +
Sbjct: 187 EMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAIDGRCDVLRK 246
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++VV+I+G+ DVP+N E SL KA+A+QP+SVAIEA GR+FQ Y GV+ G CGTQLDHG
Sbjct: 247 NAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHG 306
Query: 296 VAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
V AVGYG+ G DY IV+NSWGP WGE GY+RM+RN G CGI M+SYP KK
Sbjct: 307 VVAVGYGTENGKDYWIVRNSWGPNWGEAGYLRMERNINVTSGKCGIAMMSSYPTKK 362
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 170/353 (48%), Positives = 232/353 (65%), Gaps = 19/353 (5%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA +Q++ I ++ F ++ + + AR+ + + E WM+++ +V
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARNLH-----------EASMYERHEDWMAQYGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y+ DEK +R++IFKDN+ I+ N+ + K+Y L +NEFADL +EEF K +
Sbjct: 50 YKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFGTSRNRFKAHICS 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
+ S F Y++V +P ++DWRKKGAVT +K+QG CGSCWAFS VAA+EGI Q+ TG
Sbjct: 110 TEATS---FKYENVTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGK 166
Query: 178 LASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL+DCD + + GCNGGLMD AF++I GL E +YPY +GTC K
Sbjct: 167 LISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKKAA 226
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
INGY DVP N+E +L KA+ +QP++VAI+A G +FQFYS GV+ G CGT+LDHGV
Sbjct: 227 HPAAKINGYEDVPANNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGV 286
Query: 297 AAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
AAVGYG++ G+ Y +VKNSWG WGE+GYIRM+R+ EGLCGI ASYP
Sbjct: 287 AAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 339
>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
Length = 318
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 166/319 (52%), Positives = 227/319 (71%), Gaps = 9/319 (2%)
Query: 1 MALSSQF-KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY 59
MA F K + ++ C+S + S+ FSIVGYSP+DLTS +KLI+LF+SWM +++KVY
Sbjct: 1 MATIXSFSKLLFVAICLSVHMGLSYGA-FSIVGYSPDDLTSTEKLINLFDSWMVEYDKVY 59
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
+ +DEK+ RFEIFKDNL++IDETN+K YWLGL F DL ++EFKE ++G P+
Sbjct: 60 KDIDEKIYRFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKYVGSIPENWSTT 119
Query: 120 DQSHE-DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
++S++ +F Y DVV++P S+DWR+KGAVT V+NQGSCGSCW FS+VAAVEGIN+IVTG L
Sbjct: 120 EESNDKEFIYDDVVNIPASIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEGINKIVTGQL 179
Query: 179 ASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DC+ + GC GG YA QY V+ G+H + YPY + C + +
Sbjct: 180 VSLSEQELLDCERR-SYGCRGGFPPYALQY-VANSGIHLRQYYPYEGVQRQCRAAQAKGP 237
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
V +G V +N+E +L++ +A QP+S+ +EA GR FQ Y GG++ G CGT +DH VAA
Sbjct: 238 KVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEAKGRAFQNYRGGIFAGPCGTSIDHAVAA 297
Query: 299 VGYGSTRGLDYIIVKNSWG 317
VGYG+ YI++KNSWG
Sbjct: 298 VGYGN----GYILIKNSWG 312
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 170/347 (48%), Positives = 229/347 (65%), Gaps = 12/347 (3%)
Query: 7 FKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKL 66
+IL S + I S + D S SN +++ ++E W+ K +KVY L EK
Sbjct: 1 MASILYSLILFGLITLSLSLDMS-------SGRSNKEVMTMYEKWLVKHQKVYYGLGEKN 53
Query: 67 ERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDF 126
+RF+IFKDNL IDE N +Y +GLNEF+D+ ++E+++ +L + + + +
Sbjct: 54 QRFQIFKDNLIFIDEHNAPNHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIKNKITSVRY 113
Query: 127 SYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
+YK + LP SVDWR GA+T +KNQGSCG+CWAFS VAAVE IN+IVTG+L SLSEQ
Sbjct: 114 AYKAGHNNKLPVSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVSLSEQ 171
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
EL+DCD T N GCNGG A+++IV GGL + DYPY+ + TC K ++VV+ING
Sbjct: 172 ELVDCDRTKNKGCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKNTKVVSING 231
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
Y +V +NSE +L++A+ANQP+SV IEA G+DFQ Y GV+ G CGT LDH V VGYGS
Sbjct: 232 YKNVQRNSESALMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAVVVVGYGSE 291
Query: 305 RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
G DY +VKNSWG WGE+GY++++RN G CGI A+YP K
Sbjct: 292 NGKDYWLVKNSWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYPTK 338
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 335 bits (858), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 168/313 (53%), Positives = 219/313 (69%), Gaps = 6/313 (1%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFAD 98
+ND+++ ++ESW+ + K Y SLDEK RFEIFK+NLR ID+ N ++Y LGLN FAD
Sbjct: 34 TNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFAD 93
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
L EE++ +LGLK + D S++ + K LP VDWR GAV VKNQG C SC
Sbjct: 94 LTDEEYRSTYLGLK--RGPKTDVSNQ-YMPKVGDALPDYVDWRTVGAVVGVKNQGLCSSC 150
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHK 217
WAFS VAAVEGIN+IVTGNL SLSEQEL+DC T GCN GLM AF++I++ GG++
Sbjct: 151 WAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQITKGCNRGLMTDAFKFIINNGGINT 210
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
E +YPY ++G C ++ + VTI+ Y +VP N+E +L KA+A QP+SV +E+ G F+
Sbjct: 211 ENNYPYTAKDGQCNLSLKNQKYVTIDSYKNVPSNNEMALKKAVAYQPVSVGVESEGGKFK 270
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
Y+ G++ G CGT +DHGV VGYG+ RG+DY IVKNSWG WGE GYIR++RN G G
Sbjct: 271 LYTSGIFTGSCGTAVDHGVTIVGYGTERGMDYWIVKNSWGTNWGESGYIRIQRNIGG-AG 329
Query: 338 LCGINKMASYPIK 350
CGI KM SYP+K
Sbjct: 330 KCGIAKMPSYPVK 342
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 162/291 (55%), Positives = 206/291 (70%), Gaps = 6/291 (2%)
Query: 64 EKLERFEIFKDNLRHIDETNRKIK---NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
E RF +F DNL+ +D N + + LG+N FADL +EEF+ FLG K +A R
Sbjct: 69 EHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNEEFRATFLGAK--VAERSR 126
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
+ E + + V +LP+SVDWR+KGAV VKNQG CGSCWAFS V+ VE INQ+VTG + +
Sbjct: 127 AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMIT 186
Query: 181 LSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
LSEQEL++C N N+GCNGGLM AF +I+ GG+ E+DYPY +G C++ + ++V
Sbjct: 187 LSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKV 246
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
V+I+G+ DVPQN E SL KA+A+QP+SVAIEA GR+FQ Y GV+ G CGT LDHGV AV
Sbjct: 247 VSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAV 306
Query: 300 GYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
GYG+ G DY IV+NSWGPKWGE GY+RM+RN G CGI MASYP K
Sbjct: 307 GYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPTK 357
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 163/308 (52%), Positives = 215/308 (69%), Gaps = 6/308 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHE 102
+ + E WM ++ + Y+ DEK +R++IFKDN+ I+ N+ + K+Y L +NEFADL +E
Sbjct: 35 MYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNE 94
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
EF+ K + + S F Y++V +P +VDWRKKGAVT +K+QG CGSCWAFS
Sbjct: 95 EFRASRNRFKAHICSTEATS---FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFS 151
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
VAA+EGI Q+ TG L SLSEQEL+DCD + + GC+GGLMD AF++I GL E +Y
Sbjct: 152 AVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANY 211
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY +GTC K INGY DVP N+E +L KA+A+QP++VAI+ASG +FQFYS
Sbjct: 212 PYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSS 271
Query: 282 GVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GV+ G CGT+LDHGVAAVGYG++ G+ Y +VKNSW WGE+GYIRM+R+ EGLCG
Sbjct: 272 GVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVTAKEGLCG 331
Query: 341 INKMASYP 348
I ASYP
Sbjct: 332 IAMQASYP 339
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 334 bits (857), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 167/303 (55%), Positives = 205/303 (67%), Gaps = 4/303 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM F KVY EK RFEIFKDN+ +I+ N K Y L +N+FADL +EE K
Sbjct: 39 EQWMETFGKVYADAAEKERRFEIFKDNVEYIESFNTAGNKPYKLSVNKFADLTNEELKVA 98
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
G + L R F Y++V +P ++DWRKKGAVT +K+QG CGSCWAFSTVAA
Sbjct: 99 RNGYRRPLQTRP-MKVTSFKYENVTAVPATMDWRKKGAVTPIKDQGQCGSCWAFSTVAAT 157
Query: 168 EGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EGINQ+ TG L SLSEQEL+DCD + GC GGLM+ F++I+ G+ E +YPY
Sbjct: 158 EGINQLTTGKLVSLSEQELVDCDTQGEDQGCEGGLMEDGFEFIIKNHGITTEANYPYQAA 217
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+GTC K S + I GY VP NSE +LLKA+A+QP+SV+I+A G DFQFYS GV+ G
Sbjct: 218 DGTCNSKKEASRIAKITGYESVPANSEAALLKAVASQPISVSIDAGGSDFQFYSSGVFTG 277
Query: 287 HCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
CGT+LDHGV AVGYG T G Y +VKNSWG WGE+GYIRM+R+T EGLCGI +
Sbjct: 278 QCGTELDHGVTAVGYGETSDGTKYWLVKNSWGTSWGEEGYIRMQRDTEAEEGLCGIAMDS 337
Query: 346 SYP 348
SYP
Sbjct: 338 SYP 340
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 334 bits (857), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 163/310 (52%), Positives = 208/310 (67%), Gaps = 3/310 (0%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFA 97
++ + +LFE W ++ K Y S +EKL R +F DN + N +Y L LN +A
Sbjct: 20 SATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYA 79
Query: 98 DLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGS 157
DL H EFK LG P L + ++ S D+P S+DWRKKGAVT VK+QGSCG+
Sbjct: 80 DLTHHEFKVSRLGFSPALRNFRPVLPQEPSLPR--DVPDSLDWRKKGAVTAVKDQGSCGA 137
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHK 217
CW+FS A+EGINQI+TG+L SLSEQELIDCD +YN+GC GGLMDYA+Q+++S G+
Sbjct: 138 CWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDT 197
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
E DYPY +G+C K + VVTI+GY D+P N E LL+A+A QP+SV I S R FQ
Sbjct: 198 ENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQ 257
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
YS G++ G C T LDH V VGYGS G+DY IVKNSWG WG GY+ M+RN+G EG
Sbjct: 258 LYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEG 317
Query: 338 LCGINKMASY 347
+CGINK+ASY
Sbjct: 318 VCGINKLASY 327
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 334 bits (856), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 180/348 (51%), Positives = 224/348 (64%), Gaps = 20/348 (5%)
Query: 7 FKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY--ESLDE 64
F +++SFC S + G S L D + E WMS+ +VY E D
Sbjct: 9 FVALVLSFCFSI----------QLAGLS-RPLLDEDSM--RHEEWMSQHGRVYADEQEDH 55
Query: 65 KLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLK-PDLARRKDQSH 123
K +RF +FK+N+ I+E N K + L +N+FADL +EEF+ + G K P + +
Sbjct: 56 KNKRFNVFKENVERIEEFNDG-KTFKLAINQFADLTNEEFRASYNGFKGPMVLSSQITKP 114
Query: 124 EDFSYKDVVD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLS 182
F Y++V LP SVDWRKKGAVT VKNQG CG CWAFS VAA+EGI QI TG L SLS
Sbjct: 115 TPFRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISLS 174
Query: 183 EQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVT 241
EQEL+DCD ++GC GGLMD AF++I++ GGL E +YPY E+GTC K V+
Sbjct: 175 EQELVDCDTKGIDHGCEGGLMDTAFEFIINNGGLTTESNYPYKGEDGTCNFNKTNPIAVS 234
Query: 242 INGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGY 301
I GY DVP N E +L+KA+A+QP+SVAIEA G DFQFYS GV+ G CGT+LDH V AVGY
Sbjct: 235 ITGYEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGVFTGECGTELDHAVTAVGY 294
Query: 302 G-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G S G Y IVKNSWG KWGE GYI M+++ +GLCGI ASYP
Sbjct: 295 GESEDGSKYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCGIAMQASYP 342
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 334 bits (856), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 169/353 (47%), Positives = 232/353 (65%), Gaps = 19/353 (5%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA +Q++ I ++ F ++ + + AR+ + + E WM ++ +
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARNLH-----------EASMYERHEDWMVQYGRE 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y+ DEK +R++IFKDN+ I+ N+ + K+Y L +NEFADL +EEF+ K +
Sbjct: 50 YKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICS 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
+ S F Y++V +P +VDWRKKGAVT +K+QG CGSCWAFS VAA+EGI Q+ TG
Sbjct: 110 TEATS---FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGK 166
Query: 178 LASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL+DCD + + GC+GGLMD AF++I GL E +YPY +GTC K
Sbjct: 167 LISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAA 226
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
INGY DVP N+E +L KA+A+QP++VAI+A G +FQFYS GV+ G CGT+LDHGV
Sbjct: 227 HPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGV 286
Query: 297 AAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
+AVGYG++ G+ Y +VKNSWG WGE+GYIRM+R+ EGLCGI ASYP
Sbjct: 287 SAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 339
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 334 bits (856), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 179/324 (55%), Positives = 214/324 (66%), Gaps = 9/324 (2%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWL 91
+ DL S++ L DL+E W + +V+ EK RF FK+N R I N R + Y L
Sbjct: 27 FDERDLASDEALWDLYERWQT-HHRVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRL 85
Query: 92 GLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE---DFSYKDVVDLPKSVDWRKKGAVTH 148
LN F D+ EEF+ F + + RR+ + F Y D DLP+SVDWR+KGAVT
Sbjct: 86 RLNRFGDMGREEFRSGFADSRINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVTA 145
Query: 149 VKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQY 208
VKNQG CGSCWAFSTV AVEGIN I TG+L SLSEQELIDCD T NGC GGLM+ AF++
Sbjct: 146 VKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCD-TDENGCQGGLMENAFEF 204
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKG-ESEVVTINGYHDVPQNSEDSLLKALANQPLSV 267
I S GG+ E YPY GTC+ + VV I+G+ VP SED+L KA+A+QP+SV
Sbjct: 205 IKSHGGITTESAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSV 264
Query: 268 AIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYI 326
AI+A G+ QFYS GV+ G CGT LDHGVAAVGYG S G Y IVKNSWGP WGE GYI
Sbjct: 265 AIDAGGQALQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGEGGYI 324
Query: 327 RMKRNTGKPEGLCGINKMASYPIK 350
RM+R TG GLCGI AS+PIK
Sbjct: 325 RMQRGTGN-GGLCGIAMEASFPIK 347
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 333 bits (855), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 175/344 (50%), Positives = 228/344 (66%), Gaps = 13/344 (3%)
Query: 11 LISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE-RF 69
+++ FI S A SI+ P+ ++D+++ L++ W +K K++ +L + E RF
Sbjct: 9 IMALLFFLFIALSAASPSSII---PQ--RTDDEVMALYDQWRAKHGKLHNNLGAEPENRF 63
Query: 70 EIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYK 129
IFKDNL+ IDE N + Y LGLN FADL +EE++ +LG K R++++ + +
Sbjct: 64 HIFKDNLKFIDEINAQNLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSNRYLPR 123
Query: 130 DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDC 189
DLP S+DWR KGAV VK+QGSCGSCWAFSTVA+VE INQIVTG+L +LSEQEL+DC
Sbjct: 124 LGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDC 183
Query: 190 DNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVP 249
D +YN GCNGGLMDYAF++I+ GGL EEDYPY + +C K + I+GY DVP
Sbjct: 184 DRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNA----IDGYEDVP 239
Query: 250 QNSEDSLLKA---LANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG 306
N+E +L KA +SVAIE GR FQ Y G++ G CGT LDHGV VGYGS G
Sbjct: 240 VNNEKALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYGSEGG 299
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
+DY IV+NSWG WGE GY++M+RN P GLCGI SYP K
Sbjct: 300 VDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPTK 343
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 333 bits (855), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 173/351 (49%), Positives = 228/351 (64%), Gaps = 15/351 (4%)
Query: 1 MALSSQFKTILISF-CISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY 59
MA S+ K + ++ + ++ +++R S D N++ E WM K+ +VY
Sbjct: 1 MATISERKLMFVALLVVGLWVSQAWSR-------SLHDAAMNER----HEMWMVKYGRVY 49
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
+ EK RFEIF++N+ I+ N+ + Y L +NEFADL +EEFK G K +
Sbjct: 50 KDNSEKERRFEIFRNNVEFIESFNKPGNRPYKLDINEFADLTNEEFKASRNGYKRS-SNV 108
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
F Y +V +P S+DWR+KGAVT +K+QG CG CWAFS VAA+EGI ++ TG L
Sbjct: 109 GLSEKSSFRYGNVTAVPTSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKL 168
Query: 179 ASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGES 237
SLSEQEL+DCD + + GC GGLMD AF++I GGL E +YPY +GTC K +
Sbjct: 169 ISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGN 228
Query: 238 EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVA 297
+ I GY DVP NSED+LLKA+A+QP+SVAI+ASG FQFYSGGV+ G CGT+LDHGV
Sbjct: 229 DAAKITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVT 288
Query: 298 AVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
AVGYG++ G Y +VKNSWG WGE GYIRM+R+ EGLCGI +SYP
Sbjct: 289 AVGYGTSDGTKYWLVKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSYP 339
>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
Length = 348
Score = 333 bits (854), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 169/348 (48%), Positives = 230/348 (66%), Gaps = 15/348 (4%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
K ++S ++ FI DF+ +DL ++ L DL+E W S+ V + DEK +
Sbjct: 5 KVFVLSISLALFIGVVNCIDFT-----EKDLATDKSLWDLYERWGSQ-HMVSRAPDEKKK 58
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMF----LGLKPDLARRKDQSH 123
RF +FK N+ HI+ N+ K Y L LNEFAD+ + EFK F L + +R+
Sbjct: 59 RFNVFKYNVNHINRVNQLGKPYKLKLNEFADMTNHEFKAGFDSKILHFRMLKGKRR---Q 115
Query: 124 EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
F++ D P S+DWR GAV +KNQG CGSCWAFST+ VEGIN+I T L SLSE
Sbjct: 116 TPFTHAKTTDPPPSIDWRTNGAVNPIKNQGRCGSCWAFSTIVGVEGINKIKTNQLVSLSE 175
Query: 184 QELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTIN 243
QEL+DC+ T GCNGGLM+ +++I TGG+ E+ YPY G C+++K S VV I+
Sbjct: 176 QELVDCE-TDCEGCNGGLMENGYEFIKETGGVTTEQIYPYFARNGRCDISKRNSPVVKID 234
Query: 244 GYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS 303
G+ +VP N E ++L+A+ANQP+S+AI+A G +FQFYS GV++G CGT+L+HGVA VGYG+
Sbjct: 235 GFENVPANDESAMLRAVANQPVSIAIDAGGLNFQFYSQGVFNGACGTELNHGVAIVGYGT 294
Query: 304 TR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
T+ G +Y IV+NSWG WGE+GY+RM+R PEGLCG+ ASYPIK
Sbjct: 295 TQDGTNYWIVRNSWGTGWGEQGYVRMQRGVNVPEGLCGLAMDASYPIK 342
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 333 bits (854), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 163/308 (52%), Positives = 215/308 (69%), Gaps = 6/308 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHE 102
+ + E WM ++ + Y+ DEK +R++IFKDN+ I+ N+ + K+Y L +NEFADL +E
Sbjct: 35 MYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNE 94
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
EF+ K + + S F Y++V +P +VDWRKKGAVT +K+QG CGSCWAFS
Sbjct: 95 EFRASRNRFKAHICSTEATS---FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFS 151
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
VAA+EGI Q+ TG L SLSEQEL+DCD + + GC+GGLMD AF++I GL E +Y
Sbjct: 152 AVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANY 211
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY +GTC K INGY DVP N+E +L KA+A+QP++VAI+ASG +FQFYS
Sbjct: 212 PYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSS 271
Query: 282 GVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GV+ G CGT+LDHGVAAVGYG++ G+ Y +VKNSW WGE+GYIRM+R+ EGLCG
Sbjct: 272 GVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVTVKEGLCG 331
Query: 341 INKMASYP 348
I ASYP
Sbjct: 332 IAMQASYP 339
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 333 bits (853), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 169/346 (48%), Positives = 222/346 (64%), Gaps = 16/346 (4%)
Query: 5 SQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDE 64
SQF + + F + + S AR V + + E WM+++ +VY+ E
Sbjct: 7 SQFICLALLFVLGAWPSKSAARTLQDV-----------SMYERHEQWMAQYGRVYKDDAE 55
Query: 65 KLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSH 123
K R+ IFK+N+ ID N + K+Y LG+N+FADL +EEFK K + +
Sbjct: 56 KETRYNIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKASRNRFKGHMCSPQ---A 112
Query: 124 EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
F Y++V +P ++DWRKKGAVT VK+QG CG CWAFS VAA+EGINQ+ TG L SLSE
Sbjct: 113 GPFRYENVSAVPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTTGKLISLSE 172
Query: 184 QELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
QE++DCD + GCNGGLMD AF++I GL E +YPY +GTC K + I
Sbjct: 173 QEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTCNTQKEATHAAKI 232
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
G+ DVP NSE +L+KA+A QP+SVAI+A G +FQFYS G++ G CGTQLDHGV AVGYG
Sbjct: 233 TGFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGTQLDHGVTAVGYG 292
Query: 303 STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
+ G Y +VKNSWG +WGE+GYIRM+++ EGLCGI ASYP
Sbjct: 293 ISDGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYP 338
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 333 bits (853), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 169/351 (48%), Positives = 231/351 (65%), Gaps = 14/351 (3%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA +QF + + + + + F + + +D + ++ E WM+++ +VY+
Sbjct: 1 MATKNQFYQVSFALVLCLGLWA-----FQVSSRTLQDASMQER----HEQWMARYGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
L EK +RF IFK+N+ +I+ +N K Y LG+N+FADL +EEF K ++
Sbjct: 52 DLQEKEKRFSIFKENVNYIEASNNAGDKPYKLGVNQFADLTNEEFIATRNKFKGHMSSSI 111
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
++ F Y++V P +VDWR++GAVT VKNQG+CG CWAFS VAA EGI+++ TGNL
Sbjct: 112 TRT-TTFKYENVT-APSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLV 169
Query: 180 SLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DCD + + GC GGLMD AF++I+ GGL+ E YPY +GTC + +
Sbjct: 170 SLSEQELVDCDTSGADQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEATH 229
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
V TI GY DVP N+E +L +A+ANQP+S+AI+ASG DFQ Y GV+ G CGTQLDHGVA
Sbjct: 230 VATITGYEDVPSNNEQALQQAVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAV 289
Query: 299 VGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
VGYG S G Y +VKNSWG WGE+GYIRM+R+ PEGLCG+ SYP
Sbjct: 290 VGYGVSDDGTKYWLVKNSWGADWGEEGYIRMQRDVDAPEGLCGLAMQPSYP 340
>gi|52546920|gb|AAU81593.1| cysteine proteinase [Petunia x hybrida]
Length = 210
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 157/205 (76%), Positives = 177/205 (86%), Gaps = 2/205 (0%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLK--PD 114
K+YES++EKL RFEIFK+NL+HIDE N+ + NYWLGLNEF+DL H+EFK+M+LGLK D
Sbjct: 6 KIYESIEEKLHRFEIFKENLKHIDERNKIVSNYWLGLNEFSDLSHDEFKKMYLGLKVDHD 65
Query: 115 LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIV 174
L K QS +DF Y+D VDLPKSVDWRKKGAVT VKNQG CGSCWAFSTVAAVEGINQI
Sbjct: 66 LLNNKKQSQQDFEYRDFVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIK 125
Query: 175 TGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTK 234
TGNL SLSEQELIDCD TYNNGCNGGLMDYAFQ+I+S GGLHKE+DYPY+MEEGTC+ +
Sbjct: 126 TGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFQFIISNGGLHKEDDYPYLMEEGTCDEKR 185
Query: 235 GESEVVTINGYHDVPQNSEDSLLKA 259
ESEVVTI+GY DVP N E SLLKA
Sbjct: 186 DESEVVTIDGYRDVPANDEQSLLKA 210
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 179/341 (52%), Positives = 232/341 (68%), Gaps = 10/341 (2%)
Query: 18 FFIRSSFARDFSIVG---YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKD 74
FI S A F++ ++ DL S L +L+E W S V +LDEK RF +FK
Sbjct: 7 LFISLSLALIFTVANTFDFNEHDLESEKSLWNLYERWRSH-HTVTRNLDEKHNRFNVFKA 65
Query: 75 NLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR-KDQSHED--FSYKDV 131
N+ H+ TN+ K Y L LN+F D+ + EF+ ++ K R + SHE+ F Y++
Sbjct: 66 NVMHVHNTNKLDKPYKLKLNKFGDMTNYEFRRIYADSKISHHRMFRGMSHENGTFMYENA 125
Query: 132 VDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDN 191
VD+P S+DWR KGAVT VK+QG CGSCWAFST+AAVEGINQI T L SLSEQ+L+DCD
Sbjct: 126 VDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCDT 185
Query: 192 TYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQN 251
N GCNGGLM+YAF++I G+ E +YPY ++GTC++ K E + V+I+G+ +VP N
Sbjct: 186 EENEGCNGGLMEYAFEFI-KQNGITTESNYPYAAKDGTCDVEK-EDKAVSIDGHENVPIN 243
Query: 252 SEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG-LDYI 310
+E +LLKA A QP+SVAI+A G +FQFYS GV+ GHC T L+HGVA VGYG T+ Y
Sbjct: 244 NEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNHGVAIVGYGVTQDRTKYW 303
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
I+KNSWG +WGE+GYIRM+R EGLCGI ASYPIKK
Sbjct: 304 IMKNSWGSEWGEQGYIRMQRGISSREGLCGIAMEASYPIKK 344
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 168/351 (47%), Positives = 231/351 (65%), Gaps = 15/351 (4%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA +Q++ I ++ +S A+ ++ + + E WM+++ +VY+
Sbjct: 1 MASVNQYRYICLALLFVLAAWASHAKARNL---------HEASMYERHEDWMAQYGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
EK +R++IFKDN+ I+ N+ + K+Y L +NEFADL +EEF+ K + +
Sbjct: 52 DAGEKSKRYKIFKDNVARIESFNKAMNKSYKLSINEFADLTNEEFRASRNRFKAHICSTE 111
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
S F Y+ V +P +VDWRKKGAVT +K+QG CGSCWAFS VAA+EGI Q+ TG L
Sbjct: 112 ATS---FKYEHVXAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLI 168
Query: 180 SLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DCD + + GC+GGLMD AF++I GL E +YPY +GTC K
Sbjct: 169 SLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHP 228
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
INGY DVP N+E +L KA+A+QP++VAI+A G +FQFYS GV+ G CGT+LDHGV+A
Sbjct: 229 AAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVSA 288
Query: 299 VGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
VGYG++ G+ Y +VKNSWG WGE+GYIRM+R+ + EGLCGI ASYP
Sbjct: 289 VGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTEKEGLCGIAMQASYP 339
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 168/343 (48%), Positives = 218/343 (63%), Gaps = 15/343 (4%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++ + FC+ F +R +D + + WMS++ K+Y+ E+ R
Sbjct: 11 SLALVFCLGLFAIQVTSRTLQ-----------DDSMYERHGQWMSQYGKIYKDHQERETR 59
Query: 69 FEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
F+IF +N+ +++ +N K+Y LG+N+FADL +EEF K + ++ F
Sbjct: 60 FKIFTENVNYVEASNADDTKSYKLGINQFADLTNEEFVASRNKFKGHMCSSITRT-TTFK 118
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
Y++V +P +VDWRKKGAVT VKNQG CG CWAFS VAA EGI+++ TG L SLSEQEL+
Sbjct: 119 YENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELV 178
Query: 188 DCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
DCD + GC GGLMD AF++I+ GL E YPY +GTC K + VTI GY
Sbjct: 179 DCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYE 238
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STR 305
DVP NSE +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGT+LDHGV AVGYG S
Sbjct: 239 DVPANSEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND 298
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G Y +VKNSWG WGE+GYI M+R EGLCGI ASYP
Sbjct: 299 GTKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYP 341
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 175/357 (49%), Positives = 223/357 (62%), Gaps = 24/357 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA ++Q I ++ FC+ F +R +D + + WMS++ K+
Sbjct: 1 MAANNQLYHISLALLFCLGLFAIQVTSRTLQ-----------DDSMYERHGQWMSQYGKI 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRK--IKNYWLGLNEFADLRHEEF---KEMFLGLKP 113
Y+ E+ RF+IFK+N+ +I+ N K+Y LG+N+FADL +EEF + F G
Sbjct: 50 YKDHQERETRFKIFKENVNYIETFNNADDTKSYKLGINQFADLTNEEFIASRNKFKGHMC 109
Query: 114 DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
R F Y++V +P +VDWRKKGAVT VKNQG CG CWAFS VAA EGI+++
Sbjct: 110 SSIMRT----TSFKYENVSGIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKL 165
Query: 174 VTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEM 232
TG L SLSEQEL+DCD + GC GGLMD AF++I+ GL E YPY +GTC
Sbjct: 166 STGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVDGTCNA 225
Query: 233 TKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQL 292
K + VTI GY DVP NSE +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGT+L
Sbjct: 226 NKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGACGTEL 285
Query: 293 DHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
DHGV AVGYG S G Y +VKNSWG WGE+GYI M+R EG+CGI ASYP
Sbjct: 286 DHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGIEAAEGICGIAMQASYP 342
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 160/307 (52%), Positives = 213/307 (69%), Gaps = 6/307 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHE 102
++ E WM++ +VY + EK +R+ IFK+N+ I+ N + Y LG+N+FADL +E
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
EF+ M+ G K ++ S F Y+++ D+P S+DWR GAVT VK+QG+CG CWAFS
Sbjct: 61 EFRAMYHGYKRQSSKLMSSS---FRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCWAFS 117
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
TVAA+EGI ++ TGNL SLSEQ+L+DC N GC GGLMD AFQYI+ GGL E++YP
Sbjct: 118 TVAAIEGIIKLQTGNLISLSEQQLVDC-TAGNKGCQGGLMDTAFQYIIRNGGLTSEDNYP 176
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
Y +GTC K S I GY DVPQN+E++LL+A+A QP+SVA++ G DF+FY G
Sbjct: 177 YQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFYKSG 236
Query: 283 VYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
V++G CGT L+HGV A+GYG+ + G DY +VKNSWG WGE GY RM+R G EGLCG+
Sbjct: 237 VFEGDCGTNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASEGLCGV 296
Query: 342 NKMASYP 348
ASYP
Sbjct: 297 AMDASYP 303
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 166/314 (52%), Positives = 211/314 (67%), Gaps = 5/314 (1%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEF 96
T D + + WMS++ KVY+ E+ +RF+IF +N+ +I+ N+ N Y LG+N+F
Sbjct: 29 TLQDDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQF 88
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
ADL ++EF K + ++ F Y++ +P SVDWRKKGAVT VKNQG CG
Sbjct: 89 ADLTNDEFTSSRNKFKGHMCSSITRT-STFKYENASAIPSSVDWRKKGAVTPVKNQGQCG 147
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGL 215
CWAFS VAA EGI+++ TG L SLSEQEL+DCD + GC GGLMD AF++I+ GL
Sbjct: 148 CCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGL 207
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
+ E +YPY +GTC KG VTI GY DVP N+E +L KA+ANQP+SVAI+ASG D
Sbjct: 208 NTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPISVAIDASGSD 267
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
FQFY GV+ G CGT+LDHGV AVGYG S G Y +VKNSWG +WGE+GYI M+R
Sbjct: 268 FQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEEGYIMMQRGVDA 327
Query: 335 PEGLCGINKMASYP 348
EGLCGI ASYP
Sbjct: 328 AEGLCGIAMQASYP 341
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 158/313 (50%), Positives = 219/313 (69%), Gaps = 5/313 (1%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFAD 98
+ND++I +FESW+ ++ K Y +L EK RFEIFKDNLR +DE N + ++Y +GLN+F+D
Sbjct: 40 TNDEVIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSD 99
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
L E+ ++LG K ++ R + + + LP SVDWRKKGAV VKNQG+CGSC
Sbjct: 100 LTDAEYSSIYLGTKFNI--RMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGNCGSC 157
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHK 217
W F+++AAVEGIN+IVTGNL SLSEQE++DC Y NNGCNGG + A+Q+I++ GG++
Sbjct: 158 WTFASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGGINT 217
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
E +YPY +G C+ K + VTI+ Y +VP N+E +L KA+A QP+SV I ++ F+
Sbjct: 218 EANYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNSTAFK 277
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
Y G+++G CG ++DHGV VGYG+ G DY IV+NSWGP WGE GY+RM+RN G G
Sbjct: 278 SYKSGIFNGPCGPRIDHGVTIVGYGTEGGKDYWIVRNSWGPNWGESGYVRMQRNVGG-SG 336
Query: 338 LCGINKMASYPIK 350
C I + YP+K
Sbjct: 337 KCFIARAPVYPVK 349
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 164/339 (48%), Positives = 229/339 (67%), Gaps = 12/339 (3%)
Query: 12 ISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEI 71
IS + FF+ + ++ + + +D + ++K E WM++F++VY EK R++I
Sbjct: 10 ISLALIFFLGALASQ---AIARTLQDASIHEK----HEEWMTRFKRVYSDAKEKEIRYKI 62
Query: 72 FKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKD 130
FK+N++ I+ N+ K+Y LG+N+FADL +EEFK K + + F Y++
Sbjct: 63 FKENVQRIESFNKASEKSYKLGINQFADLTNEEFKTSRNRFKGHMCSSQAGP---FRYEN 119
Query: 131 VVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD 190
+ +P S+DWRK+GAVT +K+QG CGSCWAFS VAAVEGI Q+ T L SLSEQEL+DCD
Sbjct: 120 ITAVPSSMDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCD 179
Query: 191 NT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVP 249
+ GC GGLMD AF++I GL E +YPY +GTC + + ING+ DVP
Sbjct: 180 TKGEDQGCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVP 239
Query: 250 QNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDY 309
N+E +L+KA+A QP+SVAI+A G +FQFYS G++ G CGT+LDHGVAAVGYG + G++Y
Sbjct: 240 ANNEGALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYGESNGMNY 299
Query: 310 IIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
+VKNSWG +WGE+GYIRM+++ EGLCGI ASYP
Sbjct: 300 WLVKNSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYP 338
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 170/351 (48%), Positives = 232/351 (66%), Gaps = 14/351 (3%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA +QF I + + + + F + + +D + +++ E WM+++ KVY+
Sbjct: 1 MATKNQFYQISFALVLCLGLWA-----FQVSSRTLQDASMHER----HEQWMARYGKVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
L EK +RF IF++N+++I+ +N K Y LG+N+F DL ++EF K ++
Sbjct: 52 DLQEKEKRFNIFQENVKYIEASNNAGNKPYKLGVNQFTDLTNKEFIATRNKFKGHMSSSI 111
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
++ F Y++V P +VDWR++GAVT VKNQG+CG CWAFS VAA EGI+++ TGNL
Sbjct: 112 TRT-TTFKYENVT-APSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLV 169
Query: 180 SLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DCD + + GC GGLMD AF++I+ GGL+ E YPY +GTC + +
Sbjct: 170 SLSEQELVDCDTSGADQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVTH 229
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
V TI GY DVP N+E +L +A+ANQP+SVAI+ASG DFQ Y GV+ G CGTQLDHGVA
Sbjct: 230 VATITGYEDVPSNNEQALQQAVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAV 289
Query: 299 VGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
VGYG S G Y +VKNSWG WGE+GYIRM+R+ PEGLCGI SYP
Sbjct: 290 VGYGVSDDGTKYWLVKNSWGEDWGEEGYIRMQRDVEAPEGLCGIAMQPSYP 340
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 169/365 (46%), Positives = 225/365 (61%), Gaps = 43/365 (11%)
Query: 27 DFSIVGYSPEDLTSNDKLIDL-----FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
D SI+ Y+ E + + ++ W+++ + Y +L E+ RF +F DNL+ +D
Sbjct: 23 DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDA 82
Query: 82 TNRKIK---NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSV 138
N + + LG+N FADL ++EF+ FLG K R + E + + V +LP+SV
Sbjct: 83 HNARADEHGGFRLGMNRFADLTNDEFRATFLGAK--FVERSRAAGERYRHDGVEELPESV 140
Query: 139 DWRKKGAVTHVKNQGSC--------------------------------GSCWAFSTVAA 166
DWR+KGAV VKNQG C GSCWAFS V+
Sbjct: 141 DWREKGAVAPVKNQGQCVDRIIVWNSMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVST 200
Query: 167 VEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
VE INQ+VTG + +LSEQEL++C N N+GCNGGLMD AF +I+ GG+ E+DYPY
Sbjct: 201 VESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKA 260
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
+G C++ + ++VV+I+G+ DVPQN E SL KA+A+QP+SVAIEA GR+FQ Y GV+
Sbjct: 261 VDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFS 320
Query: 286 GHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
G CGT LDHGV AVGYG+ G DY IV+NSWGPKWGE GY+RM+RN G CGI MA
Sbjct: 321 GRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMA 380
Query: 346 SYPIK 350
SYP K
Sbjct: 381 SYPTK 385
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 167/311 (53%), Positives = 211/311 (67%), Gaps = 4/311 (1%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADL 99
D + +LF+ W K K Y S +E+ +R +IFKDN + + N I N Y L LN FADL
Sbjct: 26 DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNL-ITNATYSLSLNAFADL 84
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCW 159
H EFK LGL A + + S V +P SVDWRKKGAVT+VK+QGSCG+CW
Sbjct: 85 THHEFKASRLGLSVS-APSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEE 219
+FS A+EGINQIVTG+L SLSEQELIDCD +YN GCNGGLMDYAF++++ G+ E+
Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 220 DYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
DYPY +GTC+ K + +VVTI+ Y V N E +L++A+A QP+SV I S R FQ Y
Sbjct: 204 DYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLY 263
Query: 280 SGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
S G++ G C T LDH V VGYGS G+DY IVKNSWG WG G++ M+RNT +G+C
Sbjct: 264 SSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVC 323
Query: 340 GINKMASYPIK 350
GIN +ASYPIK
Sbjct: 324 GINMLASYPIK 334
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 169/346 (48%), Positives = 222/346 (64%), Gaps = 11/346 (3%)
Query: 5 SQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDE 64
+ FKT+ + ++ I + +A G + L N +++ E WM++ +VY++ E
Sbjct: 2 AAFKTVKLLPALALLIVAIWASQ----GEAGRSLGENKSMLERHEQWMAQHGRVYKNAAE 57
Query: 65 KLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
K RFEIF+ N+ I+ N + + LG+N+FADL +EEFK LKP K S +
Sbjct: 58 KAHRFEIFRANVERIESFNAENHKFKLGVNQFADLTNEEFKTRNT-LKPS----KMASTK 112
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
F Y++V +P ++DWR KGAVT +K+QG CGSCWAFS VAA EGI ++ TG L SLSEQ
Sbjct: 113 SFKYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLISLSEQ 172
Query: 185 ELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTIN 243
E++DCD T ++ GCNGG MD AF+YI+ G+ E +YPY +GTC K S +I
Sbjct: 173 EVVDCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPYKAADGTCNTKKAASHAASIT 232
Query: 244 GYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS 303
GY DV NSE +LLKA ANQP++VAI+A FQ YS GV+ G CGT LDHGV VGYG+
Sbjct: 233 GYEDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGVFTGDCGTDLDHGVTLVGYGA 292
Query: 304 TR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
T G Y +VKNSWG WGE GYIRM+R+ EGLCGI ASYP
Sbjct: 293 TSDGTKYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGIAMDASYP 338
>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
Length = 363
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 166/324 (51%), Positives = 220/324 (67%), Gaps = 6/324 (1%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI 86
DFSIVGYS DLTS ++LI LFESWM K K+Y+++DEK+ RFEIFKDNL++IDETN+K
Sbjct: 45 DFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKN 104
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAV 146
+YWLGLN FAD+ ++EFKE + G + S+E+ V++P+ VDWR+KGAV
Sbjct: 105 NSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAV 164
Query: 147 THVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAF 206
T VKNQGSCGS WAFS V+ +E I +I TGNL SEQEL+DCD + GCNGG A
Sbjct: 165 TPVKNQGSCGSAWAFSAVSTIESIIKIRTGNLNEYSEQELLDCDRR-SYGCNGGYPWSAL 223
Query: 207 QYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLS 266
Q +V+ G+H YPY + C + +G V +E +LL ++ANQP+S
Sbjct: 224 Q-LVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVS 282
Query: 267 VAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYI 326
V +EA+G+DFQ Y GG++ G CG ++DH VAAVGYG +YI+++NSWG WGE GYI
Sbjct: 283 VVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGP----NYILIRNSWGTGWGENGYI 338
Query: 327 RMKRNTGKPEGLCGINKMASYPIK 350
R+KR TG G+CG+ + YP+K
Sbjct: 339 RIKRGTGNSYGVCGLYTSSFYPVK 362
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 162/310 (52%), Positives = 205/310 (66%), Gaps = 6/310 (1%)
Query: 47 LFESWMSKFEKVYE-SLDEKLERFEIFKDNLRHIDETNRKI--KNYWLGLNEFADLRHEE 103
++E WM++ K +L E RF F DNLR +D N + + Y LG+N FADL + E
Sbjct: 51 MYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGYRLGINRFADLTNAE 110
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F+ +L + E + + V LP+ VDWR+KGAV VKNQG CGSCWAFS
Sbjct: 111 FRAAYLSAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVAPVKNQGQCGSCWAFSA 170
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
V AVEGINQIVTG L +LSEQEL+DC N N GC+GG+MD AF +IV GG+ ++DYP
Sbjct: 171 VGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVGNGGIDTDKDYP 230
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
Y +G C++ K VV+I+G+ VP+N E SL KA+A+QP++VAIEA GR+FQ Y G
Sbjct: 231 YTARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVAVAIEAGGREFQLYQSG 290
Query: 283 VYDGHCGTQLDHGVAAVGYGSTR--GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
V+ G CGT LDHGV AVGYG+ G DY +V+NSWG WGE GYIRM+RN G G CG
Sbjct: 291 VFTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGYIRMERNVGARAGKCG 350
Query: 341 INKMASYPIK 350
I ASYP+K
Sbjct: 351 IAMEASYPVK 360
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 330 bits (846), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 163/341 (47%), Positives = 225/341 (65%), Gaps = 16/341 (4%)
Query: 27 DFSIVGYSPEDLTSNDKLID-----LFESWMSKF----EKVYESLDEKLERFEIFKDNLR 77
D SI+ Y+ E + + +++ W+++ S+ ++ RF F DNLR
Sbjct: 26 DMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLR 85
Query: 78 HIDETNRKI----KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS--HEDFSYKDV 131
+D N + + + L +N FADL ++EF+ +LG+K R + E + +
Sbjct: 86 FVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGA 145
Query: 132 VDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD- 190
+LP++VDWR+KGAV VKNQG CGSCWAFS V+ VE INQIVTG + +LSEQEL++CD
Sbjct: 146 EELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDI 205
Query: 191 NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQ 250
N ++GCNGGLMD AF++I+ GG+ E+DYPY +G C++ + ++VV+I+G+ DVP+
Sbjct: 206 NGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPE 265
Query: 251 NSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYI 310
N E SL KA+A+ P+SVAIEA GR+FQ Y GV+ G CGTQLDHGV AVGYG+ G DY
Sbjct: 266 NDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYW 325
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
IV+NSWGP WGE GY+RM+RN G CGI M+SYP KK
Sbjct: 326 IVRNSWGPNWGEAGYLRMERNINVTSGKCGIAMMSSYPTKK 366
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 330 bits (846), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 163/341 (47%), Positives = 225/341 (65%), Gaps = 16/341 (4%)
Query: 27 DFSIVGYSPEDLTSNDKLID-----LFESWMSKF----EKVYESLDEKLERFEIFKDNLR 77
D SI+ Y+ E + + +++ W+++ S+ ++ RF F DNLR
Sbjct: 26 DMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLR 85
Query: 78 HIDETNRKI----KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS--HEDFSYKDV 131
+D N + + + L +N FADL ++EF+ +LG+K R + E + +
Sbjct: 86 FVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGA 145
Query: 132 VDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD- 190
+LP++VDWR+KGAV VKNQG CGSCWAFS V+ VE INQIVTG + +LSEQEL++CD
Sbjct: 146 EELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDI 205
Query: 191 NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQ 250
N ++GCNGGLMD AF++I+ GG+ E+DYPY +G C++ + ++VV+I+G+ DVP+
Sbjct: 206 NGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPE 265
Query: 251 NSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYI 310
N E SL KA+A+ P+SVAIEA GR+FQ Y GV+ G CGTQLDHGV AVGYG+ G DY
Sbjct: 266 NDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYW 325
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
IV+NSWGP WGE GY+RM+RN G CGI M+SYP KK
Sbjct: 326 IVRNSWGPNWGEAGYLRMERNINVTSGKCGIAMMSSYPTKK 366
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 330 bits (845), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 181/331 (54%), Positives = 216/331 (65%), Gaps = 22/331 (6%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ DL S D L L+E W + V L EK RF +F++N+R I E NR Y L
Sbjct: 32 FGDHDLASEDSLWALYERWREQ-HTVARDLGEKARRFNVFRENVRLIHEFNRGDAPYKLR 90
Query: 93 LNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKD------------VVDLPKSVDW 140
LN F D+ +EF+ + A + H FS K+ V D+P SVDW
Sbjct: 91 LNRFGDMTADEFRRAY-------ASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDW 143
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGG 200
R+KGAVT VK+QG CGSCWAFST+AAVEGIN I + NL SLSEQ+L+DCD N GCNGG
Sbjct: 144 RQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCNGG 203
Query: 201 LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKAL 260
LMDYAFQYI GG+ E+ YPY + + K S VVTI+GY DVP N E +L KA+
Sbjct: 204 LMDYAFQYIAKHGGVAAEDAYPYKARQAS-SCNKKPSAVVTIDGYEDVPANDETALKKAV 262
Query: 261 ANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPK 319
A QP++VAIEASG FQFYS GV+ G CGT+LDHGVAAVGYG+T G Y IVKNSWGP+
Sbjct: 263 AAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWGPE 322
Query: 320 WGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WGEKGYIRMKR+ EGLCGI ASYP+K
Sbjct: 323 WGEKGYIRMKRDVKDKEGLCGIAMEASYPVK 353
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 330 bits (845), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 167/302 (55%), Positives = 202/302 (66%), Gaps = 7/302 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM K+ KVY+ EK +R IFKDN+ I+ N K Y LG+N AD +EEF
Sbjct: 39 EQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLGINHLADQTNEEFVAS 98
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
G K + S F Y++V +P +VDWR+ GAVT VK+QG CGSCWAFSTVAA
Sbjct: 99 HNGYK----HKASHSQTPFKYENVTGVPNAVDWRENGAVTAVKDQGQCGSCWAFSTVAAT 154
Query: 168 EGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE 227
EGI QI T L SLSEQEL+DCD+ ++GC+GG M+ F++I+ GG+ E +YPY +
Sbjct: 155 EGIYQITTSMLMSLSEQELVDCDSV-DHGCDGGYMEGGFEFIIKNGGISSEANYPYTAVD 213
Query: 228 GTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGH 287
GTC+ K S I GY VP NSED+L KA+ANQP+SV I+A G FQFYS GV+ G
Sbjct: 214 GTCDANKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQ 273
Query: 288 CGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
CGTQLDHGV AVGYGST G Y IVKNSWG +WGE+GYIRM+R T EGLCGI AS
Sbjct: 274 CGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDAS 333
Query: 347 YP 348
YP
Sbjct: 334 YP 335
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 330 bits (845), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 167/311 (53%), Positives = 211/311 (67%), Gaps = 4/311 (1%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADL 99
D + +LF+ W K K Y S +E+ +R +IFKDN + + N I N Y L LN FADL
Sbjct: 26 DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNL-ITNATYSLSLNAFADL 84
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCW 159
H EFK LGL A + + S V +P SVDWRKKGAVT+VK+QGSCG+CW
Sbjct: 85 THHEFKASRLGLSVS-APSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEE 219
+FS A+EGINQIVTG+L SLSEQELIDCD +YN GCNGGLMDYAF++++ G+ E+
Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 220 DYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
DYPY +GTC+ K + +VVTI+ Y V N E +L++A+A QP+SV I S R FQ Y
Sbjct: 204 DYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLY 263
Query: 280 SGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
S G++ G C T LDH V VGYGS G+DY IVKNSWG WG G++ M+RNT +G+C
Sbjct: 264 SRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVC 323
Query: 340 GINKMASYPIK 350
GIN +ASYPIK
Sbjct: 324 GINMLASYPIK 334
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 330 bits (845), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 160/313 (51%), Positives = 213/313 (68%), Gaps = 6/313 (1%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEF 96
L + ++ E WM++ +VY + EK +R+ IFK+N+ I+ N + Y LG+N+F
Sbjct: 30 LDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKF 89
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
ADL +EEF+ M+ G K ++ S F Y+++ D+P S+DWR GAVT VK+QG+CG
Sbjct: 90 ADLTNEEFRAMYHGYKRQSSKLMSSS---FRYENLSDIPTSMDWRNDGAVTPVKDQGTCG 146
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
CWAFSTVAA+EGI ++ TGNL SLSEQ+L+DC N GC GGLMD AFQYI+ GGL
Sbjct: 147 CCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDC-TAGNKGCQGGLMDTAFQYIIRNGGLT 205
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E++YPY +GTC K S I GY DVPQN+E++LL+A+A QP+SV ++ G DF
Sbjct: 206 SEDNYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVGVDGGGNDF 265
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
QFY GV++G CGTQ +H V A+GYG+ G DY +VKNSWG WGE GY+RM+R G
Sbjct: 266 QFYKSGVFNGDCGTQQNHAVTAIGYGTDIDGTDYWLVKNSWGTSWGENGYMRMRRGIGSS 325
Query: 336 EGLCGINKMASYP 348
EGLCG+ ASYP
Sbjct: 326 EGLCGVAMDASYP 338
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 177/332 (53%), Positives = 216/332 (65%), Gaps = 14/332 (4%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN 88
S + +DL S + L DL+E W S +V EK RF FK N I N++ +
Sbjct: 27 SAIPMEDKDLESEEALWDLYERWQSA-HRVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDH 85
Query: 89 -YWLGLNEFADLRHEEFKEMFLGLKPDLAR---RKDQSHEDFSYK--DVVDLPKSVDWRK 142
Y L LN F D+ EF+ F+G DL R K S F Y +V DLP SVDWR+
Sbjct: 86 PYRLHLNRFGDMDQAEFRATFVG---DLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQ 142
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAVT VK+QG CGSCWAFSTV +VEGIN I TG+L SLSEQELIDCD N+GC GGLM
Sbjct: 143 KGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLM 202
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE---VVTINGYHDVPQNSEDSLLKA 259
D AF+YI + GGL E YPY GTC + + VV I+G+ DVP NSE+ L +A
Sbjct: 203 DNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARA 262
Query: 260 LANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGP 318
+ANQP+SVA+EASG+ F FYS GV+ G CGT+LDHGVA VGYG + G Y VKNSWGP
Sbjct: 263 VANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGP 322
Query: 319 KWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WGE+GYIR+++++G GLCGI ASYP+K
Sbjct: 323 SWGEQGYIRVEKDSGASGGLCGIAMEASYPVK 354
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 165/342 (48%), Positives = 225/342 (65%), Gaps = 16/342 (4%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++ + F + + + AR +D + ++K E WMS+F +VY +EK R
Sbjct: 11 SLALIFLLGALVSQAMARTL-------QDASMHEK----HEEWMSRFGRVYNDGNEKEIR 59
Query: 69 FEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
++IFK+N++ I+ N+ K+Y LG+N+FADL +EEFK K + + F
Sbjct: 60 YKIFKENVQRIESFNKASGKSYKLGINQFADLTNEEFKTSRNRFKGHMCSSQAGP---FR 116
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
Y+++ P S+DWRKKGAVT +K+QG CGSCWAFS VAAVEGI Q+ T L SLSEQEL+
Sbjct: 117 YENLTAAPSSMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELV 176
Query: 188 DCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
DCD + GC GGLMD AF++I GL E +YPY +GTC + + ING+
Sbjct: 177 DCDTKGEDQGCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFE 236
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG 306
DVP N+E +L+KA+A QP+SVAI+A G FQFYS G++ G CGT+LDHGVAAVGYG + G
Sbjct: 237 DVPANNEGALMKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYGESNG 296
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
++Y +VKNSWG +WGE+GYIRM+++ EGLCGI ASYP
Sbjct: 297 MNYWLVKNSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYP 338
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 173/347 (49%), Positives = 227/347 (65%), Gaps = 11/347 (3%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
K L + ++ + ++ + + + DL S + L DL+E W S V L EK +
Sbjct: 5 KAFLFAVVLAVILVAAMSMEIT-----ERDLASEESLWDLYERWRS-HHTVSRDLSEKRK 58
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG-LKPDLARRKDQSHEDF 126
RF +FK N+ HI + N+K K Y L LN FAD+ + EF+E + +K +++ F
Sbjct: 59 RFNVFKANVHHIHKVNQKDKPYKLKLNSFADMTNHEFREFYSSKVKHYRMLHGSRANTGF 118
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
+ LP SVDWRK+GAVT VKNQG CGSCWAFSTV VEGIN+I TG L SLSEQEL
Sbjct: 119 MHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQEL 178
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
+DC+ T N GCNGGLM+ A+++I +GG+ E YPY +G+C+ +K + VTI+G+
Sbjct: 179 VDCE-TDNEGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDSSKMNAPAVTIDGHE 237
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG-HCGTQLDHGVAAVGYGST- 304
VP N E++L+KA+ANQP+SVAI+ASG D QFYS GVY G CG +LDHGVA VGYG+
Sbjct: 238 MVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVGYGTAL 297
Query: 305 RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
G Y IVKNSWG WGE+GYIRM+R E G+CGI ASYP+K
Sbjct: 298 DGTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPLK 344
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 162/310 (52%), Positives = 206/310 (66%), Gaps = 6/310 (1%)
Query: 46 DLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFK 105
+ F+ W+ ++ Y S +E RF+++ DNLR + E N ++WL + +ADL +E++
Sbjct: 38 EAFDFWVQTLKRAYASAEEYERRFDVWLDNLRFVHEYNAGHTSHWLSMGVYADLSQDEYR 97
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
LG DL + F Y+ V PK VDW KGAVT VKNQ CGSCWAFST
Sbjct: 98 SKALGYNADLHEERPLRAAPFLYEGTVP-PKEVDWVAKGAVTPVKNQLLCGSCWAFSTTG 156
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
AVEG + I TG LASLSEQ L+DCD +NGC+GGLMD+AF++I+ GG+ E+DYPY
Sbjct: 157 AVEGASAIATGKLASLSEQMLVDCDRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTA 216
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
EEG C+ K VVTI+ Y DVP N E +L+KA+ANQP+SVAIEA R FQ Y GGV+D
Sbjct: 217 EEGMCQDNKMRRHVVTIDDYQDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFD 276
Query: 286 GHCGTQLDHGVAAVGYGS----TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
CGT LDHGV VGYG+ T L Y +VKNSWG +WG+KGYIR+ RN G+ EG CG+
Sbjct: 277 AECGTALDHGVLVVGYGTASNGTHHLPYWLVKNSWGAEWGDKGYIRLLRNLGE-EGQCGV 335
Query: 342 NKMASYPIKK 351
AS+PIKK
Sbjct: 336 AMQASFPIKK 345
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 163/314 (51%), Positives = 202/314 (64%), Gaps = 10/314 (3%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI---------KNYWLGLNEFA 97
LFE+W ++ K Y S E+ R F DN + N +Y L LN FA
Sbjct: 41 LFEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFA 100
Query: 98 DLRHEEFKEMFLGLKPDLARRKDQSHEDFSYK-DVVDLPKSVDWRKKGAVTHVKNQGSCG 156
DL H EF+ LG R S F+ V +P+++DWR+ GAVT VK+QGSCG
Sbjct: 101 DLTHAEFRAARLGRLAVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGSCG 160
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
+CW+FS A+EGIN+I TG+L SLSEQELIDCD +YN GC GGLMDYA+++++ GG+
Sbjct: 161 ACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRFVIKNGGID 220
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E+DYPY +GTC K + VVTI+GY DVP N EDSLL+A+A QP+SV I S R F
Sbjct: 221 TEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSARAF 280
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
Q YS G++DG C T LDH V VGYGS G DY IVKNSWG +WG KGY+ M RNTG
Sbjct: 281 QLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSS 340
Query: 337 GLCGINKMASYPIK 350
G+CGIN MAS+P K
Sbjct: 341 GICGINMMASFPTK 354
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 164/304 (53%), Positives = 207/304 (68%), Gaps = 6/304 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKE 106
E WM + KVY+ L E+ R +IFK+N+ +I+ +N N Y LG+N+FAD+ +EEF
Sbjct: 42 EQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLGINQFADITNEEFIA 101
Query: 107 MFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
K + ++ F Y++ +P +VDWRKKGAVT VKNQG CG CWAFS VAA
Sbjct: 102 SRNKFKGHMCSSITKT-STFKYENA-SVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAA 159
Query: 167 VEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
EGI+++ TG L SLSEQEL+DCD + GC GGLMD AF++I+ GLH E YPY
Sbjct: 160 TEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLHTEAQYPYQG 219
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
+GTC + + TI GY DVP N+E++L KA+ANQP+SVAI+ASG DFQFY GV+
Sbjct: 220 VDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISVAIDASGSDFQFYKSGVFT 279
Query: 286 GHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
G CGTQLDHGV AVGYG S G Y +VKNSWG WGE+GYIRM+R+ +GLCGI M
Sbjct: 280 GSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEEGYIRMQRSVDAAQGLCGIAMM 339
Query: 345 ASYP 348
ASYP
Sbjct: 340 ASYP 343
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 177/332 (53%), Positives = 216/332 (65%), Gaps = 14/332 (4%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN 88
S + +DL S + L DL+E W S +V EK RF FK N I N++ +
Sbjct: 27 SAIPMEDKDLESEEALWDLYERWQSA-HRVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDH 85
Query: 89 -YWLGLNEFADLRHEEFKEMFLGLKPDLAR---RKDQSHEDFSYK--DVVDLPKSVDWRK 142
Y L LN F D+ EF+ F+G DL R K S F Y +V DLP SVDWR+
Sbjct: 86 PYRLHLNRFGDMDQAEFRATFVG---DLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQ 142
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
KGAVT VK+QG CGSCWAFSTV +VEGIN I TG+L SLSEQELIDCD N+GC GGLM
Sbjct: 143 KGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLM 202
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE---VVTINGYHDVPQNSEDSLLKA 259
D AF+YI + GGL E YPY GTC + + VV I+G+ DVP NSE+ L +A
Sbjct: 203 DNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARA 262
Query: 260 LANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGP 318
+ANQP+SVA+EASG+ F FYS GV+ G CGT+LDHGVA VGYG + G Y VKNSWGP
Sbjct: 263 VANQPVSVAVEASGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGP 322
Query: 319 KWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WGE+GYIR+++++G GLCGI ASYP+K
Sbjct: 323 SWGEQGYIRVEKDSGASGGLCGIAMEASYPVK 354
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 169/316 (53%), Positives = 211/316 (66%), Gaps = 13/316 (4%)
Query: 47 LFESWMSKFEKVYES----LDEKLERFEIFKDNLRHIDETNRKIK---NYWLGLNEFADL 99
+++ W+++ +S + E RF +F DNL+ +D N + + LG+N FADL
Sbjct: 64 VYDLWVARHRHGGDSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADL 123
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTH-VKNQGSCGSC 158
++EF+ +LG P A R E + + V LP SVDWR KGAV VKNQG CGSC
Sbjct: 124 TNDEFRAAYLGTTP--AGRGRHVGEAYRHDGVEVLPDSVDWRDKGAVVAPVKNQGQCGSC 181
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDC-DNTYNNGCNGGLMDYAFQYIVSTGGLHK 217
WAFS VAAVEGIN+IVTG L SLSEQEL++C N N+GCNGG+MD AF +I GGL
Sbjct: 182 WAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLDT 241
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
EEDYPY +G C + K +VV+I+G+ DVP+N E SL KA+A+QP+SVAI+A GR+FQ
Sbjct: 242 EEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQ 301
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYGS--TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
Y GV+ G CGT LDHGV AVGYG+ G DY V+NSWGP WGE GYIRM+RN
Sbjct: 302 LYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTAR 361
Query: 336 EGLCGINKMASYPIKK 351
G CGI MASYPIKK
Sbjct: 362 TGKCGIAMMASYPIKK 377
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 165/316 (52%), Positives = 214/316 (67%), Gaps = 14/316 (4%)
Query: 41 NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFAD 98
+D + + E WM+ + KVY++ E+ +R IF +NL++I+ +N N Y LG+N+FAD
Sbjct: 32 DDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNNKPYKLGINQFAD 91
Query: 99 LRHEEF---KEMFLG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
L +EEF + F G + + R +E+ S +P +VDWRKKGAVT VKNQG
Sbjct: 92 LTNEEFIASRNKFKGHMCSSIIRTTTFKYENTS------VPSTVDWRKKGAVTPVKNQGQ 145
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTG 213
CG CWAFS +AA EGI++I TG L SLSEQEL+DCD N + GC GGLMD AF++I+
Sbjct: 146 CGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNN 205
Query: 214 GLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASG 273
G+ E YPY +GTC+ + + TI GY DVP N+E++L KA+ANQP+SVAI+ASG
Sbjct: 206 GISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNENALQKAVANQPISVAIDASG 265
Query: 274 RDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNT 332
DFQFY GV+ G CGT+LDHGV AVGYG S G Y +VKNSWG WGE+GYIRM+R+
Sbjct: 266 SDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGYIRMQRSI 325
Query: 333 GKPEGLCGINKMASYP 348
EGLCGI ASYP
Sbjct: 326 DAAEGLCGIAMQASYP 341
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 162/341 (47%), Positives = 225/341 (65%), Gaps = 16/341 (4%)
Query: 27 DFSIVGYSPEDLTSNDKLID-----LFESWMSKF----EKVYESLDEKLERFEIFKDNLR 77
D SI+ Y+ E + + +++ W+++ S+ ++ RF F DNLR
Sbjct: 26 DMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLR 85
Query: 78 HIDETNRKI----KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS--HEDFSYKDV 131
+D N + + + L +N FADL ++EF+ +LG+K R + + + +
Sbjct: 86 FVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGA 145
Query: 132 VDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD- 190
+LP++VDWR+KGAV VKNQG CGSCWAFS V+ VE INQIVTG + +LSEQEL++CD
Sbjct: 146 EELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDI 205
Query: 191 NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQ 250
N ++GCNGGLMD AF++I+ GG+ E+DYPY +G C++ + ++VV+I+G+ DVP+
Sbjct: 206 NGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPE 265
Query: 251 NSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYI 310
N E SL KA+A+ P+SVAIEA GR+FQ Y GV+ G CGTQLDHGV AVGYG+ G DY
Sbjct: 266 NDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYW 325
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
IV+NSWGP WGE GY+RM+RN G CGI M+SYP KK
Sbjct: 326 IVRNSWGPNWGEAGYLRMERNINVTSGKCGIAMMSSYPTKK 366
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 167/332 (50%), Positives = 220/332 (66%), Gaps = 10/332 (3%)
Query: 27 DFSIVGYSPEDLTSNDKLIDL-----FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
D SI+ Y+ E + + ++ W+++ + Y +L E RF +F DNLR D
Sbjct: 27 DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADA 86
Query: 82 TNRKIKN--YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVD 139
N + + + LG+N FADL +EEF+ FLG K + R + E + + V +LP+SVD
Sbjct: 87 HNARADDHGFRLGMNRFADLTNEEFRATFLGAK--VVERSRAAGERYRHDGVEELPESVD 144
Query: 140 WRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNG 199
WR+KGAV VKNQG CGSCWAFS V+ VE INQ+VTG + +LSEQEL++C NG
Sbjct: 145 WREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNGGCN 204
Query: 200 G-LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
G LMD AF +I+ GG+ E+DYPY +G C++ + ++VV+I+G+ DVPQN E SL K
Sbjct: 205 GGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQK 264
Query: 259 ALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGP 318
A+A+QP+SVAIEA GR+FQ Y GV+ G CGT LDHGV AVGYG+ G DY IV+NSWGP
Sbjct: 265 AVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGP 324
Query: 319 KWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
KWGE GY+RM+RN G CGI MASYP K
Sbjct: 325 KWGESGYVRMERNINVTTGKCGIAMMASYPTK 356
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 177/325 (54%), Positives = 214/325 (65%), Gaps = 10/325 (3%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWL 91
+ DL S++ L DL+E W + V+ EK RF FK+N+R I N R + Y L
Sbjct: 27 FDERDLASDEALWDLYERWQTH-HHVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPYRL 85
Query: 92 GLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED----FSYKDVVDLPKSVDWRKKGAVT 147
LN F D+ EEF+ F + + RR + F Y V DLP SVDWRK+GAVT
Sbjct: 86 SLNRFGDMGREEFRSTFADSRINDLRRAESPAAPAVPGFMYDGVTDLPPSVDWRKEGAVT 145
Query: 148 HVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQ 207
VK+QG CGSCWAFSTV +VEGIN I TG+L SLSEQELIDCD T NGC GGLM+ AF+
Sbjct: 146 AVKDQGHCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD-TDENGCQGGLMENAFE 204
Query: 208 YIVSTGGLHKEEDYPYIMEEGTCEMTKG-ESEVVTINGYHDVPQNSEDSLLKALANQPLS 266
+I S GG+ E YPY GTC+ + ++V+I+G+ VP SED+L KA+ANQP+S
Sbjct: 205 FIKSYGGVTTESAYPYRASNGTCDSVRSRRGQIVSIDGHQMVPTGSEDALAKAVANQPVS 264
Query: 267 VAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGY 325
VAI+A G+ FQFYS GV+ G CGT LDHGVAAVGYG S G Y IVKNSWGP WGE GY
Sbjct: 265 VAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTAYWIVKNSWGPSWGEGGY 324
Query: 326 IRMKRNTGKPEGLCGINKMASYPIK 350
IRM+R G GLCGI AS+PIK
Sbjct: 325 IRMQRGAGN-GGLCGIAMEASFPIK 348
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 168/344 (48%), Positives = 229/344 (66%), Gaps = 5/344 (1%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY-ESLDEKLER 68
+ I F + F+ S+ + + S SN+++ +F+ WMSK K Y +L EK R
Sbjct: 9 MTILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERR 68
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
F+ FKDNLR ID+ N K +Y LGL FADL +E++++F G P +R ++ +
Sbjct: 69 FQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPG-SPKPKQRNLKTSRRYVP 127
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
LP+SVDWR++GAV+ +K+QG+C SCWAFSTVAAVEG+N+IVTG L SLSEQEL+D
Sbjct: 128 LAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVD 187
Query: 189 CDNTYNNGCNG-GLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGES-EVVTINGYH 246
C N NNGC G GLMD AFQ++++ GL E+DYPY +G+C + S +V+TI+ Y
Sbjct: 188 C-NLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITIDSYE 246
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG 306
DVP N E SL KA+A+QP+SV ++ ++F Y +Y+G CGT LDH + VGYGS G
Sbjct: 247 DVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGSENG 306
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
DY IV+NSWG WG+ GYI++ RN P+GLCGI +ASYPIK
Sbjct: 307 QDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPIK 350
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 175/325 (53%), Positives = 211/325 (64%), Gaps = 12/325 (3%)
Query: 37 DLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNE 95
DL S + L DL+E W + +V EK RF FK N+ I N++ + Y L LN
Sbjct: 35 DLESEEALWDLYERWQTA-HRVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNR 93
Query: 96 FADLRHEEFKEMFLGLKPDLARR----KDQSHEDFSYK--DVVDLPKSVDWRKKGAVTHV 149
F D+ EF+ F G + RR S F Y +V DLP+SVDWR+KGAVT V
Sbjct: 94 FGDMSQAEFRATFAGSRVSDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGV 153
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
KNQG CGSCWAFSTV +VEGIN I TG L SLSEQELIDCD N+GC GGLMD AF+YI
Sbjct: 154 KNQGKCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYI 213
Query: 210 VSTGGLHKEEDYPYIMEEGTC---EMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLS 266
GGL E YPY GTC ++ K VV I+G+ DVP NSE++L KA+ANQP+S
Sbjct: 214 KKNGGLTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVS 273
Query: 267 VAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGY 325
V I+ASG+ F FYS GV+ G CGT+LDHGVA VGYG + G Y VKNSWGP WGEKGY
Sbjct: 274 VGIDASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEKGY 333
Query: 326 IRMKRNTGKPEGLCGINKMASYPIK 350
IR+++++G GLCGI ASY +K
Sbjct: 334 IRVEKDSGAEGGLCGIAMEASYAVK 358
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 167/297 (56%), Positives = 202/297 (68%), Gaps = 9/297 (3%)
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIK---NYWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
+ E RF +F DNL+ +D N + + LG+N FADL ++EF+ +LG P A R
Sbjct: 83 VGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTP--AGR 140
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTH-VKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
E + + V LP SVDWR KGAV VKNQG CGSCWAFS VAAVEGIN+IVTG
Sbjct: 141 GRHVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 200
Query: 178 LASLSEQELIDC-DNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL++C N N+GCNGG+MD AF +I GGL EEDYPY +G C + K
Sbjct: 201 LVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKKS 260
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+VV+I+G+ DVP+N E SL KA+A+QP+SVAI+A GR+FQ Y GV+ G CGT LDHGV
Sbjct: 261 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGV 320
Query: 297 AAVGYGS--TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
AVGYG+ G DY V+NSWGP WGE GYIRM+RN G CGI MASYPIKK
Sbjct: 321 VAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 377
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 165/311 (53%), Positives = 203/311 (65%), Gaps = 5/311 (1%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADL 99
D + + E WMS++ KVY+ E+ ER +IF N+ +I+ N N Y LG+N+FADL
Sbjct: 34 DSMYERHEQWMSQYSKVYKDPQEREERHKIFTANVNYIEVFNNDANNKLYKLGINQFADL 93
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCW 159
+EEF K + ++ F Y++V +P +VDWRKKGAVT VKNQG CG CW
Sbjct: 94 TNEEFIASRNKFKGHMCSSIAKT-TTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCW 152
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKE 218
AFS VAA EGI ++ TG L SLSEQEL+DCD + GC GGLMD AF++I+ GL E
Sbjct: 153 AFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTE 212
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
YPY +GTC K TI GY DVP N+E +L KA+ANQP+SVAI+ASG DFQF
Sbjct: 213 AAYPYQGVDGTCNANKASIHAATITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQF 272
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
Y GV+ G CGT+LDHGV AVGYG G Y +VKNSWG WGE+GYIRM+R EG
Sbjct: 273 YKSGVFSGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIRMQRGVDAAEG 332
Query: 338 LCGINKMASYP 348
LCGI ASYP
Sbjct: 333 LCGIAMQASYP 343
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 163/325 (50%), Positives = 216/325 (66%), Gaps = 9/325 (2%)
Query: 30 IVGYSPEDLTS----NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK 85
I+G P T+ + + + E WM+++ +VY+ +E+ R+ IFK+N+ ID N +
Sbjct: 17 ILGAWPSKSTARTLLDAPMYERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQ 76
Query: 86 I-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKG 144
K+Y LG+N+FADL +EEFK K + + F Y++V +P +VDWRK+G
Sbjct: 77 TGKSYKLGVNQFADLTNEEFKASRNRFKGHMCSPQ---AGPFRYENVSAVPSTVDWRKEG 133
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMD 203
AVT VK+QG CG CWAFS VAA+EGIN++ TG L SLSEQE++DCD + GCNGGLMD
Sbjct: 134 AVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMD 193
Query: 204 YAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
AF++I GL E +YPY +GTC K I G+ DVP NSE +L+KA+A Q
Sbjct: 194 DAFKFIEQNKGLTTEANYPYKGTDGTCNTNKAAIHAAKITGFEDVPANSEAALMKAVAKQ 253
Query: 264 PLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEK 323
P+SVAI+A G DFQFYS G++ G C TQLDHGV AVGYG + G Y +VKNSWG +WGE+
Sbjct: 254 PVSVAIDAGGSDFQFYSSGIFTGSCDTQLDHGVTAVGYGVSDGSKYWLVKNSWGAQWGEE 313
Query: 324 GYIRMKRNTGKPEGLCGINKMASYP 348
GYIRM+++ EGLCGI ASYP
Sbjct: 314 GYIRMQKDISAKEGLCGIAMQASYP 338
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 167/343 (48%), Positives = 227/343 (66%), Gaps = 4/343 (1%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY-ESLDEKLER 68
+ I F + F+ S+ + + S SN+++ +F+ WMSK K Y +L EK R
Sbjct: 9 MTILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERR 68
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
F+ FKDNLR ID+ N K +Y LGL FADL +E++++F G P +R ++ +
Sbjct: 69 FQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPG-SPKPKQRNLKTSRRYVP 127
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
LP+SVDWR++GAV+ +K+QG+C SCWAFSTVAAVEG+N+IVTG L SLSEQEL+D
Sbjct: 128 LAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVD 187
Query: 189 CDNTYNNGCNG-GLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHD 247
C N NNGC G GLMD AFQ++++ GL E+DYPY +G+C + V+TI+ Y D
Sbjct: 188 C-NLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQVHLLVITIDSYED 246
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGL 307
VP N E SL KA+A+QP+SV ++ ++F Y +Y+G CGT LDH + VGYGS G
Sbjct: 247 VPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGSENGQ 306
Query: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
DY IV+NSWG WG+ GYI++ RN P+GLCGI +ASYPIK
Sbjct: 307 DYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPIK 349
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 165/316 (52%), Positives = 214/316 (67%), Gaps = 14/316 (4%)
Query: 41 NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR--KIKNYWLGLNEFAD 98
+D + + E WM+ + KVY++ E+ +R IF +NL++I+ +N K Y LG+N+FAD
Sbjct: 32 DDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNKKPYKLGINQFAD 91
Query: 99 LRHEEF---KEMFLG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
L +EEF + F G + + R +E+ S +P +VDWRKKGAVT VKNQG
Sbjct: 92 LTNEEFIASRNKFKGHMCSSIIRTTTFKYENTS------VPSTVDWRKKGAVTPVKNQGQ 145
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTG 213
CG CWAFS +AA EGI++I TG L SLSEQEL+DCD N + GC GGLMD AF++I+
Sbjct: 146 CGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNN 205
Query: 214 GLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASG 273
G+ E YPY +GTC+ + + TI GY DVP N+E++L KA+ANQP+SVAI+ASG
Sbjct: 206 GISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNENALQKAVANQPISVAIDASG 265
Query: 274 RDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNT 332
DFQFY GV+ G CGT+LDHGV AVGYG S G Y +VKNSWG WGE+GYIRM+R+
Sbjct: 266 SDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGYIRMQRSI 325
Query: 333 GKPEGLCGINKMASYP 348
EGLCGI ASYP
Sbjct: 326 DAAEGLCGIAMQASYP 341
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 166/313 (53%), Positives = 216/313 (69%), Gaps = 6/313 (1%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFAD 98
+ND++ D++ESW+ + K Y SLDEK RFEIFKDNLR ID+ N +++ LGLN FAD
Sbjct: 34 TNDQVRDMYESWLVEQGKSYNSLDEKEMRFEIFKDNLRIIDDHNADANRSFSLGLNRFAD 93
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
L EE++ +LG K + K + DV LP VDWR GAV VKNQG C SC
Sbjct: 94 LTDEEYRSTYLGFKSG-PKAKVSNRYVPKVGDV--LPNYVDWRTVGAVVGVKNQGLCSSC 150
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHK 217
WAFS VAAVEGIN+I+TGNL SLSEQEL+DC T + GCN G M AFQ+I++ GG++
Sbjct: 151 WAFSAVAAVEGINKIMTGNLLSLSEQELVDCGRTQSTRGCNRGYMTDAFQFIINNGGINT 210
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
E++YPY ++G C + VTI+ Y +VP N+E +L A+A+QP+SV +E+ G F+
Sbjct: 211 EDNYPYTAQDGQCNRYLQNQKYVTIDDYENVPSNNEWALQNAVAHQPVSVGLESEGGKFK 270
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
Y+ G++ +CGT +DHGV VGYG+ RGLDY IVKNSWG WGE GYIR++RN G G
Sbjct: 271 LYTSGIFTQYCGTAIDHGVTIVGYGTERGLDYWIVKNSWGTNWGENGYIRIQRNIGG-AG 329
Query: 338 LCGINKMASYPIK 350
CGI +MASYP+K
Sbjct: 330 KCGIARMASYPVK 342
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 169/325 (52%), Positives = 220/325 (67%), Gaps = 7/325 (2%)
Query: 28 FSIVGYSPEDLTSNDKLIDL-FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
F+ Y T +D L+ + E WM+++ +VYE+ EK +RF IFK+N+ +I+ N+
Sbjct: 18 FATSAYLATSRTLSDSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAG 77
Query: 86 IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
K Y LG+N FADL ++EFK G K S+ F Y++V +P +VDWR KGA
Sbjct: 78 TKPYKLGINAFADLTNQEFKASRNGYK---LPHDCSSNTPFRYENVSSVPTTVDWRTKGA 134
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDY 204
VT VK+QG CG CWAFS VAA+EGI ++ TGNL SLSEQEL+DCD + GC GGLMD
Sbjct: 135 VTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDD 194
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQP 264
AF +I++ GL E +YPY +G+C+ +K + I+GY DVP NSE +L KA+ANQP
Sbjct: 195 AFSFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQP 254
Query: 265 LSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEK 323
+SVAI+A G DFQFYS GV+ G CGT+LDHGV AVGYG + G Y +VKNSWG WGEK
Sbjct: 255 VSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEK 314
Query: 324 GYIRMKRNTGKPEGLCGINKMASYP 348
GYIRM+++ EGLCGI +SYP
Sbjct: 315 GYIRMQKDIEAKEGLCGIAMQSSYP 339
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 168/313 (53%), Positives = 209/313 (66%), Gaps = 6/313 (1%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEF 96
L + L + E WMS++ K+Y+ EK +RF IFKDN+ I+ N K Y L +N
Sbjct: 30 LYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHL 89
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
ADL +EFK G K ++ + F Y++V +P++VDWR KGAVT +K+QG CG
Sbjct: 90 ADLTLDEFKASRNGYKK---IDREFATTSFKYENVTAIPEAVDWRVKGAVTPIKDQGQCG 146
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGL 215
SCWAFSTVAA+EGINQI TG L SLSEQEL+DCD + GC GGLM+ F++I+ GG+
Sbjct: 147 SCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGI 206
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E +YPY +G+C T + V I GY VP NSE SLLKA+ANQP+SV+I+AS
Sbjct: 207 TSETNYPYKAADGSCN-TATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSS 265
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
F FYS G+Y G CGT+LDHGV AVGYGS G DY IVKNSWG WGEKGYIRM+R
Sbjct: 266 FMFYSSGIYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIADK 325
Query: 336 EGLCGINKMASYP 348
EGLCGI +SYP
Sbjct: 326 EGLCGIAMDSSYP 338
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 168/325 (51%), Positives = 220/325 (67%), Gaps = 7/325 (2%)
Query: 28 FSIVGYSPEDLTSNDKLIDL-FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
F+ Y T +D L+ + E WM+++ +VY++ EK +RF IFK+N+ +I+ N+
Sbjct: 16 FATSAYLATSRTLSDSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAG 75
Query: 86 IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
K Y LG+N FADL ++EFK G K S+ F Y++V +P +VDWR KGA
Sbjct: 76 TKPYKLGINAFADLTNQEFKASRNGYK---LPHDCSSNTPFRYENVSSVPTTVDWRTKGA 132
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDY 204
VT VK+QG CG CWAFS VAA+EGI ++ TGNL SLSEQEL+DCD + GC GGLMD
Sbjct: 133 VTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDD 192
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQP 264
AF +I++ GL E +YPY +G+C+ +K + I+GY DVP NSE +L KA+ANQP
Sbjct: 193 AFSFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQP 252
Query: 265 LSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEK 323
+SVAI+A G DFQFYS GV+ G CGT+LDHGV AVGYG + G Y +VKNSWG WGEK
Sbjct: 253 VSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEK 312
Query: 324 GYIRMKRNTGKPEGLCGINKMASYP 348
GYIRM+++ EGLCGI +SYP
Sbjct: 313 GYIRMQKDIEAKEGLCGIAMQSSYP 337
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 167/307 (54%), Positives = 204/307 (66%), Gaps = 6/307 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHE 102
L + E WM++ KVYE EK +RF IFKDN+ I+ N + Y L +N ADL +
Sbjct: 36 LQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLSVNHLADLTLD 95
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
EFK G K ++ + F Y++V +P +VDWR KGAVT +K+QG CGSCWAFS
Sbjct: 96 EFKASRNGYKK---IDREFTTTSFKYENVTAIPAAVDWRVKGAVTPIKDQGQCGSCWAFS 152
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
TVAA EGINQI TG L SLSEQEL+DCD + GC GGLM+ F++I+ GG+ E +Y
Sbjct: 153 TVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSETNY 212
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY +G+C T + V I GY VP NSE SLLKA+ANQP+SV+I+AS F FYS
Sbjct: 213 PYKAADGSCN-TATTTPVAKITGYEKVPVNSEKSLLKAVANQPISVSIDASDSSFMFYSS 271
Query: 282 GVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
G+Y G CGT+LDHGV AVGYGS G DY IVKNSWG WGEKGYIRM+R EGLCGI
Sbjct: 272 GIYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIAAKEGLCGI 331
Query: 342 NKMASYP 348
+SYP
Sbjct: 332 AMDSSYP 338
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 326 bits (836), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 161/311 (51%), Positives = 210/311 (67%), Gaps = 7/311 (2%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLNEFADLR 100
D L +++ W+ + K Y S E +RF+IFK+N+ +I+ N R+ ++ LGLN+FADL
Sbjct: 32 DPLWQVYQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLT 91
Query: 101 HEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWA 160
+ EF+ +++G +R HE V D SVDWRKKG VT +K+QG CGSCWA
Sbjct: 92 NSEFRGLYVGR----LQRPAPFHEVGDIALVADTATSVDWRKKGGVTEIKDQGDCGSCWA 147
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FS VAAVEG+ + TG L SLSEQEL+DCD T N GC+GG+MDYAFQY++ GG+ + +
Sbjct: 148 FSAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRNGGITSQSN 207
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY G C+ K + TING+ +P SE+ LL+A+ANQP+SVAIEA G+DFQ YS
Sbjct: 208 YPYRALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYS 267
Query: 281 GGVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
GV+ G CG+ LDHGVA VGYG+ G Y +VKNSWG WGE GY+RM+R G G+C
Sbjct: 268 SGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGSGWGESGYVRMERQ-GPGAGVC 326
Query: 340 GINKMASYPIK 350
GIN ASYP K
Sbjct: 327 GINLDASYPTK 337
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 159/307 (51%), Positives = 209/307 (68%), Gaps = 5/307 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHE 102
+ + E WM+++ +VY+ +E+ R+ IFK+N+ ID N + K+Y LG+N+FADL +E
Sbjct: 1 MYERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNE 60
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
EFK K + + F Y++V +P +VDWRK+GAVT VK+QG CG CWAFS
Sbjct: 61 EFKASRNRFKGHMCSPQAGP---FRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFS 117
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
VAA+EGIN++ TG L SLSEQE++DCD + GCNGGLMD AF++I GL E +Y
Sbjct: 118 AVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANY 177
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY +GTC K I G+ DVP NSE +L+KA+A QP+SVAI+A G DFQFYS
Sbjct: 178 PYKGTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFYSS 237
Query: 282 GVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
G++ G C TQLDHGV AVGYG + G Y +VKNSWG +WGE+GYIRM+++ EGLCGI
Sbjct: 238 GIFTGSCDTQLDHGVTAVGYGVSDGSKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGI 297
Query: 342 NKMASYP 348
ASYP
Sbjct: 298 AMQASYP 304
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 326 bits (836), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 167/313 (53%), Positives = 208/313 (66%), Gaps = 6/313 (1%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEF 96
L + L + E WMS++ K+Y+ EK +RF IFKDN+ I+ N K Y L +N
Sbjct: 30 LYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHL 89
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
ADL +EFK G K ++ + F Y++V +P++VDWR KGAVT +K+QG CG
Sbjct: 90 ADLTLDEFKASRNGYKK---IDREFATTSFKYENVTAIPEAVDWRVKGAVTPIKDQGQCG 146
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGL 215
SCWAFSTVAA+EGINQI TG L SLSEQEL+DCD + GC GGLM+ F++I+ GG+
Sbjct: 147 SCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGI 206
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E +YPY +G+C + V I GY VP NSE SLLKA+ANQP+SV+I+AS
Sbjct: 207 TSETNYPYKAADGSCSAAT-TAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSS 265
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
F FYS G+Y G CGT+LDHGV AVGYGS G DY IVKNSWG WGEKGYIRM+R
Sbjct: 266 FMFYSSGIYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIADK 325
Query: 336 EGLCGINKMASYP 348
EGLCGI +SYP
Sbjct: 326 EGLCGIAMDSSYP 338
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 165/302 (54%), Positives = 201/302 (66%), Gaps = 7/302 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM K+ KVY+ EK +R IFKDN+ I+ N + Y L +N AD +EEF
Sbjct: 39 EQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNRPYKLSINHLADQTNEEFVAS 98
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
G K + S F Y++V +P +VDWR+ GAVT VK+QG CGSCWAFSTVAA
Sbjct: 99 HNGYK----HKGSHSQTPFKYENVTGVPNAVDWRENGAVTAVKDQGQCGSCWAFSTVAAT 154
Query: 168 EGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE 227
EGI QI T L SLSEQEL+DCD+ ++GC+GG M+ F++I+ GG+ E +YPY +
Sbjct: 155 EGIYQITTSMLMSLSEQELVDCDSV-DHGCDGGYMEGGFEFIIKNGGISSEANYPYTAVD 213
Query: 228 GTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGH 287
GTC+ K S I GY VP NSED+L KA+ANQP+SV I+A G FQFYS GV+ G
Sbjct: 214 GTCDANKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQ 273
Query: 288 CGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
CGTQLDHGV AVGYGST G Y IVKNSWG +WGE+GYIRM+R T EGLCGI AS
Sbjct: 274 CGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDAS 333
Query: 347 YP 348
YP
Sbjct: 334 YP 335
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 325 bits (834), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 169/352 (48%), Positives = 227/352 (64%), Gaps = 19/352 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
+++S F + L+ + I++S R +ND+++ ++ESW+ + K Y
Sbjct: 10 ISMSLLFFSTLLILSSALDIKNSVQR-------------TNDQVMAMYESWLVEQGKSYN 56
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
SLDEK RFEIFK+NLR ID+ N ++Y LGLN FADL EE++ +LG K + K
Sbjct: 57 SLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFK---SGPK 113
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
+ + K V LP VDWR GAV VK+QG C SCWAFS VAAVEGIN+IVTGNL
Sbjct: 114 AKVSNRYVPKVGVVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLI 173
Query: 180 SLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DC T GCN G M+ AFQ+I+ GG++ E++YPY ++G C+ +
Sbjct: 174 SLSEQELVDCGRTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCDWYRKNQR 233
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
VTI+ Y +P N+E L A+A QP++V +E+ G F+ Y+ G+Y G+CGT +DHGV
Sbjct: 234 YVTIDNYEQLPANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTI 293
Query: 299 VGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VGYG+ RGLDY IVKNSWG WGE GYIR++RN G G CGI + SYP+K
Sbjct: 294 VGYGTERGLDYWIVKNSWGTNWGENGYIRIQRNIGGA-GKCGIAMVPSYPVK 344
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 325 bits (834), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 162/302 (53%), Positives = 203/302 (67%), Gaps = 8/302 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+++ KVY+ EK +RF+IFKDN+ I+ N K Y LG+N ADL EEFK
Sbjct: 39 EQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLGVNHLADLTVEEFKAS 98
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
G K R + S F Y++V +P ++DWR KGAVT +K+QG CGSCWAFST+AA
Sbjct: 99 RNGFK----RPHEFSTTTFKYENVTAIPAAIDWRTKGAVTPIKDQGQCGSCWAFSTIAAT 154
Query: 168 EGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EGI+QI TG L SLSEQEL+DCD + GC GG M+ F++I+ GG+ E +YPY
Sbjct: 155 EGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGGITSETNYPYKAV 214
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+G C K S V I GY VP NSE +L KA+ANQP+SV+I+A G F FYS G+Y+G
Sbjct: 215 DGKC--NKATSPVAQIKGYEKVPPNSETALQKAVANQPVSVSIDADGAGFMFYSSGIYNG 272
Query: 287 HCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
CGT+LDHGV AVGYG+ G DY IVKNSWG +WGEKGY+RM+R GLCGI +S
Sbjct: 273 ECGTELDHGVTAVGYGTANGTDYWIVKNSWGTQWGEKGYVRMQRGIAAKHGLCGIALDSS 332
Query: 347 YP 348
YP
Sbjct: 333 YP 334
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 325 bits (834), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 168/353 (47%), Positives = 223/353 (63%), Gaps = 17/353 (4%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
M +QF I ++ FC F F + + +D + + + E WM ++ KV
Sbjct: 1 MVAKNQFYQISLALLFCSGFLA-------FQVTCRTLQDAS----MYERHEEWMGRYAKV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y+ E+ RF+IFK+N+ +I+ N K Y LG+N+FADL +EEF K +
Sbjct: 50 YKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCS 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
++ F Y++V +P +VDWR+KGAVT +K+QG CG CWAFS VAA EGI+ + G
Sbjct: 110 SITRT-TTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGK 168
Query: 178 LASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQE++DCD + GC GG MD AF++I+ GL+ E +YPY +G C
Sbjct: 169 LISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAA 228
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+ V TI GY DVP N+E +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGT+LDHGV
Sbjct: 229 NHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGV 288
Query: 297 AAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
AVGYG S G +Y +VKNSWG +WGE+GYIRM+R EGLCGI MASYP
Sbjct: 289 TAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYP 341
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 325 bits (834), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 164/343 (47%), Positives = 224/343 (65%), Gaps = 15/343 (4%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++ + C++F F + S +D + + + E WM+++ KVY+ E+ +R
Sbjct: 558 SLAMLLCMAFLA-------FQVTCRSLQDAS----MYERHEQWMTRYGKVYKDPQEREKR 606
Query: 69 FEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
F IFK+N+ +I+ N K Y L +N+FADL +EEF K + ++ F
Sbjct: 607 FRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRT-TTFK 665
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
Y++V +P +VDWR+KGAVT +K+QG CG CWAFS VAA EGI+ + +G L SLSEQEL+
Sbjct: 666 YENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELV 725
Query: 188 DCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
DCD + GC GGLMD AF++++ GL+ E +YPY +G C + ++VVTI GY
Sbjct: 726 DCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYE 785
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STR 305
DVP N+E +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGT+LDHGV AVGYG S
Sbjct: 786 DVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND 845
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G +Y +VKNSWG +WGE+GYIRM+R EGLCGI ASYP
Sbjct: 846 GTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYP 888
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 168/353 (47%), Positives = 223/353 (63%), Gaps = 17/353 (4%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
M +QF I ++ FC F F + + +D + + + E WM ++ KV
Sbjct: 1 MVAKNQFYQISLALLFCSGFLT-------FQVTCRTLQDAS----MYERHEEWMGRYAKV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y+ E+ RF+IFK+N+ +I+ N K Y LG+N+FADL +EEF K +
Sbjct: 50 YKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCS 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
++ F Y++V +P +VDWR+KGAVT +K+QG CG CWAFS VAA EGI+ + G
Sbjct: 110 SITRT-TTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGK 168
Query: 178 LASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQE++DCD + GC GG MD AF++I+ GL+ E +YPY +G C
Sbjct: 169 LISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAA 228
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+ V TI GY DVP N+E +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGT+LDHGV
Sbjct: 229 NHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGV 288
Query: 297 AAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
AVGYG S G +Y +VKNSWG +WGE+GYIRM+R EGLCGI MASYP
Sbjct: 289 TAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYP 341
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 167/297 (56%), Positives = 203/297 (68%), Gaps = 9/297 (3%)
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIK---NYWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
+ E RF +F DNL+ +D N + LG+N FADL ++EF+ +LG P A R
Sbjct: 84 VGEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGMNRFADLTNDEFRAAYLGTTP--AGR 141
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAV-THVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
E + + V LP SVDWR KGAV + VKNQG CGSCWAFS VAAVEGIN+IVTG
Sbjct: 142 GRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 201
Query: 178 LASLSEQELIDC-DNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL++C N N+GCNGG+MD AF +I GGL EEDYPY +G C++ K
Sbjct: 202 LVSLSEQELVECARNRGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMDGKCDLAKKS 261
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+VV+I+G+ DVP+N E SL KA+A+QP+SVAI+A GR+FQ Y GV+ G CGT LDHGV
Sbjct: 262 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGV 321
Query: 297 AAVGYGS--TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
AVGYG+ G DY V+NSWGP WGE GYIRM+RN G CGI MASYPIKK
Sbjct: 322 VAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 378
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 172/352 (48%), Positives = 226/352 (64%), Gaps = 16/352 (4%)
Query: 1 MALSSQFKTILISF-CISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY 59
MA S+ K + ++ + + +++R S D N++ E WM+K+ +VY
Sbjct: 1 MATVSENKLMFVALLVVGLWASQAWSR-------SLHDAAMNER----HEMWMAKYGRVY 49
Query: 60 ESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
+ EK RFEIF++N+ I+ N+ + Y L +NEFADL +EEFK G K
Sbjct: 50 KDNSEKERRFEIFRNNVEFIESFNKLGNRPYKLDINEFADLTNEEFKVSKNGYKRSSGVG 109
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
+ F Y +V +P S+DWR+ GAVT +K+QG CG CWAFS VAA+EGI ++ TG L
Sbjct: 110 LTE-KSSFRYANVTAVPTSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKL 168
Query: 179 ASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGES 237
SLSEQEL+DCD + + GC GGLMD AF++I GGL E +YPY +GTC K +
Sbjct: 169 ISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGN 228
Query: 238 EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVA 297
+ I GY DVP NSED+LLKA+A+QP+SVAI+ASG FQFYSGGV+ G CGT+LDHGV
Sbjct: 229 DAAKITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVT 288
Query: 298 AVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
AVGYG++ G Y +VKNSWG WGE GYIRM+R+ EGLCGI SYP
Sbjct: 289 AVGYGTSDDGTKYWLVKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQPSYP 340
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 164/343 (47%), Positives = 224/343 (65%), Gaps = 15/343 (4%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++ + C++F F + S +D + + + E WM+++ KVY+ E+ +R
Sbjct: 29 SLAMLLCMAFLA-------FQVTCRSLQDAS----MYERHEQWMTRYGKVYKDPQEREKR 77
Query: 69 FEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
F IFK+N+ +I+ N K Y L +N+FADL +EEF K + ++ F
Sbjct: 78 FRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRT-TTFK 136
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
Y++V +P +VDWR+KGAVT +K+QG CG CWAFS VAA EGI+ + +G L SLSEQEL+
Sbjct: 137 YENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELV 196
Query: 188 DCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
DCD + GC GGLMD AF++++ GL+ E +YPY +G C + ++VVTI GY
Sbjct: 197 DCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYE 256
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STR 305
DVP N+E +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGT+LDHGV AVGYG S
Sbjct: 257 DVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND 316
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G +Y +VKNSWG +WGE+GYIRM+R EGLCGI ASYP
Sbjct: 317 GTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYP 359
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 325 bits (832), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 158/304 (51%), Positives = 212/304 (69%), Gaps = 4/304 (1%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
++ W+ ++ + Y++ DE L RF I+ N++ I+ N + ++ L N+FADL ++EF +
Sbjct: 46 YDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQNLSFKLTDNKFADLTNDEFNSI 105
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
+LG + +R++ SH +++ DLP +VDWR+ GAVT +K+QG CGSCWAFS VAAV
Sbjct: 106 YLGYQIRSYKRRNLSH---MHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAVAAV 162
Query: 168 EGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EGIN+I TGNL SLSEQEL+DCD N N GCNGG M+ AF +I S GGL E DYPY
Sbjct: 163 EGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYPYKGT 222
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+G+CE K ++ V I GY VP N+E+SL A++ QP+SVAI+ASG +FQ YS GV+ G
Sbjct: 223 DGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGVFSG 282
Query: 287 HCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
+CG QL+HGV VGYG G Y +VKNSWG WGE GYIRMKR++ +G+CGI S
Sbjct: 283 YCGIQLNHGVTIVGYGDNNGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCGIAMEPS 342
Query: 347 YPIK 350
YPIK
Sbjct: 343 YPIK 346
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 167/318 (52%), Positives = 211/318 (66%), Gaps = 11/318 (3%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADL 99
D + +LF+ W K K Y S +E+ +R +IFKDN + + N I N Y L LN FADL
Sbjct: 24 DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNL-ITNATYSLSLNAFADL 82
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCW 159
H EFK LGL A + + S V +P SVDWRKKGAVT+VK+QGSCG+CW
Sbjct: 83 THHEFKASRLGLSVS-APSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 141
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEE 219
+FS A+EGINQIVTG+L SLSEQELIDCD +YN GCNGGLMDYAF++++ G+ E+
Sbjct: 142 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 201
Query: 220 DYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
DYPY +GTC+ K + +VVTI+ Y V N E +L++A+A QP+SV I S R FQ Y
Sbjct: 202 DYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLY 261
Query: 280 SG-------GVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNT 332
S G++ G C T LDH V VGYGS G+DY IVKNSWG WG G++ M+RNT
Sbjct: 262 SSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNT 321
Query: 333 GKPEGLCGINKMASYPIK 350
+G+CGIN +ASYPIK
Sbjct: 322 ENSDGVCGINMLASYPIK 339
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 166/309 (53%), Positives = 208/309 (67%), Gaps = 6/309 (1%)
Query: 46 DLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEE 103
+LF+ W + K Y S +E+ +R +IFKDN + + N I N Y L LN FADL H E
Sbjct: 30 ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNL-ITNATYSLSLNAFADLTHHE 88
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
FK LGL A + + S +P SVDWRKKGAVT+VK+QGSCG+CW+FS
Sbjct: 89 FKASRLGLSVS-ASSLIMASKGQSLGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFSA 147
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
A+EGINQIVTG+L SLSEQELIDCD +YN GCNGGLMDYAF++++ G+ E+DYPY
Sbjct: 148 TGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPY 207
Query: 224 IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS--G 281
+GTC+ K + +VVTI+ Y V N E +L +A+A QP+SV I S R FQ YS
Sbjct: 208 QERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRVS 267
Query: 282 GVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
G++ G C T LDH V VGYGS G+DY IVKNSWG WG G++ M+RNTG EG+CGI
Sbjct: 268 GIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICGI 327
Query: 342 NKMASYPIK 350
N +ASYPIK
Sbjct: 328 NMLASYPIK 336
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 160/313 (51%), Positives = 209/313 (66%), Gaps = 9/313 (2%)
Query: 46 DLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI------KNYWLGLNEFADL 99
+LFE W + K Y S +EKL R ++F+DN + + N+ +Y L LN FADL
Sbjct: 31 ELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADL 90
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCW 159
H EFK LGL L R K ++ +D++ +P +DWR+ GAVT VK+Q SCG+CW
Sbjct: 91 THHEFKTTRLGLPLTLLRFKRPQNQQ--SRDLLHIPSQIDWRQSGAVTPVKDQASCGACW 148
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEE 219
AFS A+EGIN+IVTG+L SLSEQELIDCD +YN+GC GGLMD+A+Q+++ G+ E+
Sbjct: 149 AFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGIDTED 208
Query: 220 DYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
DYPY + +C K + VTI Y DVP SE+ +LKA+A+QP+SV I S R+FQ Y
Sbjct: 209 DYPYQARQRSCSKDKLKRRAVTIEDYVDVPP-SEEEILKAVASQPVSVGICGSEREFQLY 267
Query: 280 SGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
S G++ G C T LDH V VGYGS G+DY IVKNSWG WG GYI M RN+G +G+C
Sbjct: 268 SKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNSKGIC 327
Query: 340 GINKMASYPIKKK 352
GIN +ASYP+K K
Sbjct: 328 GINTLASYPVKTK 340
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 163/309 (52%), Positives = 210/309 (67%), Gaps = 7/309 (2%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFK 105
++E W+ + K Y L EK RF+IFKDNL+ +DE N + + +GL FADL +EEF+
Sbjct: 43 MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR 102
Query: 106 EMFLGLKPDLARRKDQ-SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
++L + + R KD E + YK+ LP VDWR GAV VK+QG+CGSCWAFS V
Sbjct: 103 AIYL--RKKMERNKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAV 160
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
AVEGINQI TG L SLSEQEL+DCD + N GC+GG+M+YAF++I+ GG+ ++DYPY
Sbjct: 161 GAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPY 220
Query: 224 IMEE-GTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
+ G C K + VVTI+GY DVP++ E SL KA+A+QP+SVAIEAS + FQ Y
Sbjct: 221 NANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKS 280
Query: 282 GVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
GV G CG LDHGV VGYGST G DY I++NSWG WG+ GY++++RN P G CGI
Sbjct: 281 GVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGI 340
Query: 342 NKMASYPIK 350
M SYP K
Sbjct: 341 AMMPSYPTK 349
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 163/309 (52%), Positives = 210/309 (67%), Gaps = 7/309 (2%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFK 105
++E W+ + K Y L EK RF+IFKDNL+ +DE N + + +GL FADL +EEF+
Sbjct: 43 MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR 102
Query: 106 EMFLGLKPDLARRKDQ-SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
++L + + R KD E + YK+ LP VDWR GAV VK+QG+CGSCWAFS V
Sbjct: 103 AIYL--RKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAV 160
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
AVEGINQI TG L SLSEQEL+DCD + N GC+GG+M+YAF++I+ GG+ ++DYPY
Sbjct: 161 GAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPY 220
Query: 224 IMEE-GTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
+ G C K + VVTI+GY DVP++ E SL KA+A+QP+SVAIEAS + FQ Y
Sbjct: 221 NANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKS 280
Query: 282 GVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
GV G CG LDHGV VGYGST G DY I++NSWG WG+ GY++++RN P G CGI
Sbjct: 281 GVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGI 340
Query: 342 NKMASYPIK 350
M SYP K
Sbjct: 341 AMMPSYPTK 349
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 170/354 (48%), Positives = 222/354 (62%), Gaps = 18/354 (5%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA ++Q I ++ FC+ + +R + + + E WM+ + KV
Sbjct: 1 MAANNQLYHISLALVFCLGLWAIQVTSRTLQ-----------DGSMHERHERWMNHYGKV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKEMFLGLKPDLA 116
Y+ E+ +RF+IF +N+++I+ N N Y LG+N+FADL +EEF K +
Sbjct: 50 YKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFADLTNEEFVASRNKFKGHMC 109
Query: 117 RRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG 176
++ F Y++V +P +VDWRKKGAVT VKNQG CG CWAFS VAA EGI+++ TG
Sbjct: 110 SSIIRT-TTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTG 168
Query: 177 NLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
L SLSEQEL+DCD + GC GGLMD AF++I+ GL+ E YPY +GTC K
Sbjct: 169 KLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCNANKA 228
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
+ TI GY DVP N+E +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGT+LDHG
Sbjct: 229 SIQATTITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHG 288
Query: 296 VAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V AVGYG S G Y +VKNSWG WGE+GYI M+R EGLCGI ASYP
Sbjct: 289 VTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYP 342
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 171/355 (48%), Positives = 221/355 (62%), Gaps = 19/355 (5%)
Query: 1 MALSSQFK---TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEK 57
MA ++Q ++ + FC+ F +R L + + + E WM + K
Sbjct: 1 MAANNQLYHSISLALFFCLGLFAIQVTSRT----------LQDDSIIYEKHEQWMVHYGK 50
Query: 58 VYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKEMFLGLKPDL 115
VY+ L E+ R +IFK+N+ +I+ +N N Y LG+N+FADL +EEF K +
Sbjct: 51 VYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLGINQFADLTNEEFIASRNKFKGHM 110
Query: 116 ARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
++ F Y++ +P +VDWRKKGAVT VKNQG CG CWAFS VAA EGI+++ T
Sbjct: 111 CSSITKT-STFKYENA-SVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLST 168
Query: 176 GNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTK 234
G L SLSEQEL+DCD + GC GGLMD AF++I+ GL+ E YPY +GTC K
Sbjct: 169 GKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCSANK 228
Query: 235 GESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDH 294
VTI GY DVP N+E +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGT+LDH
Sbjct: 229 ASIHAVTITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDH 288
Query: 295 GVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
GV AVGYG G Y +VKNSWG WGE+GYI+M+R EGLCGI ASYP
Sbjct: 289 GVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIKMQRGVDAAEGLCGIAMEASYP 343
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 164/307 (53%), Positives = 205/307 (66%), Gaps = 6/307 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHE 102
LI+ E WM+K++KVY+ EK +RF IFKDN+ I+ N K Y LG+N ADL E
Sbjct: 37 LIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVEFIESFNAAGNKPYKLGVNHLADLTIE 96
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
EFK GLK + F Y++V +P SVDWRKKGAVT +K+QG CGSCWAFS
Sbjct: 97 EFKASRNGLKRSYDYEVGTT--SFKYENVTAIPASVDWRKKGAVTPIKDQGQCGSCWAFS 154
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
TVAA EGI++I TG L SLSEQEL+DCD + GC GG M+ F++I+ GG+ E +Y
Sbjct: 155 TVAATEGIHKISTGKLVSLSEQELVDCDRKGTDQGCEGGYMEDGFEFIIKNGGITTEANY 214
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY +G+C+ + I GY VP NSE +LLKA+ANQP+SV+I+A+ F FYS
Sbjct: 215 PYKAVDGSCK--NATAPAAQIKGYEKVPVNSEKALLKAVANQPVSVSIDAADGSFMFYSS 272
Query: 282 GVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
G++ G CGT+LDHGV AVGYG G DY IVKNSWG WGE+GYIRM+R EGLCGI
Sbjct: 273 GIFTGECGTELDHGVTAVGYGRANGTDYWIVKNSWGTVWGEQGYIRMQRGIAAKEGLCGI 332
Query: 342 NKMASYP 348
+SYP
Sbjct: 333 AMDSSYP 339
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 324 bits (831), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 162/310 (52%), Positives = 207/310 (66%), Gaps = 20/310 (6%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEM 107
E WM+++ KVY+ EK R +IFK+N++ I+ N K+Y LG+N+FADL +EEFK
Sbjct: 40 EQWMAQYGKVYKDSYEKELRSKIFKENVQRIEAFNNAGNKSYKLGINQFADLTNEEFK-- 97
Query: 108 FLGLKPDLARRKDQSH--------EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCW 159
AR + + H F Y+ V +P S+DWR+KGAVT +K+QG CG CW
Sbjct: 98 --------ARNRFKGHMCSNSTRTPTFKYEHVTSVPASLDWRQKGAVTPIKDQGQCGCCW 149
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKE 218
AFS VAA EGI ++ TG L SLSEQEL+DCD + GC GGLMD AF++I+ GL+ E
Sbjct: 150 AFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTE 209
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
YPY + TC + +I G+ DVP NSE +LLKA+ANQP+SVAI+ASG +FQF
Sbjct: 210 AKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQF 269
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
YS GV+ G CGT+LDHGV AVGYGS G Y +VKNSWG +WGE+GYIRM+R+ EGL
Sbjct: 270 YSSGVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGL 329
Query: 339 CGINKMASYP 348
CG ASYP
Sbjct: 330 CGFAMQASYP 339
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 324 bits (831), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 155/306 (50%), Positives = 201/306 (65%), Gaps = 4/306 (1%)
Query: 46 DLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEF 104
DLFE+W ++ K Y S +EK R ++F++N + + N +Y L LN FADL H EF
Sbjct: 27 DLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFADLTHHEF 86
Query: 105 KEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
K LG P A+ + +P +VDWRK GAVT VK+QG+CG CW+FST
Sbjct: 87 KASRLGFSPGRAQSIRSVGTPVQE---LHVPPAVDWRKSGAVTGVKDQGNCGGCWSFSTT 143
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
A+EGIN+IVTG+L SLSEQEL+DCD +YN+GC GGLMDYA+Q+++ G+ E DYPY+
Sbjct: 144 GAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKNQGIDSEADYPYV 203
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
+ C K + +VTI+GY D+P N E LL+ +A QP+SV I S + FQ YS GVY
Sbjct: 204 GMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGSEKTFQLYSKGVY 263
Query: 285 DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
G C + LDH V VGYG+ G+D+ IVKNSWG WG +GYI M RN G EG+CGIN +
Sbjct: 264 TGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRNNGTAEGICGINML 323
Query: 345 ASYPIK 350
ASYP K
Sbjct: 324 ASYPAK 329
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 160/318 (50%), Positives = 204/318 (64%), Gaps = 15/318 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI--------------KNYWLGL 93
F++W ++ K Y + +E+ R +F DN + N + +Y L L
Sbjct: 36 FDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTLAL 95
Query: 94 NEFADLRHEEFKEMFLG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQ 152
N FADL HEEF+ LG + P A R + + +P ++DWRK GAVT VK+Q
Sbjct: 96 NAFADLTHEEFRAARLGRIAPGAALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTKVKDQ 155
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVST 212
GSCG+CW+FS A+EGIN+I TG+L SLSEQELIDCD +YN+GC GGLMDYA+++++
Sbjct: 156 GSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVIKN 215
Query: 213 GGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEAS 272
GG+ EEDYPY +GTC K + VVTI+GY DVP N ED LL+A+A QP+SV I S
Sbjct: 216 GGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVGICGS 275
Query: 273 GRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNT 332
R FQ Y G++DG C T LDH V VGYGS G DY IVKNSWG WG KGY+ M RNT
Sbjct: 276 ARAFQLYYQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNT 335
Query: 333 GKPEGLCGINKMASYPIK 350
G +G+CGIN MAS+P K
Sbjct: 336 GDSKGVCGINMMASFPTK 353
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 169/332 (50%), Positives = 215/332 (64%), Gaps = 21/332 (6%)
Query: 24 FARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN 83
A D+S+ D+ D ++ WM K+ + Y+S +E RF I++ N+++ID N
Sbjct: 1 MAMDYSLGSSCSSDIQ------DRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFN 54
Query: 84 RKIKNYWLGLNEFADLRHEEFKEMFLGLK----PDLARRKDQSHEDFSYKDVVDLPKSVD 139
++ L N FADL +EEFK +LG K PD F Y ++V+LP +VD
Sbjct: 55 SMNHSHTLAENNFADLTNEEFKATYLGYKTVSIPDTC---------FRYGNMVNLPTNVD 105
Query: 140 WRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCN 198
WR++GAVT +KNQG CGSCWAFS VAAVEGIN+I G L SLSEQEL+DCD T N GCN
Sbjct: 106 WRQEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCN 165
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
GG M AF++I T GL E +YPY E C K + + V+I+GY VP N E SL
Sbjct: 166 GGYMYKAFEFIKRT-GLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKA 224
Query: 259 ALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGP 318
A+ANQP+SVAI+A G +FQFYSGG++ G+CG QL+HGVA VGYG T Y +VKNSWG
Sbjct: 225 AVANQPVSVAIDAEGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGT 284
Query: 319 KWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WGE GYIRMKR++ +G CGI MASYP K
Sbjct: 285 DWGESGYIRMKRDSTDRQGTCGIAMMASYPTK 316
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 168/348 (48%), Positives = 229/348 (65%), Gaps = 16/348 (4%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLT---SNDKLIDLFESWMSKFEKVYESLDEK 65
T+ I + F S+A D S + Y + + +++++ +++E W++K +KVY L E
Sbjct: 3 TLFIISILLFLASFSYAMDISTIEYKYDKSSAWRTDEEVKEIYELWLAKHDKVYSGLVEY 62
Query: 66 LERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS--- 122
+RFEIFKDNL+ IDE N + Y +GL + DL +EEF+ ++LG + D R ++
Sbjct: 63 EKRFEIFKDNLKFIDEHNSENHTYKMGLTPYTDLTNEEFQAIYLGTRSDTIHRLKRTINI 122
Query: 123 HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLS 182
E ++Y+ +LP+ +DWRKKGAVT VKNQG CGSCWAFSTV+ VE INQI TGNL SLS
Sbjct: 123 SERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNLISLS 182
Query: 183 EQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
EQ+L+DC N N+GC GG YA+QYI+ GG+ E +YPY +G C K +VV I
Sbjct: 183 EQQLVDC-NKKNHGCKGGAFVYAYQYIIDNGGIDTEANYPYKAVQGPCRAAK---KVVRI 238
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
+GY VP +E++L KA+A+QP VAI+AS + FQ Y G++ G CGT+L+HGV VGY
Sbjct: 239 DGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKLNHGVVIVGYW 298
Query: 303 STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
DY IV+NSWG WGE+GYIRMKR G GLCGI ++ YP K
Sbjct: 299 K----DYWIVRNSWGRYWGEQGYIRMKRVGGC--GLCGIARLPYYPTK 340
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 164/306 (53%), Positives = 205/306 (66%), Gaps = 8/306 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM K+ KVY+ EK +R IFKDN+ I+ N K Y L +N AD +EEF
Sbjct: 39 EQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLADQTNEEFVAS 98
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
G K + S F Y +V D+P +VDWR+ GAVT VK+QG CGSCWAFSTVAA
Sbjct: 99 HNGYK----YKGSHSQTPFKYGNVTDIPTAVDWRQNGAVTAVKDQGQCGSCWAFSTVAAT 154
Query: 168 EGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE 227
EGI QI TG L SLSEQEL+DCD+ ++GC+GGLM+ F++I+ GG+ E +YPY +
Sbjct: 155 EGIYQISTGMLMSLSEQELVDCDSV-DHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVD 213
Query: 228 GTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGH 287
GTC+ +K S I GY VP NSE++L +A+ANQP+SV+I+A G FQFYS GV+ G
Sbjct: 214 GTCDASKEASPAAQIKGYETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQ 273
Query: 288 CGTQLDHGVAAVGYGSTR--GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
CGTQLDHGV VGYG+T +Y IVKNSWG +WGE+GYIRM+R EGLCGI A
Sbjct: 274 CGTQLDHGVTVVGYGTTDDGTHEYWIVKNSWGTQWGEEGYIRMQRGIDAQEGLCGIAMDA 333
Query: 346 SYPIKK 351
SYP+ K
Sbjct: 334 SYPMGK 339
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 164/346 (47%), Positives = 223/346 (64%), Gaps = 21/346 (6%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++ + C++F F + S +D + + + E WM+++ KVY+ E+ +R
Sbjct: 11 SLAMLLCMAFLA-------FQVTCRSLQDAS----MYERHEQWMTRYGKVYKDPQEREKR 59
Query: 69 FEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEF---KEMFLGLKPDLARRKDQSHE 124
F IFK+N+ +I+ N K Y L +N+FADL +EEF + F G R
Sbjct: 60 FRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRTTT--- 116
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
F Y++V +P +VDWR+KGAVT +K+QG CG CWAFS VAA EGI+ + +G L SLSEQ
Sbjct: 117 -FKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQ 175
Query: 185 ELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTIN 243
EL+DCD + GC GGLMD AF++++ GL+ E +YPY +G C + + ++ TI
Sbjct: 176 ELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNVNEAANDAATIT 235
Query: 244 GYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG- 302
GY DVP N+E +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGT+LDHGV AVGYG
Sbjct: 236 GYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGV 295
Query: 303 STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
S G +Y +VKNSWG +WGE+GYIRM+R EGLCGI ASYP
Sbjct: 296 SNDGTEYWLVKNSWGTEWGEEGYIRMQRGVNSEEGLCGIAMQASYP 341
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 323 bits (828), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 205/303 (67%), Gaps = 5/303 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+K KVY+ EKL RF+IFK N+ I+ N K+Y LG+N+FADL +EEF+
Sbjct: 40 EKWMAKHGKVYKDDKEKLRRFQIFKSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAF 99
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
+ G K L + + F Y++V LP S+DWR KGAVT +K+QG CGSCWAFS VAA
Sbjct: 100 WNGYKRPLGASRKIT--PFKYENVTALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAAT 157
Query: 168 EGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EGI+++ TG L SLSEQEL+DCD + GC GGLM AF++I GG+ E +YPY
Sbjct: 158 EGIHKLRTGKLVSLSEQELVDCDVKGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGR 217
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+G C+ K S V I GY VP+NSE +LLKA+ANQP+SVAI+A FQFY G++ G
Sbjct: 218 DGKCDTKKEASRAVKITGYQAVPKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTG 277
Query: 287 HCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
CG ++HGVAAVGYG S G Y IVKNSWG +WGEKGYIRMKR+ EGLCGI
Sbjct: 278 ICGKDINHGVAAVGYGRSNSGSKYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAMEC 337
Query: 346 SYP 348
SYP
Sbjct: 338 SYP 340
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 323 bits (828), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 164/315 (52%), Positives = 207/315 (65%), Gaps = 6/315 (1%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNE 95
L + + + E WM + KVY+ L E+ R +IFK+N+ +I+ +N N Y LG+N+
Sbjct: 31 LQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLGINQ 90
Query: 96 FADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSC 155
FADL +EEF K + ++ F Y++ +P +VDWRKKGAVT VKNQG C
Sbjct: 91 FADLTNEEFIASRNKFKGHMCSSITKT-STFKYENA-SVPSTVDWRKKGAVTPVKNQGQC 148
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGG 214
G CWAFS VAA EGI+++ TG L SLSEQEL+DCD + GC GGLMD AF++I+ G
Sbjct: 149 GCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 208
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
L+ E YPY +GTC K VTI GY DVP N+E +L KA+ANQP+SVAI+ASG
Sbjct: 209 LNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISVAIDASGS 268
Query: 275 DFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
DFQFY GV+ G CGT+LDHGV AVGYG G Y +VKNSWG WGE+GYI+M+R
Sbjct: 269 DFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIKMQRGVD 328
Query: 334 KPEGLCGINKMASYP 348
EGLCGI ASYP
Sbjct: 329 AAEGLCGIAMEASYP 343
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 168/330 (50%), Positives = 214/330 (64%), Gaps = 21/330 (6%)
Query: 24 FARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN 83
A D+S+ D+ D ++ WM K+ + Y+S +E RF I++ N+++ID N
Sbjct: 1 MAMDYSLGSSCSSDIQ------DRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFN 54
Query: 84 RKIKNYWLGLNEFADLRHEEFKEMFLGLK----PDLARRKDQSHEDFSYKDVVDLPKSVD 139
++ L N FADL +EEFK +LG K PD F Y ++V+LP +VD
Sbjct: 55 SMNHSHTLAENNFADLTNEEFKATYLGYKTVSIPDTC---------FRYGNMVNLPTNVD 105
Query: 140 WRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCN 198
WR++GAVT +KNQG CGSCWAFS VAAVEGIN+I G L SLSEQEL+DCD T N GCN
Sbjct: 106 WRQEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCN 165
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
GG M AF++I T GL E +YPY E C K + + V+I+GY VP N E SL
Sbjct: 166 GGYMYKAFEFIKRT-GLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKA 224
Query: 259 ALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGP 318
A+ANQP+SVAI+A G +FQFYSGG++ G+CG QL+HGVA VGYG T Y +VKNSWG
Sbjct: 225 AVANQPVSVAIDAEGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGT 284
Query: 319 KWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
WGE GYIRMKR++ +G CGI MASYP
Sbjct: 285 DWGESGYIRMKRDSTDKQGTCGIAMMASYP 314
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 168/353 (47%), Positives = 227/353 (64%), Gaps = 17/353 (4%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
M +QF I ++ FC+ F+ F + + +D + + + E WM+++ KV
Sbjct: 1 MVAKNQFYHISLALLFCLGFWA-------FQVTSRTLQDAS----MYERHEEWMARYAKV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y+ +E+ +RF+IFK+N+ +I+ N K Y LG+N+FADL +EEF K +
Sbjct: 50 YKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLGINQFADLTNEEFIAPRNRFKGHMCS 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
++ F Y++V LP +VDWR+KGAVT +K+QG CG CWAFS VAA EGI+ + +G
Sbjct: 110 SITRT-TTFKYENVTALPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGK 168
Query: 178 LASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQE++DCD + GC GG MD AF++I+ GL+ E +YPY +G C +
Sbjct: 169 LISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEANYPYKAVDGKCNANEAA 228
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+ TI GY DVP N+E +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGTQLDHGV
Sbjct: 229 NHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKTGVFTGSCGTQLDHGV 288
Query: 297 AAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
AVGYG S G Y +VKNSWG +WGE+GYI M+R EGLCGI MASYP
Sbjct: 289 TAVGYGVSADGTQYWLVKNSWGTEWGEEGYIMMQRGVKAQEGLCGIAMMASYP 341
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 168/353 (47%), Positives = 227/353 (64%), Gaps = 17/353 (4%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
M +QF I ++ FC+ F+ F + + +D + + + E WM+++ KV
Sbjct: 1 MVAKNQFYHISLALLFCLGFWA-------FQVTSRTLQDAS----MYERHEEWMARYAKV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y+ +E+ +RF+IFK+N+ +I+ N K Y LG+N+FADL +EEF K +
Sbjct: 50 YKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLGINQFADLTNEEFIAPRNKFKGHMCS 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
++ F Y++V LP +VDWR+KGAVT +K+QG CG CWAFS VAA EGI+ + +G
Sbjct: 110 SITRT-TTFKYENVTALPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGK 168
Query: 178 LASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQE++DCD + GC GG MD AF++I+ GL+ E +YPY +G C +
Sbjct: 169 LISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEANYPYKAVDGKCNANEAA 228
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+ TI GY DVP N+E +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGTQLDHGV
Sbjct: 229 NHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKTGVFTGSCGTQLDHGV 288
Query: 297 AAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
AVGYG S G Y +VKNSWG +WGE+GYI M+R EGLCGI MASYP
Sbjct: 289 TAVGYGVSADGTQYWLVKNSWGTEWGEEGYIMMQRGVKAQEGLCGIAMMASYP 341
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 160/305 (52%), Positives = 201/305 (65%), Gaps = 4/305 (1%)
Query: 48 FESWMSKFEKVYESLDEKLER-FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKE 106
F W+ +K Y+ E+ ER F ++ DNL + N K + LGL FADL H+E+++
Sbjct: 48 FSDWVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHNEKDSTFKLGLTNFADLTHDEYRQ 107
Query: 107 MFLGLKPDLARRKDQSHED--FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
LG +P+L + + F Y D + P S+DWRKKGAVT VKNQ CGSCWAFST
Sbjct: 108 HALGYRPELKGTGLGTGKSTGFQYADY-EAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTT 166
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
+VEG N I +G L SLSEQEL+DCD T ++GC+GGLMD+AF +I+ GG+ E+DY Y
Sbjct: 167 GSVEGANAIYSGELVSLSEQELVDCDVTQDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYK 226
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
++G C + K + VVTI+ Y DVP N E +L KA ANQP+SVAIEA R+FQ Y+GGV+
Sbjct: 227 AQDGVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVF 286
Query: 285 DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
D CGT LDHGV VGYGS G DY IVKNSWG WG+ GYIR+ R G CGI
Sbjct: 287 DAPCGTALDHGVLVVGYGSDNGTDYWIVKNSWGDFWGDSGYIRLARGISNSAGQCGIAMQ 346
Query: 345 ASYPI 349
ASYPI
Sbjct: 347 ASYPI 351
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 168/353 (47%), Positives = 226/353 (64%), Gaps = 18/353 (5%)
Query: 1 MAL--SSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MAL QF I + F ++ + + R+ +++ E WM+K KV
Sbjct: 1 MALLCKGQFLLIALFFVLAMWADQASTRELH-----------ESTMVERHEKWMAKHGKV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y+ +EKL RF+IFK+N+ I+ +N N Y LG+N FADL +EEF+ + G K L
Sbjct: 50 YKDDEEKLRRFQIFKNNVEFIESSNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDA 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
+ + F Y++V LP S+DWR+KGAVT +K+Q CGSCWAFS VAA EG++++ TG
Sbjct: 110 SRIVT--PFKYENVTALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGK 167
Query: 178 LASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL+DCD + GC GGLM+ AF++I GG+ E +Y Y +G C+ K
Sbjct: 168 LVSLSEQELVDCDVKGEDKGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEA 227
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
S V I GY VP+NSE +LLKA+A+QP+SV+I+A FQFY G+Y G CG+ L+HGV
Sbjct: 228 SHVAKITGYQVVPENSEAALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGV 287
Query: 297 AAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
AAVGYG S+ G Y IVKNSWGP+WGE+GY+RMKR+ +GLCGI SYP
Sbjct: 288 AAVGYGTSSSGSKYWIVKNSWGPEWGERGYVRMKRDITSRKGLCGIAMDCSYP 340
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 175/330 (53%), Positives = 213/330 (64%), Gaps = 15/330 (4%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK--IKNYW 90
+ DL S++ L DL+E W + +V+ EK RF FK+N+R I N++ +Y
Sbjct: 31 FDERDLASDEALWDLYERWQTH-HRVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYR 89
Query: 91 LGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE------DFSYKDVVDLPKSVDWRKKG 144
L LN F D+ EEF+ F + + RR +S F Y D D+P+SVDWR+ G
Sbjct: 90 LRLNRFGDMGPEEFRSTFADSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHG 149
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDY 204
AVT VKNQG CGSCWAFSTV AVEGIN I TG+L SLSEQEL+DCD T NGC GGLM+
Sbjct: 150 AVTAVKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCD-TAENGCQGGLMEN 208
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCE-MTKGESEV-VTINGYHDVPQNSEDSLLKALAN 262
AF +I S GG+ E YPY GTC+ M V V+I+G+ VP SED+L KA+A
Sbjct: 209 AFDFIKSYGGITTESAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVAR 268
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST--RGLDYIIVKNSWGPKW 320
QP+SVAI+A G+ FQFYS GV+ G CGT LDHGVA VGYG + G Y IVKNSWGP W
Sbjct: 269 QPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSWGPSW 328
Query: 321 GEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
GE GYIRM+R G GLCGI AS+PIK
Sbjct: 329 GEGGYIRMQRGAGN-GGLCGIAMEASFPIK 357
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 161/303 (53%), Positives = 202/303 (66%), Gaps = 4/303 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+ + KVY EK RF+IFK+N+ +I+ N K Y L +N+FAD +E+FK
Sbjct: 39 EQWMATYGKVYVDAAEKERRFKIFKNNVEYIESFNTAGNKPYKLSVNKFADQTNEKFKGA 98
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
G + R F Y++V +P ++DWRKKGAVT +K+QG CGSCWAFSTVAA
Sbjct: 99 RNGYRRPFQTRP-MKVTSFKYENVTAVPATMDWRKKGAVTLIKDQGQCGSCWAFSTVAAT 157
Query: 168 EGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EGINQ+ TG L SLSEQEL+DCD + GC GGLM+ F++I+ G+ E +YPY
Sbjct: 158 EGINQLTTGKLVSLSEQELVDCDIQGEDQGCEGGLMEDGFEFIIKNHGITTEANYPYQAA 217
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+GTC K S + I GY VP NSE LLK +ANQP+SV+I+A G DFQFYS GV+ G
Sbjct: 218 DGTCNSKKQASHIAKITGYESVPANSEAELLKVVANQPISVSIDAGGSDFQFYSSGVFTG 277
Query: 287 HCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
CGT+LDHGV AVGYG T G Y +VKNSWG WGE+GYIRM+R+ EGLCGI +
Sbjct: 278 KCGTELDHGVTAVGYGETSDGTKYWLVKNSWGTSWGEEGYIRMQRDIDTEEGLCGIAMDS 337
Query: 346 SYP 348
SYP
Sbjct: 338 SYP 340
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/310 (53%), Positives = 213/310 (68%), Gaps = 6/310 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHE 102
L + E+WM+++ K+Y+ EK +RF+IFKDN+ I+ N K Y LG+N ADL E
Sbjct: 34 LRERHENWMAEYGKIYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLE 93
Query: 103 EFKEMFLGLKPDLARRKDQ-SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQG-SCGSCWA 160
EFK+ GLK F Y++V D+P+++DWR KGAVT +K+QG CGSCWA
Sbjct: 94 EFKDSRNGLKRTYEFSTTTFKLNGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWA 153
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FSTVAA EGI QI TG L SLSEQEL+DCD+ ++GC+GGLM+ F++I+ GG+ E +
Sbjct: 154 FSTVAATEGIYQISTGMLMSLSEQELVDCDSV-DHGCDGGLMEDGFEFIIKNGGISSEAN 212
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY +GTC+ +K S I GY VP NSE++L +A+ANQP+SV+I+A G FQFYS
Sbjct: 213 YPYTAVDGTCDASKEASPAAQIKGYETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYS 272
Query: 281 GGVYDGHCGTQLDHGVAAVGYGSTR--GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
GV+ G CGTQLDHGV VGYG+T +Y IVKNSWG +WGE+GYIRM+R EGL
Sbjct: 273 SGVFTGQCGTQLDHGVTVVGYGTTDDGTHEYWIVKNSWGTQWGEEGYIRMQRGIDALEGL 332
Query: 339 CGINKMASYP 348
CGI ASYP
Sbjct: 333 CGIAMDASYP 342
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 164/295 (55%), Positives = 199/295 (67%), Gaps = 9/295 (3%)
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIK---NYWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
+ E RF +F DNL+ +D N + + LG+N FADL + EF+ +LG P A R
Sbjct: 82 IGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTP--AGR 139
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTH-VKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
+ E + + V LP SVDWR KGAV VKNQG CGSCWAFS VAAVEGIN+IVTG
Sbjct: 140 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 178 LASLSEQELIDC-DNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL++C N N+GCNGG+MD AF +I GGL EEDYPY +G C + K
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+VV+I+G+ DVP+N E SL KA+A+QP+SVAI+A GR+FQ Y GV+ G CGT LDHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 297 AAVGYGS--TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
AVGYG+ G Y V+NSWGP WGE GYIRM+RN G CGI MASYPI
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 162/314 (51%), Positives = 214/314 (68%), Gaps = 5/314 (1%)
Query: 40 SNDKLIDLFESWMSKFEKVY-ESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFAD 98
SN+++ +F+ WMSK K Y +L EK RF+ FKDNLR ID+ N K +Y LGL FAD
Sbjct: 40 SNEEVGFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFAD 99
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
L +E++++F G P +R + + D LP+SVDWR +GAV+ +K+QG+C SC
Sbjct: 100 LTVQEYRDLFPG-SPKPKQRNLRISRRYVPLDGDQLPESVDWRNEGAVSAIKDQGTCNSC 158
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNG-GLMDYAFQYIVSTGGLHK 217
WAFSTVAAVEGIN+IVTG L SLSEQEL+DC N NNGC G G MD AFQ++++ GGL
Sbjct: 159 WAFSTVAAVEGINKIVTGELVSLSEQELVDC-NLVNNGCYGSGTMDAAFQFLINNGGLDS 217
Query: 218 EEDYPYIMEEGTCEMTKGES-EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
+ DYPY +G C + S +++TI+ Y DVP N E SL KA+A+QP+SV ++ ++F
Sbjct: 218 DTDYPYQGSQGYCNRKESTSNKIITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEF 277
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
Y G+Y+G CGT LDH + VGYGS G DY IV+NSWG WG+ GY +M RN P
Sbjct: 278 MLYRSGIYNGPCGTDLDHALVIVGYGSENGQDYWIVRNSWGTTWGDAGYAKMARNFEYPS 337
Query: 337 GLCGINKMASYPIK 350
G+CGI +ASYP+K
Sbjct: 338 GVCGIAMLASYPVK 351
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 167/330 (50%), Positives = 222/330 (67%), Gaps = 7/330 (2%)
Query: 23 SFARDFSIVGYSPEDLTSNDKLIDL-FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
+ A F+ Y T D L+ + E WM+++ +VY++ EK +R+ IFK+N+ +I+
Sbjct: 11 ALALVFATSAYLATSRTLLDSLMAVRHEQWMAQYGRVYKNEVEKTKRYNIFKENVEYIES 70
Query: 82 TNRK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDW 140
N+ K Y LG+N FADL ++EF G + + S+ F Y++V +P +VDW
Sbjct: 71 FNKAGTKPYKLGINAFADLTNKEFIASRNGY---ILPHECSSNTPFRYENVSAVPTTVDW 127
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNG 199
RKKGAVT VK+QG CG CWAFS VAA+EGI ++ TGNL SLSEQEL+DCD + GC G
Sbjct: 128 RKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEG 187
Query: 200 GLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKA 259
GLMD AF +I++ GL E +YPY +G+C+ +K + I+GY DVP NSE +L KA
Sbjct: 188 GLMDDAFTFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKA 247
Query: 260 LANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGP 318
+ANQP+SVAI+A G DFQFYS GV+ G CGT+LDHGV AVGYG + G Y +VKNSWG
Sbjct: 248 VANQPVSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGT 307
Query: 319 KWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
WGEKGYIRM+++ EGLCGI +SYP
Sbjct: 308 SWGEKGYIRMQKDIEAKEGLCGIAMQSSYP 337
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 161/303 (53%), Positives = 202/303 (66%), Gaps = 4/303 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+ + KVY EK RF+IFK+N+ +I+ N K Y L +N+FAD +E+FK
Sbjct: 39 EQWMATYGKVYVDAAEKERRFKIFKNNVEYIESFNTAGNKPYKLSVNKFADQTNEKFKGA 98
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
G + R F Y++V +P ++DWRKKGAVT +K+QG CGSCWAFSTVAA
Sbjct: 99 RNGYRRPFQTRP-MKVTSFKYENVTAVPATMDWRKKGAVTPIKDQGQCGSCWAFSTVAAT 157
Query: 168 EGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EGINQ+ TG L SLSEQEL+DCDN + GC GGLM+ F++I+ G+ E +YPY
Sbjct: 158 EGINQLTTGKLVSLSEQELVDCDNQGEDQGCEGGLMEDGFEFIIKNHGITTEANYPYQAA 217
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+GTC K S + I GY VP NSE LLK +ANQP+SV+I+A G DFQFYS GV+ G
Sbjct: 218 DGTCNSKKQASHIAKITGYESVPANSEAELLKVVANQPISVSIDAGGSDFQFYSSGVFTG 277
Query: 287 HCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
CGT+LDHGV AVGYG T G Y +VKNSW WGE+GYIRM+R+ EGLCGI +
Sbjct: 278 KCGTELDHGVTAVGYGETSDGTKYWLVKNSWXTSWGEEGYIRMQRDIDAEEGLCGIAMDS 337
Query: 346 SYP 348
SYP
Sbjct: 338 SYP 340
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 164/295 (55%), Positives = 199/295 (67%), Gaps = 9/295 (3%)
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIK---NYWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
+ E RF +F DNL+ +D N + + LG+N FADL + EF+ +LG P A R
Sbjct: 82 IGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTP--AGR 139
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTH-VKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
+ E + + V LP SVDWR KGAV VKNQG CGSCWAFS VAAVEGIN+IVTG
Sbjct: 140 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 178 LASLSEQELIDC-DNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL++C N N+GCNGG+MD AF +I GGL EEDYPY +G C + K
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+VV+I+G+ DVP+N E SL KA+A+QP+SVAI+A GR+FQ Y GV+ G CGT LDHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 297 AAVGYGS--TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
AVGYG+ G Y V+NSWGP WGE GYIRM+RN G CGI MASYPI
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 423
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 180/354 (50%), Positives = 224/354 (63%), Gaps = 16/354 (4%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
KT+L+ + F+ S+ + + DL S++ L DL+E W + +V+ EK
Sbjct: 50 KTLLLVALV--FVSSAAVELCRAIDFDERDLASDEALWDLYERWQTH-HRVHRHHGEKGR 106
Query: 68 RFEIFKDNLRHIDETN-RKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSH--- 123
RF FK+N+R I N R + Y L LN F D+ EEF+ F + + RR+D
Sbjct: 107 RFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAARA 166
Query: 124 ---EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
F Y D P+SVDWR++GAVT VK+QG CGSCWAFSTV AVEGIN I TG+LAS
Sbjct: 167 GAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLAS 226
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCE---MTKGES 237
LSEQELIDCD T NGC GGLM+ AF++I S GG+ E YPY GTC+ +G
Sbjct: 227 LSEQELIDCD-TDENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGG 285
Query: 238 EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVA 297
VV I+G+ VP SED+L KA+A+QP+SVA++A G+ FQFYS GV+ G CGT LDHGVA
Sbjct: 286 VVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVA 345
Query: 298 AVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
AVGYG G Y IVKNSWG WGE GYIRM+R G GLCGI AS+PIK
Sbjct: 346 AVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGN-GGLCGIAMEASFPIK 398
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 170/352 (48%), Positives = 223/352 (63%), Gaps = 12/352 (3%)
Query: 8 KTILISFCISFFIRS-SFARDFSIVGYSPEDLTSNDKLID-----LFESWMSKFEKVYES 61
K+ ++ ++ I S + A D S+V Y +D + D +FESWM K KVY S
Sbjct: 5 KSAMLILLVAMVIASCATAIDMSVVSY--DDNNRLHSVFDAEASLIFESWMVKHGKVYGS 62
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQ 121
+ EK R IF+DNLR I+ N + +Y LGL FADL E+KE+ G P R
Sbjct: 63 VAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVF 122
Query: 122 SHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
YK D LPKSVDWR +GAVT VK+QG C SCWAFSTV AVEG+N+IVTG L
Sbjct: 123 MTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELV 182
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCE-MTKGESE 238
+LSEQ+LI+C N NNGC GG ++ A+++I+ GGL + DYPY G C+ K ++
Sbjct: 183 TLSEQDLINC-NKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNK 241
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
V I+GY ++P N E +L+KA+A+QP++ I++S R+FQ Y GV+DG CGT L+HGV
Sbjct: 242 NVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVV 301
Query: 299 VGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VGYG+ G DY +VKNS G WGE GY++M RN P GLCGI ASYP+K
Sbjct: 302 VGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPLK 353
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 161/307 (52%), Positives = 204/307 (66%), Gaps = 4/307 (1%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKE 106
+FESW+ K KVY+S+ EK R IFKDNLR I N + Y LGLN FADL E+KE
Sbjct: 63 IFESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSENLGYRLGLNRFADLSLHEYKE 122
Query: 107 MFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
+ G P R YK LPKSVDWR +GAVT VK+QG C SCWAFSTV
Sbjct: 123 ICHGADPKPPRNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTV 182
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
AVEG+N+IVTG L +LSEQ+LI+C N NNGC GG ++ A+++IVS GGL + DYPY
Sbjct: 183 GAVEGLNKIVTGELVTLSEQDLINC-NKENNGCGGGKVETAYEFIVSNGGLGTDNDYPYK 241
Query: 225 MEEGTCEMTKGES-EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV 283
G C+ E+ + V I+GY ++P N E +L+KA+A+QP++ I++S R+FQ Y GV
Sbjct: 242 AVNGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQLYESGV 301
Query: 284 YDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINK 343
+DG CGT L+HGV VGYG+ G +Y IV+NSWG WGE GY++M RN P GLCGI
Sbjct: 302 FDGRCGTNLNHGVVVVGYGTENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRGLCGIAM 361
Query: 344 MASYPIK 350
SYP+K
Sbjct: 362 RVSYPLK 368
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 167/353 (47%), Positives = 222/353 (62%), Gaps = 17/353 (4%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
M +QF I ++ FC F F + + +D + + + E WM ++ KV
Sbjct: 1 MVAKNQFYQISLALLFCSGFLA-------FQVTCRTLQDAS----MYERHEEWMGRYAKV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y+ E+ RF+IFK+N+ +I+ N K Y LG+N+FADL +EEF K +
Sbjct: 50 YKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCS 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
++ F Y++V +P +VDWR+KGAVT +K+QG CG CWAFS VAA EGI+ + G
Sbjct: 110 SITRT-TTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGK 168
Query: 178 LASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQE++DCD + GC GG MD AF++I+ GL+ E +YPY +G C
Sbjct: 169 LISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAA 228
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+ V TI GY DVP N+E +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGT+LDHGV
Sbjct: 229 NHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGV 288
Query: 297 AAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
AVGYG S G +Y +VKNSWG +WGE+GYIRM+R EGL GI MASYP
Sbjct: 289 TAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLXGIAMMASYP 341
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 162/308 (52%), Positives = 208/308 (67%), Gaps = 4/308 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHE 102
L + E+WM+++ K+Y+ EK +RF+IFKDN+ I+ N K Y LG+N ADL E
Sbjct: 34 LRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLE 93
Query: 103 EFKEMFLGLKPDLARRKDQ-SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQG-SCGSCWA 160
EFK+ GLK F Y++V D+P+++DWR KGAVT +K+QG CGSCWA
Sbjct: 94 EFKDSRNGLKRTYEFSTTTFKLNGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWA 153
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FST+AA EGI+QI TGNL SLSEQEL+DCD+ ++GC GG M+ F++I+ GG+ E +
Sbjct: 154 FSTIAATEGIHQISTGNLVSLSEQELVDCDSV-DDGCEGGFMEDGFEFIIKNGGITSETN 212
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY +GTC T S V I GY VP SE++L KA+ANQP+SV+I A+ F FYS
Sbjct: 213 YPYKGVDGTCNTTIAASPVAQIKGYEIVPSYSEEALQKAVANQPVSVSIHATNATFMFYS 272
Query: 281 GGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
G+Y+G CGT LDHGV AVGYG+ G DY IVKNSWG +WGEKGYIRM R G+CG
Sbjct: 273 SGIYNGECGTDLDHGVTAVGYGTENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICG 332
Query: 341 INKMASYP 348
I +SYP
Sbjct: 333 IALDSSYP 340
>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 379
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 179/354 (50%), Positives = 223/354 (62%), Gaps = 16/354 (4%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
KT+L+ + F+ S+ + + DL S++ L DL+E W + +V+ EK
Sbjct: 6 KTLLLVALV--FVSSAAVELCRAIDFDERDLASDEALWDLYERWQTH-HRVHRHHGEKGR 62
Query: 68 RFEIFKDNLRHIDETN-RKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSH--- 123
RF FK+N+R I N R + Y L LN F D+ EEF+ F + + RR+D
Sbjct: 63 RFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAARA 122
Query: 124 ---EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
F Y D P+SVDWR++GAVT VK+QG CGSCWAFSTV AVEGIN I TG+LAS
Sbjct: 123 GAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLAS 182
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE---S 237
LSEQELIDCD T NGC GGLM+ AF++I S GG+ E YPY GTC+ +
Sbjct: 183 LSEQELIDCD-TDENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGG 241
Query: 238 EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVA 297
VV I+G+ VP SED+L KA+A+QP+SVA++A G+ FQFYS GV+ G CGT LDHGVA
Sbjct: 242 VVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVA 301
Query: 298 AVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
AVGYG G Y IVKNSWG WGE GYIRM+R G GLCGI AS+PIK
Sbjct: 302 AVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGN-GGLCGIAMEASFPIK 354
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 169/356 (47%), Positives = 227/356 (63%), Gaps = 23/356 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA QF I ++ FC+ F F + + +D + + + E WM+++ KV
Sbjct: 1 MATKIQFHHISLALFFCLGFLA-------FQVASRTLQDAS----MYERHEQWMARYGKV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEF---KEMFLGLKPD 114
Y+ +EK +RF +FK+N+ +I+ N K Y LG+N+FADL EEF + F G
Sbjct: 50 YKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNGH--- 106
Query: 115 LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIV 174
R + F Y++V LP S+DWR+KGAVT +KNQGSCG CWAFS +AA EGI++I
Sbjct: 107 -TRSSNTRTTTFKYENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKIS 165
Query: 175 TGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TG L SLSEQE++DCD ++GC GG MD AF++I+ G++ E YPY +G C +
Sbjct: 166 TGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIK 225
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+ TI GY DVP N+E +L KA+ANQP+SVAI+ASG DFQFY G++ G CGT+LD
Sbjct: 226 EEAVHAATITGYEDVPINNEKALQKAVANQPVSVAIDASGADFQFYKSGIFTGSCGTELD 285
Query: 294 HGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
HGV AVGYG + G Y +VKNSWG +WGE+GYI M+R EG+CGI MASYP
Sbjct: 286 HGVTAVGYGENNEGTKYWLVKNSWGTEWGEEGYIMMQRGVKAVEGICGIAMMASYP 341
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 165/345 (47%), Positives = 219/345 (63%), Gaps = 9/345 (2%)
Query: 11 LISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFE 70
L +SF ++ F + ++L + + + L+E W V + E L+RF
Sbjct: 3 LFFIVLSFLCLLQASKGFD---FDEKELETEENVWKLYERWRDH-HSVTRASHEALKRFN 58
Query: 71 IFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD---LARRKDQSHEDFS 127
+F+ N+ H+ TN+K K Y L +N FAD+ H EF+ + G + R + F
Sbjct: 59 VFRHNVLHVHRTNKKNKPYKLKVNRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGFM 118
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
Y++V +P SVDWR+KGAVT VKNQ CGSCWAFSTVAAVEGIN+I T L SLSEQEL+
Sbjct: 119 YENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELV 178
Query: 188 DCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT-CEMTKGESEVVTINGYH 246
DCD N GC GGLM+ AF++I + GG+ EE YPY + C + E VTI+G+
Sbjct: 179 DCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSNDVQFCRAKSIDGETVTIDGHE 238
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR- 305
VP+N E++LLKA+A+QP+SVAI+A DFQ YS GV+ G CGTQL+HGV VGYG T+
Sbjct: 239 HVPENDEEALLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKN 298
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G Y IV+NSWGP+WGE GY+R++R + EG CGI ASYP K
Sbjct: 299 GTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTK 343
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 160/303 (52%), Positives = 207/303 (68%), Gaps = 6/303 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEM 107
E WM+ + +VY+ ++EK +R++IF++N+ I+ +N+ K Y L +N+FADL +EEFK
Sbjct: 39 EEWMASYGRVYKDINEKQKRYKIFEENVALIESSNKDANKPYKLSVNQFADLTNEEFKAS 98
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
K + K S F Y +V +P ++DWR KGAVT VK+QG CG CWAFS VAA
Sbjct: 99 RNRFKGHICSTKSTS---FKYGNVSAVPSAMDWRMKGAVTPVKDQGQCGCCWAFSAVAAT 155
Query: 168 EGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EGI ++ TG L SLSEQEL+DCD + + GC GGLMD AF +I GL E +YPY
Sbjct: 156 EGITKLTTGELISLSEQELVDCDTSGVDQGCEGGLMDNAFTFIQHNHGLASEANYPYKGV 215
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+GTC K ING+ DVP NSE++LL A+A+QP+SVAI+A G FQFYS GV+ G
Sbjct: 216 DGTCNTNKQAIHAAEINGFEDVPANSEEALLNAVAHQPVSVAIDAGGSGFQFYSKGVFIG 275
Query: 287 HCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
CGTQLDHGV AVGYG++ G Y +VKNSWG +WGE+GYIRM+R+ EGLCGI A
Sbjct: 276 ACGTQLDHGVTAVGYGTSDDGTKYWLVKNSWGTQWGEEGYIRMQRDVDAKEGLCGIAMKA 335
Query: 346 SYP 348
SYP
Sbjct: 336 SYP 338
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 167/333 (50%), Positives = 213/333 (63%), Gaps = 22/333 (6%)
Query: 28 FSIVGYSPEDLTSND-KLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRK 85
F + + T D L + E WM+++ KVY EK R IFK+N++ I+ N
Sbjct: 18 FGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKENVQRIEAFNNAG 77
Query: 86 IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSH--------EDFSYKDVVDLPKS 137
K Y LG+N+FADL +EEFK AR + + H F Y+DV +P S
Sbjct: 78 NKPYKLGINQFADLTNEEFK----------ARNRFKGHMCSNSTRTPTFKYEDVSSVPAS 127
Query: 138 VDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNG 196
+DWR+KGAVT +K+QG CG CWAFS VAA EGI ++ TG L SLSEQEL+DCD + G
Sbjct: 128 LDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQG 187
Query: 197 CNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSL 256
C GGLMD AF++I+ GL+ E YPY + TC + +I G+ DVP NSE +L
Sbjct: 188 CEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESAL 247
Query: 257 LKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNS 315
LKA+ANQP+SVAI+ASG +FQFYS G++ G CGT+LDHGV AVGYG S G Y +VKNS
Sbjct: 248 LKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVSDDGTKYWLVKNS 307
Query: 316 WGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
WG +WGE+GYIRM+R+ EGLCGI ASYP
Sbjct: 308 WGEQWGEEGYIRMQRDVAAEEGLCGIAMQASYP 340
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 168/349 (48%), Positives = 221/349 (63%), Gaps = 13/349 (3%)
Query: 7 FKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKL 66
F +LISF +++S DF ++L + + + L+E W V + E +
Sbjct: 4 FFIVLISFLS--LLQASKGFDFD-----EKELETEENVWKLYERWRGH-HSVSRASHEAI 55
Query: 67 ERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPD---LARRKDQSH 123
+RF +F+ N+ H+ TN+K K Y L +N FAD+ H EF+ + G + R +
Sbjct: 56 KRFNVFRHNVLHVHRTNKKNKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGS 115
Query: 124 EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
F Y++V +P SVDWR+KGAVT VKNQ CGSCWAFSTVAAVEGIN+I T L SLSE
Sbjct: 116 GGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSE 175
Query: 184 QELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT-CEMTKGESEVVTI 242
QEL+DCD N GC GGLM+ AF++I + GG+ EE YPY + C E VTI
Sbjct: 176 QELVDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTI 235
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
+G+ VP+N E+ LLKA+A+QP+SVAI+A DFQ YS GV+ G CGTQL+HGV VGYG
Sbjct: 236 DGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYG 295
Query: 303 STR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
T+ G Y IV+NSWGP+WGE GY+R++R + EG CGI ASYP K
Sbjct: 296 ETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTK 344
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 161/308 (52%), Positives = 206/308 (66%), Gaps = 4/308 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHE 102
+ + E WM ++ +VY+ EK RF+IF DN++ I+E N+ + +Y L +NEFAD +E
Sbjct: 53 MFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKFIEEFNKDGRQSYKLAVNEFADQTNE 112
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
EF+ G K ++ R Q+ F Y++V +P S+DWRKKGAVT VK+QG CGSCWAFS
Sbjct: 113 EFQASRNGYKMAVSSRPSQTTL-FRYENVTAVPSSMDWRKKGAVTPVKDQGQCGSCWAFS 171
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
T+AA EGI ++ TG L SLSEQEL+DCD T + GC GG M+ F++IV G+ E Y
Sbjct: 172 TIAATEGITKLKTGKLISLSEQELVDCDKTGEDQGCEGGYMEDGFEFIVKNKGIALEASY 231
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY +GTC + S I+GY VP NSE +LLKA+ANQP+SV+I+ASG FQFYS
Sbjct: 232 PYTAADGTCNSKEEASRAAKISGYEKVPANSETALLKAVANQPVSVSIDASGVAFQFYSS 291
Query: 282 GVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GV+ G CGT LDHGV AVGYG T G Y +VKNSWG WG+ GYI M+R GLCG
Sbjct: 292 GVFTGECGTDLDHGVTAVGYGKTSDGTKYWLVKNSWGASWGDSGYIMMQRGVAAKGGLCG 351
Query: 341 INKMASYP 348
I ASYP
Sbjct: 352 IAMDASYP 359
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 320 bits (821), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 167/357 (46%), Positives = 224/357 (62%), Gaps = 15/357 (4%)
Query: 8 KTILISFCISFFIRS-SFARDFSIVGYSPEDLTSND----------KLIDLFESWMSKFE 56
K+ ++ ++ I S + A D SIV + +N + +FESWM K
Sbjct: 5 KSAMLVLLLAMVISSCATAMDMSIVSSNDNHHVTNGPGRRQGVFDAEATLMFESWMVKHG 64
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLA 116
KVYES+ EK R IF+DNLR I N + +Y LGLN FADL E+ ++ G P
Sbjct: 65 KVYESVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYAQICHGADPRPP 124
Query: 117 RRKD--QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIV 174
R S + D LPKSVDWR +GAVT VK+QG C SCWAFSTV AVEG+N+IV
Sbjct: 125 RNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFSTVGAVEGLNKIV 184
Query: 175 TGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTC-EMT 233
TG L +LSEQ+LI+C N NNGC GG ++ A+++I++ GGL + DYPY G C +
Sbjct: 185 TGELVTLSEQDLINC-NKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCNDRL 243
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
K ++ V I+GY ++P N E +L+KA+A+QP++ +++S R+FQ Y+ GV+DG CGT L+
Sbjct: 244 KENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASGVFDGTCGTNLN 303
Query: 294 HGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
HGV VGYG+ G DY IV+NS G WGE GY++M RN P GLCGI ASYP+K
Sbjct: 304 HGVVVVGYGTENGRDYWIVRNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASYPLK 360
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 213/332 (64%), Gaps = 2/332 (0%)
Query: 19 FIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRH 78
F+ + A ++ + DLT + ++ E WM+K+ +VY + EK +R E+FK N+
Sbjct: 82 FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141
Query: 79 IDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSV 138
I+ N + L N+FAD+ +EF+ G KP A + + ++ + LP S+
Sbjct: 142 IELVNAGNDKFSLEANQFADMTVDEFRAAHTGYKPVPANKGRTTQFKYANVSLDALPASM 201
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGC 197
DWR KGAVT +K+QG CG CWAFSTVA+VEGI ++ TG L SLSEQEL+DCD + + GC
Sbjct: 202 DWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDGMDQGC 261
Query: 198 NGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLL 257
GGLMD AF++I+ GGL E +YPY + +C K ++V +I GY DVP N E SLL
Sbjct: 262 EGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVPSNDETSLL 321
Query: 258 KALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSW 316
KA+A QP+S+A++ F+FY GGV G CGT+LDHG+AAVGYG T G + ++KNSW
Sbjct: 322 KAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGITSDGTKFWLMKNSW 381
Query: 317 GPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G WGEKG+IRM+R+ EGLCG+ SYP
Sbjct: 382 GTSWGEKGFIRMERDIADEEGLCGLAMQPSYP 413
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 170/352 (48%), Positives = 218/352 (61%), Gaps = 21/352 (5%)
Query: 1 MALSSQFK-TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY 59
MA +SQ + TI + ++ I +R + + E WM+++ KVY
Sbjct: 1 MAFTSQKQYTIALFLLLALGIPQMMSRKLH-----------ETSMRERHEQWMAEYGKVY 49
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
+ EK +RF IFK N+ I+ N K Y LG+N ADL EEFK GLK R
Sbjct: 50 KDAAEKEKRFLIFKHNVEFIESFNAAANKPYKLGVNHLADLTVEEFKASRNGLK----RP 105
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSC-GSCWAFSTVAAVEGINQIVTGN 177
+ S F Y++V +P ++DWR KGAVT +K+QG C GSCWAFSTVAA EGI+QI TG
Sbjct: 106 YELSTTPFKYENVTAIPAAIDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGK 165
Query: 178 LASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL+DCD + GC GG M+ F++I+ GG+ E +YPY +G C K
Sbjct: 166 LVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGGITSEANYPYKAVDGKC--NKAT 223
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
S V I GY VP NSE +L KA+ANQP+SV+I+A+G F FYS G+Y+G CGT+LDHGV
Sbjct: 224 SPVAQIKGYEKVPPNSEKTLQKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELDHGV 283
Query: 297 AAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
AVGYG G DY +VKNSWG +WGEKGY+RM+R GLCGI +SYP
Sbjct: 284 TAVGYGIANGTDYWLVKNSWGTQWGEKGYVRMQRGVAAKHGLCGIALDSSYP 335
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 320 bits (820), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 169/349 (48%), Positives = 218/349 (62%), Gaps = 11/349 (3%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLID-----LFESWMSKFEKVYESLDE 64
+LI + A D S+V Y +D + D +FESWM K KVY S+ E
Sbjct: 1 MLILLVAMVIASCATAIDMSVVSY--DDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAE 58
Query: 65 KLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE 124
K R IF+DNLR I+ N + +Y LGL FADL E+KE+ G P R
Sbjct: 59 KERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMTS 118
Query: 125 DFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLS 182
YK D LPKSVDWR +GAVT VK+QG C SCWAFSTV AVEG+N+IVTG L +LS
Sbjct: 119 SDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLS 178
Query: 183 EQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCE-MTKGESEVVT 241
EQ+LI+C N NNGC GG ++ A+++I+ GGL + DYPY G C+ K ++ V
Sbjct: 179 EQDLINC-NKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVM 237
Query: 242 INGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGY 301
I+GY ++P N E +L+KA+A+QP++ I++S R+FQ Y GV+DG CGT L+HGV VGY
Sbjct: 238 IDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGY 297
Query: 302 GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G+ G DY +VKNS G WGE GY++M RN P GLCGI ASYP+K
Sbjct: 298 GTENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPLK 346
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 320 bits (819), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 172/352 (48%), Positives = 220/352 (62%), Gaps = 17/352 (4%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDK-LIDLFESWMSKFEKVY 59
MA F L F I F+ + T D + + E WM+ KVY
Sbjct: 1 MAFKKLFHCTLALFLI-----------FAFCAFEANARTLEDAPMRERHEQWMATHGKVY 49
Query: 60 ESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
+ EK ++++IF +N++ I+ N K Y LG+N FADL +EEFK + K + +
Sbjct: 50 KHSYEKEQKYQIFMENVQRIEAFNNAGXKPYKLGINHFADLTNEEFKAINR-FKGHVCSK 108
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
+ ++ F Y++V +P S+DWR+KGAVT +K+QG CG CWAFS VAA EGI ++ TG L
Sbjct: 109 RTRT-TTFRYENVTAVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKL 167
Query: 179 ASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGES 237
SLSEQEL+DCD + GC GGLMD AF++I+ GL E YPY +GTC +
Sbjct: 168 ISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKGLATEAIYPYEGFDGTCNAKADGN 227
Query: 238 EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVA 297
+I GY DVP NSE +LLKA+ANQP+SVAIEASG FQFYSGGV+ G CGT LDHGV
Sbjct: 228 HAGSIKGYEDVPANSESALLKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVT 287
Query: 298 AVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
+VGYG G Y +VKNSWG KWGEKGYIRM+R+ EGLCGI +ASYP
Sbjct: 288 SVGYGVGDDGTKYWLVKNSWGVKWGEKGYIRMQRDVAAKEGLCGIAMLASYP 339
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 320 bits (819), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 164/320 (51%), Positives = 216/320 (67%), Gaps = 15/320 (4%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHID----ETNRKIKNYWLGLNE 95
++D++ ++E+W S+ + S D++L R E+F+DNLR+ID E + + + LGL
Sbjct: 44 ADDEVRRMYEAWKSEHGHGHGS-DDRL-RLEVFRDNLRYIDAHNAEADAGLHTFRLGLTP 101
Query: 96 FADLRHEEFKEMFLGLKPDLARRKDQSH--EDFSYKDVV---DLPKSVDWRKKGAVTHVK 150
FADL EE++ LG + ARR S SY+ DLP ++DWR+ GAVT VK
Sbjct: 102 FADLTLEEYRGRALGFR---ARRGGASRVGSGSSYRPRPRGGDLPDAIDWRELGAVTGVK 158
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIV 210
NQ CG CWAFS VAA+EGIN+IVTGNL SLSEQE+IDCD T + GCNGG M AFQ+++
Sbjct: 159 NQEQCGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCD-TQDGGCNGGEMQNAFQFVI 217
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
+ GG+ E DYPY+ + C+ + VVTI+G+ V +E +L +A+ANQP+SVAI+
Sbjct: 218 NNGGIDTEADYPYLGTDAACDANRVNERVVTIDGFVSVATENETALQEAVANQPVSVAID 277
Query: 271 ASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKR 330
ASGR FQ Y+ G+++G CGTQLDHGV AVGYGS G DY IVKNSW WGE GYIR++R
Sbjct: 278 ASGRKFQHYTSGIFNGPCGTQLDHGVTAVGYGSENGKDYWIVKNSWSSSWGEAGYIRIRR 337
Query: 331 NTGKPEGLCGINKMASYPIK 350
N G CGI ASYP+K
Sbjct: 338 NVAAATGKCGIAMDASYPVK 357
>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
gi|194701540|gb|ACF84854.1| unknown [Zea mays]
Length = 379
Score = 320 bits (819), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 179/354 (50%), Positives = 222/354 (62%), Gaps = 16/354 (4%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
KT+L+ + F+ S+ + + DL S++ L DL+E W + +V+ EK
Sbjct: 6 KTLLLVALV--FVSSAAVELCRAIDFDERDLASDEALWDLYERWQTH-HRVHRHHGEKGR 62
Query: 68 RFEIFKDNLRHIDETN-RKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSH--- 123
RF FK+N+R I N R + Y L LN F D+ EEF+ F + + RR+D
Sbjct: 63 RFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAARA 122
Query: 124 ---EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
F Y D P+SVDWR++GAVT VK QG CGSCWAFSTV AVEGIN I TG+LAS
Sbjct: 123 GAVPGFMYDSAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGSLAS 182
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE---S 237
LSEQELIDCD T NGC GGLM+ AF++I S GG+ E YPY GTC+ +
Sbjct: 183 LSEQELIDCD-TDENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGG 241
Query: 238 EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVA 297
VV I+G+ VP SED+L KA+A+QP+SVA++A G+ FQFYS GV+ G CGT LDHGVA
Sbjct: 242 VVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVA 301
Query: 298 AVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
AVGYG G Y IVKNSWG WGE GYIRM+R G GLCGI AS+PIK
Sbjct: 302 AVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGN-GGLCGIAMEASFPIK 354
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 159/308 (51%), Positives = 208/308 (67%), Gaps = 8/308 (2%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHE 102
+ + E WM+++ ++Y+ +EK +RF+IFKDN+ I+ N+ + K Y L +NEFADL +E
Sbjct: 35 MYERHEDWMARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNE 94
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
EF+ + K + F Y++V +P ++DWRKKGAVT +K+Q CG CWAFS
Sbjct: 95 EFRSLRNRFKAHICSEATT----FKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFS 150
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
VAA EGI QI TG L SLSEQEL+DCD N GC+GGLMD AF++I GL E Y
Sbjct: 151 AVAATEGITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFI-KIHGLASEATY 209
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY ++GTC K I GY DVP N+E +L KA+A+QP++VAI+A G +FQFY+
Sbjct: 210 PYEGDDGTCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTS 269
Query: 282 GVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GV+ G CGT+LDHGVAAVGYG G+ Y +VKNSWG WGE+GYIRM+R+ EGLCG
Sbjct: 270 GVFTGQCGTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCG 329
Query: 341 INKMASYP 348
I ASYP
Sbjct: 330 IAMQASYP 337
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 160/315 (50%), Positives = 214/315 (67%), Gaps = 9/315 (2%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFAD 98
+ND++ ++ESW+ K K Y SL E+ RFEIFK+ LR IDE N ++Y +GLN+FAD
Sbjct: 30 TNDEVKAMYESWLIKHGKSYNSLGERERRFEIFKETLRFIDEHNADTSRSYKVGLNQFAD 89
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCG 156
L +EEF+ +LG R +++ Y+ V LP VDWR +GAV +KNQG CG
Sbjct: 90 LTNEEFRSTYLGF----TRGSNKTKVSNRYEPRVGQVLPDYVDWRSEGAVVDIKNQGQCG 145
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGL 215
SCWAFS +AAVEGIN+IVTGNL SLSEQEL+DC T + GC+GG M F++I++ GG+
Sbjct: 146 SCWAFSAIAAVEGINKIVTGNLISLSEQELVDCGRTQSTKGCDGGYMTDGFEFIINNGGI 205
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
+ EE+YPY +EG C++ + VTI+ Y +VP +E +L A+A QP+SVA+E++G
Sbjct: 206 NTEENYPYTAQEGQCDLNLQNEKYVTIDNYENVPYYNEWALQTAVAYQPVSVALESAGDA 265
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
FQ YS G++ G CGT DH V VGYG+ G+DY IVKNSW WGE+GY+R+ RN G
Sbjct: 266 FQHYSSGIFTGPCGTATDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGA 325
Query: 336 EGLCGINKMASYPIK 350
G CGI M SYP+K
Sbjct: 326 -GTCGIATMPSYPVK 339
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 168/357 (47%), Positives = 225/357 (63%), Gaps = 15/357 (4%)
Query: 8 KTILISFCISFFIRS-SFARDFSIVGYSPEDLTS----------NDKLIDLFESWMSKFE 56
K+ ++ F ++ I S + A D S+V + + + + +FESWM K
Sbjct: 5 KSAMLIFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHG 64
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLA 116
KVY+S+ EK R IF+DNLR I N + +Y LGLN FADL E+ E+ G P
Sbjct: 65 KVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGEICHGADPRPP 124
Query: 117 RRKD--QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIV 174
R S + D LPKSVDWR +GAVT VK+QG C SCWAFSTV AVEG+N+IV
Sbjct: 125 RNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIV 184
Query: 175 TGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCE-MT 233
TG L +LSEQ+LI+C N NNGC GG ++ A+++I++ GGL + DYPY G CE
Sbjct: 185 TGELVTLSEQDLINC-NKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRL 243
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
K +++ V I+GY ++P N E +L+KA+A+QP++ +++S R+FQ Y GV+DG CGT L+
Sbjct: 244 KEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLN 303
Query: 294 HGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
HGV VGYG+ G DY IVKNS G WGE GY++M RN P GLCGI ASYP+K
Sbjct: 304 HGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPLK 360
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 158/314 (50%), Positives = 206/314 (65%), Gaps = 13/314 (4%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK---------NYWLGLNEFA 97
LF++W ++ K Y + +E+ R +F DN + N ++ +Y L LN FA
Sbjct: 40 LFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFA 99
Query: 98 DLRHEEFKEMFLG-LKPDLARRKDQSHEDFSYKD--VVDLPKSVDWRKKGAVTHVKNQGS 154
DL HEEF+ LG + A + + + D + +P ++DWR+ GAVT VK+QGS
Sbjct: 100 DLTHEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGS 159
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGG 214
CG+CW+FS A+EGIN+I TG+L SLSEQELIDCD +YN+GC GGLMDYA++++V GG
Sbjct: 160 CGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGG 219
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
+ EEDYPY +GTC K + +VTI+GY DVP N ED LL+A+A QP+SV I S R
Sbjct: 220 IDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSAR 279
Query: 275 DFQFYS-GGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
FQ YS G++DG C T LDH V VGYGS G DY IVKNSWG WG KGY+ M RNTG
Sbjct: 280 AFQLYSQQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNTG 339
Query: 334 KPEGLCGINKMASY 347
+G+CGIN MAS+
Sbjct: 340 DSKGVCGINMMASF 353
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 169/339 (49%), Positives = 215/339 (63%), Gaps = 18/339 (5%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN 88
S + + +DL S + L +L+ W S + EK RF FK N+ I N ++ +
Sbjct: 23 SAIPFDAKDLESEEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLND 82
Query: 89 ---------YWLGLNEFADLRHEEFKEMFLGLKPDLAR--RKDQSHEDFSYKDVVDLPKS 137
Y L LN F D+ EF+ F G L R R QS F Y V D+P++
Sbjct: 83 TSTNNNGPSYRLRLNRFGDMDQAEFRSTFAG---PLHRHTRPAQSIPGFIYDTVKDIPQA 139
Query: 138 VDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN-NG 196
VDWR+KGAVT VK+QG CGSCWAFS VA+VEG+N I TG+L SLSEQELIDCD + NG
Sbjct: 140 VDWRQKGAVTGVKDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNG 199
Query: 197 CNGGLMDYAFQYIV-STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDS 255
C GGLM+ AF++I S GGL E YPY GTC +G S V I+G+ VP +E++
Sbjct: 200 CQGGLMESAFEFIAHSAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEA 259
Query: 256 LLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR--GLDYIIVK 313
L KA+A+QP+SVAI+A G+ FQFYS GV+ G CG++LDHGVA VGYG G +Y IVK
Sbjct: 260 LAKAVAHQPVSVAIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKEYWIVK 319
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
NSWGP WGE GY+RM+R++G GLCGI ASYP+K +
Sbjct: 320 NSWGPGWGEHGYVRMQRDSGVDGGLCGIAMEASYPVKNE 358
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 159/341 (46%), Positives = 217/341 (63%), Gaps = 6/341 (1%)
Query: 13 SFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIF 72
+F + I + A F + +L+ + + + E WM+ + +VY+ EK RFE+F
Sbjct: 6 AFLLLLAILTGCACSFPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAEKARRFEVF 65
Query: 73 KDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV 131
KDNL ++ N KN +WLG+N+FADL EEFK G KP A + + V
Sbjct: 66 KDNLAFVESFNADKKNKFWLGVNQFADLTTEEFKAN-KGFKPISAEEVPTTGFKYENLSV 124
Query: 132 VDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD- 190
LP +VDWR KGAVT +KNQG CG CWAFS VAA+EGI ++ T NL SLSEQEL+DCD
Sbjct: 125 SALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLSEQELVDCDT 184
Query: 191 NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQ 250
++ + GC GG MD AF++++ GGL E YPY +G C+ G TI G+ DVP
Sbjct: 185 HSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCK--GGSKSAATIKGHEDVPP 242
Query: 251 NSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDY 309
N+E +L+KA+A+QP+SVA++AS R F YSGGV G CGTQLDHG+AA+GYG + G Y
Sbjct: 243 NNEAALMKAVASQPVSVAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIGYGVESDGTKY 302
Query: 310 IIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
I+KNSWG WGEK ++RM+++ +G+CG+ SYP +
Sbjct: 303 WILKNSWGTTWGEKRFLRMEKDISDKQGMCGLAMKPSYPTE 343
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 166/349 (47%), Positives = 220/349 (63%), Gaps = 14/349 (4%)
Query: 7 FKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKL 66
FK + I ++ F SSF S+ +L + I+ WM+K +VY + EK
Sbjct: 3 FKHMQIFLFVAIF--SSFYFSISLSRPLDNELIMQKRHIE----WMTKHGRVYADVKEKS 56
Query: 67 ERFEIFKDNLRHIDETNR--KIKNYWLGLNEFADLRHEEFKEMFLGLK--PDLARRKDQS 122
R+ +FK N+ I+ N + + L +N+FADL ++EF+ M+ G K L+ +
Sbjct: 57 NRYVVFKSNVERIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQTK 116
Query: 123 HEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
F Y++V LP SVDWR KGAVT +KNQGSCG CWAFS VAA+EG QI G L S
Sbjct: 117 TTSFRYQNVSSGALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLIS 176
Query: 181 LSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
LSEQ+L+DCD T + GC GGLMD AF++I++TGGL E +YPY E+ TC K +
Sbjct: 177 LSEQQLVDCD-TNDFGCEGGLMDTAFEHIMATGGLTTESNYPYKGEDATCNSKKTNPKAT 235
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
+I GY DVP N E +L+KA+A+QP+SV IE G DFQFYS GV+ G C T LDH V A+G
Sbjct: 236 SITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIG 295
Query: 301 YG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
YG ST G Y I+KNSWG KWGE GY+R++++ +GLCG+ ASYP
Sbjct: 296 YGQSTNGSKYWIIKNSWGTKWGESGYMRIQKDIKDKQGLCGLAMKASYP 344
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 164/307 (53%), Positives = 205/307 (66%), Gaps = 7/307 (2%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEE 103
++ E+WM+++ + Y+ EK R IFK+N+ I+ N+ K Y L +NEFADL +EE
Sbjct: 1 MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEE 60
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F+ G K A S + F Y++V +P ++DWRKKGAVT +K+QG CG CWAFS
Sbjct: 61 FQASRNGYKMS-AHLSSSSTKPFRYENVSAVPSTMDWRKKGAVTPIKDQGQCGCCWAFSA 119
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
VAA EGI Q+ TG L SLSEQEL+DCD + + GCNGGLMD AF +I+ GL E +YP
Sbjct: 120 VAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEANYP 179
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
Y +G C K ++ I GY DVP NSE +LLKA+ANQP+SVAI+A G FQFYS G
Sbjct: 180 YQGADGACNSGKAAAK---ITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYSSG 236
Query: 283 VYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
V+ G CGT LDHGV AVGYG S G Y +VKNSWG WGE GYIRM+R+ EGLCGI
Sbjct: 237 VFTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERDIDAQEGLCGI 296
Query: 342 NKMASYP 348
ASYP
Sbjct: 297 AMEASYP 303
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 160/349 (45%), Positives = 220/349 (63%), Gaps = 22/349 (6%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
+IL +FF ++ A DL + ++ E WM+++ +VY+ EK R
Sbjct: 7 SILAVLSFAFFCGAALA---------ARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARR 57
Query: 69 FEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFL--GLKPDLARRKDQSHED 125
FE+FK N++ I+ N + +WLG+N+FADL ++EF+ G KP L D+
Sbjct: 58 FEVFKANVKFIESFNTGGNRKFWLGINQFADLTNDEFRTTKTNKGFKPSL----DKVSTG 113
Query: 126 FSYKDV-VD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
F Y++V VD +P ++DWR GAVT +K+QG CG CWAFS VAA EGI +I TG L SLSE
Sbjct: 114 FRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSE 173
Query: 184 QELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
QEL+DCD + + GC GGLMD AF++I+ GGL E +YPY +G C+ G + I
Sbjct: 174 QELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCK--SGSNSAANI 231
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
GY DVP N E +L+KA+ANQP+SVA++ FQFYSGGV G CGT LDHG+AA+GYG
Sbjct: 232 KGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG 291
Query: 303 STR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
T G Y ++KNSWG WGE GY+RM+++ +G+CG+ SYP +
Sbjct: 292 KTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAMEPSYPTE 340
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 159/307 (51%), Positives = 206/307 (67%), Gaps = 10/307 (3%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYW-LGLNEFADLRHEEF--- 104
+ WM ++ K+Y E +RF+IFK+N+ +I+ +N++ ++ LG+N+F DL +EEF
Sbjct: 40 QQWMGQYAKIYNDHQEWEKRFQIFKENVNYIETSNKEGGRFYKLGVNQFVDLTNEEFIAP 99
Query: 105 KEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
+ F G R + + Y++V +P +VDWR+KGAVT VK+QG CG CWAFS V
Sbjct: 100 RNRFKGHMCSSIIRTN----TYKYENVTTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSAV 155
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
AA EGI+Q+ TG L SLSEQEL+DCD + GC GGLMD AF++I+ GL E YPY
Sbjct: 156 AATEGIHQLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLDTEAKYPY 215
Query: 224 IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV 283
+GTC + TI Y DVP N+E +L KA+ANQP+SVAI+ASG DFQFY+ GV
Sbjct: 216 QGVDGTCNANEASINAATITSYEDVPTNNEQALQKAVANQPISVAIDASGSDFQFYTSGV 275
Query: 284 YDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGIN 342
+ G CGT+LDHGV AVGYG S G Y +VKNSWG WGE+GYIRM+R EGLCGI
Sbjct: 276 FTGSCGTELDHGVTAVGYGVSDDGTKYWLVKNSWGTSWGEEGYIRMQRGVDAVEGLCGIA 335
Query: 343 KMASYPI 349
ASYPI
Sbjct: 336 MQASYPI 342
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 317 bits (813), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 156/329 (47%), Positives = 225/329 (68%), Gaps = 7/329 (2%)
Query: 25 ARDFSIVGYSPE-DLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN 83
A D SI+ Y+ + + +ND+++ +FESW+ ++ K Y +L EK RFEIFKDNLR +DE N
Sbjct: 24 AFDASIITYAKKWEQRTNDEVMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHN 83
Query: 84 RKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRK 142
+ ++Y +GLN+F+DL EE+ ++LG K D+ R + + + LP S+DWRK
Sbjct: 84 ADVNRSYKVGLNQFSDLTLEEYSSIYLGTKFDM--RMTNVSDRYEPRVGDQLPNSIDWRK 141
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDN-TYNNGCNGGL 201
KGAV VKNQG+CGSCW F+ +AAVE INQIVTGNL SLSEQ+++DC + NNGC GG
Sbjct: 142 KGAVLGVKNQGNCGSCWTFAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGS 201
Query: 202 MDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALA 261
A+Q+I+ GG++ E +YPY ++G C+ K + + VTI+ Y +VP+ +E +L KA++
Sbjct: 202 RAGAYQFIIDNGGINTEANYPYKAQDGECDEQKNQ-KYVTIDRYENVPRKNEKALQKAVS 260
Query: 262 NQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWG 321
NQ +SV I ++ +F+ Y G++ G CG ++DH V VGYG+ G+DY IV+NSWG WG
Sbjct: 261 NQLVSVGIASNSSEFKAYKSGIFTGPCGAKIDHAVTIVGYGTEGGMDYWIVRNSWGSNWG 320
Query: 322 EKGYIRMKRNTGKPEGLCGINKMASYPIK 350
E GY+RM+RN G G C I +YP+K
Sbjct: 321 ENGYVRMQRNVGNA-GTCFIATSPNYPVK 348
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 154/314 (49%), Positives = 209/314 (66%), Gaps = 6/314 (1%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFAD 98
S+ +++ E+WM ++ +VY+ EK RFE+FKDN+ ++ N N +WLG+N+FAD
Sbjct: 28 SDAAMVERHENWMVEYGRVYKDAAEKARRFEVFKDNVAFVESFNTNKNNKFWLGINQFAD 87
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
L EEFK G KP A + + + V LP +VDWR KGAVT +KNQG CG C
Sbjct: 88 LTIEEFKAN-KGFKPISAEKVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCC 146
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHK 217
WAFS VAA+EGI ++ TGNL SLSEQEL+DCD ++ + GC GG MD AF++++ GGL
Sbjct: 147 WAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLAT 206
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
YPY +G C+ G TI G+ DVP N E +L+KA+ANQP+SVA++AS R F
Sbjct: 207 VSSYPYKAVDGKCK--GGSKSAATIKGHEDVPVNDEAALMKAVANQPVSVAVDASDRTFM 264
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
YSGGV G CGT+LDHG+AA+GYG + G Y I+KNSWG WGEKG++RM+++ +
Sbjct: 265 LYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYWILKNSWGTTWGEKGFLRMEKDISDKQ 324
Query: 337 GLCGINKMASYPIK 350
G+CG+ SYP +
Sbjct: 325 GMCGLAMKPSYPTE 338
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 155/313 (49%), Positives = 201/313 (64%), Gaps = 11/313 (3%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-------KNYWLGLNEFADLR 100
FE+W ++ K Y + E+ R F +N + N + +Y L LN FADL
Sbjct: 39 FEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLT 98
Query: 101 HEEFKEMFLG---LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGS 157
H+EF+ LG + P S F + V +P ++DWR+ GAVT VK+QGSCG+
Sbjct: 99 HDEFRAARLGRLAVGPGPLGAPSPSDGGFEGR-VGAVPDALDWRQSGAVTKVKDQGSCGA 157
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHK 217
CW+FS A+EGIN+I TG+L SLSEQELIDCD +YN GC GGLM YA+++++ GG+
Sbjct: 158 CWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDT 217
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
E+DYP+ +GTC K + VVTI+GY +VP + ED LL+A+A QP+SV I S R FQ
Sbjct: 218 EDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQ 277
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
YS G++DG C T LDH V VGYGS G DY IVKNSWG +WG KGY+ M RNTG G
Sbjct: 278 LYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSSG 337
Query: 338 LCGINKMASYPIK 350
+CGIN MAS+P K
Sbjct: 338 ICGINMMASFPTK 350
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 169/351 (48%), Positives = 220/351 (62%), Gaps = 16/351 (4%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MAL S+ CI+ I +A + + +++ +++ E WM + + Y+
Sbjct: 1 MALESKI------ICITLLIMGVWASQ--ALSRTLHEVSMSER----HEDWMGLYGRTYK 48
Query: 61 SLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
+ EK RF+IFK+N+ +I+ N + Y L +NEFAD +EEFK G +R +
Sbjct: 49 DIAEKERRFKIFKENVEYIESVNSAGNRRYKLSINEFADQTNEEFKASRNGYNMS-SRPR 107
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
F Y++V +P S+DWRKKGAVT +K+QG CG CWAFS VAA+EG+ Q+ TG L
Sbjct: 108 SSEITSFRYENVAAVPSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELI 167
Query: 180 SLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DCD + + GC GGLMD AF++I+ GGL E +YPY + TC K S
Sbjct: 168 SLSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASS 227
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
I Y DVP NSE +LLKA+A P+SVAI+A G DFQFYS GV+ G CGT+LDHGV A
Sbjct: 228 AAKIKNYEDVPANSEAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTA 287
Query: 299 VGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
VGYG T G Y +VKNSWG WGE GYI M+R+ G EGLCGI ASYP
Sbjct: 288 VGYGKTDDGTKYWLVKNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYP 338
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 155/314 (49%), Positives = 208/314 (66%), Gaps = 6/314 (1%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFAD 98
S+ +++ E+WM ++ +VY+ EK RFE FK N+ ++ N KN +WLG+N+FAD
Sbjct: 28 SDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFAD 87
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
L EEFK G KP A + + V LP +VDWR KGAVT +KNQG CG C
Sbjct: 88 LTTEEFKAN-KGFKPISAEMVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCC 146
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHK 217
WAFS VAA+EGI ++ TGNL SLSEQEL+DCD ++ + GC GG MD AF++++ GGL
Sbjct: 147 WAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLAT 206
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
E YPY +G C+ G TI G+ DVP N E +L+KA+ANQP+SVA++AS R F
Sbjct: 207 ESSYPYKAVDGKCK--GGSKSAATIKGHEDVPVNDEAALMKAVANQPVSVAVDASDRTFM 264
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
YSGGV G CGT+LDHG+AA+GYG + G Y I+KNSWG WGEKG++RM+++ +
Sbjct: 265 LYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYWILKNSWGTTWGEKGFLRMEKDISDKQ 324
Query: 337 GLCGINKMASYPIK 350
G+CG+ SYP +
Sbjct: 325 GMCGLAMKPSYPTE 338
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 162/343 (47%), Positives = 220/343 (64%), Gaps = 18/343 (5%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERF 69
+ I C+ FF AR+ + +DL+ ++ ESWMS++ + Y+ EK +F
Sbjct: 9 LAILGCLCFFASGLAARELN------DDLS----MVARHESWMSQYGRSYKDAAEKDRKF 58
Query: 70 EIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYK 129
E+FK N ID N K +WLG+N+FAD+ +EEFK K ++ FSY+
Sbjct: 59 EVFKANAAFIDSFNAKNHKFWLGINQFADITNEEFK--VTKTNKGFISNKVRASTGFSYE 116
Query: 130 DV-VD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
+V +D LP ++DWR KGAVT VK+QG CG CWAFS VAA EGI ++ TG L SLSEQEL+
Sbjct: 117 NVSIDALPATIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELV 176
Query: 188 DCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
DCD + + GC GGLMD AF++I++ GGL +E YPY E+G C+ G TI Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCK--SGSKSAGTIKSYE 234
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR- 305
DVP N+E +L+KA+ANQP+SVA++ FQFYSGGV G CGT LDHG+AA+GYG T
Sbjct: 235 DVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSD 294
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G Y ++KNSWG WGE G++RM+++ +G+CG+ SYP
Sbjct: 295 GTKYWLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYP 337
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 160/326 (49%), Positives = 207/326 (63%), Gaps = 6/326 (1%)
Query: 28 FSIVGYSPEDLTSNDKLIDL-FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI 86
F++ DL +D LI E WM+++ +VY + EK R E+FK N+ I+ N
Sbjct: 12 FALGALGARDLADDDWLIAARHEQWMARYGRVYSDVAEKARRLEVFKANVGFIESVNAGN 71
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DLPKSVDWRKKG 144
+WL N+FAD+ +EF+ M G K + K ++ F Y +V DLP SVDWR G
Sbjct: 72 HKFWLEANQFADITKDEFRAMHKGYKMQVIGSKARA-TGFRYANVSIDDLPASVDWRANG 130
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMD 203
AVT VK+QG CG CWAFSTVA++EGI ++ TG L SLSEQEL+DCD N GC GGLMD
Sbjct: 131 AVTPVKDQGQCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVGMQNKGCGGGLMD 190
Query: 204 YAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
AF++IV+ GGL E DYPY +GTC K + +I GY DVP N E SL KA+A Q
Sbjct: 191 NAFEFIVNNGGLDTEADYPYTGADGTCNSNKESNIAASIKGYEDVPANDEASLQKAVAAQ 250
Query: 264 PLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGE 322
P+S+A++ F+FY GGV G CGT+LDHGVAAVGYG + G Y +VKNSWG WGE
Sbjct: 251 PVSIAVDGGDDLFRFYKGGVLTGACGTELDHGVAAVGYGVAGDGTKYWLVKNSWGTSWGE 310
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYP 348
G+IR++R+ G+CG+ SYP
Sbjct: 311 DGFIRLERDVADEAGMCGLAMKPSYP 336
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 160/322 (49%), Positives = 215/322 (66%), Gaps = 13/322 (4%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGL 93
+ S++++ L+ W +K + LD R E+FK+NL+ +D+ N R + LG+
Sbjct: 41 VRSDEEVRMLYLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHNAAADRGEHTFRLGM 100
Query: 94 NEFADLRHEEFKEMFLGLKPDLAR-RKDQS---HEDFSYKDVVDLPKSVDWRKKGAVTHV 149
N FADL +EE++ FL D +R R+ S + ++ DLP S+DWR+KGAV V
Sbjct: 101 NRFADLTNEEYRTRFL---RDFSRLRRSASGKISSRYRLREGDDLPDSIDWREKGAVVPV 157
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
KNQG CGSCWAFSTVAAVEGINQIVTG+L SLSEQ+L+DC T N+GC GG M+ AFQ+I
Sbjct: 158 KNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC-TTANHGCRGGWMNPAFQFI 216
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
V+ GG++ EE YPY + G C T + VV+I+ Y +VP ++E SL KA+ANQP+SV +
Sbjct: 217 VNNGGINSEETYPYRGQNGICNSTV-NAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTM 275
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMK 329
+A+GRDFQ Y G++ G C +H + VGYG+ DY VKNSWG WGE GYIR++
Sbjct: 276 DAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDYRTVKNSWGKNWGESGYIRVE 335
Query: 330 RNTGKPEGLCGINKMASYPIKK 351
RN G P G CGI + ASYP+KK
Sbjct: 336 RNIGNPNGKCGITRFASYPVKK 357
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 164/354 (46%), Positives = 225/354 (63%), Gaps = 15/354 (4%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLT--SNDKLIDLFESWMSKFEKV 58
M L F ++ + F + I S + ++ ++LT +ND++ ++ESW+ K+ K
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILS--------LAFNAKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y SL E RFEIFK+ LR IDE N ++Y +GLN+FADL EEF+ +LG +
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNK 112
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
K + + + V LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IVTG
Sbjct: 113 TKVSNRYEPRFGQV--LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGV 170
Query: 178 LASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQELIDC T N GCNGG + FQ+I++ GG++ EE+YPY ++G C +
Sbjct: 171 LISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQN 230
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+ VTI+ Y +VP N+E +L A+ QP+SVA++A+G F+ YS G++ G CGT +DH V
Sbjct: 231 EKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAV 290
Query: 297 AAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VGYG+ G+DY IVKNSW WGE+GY+R+ RN G G CGI M SYP+K
Sbjct: 291 TIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPVK 343
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 168/351 (47%), Positives = 218/351 (62%), Gaps = 26/351 (7%)
Query: 7 FKTILISFCISFFIRSSFARDFSIVGYSPED--LTSNDKLIDLFESWMSKFEKVYESLDE 64
F I SFC FSI P D L + I+ WM+K +VY + E
Sbjct: 11 FVAIFSSFC------------FSITLSRPLDNELIMQKRHIE----WMTKHGRVYADVKE 54
Query: 65 KLERFEIFKDNLRHIDETNR--KIKNYWLGLNEFADLRHEEFKEMFLGLK--PDLARRKD 120
+ R+ +FK+N+ I+ N + + L +N+FADL ++EF+ M+ G K L+ +
Sbjct: 55 ENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQ 114
Query: 121 QSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
F Y++V LP SVDWRKKGAVT +KNQGSCG CWAFS VAA+EG QI G L
Sbjct: 115 TKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKL 174
Query: 179 ASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQ+L+DCD T + GC GGLMD AF++I +TGGL E +YPY E+ TC K +
Sbjct: 175 ISLSEQQLVDCD-TNDFGCEGGLMDTAFEHIKATGGLTTESNYPYKGEDATCNSKKTNPK 233
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
+I GY DVP N E +L+KA+A+QP+SV IE G DFQFYS GV+ G C T LDH V A
Sbjct: 234 ATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTA 293
Query: 299 VGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
+GYG ST G Y I+KNSWG KWGE GY+R++++ +GLCG+ ASYP
Sbjct: 294 IGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 161/343 (46%), Positives = 221/343 (64%), Gaps = 15/343 (4%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++ + C++F F + + +D + + + E WM+++ KVY+ E+ +R
Sbjct: 11 SLAMLLCMTFLA-------FQVTCRTLQDAS----MYERHEQWMTRYGKVYKDPQEREKR 59
Query: 69 FEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
F +FK+N+ +I+ N K+Y LG+N+FADL ++EF G K + ++ F
Sbjct: 60 FRVFKENVNYIEAFNNAANKSYKLGINQFADLTNKEFIAPRNGFKGHMCSSIIRT-TTFK 118
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
+++V P +VDWR+KGAVT +K+QG CG CWAFS VAA EGI+ + G L SLSEQEL+
Sbjct: 119 FENVTATPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELV 178
Query: 188 DCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
DCD + GC GGLMD AF++I+ GL+ E +YPY +G C + TI GY
Sbjct: 179 DCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEAAKNAATITGYE 238
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STR 305
DVP N+E +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGT+LDHGV AVGYG S
Sbjct: 239 DVPANNEMALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDD 298
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G +Y +VKNSWG +WGE+GYIRM+R EGLCGI ASYP
Sbjct: 299 GTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYP 341
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 167/351 (47%), Positives = 217/351 (61%), Gaps = 17/351 (4%)
Query: 16 ISFFIRSSFARDFSIVGYSPED----------LTSNDKLIDLFESWMSKFEKVYESLDEK 65
+ F I + VG +PE L + + F+ WM ++ K Y + ++
Sbjct: 3 VRFLIAALLVAASGGVGAAPELQLREQHEKLLLDAKANPMAAFQQWMMQYTKAYANDIKE 62
Query: 66 LE-RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFK-EMFLGLKPDLARRKDQSH 123
LE RF ++ +NL +I N + ++WL LN FADL +EF+ + K A + QS
Sbjct: 63 LETRFSVWLENLNYILAYNARTTSHWLHLNAFADLTTDEFRNRLGYDFKARQASNRLQSS 122
Query: 124 EDFSYK--DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASL 181
F Y D LP +DWRKKGAVT VKNQG CGSCWAF+T +VEGIN IVTG LASL
Sbjct: 123 P-FIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFATTGSVEGINAIVTGELASL 181
Query: 182 SEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVT 241
SEQEL+DCD + GC+GGLMDYA+Q+I+ GGL E+DYPY E+G C K VVT
Sbjct: 182 SEQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGLDTEDDYPYTAEDGVCVAAKKNRRVVT 241
Query: 242 INGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY-DGHCGTQLDHGVAAVG 300
I+GY D+P+N E +L KA A+QP++VAIEA + FQ Y GGVY D CGT L+HGV VG
Sbjct: 242 IDGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGGVYDDPTCGTSLNHGVLVVG 301
Query: 301 YGSTRGL-DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
YG +Y IVKNSWGP+WG+ GYIR++ +G+CGI S+P K
Sbjct: 302 YGKDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQGMCGIAMAPSFPTK 352
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 169/351 (48%), Positives = 217/351 (61%), Gaps = 26/351 (7%)
Query: 7 FKTILISFCISFFIRSSFARDFSIVGYSPED--LTSNDKLIDLFESWMSKFEKVYESLDE 64
F I SFC FSI P D L + I+ WM+K +VY + E
Sbjct: 11 FVAIFSSFC------------FSITLSRPLDNELIMQKRHIE----WMTKHGRVYADVKE 54
Query: 65 KLERFEIFKDNLRHIDETNR--KIKNYWLGLNEFADLRHEEFKEMFLGLK--PDLARRKD 120
+ R+ +FK+N+ I+ N + + L +N+FADL ++EF M+ G K L+ +
Sbjct: 55 ENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQ 114
Query: 121 QSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
F Y++V LP SVDWRKKGAVT +KNQGSCG CWAFS VAA+EG QI G L
Sbjct: 115 TKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKL 174
Query: 179 ASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQ+L+DCD T + GC GGLMD AF++I +TGGL E DYPY E+ TC K +
Sbjct: 175 ISLSEQQLVDCD-TNDFGCEGGLMDTAFEHIKATGGLTTESDYPYKGEDATCNSKKTNPK 233
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
+I GY DVP N E +L+KA+A+QP+SV IE G DFQFYS GV+ G C T LDH V A
Sbjct: 234 ATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTA 293
Query: 299 VGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
+GYG ST G Y I+KNSWG KWGE GY+R++++ +GLCG+ ASYP
Sbjct: 294 IGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 161/303 (53%), Positives = 207/303 (68%), Gaps = 5/303 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+ KVY EK ++++ FK+N++ I+ N K Y LG+N FADL +EEFK +
Sbjct: 41 EQWMAIHGKVYTHSYEKEQKYQTFKENVQRIEAFNHAGNKPYKLGINHFADLTNEEFKAI 100
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
K + + ++ F Y+++ +P ++DWR++GAVT +K+QG CG CWAFS VAA
Sbjct: 101 NR-FKGHVCSKITRT-PTFRYENMTAVPATLDWRQEGAVTPIKDQGQCGCCWAFSAVAAT 158
Query: 168 EGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EGI ++ TG L SLSEQEL+DCD + GC GGLMD AF++I+ GL E YPY
Sbjct: 159 EGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKGLAAEAIYPYEGV 218
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+GTC + +I GY DVP NSE +LLKA+ANQP+SVAIEASG +FQFYSGGV+ G
Sbjct: 219 DGTCNAKAEGNHATSIKGYEDVPANSESALLKAVANQPVSVAIEASGFEFQFYSGGVFTG 278
Query: 287 HCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
CGT LDHGV AVGYG S G Y +VKNSWG KWG+KGYIRM+R+ EGLCGI +A
Sbjct: 279 SCGTNLDHGVTAVGYGVSDDGTKYWLVKNSWGVKWGDKGYIRMQRDVAAKEGLCGIAMLA 338
Query: 346 SYP 348
SYP
Sbjct: 339 SYP 341
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 161/343 (46%), Positives = 219/343 (63%), Gaps = 15/343 (4%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++ + FC+ F F + + +D + + + WM+++ KVY+ E+ +R
Sbjct: 11 SLALLFCMGFLA-------FQVTCRTLQDAS----MYERHAQWMARYAKVYKDPQEREKR 59
Query: 69 FEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
F IFK+N+ +I+ N K+Y L +N+FADL +EEF K + ++ F
Sbjct: 60 FRIFKENVNYIETFNSADNKSYKLDINQFADLTNEEFIAPRNRFKGHMCSSITRT-TTFK 118
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
Y++V +P +VDWR+KGAVT +K+QG CG CWAFS VAA EGI+ + G L SLSEQE++
Sbjct: 119 YENVTVIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEVV 178
Query: 188 DCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
DCD + GC GG MD AF++I+ GL+ E +YPY +G C + TI GY
Sbjct: 179 DCDTKGQDQGCAGGFMDGAFKFIIQNHGLNTEPNYPYKAADGKCNAKAAANHAATITGYE 238
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STR 305
DVP N+E +L KA+ANQP+SVAI+ASG DFQFY GV+ G CGT+LDHGV AVGYG S
Sbjct: 239 DVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSAD 298
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G +Y +VKNSWG +WGE+GYIRM+R EGLCGI MASYP
Sbjct: 299 GTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYP 341
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 157/302 (51%), Positives = 205/302 (67%), Gaps = 7/302 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEM 107
E+W++++ +VY+ EK E F+IFK+N+ I+ N K Y LG+N FADL EEFK+
Sbjct: 39 ENWIARYGQVYKVAAEK-ETFQIFKENVEFIESFNAAANKPYKLGVNLFADLTLEEFKDF 97
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
GLK + + S F Y++V D+P+++DWR+KGAVT +K+QG CGSCWAFSTVAA
Sbjct: 98 RFGLK----KTHEFSITPFKYENVTDIPEALDWREKGAVTPIKDQGQCGSCWAFSTVAAT 153
Query: 168 EGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EGI+QI TGNL SL EQEL+ CD + GC GG M+ F++I+ GG+ + +YPY
Sbjct: 154 EGIHQITTGNLVSLXEQELVSCDTKGVDQGCEGGYMEDGFEFIIKNGGITTKANYPYKGV 213
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
GTC T S V I GY VP SE++L KA+ANQP+SV+I+A+ F FY+GG+Y G
Sbjct: 214 NGTCNTTIAASTVAQIKGYETVPSYSEEALQKAVANQPVSVSIDANNGHFMFYAGGIYTG 273
Query: 287 HCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
CGT LDHGV AVGYG+T DY IVKNSWG W EKG+IRM+R GLCG+ +S
Sbjct: 274 ECGTDLDHGVTAVGYGTTNETDYWIVKNSWGTGWDEKGFIRMQRGITVKHGLCGVALDSS 333
Query: 347 YP 348
YP
Sbjct: 334 YP 335
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 176/334 (52%), Positives = 215/334 (64%), Gaps = 13/334 (3%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIK 87
S + + DL S + L L+E W ++ V L EK RF +F++N R + E N R+
Sbjct: 30 SAMDFGESDLASEESLWALYERWRAR-HTVSRDLAEKSRRFNVFRENARLVHEFNLRRDA 88
Query: 88 NYWLGLNEFADLRHEEFKEMFLG--------LKPDLARRKDQSH-EDFSYKDVVDLPKSV 138
Y L LN FADL +EF+ + KP A D + S+ LP SV
Sbjct: 89 PYKLRLNRFADLTSDEFRRSYASSRVSHHRMFKPRAANNNDDDDDKGSSFTHGGALPTSV 148
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
DWR+KGAVT VK+QG CGSCWAFST+AAVEGIN I T NL SLSEQ+L+DCD N GC+
Sbjct: 149 DWREKGAVTGVKDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCD 208
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPY-IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLL 257
GGLMD AF YI GG+ E+ YPY + +C K + VV+I+GY DVP+N E +L
Sbjct: 209 GGLMDDAFSYIAKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALK 268
Query: 258 KALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSW 316
KA+A QP++VAIEA G FQFYS GV+ G CGT+LDHGVAAVGYG T G Y IVKNSW
Sbjct: 269 KAVAAQPVAVAIEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWIVKNSW 328
Query: 317 GPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G +WGEKGYIRMKR+ EGLCGI ASYP+K
Sbjct: 329 GEEWGEKGYIRMKRDVADKEGLCGIAMEASYPVK 362
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 315 bits (806), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 160/308 (51%), Positives = 206/308 (66%), Gaps = 4/308 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHE 102
L + E+WM+++ K+Y+ EK +RF+IFKDN+ I+ N K Y LG+N ADL E
Sbjct: 34 LRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLE 93
Query: 103 EFKEMFLGLKPDLARRKDQ-SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQG-SCGSCWA 160
EFK+ GLK F Y++V D+P+++DWR KGAVT +K+QG CG WA
Sbjct: 94 EFKDSRNGLKRTYEFSTTTFKLNGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGRFWA 153
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FST+AA EGI+QI TGNL SLSEQEL+DCD+ ++GC GG M+ F++I+ GG+ E +
Sbjct: 154 FSTIAATEGIHQISTGNLVSLSEQELVDCDSV-DDGCEGGFMEDGFEFIIKNGGITSETN 212
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY +GTC T S V I GY VP SE++L KA+ANQP+SV+I A+ F FYS
Sbjct: 213 YPYKGVDGTCNTTIAASPVAQIKGYEIVPSYSEEALKKAVANQPVSVSIHATNATFMFYS 272
Query: 281 GGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
G+Y+G CGT LDHGV AVGYG+ G DY IVKNSWG +WGEKGYIRM R G+CG
Sbjct: 273 SGIYNGECGTDLDHGVTAVGYGTENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICG 332
Query: 341 INKMASYP 348
I +SYP
Sbjct: 333 IALDSSYP 340
>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
Length = 449
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 155/301 (51%), Positives = 196/301 (65%), Gaps = 1/301 (0%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
FE+W ++ + Y + E+ R F DN + N +Y L LN FADL H+EF+
Sbjct: 38 FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYALALNAFADLTHDEFRAA 97
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
LG +D V +P +VDWR+ GAVT VK+QGSCG+CW+FS A
Sbjct: 98 RLGRLAAAGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATGA 157
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
+EGIN+I TG+L SLSEQELIDCD +YN+GC GGLMDYA++++V GG+ E DYPY
Sbjct: 158 MEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRET 217
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+GTC K + VVTI+GY DVP N+ED LL+A+A QP+SV I S R FQ YS G++DG
Sbjct: 218 DGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFDG 277
Query: 287 HCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
C T LDH + VGYGS G DY IVKNSWG WG KGY+ M RNTG G+CGIN+M S
Sbjct: 278 PCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQMPS 337
Query: 347 Y 347
+
Sbjct: 338 F 338
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 155/314 (49%), Positives = 210/314 (66%), Gaps = 7/314 (2%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFAD 98
S+ +++ E+WM ++ +VY+ EK RFE FK N+ ++ N KN +WLG+N+FAD
Sbjct: 28 SDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFAD 87
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
L EEFK G KP A + + + V LP +VDWR KGAVT +KNQG CG C
Sbjct: 88 LTTEEFKAN-KGFKP-TAEKVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCC 145
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHK 217
WAFS VAA+EGI ++ TGNL SLSEQEL+DCD ++ + GC GG MD AF++++ GGL
Sbjct: 146 WAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLAT 205
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
E +YPY +G C+ G TI G+ DVP N+E +L+KA+ANQP+SVA++AS R F
Sbjct: 206 ESNYPYKAVDGKCK--GGSKSAATIKGHEDVPVNNEAALMKAVANQPVSVAVDASDRTFM 263
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
YSGGV G CGT+LDHG+AA+GYG + G Y I+KNSWG WGEKG++RM+++
Sbjct: 264 LYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILKNSWGTTWGEKGFLRMEKDITDKR 323
Query: 337 GLCGINKMASYPIK 350
G+CG+ SYP +
Sbjct: 324 GMCGLAMKPSYPTE 337
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 164/354 (46%), Positives = 224/354 (63%), Gaps = 15/354 (4%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLT--SNDKLIDLFESWMSKFEKV 58
M L F ++ + F + I S + ++ ++LT +ND++ ++ESW+ K+ K
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILS--------LAFNAKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y SL E RFEIFK+ LR IDE N ++Y +GLN+FADL EEF+ +LG +
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNK 112
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
K + + V LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IVTG
Sbjct: 113 TKVSNRYEPRVGQV--LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGV 170
Query: 178 LASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQELIDC T N GCNGG + FQ+I++ GG++ EE+YPY ++G C +
Sbjct: 171 LISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQN 230
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+ VTI+ Y +VP N+E +L A+ QP+SVA++A+G F+ YS G++ G CGT +DH V
Sbjct: 231 EKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAV 290
Query: 297 AAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VGYG+ G+DY IVKNSW WGE+GY+R+ RN G G CGI M SYP+K
Sbjct: 291 TIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPVK 343
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 160/323 (49%), Positives = 215/323 (66%), Gaps = 10/323 (3%)
Query: 37 DLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK---NYWLGL 93
+L S + +I++F+ W + +KVYE E +R+ FK NL++I E K + +GL
Sbjct: 39 ELVSEESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGL 98
Query: 94 NEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV--VDLPKSVDWRKKGAVTHVKN 151
N+FADL +EEFKE++L K + D+ +++ D P S+DWRKKG VT VK+
Sbjct: 99 NKFADLSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDAPSSLDWRKKGVVTAVKD 158
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVS 211
QG CGSCW+FST A+EGIN IVTG+L SLSEQEL+DCD T N GC GG MDYAF+++++
Sbjct: 159 QGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT-NYGCEGGYMDYAFEWVIN 217
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEA 271
GG+ E +YPY +GTC TK E +VV+I+GY DV + ++ +LL A QP+SV ++
Sbjct: 218 NGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDE-TDSALLCATVQQPISVGMDG 276
Query: 272 SGRDFQFYSGGVYDGHCG---TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
S DFQ Y+GG+YDG C +DH V VGYGS G DY IVKNSWG +WG +GY +
Sbjct: 277 SALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYGSENGEDYWIVKNSWGTEWGMEGYFYI 336
Query: 329 KRNTGKPEGLCGINKMASYPIKK 351
KRNT P G+C IN ASYP K+
Sbjct: 337 KRNTDLPYGVCAINAEASYPTKE 359
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 156/308 (50%), Positives = 208/308 (67%), Gaps = 6/308 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHE 102
++ E WM++ +VY + EK +R+ IFK+N+ I+ N + Y LG+N+FADL +E
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
EF+ M G K ++ S F ++++ +P S+DWRK GAVT VK+QG+CG CWAFS
Sbjct: 61 EFRAMHHGYKRQSSKLMSSS---FRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCCWAFS 117
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
VAA+EGI ++ TG L SLSEQ+L+DCD + GC GGLMD AFQ+I+ GGL E Y
Sbjct: 118 AVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSEATY 177
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY +GTC+ K S I GY DVP N+E++LL+A+A QP+SVA+E G DFQFY
Sbjct: 178 PYQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQFYKS 237
Query: 282 GVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GV+ G CGT LDH V A+GYG+ + G +Y +VKNSWG WGE GY+RM+R G EGLCG
Sbjct: 238 GVFKGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRGIGAREGLCG 297
Query: 341 INKMASYP 348
+ ASYP
Sbjct: 298 VAMDASYP 305
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 160/308 (51%), Positives = 202/308 (65%), Gaps = 4/308 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHE 102
+ + E WM K+ KVY+ E +RF IF++N+ I+ N K Y L +N AD +E
Sbjct: 34 MYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNKPYKLSINHLADQTNE 93
Query: 103 EFKEMFLGLK-PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
EF G K + + F Y++V D+P +VDWR+KG T +K+QG CG CWAF
Sbjct: 94 EFMASHKGYKGSHWQGLRITTQTPFKYENVTDIPWAVDWRQKGDATSIKDQGQCGICWAF 153
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
S VAA EGI QI TGNL SLSEQEL+DCD+ ++GC+GGLM++ F++I+ GG+ E +Y
Sbjct: 154 SAVAATEGIYQITTGNLVSLSEQELVDCDSV-DHGCDGGLMEHGFEFIIKNGGISSEANY 212
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY GTC+ K S I GY VP N E+ L KA+ANQP+SV+I+A G FQFYS
Sbjct: 213 PYTAVNGTCDTNKEASPGAQIKGYETVPVNCEEELQKAVANQPVSVSIDAGGSAFQFYSS 272
Query: 282 GVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GV+ G CGTQLDHGV AVGYGST G+ Y IVKNSWG +WGE+GYIRM R EGLCG
Sbjct: 273 GVFTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQWGEEGYIRMLRGIDAQEGLCG 332
Query: 341 INKMASYP 348
I ASYP
Sbjct: 333 IAMDASYP 340
>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
Length = 289
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 145/218 (66%), Positives = 174/218 (79%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
+P+SVDWRK+GAV VK+QGSCGSCWAFST+ AVEGIN+IVTG+L SLSEQEL+DCD +Y
Sbjct: 3 IPESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSY 62
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
N GCNGGLMDYAF++I+ GG+ EEDYPY +G C+ + ++VVTI+ Y DVP+N+E
Sbjct: 63 NQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQNRKNAKVVTIDAYEDVPENNE 122
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
+L KALANQP+SVAIEA GR FQ YS GV+DG CGT+LDHGV AVGYG+ G DY IV+
Sbjct: 123 AALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGTENGKDYWIVR 182
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
NSWG WGE GYI+M RN + G CGI ASYPIKK
Sbjct: 183 NSWGGSWGESGYIKMARNIAEATGKCGIAMEASYPIKK 220
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 157/340 (46%), Positives = 213/340 (62%), Gaps = 12/340 (3%)
Query: 16 ISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDN 75
IS I +F F + DL+ + ++ E WM+++ +VY+ EK RFE+FK N
Sbjct: 101 ISAIIGFAF---FCGAAMAARDLSDDSVMVARHEQWMAQYSRVYKDASEKARRFEVFKAN 157
Query: 76 LRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD- 133
++ I+ N N +WLG+N+FADL ++EF+ L + F Y++V
Sbjct: 158 VQFIESFNAGGNNKFWLGVNQFADLTNDEFRST--KTNKGLKSSNMKIPTGFRYENVSAD 215
Query: 134 -LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-N 191
LP ++DWR KGAVT +K+QG CG CWAFS VAA EGI +I TG L SL+EQEL+DCD +
Sbjct: 216 ALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVH 275
Query: 192 TYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQN 251
+ GC GGLMD AF++I+ GGL E YPY +G C+ G + TI GY DVP N
Sbjct: 276 GEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCK--SGSNSAATIKGYEDVPAN 333
Query: 252 SEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYI 310
E +L+KA+ANQP+SVA++ FQFYSGGV G CGT LDHG+AA+GYG T G Y
Sbjct: 334 DEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYW 393
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
++KNSWG WGE GY+RM+++ G+CG+ SYP +
Sbjct: 394 LMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPTE 433
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 164/354 (46%), Positives = 224/354 (63%), Gaps = 15/354 (4%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLT--SNDKLIDLFESWMSKFEKV 58
M L F ++ + F + I S + ++ ++LT +ND++ ++ESW+ K+ K
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILS--------LAFNAKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y SL E RFEIFK+ LR IDE N ++Y +GLN+FADL EEF+ +LG +
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNK 112
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
K + + V LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IVTG
Sbjct: 113 TKVSNRYEPRVGQV--LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGV 170
Query: 178 LASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQELIDC T N GCNGG + FQ+I++ GG++ EE+YPY ++G C +
Sbjct: 171 LISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQN 230
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+ VTI+ Y +VP N+E +L A+ QP+SVA++A+G F+ YS G++ G CGT +DH V
Sbjct: 231 EKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAV 290
Query: 297 AAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VGYG+ G+DY IVKNSW WGE+GY+R+ RN G G CGI M SYP+K
Sbjct: 291 TIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPVK 343
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 168/353 (47%), Positives = 228/353 (64%), Gaps = 13/353 (3%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
M F ++ + F +F I S FA D I SP L +ND+++ L+ESW+ K+ K Y
Sbjct: 1 MGSPKSFISMSLLFFSTFLIFS-FAIDAKI---SP--LRTNDEVMALYESWLVKYGKSYN 54
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
SL E+ R EIFK+NLR IDE N ++Y +GLN+FADL EE++ +LG K L K
Sbjct: 55 SLGEREMRIEIFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSSL---K 111
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
+ + + LP VDWR GAV VKNQG C SCWAF+T+A VE INQI+TG+L
Sbjct: 112 SKVSNRYMPQVGEVLPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLI 171
Query: 180 SLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DC+ T N GC GG MD A+++I++ GG++ EE+YPYI ++ C+ K
Sbjct: 172 SLSEQELVDCNRTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCDEPKKNQN 231
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD-GHCGTQLDHGVA 297
VTI+ Y VP N E ++ +A+A QP+SVAI+A F+FY G++ G CGT L+H V
Sbjct: 232 YVTIDSYEQVPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVT 291
Query: 298 AVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
+GYG+ G+DY IVKNS+G +WGE GY +++RN G EG CGI YP+K
Sbjct: 292 IIGYGTENGIDYWIVKNSYGTQWGESGYGKVQRNVGG-EGRCGIASYPFYPVK 343
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 164/354 (46%), Positives = 224/354 (63%), Gaps = 15/354 (4%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLT--SNDKLIDLFESWMSKFEKV 58
M L F ++ + F + I S + ++ ++LT +ND++ ++ESW+ K+ K
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILS--------LAFNTKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y SL E RFEIFK+ LR IDE N ++Y +GLN+FADL EEF+ +LG +
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNK 112
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
K + + V LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IVTG
Sbjct: 113 TKVSNRYEPRVGQV--LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGV 170
Query: 178 LASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQELIDC T N GCNGG + FQ+I++ GG++ EE+YPY ++G C +
Sbjct: 171 LISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQN 230
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+ VTI+ Y +VP N+E +L A+ QP+SVA++A+G F+ YS G++ G CGT +DH V
Sbjct: 231 EKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAV 290
Query: 297 AAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VGYG+ G+DY IVKNSW WGE+GY+R+ RN G G CGI M SYP+K
Sbjct: 291 TIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPVK 343
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 313 bits (803), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 162/330 (49%), Positives = 218/330 (66%), Gaps = 10/330 (3%)
Query: 30 IVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-N 88
IV + + S ++++++F+ W K KVY +E +RFE FK NL++I E N K K N
Sbjct: 31 IVEHEIDAFLSEERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKAN 90
Query: 89 YW---LGLNEFADLRHEEFKEMFLG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKG 144
W +GLN+FAD+ +EEF++ +L +K + + S D P S+DWR G
Sbjct: 91 KWEHHVGLNKFADMSNEEFRKAYLSKVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYG 150
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDY 204
VT VK+QGSCGSCWAFS+ A+EGIN +VTG+L SLSEQEL++CD T N GC GG MDY
Sbjct: 151 VVTAVKDQGSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECD-TSNYGCEGGYMDY 209
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQP 264
AF+++++ GG+ E DYPY +GTC TK E++VV+I+GY DV Q S+ +LL A+A QP
Sbjct: 210 AFEWVINNGGIDSESDYPYTGVDGTCNTTKEETKVVSIDGYQDVEQ-SDSALLCAVAQQP 268
Query: 265 LSVAIEASGRDFQFYSGGVYDGHCG---TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWG 321
+SV I+ S DFQ Y+GG+YDG C +DH V VGYGS +Y IVKNSWG WG
Sbjct: 269 VSVGIDGSAIDFQLYTGGIYDGSCSDDPDDIDHAVLIVGYGSEDSEEYWIVKNSWGTSWG 328
Query: 322 EKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
GY +KR+T P G+C +N MASYP K+
Sbjct: 329 IDGYFYLKRDTDLPYGVCAVNAMASYPTKQ 358
>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
Length = 450
Score = 313 bits (802), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 155/302 (51%), Positives = 196/302 (64%), Gaps = 2/302 (0%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
FE+W ++ + Y + E+ R F DN + N +Y L LN FADL H+EF+
Sbjct: 38 FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYALALNAFADLTHDEFRAA 97
Query: 108 FLGLKPDLARRKDQSHEDFSYKD--VVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
LG + D V +P +VDWR+ GAVT VK+QGSCG+CW+FS
Sbjct: 98 RLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATG 157
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
A+EGIN+I TG+L SLSEQELIDCD +YN+GC GGLMDYA++++V GG+ E DYPY
Sbjct: 158 AMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRE 217
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
+GTC K + VVTI+GY DVP N+ED LL+A+A QP+SV I S R FQ YS G++D
Sbjct: 218 TDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFD 277
Query: 286 GHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
G C T LDH + VGYGS G DY IVKNSWG WG KGY+ M RNTG G+CGIN+M
Sbjct: 278 GPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQMP 337
Query: 346 SY 347
S+
Sbjct: 338 SF 339
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 313 bits (801), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 162/327 (49%), Positives = 210/327 (64%), Gaps = 10/327 (3%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----I 86
+ S + S ++ ++ W ++ +E+ R+E F+DNLR+IDE N I
Sbjct: 26 IASSSGQIRSEEETRRMYAEWTAQHGSPI--TNEEEGRYEAFRDNLRYIDEHNAAADAGI 83
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYK--DVVDLPKSVDWRKKG 144
++ LGLN FA L +EE++ +LGL+ D Y+ D LP+SVDWR+KG
Sbjct: 84 HSFRLGLNRFAGLTNEEYRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVDWREKG 143
Query: 145 AVTHVKNQG-SCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMD 203
AV VK+QG SCGS WAFS +AAVE INQIVTG L SLSEQEL+DCD +YN GC+GGLMD
Sbjct: 144 AVGKVKDQGRSCGSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNAGCDGGLMD 203
Query: 204 YAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
AF++I+S GG+ +EDYPY +C+ K + VTI+ Y D+ N E SL KA++NQ
Sbjct: 204 DAFEFIISNGGIDTDEDYPYKARNDSCDANKRNRKAVTIDDYEDLRMN-EKSLQKAVSNQ 262
Query: 264 PLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEK 323
P+SVAIEA GRDFQ Y G++ G CGT LDH VGYGS G DY IVK S+G WGE
Sbjct: 263 PVSVAIEAGGRDFQLYKSGIFTGTCGTDLDHATTIVGYGSENGTDYWIVKESYGTSWGES 322
Query: 324 GYIRMKRNTGKPEGLCGINKMASYPIK 350
GY RM+RN + G CGI + SYP+K
Sbjct: 323 GYARMERNIKETSGKCGIAMLPSYPVK 349
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 157/301 (52%), Positives = 195/301 (64%), Gaps = 4/301 (1%)
Query: 51 WMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG 110
WM++ + Y+ EK +R IFK N+ +I+ N + Y L N+FADL HEEFK M G
Sbjct: 38 WMARHGRTYKDAAEKEQRLGIFKSNVEYIESFNAGKRKYQLAANQFADLTHEEFKAMHTG 97
Query: 111 LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
KP K ++ F + + +P SVDWR KGAVT VK+QG CGSCWAF+ VAAVEGI
Sbjct: 98 FKPSGTGAK-KAGNGFRHGSLSSVPDSVDWRSKGAVTPVKDQGLCGSCWAFTVVAAVEGI 156
Query: 171 NQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
+IVTG L SLSEQ+L+DCD + + GC GG MD AF++IV+ GG+ E +YPY +
Sbjct: 157 TKIVTGKLISLSEQQLVDCDVHGKDQGCQGGDMDAAFEFIVNNGGITSEANYPYEEVQRL 216
Query: 230 CEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEA-SGRDFQFYSGGVYDGHC 288
C V TI + DVP N E +L KA+ANQP+SV I+A S DFQ YSGGV+ G C
Sbjct: 217 CNAHNASFVVATIESHEDVPTNDEKALRKAVANQPVSVGIDAGSSLDFQLYSGGVFSGEC 276
Query: 289 GTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASY 347
GT LDH V VGYG+T G Y + KNSWG WGE GYIRM+R+ EGLCGI ASY
Sbjct: 277 GTDLDHAVTVVGYGTTSDGTKYWLAKNSWGETWGENGYIRMERDVAAKEGLCGIAMQASY 336
Query: 348 P 348
P
Sbjct: 337 P 337
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 163/323 (50%), Positives = 211/323 (65%), Gaps = 22/323 (6%)
Query: 35 PEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNL----RHIDETNRKIKNYW 90
P LT N LF+++ +KF KVYES +E+ RF +F N+ RH E R + +
Sbjct: 19 PLSLTVNKGR--LFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHT 76
Query: 91 LGLNEFADLRHEEFKEMFLGLKP-DLARRKDQSHEDFSYKDVVDLPK--SVDWRKKGAVT 147
+ +N+FADL +EE+++++L P +L R+ Q + +D P SVDWR+KGAVT
Sbjct: 77 VDVNQFADLTNEEYRQLYLRPYPTELLGRERQ-------EVWLDGPNAGSVDWRQKGAVT 129
Query: 148 HVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAF 206
+KNQG CGSCW+FST +VEG + I TGNL SLSEQ+L+DC ++ N GCNGGLMD AF
Sbjct: 130 PIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAF 189
Query: 207 QYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLS 266
+YI+S GGL E+DYPY +G C+ +K V+I+GY DVPQN+ED L A+ P+S
Sbjct: 190 KYIISNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVS 249
Query: 267 VAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYI 326
VAIEA + FQ YS GV+ G CGT LDHGV VGY S DY IVKNSWG WG++GYI
Sbjct: 250 VAIEADQQSFQMYSSGVFSGPCGTNLDHGVLVVGYTS----DYWIVKNSWGASWGDQGYI 305
Query: 327 RMKRNTGKPEGLCGINKMASYPI 349
MKR G+CGI SYPI
Sbjct: 306 MMKRGVSS-AGICGIAMQPSYPI 327
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 157/300 (52%), Positives = 204/300 (68%), Gaps = 8/300 (2%)
Query: 52 MSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLG 110
M+++ ++Y+ +EK +RF+IFKDN+ I+ N+ + K Y L +NEFADL +EEF+ +
Sbjct: 1 MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNR 60
Query: 111 LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
K + F Y++V +P ++DWRKKGAVT +K+Q CG CWAFS VAA EGI
Sbjct: 61 FKAHICSEATT----FKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGI 116
Query: 171 NQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
QI TG L SLSEQEL+DCD N GC+GGLMD AF++I GL E YPY ++GT
Sbjct: 117 TQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFI-KIHGLASEATYPYEGDDGT 175
Query: 230 CEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCG 289
C K I GY DVP N+E +L KA+A+QP++VAI+A G +FQFY+ GV+ G CG
Sbjct: 176 CNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCG 235
Query: 290 TQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
T+LDHGVAAVGYG G+ Y +VKNSWG WGE+GYIRM+R+ EGLCGI ASYP
Sbjct: 236 TELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 295
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 151/304 (49%), Positives = 203/304 (66%), Gaps = 8/304 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMF 108
E+WM+++ +VY+ EK ++FE+FK N R ID N + +WLG+N+FADL +EEFK
Sbjct: 38 ETWMAQYGRVYKDAAEKAQKFEVFKANARFIDSFNAENHKFWLGINQFADLTNEEFKAT- 96
Query: 109 LGLKPDLARRKDQSHEDFSYKD--VVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
K + F Y++ + LP S+DWR KGAVT VK+QG CG CWAFS VAA
Sbjct: 97 -KTNKGFISNKARVSTGFKYENLKIEALPTSIDWRTKGAVTPVKDQGQCGCCWAFSAVAA 155
Query: 167 VEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
EGI ++ TG L SLSEQEL+DCD + + GC GGLMD AF++I++ GGL +E YPY
Sbjct: 156 TEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDA 215
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
E+G C+ G TI Y DVP N+E +L+KA+ANQP+SVA++ FQFYSGGV
Sbjct: 216 EDGKCK--SGSKSAGTIKSYEDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMT 273
Query: 286 GHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
G CGT LDHG+AA+GYG T G + ++KNSWG WGE G++RM+++ +G+CG+
Sbjct: 274 GSCGTDLDHGIAAIGYGVTSDGTKFWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAME 333
Query: 345 ASYP 348
SYP
Sbjct: 334 PSYP 337
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 157/347 (45%), Positives = 217/347 (62%), Gaps = 18/347 (5%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
+IL +FF ++ A DL+ + ++ E WM+++ +VY+ EK R
Sbjct: 7 SILAILGFAFFCGAALA---------ARDLSDDSAMVARHEQWMAQYSRVYKDASEKARR 57
Query: 69 FEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
FE+FK N++ I+ N N +WLG+N+FADL ++EF+ + + F
Sbjct: 58 FEVFKANVKFIESFNAGGNNKFWLGVNQFADLTNDEFRS--IKTNKGFKSSNMKIPTGFR 115
Query: 128 YKDV-VD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQE 185
Y++V VD LP ++DWR KGAVT +K+QG CG CWAFS VAA EGI +I TG L SL+EQE
Sbjct: 116 YENVSVDALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQE 175
Query: 186 LIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
L+DCD + + GC GGLMD AF++I++ GGL E YPY +G C+ G + TI G
Sbjct: 176 LVDCDVHGEDQGCEGGLMDDAFKFIINNGGLTTESSYPYTAADGKCK--SGSNSAATIKG 233
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST 304
Y DVP N E +L+KA+ANQP+SVA++ FQFYS GV G CGT LDHG+AA+GYG T
Sbjct: 234 YEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYGKT 293
Query: 305 R-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G Y ++KNSWG WGE GY+RM+++ G+CG+ SYP +
Sbjct: 294 SDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPTE 340
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 160/317 (50%), Positives = 202/317 (63%), Gaps = 12/317 (3%)
Query: 46 DLFESWMSKFE----KVYESLDEKLER-FEIFKDNLRHIDETNRKIKNYWLGLNEFADLR 100
+ F+ W+ + + Y S E ER F I+ DNLR E N + ++WL + +ADL
Sbjct: 44 EAFDFWVHTVKPPSNRAYASSAEVYERRFNIWLDNLRFAHEYNARHTSHWLSMGVYADLS 103
Query: 101 HEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWA 160
+E++ LG L +++ F YK V P+ VDW GAVT VK+Q CGSCWA
Sbjct: 104 QDEYRSKALGYNAHLHKKRPLRAAPFLYKGTVP-PEEVDWVAGGAVTPVKDQLLCGSCWA 162
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FST AVEG N I TG L SLSEQ L+DCD Y+ GC GG MD AF +IV+ GG+ E+D
Sbjct: 163 FSTTGAVEGANAIATGKLVSLSEQMLVDCDREYDTGCRGGFMDSAFDFIVNNGGIDTEDD 222
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY E+G C+ + VVTI+GY DVP N E++L+KA+A+QP+SVAIEA FQ Y
Sbjct: 223 YPYRAEDGICQDNRTRRHVVTIDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYG 282
Query: 281 GGVYDGHCGTQLDHGVAAVGYGS----TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK-- 334
GGV+D CGT LDH V VGYG+ T L Y +VKNSWG +WGEKGYIR+ RN GK
Sbjct: 283 GGVFDAECGTALDHAVLVVGYGTASNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDA 342
Query: 335 PEGLCGINKMASYPIKK 351
PEG CG+ AS+PIKK
Sbjct: 343 PEGQCGLAMYASFPIKK 359
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 160/318 (50%), Positives = 209/318 (65%), Gaps = 4/318 (1%)
Query: 39 TSNDK-LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLNEF 96
T ND +I E WM+ ++Y +EK RF+IFK+N+ +ID N R ++Y L +N+F
Sbjct: 45 TLNDPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLEVNKF 104
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
ADL ++EF+ G K F Y +V +P VDWRK+GAVT VK+QG CG
Sbjct: 105 ADLTNDEFRASRNGYKKQPDSDSHVVSGLFRYANVSAVPDEVDWRKEGAVTPVKDQGDCG 164
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGL 215
CWAFS VAA+EGIN++ G L SLSEQEL+DCD + + GC GGLM+ AFQ+I GL
Sbjct: 165 CCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFIEKRKGL 224
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E YPY E+G C K I+G+ VP N+E +LL+A+ANQP+S+AI+ASG +
Sbjct: 225 AAESVYPYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAIDASGYE 284
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
FQFYSGGV+ G CGT+LDH + AVGYG+T G Y ++KNSWG WGE GYIR+KR++
Sbjct: 285 FQFYSGGVFTGSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGYIRIKRDSLA 344
Query: 335 PEGLCGINKMASYPIKKK 352
EGLCGI SYP+ K
Sbjct: 345 KEGLCGIAMDPSYPVVSK 362
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 158/324 (48%), Positives = 215/324 (66%), Gaps = 7/324 (2%)
Query: 31 VGYSPEDLT--SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-K 87
+ ++ ++LT +ND+L ++ESW++K+ K Y SL E RFEIFK+ LR IDE N +
Sbjct: 23 LAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNR 82
Query: 88 NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVT 147
+Y +GLN+FAD +EEF+ +LG + K + + V LP VDWR GAV
Sbjct: 83 SYRVGLNQFADQTNEEFQSTYLGFTSGSNKMKVSNRYEPRVGQV--LPDYVDWRSAGAVV 140
Query: 148 HVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAF 206
+K+QG CGSCWAFS +A VEGIN+IVTG+L SLSEQEL+DC T N GC+GG + F
Sbjct: 141 DIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRTQNTRGCDGGSITDGF 200
Query: 207 QYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLS 266
Q+I++ GG++ E +YPY E+G C + + +I+ Y +VP N+E +L A+A QP+S
Sbjct: 201 QFIINNGGINTEANYPYTAEDGQCNLDLQNEKYASIDTYENVPYNNEWALQTAVAYQPVS 260
Query: 267 VAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYI 326
VA+EA+G FQ YS G++ G CGT +DH V VGYG+ G+DY IVKNSW WGE+GYI
Sbjct: 261 VALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYI 320
Query: 327 RMKRNTGKPEGLCGINKMASYPIK 350
R+ RN G G CGI SYP+K
Sbjct: 321 RILRNVGG-AGTCGIATKPSYPVK 343
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 153/304 (50%), Positives = 202/304 (66%), Gaps = 6/304 (1%)
Query: 50 SWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN--RKIKNYWLGLNEFADLRHEEFKEM 107
+WM++ +VY +EK R+ +FK N+ I+ N + + L +N+FADL +EEF+ M
Sbjct: 39 AWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSM 98
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
+ G K + F Y+ V LP SVDWRKKGAVT +K+QGSCGSCWAFS VA
Sbjct: 99 YTGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVA 158
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
A+EG+ QI G L SLSEQEL+DCD T ++GC GG M+ AF Y ++TGGL E +YPY
Sbjct: 159 AIEGVAQIKKGKLISLSEQELVDCD-TNDDGCMGGYMNSAFNYTMTTGGLTSESNYPYKS 217
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
+GTC + K + +I G+ DVP N E +L+KA+A+ P+S+ I G FQFYS GV+
Sbjct: 218 TDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGVFS 277
Query: 286 GHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
G C T LDHGVA VGYG S+ G Y I+KNSWGPKWGE+GY+R+K++T G CG+
Sbjct: 278 GECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKKDTKAKHGQCGLAMN 337
Query: 345 ASYP 348
ASYP
Sbjct: 338 ASYP 341
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 158/289 (54%), Positives = 193/289 (66%), Gaps = 5/289 (1%)
Query: 64 EKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQ 121
E+ +R IF N+ +I+ +N + N Y L +N+FADL +EEF K + +
Sbjct: 3 EREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIASRNKFKGHMCSSIIR 62
Query: 122 SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASL 181
+ F Y++ +P +VDWRKKGAVT VKNQG CGSCWAFS VAA EGI+Q+ TG L SL
Sbjct: 63 T-TTFKYENASAIPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSL 121
Query: 182 SEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
SEQELIDCD + GC GGLMD AF++I+ GL E YPY +GTC K V
Sbjct: 122 SEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKASIHAV 181
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVG 300
TI GY DVP N+E +L KA+ANQP+SVAI+ASG DFQFY+ GV+ G CGT+LDHGV AVG
Sbjct: 182 TITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVG 241
Query: 301 YG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
YG G Y +VKNSWG WGE+GYIRM+R EGLCGI ASYP
Sbjct: 242 YGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYP 290
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 158/322 (49%), Positives = 211/322 (65%), Gaps = 13/322 (4%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGL 93
+ S++++ L+ W K + LD R E+FK+NL+ +DE N R + LG+
Sbjct: 43 VRSDEEVRMLYLEWRVKNHPAEKYLDLNEYRLEVFKENLQFVDEHNAAADRGEHTFLLGM 102
Query: 94 NEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS----YKDVVDLPKSVDWRKKGAVTHV 149
N FADL +EE++ FL D +R + + S ++ DLP S+DWR+ GAV V
Sbjct: 103 NRFADLTNEEYRTRFL---RDFSRLRRSASGKISSRYRLREGDDLPDSIDWRENGAVVPV 159
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
KNQG CGSCWAFSTVAAVEGINQIVTG+L SLSEQ+L+DC T N+GC GG M+ AFQ+I
Sbjct: 160 KNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC-TTANHGCRGGWMNPAFQFI 218
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
V+ GG++ EE YPY + G C T + VV+I+ Y +VP ++E SL KA+ANQP+SV +
Sbjct: 219 VNNGGINSEETYPYRGQNGICNSTV-NAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTM 277
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMK 329
+A+GRDFQ Y G++ G C +H + VGYG+ D+ IVKNSWG WGE GYIR +
Sbjct: 278 DAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIVKNSWGKNWGESGYIRAE 337
Query: 330 RNTGKPEGLCGINKMASYPIKK 351
RN P G CGI + ASYP+KK
Sbjct: 338 RNIENPNGKCGITRFASYPVKK 359
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 153/310 (49%), Positives = 199/310 (64%), Gaps = 11/310 (3%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-------KNYWLGLNEFADLR 100
FE+W ++ K Y + E+ R F +N + N + +Y L LN FADL
Sbjct: 39 FEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLT 98
Query: 101 HEEFKEMFLG---LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGS 157
H+EF+ LG + P S F + V +P ++DWR+ GAVT VK+QGSCG+
Sbjct: 99 HDEFRAARLGRLAVGPGPLGAPSPSDGGFEGR-VGAVPDALDWRQSGAVTKVKDQGSCGA 157
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHK 217
CW+FS A+EGIN+I TG+L SLSEQELIDCD +YN GC GGLM YA+++++ GG+
Sbjct: 158 CWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDT 217
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
E+DYP+ +GTC K + VVTI+GY +VP + ED LL+A+A QP+SV I S R FQ
Sbjct: 218 EDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQ 277
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
YS G++DG C T LDH V VGYGS G DY IVKNSWG +WG KGY+ M RNTG G
Sbjct: 278 LYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSSG 337
Query: 338 LCGINKMASY 347
+CGIN MAS+
Sbjct: 338 ICGINMMASF 347
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 163/325 (50%), Positives = 217/325 (66%), Gaps = 11/325 (3%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYW 90
+ ++ +DL S++ L DL+E W S + S EK RF +FK+N+++I+E N+ K Y
Sbjct: 27 IDFTDKDLESDETLWDLYERWRSVYTSA-RSFGEKQNRFHVFKENVKYINEVNKMDKPYK 85
Query: 91 LGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVK 150
L LN+F DL EF + K R + F Y++V ++P+S+DWR KGAVT VK
Sbjct: 86 LRLNQFGDLTPSEFARTYANSKIIEGTRNESG--GFMYENV-EVPRSIDWRVKGAVTPVK 142
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIV 210
NQG CG CWAFS AAVEGINQI TG L SLSEQ+LIDCD T N+GC GG M AF+YI
Sbjct: 143 NQGRCGGCWAFSAAAAVEGINQITTGQLISLSEQQLIDCD-TQNSGCRGGTMGRAFEYIK 201
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
GG+ E +YPY + G C+ + V+I+GY+++ + SED++LK LA+QP+SVA++
Sbjct: 202 QRGGITSEANYPYKAQAGMCKNNLIQRPTVSIDGYYNI-RRSEDAVLKILAHQPVSVAVD 260
Query: 271 A---SGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYI 326
A S D+ FY GV+ G CGT+L+HGV AVGYG+T G DY I+KNSWG WGE+GY+
Sbjct: 261 ATTWSSLDWMFYFQGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYM 320
Query: 327 RMKRNTGKPEGLCGINKMASYPIKK 351
RM R P GLCGI AS+PIK+
Sbjct: 321 RMLRGV-SPYGLCGIAMQASFPIKR 344
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 157/307 (51%), Positives = 207/307 (67%), Gaps = 9/307 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFKEM 107
+ W+ EKVY+ L+EK RF+IFK+N+ I+ N + K Y LG N+F+DL +EEF+ +
Sbjct: 43 DQWIVHHEKVYKDLNEKEVRFQIFKENVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVL 102
Query: 108 FLGLKPD----LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
G K + K ++H F Y +V D+P ++DWRKKGAVT +K+Q CG CWAFS
Sbjct: 103 HTGYKRSHPKVMTSSKGKTH--FRYTNVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSA 160
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
VAA+EG++Q+ TG L LSEQEL+DCD + GC+GGL+D AF +I+ GL E +YP
Sbjct: 161 VAAMEGLHQLKTGELIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYP 220
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
Y E+G C K I GY DVP NSE +LL+A+ANQP+SVAI+ S DFQFYS G
Sbjct: 221 YKGEDGVCNKKKSALSAAKITGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSG 280
Query: 283 VYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
V+ G C T L+H V AVGYG+T G Y I+KNSWG KWG+ GY+R+KR+ + EGLCG+
Sbjct: 281 VFSGSCSTWLNHAVTAVGYGATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGL 340
Query: 342 NKMASYP 348
ASYP
Sbjct: 341 AMDASYP 347
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 155/260 (59%), Positives = 184/260 (70%), Gaps = 10/260 (3%)
Query: 99 LRHEEFKEMFLGLKPDLAR--RKDQ-----SHEDFSYKDVVDLPKSVDWRKKGAVTHVKN 151
+ +EF+ + G + R R D+ S F Y D D+P SVDWR+KGAVT VK+
Sbjct: 1 MTADEFRRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKD 60
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVS 211
QG CGSCWAFST+AAVEGIN I T NL SLSEQ+L+DCD N GCNGGLMDYAFQYI
Sbjct: 61 QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAK 120
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEA 271
GG+ E+ YPY + +C+ K + VVTI+GY DVP N E +L KA+A+QP+SVAIEA
Sbjct: 121 HGGVAAEDAYPYRARQASCK--KSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEA 178
Query: 272 SGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKR 330
SG FQFYS GV+ G CGT+LDHGVAAVGYG T G Y +VKNSWGP+WGEKGYIRM R
Sbjct: 179 SGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMAR 238
Query: 331 NTGKPEGLCGINKMASYPIK 350
+ EG CGI ASYP+K
Sbjct: 239 DVAAKEGHCGIAMEASYPVK 258
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 165/354 (46%), Positives = 223/354 (62%), Gaps = 20/354 (5%)
Query: 4 SSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLI--DLFESWMSKFEKVYES 61
S Q + LI IS F SI P D +++LI + WM+K +VY
Sbjct: 3 SKQIQIFLIVSLISSFC-------LSITLSRPLD---DNELIMQKRHDEWMAKHGRVYAD 52
Query: 62 LDEKLERFEIFKDNLRHIDETNR--KIKNYWLGLNEFADLRHEEFKEMFLGLKPD--LAR 117
+ EK R+ +FK N+ I+ N + + L +N+FADL ++EF+ M+ G K L+
Sbjct: 53 MKEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSS 112
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
+ F Y++V LP SVDWRKKGAVT +KNQG+CG CWAFS VAA+EG +I
Sbjct: 113 QSGTKTSSFRYQNVSSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKK 172
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
G L SLSEQ+L+DCD T + GC+GGLMD AF++I++TGGL E +YPY ++ TC++
Sbjct: 173 GKLISLSEQQLVDCD-TNDFGCSGGLMDTAFEHIMATGGLTTESNYPYKGKDATCKIKNT 231
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
+ +I GY DVP N E +L+KA+A+QP+S+ IE G DFQFY GV+ G C T LDH
Sbjct: 232 KPTATSITGYEDVPVNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDHA 291
Query: 296 VAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V AVGYG S+ G Y I+KNSWG KWGE GY+R+K++ +GLCG+ ASYP
Sbjct: 292 VTAVGYGQSSNGSKYWIIKNSWGTKWGESGYMRIKKDVKDKKGLCGLAMKASYP 345
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 152/313 (48%), Positives = 206/313 (65%), Gaps = 10/313 (3%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEE 103
+++ E WM+KF +VY+ EK +RFE+FK N+ I+ N + + +WLG+N+F DL ++E
Sbjct: 33 MVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAFIESFNAENRKFWLGVNQFTDLTNDE 92
Query: 104 FKEMFL--GLKPDLARRKDQSHEDFSYKDV-VD-LPKSVDWRKKGAVTHVKNQGSCGSCW 159
F+ GLK R + F Y +V +D LP +VDWR KG VT +K+QG CG CW
Sbjct: 93 FRATKTNKGLKMSGGR----APTGFKYSNVSIDALPTAVDWRTKGVVTPIKDQGQCGCCW 148
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
AFS V A EGI ++ TG L SLSEQEL+DCD + + GC GG MD AF++I+ GGL E
Sbjct: 149 AFSAVVATEGIVKLSTGKLISLSEQELVDCDVHGVDQGCEGGEMDDAFKFIIKNGGLTTE 208
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
+YPY ++G C+ + + V TI GY DVP N E SL+KA+ANQP+SVA++ FQ
Sbjct: 209 ANYPYTAQDGQCKTSIASNSVATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDVIFQH 268
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
YSGGV G CGT LDHG+AA+GYG T G Y ++KNSWG WGE GY+RM+++ G
Sbjct: 269 YSGGVMTGSCGTDLDHGIAAIGYGMTSDGTKYWLLKNSWGTTWGESGYLRMEKDISDKSG 328
Query: 338 LCGINKMASYPIK 350
+CG+ SYP +
Sbjct: 329 MCGLAMQPSYPTE 341
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 165/356 (46%), Positives = 223/356 (62%), Gaps = 18/356 (5%)
Query: 6 QFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEK 65
Q +L + + SS +F I G E+ S +++ +LF W + ++VY+ +E
Sbjct: 7 QLALVLFIWASLACLSSSLPTEFYITG---EEFASEERVRELFHLWKERHKRVYKHAEET 63
Query: 66 LERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLA-------RR 118
+RFEIFK+NL+++ E N K + LG+N+FAD+ +EEFKE +L RR
Sbjct: 64 AKRFEIFKENLKYVIERNSKGHRHTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRR 123
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
Q + + + P S+DWRKKG VT +K+QG CGSCWAFS+ A+EGIN IVTG+L
Sbjct: 124 SMQQKKGTA---SCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDL 180
Query: 179 ASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DCD T N GC GG MDYAF++++S GG+ E DYPY +GTC TK +++
Sbjct: 181 ISLSEQELVDCDTT-NYGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDGTCNTTKEDTK 239
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG---HCGTQLDHG 295
VV+I+GY DV + S+ +LL A NQP+SV ++ S DFQ Y+ G+Y G +DH
Sbjct: 240 VVSIDGYKDVDE-SDSALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHA 298
Query: 296 VAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
V VGYGS DY I KNSWG WG +GY +KRNT P G C IN MASYP K+
Sbjct: 299 VLIVGYGSEDSEDYWICKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYPTKE 354
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 157/333 (47%), Positives = 206/333 (61%), Gaps = 11/333 (3%)
Query: 17 SFFIRSSFARDFSIVG-YSPEDLTSNDK-LIDLFESWMSKFEKVYESLDEKLERFEIFKD 74
+F + S A ++ G + DL D+ ++ E WM+K+++VY EK RFE+FK
Sbjct: 8 AFVLLSVVAWACALSGSLAARDLADQDQAMVARHEEWMAKYDRVYSDAAEKARRFEVFKA 67
Query: 75 NLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLAR-----RKDQSHEDFSYK 129
N+ I+ N +WL N FADL +EF+ + G +P A R + F Y
Sbjct: 68 NMALIESVNAGNHKFWLEANRFADLTDDEFRATWTGYRPKTAAASSKGRSRTATTGFKYA 127
Query: 130 DVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
+V D+P SVDWR KGAVT +KNQG CG CWAFS VA++EG+ ++ TG L SLSEQEL+
Sbjct: 128 NVSLDDVPASVDWRTKGAVTPIKNQGECGCCWAFSAVASMEGVVKLSTGKLVSLSEQELV 187
Query: 188 DCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
DCD N + GC GG MD AF +IV GGL E YPY +GTC + + +I GY
Sbjct: 188 DCDVNGMDQGCEGGEMDDAFDFIVGNGGLTTESRYPYTASDGTCNSNEASGDAASIKGYE 247
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STR 305
DVP N E SL KA+ANQP+SVA++ F+FY GGV G CGT+LDHG+AAVGYG ++
Sbjct: 248 DVPANDEASLRKAVANQPVSVAVDGGDSHFRFYKGGVLSGACGTELDHGIAAVGYGVASD 307
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
G Y ++KNSWG WGE GYIRM+R+ E L
Sbjct: 308 GTKYWVMKNSWGTSWGEAGYIRMERDIADEEVL 340
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 162/348 (46%), Positives = 218/348 (62%), Gaps = 6/348 (1%)
Query: 7 FKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDL---FESWMSKFEKVYESLD 63
KT + + F + + S +S E ++ D+ +E W+ + + Y++ D
Sbjct: 1 MKTSMFCRNVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRD 60
Query: 64 EKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSH 123
E F I++ N+R I+ N + ++ L N+FAD+ +EE+K +++GL RK+QS
Sbjct: 61 EWQRHFGIYQSNVRFINYINAQNFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQS- 119
Query: 124 EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
F + LP SVDWRK GAVT V+NQG CGSCWAFSTVAAVEGIN+I TG L SLSE
Sbjct: 120 -SFKRERSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSE 178
Query: 184 QELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
QEL+DCD ++ N GCNGG M AF++I GG+ +YPYI E+G C K + VV I
Sbjct: 179 QELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKI 238
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
+GY VP N+E L A+A QP+SVAI+A G +FQ YS G+++G CG QL+H V +GYG
Sbjct: 239 SGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYG 298
Query: 303 STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G Y +VKNSWG WGE GY RM R++ EG+CGI ASYPIK
Sbjct: 299 EDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPIK 346
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 143/247 (57%), Positives = 181/247 (73%)
Query: 105 KEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
+ + G++ R + + + Y+ LP SVDWR+KGAV +K+QG CGSCWAFST+
Sbjct: 12 RTTYFGVRGAGRRTPGLASDRYRYRAGDALPDSVDWREKGAVVPIKDQGGCGSCWAFSTI 71
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
A+VEGIN+IVTG+L SLSEQEL+DCD TYN+GCNGGLMDYAFQ+I+ GG+ E+DYPY
Sbjct: 72 ASVEGINKIVTGDLISLSEQELVDCDKTYNDGCNGGLMDYAFQFIIDNGGIDTEKDYPYT 131
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
++G C+ + ++VV+IN Y DVP N E +L KA A+QP++VAI+ GR FQ Y+ G++
Sbjct: 132 EQDGRCDSYRKNAKVVSINSYEDVPVNDEQALKKAAASQPIAVAIDGGGRSFQLYNSGIF 191
Query: 285 DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
G CGT LDHGV VGYGS G DY IV+NSWG WGEKGYIRM RN P G+CGI
Sbjct: 192 TGKCGTSLDHGVTVVGYGSESGKDYWIVRNSWGESWGEKGYIRMARNIDSPSGICGIAME 251
Query: 345 ASYPIKK 351
ASYPIKK
Sbjct: 252 ASYPIKK 258
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 154/319 (48%), Positives = 207/319 (64%), Gaps = 8/319 (2%)
Query: 34 SPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGL 93
+ +L + ++ ESWM ++ +VY+ EK +FE+FK N ID N +WLG+
Sbjct: 23 AARELNDDLSMVARHESWMLQYGRVYKDAAEKASKFEVFKANAGFIDSFNAGNHKFWLGI 82
Query: 94 NEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKN 151
N+FAD+ ++EFK K ++ FSY++V LP S+DWR KGAVT VK+
Sbjct: 83 NQFADITNKEFKAT--KTNKGFISNKVRAPTGFSYENVSFDALPASIDWRTKGAVTPVKD 140
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIV 210
QG CG CWAFS VAA EGI ++ TG L SLSEQEL+DCD + + GC GGLMD AF++I+
Sbjct: 141 QGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFII 200
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
S GGL +E YPY E+G C+ G TI Y DVP N+E +L+KA+ANQP+SVA++
Sbjct: 201 SNGGLTQESSYPYDAEDGKCK--SGSKSAGTIKSYEDVPANNEGALMKAVANQPVSVAVD 258
Query: 271 ASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMK 329
FQFYSGGV G CGT LDHG+AA+GYG T G Y ++KNSWG WGE G++RM+
Sbjct: 259 GGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWLMKNSWGTSWGENGFLRME 318
Query: 330 RNTGKPEGLCGINKMASYP 348
++ +G+CG+ SYP
Sbjct: 319 KDIADKKGMCGLAMEPSYP 337
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 155/307 (50%), Positives = 209/307 (68%), Gaps = 9/307 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFKEM 107
+ W++ +KVY+ L+EK RF+IFK+N+ I+ N + K Y LG+N+F+DL +E+F+ +
Sbjct: 43 DQWIAHHDKVYKDLNEKEMRFKIFKENVERIEAFNAGEDKGYKLGVNKFSDLTNEKFRVL 102
Query: 108 FLGLKPD----LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
G K ++ K ++H F Y +V D+P ++DWRKKGAVT +K+Q CG CWAFS
Sbjct: 103 HTGYKRSHPKVMSSSKPKTH--FRYANVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSA 160
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
VAA EG++Q+ TG L LSEQEL+DCD + GC+GGL+D AF +I+ GL E +YP
Sbjct: 161 VAATEGLHQLKTGKLIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEANYP 220
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
Y E+G C K I GY DVP NSE +LL+A+ANQP+SVAI+ S DFQFYS G
Sbjct: 221 YKGEDGVCNKKKSALSAAKIAGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSG 280
Query: 283 VYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
V+ G C T L+H V AVGYG+T G Y I+KNSWG KWG+ GY+R+KR+ + EGLCG+
Sbjct: 281 VFSGSCSTWLNHAVTAVGYGATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGL 340
Query: 342 NKMASYP 348
ASYP
Sbjct: 341 AMDASYP 347
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 164/357 (45%), Positives = 224/357 (62%), Gaps = 22/357 (6%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLI--DLFESWMSKFEKV 58
MAL +++S SF ++ +R +D+LI + WM++ +
Sbjct: 1 MALEHIKIFLIVSLVSSFCFSTTLSRLL------------DDELIMQKKHDEWMAEHGRT 48
Query: 59 YESLDEKLERFEIFKDNLRHIDETNR--KIKNYWLGLNEFADLRHEEFKEMFLGLKPD-- 114
Y ++EK R+ +FK N+ I+ N + + L +N+FADL ++EF+ M+ G K D
Sbjct: 49 YADMNEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRFMYTGYKGDFV 108
Query: 115 LARRKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQ 172
L + F Y++V LP +VDWRKKGAVT +KNQGSCG CWAFS VAA+EG Q
Sbjct: 109 LFSQSQTKSTSFRYQNVFFGALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQ 168
Query: 173 IVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEM 232
I G L SLSEQ+L+DCD T + GC+GGLMD AF++I++TGGL E +YPY E+ C++
Sbjct: 169 IKKGKLISLSEQQLVDCD-TNDFGCSGGLMDTAFEHIMATGGLTTESNYPYKGEDANCKI 227
Query: 233 TKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQL 292
+ +I GY DVP N E++L+KA+A+QP+SV IE G DFQFYS GV+ G C T L
Sbjct: 228 KSTKPSAASITGYEDVPVNDENALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYL 287
Query: 293 DHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
DH V AVGY S+ G Y I+KNSWG KWGE GY+R+K++ EGLCG+ ASYP
Sbjct: 288 DHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIKKDIKDKEGLCGLAMKASYP 344
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 162/343 (47%), Positives = 217/343 (63%), Gaps = 8/343 (2%)
Query: 14 FC--ISFFIRSSFARDFSIVGYSPEDLTSNDKLIDL---FESWMSKFEKVYESLDEKLER 68
FC + F + + S +S E ++ D+ +E W+ + + Y++ DE
Sbjct: 2 FCRNVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRH 61
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
F I++ N+R I+ N + ++ L N+FAD+ +EE+K +++GL RK+QS F
Sbjct: 62 FGIYQSNVRFINYINAQNFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQS--SFKR 119
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
+ LP SVDWRK GAVT V+NQG CGSCWAFSTVAAVEGIN+I TG L SLSEQEL+D
Sbjct: 120 ERSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLD 179
Query: 189 CD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHD 247
CD ++ N GCNGG M AF++I GG+ +YPYI E+G C K + VV I+GY
Sbjct: 180 CDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKISGYET 239
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGL 307
VP N+E L A+A QP+SVAI+A G +FQ YS G+++G CG QL+H V +GYG G
Sbjct: 240 VPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYGEDNGK 299
Query: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
Y +VKNSWG WGE GY RM R++ EG+CGI ASYPIK
Sbjct: 300 KYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPIK 342
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 163/354 (46%), Positives = 223/354 (62%), Gaps = 15/354 (4%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLT--SNDKLIDLFESWMSKFEKV 58
M L F ++ + F + I S + ++ ++LT +ND++ ++ESW+ K+ K
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILS--------LAFNAKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y SL E RFEIFK+ LR IDE N ++Y +GLN+FADL EEF+ +L +
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFTSGSNK 112
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
K + + V LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IVTG
Sbjct: 113 TKVSNRYEPRVGQV--LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGV 170
Query: 178 LASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQELIDC T N GCNGG + FQ+I++ GG++ EE+YPY ++G C +
Sbjct: 171 LISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQN 230
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+ VTI+ Y +VP N+E +L A+ QP+SVA++A+G F+ YS G++ G CGT +DH V
Sbjct: 231 EKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAV 290
Query: 297 AAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VGYG+ G+DY IVKNSW WGE+GY+R+ RN G G CGI M SYP+K
Sbjct: 291 TIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPVK 343
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 156/307 (50%), Positives = 202/307 (65%), Gaps = 4/307 (1%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKE 106
+F+SWM K KVY S+ EK R IF+DNLR I N + +Y LGL +FADL E+ E
Sbjct: 55 IFDSWMVKHGKVYGSVAEKERRLTIFEDNLRFISNRNAENLSYRLGLTQFADLSLHEYGE 114
Query: 107 MFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
+ G P R YK LPKSVDWR +GAVT VK+QG C SCWAFSTV
Sbjct: 115 VCHGADPRPPRNHVFMTSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTV 174
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
AVEG+N+IVTG L +LSEQ+LI+C N NNGC GG ++ A+++I+ GGL + DYPY
Sbjct: 175 GAVEGLNKIVTGELVTLSEQDLINC-NKENNGCGGGKVETAYEFIMKNGGLGTDNDYPYK 233
Query: 225 MEEGTCE-MTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV 283
G C+ K ++ V I+G+ ++P N E +L+KA+A+QP++ I++S R+FQ Y GV
Sbjct: 234 AVNGVCDGRLKENNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYESGV 293
Query: 284 YDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINK 343
+DG CGT L+HGV VGYG+ G DY +VKNS G WGE GY++M RN P GLCGI
Sbjct: 294 FDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGNTWGEAGYMKMARNIANPRGLCGIAM 353
Query: 344 MASYPIK 350
ASYP+K
Sbjct: 354 RASYPLK 360
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 156/304 (51%), Positives = 204/304 (67%), Gaps = 7/304 (2%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
F+ W+ + + Y+ DE+ RF I++ N+++I N + +Y L N+FADL +EEF+
Sbjct: 46 FDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQST 105
Query: 108 FLGLKPDLARRKDQSHED-FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
++GL L +SH F Y + DLP+S DWRK+GAVT + +QG CG CWAF+ VAA
Sbjct: 106 YMGLSTRL-----RSHNTGFRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAA 160
Query: 167 VEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
VEGIN+I +G L SLSEQELIDCD + N GC GGLM+ A+ +I+ GGL E+DYPY
Sbjct: 161 VEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEG 220
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
+GTC+M K +I+GY +VP ++E L A A+QP+SVAI+A G FQFYS GV+
Sbjct: 221 VDGTCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFS 280
Query: 286 GHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
G CG QL+HGV VGYG Y IVKNSWG WGE GYIRMKR+T EG+CGI A
Sbjct: 281 GICGKQLNHGVTVVGYGKETINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQA 340
Query: 346 SYPI 349
SYP+
Sbjct: 341 SYPL 344
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 310 bits (794), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 168/317 (52%), Positives = 212/317 (66%), Gaps = 18/317 (5%)
Query: 41 NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADL 99
ND + ++ E WM + KVY++ EK +RF IFK+N+ +I+ N K+Y LGLN FADL
Sbjct: 32 NDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENVNYIEAFNNVGNKSYKLGLNHFADL 91
Query: 100 RHEEFKEMFLGLKPDLARRKDQSH------EDFSYKDVVDLPKSVDWRKKGAVTHVKNQG 153
+ EF AR K + F YK+V D+P +VDWR++GAVT VKNQG
Sbjct: 92 TNHEFI---------AARNKFNGYLHGSIITTFKYKNVSDVPSAVDWRQEGAVTPVKNQG 142
Query: 154 SCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVST 212
CG CWAFS VA+ EGI+++ TGNL SLSEQEL+DCD N + GC GGLMD AF++I+
Sbjct: 143 QCGCCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTNGEDQGCEGGLMDDAFEFIIQN 202
Query: 213 GGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEAS 272
GL E +YPY +GTC T+ S TI+GY +VP N E +L KA+ANQP+SVAI+AS
Sbjct: 203 NGLSTEAEYPYQGVDGTCNKTEVGSSAATISGYENVPVNDEQALQKAVANQPVSVAIDAS 262
Query: 273 GRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG-LDYIIVKNSWGPKWGEKGYIRMKRN 331
G DFQFY GV+ G CGT+LDHGVA VGYG +Y +VKNSWG +WGE+GYIRM+R
Sbjct: 263 GSDFQFYKSGVFTGSCGTELDHGVAVVGYGVGEDETEYWLVKNSWGTQWGEEGYIRMQRG 322
Query: 332 TGKPEGLCGINKMASYP 348
EGLCGI SYP
Sbjct: 323 VDASEGLCGIAMQPSYP 339
>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 165/322 (51%), Positives = 217/322 (67%), Gaps = 7/322 (2%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ ++L + + L L+E W K + +L EK +RF +FK+N+ H+ N+ K Y L
Sbjct: 26 FDEKELATEESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQMDKPYKLK 84
Query: 93 LNEFADLRHEEFKEMFLGLKPDLARR---KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
LN+FAD+ + EF + R+ + + F Y+ DLP SVDWR++GAV V
Sbjct: 85 LNKFADMSNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDWRERGAVNAV 144
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K QG CGSCWAFS+VAAVEGIN+I T L SLSEQEL+DC N N GCNGG M+ AF +I
Sbjct: 145 KEQGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDC-NYRNKGCNGGFMEIAFDFI 203
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
GG+ E YPY G C ++ S +V I+GY VP+N ED+L++A+ANQP+SVAI
Sbjct: 204 KRNGGIATENSYPYHGSRGLCRSSRISSPIVKIDGYESVPEN-EDALMQAVANQPVSVAI 262
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRM 328
+A+GRDFQFYS GV+DG+CGT+L+HGV A+GYG+T G DY +V+NSWG WGE GY+RM
Sbjct: 263 DAAGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRM 322
Query: 329 KRNTGKPEGLCGINKMASYPIK 350
KR + EGLCGI ASYPIK
Sbjct: 323 KRGVEQAEGLCGIAMEASYPIK 344
>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 517
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 165/356 (46%), Positives = 229/356 (64%), Gaps = 12/356 (3%)
Query: 3 LSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESL 62
L +Q + + + F+ ++SI+ + S + +I+LF+ W + +K+Y S
Sbjct: 5 LKTQLFLLFLVWGSWTFLCYGLPSEYSILALEIDKFPSEEGVIELFQRWKEENKKIYRSP 64
Query: 63 DEKLERFEIFKDNLRHIDETN-RKIKNYW--LGLNEFADLRHEEFKEMFLG-LKPDLARR 118
D++ RFE FK NL++I E N ++I Y LGLN FAD+ +EEFK F +K ++R
Sbjct: 65 DQEKLRFENFKRNLKYIAEKNSKRISPYGQSLGLNRFADMSNEEFKSKFTSKVKKPFSKR 124
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
S +D S +D P S+DWRKKG VT VK+QG CG CWAFS+ A+EGIN IV+G+L
Sbjct: 125 NGLSGKDHSCEDA---PYSLDWRKKGVVTAVKDQGYCGCCWAFSSTGAIEGINAIVSGDL 181
Query: 179 ASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSE EL+DCD T N+GC+GG MDYAF++++ GG+ E +YPY +GTC + K E++
Sbjct: 182 ISLSEPELVDCDRT-NDGCDGGHMDYAFEWVMHNGGIDTETNYPYSGADGTCNVAKEETK 240
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGT---QLDHG 295
V+ I+GY++V Q S+ SLL A QP+S I+ S DFQ Y GG+YDG C + +DH
Sbjct: 241 VIGIDGYYNVEQ-SDRSLLCATVKQPISAGIDGSSWDFQLYIGGIYDGDCSSDPDDIDHA 299
Query: 296 VAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
+ VGYGS DY IVKNSWG WG +GYI ++RNT G+C IN MASYP K+
Sbjct: 300 ILVVGYGSEGDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMASYPTKE 355
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 152/256 (59%), Positives = 182/256 (71%), Gaps = 4/256 (1%)
Query: 99 LRHEEFKEMFLGLKPD---LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSC 155
+ + EF+ + G K + + R + F Y+ V +P SVDWRKKGAVT +K+QG C
Sbjct: 1 MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 60
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGL 215
GSCWAFSTV AVEGIN I T L SLSEQEL+DCD + N GCNGGLM YAF++I GG+
Sbjct: 61 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEKGGI 120
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E+ YPY E+GTC+++K S VV+I+G+ VP N+ED+LLKA ANQP+SVAI+A G
Sbjct: 121 TTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSA 180
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
FQFYS GV+ G CGT LDHGVA VGYG+T G Y IVKNSWG WGE GYIRMKR
Sbjct: 181 FQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRGISA 240
Query: 335 PEGLCGINKMASYPIK 350
EGLCGI ASYPIK
Sbjct: 241 KEGLCGIAVEASYPIK 256
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 162/339 (47%), Positives = 220/339 (64%), Gaps = 9/339 (2%)
Query: 20 IRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHI 79
+ SS ++SIVG +L ++ +I++F+ W + +K Y+ +E +RF FK NL++I
Sbjct: 15 VSSSLPSEYSIVGNDFSELPPDESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYI 74
Query: 80 DETNRK--IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV--VDLP 135
E K + +GLN+FADL +EEFK+++L + ED S +++ D P
Sbjct: 75 IEKTGKETTLRHRVGLNKFADLSNEEFKQLYLSKVKKPINKTRIDAEDRSRRNLQSCDAP 134
Query: 136 KSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN 195
S+DWRKKG VT VK+QG CGSCW+FST A+EGIN IVT +L SLSEQEL+DCD T N
Sbjct: 135 SSLDWRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTT-NY 193
Query: 196 GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDS 255
GC GG MDYAF+++++ GG+ E +YPY +GTC K E +VV+I+GY DV + ++ +
Sbjct: 194 GCEGGYMDYAFEWVINNGGIDTEANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDE-TDSA 252
Query: 256 LLKALANQPLSVAIEASGRDFQFYSGGVY---DGHCGTQLDHGVAAVGYGSTRGLDYIIV 312
LL A A QP+SV I+ S DFQ Y+GG+Y +DH V VGYGS G DY IV
Sbjct: 253 LLCAAAQQPISVGIDGSAIDFQLYTGGIYDGDCSDDPDDIDHAVLIVGYGSENGEDYWIV 312
Query: 313 KNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
KNSWG WG +GY +KRNT P G+C IN MASYP K+
Sbjct: 313 KNSWGTSWGIEGYFYIKRNTDLPYGVCAINAMASYPTKE 351
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 169/357 (47%), Positives = 215/357 (60%), Gaps = 18/357 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
M S+ + I CI SS +IV + E L + + E WM++ +VY+
Sbjct: 1 MGAISKPLLLAILCCIVCLYSSSGG---AIVAAARE-LGGDAAMAARHERWMAQHGRVYK 56
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLGLK----PDL 115
EK R E+FK N+ I+ N KN YWLG+N+FADL EEFK K P+
Sbjct: 57 DAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNN 116
Query: 116 ARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
R F Y++V LP SVDWR KGAVT +K+QG CG CWAFS VAA+EGI ++
Sbjct: 117 GVRVSTG---FKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVKL 173
Query: 174 VTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEM 232
TG L SLSEQEL+DCD N+ GC GG +D AFQ+I+S GGL E +YPY E+G C+
Sbjct: 174 STGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKT 233
Query: 233 TKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQL 292
T +I GY DVP N E SL+KA+A QP+SVA++AS FQFY GGV G CGT L
Sbjct: 234 TAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDAS--KFQFYGGGVMAGECGTSL 291
Query: 293 DHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
DHGV +GYG+ G Y +VKNSWG WGE GY+RM+++ G+CG+ SYP
Sbjct: 292 DHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYP 348
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 151/303 (49%), Positives = 197/303 (65%), Gaps = 6/303 (1%)
Query: 51 WMSKFEKVYESLDEKLERFEIFKDNLRHIDETN--RKIKNYWLGLNEFADLRHEEFKEMF 108
WM++ +VY +EK R+ +FK N+ I+ N + + L +N+FADL +EEF+ M+
Sbjct: 41 WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 100
Query: 109 LGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
G K + F Y++V LP SVDWRKKGAVT +K+QG CGSCWAFS VAA
Sbjct: 101 TGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAA 160
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
+EG+ QI G L SLSEQEL+DCD T + GC GGLMD AF Y ++ GGL E +YPY
Sbjct: 161 IEGVAQIKKGKLISLSEQELVDCD-TNDGGCMGGLMDTAFNYTITIGGLTSESNYPYKST 219
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
GTC K + +I G+ DVP N E +L+KA+A+ P+S+ I FQFYS GV+ G
Sbjct: 220 NGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSG 279
Query: 287 HCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
C T LDHGV AVGYG ++ GL Y I+KNSWGPKWGE+GY+R+K++ G CG+ A
Sbjct: 280 ECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKDIKPKHGQCGLAMNA 339
Query: 346 SYP 348
SYP
Sbjct: 340 SYP 342
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 162/354 (45%), Positives = 222/354 (62%), Gaps = 15/354 (4%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLT--SNDKLIDLFESWMSKFEKV 58
M L F ++ + F + I S + ++ ++LT +ND++ ++ESW+ K+ K
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILS--------LAFNAKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y SL E RFEIFK+ LR IDE N ++Y +GLN+FADL EEF+ +LG +
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNK 112
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
K + + V LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IVTG
Sbjct: 113 TKVSNRYEPRVGQV--LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGV 170
Query: 178 LASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQELIDC T N GCNG + F +I++ GG++ EE+YPY ++G C +
Sbjct: 171 LISLSEQELIDCGRTQNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECNVDLQN 230
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+ VTI+ Y +VP N+E +L A+ QP+SVA++A+G F+ YS G++ G CGT +DH V
Sbjct: 231 EKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAV 290
Query: 297 AAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VGYG+ G+DY IVKNSW WGE+GY+R+ RN G G CGI M SYP+K
Sbjct: 291 TIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPVK 343
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 150/302 (49%), Positives = 200/302 (66%), Gaps = 4/302 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+++ K+Y EK +RF+IFK+N++ I+ N K + L +N+FADL +EEFK
Sbjct: 38 EKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKAS 97
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
+ ++ + + + F Y+ + +P ++DWRK+GAVT +K+QG+CGSCWAFSTVAA+
Sbjct: 98 LINVQKKESGVETATETSFRYESITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSTVAAI 157
Query: 168 EGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE 227
EGI+QI TG L SLSEQEL+DC + GCN G + AF+++ GGL E YPY
Sbjct: 158 EGIHQITTGKLVSLSEQELVDCVKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPYKANN 217
Query: 228 GTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGH 287
TC + K V I GY +VP NSE +LLKA+ANQP+SV I+A QFYS G++ G
Sbjct: 218 KTCMVKKETQGVAQIKGYENVPSNSEKALLKAVANQPVSVYIDAGA--LQFYSSGIFTGK 275
Query: 288 CGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
CGT +H V +GYG R G Y +VKNSWG KWGEKGYI+MKR+ EGLCGI AS
Sbjct: 276 CGTAPNHAVTVIGYGKARGGAKYWLVKNSWGTKWGEKGYIKMKRDIRAKEGLCGIATNAS 335
Query: 347 YP 348
YP
Sbjct: 336 YP 337
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 158/330 (47%), Positives = 217/330 (65%), Gaps = 21/330 (6%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDE--------------KLERFEIFKDNLRHID----E 81
+++++ ++E+W SK + S D+ + R E+F+DNLR+ID E
Sbjct: 46 ADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNLRYIDAHNAE 105
Query: 82 TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWR 141
+ + + LGL FADL EE++ LG + R + +S + DLP ++DWR
Sbjct: 106 ADAGLHTFRLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVRGG-DLPDAIDWR 164
Query: 142 KKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGL 201
+ GAVT VK+Q CG CWAFS VAA+EG+N I TGNL SLSEQE+IDCD ++GC+GG
Sbjct: 165 QLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCD-AQDSGCDGGQ 223
Query: 202 MDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE-VVTINGYHDVPQNSEDSLLKAL 260
M+ AF++++ GG+ E DYP+I +GTC+ +K ++E V TI+G +V N+E +L +A+
Sbjct: 224 MENAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKNEKVATIDGLVEVASNNETALQEAV 283
Query: 261 ANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKW 320
A QP+SVAI+ASGR FQ YS G+++G CGT LDHGV AVGYGS G DY IVKNSW W
Sbjct: 284 AIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGSESGKDYWIVKNSWSASW 343
Query: 321 GEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
GE GYIRM+RN +P G CGI ASYP+K
Sbjct: 344 GEAGYIRMRRNVPRPTGKCGIAMDASYPVK 373
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 154/330 (46%), Positives = 209/330 (63%), Gaps = 13/330 (3%)
Query: 28 FSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-I 86
F + DL + ++ E WM+++ +VY+ EK +RFE+FK N++ I+ N
Sbjct: 17 FCGAALAARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKFIESFNAGGN 76
Query: 87 KNYWLGLNEFADLRHEEFKEMFL--GLKPDLARRKDQSHEDFSYKDV-VD-LPKSVDWRK 142
+ +WLG+N+FADL ++EF+ G KP + F Y++V VD LP S+DWR
Sbjct: 77 RKFWLGVNQFADLTNDEFRATKTNKGFKPSPVKVP----TGFRYENVSVDALPASIDWRT 132
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGL 201
KGAVT +K+QG CG CWAFS VAA EGI +I T L SLSEQEL+DCD + + GC GGL
Sbjct: 133 KGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVHGEDQGCEGGL 192
Query: 202 MDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALA 261
MD AF++I+ GGL E YPY +G C+ G + I G+ DVP N E +L+KA+A
Sbjct: 193 MDDAFKFIIKNGGLTTESSYPYTATDGKCK--SGTNSAANIKGFEDVPANDEAALMKAVA 250
Query: 262 NQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKW 320
NQP+SVA++ FQ YSGGV G CGT LDHG+AA+GYG T G Y ++KNSWG W
Sbjct: 251 NQPVSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTW 310
Query: 321 GEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
GE GY+RM+++ G+CG+ SYP +
Sbjct: 311 GENGYLRMEKDISDKRGMCGLAMEPSYPTE 340
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 469
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 153/320 (47%), Positives = 204/320 (63%), Gaps = 7/320 (2%)
Query: 38 LTSNDKLIDLFESWMSKFEKVY-ESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEF 96
L + + F+ W + Y + E RF+++ +NL ++ N + ++WL LN
Sbjct: 3 LEAQANPLGAFKEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNARTTSHWLTLNHL 62
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV--VDLPKSVDWRKKGAVTHVKNQGS 154
ADL E+K LG +++ F Y+DV LP ++DWRKK AV VKNQG
Sbjct: 63 ADLSTPEYKSKLLGFDNQARVARNKLKTGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQ 122
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGG 214
CGSCWAF+T +VEGIN IVTG+L SLSEQEL+DCD + GC+GGLMDYA+ +I+ G
Sbjct: 123 CGSCWAFATTGSVEGINAIVTGSLVSLSEQELVDCDTEQDKGCSGGLMDYAYAWIIKNKG 182
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
++ EEDYPY +G C++ K + VVTI+ Y DVP+N E +L KA A+QP++VAIEA +
Sbjct: 183 INTEEDYPYTAMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAK 242
Query: 275 DFQFYSGGVY-DGHCGTQLDHGVAAVGYG---STRGLDYIIVKNSWGPKWGEKGYIRMKR 330
FQ Y GGVY D CGT L+HGV VGYG + G +Y IVKNSWG +WG+ GYIR+K
Sbjct: 243 SFQLYGGGVYDDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKM 302
Query: 331 NTGKPEGLCGINKMASYPIK 350
+ EGLCGI SYP+K
Sbjct: 303 GSTDAEGLCGIAMAPSYPVK 322
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 203/310 (65%), Gaps = 6/310 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHE 102
+ + E WM+++++VY+ EK RFE+FKDN ++ N KN +WLG+N+FADL E
Sbjct: 1 MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTE 60
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
EFK G KP A + + V LP +VDWR KGAVT +KNQG CG CWAFS
Sbjct: 61 EFKAN-KGFKPISAEEVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFS 119
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
+AA+EGI ++ TGNL SLSEQE +DCD + + GC GG MD AF++++ GGL E Y
Sbjct: 120 AIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLATESSY 179
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY + +G C+ G TI G+ DVP N+E +L+K +A+QP+SVA++AS R F YSG
Sbjct: 180 PYKVVDGKCK--GGSKSAATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTFMLYSG 237
Query: 282 GVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GV G CGTQLDHG+AA+GYG + Y I+KNSWG WGEKG++RM+++ G+C
Sbjct: 238 GVMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDISDKRGMCD 297
Query: 341 INKMASYPIK 350
+ SYP +
Sbjct: 298 LAMKPSYPTE 307
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 160/309 (51%), Positives = 200/309 (64%), Gaps = 8/309 (2%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFK-E 106
FE WM K + Y + EK RFE++K+NL I+E N Y L N+FADL +EEF+ +
Sbjct: 119 FEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNSGGHGYTLTDNKFADLTNEEFRAK 178
Query: 107 MFLGLKPDLARRKDQSHEDFSYK-----DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
M GL D RR+ H + + + DLPK VDWRKKGAV VKNQGSCGSCWAF
Sbjct: 179 MLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGSCWAF 238
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
S VAA+EG+NQI G L SLSEQEL+DCD GC GG M +AF+++++ GL E Y
Sbjct: 239 SAVAAMEGLNQIKNGKLVSLSEQELVDCD-AEAVGCAGGFMSWAFEFVMANHGLTTEASY 297
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY G C+ K V+I GY +V NSE LLK A QP+SVA++A G FQ Y+G
Sbjct: 298 PYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLFQLYAG 357
Query: 282 GVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GV+ G C Q++HGV VGYG T + Y IVKNSWGP+WGE GY+ M+R+ G P GLCG
Sbjct: 358 GVFSGPCTAQINHGVTVVGYGETDKAEKYWIVKNSWGPEWGEAGYMLMQRDAGVPTGLCG 417
Query: 341 INKMASYPI 349
I +ASYP+
Sbjct: 418 IAMLASYPV 426
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 160/353 (45%), Positives = 220/353 (62%), Gaps = 38/353 (10%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA +Q++ I ++ F ++ + + AR+ + + E WM+++ +V
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARNLH-----------EASMYERHEDWMAQYGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y+ DEK +R++IFKDN+ I+ N+ + K+Y L +NEFADL +EEF K +
Sbjct: 50 YKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFGTSRNRFKAHICS 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
+ S F Y++V +P ++DWRKKGAVT +K+QG CGSCWAFS VAA+EGI Q+ TG
Sbjct: 110 TEATS---FKYENVTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGK 166
Query: 178 LASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL+DCD + + GCNG +YPY +GTC K
Sbjct: 167 LISLSEQELVDCDTSGEDQGCNGA-------------------NYPYAGTDGTCNRKKAA 207
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
INGY DVP N+E +L KA+ +QP++VAI+A G +FQFYS GV+ G CGT+LDHGV
Sbjct: 208 HPAAKINGYEDVPANNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGV 267
Query: 297 AAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
AAVGYG++ G+ Y +VKNSWG WGE+GYIRM+R+ EGLCGI ASYP
Sbjct: 268 AAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 320
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 168/359 (46%), Positives = 215/359 (59%), Gaps = 18/359 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
M S+ + I CI SS +IV + E L + + E WM++ +VY+
Sbjct: 1 MGAISKPLLLAILCCIVCLYSSSGG---AIVAAARE-LGGDAAMAARHERWMAQHGRVYK 56
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLGLK----PDL 115
EK R E+FK N+ I+ N KN YWLG+N+FADL EEFK K P+
Sbjct: 57 DAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNN 116
Query: 116 ARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
R F Y++V LP SVDWR KGAVT +K+QG CG CWAFS VAA+EG ++
Sbjct: 117 GVRVSTG---FKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVKL 173
Query: 174 VTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEM 232
TG L SLSEQEL+DCD N+ GC GG +D AFQ+I+S GGL E +YPY E+G C+
Sbjct: 174 STGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKT 233
Query: 233 TKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQL 292
T +I GY DVP N E SL+KA+A QP+SVA++AS FQFY GGV G CGT L
Sbjct: 234 TAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDAS--KFQFYGGGVMAGECGTSL 291
Query: 293 DHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
DHGV +GYG+ G Y +VKNSWG WGE GY+RM+++ G+CG+ SYP +
Sbjct: 292 DHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPTE 350
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 155/308 (50%), Positives = 204/308 (66%), Gaps = 13/308 (4%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHE 102
+ + E WM+++ +VY+ EK R+ IFK+N+ ID N + K+Y LG+N+FADL +E
Sbjct: 1 MYERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNE 60
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
EFK K + + F Y++V +P ++DWRKKGAVT VK+QG C
Sbjct: 61 EFKASRNRFKGHMCSPQAGP---FRYENVSAVPATMDWRKKGAVTPVKDQGQC------- 110
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
VAA+EGINQ+ TG L SLSEQE++DCD + GCNGGLMD AF++I GL E +Y
Sbjct: 111 -VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANY 169
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY +GTC K S I G+ DVP NSE +L+KA+A QP+SVAI+A G +FQFYS
Sbjct: 170 PYTGTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSS 229
Query: 282 GVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
G++ G CGT+LDHGV AVGYG + G Y +VKNSWG +WGE+GYIRM+++ EGLCGI
Sbjct: 230 GIFTGSCGTELDHGVTAVGYGGSDGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGI 289
Query: 342 NKMASYPI 349
ASYP
Sbjct: 290 AMQASYPT 297
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 164/343 (47%), Positives = 208/343 (60%), Gaps = 11/343 (3%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
T+ + C + D S+ Y P + L FE W+ K+Y DE + R
Sbjct: 11 TLAVLICFVLIASKLCSVDSSV--YDP-----HKTLKQRFEKWLKTHSKLYGGRDEWMLR 63
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
F I++ N++ ID N + L N FAD+ + EFK FLGL R +
Sbjct: 64 FGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRP--VC 121
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
++P +VDWR +GAVT ++NQG CG CWAFS VAA+EGIN+I TGNL SLSEQ+LID
Sbjct: 122 DPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLID 181
Query: 189 CD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHD 247
CD TYN GC+GGLM+ AF++I + GGL E DYPY EGTC+ K +++VVTI GY
Sbjct: 182 CDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQK 241
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGL 307
V QN E SL A A QP+SV I+A G FQ YS GV+ +CGT L+HGV VGYG
Sbjct: 242 VAQN-EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGVEGDQ 300
Query: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
Y IVKNSWG WGE+GYIRM+R + G CGI MASYP++
Sbjct: 301 KYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPLQ 343
>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
Length = 369
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 167/320 (52%), Positives = 204/320 (63%), Gaps = 16/320 (5%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ +D+ S + L +L+E W + +V L EK RF +FKDN+R I E NR+ + Y L
Sbjct: 33 FGDKDVASEEALWELYERWRGQ-HRVARDLGEKARRFNVFKDNVRLIHEFNRRDEPYKLR 91
Query: 93 LNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQ 152
LN F D+ +E + A + H F + R GAV VK+Q
Sbjct: 92 LNRFGDMTADESAGAY-------ASSRVSHHRMFRGRG------EKAQRLHGAVGAVKDQ 138
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDN-TYNNGCNGGLMDYAFQYIVS 211
G CGSCWAFST+AAVEGIN I T NL +LSEQ+L+DCD T N GC+GGLMD AFQYI
Sbjct: 139 GQCGSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAK 198
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEA 271
GG+ YPY + +C+ + S VTI+GY DVP NSE +L KA+ANQP+SVAIEA
Sbjct: 199 HGGVAASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEA 258
Query: 272 SGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKR 330
G FQFYS GV+ G CGT+LDHGVAAVGYG+T G Y IV+NSWG WGEKGYIRMKR
Sbjct: 259 GGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKR 318
Query: 331 NTGKPEGLCGINKMASYPIK 350
+ EGLCGI ASYPIK
Sbjct: 319 DVSAKEGLCGIAMEASYPIK 338
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 156/312 (50%), Positives = 208/312 (66%), Gaps = 10/312 (3%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEE 103
I+ E WMS+F +VY EK RFEIFK NL+ ++ N K Y L +NEF+DL EE
Sbjct: 32 IEKHEQWMSRFHRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNKTYTLDVNEFSDLTDEE 91
Query: 104 FKEMFLGLK-PDLARR--KDQSHE--DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
FK + GL P+ R SHE F Y++V + +S+DWR++GAVT VK+Q CG C
Sbjct: 92 FKARYTGLVVPEGMTRMSTTDSHETVSFRYENVGETGESMDWREEGAVTSVKHQQQCGCC 151
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
WAFS VAAVEG+ +I G L SLSEQ+L+DC +T N+GC+GG+M AF YIV G+ E
Sbjct: 152 WAFSAVAAVEGMTKIAKGELVSLSEQQLLDC-STENDGCDGGIMWKAFDYIVENQGITAE 210
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
++YPY + TCE TI+GY VPQN E++LLKA++ QP+SVAIE SG +F
Sbjct: 211 DNYPYQGAQQTCE--SNHVAAATISGYETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIH 268
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
YSGG+++G CGT L+H V VGYG S G+ Y ++KNSWG WGE GY+R+ R+ P+G
Sbjct: 269 YSGGIFNGECGTHLNHAVTIVGYGVSEEGIKYWLLKNSWGESWGEDGYMRIMRDVDAPQG 328
Query: 338 LCGINKMASYPI 349
+CG+ +A YP+
Sbjct: 329 MCGLASLAYYPV 340
>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
Length = 475
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 161/343 (46%), Positives = 221/343 (64%), Gaps = 26/343 (7%)
Query: 16 ISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDN 75
++F ++SI+ + S +++++LF+ W + +K Y +E R E FK N
Sbjct: 19 LTFLSCYGIPSEYSILAFDLNKFPSEEQVVELFQQWKKEHQKFYIHPEEAALRLENFKRN 78
Query: 76 LRHIDETNRKIKN----YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV 131
L++I E N ++N + LGLN FAD+ +EEFK F+ K +S +D
Sbjct: 79 LKYIVERN-AMRNSPVGHHLGLNRFADMSNEEFKNKFI--------SKVESCDD------ 123
Query: 132 VDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDN 191
P S+DWRKKG VT VK+QG+CGSCW+FS+ A+EG+N IVTG+L SLSEQEL+DCD
Sbjct: 124 --APYSLDWRKKGVVTGVKDQGNCGSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDT 181
Query: 192 TYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQN 251
T N+GC GG MDYAF+++++ GG+ E DYPYI GTC +TK E++VVTI+GY DV Q
Sbjct: 182 T-NDGCEGGYMDYAFEWVINNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDGYTDVTQ- 239
Query: 252 SEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQ---LDHGVAAVGYGSTRGLD 308
S+ +L A QP+SV I+ S DFQ Y+GG+YDG C + +DH V VGYGS D
Sbjct: 240 SDSALFCATVKQPISVGIDGSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGYGSDGNQD 299
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
Y IVKNSWG WG +G+I ++RNT G+C IN MAS+P K+
Sbjct: 300 YWIVKNSWGTSWGIEGFIYIRRNTNLKYGVCAINYMASFPTKE 342
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 306 bits (784), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 158/309 (51%), Positives = 201/309 (65%), Gaps = 5/309 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHE 102
+ + E WM K+ KVY+ E +RF IF++N+ I+ N K Y L +N AD +E
Sbjct: 34 MYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNKPYKLSINHLADQTNE 93
Query: 103 EFKEMFLGLK-PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
EF G K + + F Y++V D+P +VDWR+KG VT +K+Q CG+CWAF
Sbjct: 94 EFMASHKGYKGSHWQGLRITTQTPFKYENVTDIPWAVDWRQKGDVTSIKDQAQCGNCWAF 153
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
S VAA EGI QI TGNL SLSE+EL+DCD+ ++GC+GGLM++ F++I+ GG+ E +Y
Sbjct: 154 SAVAATEGIYQITTGNLVSLSEKELVDCDSV-DHGCDGGLMEHGFEFIIKNGGISSEANY 212
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ-PLSVAIEASGRDFQFYS 280
PY GTC+ K S V I GY VP N E+ L KA+ANQ +SV+I+A G FQFY
Sbjct: 213 PYTAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVANQLTMSVSIDAGGSAFQFYP 272
Query: 281 GGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
GV+ G CGTQLDHGV AVGYGST G Y IVKNSWG +WGE+GYIRM R EGLC
Sbjct: 273 SGVFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQWGEEGYIRMLRGIDAQEGLC 332
Query: 340 GINKMASYP 348
GI ASYP
Sbjct: 333 GIAMDASYP 341
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 149/302 (49%), Positives = 198/302 (65%), Gaps = 4/302 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+++ K+Y EK +RF+IFK+N++ I+ N K + L +N+FADL +EEFK
Sbjct: 38 EKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKAS 97
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
+ ++ + + + F Y+ + +P ++DWRK+GAVT +K+QG+CGSCWAFS VAA+
Sbjct: 98 LINVQKKESGVETATETSFRYESITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSIVAAI 157
Query: 168 EGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE 227
EGI+QI TG L SLSEQEL+DC + GCN G + AF+++ GGL E YPY
Sbjct: 158 EGIHQITTGKLVSLSEQELVDCVKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPYKANN 217
Query: 228 GTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGH 287
TC + K V I GY +VP NSE +LLKA+ANQP+SV I+A QFYS G++ G
Sbjct: 218 KTCMVKKETQGVAQIKGYENVPSNSEKALLKAVANQPVSVYIDAGA--LQFYSSGIFTGK 275
Query: 288 CGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
CGT +H +GYG R G Y +VKNSWG KWGEKGYIRMKR+ EGLCGI AS
Sbjct: 276 CGTAPNHAATVIGYGKARGGAKYWLVKNSWGTKWGEKGYIRMKRDIRAKEGLCGIATNAS 335
Query: 347 YP 348
YP
Sbjct: 336 YP 337
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 159/336 (47%), Positives = 211/336 (62%), Gaps = 11/336 (3%)
Query: 22 SSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
S ++S V + + + + ++F+ W K +KVY+ +E R FK NL++I E
Sbjct: 24 SGLPGEYSAVSNDLHEGLTEEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIE 83
Query: 82 TNRKIKN---YWLGLNEFADLRHEEFKEMFLG-LKPDLARRKDQSHEDFSYKDVVDLPKS 137
N K K+ + +GLN+FADL +EEF+EM+L +K + + + H D P S
Sbjct: 84 KNGKRKSGLEHKVGLNKFADLSNEEFREMYLSKVKKPITIEEKRKHRHLQ---TCDAPSS 140
Query: 138 VDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGC 197
+DWR KG VT VK+QG CGSCW+FST A+E IN IVTG+L SLSEQEL+DCD T N GC
Sbjct: 141 LDWRNKGVVTAVKDQGDCGSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGC 200
Query: 198 NGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLL 257
GG MD AFQ+++ GG+ E DYPY +GTC K E +VV+I GY DV S+ +LL
Sbjct: 201 EGGDMDSAFQWVIGNGGIDTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDV-DPSDSALL 259
Query: 258 KALANQPLSVAIEASGRDFQFYSGGVYDGHCG---TQLDHGVAAVGYGSTRGLDYIIVKN 314
A QP+SV ++ S DFQ Y+GG+YDG C +DH + VGYGS DY IVKN
Sbjct: 260 CATVQQPISVGMDGSALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYGSENDEDYWIVKN 319
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG +WG +GY ++RNT KP G+C IN ASYP K
Sbjct: 320 SWGTEWGMEGYFYIRRNTSKPYGVCAINADASYPTK 355
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 148/319 (46%), Positives = 208/319 (65%), Gaps = 8/319 (2%)
Query: 34 SPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGL 93
+ +L + ++ E+WM ++ +VY+ EK ++FE+FK N I+ N +WLG+
Sbjct: 23 AARELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAEFINSFNAGNHKFWLGI 82
Query: 94 NEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKN 151
N+FAD+ +EEFK K + F Y+++ LP ++DWR KGAVT +K+
Sbjct: 83 NQFADITNEEFKAT--KTNKGFISNKVRVPTGFMYENMSFDALPATIDWRTKGAVTPIKD 140
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIV 210
QG CG CWAFS VAA+EGI ++ TG L SLSEQEL+DCD + + GC GGLMD AF++I+
Sbjct: 141 QGQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFII 200
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
GGL +E +YPY +G C+ G S TI Y DVP N+E +L+KA+ANQP+SVA++
Sbjct: 201 KNGGLTQESNYPYDAADGKCK--SGSSSAATIKSYEDVPANNEGALMKAVANQPVSVAVD 258
Query: 271 ASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMK 329
FQFYSGGV G CGT LDHG+AA+GYG+T G + I+KNSWG WGE G++RM+
Sbjct: 259 GGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGTTSDGTKFWIMKNSWGTSWGENGFLRME 318
Query: 330 RNTGKPEGLCGINKMASYP 348
++ +G+CG+ SYP
Sbjct: 319 KDIADKKGMCGLAMEPSYP 337
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 163/343 (47%), Positives = 209/343 (60%), Gaps = 11/343 (3%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
T+++ C + + S+ Y P + L FE W+ K+Y DE + R
Sbjct: 11 TLVVLICFVLIASKLCSVNSSV--YDP-----HKTLKQRFEKWLKTHSKLYGGRDEWMLR 63
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
F I++ N++ ID N + L N FAD+ + EFK FLGL R +
Sbjct: 64 FGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRP--VC 121
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
++P +VDWR +GAVT ++NQG CG CWAFS VAA+EGIN+I TGNL SLSEQ+LID
Sbjct: 122 DPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLID 181
Query: 189 CD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHD 247
CD TYN GC+GGLM+ AF++I S GGL E DYPY EGTC+ K +++VVTI GY
Sbjct: 182 CDVGTYNKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIEGTCDQEKAKNKVVTIQGYQK 241
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGL 307
V QN E SL A A QP+SV I+A G FQ YS GV+ +CGT L+HGV VGYG
Sbjct: 242 VAQN-EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVTVVGYGVEGDQ 300
Query: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
Y IVKNSWG WGE+GYIRM+R + G CGI +ASYP++
Sbjct: 301 KYWIVKNSWGTGWGEEGYIRMERGISEDTGKCGIAMLASYPLQ 343
>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
Length = 464
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 162/297 (54%), Positives = 197/297 (66%), Gaps = 9/297 (3%)
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIKN---YWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
+ E RF +F DNL+ +D N + LG+N FADL ++EF+ +LG P A R
Sbjct: 84 VGEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGMNRFADLTNDEFRAAYLGTTP--AGR 141
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAV-THVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
E + + V LP SVDWR KGAV + VKNQG CGSCWAFS VAAVEGIN+IVTG
Sbjct: 142 GRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 201
Query: 178 LASLSEQELIDCDNTYNNGCNGG-LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL++C N G +MD AF +I GGL EEDYPY +G C++ K
Sbjct: 202 LVSLSEQELVECARNGGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMDGKCDLAKKS 261
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+VV+I+G+ DVP+N E SL KA+A+QP+SVAI+A GR+FQ Y GV+ G CGT LDHGV
Sbjct: 262 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGV 321
Query: 297 AAVGYGS--TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
AVGYG+ G DY V+NSWGP WGE GYIRM+RN G CGI MASYPIKK
Sbjct: 322 VAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 378
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 150/304 (49%), Positives = 202/304 (66%), Gaps = 4/304 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEM 107
++WM+++ +VY+ EK +RF+IFK+N+ I+ N K Y LG+N F DL +EEF+
Sbjct: 39 KTWMTQYGRVYKGNVEKEKRFKIFKENVEFIESFNNNGNKPYKLGINAFTDLTNEEFRAS 98
Query: 108 FLGLKPDLARRKDQSH-EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
G ++ + + F Y++V +P S+DWR KGAVTH+K+QG CG CWAFS VAA
Sbjct: 99 HNGYTMSMSSHQSSYRTKSFRYENVTAVPPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAA 158
Query: 167 VEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
+EGI ++ TG L SLSEQEL+DCD + + GC GGLMD AF++I+ GL E +YPY
Sbjct: 159 MEGITKLSTGTLISLSEQELVDCDTSGMDQGCEGGLMDDAFEFIIENNGLTTEANYPYEG 218
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
+G+C K + I GY +VP E++L KA+ANQP+SVAI+A FQ YS G++
Sbjct: 219 VDGSCNTRKAANHAAKITGYENVPAYDEEALRKAVANQPVSVAIDAGESAFQHYSSGIFT 278
Query: 286 GHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
G CGT+LDHGV VGYG++ G Y +VKNSWG WGE GYIRM+R+ EGLCGI
Sbjct: 279 GDCGTELDHGVTVVGYGTSDDGTKYWLVKNSWGTSWGEDGYIRMERDIDAKEGLCGIAME 338
Query: 345 ASYP 348
SYP
Sbjct: 339 PSYP 342
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/338 (47%), Positives = 212/338 (62%), Gaps = 31/338 (9%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLD----EKLERFEIFKDNLRHID----ETNRKIKNYWL 91
+++++ ++E+W SK + + D E R E+F+DNLR+ID E + + + L
Sbjct: 46 ADEEVRRMYEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRL 105
Query: 92 GLNEFADLRHEEFKEMFLGLKPDLARRK----------------DQSHEDFSYKDVV--D 133
GL FADL EE++ LG + AR + +SH D
Sbjct: 106 GLTPFADLTLEEYRGRALGFR---ARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCGD 162
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP ++DWR+ GAVT VKNQ CG CWAFS VAA+EGIN IVTGNL SLSEQE+IDCD T
Sbjct: 163 LPDAIDWRQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCD-TQ 221
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE-VVTINGYHDVPQNS 252
++GCNGG M+ AFQ+++ GG+ E DYP+I +GTC+ K E V I+G+ +V N+
Sbjct: 222 DSGCNGGQMENAFQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAAIDGFVEVASNN 281
Query: 253 EDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIV 312
E +L +A+A QP+SVAI+A GR FQ YS G+++G CGT LDHGV VGYGS G Y IV
Sbjct: 282 ETALQEAVAIQPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYGSENGKAYWIV 341
Query: 313 KNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
KNSW WGE GYIR++RN P G CGI ASYP+K
Sbjct: 342 KNSWSDSWGEAGYIRIRRNVFLPVGKCGIAMDASYPVK 379
>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
Length = 229
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 144/219 (65%), Positives = 172/219 (78%), Gaps = 1/219 (0%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
+P SVDWRKKGAVT VK+QG CGSCWAFST+ AVEGINQI T L SLSEQEL+DCD
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
N GCNGGLMDYAF++I GG+ E +YPY +GTC+++K + V+I+G+ +VP+N E
Sbjct: 62 NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDE 121
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIV 312
++LLKA+ANQP+SVAI+A G DFQFYS GV+ G CGT+LDHGVA VGYG+T G Y V
Sbjct: 122 NALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTV 181
Query: 313 KNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
KNSWGP+WGEKGYIRM+R EGLCGI ASYPIKK
Sbjct: 182 KNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPIKK 220
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 154/321 (47%), Positives = 196/321 (61%), Gaps = 9/321 (2%)
Query: 37 DLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLN 94
DL + E WM+K + Y EK R E+F+DN+ I+ N +WL N
Sbjct: 29 DLVDAAAMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEEN 88
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQ 152
+FADL + EF+ GL+P + R +++ F Y +V DLP SVDWR KGAV VK+Q
Sbjct: 89 QFADLTNAEFRATRTGLRPS-SSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQ 147
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVS 211
G CG CWAFS VAA+EG ++ TG L SLSEQ+L+ CD + GC GGLMD AF +I+
Sbjct: 148 GDCGCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIK 207
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEA 271
GGL E DYPY + C + TI GY DVP N E +LLKA+ANQP+SVAI+
Sbjct: 208 NGGLAAESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDG 267
Query: 272 SGRDFQFYSGGVYDGH--CGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRM 328
R FQFY GGV G C T+LDH + AVGYG ++ G Y ++KNSWG WGE GY+RM
Sbjct: 268 GDRHFQFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRM 327
Query: 329 KRNTGKPEGLCGINKMASYPI 349
+R EG+CG+ MASYP
Sbjct: 328 ERGVADKEGVCGLAMMASYPT 348
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 149/297 (50%), Positives = 198/297 (66%), Gaps = 6/297 (2%)
Query: 50 SWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN--RKIKNYWLGLNEFADLRHEEFKEM 107
+WM++ +VY +EK R+ +FK N+ I+ N + + L +N+FADL +EEF+ M
Sbjct: 33 AWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSM 92
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
+ G K + F Y+ V LP SVDWRKKGAVT +K+QGSCGSCWAFS VA
Sbjct: 93 YTGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVA 152
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
A+EG+ QI G L SLSEQEL+DCD T ++GC GG M+ AF Y ++TGGL E +YPY
Sbjct: 153 AIEGVAQIKKGKLISLSEQELVDCD-TNDDGCMGGYMNSAFNYTMTTGGLTSESNYPYKS 211
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
+GTC + K + +I G+ DVP N E +L+KA+A+ P+S+ I G FQFYS GV+
Sbjct: 212 TDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGVFS 271
Query: 286 GHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
G C T LDHGVA VGYG S+ G Y I+KNSWGPKWGE+GY+R+K++T G CG+
Sbjct: 272 GECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKKDTKAKHGQCGL 328
>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 164/322 (50%), Positives = 216/322 (67%), Gaps = 7/322 (2%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLG 92
+ ++L + + L L+E W K + +L EK +RF +FK+N+ H+ N+ K Y L
Sbjct: 26 FDEKELATEESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQMDKPYKLK 84
Query: 93 LNEFADLRHEEFKEMFLGLKPDLARR---KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
LN+FAD+ + EF + R+ + + F Y+ DLP SVD R++GAV V
Sbjct: 85 LNKFADMSNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDGRERGAVNAV 144
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K QG CGSCWAFS+VAAVEGIN+I T L SLSEQEL+DC N N GCNGG M+ AF +I
Sbjct: 145 KEQGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDC-NYRNKGCNGGFMEIAFDFI 203
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
GG+ E YPY G C ++ S +V I+GY VP+N ED+L++A+ANQP+SVAI
Sbjct: 204 KRNGGIATENSYPYHGSRGLCRSSRISSPIVKIDGYESVPEN-EDALMQAVANQPVSVAI 262
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRM 328
+A+GRDFQFYS GV+DG+CGT+L+HGV A+GYG+T G DY +V+NSWG WGE GY+RM
Sbjct: 263 DAAGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRM 322
Query: 329 KRNTGKPEGLCGINKMASYPIK 350
KR + EGLCGI ASYPIK
Sbjct: 323 KRGVEQAEGLCGIAMEASYPIK 344
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 155/326 (47%), Positives = 206/326 (63%), Gaps = 10/326 (3%)
Query: 30 IVGYSPEDLTSNDK----LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK 85
I PE T N + +E+W+ ++ + Y +E RF+I++ N+++I+ N +
Sbjct: 17 IASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRFDIYQSNVQYIEFYNSQ 76
Query: 86 IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
+Y L N FAD+ +EEFK +LG P + +F Y +LPKS+DWRKKGA
Sbjct: 77 NYSYKLIDNRFADITNEEFKSTYLGYLPRF-----RVQTEFRYHKHGELPKSIDWRKKGA 131
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDY 204
VTHVK+QG CGSCWAFS VAAVEGIN+I T NL SLSEQ+LIDCD + N GC GG M
Sbjct: 132 VTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDCDIKSGNEGCEGGDMYI 191
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQP 264
AF YI GG+ ++YPY +G C +K ++ VTI+GY VP +E L A+A+QP
Sbjct: 192 AFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNNAVTISGYESVPARNEKMLKAAVAHQP 251
Query: 265 LSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKG 324
+S+A +A G FQFYS G++ G CG L+HG+ VGYG G Y IVKNSW WGE G
Sbjct: 252 VSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYGEENGDKYWIVKNSWANDWGESG 311
Query: 325 YIRMKRNTGKPEGLCGINKMASYPIK 350
Y+RMKR+T +G CGI A+YP+K
Sbjct: 312 YVRMKRDTKDKDGTCGIAMDATYPVK 337
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 156/302 (51%), Positives = 193/302 (63%), Gaps = 7/302 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E W K+ KVY+ EK +R IFKDN+ I+ N K Y L +N D +EEF
Sbjct: 41 EQWTKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLTDQTNEEFVAS 100
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
G K + S F Y+++ +P +VDWR+ GAV +K+QG CG+CWAFSTVA
Sbjct: 101 HNGYK----HKGSHSQTPFKYENITGVPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATT 156
Query: 168 EGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE 227
EGI QI T L SLSEQEL+DCD+ ++GC+GG M+ F++I GG+ E +YPY +
Sbjct: 157 EGIYQITTSMLMSLSEQELVDCDSV-DHGCDGGYMEGGFEFIXKNGGISSEANYPYTAVD 215
Query: 228 GTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGH 287
GT + K S I GY VP NSED+L KA+ANQP+SV I+ G FQF S GV+ G
Sbjct: 216 GTYDANKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDVGGSAFQFNSSGVFTGQ 275
Query: 288 CGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
CGTQLDHGV AVGYGST G Y IVKNSWG +WGE+GYIRM+R T EGLCGI AS
Sbjct: 276 CGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDAS 335
Query: 347 YP 348
YP
Sbjct: 336 YP 337
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 158/310 (50%), Positives = 205/310 (66%), Gaps = 5/310 (1%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHE 102
L + E WM++F K Y+ EK +RF+IFK+N+ I+ N K + L +N FADL +E
Sbjct: 33 LSNKHEKWMTQFGKSYKDAAEKEKRFQIFKNNVEFIELFNAVGNKPFNLSINHFADLTNE 92
Query: 103 EFKEMFLGLKPDLARRKDQSHE--DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWA 160
EFK G K L + D +E F Y +V +P S+DWRK+GAVT +KNQGSCGSCWA
Sbjct: 93 EFKASLNGNK-KLHDKFDILNETTSFRYHNVTSVPASMDWRKRGAVTPIKNQGSCGSCWA 151
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FSTVA++EGI+QI TG L SLSEQELIDC ++GC+GG ++ AF++I GG+ E +
Sbjct: 152 FSTVASIEGIHQITTGELVSLSEQELIDCVRGNSSGCSGGYLEDAFKFIAKKGGMASETN 211
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY + C+ K V I GY VP NSE+ LLKA+ANQP+SV ++A FQFYS
Sbjct: 212 YPYKETDEKCKFKKESKHVAEIKGYEKVPSNSENDLLKAVANQPVSVYVDAGDYVFQFYS 271
Query: 281 GGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
GG++ G CGT DH V VGYG S +Y +VKNSWG WGEKGY+++KRN +GLC
Sbjct: 272 GGIFTGKCGTDTDHVVTIVGYGVSLDYTEYWLVKNSWGTGWGEKGYMKLKRNVDSKKGLC 331
Query: 340 GINKMASYPI 349
GI SYP+
Sbjct: 332 GIATNPSYPV 341
>gi|113120273|gb|ABI30276.1| VXH-C [Vasconcellea x heilbornii]
Length = 282
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 157/277 (56%), Positives = 200/277 (72%), Gaps = 4/277 (1%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
K I ++ C+ SFA DFSIVGYS +DLTS +K I LFESWM K +KVY+S++EK+
Sbjct: 9 KLIFVATCLIVRAGLSFA-DFSIVGYSQDDLTSIEKSIRLFESWMLKHDKVYKSMEEKIN 67
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE-DF 126
RFEIFKDNL +IDETN+K +YWLGLNEFADL H+EFK+ ++G P+ +QS + +F
Sbjct: 68 RFEIFKDNLMYIDETNKKNNSYWLGLNEFADLTHDEFKKKYVGSIPEDYTIIEQSDDGEF 127
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
YK VVD P+SVDWR+KGAVT VK+Q CGSCWAFSTVA VEGIN+IVTG L SLSEQEL
Sbjct: 128 PYKHVVDYPESVDWRQKGAVTPVKDQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQEL 187
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
+DCD ++GC+GG + QY+V G+H E +Y Y ++G C + V INGY
Sbjct: 188 LDCDRR-SHGCDGGYQRTSLQYVVDN-GVHTEYEYQYEKKQGNCRAKNKKGLKVYINGYK 245
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV 283
VP N E SL+K +ANQP+SV +++S R F FY GG+
Sbjct: 246 GVPPNDEISLIKVIANQPVSVLVDSSERAFHFYRGGI 282
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 303 bits (776), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 154/312 (49%), Positives = 205/312 (65%), Gaps = 10/312 (3%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEE 103
++ E WMS+F +VY EK RFEIF +NL+ ++ N K Y L +NEF+DL EE
Sbjct: 32 VEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEE 91
Query: 104 FKEMFLGLK-PDLARR--KDQSHE--DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
FK + GL P+ R SHE F Y++V + +S+DW ++GAVT VK+Q CG C
Sbjct: 92 FKARYTGLVVPEGMTRISTTDSHETVSFRYENVGETGESMDWIQEGAVTSVKHQQQCGCC 151
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
WAFS VAAVEG+ +I G L SLSEQ+L+DC +T NNGC GG+M AF YI G+ E
Sbjct: 152 WAFSAVAAVEGMTKIANGELVSLSEQQLLDC-STENNGCGGGIMWKAFDYIKENQGITTE 210
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
++YPY + TCE TI+GY VPQN E++LLKA++ QP+SVAIE SG +F
Sbjct: 211 DNYPYQGAQQTCE--SNHLAAATISGYETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIH 268
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
YSGG+++G CGTQL H V VGYG S G+ Y ++KNSWG WGE GY+R+ R+ P+G
Sbjct: 269 YSGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGENGYMRIMRDVDSPQG 328
Query: 338 LCGINKMASYPI 349
+CG+ +A YP+
Sbjct: 329 MCGLASLAYYPV 340
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 159/353 (45%), Positives = 217/353 (61%), Gaps = 40/353 (11%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA +Q++ I ++ F ++ + + AR + + E WM ++ +
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARSLH-----------EASMYERHEDWMVQYGRE 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y+ DEK +R++IFKDN+ I+ N+ + K+Y L +NEFADL +EEF+ K +
Sbjct: 50 YKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICS 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
+ S F Y++V +P +VDWRKKGAVT +K+QG CGSCWAFS VAA+EGI Q+ TG
Sbjct: 110 TEATS---FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGK 166
Query: 178 LASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL+DCD + + GC +YPY +GTC K
Sbjct: 167 LISLSEQELVDCDTSGEDQGCT---------------------NYPYAGTDGTCNRKKAA 205
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
INGY DVP N+E +L KA+A+QP++VAI+ASG +FQFYS GV+ G CGT+LDHGV
Sbjct: 206 HPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGV 265
Query: 297 AAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
AAVGYG++ G+ Y +VKNSW WGE+GYIRM+R+ EGLCGI ASYP
Sbjct: 266 AAVGYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 318
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 152/308 (49%), Positives = 194/308 (62%), Gaps = 9/308 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKE 106
E WM+K + Y EK+ R E+F+DN+ I+ N +WL N+FADL + EF+
Sbjct: 6 ERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFRA 65
Query: 107 MFLGLKPDLARRKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
GL+P + R +++ F Y +V DLP SVDWR KGAV VK+QG CG CWAFS V
Sbjct: 66 TRTGLRPS-SSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWAFSAV 124
Query: 165 AAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
AA+EG ++ TG L SLSEQ+L+ CD + GC GGLMD AF +I+ GGL E DYPY
Sbjct: 125 AAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAESDYPY 184
Query: 224 IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV 283
+ C + TI GY DVP N E +LLKA+ANQP+SVAI+ R FQFY GGV
Sbjct: 185 TASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFYKGGV 244
Query: 284 YDGH--CGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
G C T+LDH + AVGYG ++ G Y ++KNSWG WGE GY+RM+R EG+CG
Sbjct: 245 LSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKEGVCG 304
Query: 341 INKMASYP 348
+ MASYP
Sbjct: 305 LAMMASYP 312
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 157/333 (47%), Positives = 215/333 (64%), Gaps = 23/333 (6%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDE-------------KLERFEIFKDNLRHIDETNRK- 85
+++++ ++E+W SK + S D+ + R E+F+DNLR+ID+ N +
Sbjct: 76 ADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEA 135
Query: 86 ---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD----LPKSV 138
+ + LGL FADL +E++ LG + R + Y+ LP ++
Sbjct: 136 DAGLHTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDAI 195
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
DWR+ GAVT VK+Q CG CWAFS VAA+EGIN I TGNL SLSEQE+IDCD ++GC+
Sbjct: 196 DWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCD-AQDSGCD 254
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE-VVTINGYHDVPQNSEDSLL 257
GG M+ AF++++ GG+ E DYP+I +GTC+ +K +E V TI+G +V N+E +L
Sbjct: 255 GGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEVASNNETALQ 314
Query: 258 KALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWG 317
+A+A QP+SVAI+ASGR FQ YS G+++G CGT LDHGV AVGYGS G DY IVKNSW
Sbjct: 315 EAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGSESGKDYWIVKNSWS 374
Query: 318 PKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WGE GYIRM+RN +P G CGI ASYP+K
Sbjct: 375 ASWGEAGYIRMRRNVPRPTGKCGIAMDASYPVK 407
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 158/353 (44%), Positives = 218/353 (61%), Gaps = 40/353 (11%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA +Q++ I ++ F ++ + + AR+ + + E WM ++ +
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARNLH-----------EASMYERHEDWMVQYGRE 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y+ DEK +R++IFKDN+ I+ N+ + K+Y L +NEFADL +EEF+ K +
Sbjct: 50 YKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICS 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
+ S F Y++V +P +VDWRKKGAVT +K+QG CGSCWAFS VAA+EGI Q+ TG
Sbjct: 110 TEATS---FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGK 166
Query: 178 LASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQEL+DCD + + GC +YPY +GTC K
Sbjct: 167 LISLSEQELVDCDTSGEDQGCT---------------------NYPYAGTDGTCNRKKAA 205
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
INGY DVP N+E +L KA+A+QP++VAI+A G +FQFYS GV+ G CGT+LDHGV
Sbjct: 206 HPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGV 265
Query: 297 AAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
+AVGYG++ G+ Y +VKNSWG WGE+GYIRM+R+ EGLCGI ASYP
Sbjct: 266 SAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 318
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 148/305 (48%), Positives = 207/305 (67%), Gaps = 4/305 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+++ KVY+ EK +RF++FK+N++ I+ N K + L +N+FADL EEFK +
Sbjct: 36 EKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEFKAL 95
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQG-SCGSCWAFSTVAA 166
++ +R + + F Y++V +P ++DWRK+GAVT +K+QG +CGSCWAF+TVA
Sbjct: 96 LNNVQKKASRVETATETSFRYENVTKIPSTMDWRKRGAVTPIKDQGYTCGSCWAFATVAT 155
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
VE ++QI TG L SLSEQEL+DC + GC GG ++ AF++I + GG+ E YPY +
Sbjct: 156 VESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGITSEAYYPYKGK 215
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+ +C++ K V I GY VP NSE +LLKA+ANQP+SV I+A F+FYS G+++
Sbjct: 216 DRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKFYSSGIFEA 275
Query: 287 -HCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
+CGT LDH VA VGYG R G Y +VKNSW WGEKGY+R+KR+ +GLCGI
Sbjct: 276 RNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGYMRIKRDIRAKKGLCGIASN 335
Query: 345 ASYPI 349
ASYPI
Sbjct: 336 ASYPI 340
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 152/309 (49%), Positives = 193/309 (62%), Gaps = 9/309 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKE 106
E WM+K + Y EK R E+F+DN+ I+ N +WL N+FADL + EF+
Sbjct: 6 ERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFRA 65
Query: 107 MFLGLKPDLARRKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
GL+P + R +++ F Y +V DLP SVDWR KGAV VK+QG CG CWAFS V
Sbjct: 66 TRTGLRPS-SSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWAFSAV 124
Query: 165 AAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
AA+EG ++ TG L SLSEQ+L+ CD + GC GGLMD AF +I+ GGL E DYPY
Sbjct: 125 AAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAESDYPY 184
Query: 224 IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV 283
+ C + TI GY DVP N E +LLKA+ANQP+SVAI+ R FQFY GGV
Sbjct: 185 TASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFYKGGV 244
Query: 284 YDGH--CGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
G C T+LDH + AVGYG ++ G Y ++KNSWG WGE GY+RM+R EG+CG
Sbjct: 245 LSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKEGVCG 304
Query: 341 INKMASYPI 349
+ MASYP
Sbjct: 305 LAMMASYPT 313
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 150/321 (46%), Positives = 208/321 (64%), Gaps = 12/321 (3%)
Query: 34 SPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGL 93
+ +L+ + + E WM+++ +VY EK RFE+FK N+ I+ N N+WLG+
Sbjct: 23 AARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGV 82
Query: 94 NEFADLRHEEFKEMFL--GLKPDLARRKDQSHEDFSYKDV-VD-LPKSVDWRKKGAVTHV 149
N+FADL ++EF+ M G P R F Y++V +D LP +VDWR KGAVT +
Sbjct: 83 NQFADLTNDEFRWMKTNKGFIPSTTRVP----TGFRYENVNIDALPATVDWRTKGAVTPI 138
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQY 208
K+QG CG CWAFS VAA+EGI ++ TG L SLSEQEL+DCD + + GC GGLMD AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 198
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVA 268
I+ GGL E +YPY + C+ + V +I GY DVP N+E +L+KA+ANQP+SVA
Sbjct: 199 IIKNGGLTTESNYPYAAADDKCKSV--SNSVASIKGYEDVPANNEAALMKAVANQPVSVA 256
Query: 269 IEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIR 327
++ FQFY GGV G CGT LDHG+ A+GYG ++ G Y ++KNSWG WGE G++R
Sbjct: 257 VDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLR 316
Query: 328 MKRNTGKPEGLCGINKMASYP 348
M+++ G+CG+ SYP
Sbjct: 317 MEKDISDKRGMCGLAMEPSYP 337
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 148/317 (46%), Positives = 206/317 (64%), Gaps = 9/317 (2%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADL 99
+++ + E WM++F +VY+ EK R E+FK N+ I+ N + +WLG N+FADL
Sbjct: 33 ADNAMAARHEQWMAQFGRVYKDPAEKAHRLEVFKANVAFIESFNAENHEFWLGANQFADL 92
Query: 100 RHEEFK--EMFLGLKPDLARRKDQSHEDFSYKDV-VD-LPKSVDWRKKGAVTHVKNQGSC 155
++EF+ + G+K R + F Y DV +D LP SVDWR KGAVT +KNQG C
Sbjct: 93 TNDEFRASKTNKGIKQGGVR---DAPTGFKYSDVSIDALPASVDWRTKGAVTPIKNQGQC 149
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGG 214
GSCWAFS VAA EG+ ++ TG L SLSEQEL+DCD + + GC GG MD AF++I+ GG
Sbjct: 150 GSCWAFSAVAATEGVVKLSTGKLVSLSEQELVDCDVHGVDQGCMGGWMDDAFKFIIKNGG 209
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
L E +YPY E+ C+ + + TI GY DVP N E +L+KA+A+QP+SV ++
Sbjct: 210 LTTEANYPYTGEDDKCKSNETVNVAATIKGYEDVPANDESALMKAVAHQPVSVVVDGGDM 269
Query: 275 DFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
FQ Y+GGV G CG ++DHG+AA+GYG+T G Y ++KNSWG WGEKG++RM ++
Sbjct: 270 TFQLYAGGVMTGSCGVEMDHGIAAIGYGATSNGTKYWLMKNSWGTTWGEKGFLRMAKDIP 329
Query: 334 KPEGLCGINKMASYPIK 350
G+CG+ SYP +
Sbjct: 330 DKRGMCGLAMKPSYPTE 346
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 158/317 (49%), Positives = 206/317 (64%), Gaps = 6/317 (1%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGLNE 95
S++++ +++ W +K R E+FK+NLR +DE N R Y LG+N
Sbjct: 35 SDEEVRIIYQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNR 94
Query: 96 FADLRHEEFKEMFLGLKPDLARRKD-QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
FADL +EE++ FL L R + + ++ LP S+DWR+KGAV VK+QG
Sbjct: 95 FADLTNEEYRARFLRDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKSQGR 154
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGG 214
CGSCWAF+ +A VEGINQIVTG+L SLSEQ+L+DC +T N+GC GG AFQYI++ GG
Sbjct: 155 CGSCWAFAAIATVEGINQIVTGDLISLSEQQLVDC-STRNHGCEGGWPYRAFQYIINNGG 213
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
++ EE YPY GTC TKG + VV+I+ Y +VP N E SL KA+ANQP+SV I ASGR
Sbjct: 214 VNSEEHYPYTGTNGTCNTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISVGINASGR 273
Query: 275 DFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
+FQ Y G++ G C T L+HGV VGYG+ G DY IVKNSWG WG+ GYI M+RN +
Sbjct: 274 NFQLYHSGIFTGSCNTSLNHGVTVVGYGTVNGNDYWIVKNSWGESWGDSGYILMERNIAE 333
Query: 335 PEGLCGINKMASYPIKK 351
G CGI SYPIK+
Sbjct: 334 SSGKCGIAISPSYPIKE 350
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 151/303 (49%), Positives = 199/303 (65%), Gaps = 4/303 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEM 107
E WM++ KVY+ E+ +RF IF +N+ +++ N K Y LG+N+F DL ++EF
Sbjct: 136 EQWMTRHGKVYKDPREREKRFRIFNENVNYVEAFNNAANKPYKLGINQFXDLTNQEFIAP 195
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
K + ++ F Y++V +P +VDWR+ GAVT VK+QG CG CWAFS VAA
Sbjct: 196 RNRFKGHMCSSIIRT-TTFKYENVTTVPSTVDWRQNGAVTPVKDQGQCGCCWAFSAVAAT 254
Query: 168 EGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EGI+ + G L SLSEQEL+DCD + GC GGLMD A+++I+ GL+ E +YPY
Sbjct: 255 EGIHALSGGKLISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNHGLNTEANYPYKGV 314
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+G C + + TI GY DVP N+E +L KA+ANQP+SVAI+AS DFQFY G + G
Sbjct: 315 DGKCNANEAANHAATITGYEDVPANNEKALQKAVANQPVSVAIDASSSDFQFYKSGAFTG 374
Query: 287 HCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
CGT+LDHGV AVGYG S G Y +VKNSWG +WGE+GYIRM+R EG+CGI A
Sbjct: 375 SCGTELDHGVTAVGYGVSDHGTKYWLVKNSWGTEWGEEGYIRMQRGVDSEEGVCGIAMQA 434
Query: 346 SYP 348
SYP
Sbjct: 435 SYP 437
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 147/296 (49%), Positives = 193/296 (65%), Gaps = 6/296 (2%)
Query: 51 WMSKFEKVYESLDEKLERFEIFKDNLRHIDETN--RKIKNYWLGLNEFADLRHEEFKEMF 108
WM++ +VY +EK R+ +FK N+ I+ N + + L +N+FADL +EEF+ M+
Sbjct: 35 WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 94
Query: 109 LGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
G K + F Y++V LP SVDWRKKGAVT +K+QG CGSCWAFS VAA
Sbjct: 95 TGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAA 154
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
+EG+ QI G L SLSEQEL+DCD T + GC GGLMD AF Y ++ GGL E +YPY
Sbjct: 155 IEGVAQIKKGKLISLSEQELVDCD-TNDGGCMGGLMDTAFNYTITIGGLTSESNYPYKST 213
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
GTC K + +I G+ DVP N E +L+KA+A+ P+S+ I FQFYS GV+ G
Sbjct: 214 NGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSG 273
Query: 287 HCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
C T LDHGV AVGYG ++ GL Y I+KNSWGPKWGE+GY+R+K++ G CG+
Sbjct: 274 ECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKDIKPKHGQCGL 329
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 163/345 (47%), Positives = 212/345 (61%), Gaps = 17/345 (4%)
Query: 22 SSFARDFSIVGYSPEDLTSN--DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHI 79
S A ++ G PED+ + + +LFE WM K KVY EK R+ F NL +
Sbjct: 23 SCSAGEWPSSGQGPEDVGAGGVEGGQELFERWMEKHRKVYAHPGEKARRYANFLSNLAFV 82
Query: 80 DETNRKIK-----NYWLGLNEFADLRHEEFKEMF----LGLKPDLARRKDQSHEDFSYKD 130
+ N + + +G+N FADL +EEF+E++ L K R + +
Sbjct: 83 RKRNAEGRRAPSSGQGVGMNVFADLSNEEFREVYSSRVLRKKAAEGRGARRRAGEGRVVA 142
Query: 131 VVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD 190
D P S+DWRK+GAVT VKNQG CGSCWAFS+ A+EGIN I TG L SLSEQEL+DCD
Sbjct: 143 GCDAPASLDWRKRGAVTAVKNQGDCGSCWAFSSTGAMEGINAITTGELISLSEQELVDCD 202
Query: 191 NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME-EGTCEMTKGESEVVTINGYHDVP 249
T N GC+GG MDYAF+++++ GG+ E +YPY + + C TK E +VV+I+GY DV
Sbjct: 203 TT-NEGCDGGYMDYAFEWVINNGGIDSEANYPYTGQADSVCNTTKEEIKVVSIDGYEDV- 260
Query: 250 QNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQ---LDHGVAAVGYGSTRG 306
SE +LL A QP+SV I+ S DFQ Y+GG+YDG C +DH V VGYG G
Sbjct: 261 ATSESALLCAAVQQPVSVGIDGSSLDFQLYAGGIYDGDCSGNPDDIDHAVLVVGYGQQGG 320
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
DY IVKNSWG WG +GYI ++RNTG P G+C I+ MASYP K+
Sbjct: 321 TDYWIVKNSWGTDWGMQGYIYIRRNTGLPYGVCAIDAMASYPTKQ 365
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 151/311 (48%), Positives = 199/311 (63%), Gaps = 12/311 (3%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEE 103
++ E WM ++ +VY+ EK RFEIFK N+ I+ N +WLG+N+FADL + E
Sbjct: 33 MVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKFWLGVNQFADLTNYE 92
Query: 104 FKEMFL--GLKPDLARRKDQSHEDFSYKDV-VD-LPKSVDWRKKGAVTHVKNQGSCGSCW 159
F+ G P R F Y++V +D LP +VDWR KGAVT +K+QG CG CW
Sbjct: 93 FRATKTNKGFIPSTVRVPTT----FRYENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCW 148
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
AFS VAA+EGI ++ TG L SLSEQEL+DCD + + GC GGLMD AF++I+ GGL E
Sbjct: 149 AFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTE 208
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
YPY +G C G + TI GY DVP N+E +L+KA+ANQP+SVA++ FQF
Sbjct: 209 SKYPYTAADGKCN--GGSNSAATIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQF 266
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
YSGGV G CGT LDHG+ A+GYG G Y ++KNSWG WGE G++RM+++ G
Sbjct: 267 YSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKRG 326
Query: 338 LCGINKMASYP 348
+CG+ SYP
Sbjct: 327 MCGLAMEPSYP 337
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 151/327 (46%), Positives = 213/327 (65%), Gaps = 18/327 (5%)
Query: 34 SPEDLTSNDKL--IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI----K 87
+ +L +D+L + E WM + +VY+ +K RF +FK N++ I+ N +
Sbjct: 25 AARELGGDDELAMVARHEQWMVQHGRVYKDETDKAHRFLVFKANVKFIESFNAAAAAGNR 84
Query: 88 NYWLGLNEFADLRHEEFKEMFL--GLKPDLARRKDQSHEDFSYKDV-VD-LPKSVDWRKK 143
+WLG+N+FADL ++EF+ G P++ + F Y+++ +D LP++VDWR K
Sbjct: 85 KFWLGVNQFADLTNDEFRATKTNKGFNPNVVKVP----TGFRYQNLSIDALPQTVDWRTK 140
Query: 144 GAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLM 202
GAVT +K+QG CG CWAFS VAA EGI +I TG L SLSEQEL+DCD + + GCNGG M
Sbjct: 141 GAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLTSLSEQELVDCDVHGEDQGCNGGEM 200
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN 262
D AF++I+ GGL E +YPY ++G C+ G + TI GY DVP N E +L+KA+A+
Sbjct: 201 DDAFKFIIKNGGLTTESNYPYTAQDGQCK--SGSNGAATIKGYEDVPANDEAALMKAVAS 258
Query: 263 QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWG 321
QP+SVA++ FQFYSGGV G CGT LDHG+AA+GYG T G Y ++KNSWG WG
Sbjct: 259 QPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWG 318
Query: 322 EKGYIRMKRNTGKPEGLCGINKMASYP 348
E G++RM+++ +G+CG+ SYP
Sbjct: 319 ENGFLRMEKDIADKKGMCGLAMQPSYP 345
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 208/321 (64%), Gaps = 12/321 (3%)
Query: 34 SPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGL 93
+ +L+ + + E WM+++ +VY EK RFE+FK N+ I+ N N+WLG+
Sbjct: 23 AARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGV 82
Query: 94 NEFADLRHEEFK--EMFLGLKPDLARRKDQSHEDFSYKDV-VD-LPKSVDWRKKGAVTHV 149
N+FADL ++EF+ + G P R F Y++V +D LP +VDWR KGAVT +
Sbjct: 83 NQFADLTNDEFRWTKTNKGFIPSTTRVP----TGFRYENVNIDALPATVDWRTKGAVTPI 138
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQY 208
K+QG CG CWAFS VAA+EGI ++ TG L SLSEQEL+DCD + + GC GGLMD AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 198
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVA 268
I+ GGL E +YPY + C+ + V +I GY DVP N+E +L+KA+ANQP+SVA
Sbjct: 199 IIKNGGLTTESNYPYAAADDKCKSV--SNSVASIKGYEDVPANNEAALMKAVANQPVSVA 256
Query: 269 IEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIR 327
++ FQFY GGV G CGT LDHG+ A+GYG ++ G Y ++KNSWG WGE G++R
Sbjct: 257 VDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLR 316
Query: 328 MKRNTGKPEGLCGINKMASYP 348
M+++ G+CG+ SYP
Sbjct: 317 MEKDISDKRGMCGLAMEPSYP 337
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 151/305 (49%), Positives = 198/305 (64%), Gaps = 5/305 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E+WM+++ KVY+ EK +RF+IFK+N+ I+ N K + L +N+FADL EEFK +
Sbjct: 39 ENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLSINQFADLHDEEFKAL 98
Query: 108 FL-GLKP--DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
G K + ++ F Y V L ++DWRK+GAVT +K+Q CGSCWAFS V
Sbjct: 99 LTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRGAVTPIKDQRRCGSCWAFSAV 158
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
AA+EGI+QI T L SLSEQEL+DC + GCNGG M+ AF+++ GG+ E YPY
Sbjct: 159 AAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFEFVAKKGGIASESYYPYK 218
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
++ +C++ K V I GY VP NSE +L KA+A+QP+SV +EA G FQFYS G++
Sbjct: 219 GKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQPVSVYVEAGGNAFQFYSSGIF 278
Query: 285 DGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINK 343
G CGT DH + VGYG +R G Y +VKNSWG WGEKGYIRMKR+ EGLCGI
Sbjct: 279 TGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWGEKGYIRMKRDIRAKEGLCGIAM 338
Query: 344 MASYP 348
A YP
Sbjct: 339 NAFYP 343
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 154/319 (48%), Positives = 210/319 (65%), Gaps = 7/319 (2%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEF 96
L + +L++ E WM + K Y+ EK +RF+IFK+NL I+ N N + L +N+F
Sbjct: 25 LVISSRLLEKHEQWMEEHGKFYKDAAEKEQRFQIFKENLEFIESFNAAGDNGFNLSINQF 84
Query: 97 ADLRHEEFKEMFL-GLKPDLARRKDQSHED---FSYKDVVDLPKSVDWRKKGAVTHVKNQ 152
D ++EFK +L G K L + E+ F Y++V ++P ++DWR++GAVT +K+Q
Sbjct: 85 GDQTNDEFKANYLNGKKKPLIGVGIAAIEEESVFRYENVTEVPATMDWRERGAVTPIKHQ 144
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVS 211
CGSCWAF+TVAA+EGI+QI TG L SLSEQEL+DC T +GCNGG ++ A +IV
Sbjct: 145 HLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELVDCVKTNTTDGCNGGYVEDACDFIVK 204
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEA 271
GG+ E +YPY +G C + KG V I GY VP N+E +LLKA+ANQP++V I A
Sbjct: 205 KGGITSETNYPYTRVDGKCNVRKGTYNVAKIKGYEHVPANNEKALLKAVANQPIAVYIAA 264
Query: 272 SGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKR 330
+ R FQFYS G+ G CG LDH V VGYG++ G+ Y +VKNSWG KWGEKGYI++KR
Sbjct: 265 TKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTSDDGVKYWLVKNSWGTKWGEKGYIKIKR 324
Query: 331 NTGKPEGLCGINKMASYPI 349
+ EG CGI + +YPI
Sbjct: 325 DVHAKEGSCGIAMVPTYPI 343
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 153/269 (56%), Positives = 183/269 (68%), Gaps = 3/269 (1%)
Query: 82 TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWR 141
+N K Y LG+N+FADL +EEFK K + ++ F Y++ +P +VDWR
Sbjct: 3 SNVNNKLYKLGINKFADLTNEEFKASRNKFKGHMCSSIIRT-TTFKYENASAIPSTVDWR 61
Query: 142 KKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGG 200
KKGAVT VKNQG CGSCWAFS VAA EGI+Q+ TG L SLSEQELIDCD + GC GG
Sbjct: 62 KKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGG 121
Query: 201 LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKAL 260
LMD AF++I+ GL E YPY +GTC + VTI GY DVP N+E +L KA+
Sbjct: 122 LMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELALQKAV 181
Query: 261 ANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPK 319
ANQP+SVAI+ASG DFQFY+ GV+ G CGT+LDHGV AVGYG G Y +VKNSWG
Sbjct: 182 ANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGAD 241
Query: 320 WGEKGYIRMKRNTGKPEGLCGINKMASYP 348
WGE+GYIRM+R EGLCGI ASYP
Sbjct: 242 WGEEGYIRMQRGIDAAEGLCGIAMQASYP 270
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 150/311 (48%), Positives = 199/311 (63%), Gaps = 12/311 (3%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEE 103
++ E WM ++ +VY+ EK RFEIFK N+ I+ N +WLG+N+FADL + E
Sbjct: 33 MVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKFWLGVNQFADLTNYE 92
Query: 104 FKEMFL--GLKPDLARRKDQSHEDFSYKDV-VD-LPKSVDWRKKGAVTHVKNQGSCGSCW 159
F+ G P R F Y++V +D LP +VDWR KGAVT +K+QG CG CW
Sbjct: 93 FRATKTNKGFIPSTVRVPTT----FRYENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCW 148
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
AFS VAA+EGI ++ TG L SLSEQEL+DCD + + GC GGLMD AF++I+ GGL E
Sbjct: 149 AFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTE 208
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
YPY +G C G + TI GY +VP N+E +L+KA+ANQP+SVA++ FQF
Sbjct: 209 SKYPYTAADGKCN--GGSNSAATIKGYEEVPANNEAALMKAVANQPVSVAVDGGDMTFQF 266
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
YSGGV G CGT LDHG+ A+GYG G Y ++KNSWG WGE G++RM+++ G
Sbjct: 267 YSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKRG 326
Query: 338 LCGINKMASYP 348
+CG+ SYP
Sbjct: 327 MCGLAMEPSYP 337
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 150/311 (48%), Positives = 198/311 (63%), Gaps = 12/311 (3%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEE 103
++ E WM ++ +VY+ EK RFEIFK N+ I+ N +WL +N+FADL + E
Sbjct: 33 MVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKFWLSVNQFADLTNYE 92
Query: 104 FKEMFL--GLKPDLARRKDQSHEDFSYKDV-VD-LPKSVDWRKKGAVTHVKNQGSCGSCW 159
F+ G P R F Y++V +D LP +VDWR KGAVT +K+QG CG CW
Sbjct: 93 FRATKTNKGFIPSTVRVPTT----FRYENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCW 148
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
AFS VAA+EGI ++ TG L SLSEQEL+DCD + + GC GGLMD AF++I+ GGL E
Sbjct: 149 AFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTE 208
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
YPY +G C G + TI GY DVP N+E +L+KA+ANQP+SVA++ FQF
Sbjct: 209 SKYPYTAADGKCN--GGSNSAATIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQF 266
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
YSGGV G CGT LDHG+ A+GYG G Y ++KNSWG WGE G++RM+++ G
Sbjct: 267 YSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKRG 326
Query: 338 LCGINKMASYP 348
+CG+ SYP
Sbjct: 327 MCGLAMEPSYP 337
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 157/338 (46%), Positives = 218/338 (64%), Gaps = 16/338 (4%)
Query: 27 DFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK- 85
+FSIVG P + + +++++LF+ W K KVY+ E ++F+ F+DNLR++ E N +
Sbjct: 31 EFSIVG-RPGESIAEERVVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGER 89
Query: 86 --IKNYWLGLNEFADLRHEEFKEMFLGL--KPD-----LARRKDQSHEDFSYKDVVDLPK 136
+ +GLN+FAD+ +EEF+E+++ KP + RR+ D P
Sbjct: 90 GASGGHLVGLNKFADMSNEEFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPT 149
Query: 137 SVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNG 196
S+DWRK G VT VK+QG CGSCWAFS+ A+EGIN + G+L SLSEQEL+DCD+T N+G
Sbjct: 150 SLDWRKYGIVTGVKDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCDST-NDG 208
Query: 197 CNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSL 256
C GG MDYAF++++S GG+ E DYPY E+GTC TK E++ V+I+GY DV + E +L
Sbjct: 209 CEGGYMDYAFEWVMSNGGIDTETDYPYTGEDGTCNTTKEETKAVSIDGYEDVAEE-ESAL 267
Query: 257 LKALANQPLSVAIEASGRDFQFYSGGVY---DGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
A+ QP+SV I+ DFQ Y+GG+Y +DH V VGYG+ G +Y I+K
Sbjct: 268 FCAVLKQPISVGIDGGAIDFQLYTGGIYDGDCSDDPDDIDHAVLVVGYGAESGEEYWIIK 327
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
NSWG WG KGY +KRNT K G+C IN MASYP K+
Sbjct: 328 NSWGTDWGMKGYAYIKRNTSKDYGVCAINAMASYPTKE 365
>gi|356557743|ref|XP_003547170.1| PREDICTED: LOW QUALITY PROTEIN: xylem cysteine proteinase 1-like
[Glycine max]
Length = 400
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 165/361 (45%), Positives = 232/361 (64%), Gaps = 17/361 (4%)
Query: 3 LSSQFKTILISFCISF----FIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
+ Q KT L I + F+ ++SI+ + S + +++LF+ W + +K+
Sbjct: 1 MGCQLKTHLFLLFIVWGSWSFLCYDLPSEYSILALEIDKFPSEEGVVELFQRWKEENKKI 60
Query: 59 YESLDEKLERFEIFKDNLRHIDETN-RKIKNYW--LGLNEFADLRHEEFKEMFLG-LKPD 114
Y + +E+ RFE FK NL++I E N ++I Y LGLN+FAD+ +EEFK F+ +K
Sbjct: 61 YRNPEEEKLRFENFKRNLKYIVEKNSKRISPYGQSLGLNQFADMSNEEFKSKFMSKVKKP 120
Query: 115 LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVT-HVKNQGSCGSCWAFSTVAAVEGINQI 173
++R S +D S +D P S+DWRKKG VT VK+QG CGS WAFS+ A+EGIN I
Sbjct: 121 FSKRNGVSSKDHSCEDE---PYSLDWRKKGVVTLAVKDQGYCGSYWAFSSTDAIEGINAI 177
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
VT +L SLSEQEL+DCD+T N+GC+GG MDYAF++++ GG+ E +YPYI +GTC +T
Sbjct: 178 VTADLISLSEQELVDCDST-NDGCDGGXMDYAFEWVMYNGGIDTETNYPYIGADGTCNVT 236
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGT--- 290
K +++V+ I+GY+DV Q S+ SLL A QP+S I+ + DFQ Y GG+YDG C +
Sbjct: 237 KEKTKVIGIDGYYDVGQ-SDSSLLCATVKQPISAGIDGTSWDFQLYIGGIYDGDCSSDPD 295
Query: 291 QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
+DH + VGYGS DY IVKNSW WG +G I +++NT G C IN MASYP K
Sbjct: 296 DIDHAILVVGYGSEGDDDYWIVKNSWRTSWGMEGCIYLRKNTNLKYGXCAINYMASYPTK 355
Query: 351 K 351
+
Sbjct: 356 E 356
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 159/316 (50%), Positives = 202/316 (63%), Gaps = 6/316 (1%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGLNE 95
S++++ +++ W K R E+FK+NLR +DE N R Y LG+N
Sbjct: 44 SDEEVRIIYQEWRVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNR 103
Query: 96 FADLRHEEFKEMFLGLKPDLARRKD-QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
FADL +EE++ FL L R + + ++ LP S+DWR+KGAV VKNQG
Sbjct: 104 FADLTNEEYRARFLRDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKNQGR 163
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGG 214
CGSCWAF+ +AAVEGINQIVTG+L SLSEQ+L+DC +T N GC GG AFQYI++ GG
Sbjct: 164 CGSCWAFAAIAAVEGINQIVTGDLISLSEQQLVDC-STRNYGCEGGWPYRAFQYIINNGG 222
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
++ EE YPY GTC TK + VV+I+ Y +VP N E SL KA ANQP+SV I+ASGR
Sbjct: 223 VNSEEHYPYTGTNGTCNTTKENAHVVSIDSYRNVPSNDEKSLQKAAANQPISVGIDASGR 282
Query: 275 DFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
+FQ Y G++ G C T L+HGV VGYG+ G DY IVKNSWG WG GYI M+RN +
Sbjct: 283 NFQLYHSGIFTGSCNTSLNHGVTVVGYGTENGNDYWIVKNSWGENWGNSGYILMERNIAE 342
Query: 335 PEGLCGINKMASYPIK 350
G CGI SYPIK
Sbjct: 343 SSGKCGIAISPSYPIK 358
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 155/361 (42%), Positives = 212/361 (58%), Gaps = 23/361 (6%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ F ++ CI AR+ + +++ E WM++ +VY+
Sbjct: 1 MAIPKVFLLAVVLGCICLCSTVLSARELG-----------DAAMVERHEQWMAQHGRVYK 49
Query: 61 SLDEKLERFEIFKDNLRHIDETNR--KIKNYWLGLNEFADLRHEEFKEM-----FLGLKP 113
EK RFE F++N+ I+ N + +WLG+N+F DL ++EF+ F+ +
Sbjct: 50 DGAEKARRFEAFRNNVVFIESFNAAGNRRKFWLGVNQFTDLTNDEFRATKTNKGFI-KRN 108
Query: 114 DLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGIN 171
A K F Y +V LP +VDWR KGAVT +KNQG CG CWAFS VAA EGI
Sbjct: 109 AAAVNKASPTGTFRYSNVSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAATEGIV 168
Query: 172 QIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTC 230
Q+ TG L LSEQEL+DCD N ++GC GG MD AF++I+ GGL E +YPY ++G C
Sbjct: 169 QLSTGKLVPLSEQELVDCDANGADHGCEGGEMDDAFEFIIKNGGLTSETNYPYTAQDGQC 228
Query: 231 EMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGT 290
+ + V TI GY DVP N E SL+KA+A QP+SVA++ FQ Y+GGV G CGT
Sbjct: 229 KAKNTINSVATIKGYEDVPANDEASLMKAVAAQPVSVAVDGGDMVFQHYAGGVLSGSCGT 288
Query: 291 QLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
LDHG+ AVGYG+ G + ++KNSWG WGE GYIRM+++ G+CG+ SYP
Sbjct: 289 SLDHGIVAVGYGAADDGTKFWLMKNSWGTTWGEDGYIRMEKDVADAGGMCGLAMQPSYPT 348
Query: 350 K 350
+
Sbjct: 349 E 349
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 150/304 (49%), Positives = 197/304 (64%), Gaps = 6/304 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+++ KVY+ EK +RF+IFK+N+ I+ + K + L +N+FADL +FK +
Sbjct: 39 EKWMAQYGKVYKDAAEKEKRFQIFKNNVHFIESFHAAGDKPFNLSINQFADLH--KFKAL 96
Query: 108 FLG--LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
+ K R + F Y V +P S+DWRK+GAVT +K+QG+C SCWAFSTVA
Sbjct: 97 LINGQKKEHNVRTATATEASFKYDSVTRIPSSLDWRKRGAVTPIKDQGTCRSCWAFSTVA 156
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
+EG++QI G L SLSEQEL+DC + GC GG ++ AF++I GG+ E YPY
Sbjct: 157 TIEGLHQITKGELVSLSEQELVDCVKGDSEGCYGGYVEDAFEFIAKKGGVASETHYPYKG 216
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
TC++ K VV I GY VP NSE +LLKA+A+QP+S +EA G FQFYS G++
Sbjct: 217 VNKTCKVKKETHGVVQIKGYEQVPSNSEKALLKAVAHQPVSAYVEAGGYAFQFYSSGIFT 276
Query: 286 GHCGTQLDHGVAAVGYGSTRGLD-YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
G CGT +DH V VGYG RG + Y +VKNSWG +WGEKGYIRMKR+ EGLCGI
Sbjct: 277 GKCGTDIDHSVTVVGYGKARGGNKYWLVKNSWGTEWGEKGYIRMKRDIRAKEGLCGIATG 336
Query: 345 ASYP 348
A YP
Sbjct: 337 ALYP 340
>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
Length = 484
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 156/344 (45%), Positives = 209/344 (60%), Gaps = 26/344 (7%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFE----------KVYESLDEKLERFEIFKDNLRH 78
+ V +P +++++ L+E W S+ + + D+ R E+F+ NLR+
Sbjct: 34 AAVTVTPPPERTDEEVRRLYEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNLRY 93
Query: 79 ID----ETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE-----DFSYK 129
ID E + + + LGL FADL EE++ L L R Y
Sbjct: 94 IDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLL-----LGSRGRNGTAVGVVGSRRYL 148
Query: 130 DVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
+ LP +VDWR++GAV VK+QG CG+CWAFS VAAVEGIN+IVTG+L SLSEQELI
Sbjct: 149 PLAGEQLPDAVDWRERGAVAEVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELI 208
Query: 188 DCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHD 247
DCD + GC+GGLMD AF +++ GG+ E DYP+ +GTC++ + VV+I+ +
Sbjct: 209 DCDKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDSFER 268
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGL 307
VP N E +L KA+A+QP+S +IEAS R FQ YS G++DG CGT LDHGV VGYGS G
Sbjct: 269 VPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYGSEGGK 328
Query: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
DY IVKNSWG +WGE GY+RM RN G CGI YP+K+
Sbjct: 329 DYWIVKNSWGTQWGEAGYVRMARNVRVRAGKCGIAMEPLYPVKE 372
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 145/300 (48%), Positives = 195/300 (65%), Gaps = 10/300 (3%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEE 103
+++ E WM+KF +VY+ EK +RF+ FK N+ I+ N +WLG+N+F DL ++E
Sbjct: 33 MVEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAFIESFNTGNHKFWLGVNQFTDLTNDE 92
Query: 104 FKEMFL--GLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCW 159
F+ GLK + AR + F Y +V LP +VDWR KG VT +K+QG CG CW
Sbjct: 93 FRATKTNKGLKRNGARAPTR----FKYNNVSTDALPAAVDWRTKGVVTPIKDQGQCGCCW 148
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
AFS VAA EGI ++ TG L SLSEQEL+DCD + + GC GG MD AF++I+ GGL E
Sbjct: 149 AFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGVDQGCEGGEMDNAFKFIIKNGGLTTE 208
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
+YPY ++G C+ + + V TI GY DVP N E SL+KA+ANQP+SVA++ FQ
Sbjct: 209 ANYPYTAQDGQCKTSTTSNSVATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDVIFQH 268
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
YSGGV G CGT LDHG+ A+GYG T G + ++KNSWG WGE GY+RM+++ G
Sbjct: 269 YSGGVMTGSCGTDLDHGIVAIGYGMTSDGTKFWLLKNSWGTTWGESGYLRMEKDISDKSG 328
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 162/350 (46%), Positives = 209/350 (59%), Gaps = 34/350 (9%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MAL S+ CI+ I +A + + +++ +++ E WM + + Y+
Sbjct: 1 MALESKI------ICITLLIMGVWASQ--ALSRTLHEVSMSER----HEDWMGLYGRTYK 48
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
+ EK RF+IFK+N+ +I+ N K K G N + R E
Sbjct: 49 DIAEKERRFKIFKENVEYIESVN-KFKASRNGYNMSSRPRSSEIT--------------- 92
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
F Y++V +P S+DWRKKGAVT +K+QG CG CWAFS VAA+EG+ Q+ TG L S
Sbjct: 93 ----SFRYENVAAVPSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELIS 148
Query: 181 LSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
LSEQEL+DCD + + GC GGLMD AF++I+ GGL E +YPY + TC K S
Sbjct: 149 LSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSA 208
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
I Y DVP NSE +LLKA+A P+SVAI+A G DFQFYS GV+ G CGT+LDHGV AV
Sbjct: 209 AKIKNYEDVPANSEAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAV 268
Query: 300 GYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
GYG T G Y +VKNSWG WGE GYI M+R+ G EGLCGI ASYP
Sbjct: 269 GYGKTDDGTKYWLVKNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYP 318
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 295 bits (756), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 144/303 (47%), Positives = 199/303 (65%), Gaps = 3/303 (0%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+++ +VY+ EK +RF++FK+N+ I+ N K + L +N+FADL EEFK +
Sbjct: 38 EKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKAL 97
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
+ ++ + + + F Y+ V +P ++DWRK+GAVT +K+QG CGSCWAFS VAA
Sbjct: 98 LINVQKKASWVETSTETSFRYESVTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAAT 157
Query: 168 EGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE 227
EGI+QI TG L LSEQEL+DC + GC GG +D AF++I GG+ E YPY
Sbjct: 158 EGIHQITTGKLVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVN 217
Query: 228 GTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG- 286
TC++ K V I GY VP N+E +LLKA+ANQP+SV I+A F++YS G+++
Sbjct: 218 KTCKVKKETHGVAEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNAR 277
Query: 287 HCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
+CGT +H VA VGYG + G Y +VKNSWG +WGE+GYIR+KR+ EGLCGI K
Sbjct: 278 NCGTDPNHAVAVVGYGKALDGSKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYP 337
Query: 346 SYP 348
YP
Sbjct: 338 YYP 340
>gi|357439999|ref|XP_003590277.1| Cysteine protease [Medicago truncatula]
gi|355479325|gb|AES60528.1| Cysteine protease [Medicago truncatula]
Length = 514
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 163/388 (42%), Positives = 225/388 (57%), Gaps = 57/388 (14%)
Query: 16 ISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDN 75
++F ++SI+ + S +++++LF+ W + +K Y +E R E FK N
Sbjct: 20 LTFLSCYGIPSEYSILAFDLNKFPSEEQVVELFQQWKKEHQKFYIHPEEAALRLENFKRN 79
Query: 76 LRHIDETNRKIKN----YWLGLNEFADLRHEEFKEMFLG-LKPDLARRKDQSHEDFSYKD 130
L++I E N ++N + LGLN FAD+ +EEFK F+ +K +++R H D
Sbjct: 80 LKYIVERN-AMRNSPVGHHLGLNRFADMSNEEFKNKFISKVKKPISKRASNLHVKVESCD 138
Query: 131 VVDLPKSVDWRKKGAVTHVKNQGSCG---------------------------------- 156
D P S+DWRKKG VT VK+QG+CG
Sbjct: 139 --DAPYSLDWRKKGVVTGVKDQGNCGKLLYFMHFKSFLVIYILELTTNFPLYSFESQFCI 196
Query: 157 ----------SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAF 206
SCW+FS+ A+EG+N IVTG+L SLSEQEL+DCD T N+GC GG MDYAF
Sbjct: 197 LEKKKLDFVGSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTT-NDGCEGGYMDYAF 255
Query: 207 QYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLS 266
+++++ GG+ E DYPYI GTC +TK E++VVTI+GY DV Q S+ +L A QP+S
Sbjct: 256 EWVINNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDGYTDVTQ-SDSALFCATVKQPIS 314
Query: 267 VAIEASGRDFQFYSGGVYDGHCGTQ---LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEK 323
V I+ S DFQ Y+GG+YDG C + +DH V VGYGS DY IVKNSWG WG +
Sbjct: 315 VGIDGSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGYGSDGNQDYWIVKNSWGTSWGIE 374
Query: 324 GYIRMKRNTGKPEGLCGINKMASYPIKK 351
G+I ++RNT G+C IN MAS+P K+
Sbjct: 375 GFIYIRRNTNLKYGVCAINYMASFPTKE 402
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 155/353 (43%), Positives = 218/353 (61%), Gaps = 21/353 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
M L + F + +F IS A ++ + P L + E WM++F +VY
Sbjct: 5 MVLVTIFTILFTTFSISQ------ATSRTVTFHEPSSLEKH-------EQWMARFSRVYR 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
EK R ++FK NL+ I+ N+K K+Y LG+NEFAD +EEF + GLK ++
Sbjct: 52 DELEKQMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLSSKVV 111
Query: 120 DQ--SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
D+ S ++ D+V + K DWR +GAVT VK QG CG CWAFS VAAVEG+ +I GN
Sbjct: 112 DETISSRSWNISDMVGVSK--DWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKIAGGN 169
Query: 178 LASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGES 237
L SLSEQ+L+DCD Y+ GC+GG+M AF YI+ G+ E DY Y +G C +
Sbjct: 170 LVSLSEQQLLDCDREYDRGCDGGIMSDAFNYIIQNRGIASENDYSYQGSDGRCRSSA--R 227
Query: 238 EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVA 297
I+G+ VP N+E +LL+A++ QP+SV+++A+G F YSGGVYDG CGT +H V
Sbjct: 228 PAARISGFQTVPSNNEQALLEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSNHAVT 287
Query: 298 AVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
VGYG+++ G Y + KNSWG WGEKGYIR++R+ P+G+CG+ + A YP+
Sbjct: 288 FVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPV 340
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 144/303 (47%), Positives = 199/303 (65%), Gaps = 3/303 (0%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+++ +VY+ EK +RF++FK+N+ I+ N K + L +N+FADL EEFK +
Sbjct: 38 EKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKAL 97
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
+ ++ + + + F Y+ V +P ++DWRK+GAVT +K+QG CGSCWAFS VAA
Sbjct: 98 LINVQKKASWVETSTQTSFRYESVTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAAT 157
Query: 168 EGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE 227
EGI+QI TG L LSEQEL+DC + GC GG +D AF++I GG+ E YPY
Sbjct: 158 EGIHQITTGKLVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVN 217
Query: 228 GTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD-G 286
TC++ K V I GY VP N+E +LLKA+ANQP+SV I+A F++YS G+++
Sbjct: 218 KTCKVKKETHGVAEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNVR 277
Query: 287 HCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
+CGT +H VA VGYG + G Y +VKNSWG +WGE+GYIR+KR+ EGLCGI K
Sbjct: 278 NCGTDPNHAVAVVGYGKALDGSKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYP 337
Query: 346 SYP 348
YP
Sbjct: 338 YYP 340
>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
sativus]
Length = 235
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 137/218 (62%), Positives = 168/218 (77%), Gaps = 1/218 (0%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP++VDWR+KGAV +KNQG+CGSCWAFST A VEGIN+IVTG L SLSEQEL+DCD +Y
Sbjct: 4 LPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDCDKSY 63
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
N GCNGGLMDYAFQ+I+ GGL+ E+DYPY +G C S+VVTI+GY DVP N E
Sbjct: 64 NQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRGSDGKCNSLLKNSKVVTIDGYEDVPTNDE 123
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
+L +A++ QP+SVAI+A GR FQ Y G++ G CGT++DH V AVGYGS G+DY IV+
Sbjct: 124 TALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYGSENGVDYWIVR 183
Query: 314 NSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
NSWG KWGE GYIR++RN + G CGI ASYP+K
Sbjct: 184 NSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPVK 221
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 206/321 (64%), Gaps = 12/321 (3%)
Query: 34 SPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGL 93
+ +L+ + + E WM+++ ++Y+ EK RFE+FK N+ I+ N +WLG+
Sbjct: 23 AARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGV 82
Query: 94 NEFADLRHEEFKEMFL--GLKPDLARRKDQSHEDFSYKDV-VD-LPKSVDWRKKGAVTHV 149
N+FADL ++EF+ G P R F Y++V +D LP ++DWR KG VT +
Sbjct: 83 NQFADLTNDEFRSTKTNKGFIPSTTRVP----TGFRYENVNIDALPATMDWRTKGVVTPI 138
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQY 208
K+QG CG CWAFS VAA+EGI ++ TG L SLSEQEL+DCD + + GC GGLMD AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 198
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVA 268
I+ GGL E +YPY + C+ + V +I GY DVP N+E +L+KA+ANQP+SVA
Sbjct: 199 IIKNGGLTTESNYPYAAADDKCKSV--SNSVASIKGYEDVPANNEAALMKAVANQPVSVA 256
Query: 269 IEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIR 327
++ FQFY GGV G CGT LDHG+ A+GYG ++ G Y ++KNSWG WGE G++R
Sbjct: 257 VDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLR 316
Query: 328 MKRNTGKPEGLCGINKMASYP 348
M+++ G+CG+ SYP
Sbjct: 317 MEKDISDKRGMCGLAMEPSYP 337
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 150/305 (49%), Positives = 200/305 (65%), Gaps = 9/305 (2%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
F WM K ++ Y S +E +R++ FK+N+ I + N + + LGL +FADL +EE+K+
Sbjct: 33 FIGWMRKHDRAY-SHEEFTDRYQAFKENMDFIHKWNSQESDTVLGLTKFADLTNEEYKKH 91
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
+LG+K ++ + + + + + P S+DWR+KGAV+ VK+QG CGSCW+FST AV
Sbjct: 92 YLGIKVNVKKNLNAAQKGLKFFKFTG-PDSIDWREKGAVSQVKDQGQCGSCWSFSTTGAV 150
Query: 168 EGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EG +QI +GN+ SLSEQ L+DC Y N GC GGLM AF+YI+ GG+ E YPY
Sbjct: 151 EGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIATESSYPYTAA 210
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD- 285
+G C+ TK + I GY ++PQ EDSL ALA QP+SVAI+AS FQ YS GVYD
Sbjct: 211 QGRCKFTKSMNGANII-GYKEIPQGEEDSLTAALAKQPVSVAIDASHMSFQLYSSGVYDE 269
Query: 286 GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
C ++ LDHGV AVGYG+ G DY I+KNSWGP WG+ GYI M RN + CG+ M
Sbjct: 270 PACSSEALDHGVLAVGYGTLEGKDYYIIKNSWGPTWGQDGYIFMSRNA---QNQCGVATM 326
Query: 345 ASYPI 349
ASYPI
Sbjct: 327 ASYPI 331
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 151/316 (47%), Positives = 200/316 (63%), Gaps = 11/316 (3%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEE 103
I+ E WM++F +VY EK RF IFK NL + N K Y + +NEF+DL EE
Sbjct: 32 IEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEE 91
Query: 104 FKEMFLGLK-PDLARR-----KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGS 157
F+ GL P+ R ++ F Y +V D +S+DWR++GAVT VK QG CG
Sbjct: 92 FRATHTGLVVPEAITRISTLSSGKNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGG 151
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHK 217
CWAFS VAAVEGI +I G L SLSEQ+L+DCD YN GC GG+M AF+YI+ G+
Sbjct: 152 CWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQGCRGGIMSKAFEYIIKNQGITT 211
Query: 218 EEDYPYIMEEGTCEMTKGES---EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
E++YPY + TC + S TI+GY VP N+E++LL+A++ QP+SV IE +G
Sbjct: 212 EDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGA 271
Query: 275 DFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
F+ YSGGV++G CGT L H V VGYG S G Y +VKNSWG WGE GY+R+KR+
Sbjct: 272 AFRHYSGGVFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDVD 331
Query: 334 KPEGLCGINKMASYPI 349
P+G+CG+ +A YP+
Sbjct: 332 APQGMCGLAILAFYPL 347
>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 130/216 (60%), Positives = 165/216 (76%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IVTGNL SLSEQEL+DCD +YN
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GC+GGLMDYAF+++++ GG+ EEDYPY G C+ + ++VVTI+ Y DVP N+E
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNGVCDQYRKNAKVVTIDSYEDVPVNNEK 121
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
+L KA+A+QP+S+A+EA GRDFQ Y G++ G CGT +DHGV GYG+ G+DY IV+N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVVAGYGTENGMDYWIVRN 181
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG KWGEKGY+R++RN GLCG+ SYP+K
Sbjct: 182 SWGAKWGEKGYLRVQRNVASSSGLCGLAIEPSYPVK 217
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 156/327 (47%), Positives = 207/327 (63%), Gaps = 7/327 (2%)
Query: 28 FSIVGYSPEDLTSND-KLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRK 85
F +S T D + + E WM++ KVY+ EK R++IF+ N++ I+ N
Sbjct: 18 FGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIEGFNNAG 77
Query: 86 IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
K++ LG+N+FADL EEFK + LK + + ++ F Y+ V +P ++DWR+KGA
Sbjct: 78 NKSHKLGVNQFADLTEEEFKAIN-KLKGYMWSKISRT-STFKYEHVTKVPATLDWRQKGA 135
Query: 146 VTHVKNQG-SCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMD 203
VT +K+QG CGSCWAF+ VAA EGI ++ TG L SLSEQELIDCD N N GC G++
Sbjct: 136 VTPIKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGDNGGCKWGIIQ 195
Query: 204 YAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
AF++IV GL E YPY +GTC V +I GY DVP N+E +LL A+ANQ
Sbjct: 196 EAFKFIVQNKGLATEASYPYQAVDGTCNAKVESKHVASIKGYEDVPANNETALLNAVANQ 255
Query: 264 PLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGE 322
P+SV +++S DF+FYS GV G CGT DH V VGYG S G Y ++KNSWG WGE
Sbjct: 256 PVSVLVDSSDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDDGTKYWLIKNSWGVYWGE 315
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPI 349
+GYIR+KR+ EG+CGI ASYPI
Sbjct: 316 QGYIRIKRDVAAKEGMCGIAMQASYPI 342
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 155/357 (43%), Positives = 214/357 (59%), Gaps = 21/357 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA T+LI F I + +R + ++D E WM++F + Y
Sbjct: 1 MASIMVLVTVLIILFTGFRISQATSRTV---------IFREQSMVDKHEQWMARFSREYR 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLK------P 113
EK R ++FK NL+ I+ N+K K+Y LG+NEFAD +EEF + GLK P
Sbjct: 52 DELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSP 111
Query: 114 DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
K S + ++ D+V +S DWR +GAVT VK QG CG CWAFS VAAVEG+ +I
Sbjct: 112 SKVVAKTISSQTWNVSDMV--VESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKI 169
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
GNL SLSEQ+L+DCD Y+ GC+GG+M AF Y+V G+ E DY Y +G C
Sbjct: 170 AGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCR-- 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
I+G+ VP N+E +LL+A++ QP+SV+++A+G F YSGGVYDG CGT +
Sbjct: 228 SNARPAARISGFQTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSN 287
Query: 294 HGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
H V VGYG+++ G Y + KNSWG WGEKGYIR++R+ P+G+CG+ + A YP+
Sbjct: 288 HAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPV 344
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 155/345 (44%), Positives = 210/345 (60%), Gaps = 29/345 (8%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
T L++ C++ F+ S+FA S+D L +F WM + +K Y + +E + R
Sbjct: 4 TTLLALCVALFVASTFA-------------VSHDPLTGVFADWMQEHQKSYAN-EEFVYR 49
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
+ ++++N +I+ N + K++ L +N+F DL + EF ++F GL + DQ+ ++
Sbjct: 50 WNVWRENYLYIEAHNHQNKSFHLAMNKFGDLTNAEFNKLFKGL----SITADQAKQESDI 105
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
LP DWR+KGAVTHVKNQG CGSCW+FST + EG N + G L SLSEQ L+D
Sbjct: 106 APAPGLPADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVD 165
Query: 189 CDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGES--EVVTINGY 245
C +Y N+GCNGGLMDYAF+YI+ G+ EE YPY +GTC K S E+V+ Y
Sbjct: 166 CSTSYGNHGCNGGLMDYAFEYIIRNKGIDTEESYPYHASQGTCRYNKQHSGGELVS---Y 222
Query: 246 HDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD--GHCGTQLDHGVAAVGYGS 303
+VP +E +LL A+A QP SVAI+AS FQFY GGVYD ++LDHGV AVG+G
Sbjct: 223 TNVPSGNEGALLNAVATQPTSVAIDASHSSFQFYKGGVYDEPACSSSRLDHGVLAVGWGV 282
Query: 304 TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G DY +VKNSWG WG GYI M RN CGI AS+P
Sbjct: 283 RDGKDYWLVKNSWGADWGLSGYIEMSRN---KHNQCGIATAASHP 324
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 138/269 (51%), Positives = 191/269 (71%), Gaps = 8/269 (2%)
Query: 28 FSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-- 85
SIV Y S ++ ++ WM+ + Y ++ E+ RFE+F+DNLR++D N
Sbjct: 29 MSIVSYGER---SEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAAD 85
Query: 86 --IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKK 143
+ ++ LGLN FADL ++E++ +LG++ +R+ + + + D DLP+SVDWR K
Sbjct: 86 AGVHSFRLGLNRFADLTNDEYRATYLGVRS-RPQRERRLGDRYLAGDNEDLPESVDWRAK 144
Query: 144 GAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMD 203
GAV VK+QGSCGSCWAFST+AAVEGINQIVTG++ SLSEQEL+DCD +YN GCNGGLMD
Sbjct: 145 GAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMD 204
Query: 204 YAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
YAF++I++ GG+ EEDYPY +G C++ + ++VVTI+ Y DVP NSE SL KA+ANQ
Sbjct: 205 YAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQ 264
Query: 264 PLSVAIEASGRDFQFYSGGVYDGHCGTQL 292
P+SVAIEA GR FQ Y+ G++ G CG +
Sbjct: 265 PISVAIEAGGRAFQLYNSGIFTGTCGNSV 293
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 152/308 (49%), Positives = 200/308 (64%), Gaps = 17/308 (5%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFK 105
++E W+ + K Y L EK R +IFK+NL+ IDE N + + +GL FADL ++E K
Sbjct: 1 MYERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDEPK 60
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
+ +K D + YK+ LP +DWR KGAV VK+QG+CGSCWAFS V
Sbjct: 61 DF---MKADR----------YLYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWAFSAVG 107
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
AVEGINQI TG L SLS+QELIDCD + N GC GG+M+YAF++I++ GG+ ++DYPY
Sbjct: 108 AVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDYPYT 167
Query: 225 MEE-GTCEM-TKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
+ G C K + VV I+GY V QN E SL KA+A+QP+ VAIEAS + F+ Y G
Sbjct: 168 ATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYKSG 227
Query: 283 VYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGIN 342
V+ G CG LDHGV VGYG++ G DY I++NSWG WGE GY++++RN G CG+
Sbjct: 228 VFTGTCGIYLDHGVVVVGYGTSSGEDYWIIRNSWGLNWGENGYVKLQRNIDDSFGKCGVA 287
Query: 343 KMASYPIK 350
M SYP K
Sbjct: 288 MMPSYPTK 295
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 147/314 (46%), Positives = 204/314 (64%), Gaps = 16/314 (5%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFAD 98
S+ +++ E+WM ++ +VY+ EK RF++FKDN+ ++ N N +WLG+N+FAD
Sbjct: 28 SDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKDNVAFVESFNTNKNNKFWLGVNQFAD 87
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
L EEFK G KP A + + + V LP +VDWR KGAVT +KNQG C
Sbjct: 88 LTTEEFKAN-KGFKP-TAEKVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQC--- 142
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHK 217
AA+EGI ++ TGNL SLSEQEL+DCD ++ + GC GG MD AF++++ GGL
Sbjct: 143 ------AAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLAT 196
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
E +YPY +G C+ G TI G+ DVP N+E +L+KA+ANQP+SVA++AS R F
Sbjct: 197 ESNYPYKAVDGKCK--GGSKSAATIKGHEDVPVNNEAALMKAVANQPVSVAVDASDRTFM 254
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
YSGGV G CGT+LDHG+AA+GYG + G Y I+KNSWG WGEKG++RM+++
Sbjct: 255 LYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILKNSWGTTWGEKGFLRMEKDITDKR 314
Query: 337 GLCGINKMASYPIK 350
G+CG+ SYP +
Sbjct: 315 GMCGLAMKPSYPTE 328
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 149/315 (47%), Positives = 200/315 (63%), Gaps = 10/315 (3%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLNEFADLRHEE 103
I+ E WM++F +VY EK RF IFK NL + N K Y L +NEF+DL EE
Sbjct: 32 IEKHEQWMARFNRVYSDESEKRNRFNIFKKNLEFVQSFNMNKNITYKLDVNEFSDLTDEE 91
Query: 104 FKEMFLGLK-PD----LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
F+ GL P+ ++ F Y +V D +S+DWR++GAVT VK QG CG C
Sbjct: 92 FRATHTGLVVPEEITGISTLSSDKTVPFRYGNVSDTGESMDWRQEGAVTPVKYQGRCGGC 151
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
WAFS VAAVEGI +I G L SLSEQ+L+DCD YN GC+GG+M AF+YI+ G+ E
Sbjct: 152 WAFSAVAAVEGITKITKGELVSLSEQQLLDCDTDYNQGCHGGIMSKAFEYIIKNQGITTE 211
Query: 219 EDYPYIMEEGTCEMTKGES---EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
++YPY + TC + S TI+GY VP N+E++LL+A++ QP+SV IE +G
Sbjct: 212 DNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAG 271
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
F+ YSGG+++G CGT L H V VGYG S G Y +VKNSWG WGE G++R+KR+
Sbjct: 272 FRHYSGGIFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGEDGFMRIKRDVDA 331
Query: 335 PEGLCGINKMASYPI 349
P+G+CG+ +A YP+
Sbjct: 332 PQGMCGLAMLAFYPL 346
>gi|313118764|gb|ADR32294.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 130/216 (60%), Positives = 164/216 (75%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IVTGNL SLSEQEL+DCD +YN
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GC+GGLMDYAF+++++ GG+ EEDYPY G C+ + ++VV I+ Y DVP N+E
Sbjct: 62 QGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNGVCDQYRKNAKVVVIDSYEDVPVNNEK 121
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
+L KA+A+QP+S+A+EA GRDFQ Y G++ G CGT +DHGV A GYG+ GLDY IV+N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGLDYWIVRN 181
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG WGEKGY+R++RN GLCG+ SYP+K
Sbjct: 182 SWGADWGEKGYLRVQRNVASSSGLCGLAIEPSYPVK 217
>gi|313118762|gb|ADR32293.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 129/216 (59%), Positives = 165/216 (76%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IVTGNL SLSEQEL+DCD +YN
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GC+GGLMDYAF+++++ GG+ EEDYPY C+ + ++VV I+ Y DVP N+E
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
+L KA+A+QP+S+A+EA GRDFQ Y G++ G CGT +DHGV A GYG+ G+DY IV+N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG KWGEKGY+R++RN + GLCG+ SYP+K
Sbjct: 182 SWGAKWGEKGYLRVQRNIARSSGLCGLATEPSYPVK 217
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 145/306 (47%), Positives = 200/306 (65%), Gaps = 7/306 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+++ +VY+ EK +RF++FK+N+ I+ N K + L +N+FADL EEFK +
Sbjct: 38 EKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKAL 97
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
+ ++ + + + F Y+ V +P ++D RK+GAVT +K+QG CGSCWAFS VAA
Sbjct: 98 LINVQKKASWVETSTETSFRYESVTKIPATIDRRKRGAVTPIKDQGRCGSCWAFSAVAAT 157
Query: 168 EGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE 227
EGI+QI TG L LSEQEL+DC + GC GG +D AF++I GG+ E YPY
Sbjct: 158 EGIHQITTGKLVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVN 217
Query: 228 GTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG- 286
TC++ K V I GY VP N+E +LLKA+ANQP+SV I+A F++YS G+++
Sbjct: 218 KTCKVKKETHGVAEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNAR 277
Query: 287 HCGTQLDHGVAAVGYGSTRGLD---YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINK 343
+CGT +H VA VGYG + LD Y +VKNSWG +WGE+GYIR+KR+ EGLCGI K
Sbjct: 278 NCGTDPNHAVAVVGYG--KALDDSKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAK 335
Query: 344 MASYPI 349
YPI
Sbjct: 336 YPYYPI 341
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 160/364 (43%), Positives = 217/364 (59%), Gaps = 21/364 (5%)
Query: 3 LSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESL 62
+++ +LIS I + ++ A S Y D+ S + L+ LF+ W+ + K+Y S
Sbjct: 1 MANPLHLLLISATIICLVSAAKAVQHS---YEVGDINSGNGLVRLFDRWLGRHGKLYGSH 57
Query: 63 DEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLGLKPDLAR-RKD 120
+EK R +IF+ NL++I N+ + + LGLN+FADL +EEFK + G R R+
Sbjct: 58 EEKARRLQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNEEFKTRYFGKNSKQWRDRRR 117
Query: 121 QSHEDFSYKDVVD-----------LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEG 169
E + V+ + S+DWRKKGAVT VK+Q CGSCWAFST A+EG
Sbjct: 118 TELEGAELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEG 177
Query: 170 INQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
+N I TG L SLSEQEL+ CD T N GC GG MDYAF +++ GG+ E+DY Y + T
Sbjct: 178 VNFISTGKLVSLSEQELVACDAT-NYGCEGGDMDYAFTWVIQNGGIDTEKDYSYTGVDST 236
Query: 230 CEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCG 289
C K ++V+I+GY DV + + +LL A +QP+SV I+ S DFQ Y+GG+YDG C
Sbjct: 237 CNTNKEAKKIVSIDGYTDVSPD-DSALLCAAGSQPVSVGIDGSAIDFQLYTGGIYDGDCS 295
Query: 290 TQ---LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
+DH V VGY + G DY IVKNSWG WG +GY + RNT P G+C IN MAS
Sbjct: 296 GNPDDIDHAVLVVGYSAKNGKDYWIVKNSWGTDWGLEGYFYILRNTELPYGVCAINAMAS 355
Query: 347 YPIK 350
YP K
Sbjct: 356 YPTK 359
>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
Length = 340
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 159/343 (46%), Positives = 218/343 (63%), Gaps = 15/343 (4%)
Query: 17 SFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNL 76
SF + SF S + + S+D++I L+E W+ K +K+Y SL EK++RFEIFKDNL
Sbjct: 3 SFVLILSFLLFVSAITCISTNWRSDDEVIALYEEWLVKHQKLYSSLGEKIKRFEIFKDNL 62
Query: 77 RHIDETNRKIK----NYWLGLNEFADLRHEEFKEMFLGLKPDLAR------RKDQSHEDF 126
R+ID+ N K N+ LGLN+FADL +EF ++LG D + D ED
Sbjct: 63 RYIDQQNHYNKVNHMNFTLGLNQFADLTLDEFSSIYLGTSVDYEQIISSNPNHDDVEEDI 122
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
+DVV+LP SVDWR+KG V ++NQG CGSCW FS VA++E +N I G++ +LSEQEL
Sbjct: 123 LKEDVVELPDSVDWREKGVVFPIRNQGKCGSCWTFSAVASIETLNGIKKGHMIALSEQEL 182
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
+DC+ T + GC GG + AF Y V+ G+ EE YPYI +G C + +VV I+GY
Sbjct: 183 LDCE-TISQGCKGGHYNNAFAY-VAKNGITSEEKYPYIFRQGQCYQ---KEKVVKISGYK 237
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG 306
VP+N+ L A+A Q +SVA++ +DFQFY G++ G CG LDH V VGYGS G
Sbjct: 238 RVPRNNGGQLQSAVAQQVVSVAVKCESKDFQFYDRGIFSGACGPILDHAVNIVGYGSKGG 297
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
+Y I++NSWG WGE GY+R+++N+ EG CGI SYP+
Sbjct: 298 ANYWIMRNSWGTNWGENGYMRIQKNSKHYEGHCGIAMQPSYPV 340
>gi|313118766|gb|ADR32295.1| C14 cysteine protease [Solanum demissum]
gi|313118774|gb|ADR32299.1| C14 cysteine protease [Solanum verrucosum]
gi|313118776|gb|ADR32300.1| C14 cysteine protease [Solanum verrucosum]
gi|313118778|gb|ADR32301.1| C14 cysteine protease [Solanum verrucosum]
Length = 217
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 129/216 (59%), Positives = 164/216 (75%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IVTGNL SLSEQEL+DCD +YN
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GC+GGLMDYAF+++++ GG+ EEDYPY C+ + ++VV I+ Y DVP N+E
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
+L KA+A+QP+S+A+EA GRDFQ Y G++ G CGT +DHGV A GYG+ G+DY IV+N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG KWGEKGY+R++RN GLCG+ SYP+K
Sbjct: 182 SWGAKWGEKGYLRVQRNIASSSGLCGLATEPSYPVK 217
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 161/357 (45%), Positives = 224/357 (62%), Gaps = 25/357 (7%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA+ +ILI+ F IS F + AR S L+ +++ E WMS+ +V
Sbjct: 1 MAMKVDLMSILITLFFVISMFNSQTRAR-------SQPKLSVSER----HELWMSRHGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-- 114
Y+ EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ EEF F GL P+
Sbjct: 50 YKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSY 109
Query: 115 LARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQ 172
L+ S E F D+ D +P ++DWR+ GAVT VKNQG CG CWAFS V ++EG +
Sbjct: 110 LSPSPMPSTE-FKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYK 168
Query: 173 IVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEM 232
I TGNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ ++ TC
Sbjct: 169 IATGNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR- 226
Query: 233 TKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQL 292
++G++ V I+ Y VP+ E SLL+A+ QP+S+ I AS D QFY+GG YDG C ++
Sbjct: 227 SQGKTAAVQISNYQVVPE-GETSLLQAVTKQPVSIGIAAS-HDLQFYAGGTYDGSCANRI 284
Query: 293 DHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
+H V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 285 NHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 213/349 (61%), Gaps = 21/349 (6%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDK--LIDLFESWMSKFEKVYESLDEKLE 67
I I+ CI ++ + VG + T D+ ++ ++ WM+++ + Y+ EK
Sbjct: 23 IAIADCICQAAVAARVEPSTTVGRT----TGGDEAMMMARYKKWMAQYRRKYKDDAEKAH 78
Query: 68 RFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLA--RRKDQSHE 124
RF++FK N ID +N K Y LG N+FADL +EF M+ GL+ A Q
Sbjct: 79 RFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAKQIPA 138
Query: 125 DFSYKDVVDLPK--SVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLS 182
F Y++ L VDWR++GAVT VKNQG CG CWAFS V A+EG+ I TGNL SLS
Sbjct: 139 GFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGNLVSLS 198
Query: 183 EQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVT 241
EQ+++DCD + N GCNGG MD AFQY+V+ GG+ E+ YPY +GTC+ + T
Sbjct: 199 EQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQGTCQNVQ---PAAT 255
Query: 242 INGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGH-CGTQLDHGVAAVG 300
I+G+ D+P E++L A+ANQP+SV ++ FQFY GG+YDG CGT ++H V A+G
Sbjct: 256 ISGFQDLPSGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNHAVTAIG 315
Query: 301 YGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
YG+ +G Y I+KNSWG WGE G+++++ G CGI+ MASYP
Sbjct: 316 YGADDQGTQYWILKNSWGTGWGENGFMQLQMGVGA----CGISTMASYP 360
>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
Length = 245
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 138/218 (63%), Positives = 163/218 (74%), Gaps = 1/218 (0%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP+SVDWR+ GAV VK+Q SCGSCWAFSTVAAVEGINQIVTG L SLSEQEL+DCD Y
Sbjct: 6 LPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEY 65
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
+ GCNGGLMDYAF +I+ GGL E+DYPY +G C ++ S+VV+I+GY DVP E
Sbjct: 66 DMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDE 125
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
+L KA+A+QP+SVA+EA GR Q Y G++ G CGT LDHG+ AVGYG+ G DY IV+
Sbjct: 126 KALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGTENGTDYWIVR 185
Query: 314 NSWGPKWGEKGYIRMKRNTGKP-EGLCGINKMASYPIK 350
NSWG WGE GYIRM+RN G CGI ASYPIK
Sbjct: 186 NSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPIK 223
>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
Length = 493
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 149/298 (50%), Positives = 193/298 (64%), Gaps = 22/298 (7%)
Query: 68 RFEIFKDNLRHID----ETNRKIKNYWLGLNEFADLRHEEFK-EMFLGLKPD-------L 115
R E+F+DNLR+ID E + + + LGL FADL EE++ + LG + +
Sbjct: 92 RLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVV 151
Query: 116 ARRKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
RR+ Y + LP +VDWR++GAV VK+QG CG CWAFS VAAVEGIN+I
Sbjct: 152 GRRR--------YLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKI 203
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
VTG+L SLSEQELIDCD + GC+GGLMD AF +++ GG+ E DYP+ +GTC++
Sbjct: 204 VTGSLISLSEQELIDCDKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLK 263
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+ VV+I+ + VP N E +L KA+A+QP+S +IEAS R FQ YS G++DG CGT LD
Sbjct: 264 LKNTRVVSIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLD 323
Query: 294 HGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
HGV VGYGS G DY IVKNSWG +WGE GY+RM RN GI YP+K+
Sbjct: 324 HGVTVVGYGSEGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLYPVKE 381
>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
Length = 328
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 137/218 (62%), Positives = 167/218 (76%), Gaps = 1/218 (0%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP+SVDWRK+GAV VK+Q SCGSCWAFS +AAVEGIN+IVTG+L SLSEQEL+DCD +Y
Sbjct: 24 LPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSY 83
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
N GCNGGLMDYAF++I+S GG+ E+DYPY +G C+ + ++VVTI+ Y DVP E
Sbjct: 84 NEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDE 143
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
+L KA+ANQP++VA+E GR+FQ Y GV G CGT LDHGVAAVGYG+ G DY IV+
Sbjct: 144 LALQKAVANQPIAVAVEGGGREFQLYEYGVLTGRCGTALDHGVAAVGYGTENGKDYWIVR 203
Query: 314 NSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMASYPIK 350
NSWG WGE+GYIR++RN G CGI SYPIK
Sbjct: 204 NSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIK 241
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 149/350 (42%), Positives = 214/350 (61%), Gaps = 22/350 (6%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDK--LIDLFESWMSKFEKVYESLDEKLE 67
I I+ CI ++ + VG + T D+ ++ ++ WM+++ + Y+ EK
Sbjct: 23 IAIADCICHAAVAARVEPSTTVGRT----TGGDEAMMMARYKKWMAQYRRKYKDDAEKAH 78
Query: 68 RFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLK-----PDLARRKDQ 121
RF++FK N ID +N K Y LG N+FADL +EF M+ GL+ P A++
Sbjct: 79 RFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAKQIPA 138
Query: 122 SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASL 181
+ + +D VDWR++GAVT VKNQG CG CWAFS V A+EG+ I TGNL SL
Sbjct: 139 AGSKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGNLVSL 198
Query: 182 SEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVV 240
SEQ+++DCD + N GCNGG MD AFQY+++ GG+ E+ YPY +GTC+ +
Sbjct: 199 SEQQILDCDESDGNQGCNGGYMDNAFQYVINNGGVTTEDAYPYSAVQGTCQNVQ---PAA 255
Query: 241 TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGH-CGTQLDHGVAAV 299
TI+G+ D+P E++L A+ANQP+SV ++ FQFY GG+YDG CGT ++H V A+
Sbjct: 256 TISGFQDLPSGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNHAVTAI 315
Query: 300 GYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
GYG+ +G Y I+KNSWG WGE G+++++ G CGI+ MASYP
Sbjct: 316 GYGADDQGTQYWILKNSWGTGWGENGFMQLQMGVGA----CGISTMASYP 361
>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
Length = 262
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 140/224 (62%), Positives = 166/224 (74%), Gaps = 4/224 (1%)
Query: 131 VVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD 190
V DLP SVDWR+KGAVT VK+QG CGSCWAFSTV +VEGIN I TG+L SLSEQELIDCD
Sbjct: 1 VSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD 60
Query: 191 NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE---VVTINGYHD 247
N+GC GGLMD AF+YI + GGL E YPY GTC + + VV I+G+ D
Sbjct: 61 TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQD 120
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-G 306
VP NSE+ L +A+ANQP+SVA+EASG+ F FYS GV+ G CGT+LDHGVA VGYG G
Sbjct: 121 VPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDG 180
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
Y VKNSWGP WGE+GYIR+++++G GLCGI ASYP+K
Sbjct: 181 KAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVK 224
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 158/354 (44%), Positives = 222/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGGLM AF +I+ GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTCR-SRE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG+C Q++H
Sbjct: 230 KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGNCADQINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|313118772|gb|ADR32298.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 128/216 (59%), Positives = 164/216 (75%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IVTG+L SLSEQEL+DCD +YN
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGDLISLSEQELVDCDKSYN 61
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GC+GGLMDYAF+++++ GG+ EEDYPY C+ + ++VV I+ Y DVP N+E
Sbjct: 62 QGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
+L KA+A+QP+S+A+EA GRDFQ Y G++ G CGT +DHGV A GYG+ G+DY IV+N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG KWGEKGY+R++RN GLCG+ SYP+K
Sbjct: 182 SWGAKWGEKGYLRVQRNIASSSGLCGLATEPSYPVK 217
>gi|313118760|gb|ADR32292.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 128/216 (59%), Positives = 163/216 (75%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P SVDWR KG + VK+QGSCGSCWAFS VAA+E IN IVTGNL SLSEQEL+DCD +YN
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GC+GGLMDYAF+++++ GG+ EEDYPY C+ + ++VV I+ Y DVP N+E
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
+L KA+A+QP+S+A+EA GRDFQ Y G++ G CGT +DHGV A GYG+ G+DY IV+N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWG WGEKGY+R++RN GLCG+ SYP+K
Sbjct: 182 SWGANWGEKGYLRVQRNIASSSGLCGLATEPSYPVK 217
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 210/349 (60%), Gaps = 34/349 (9%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
+IL ++FF ++ A DL + ++ E WM ++ +VY+ EK R
Sbjct: 7 SILAILGLAFFCGAALA---------ARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARR 57
Query: 69 FEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFL--GLKPDLARRKDQSHED 125
FE+FK N++ I+ N + +WLG+N+FADL ++EF+ G KP +
Sbjct: 58 FEVFKANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSPVKVP----TG 113
Query: 126 FSYKDV-VD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
F Y++V VD LP ++DWR KGAVT +K+QG C EGI +I TG L SLSE
Sbjct: 114 FRYENVSVDALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSE 161
Query: 184 QELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
QEL+DCD + + GC GGLMD AFQ+I+ GGL E YPY +G C+ G + T+
Sbjct: 162 QELVDCDVHGEDQGCEGGLMDDAFQFIIKNGGLTTESSYPYTAADGKCK--SGSNSAATV 219
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
G+ DVP N E +L+KA+ANQP+SVA++ FQFYSGGV G CGT LDHG+AA+GYG
Sbjct: 220 KGFEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG 279
Query: 303 STR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
T G Y ++KNSWG WGE GY+RM+++ G+CG+ SYPI+
Sbjct: 280 QTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPIE 328
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 222/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ +ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMSILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 145/282 (51%), Positives = 188/282 (66%), Gaps = 10/282 (3%)
Query: 73 KDNLRHIDETNRKI-KNYWLGLNEFADLRHEEF---KEMFLGLKPDLARRKDQSHEDFSY 128
K+N+ +I+ N K Y LG+N+FADL EEF + F G R + F Y
Sbjct: 5 KENVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNGH----MRFSNTRTTTFKY 60
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
++V LP S+DWR+KGAVT +KNQGSCG CWAFS +AA EGI++I TG L SLSEQE++D
Sbjct: 61 ENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVD 120
Query: 189 CDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHD 247
CD ++GC GG MD AF++I+ G++ E YPY +G C + + TI GY D
Sbjct: 121 CDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGYED 180
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRG 306
VP N+E +L KA+ANQP+SVAI+A G DFQFY G++ G CGT+LDHGV AVGYG + G
Sbjct: 181 VPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEG 240
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
Y +VKNSWG +WGE+GY M+R EG+CGI +ASYP
Sbjct: 241 TKYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICGIAMLASYP 282
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 158/356 (44%), Positives = 220/356 (61%), Gaps = 23/356 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA+ +ILI+ F IS F + AR S L+ +++ E WMS+ +V
Sbjct: 1 MAMKVDLMSILITLFFVISMFNSQTRAR-------SQPKLSVSER----HELWMSRHGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-L 115
Y+ EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ EEF F GL P+
Sbjct: 50 YKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSY 109
Query: 116 ARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
S +F D+ D +P ++DWR+ GAVT VKNQG CG CWAFS V ++EG +I
Sbjct: 110 LSPSPMSSTEFKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKI 169
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TGNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ ++ TC +
Sbjct: 170 ATGNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCR-S 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+ ++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++
Sbjct: 228 QEKTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRIN 285
Query: 294 HGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
H V A+GYG+ G Y ++KNSWG WGEKG++++ R+ G P GLC I K++SYP
Sbjct: 286 HAVTAIGYGTDENGQKYWLLKNSWGTSWGEKGFMKIIRDYGNPSGLCDIAKLSSYP 341
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 221/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK+ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 153/317 (48%), Positives = 194/317 (61%), Gaps = 12/317 (3%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADL 99
L DLF W K K Y+S +EK R +IF DN + + N + +N +++GLN ADL
Sbjct: 64 LSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLADL 123
Query: 100 RHEEFKEMFLGLKPDL-ARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
+EFK+M LG L A R + Y DV P+ +DW GAVT VKNQ CGSC
Sbjct: 124 TKDEFKKM-LGYNAALRASRAPVDASTWEYADVTP-PEEIDWVASGAVTPVKNQKQCGSC 181
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
WAFST AVEG+N I TG L SLSE+ELI C N GCNGGLMD F++IV+ G+ E
Sbjct: 182 WAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEWIVNNRGIDTE 241
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
+ + Y+ +E C + V I+G+ DVP N EDSL+KA++ QP+SVAIEA + FQ
Sbjct: 242 DGWEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQSFQL 301
Query: 279 YSGGVYDGH-CGTQLDHGVAAVGYG----STRGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
Y+GGVY CGT+LDHGV VGYG ST+ + +KNSWGP WGE GYIR+ +
Sbjct: 302 YAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRIAKGGS 361
Query: 334 KPEGLCGINKMASYPIK 350
EG CG+ SYP K
Sbjct: 362 GVEGQCGVAMQPSYPTK 378
>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 340
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 153/315 (48%), Positives = 196/315 (62%), Gaps = 10/315 (3%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFAD 98
S + E WM+ ++VY EK R +IFK+NL I++ N + K Y L LN FAD
Sbjct: 30 SESSIATQHEEWMAMHDRVYADSAEKDRRQQIFKENLEFIEKHNNEGKKRYNLSLNSFAD 89
Query: 99 LRHEEFKEMFLGL--KP--DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
L +EEF G KP L K F V D+ S+DWRK+GAV +KNQG
Sbjct: 90 LTNEEFVASHTGALYKPPTQLGSFKINHSLGFHKMSVGDIEASLDWRKRGAVNDIKNQGR 149
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGG 214
CGSCWAFS VAAVEGINQI G L SLSEQ L+DC + N+GC+G ++ AF YI G
Sbjct: 150 CGSCWAFSAVAAVEGINQIKNGQLVSLSEQNLVDCAS--NDGCHGQYVEKAFDYIRDYG- 206
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
L EE+YPY+ GTC + + + I GY V +E+ LL A+A+QP+SV +EA G+
Sbjct: 207 LANEEEYPYVETVGTC--SGNSNPAIQIRGYQSVTPQNEEQLLTAVASQPVSVLLEAKGQ 264
Query: 275 DFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
FQFYSGGV+ G CGT+L+H V VGYG Y +++NSWG WGE GY+++ R+TG
Sbjct: 265 GFQFYSGGVFSGECGTELNHAVTIVGYGEEAEGKYWLIRNSWGKSWGEGGYMKLMRDTGN 324
Query: 335 PEGLCGINKMASYPI 349
P+GLCGIN ASYP
Sbjct: 325 PQGLCGINMQASYPF 339
>gi|222632170|gb|EEE64302.1| hypothetical protein OsJ_19139 [Oryza sativa Japonica Group]
Length = 1105
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 133/220 (60%), Positives = 163/220 (74%)
Query: 131 VVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD 190
V +P +VDWR+ GAVT VK+QGSCG+CW+FS A+EGIN+I TG+L SLSEQELIDCD
Sbjct: 126 VGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELIDCD 185
Query: 191 NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQ 250
+YN+GC GGLMDYA++++V GG+ E DYPY +GTC K + VVTI+GY DVP
Sbjct: 186 RSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRETDGTCNKNKLKRRVVTIDGYKDVPA 245
Query: 251 NSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYI 310
N+ED LL+A+A QP+SV I S R FQ YS G++DG C T LDH + VGYGS G DY
Sbjct: 246 NNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFDGPCPTSLDHAILIVGYGSEGGKDYW 305
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
IVKNSWG WG KGY+ M RNTG G+CGIN+M S+P K
Sbjct: 306 IVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQMPSFPTK 345
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 221/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK+ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R+ G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 203/314 (64%), Gaps = 12/314 (3%)
Query: 34 SPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGL 93
+ +L+ + + E WM+++ ++Y+ EK RFE+FK N I+ N +WLG+
Sbjct: 23 AARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANAAFIESFNAGNHKFWLGV 82
Query: 94 NEFADLRHEEFK--EMFLGLKPDLARRKDQSHEDFSYKDV-VD-LPKSVDWRKKGAVTHV 149
N+FADL ++EF+ + G P R F Y++V +D LP ++DWR KG VT +
Sbjct: 83 NQFADLTNDEFRLTKTNKGFIPSTTRVP----TGFRYENVNIDALPATMDWRTKGVVTPI 138
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQY 208
K+QG CG CWAFS VAA+EGI ++ TG L SLSEQEL+DCD + + GC GGLMD AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 198
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVA 268
I+ GGL E +YPY + C+ + V +I GY DVP N+E +L+KA+ANQP+SVA
Sbjct: 199 IIKNGGLTTESNYPYAAADDKCKSV--SNSVASIKGYEDVPANNEAALMKAVANQPVSVA 256
Query: 269 IEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIR 327
++ FQFY GGV G CGT LDHG+ A+GYG ++ G Y ++KNSWG WGE G++R
Sbjct: 257 VDGDDMTFQFYKGGVMIGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLR 316
Query: 328 MKRNTGKPEGLCGI 341
M+++ G+CG+
Sbjct: 317 MEKDISDKRGMCGL 330
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 157/356 (44%), Positives = 221/356 (62%), Gaps = 23/356 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA+ ILI+ F IS F + AR S L+ +++ E WMS+ +V
Sbjct: 1 MAMKVDLMNILITLFFVISMFNTQTRAR-------SQPKLSVSER----HELWMSRHGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-L 115
Y+ EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 50 YKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSY 109
Query: 116 ARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I
Sbjct: 110 LSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKI 169
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TGNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ E+ TC +
Sbjct: 170 ATGNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-S 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+ ++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++
Sbjct: 228 QEKTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRIN 285
Query: 294 HGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
H V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 286 HAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 153/357 (42%), Positives = 212/357 (59%), Gaps = 21/357 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA T+LI F I + +R + ++D E WM++F + Y
Sbjct: 1 MASIMVLVTVLIILFTGFRISQATSRTV---------IFREQSMVDKHEQWMARFSREYR 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLK------P 113
EK R ++FK NL+ I+ N+K K+Y LG+NEFAD +EEF + GLK P
Sbjct: 52 DELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSP 111
Query: 114 DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
K S + ++ D+V +S DWR +GAVT VK QG CG CWAFS VAAVEG+ +I
Sbjct: 112 SKVVAKTISSQTWNVSDMV--VESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKI 169
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
GNL SLSEQ+L+DCD Y+ C+GG+M AF Y+V G+ E DY Y +G C
Sbjct: 170 AGGNLVSLSEQQLLDCDREYDRDCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCR-- 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
I+G+ VP N+E +LL+A++ QP+SV+++A+G F YSGGVYDG CGT +
Sbjct: 228 SNARPAARISGFQTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSN 287
Query: 294 HGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
H V VGYG+++ G Y + KNSWG W EKGYIR++R+ P+G+CG+ + A YP+
Sbjct: 288 HAVTFVGYGTSQDGTKYWLAKNSWGETWEEKGYIRIRRDVAWPQGMCGVAQYAFYPV 344
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 149/299 (49%), Positives = 194/299 (64%), Gaps = 19/299 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNL----RHIDETNRKIKNYWLGLNEFADLRHEE 103
F+ + + FEK YES +E+ RF IF DNL RH E R + + +G+N+FADL +EE
Sbjct: 20 FDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEE 79
Query: 104 FKEMFLGLKP-DLARRKDQSHEDFSYKDVVDLPK--SVDWRKKGAVTHVKNQGSCGSCWA 160
+++++L P +L R+ Q + +D P SVDWR+KGAVT +KNQG CGSCW+
Sbjct: 80 YRQLYLRPYPTELLGRERQ-------EVWLDGPNAGSVDWRQKGAVTPIKNQGQCGSCWS 132
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEE 219
FST +VEG + I TGNL SLSEQ+L+DC ++ N GCNGGLMD AF+YI+S GGL E+
Sbjct: 133 FSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQ 192
Query: 220 DYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
DYPY +G C+ +K V+I+GY DVPQN+ED L A+ P+SVAIEA + FQ Y
Sbjct: 193 DYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMY 252
Query: 280 SGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
S GV+ G CGT LDHGV VGY S DY IVKNSWG W +G + EG+
Sbjct: 253 SSGVFSGPCGTNLDHGVLVVGYTS----DYWIVKNSWGASWVTRGGCHSGEQAVRIEGI 307
>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 343
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 156/353 (44%), Positives = 216/353 (61%), Gaps = 34/353 (9%)
Query: 5 SQFKTILISFCISFFIRSSFARDFSIVGY--SPEDLT---SNDKLIDLFESWMSKFEKVY 59
S TI I F + S+ D SI+ Y S D + S+++++ ++E ++K KVY
Sbjct: 6 SSKATIFILFFTVLAVSSAL--DLSIISYDRSHADKSGWRSDEEVMSIYEEXLAKHGKVY 63
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
++DE ERF+I K+NL+ +++ N + Y +GLN FAD +R
Sbjct: 64 NAIDEMEERFQISKENLKFVEQHNAGNRTYKVGLNRFADR----------------SRMM 107
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
+ ++ + +L +SVDWRK+GAV VK Q C SC F+ +AAVEGIN+IVTGNL
Sbjct: 108 TRPSSRYAPRVSDNLSESVDWRKEGAVVRVKTQSECESCRTFTVIAAVEGINKIVTGNLT 167
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
+LS DCD T N GC+GGL DYA ++I++ GG+ EEDYP+ G C+ K +
Sbjct: 168 ALS-----DCDRTVNAGCSGGLADYALEFIINNGGIDTEEDYPFQGAVGICDQYK----I 218
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVA-IEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
++GY VP E +L KA+ANQP+SVA IEA G++FQ Y G++ G CGT +DHGV A
Sbjct: 219 NAVDGYERVPAYDELALKKAVANQPVSVAYIEAYGKEFQLYESGIFTGKCGTSIDHGVTA 278
Query: 299 VGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK-PEGLCGINKMASYPIK 350
VGYG+ G+DY IVKNSWG WGE GY+RM+RNT + G CGI + YPIK
Sbjct: 279 VGYGTENGIDYWIVKNSWGENWGEAGYVRMERNTAEDTAGKCGIAILTLYPIK 331
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 221/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK+ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S + D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 221/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 221/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 150/311 (48%), Positives = 202/311 (64%), Gaps = 14/311 (4%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
+ WM++F +VY EK RF++FK NL+ I++ N+K + Y LG+NEFAD EEF
Sbjct: 39 QQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTKEEFIAT 98
Query: 108 FLGLK-----PDLARRKDQSHEDFSYKDVVDL--PKSVDWRKKGAVTHVKNQGSCGSCWA 160
GLK P + D+ +++ +V D+ P+ DWR +GAVT VK QG CG CWA
Sbjct: 99 HTGLKGFNGIPS-SEFVDEMIPSWNW-NVSDVAGPEIKDWRYEGAVTPVKYQGQCGCCWA 156
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FS+VAAVEG+ +IV GNL SLSEQ+L+DCD +NGCNGG+M AF YI+ G+ E
Sbjct: 157 FSSVAAVEGLTKIVGGNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEAS 216
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY EGTC S I G+ VP N+E +LL+A++ QP+SV+I+A G F YS
Sbjct: 217 YPYQETEGTCRYNAKPS--AWIRGFQTVPSNNERALLEAVSRQPVSVSIDADGPGFMHYS 274
Query: 281 GGVYD-GHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
GGVYD +CGT ++H V VGYG S G+ Y + KNSWG WGE GYIR++R+ P+G+
Sbjct: 275 GGVYDEPYCGTDVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGM 334
Query: 339 CGINKMASYPI 349
CG+ + A YP+
Sbjct: 335 CGVAQYAFYPV 345
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 221/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S +L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPELSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 157/356 (44%), Positives = 221/356 (62%), Gaps = 23/356 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA+ ILI+ F IS F + AR S L+ +++ E WMS+ +V
Sbjct: 1 MAMKVDLMNILITLFFVISMFNTQTRAR-------SQPKLSVSER----HELWMSRHGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-L 115
Y+ EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 50 YKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSY 109
Query: 116 ARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I
Sbjct: 110 LSPSPVSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKI 169
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TGNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ E+ TC +
Sbjct: 170 ATGNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-S 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+ ++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++
Sbjct: 228 QEKTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRIN 285
Query: 294 HGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
H V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 286 HAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 221/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 291
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 148/221 (66%), Positives = 169/221 (76%), Gaps = 2/221 (0%)
Query: 131 VVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD 190
V D+P SVDWR+KGAVT VK+QG CGSCWAFST+AAVEGIN I T NL SLSEQ+L+DCD
Sbjct: 58 VRDVPSSVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCD 117
Query: 191 NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQ 250
N GCNGGLMDYAFQYI GG+ E+ YPY + + K S VVTI+GY DVP
Sbjct: 118 TKSNAGCNGGLMDYAFQYIAKHGGVAAEDAYPYKARQAS-SCNKKPSAVVTIDGYEDVPA 176
Query: 251 NSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDY 309
N E +L KA+A QP++VAIEASG FQFYS GV+ G CGT+LDHGVAAVGYG+T G Y
Sbjct: 177 NDETALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKY 236
Query: 310 IIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
IVKNSWGP+WGEKGYIRMKR+ EGLCGI ASYP+K
Sbjct: 237 WIVKNSWGPEWGEKGYIRMKRDVEDKEGLCGIAMEASYPVK 277
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 150/349 (42%), Positives = 209/349 (59%), Gaps = 34/349 (9%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
+IL ++FF ++ A DL + ++ E WM ++ +VY+ EK R
Sbjct: 7 SILAILGLAFFCGAALA---------ARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARR 57
Query: 69 FEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFL--GLKPDLARRKDQSHED 125
FE+FK N++ I+ N + +WLG+N+FADL ++EF+ G KP +
Sbjct: 58 FEVFKANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSPVK----VSTG 113
Query: 126 FSYKDV-VD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
F Y++V VD LP ++DWR KGAVT +K+QG C EGI +I TG L SLSE
Sbjct: 114 FRYENVSVDALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSE 161
Query: 184 QELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
QEL+DCD + + GC GGLMD AF++I+ GGL E YPY +G C+ G + T+
Sbjct: 162 QELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCK--SGSNSAATV 219
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
G+ DVP N E +L+KA+ANQP+SVA++ FQFYSGGV G CGT LDHG+AA+GYG
Sbjct: 220 KGFEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG 279
Query: 303 STR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
T G Y ++KNSWG WGE GY+RM+++ G+CG+ SYP +
Sbjct: 280 QTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPTE 328
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 221/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ ++ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 148/350 (42%), Positives = 210/350 (60%), Gaps = 18/350 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA Q + + C+ + S+ +RD +D ++ FE WM+++ +VY+
Sbjct: 1 MASKVQLVFLFLFLCVMWASPSAASRD-----------EPSDPMMKRFEEWMAEYGRVYK 49
Query: 61 SLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
DEK+ RF+IFK+N+ HI+ NR +Y LG+N+F D+ + EF + G+ L ++
Sbjct: 50 DNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVTQYTGVSLPLNFKR 109
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
+ F ++ + +S+DWR GAVT VK+Q CGSCWAFS +A VEGI +IVTG L
Sbjct: 110 EPVVS-FDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLV 168
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
SLSEQE++DC +NGC+GG +D A+ +I+S G+ E DYPY EG C +
Sbjct: 169 SLSEQEVLDC--AVSNGCDGGFVDNAYDFIISNNGVASEADYPYQAYEGDCTANSWPNSA 226
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
I GY V N E S+ A+ NQP++ AI+ASG +FQ+Y+GGV+ G CGT L+H + +
Sbjct: 227 Y-ITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITII 285
Query: 300 GYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
GYG + G Y IVKNSWG WGE+GY+RM R GLCGI YP
Sbjct: 286 GYGQDSSGTQYWIVKNSWGSSWGERGYVRMARGVSS-SGLCGIAMDPLYP 334
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 221/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ ++ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 157/336 (46%), Positives = 210/336 (62%), Gaps = 24/336 (7%)
Query: 25 ARDFSIV--GYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDET 82
ARD S GY E + + WM++ + Y+ EK RF++FK N +D +
Sbjct: 30 ARDLSTSTGGYGEEAMKVR------HQQWMAEHGRTYKDEAEKARRFQVFKANADFVDRS 83
Query: 83 NRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS---HEDFSYKDVVDLPKSV 138
N K+Y L +NEFAD+ ++EF M+ GLKP A K + +E+ + DV ++V
Sbjct: 84 NAAGGKSYELAINEFADMTNDEFVAMYTGLKPVPAGPKKMAGFKYENLTLSDVDQ--QAV 141
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
DWR+KGAVT +KNQG CG CWAF+ VAAVE I+QI TGNL SLSEQ+++DCD NNGCN
Sbjct: 142 DWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIHQITTGNLVSLSEQQVLDCDTDGNNGCN 201
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
GG +D AFQYI+S GGL E+ YPY +GTC+ + VTI+ Y DVP E +L
Sbjct: 202 GGYIDNAFQYIISNGGLATEDAYPYAAAQGTCQSSV--QPAVTISSYQDVPSGDEAALAA 259
Query: 259 ALANQPLSVAIEASGRDFQFYSGGVYDGH-CGT-QLDHGVAAVGYGSTR-GLDYIIVKNS 315
A+ANQP++VAI+A +FQFYS GV CGT L+H V AVGY + G Y ++KN
Sbjct: 260 AVANQPVAVAIDAH-NNFQFYSSGVLTADTCGTPSLNHAVTAVGYSTAEDGTPYWLLKNQ 318
Query: 316 WGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
WG WGE GY+R++R T CG+ + ASYP+ +
Sbjct: 319 WGQNWGEGGYLRVERGTNA----CGVAQQASYPVAR 350
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 147/330 (44%), Positives = 194/330 (58%), Gaps = 23/330 (6%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRH 101
D +++ FE WM + ++Y EK R E+++ N+ ++ N Y L N+FADL +
Sbjct: 27 DPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGYRLADNKFADLTN 86
Query: 102 EEFKEMFLGL-KPDLARRKDQSHED----------FSYKDVVDLPKSVDWRKKGAVTHVK 150
EEF+ LG +P S + DLPKSVDWR+KGAV VK
Sbjct: 87 EEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVK 146
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIV 210
+QG CGSCWAFS VAA+EGINQI G L SLSEQEL+DCD T GC GG M +AF++++
Sbjct: 147 SQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCD-TKAIGCAGGYMSWAFEFVM 205
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
GL E +YPY G C+ K + V+I+GY +V +SE LL+A A QP+SVA++
Sbjct: 206 KNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVD 265
Query: 271 ASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-----------GLDYIIVKNSWGPK 319
A +Q Y GGV+ G C +L+HGV VGYG T+ G Y IVKNSWGP+
Sbjct: 266 AGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPE 325
Query: 320 WGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
WG+ GYI M+R GLCGI + SYP+
Sbjct: 326 WGDAGYILMQREASVASGLCGIAMLPSYPV 355
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 151/331 (45%), Positives = 192/331 (58%), Gaps = 22/331 (6%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFA 97
LT D ++D FE WM + + Y EK RFE+++ N+ ++ N Y L N+FA
Sbjct: 22 LTRADLMLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFA 81
Query: 98 DLRHEEFKEMFLGLKPDLARRK-------DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVK 150
DL +EEF+ LG +P + + D + S D+ LPKSVDWRKKGAV VK
Sbjct: 82 DLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDI--LPKSVDWRKKGAVVEVK 139
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIV 210
NQG CGSCWAFS VAA+EGINQI G L SLSEQEL+DCD+ GC GG M +AF+++V
Sbjct: 140 NQGDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDE-AVGCGGGYMSWAFEFVV 198
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
GL E YPY G C+ K V I GY +V +SE L +A A QP+SVA++
Sbjct: 199 GNHGLTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVD 258
Query: 271 ASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-----------GLDYIIVKNSWGPK 319
FQ Y GVY G C ++HGV VGYG + G Y IVKNSWG +
Sbjct: 259 GGSFMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAE 318
Query: 320 WGEKGYIRMKRN-TGKPEGLCGINKMASYPI 349
WG+ GYI M+R+ G GLCGI + SYP+
Sbjct: 319 WGDAGYILMQRDVAGLASGLCGIALLPSYPV 349
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 154/355 (43%), Positives = 218/355 (61%), Gaps = 20/355 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL +
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 120 DQSHEDFSYKDVVDL-----PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIV 174
+K + DL P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I
Sbjct: 112 PSPMSSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 171
Query: 175 TGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTK 234
TGNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ ++ TC ++
Sbjct: 172 TGNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQ 229
Query: 235 GESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDH 294
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG+C +++H
Sbjct: 230 EKTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGNCADRINH 287
Query: 295 GVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ G Y ++KNSWG WGE GY+++ R++G P GLC I KM+SYP
Sbjct: 288 AVTAIGYGTDEEGQKYWLLKNSWGTSWGENGYMKIIRDSGDPSGLCDIAKMSSYP 342
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISIF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKTNDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ ++ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFYSGG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYSGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 156/356 (43%), Positives = 221/356 (62%), Gaps = 23/356 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA+ +ILI+ F IS F + AR S L+ +++ E WMS+ +V
Sbjct: 1 MAMKVDLMSILITLFFVISMFNTQTRAR-------SQPKLSVSER----HELWMSRHGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-L 115
Y+ EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 50 YKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSY 109
Query: 116 ARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
S +F D+ D +P ++DWR+ GAVT VKNQG CG CWAFS V ++EG +I
Sbjct: 110 VSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKI 169
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TGNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ ++ TC +
Sbjct: 170 ATGNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCR-S 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+ ++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++
Sbjct: 228 QEKTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRIN 285
Query: 294 HGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
H V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I K++SYP
Sbjct: 286 HAVTAIGYGTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 341
>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
Length = 299
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 149/288 (51%), Positives = 193/288 (67%), Gaps = 12/288 (4%)
Query: 2 ALSSQFKTILISFCISFFIRSSFARDFSIVGYS---PEDLTS---NDKLIDLFESWMSKF 55
LS K +++ SF + S A D SI+ Y P+ TS N +++ ++E W+ K
Sbjct: 5 TLSPAMKLMIVLIISSFTV--SLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKH 62
Query: 56 EKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDL 115
K Y L EK +RFEIFKDNL+ IDE N Y LGL FADL +EE++ FLG K D
Sbjct: 63 GKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDP 122
Query: 116 ARRKDQ---SHEDFSYKDVVD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGIN 171
RR + S + V D LP+SVDWRK+GAV VK+Q SCGSCWAFS +AAVEGIN
Sbjct: 123 NRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGIN 182
Query: 172 QIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCE 231
+IVTG+L SLSEQEL+DCD +YN GCNGGLMDYAF++I+S GG+ E+DYPY +G C+
Sbjct: 183 KIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCD 242
Query: 232 MTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
+ ++VVTI+ Y DVP E +L KA+ANQP++VA+E GR+FQ Y
Sbjct: 243 QNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLY 290
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 201/310 (64%), Gaps = 14/310 (4%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHE 102
L + FE W +K+ VY+ + E+ + F+IFK N+ +ID N K Y L +N F D E
Sbjct: 38 LSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPIE 97
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
+ + F F Y++V D+P +VDWRK+GAVT +KNQG CGSCWAFS
Sbjct: 98 DSDDGFE------RTTTTTPTTTFKYENVTDIPATVDWRKRGAVTPIKNQGKCGSCWAFS 151
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
VAA+EGI +I +GNL SLSEQ+L+DCD + GC+ G M AF++I+ GG+ E +Y
Sbjct: 152 AVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIATEANY 211
Query: 222 PY-IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
PY + +GTC K S V I Y +VP NSEDSLLKA+ANQP+SV I+ G F+FYS
Sbjct: 212 PYKRVVKGTC---KKVSHKVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRGM-FKFYS 267
Query: 281 GGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
G++ G CGT+ +H + VGYG+++ G+ Y +VKNSW +WGEKGYIR+KR+ EGLC
Sbjct: 268 SGIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRDIDAKEGLC 327
Query: 340 GINKMASYPI 349
GI SYPI
Sbjct: 328 GIAMKPSYPI 337
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 147/330 (44%), Positives = 194/330 (58%), Gaps = 23/330 (6%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRH 101
D +++ FE WM + ++Y EK R E+++ N+ ++ N Y L N+FADL +
Sbjct: 48 DPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGYRLADNKFADLTN 107
Query: 102 EEFKEMFLGL-KPDLARRKDQSHED----------FSYKDVVDLPKSVDWRKKGAVTHVK 150
EEF+ LG +P S + DLPKSVDWR+KGAV VK
Sbjct: 108 EEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVK 167
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIV 210
+QG CGSCWAFS VAA+EGINQI G L SLSEQEL+DCD T GC GG M +AF++++
Sbjct: 168 SQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCD-TKAIGCAGGYMSWAFEFVM 226
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
GL E +YPY G C+ K + V+I+GY +V +SE LL+A A QP+SVA++
Sbjct: 227 KNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVD 286
Query: 271 ASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-----------GLDYIIVKNSWGPK 319
A +Q Y GGV+ G C +L+HGV VGYG T+ G Y IVKNSWGP+
Sbjct: 287 AGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPE 346
Query: 320 WGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
WG+ GYI M+R GLCGI + SYP+
Sbjct: 347 WGDAGYILMQREASVASGLCGIAMLPSYPV 376
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ ++ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDENGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 221/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ ++ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 156/356 (43%), Positives = 221/356 (62%), Gaps = 23/356 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA+ +ILI+ F IS F + AR S L+ +++ E WMS+ +V
Sbjct: 1 MAMKVDLMSILITLFFVISMFNSQTRAR-------SQPKLSVSER----HELWMSRHGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-L 115
Y+ EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 50 YKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSY 109
Query: 116 ARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I
Sbjct: 110 LSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKI 169
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TGNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ ++ TC +
Sbjct: 170 ATGNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-S 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+ ++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++
Sbjct: 228 QEKTAAVQISSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRIN 285
Query: 294 HGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
H V A+GYG+ +G Y ++KNSWG WGE G++++ R+ G P GLC I KM+SYP
Sbjct: 286 HAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPSGLCDIAKMSSYP 341
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 150/331 (45%), Positives = 191/331 (57%), Gaps = 22/331 (6%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFA 97
L D ++D FE WM + + Y EK RFE+++ N+ ++ N Y L N+FA
Sbjct: 21 LARADLMLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFA 80
Query: 98 DLRHEEFKEMFLGLKPDLARRK-------DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVK 150
DL +EEF+ LG +P + + D + S D+ LPKSVDWRKKGAV VK
Sbjct: 81 DLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDI--LPKSVDWRKKGAVVEVK 138
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIV 210
NQG CGSCWAFS VAA+EGINQI G L SLSEQEL+DCD+ GC GG M +AF+++V
Sbjct: 139 NQGDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDE-AVGCGGGYMSWAFEFVV 197
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
GL E YPY G C+ K V I GY +V +SE L +A A QP+SVA++
Sbjct: 198 GNHGLTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVD 257
Query: 271 ASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-----------GLDYIIVKNSWGPK 319
FQ Y GVY G C ++HGV VGYG + G Y IVKNSWG +
Sbjct: 258 GGSFMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAE 317
Query: 320 WGEKGYIRMKRN-TGKPEGLCGINKMASYPI 349
WG+ GYI M+R+ G GLCGI + SYP+
Sbjct: 318 WGDAGYILMQRDVAGLASGLCGIALLPSYPV 348
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 157/356 (44%), Positives = 220/356 (61%), Gaps = 23/356 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA+ ILI+ F IS F + AR S L+ +++ E WMS+ +V
Sbjct: 1 MAMKVDLMNILITLFFVISMFNTQTRAR-------SQPKLSVSER----HELWMSRHGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-L 115
Y+ EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 50 YKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSY 109
Query: 116 ARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I
Sbjct: 110 LSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKI 169
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TGNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC +
Sbjct: 170 ATGNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-S 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+ ++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++
Sbjct: 228 QEKTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRIN 285
Query: 294 HGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
H V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 286 HAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 154/354 (43%), Positives = 221/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GC+GG M AF +I+ GG+ +E DY Y+ ++ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
Length = 396
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 152/333 (45%), Positives = 205/333 (61%), Gaps = 27/333 (8%)
Query: 43 KLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFAD 98
K+ D F++W+ K++K + +E+L+R +IF +N + E N K ++++ +N+FA
Sbjct: 67 KIEDAFDAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVEMNKFAA 126
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV-------VDLPKSVDWRKKGAVTHVKN 151
EE+++M LG K L R+KD + KDV V+ P+S+DW +G +T KN
Sbjct: 127 HTREEYRKM-LGFKKSLRRKKDSGE---AAKDVSLWEYEGVEAPESIDWVDEGVITTPKN 182
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIV 210
QGSCGSCWAFS + AVEGIN I TG L SLSEQEL+ C N GCNGGLMD AF++IV
Sbjct: 183 QGSCGSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAFEWIV 242
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
GG+ E+ Y Y C+ K + +I+G++DVP N E +L KA++ QP+SVAIE
Sbjct: 243 ENGGVDSEKQYQYKASFDDCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVSVAIE 302
Query: 271 ASGRDFQFYSGGVYDGH-CGTQLDHGVAAVGYGSTRGLDYII----------VKNSWGPK 319
A R FQ Y GGVY CGTQLDHGV VGYG +I +KNSW +
Sbjct: 303 ADQRSFQLYGGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNSWSEQ 362
Query: 320 WGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
WGE GYIR+ R+ P G+CG+ +MASYP K K
Sbjct: 363 WGEGGYIRIARDVESPSGMCGVAEMASYPEKTK 395
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 156/355 (43%), Positives = 217/355 (61%), Gaps = 22/355 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA+ +ILI+ F IS F + AR S L+ +++ E WMS+ +V
Sbjct: 1 MAMKIDLMSILITLFFVISMFNSQTTAR-------SQPKLSVSER----HELWMSRHGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PDLA 116
Y+ EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ EEF F G+ P
Sbjct: 50 YKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGINIPSYL 109
Query: 117 RRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIV 174
S +F D+ D +P ++DWR+ GAVT VKNQG CG CWAFS V ++EG +I
Sbjct: 110 SPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIA 169
Query: 175 TGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTK 234
TGNL SEQEL+DC T N GCNGG M AF +I GG+ E DY Y ++ TC ++
Sbjct: 170 TGNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQYTCR-SQ 227
Query: 235 GESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDH 294
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 228 EKTAAVQISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINH 285
Query: 295 GVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P G C I KM+SYP
Sbjct: 286 AVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPGGHCDIAKMSSYP 340
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 159/342 (46%), Positives = 206/342 (60%), Gaps = 25/342 (7%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERF 69
+ + FC S AR FS Y F++WM K +K Y + DE R+
Sbjct: 5 LALIFCFLIINCCSAARIFSQKQYQTA-----------FQNWMVKHQKSYTN-DEFGSRY 52
Query: 70 EIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYK 129
+F+DN+ + + N+K N LGLN ADL +EEFK+++LG K ++ +K +
Sbjct: 53 SVFQDNMDIVAKWNQKGSNTILGLNVMADLTNEEFKKLYLGTKANVTYKKK------TLV 106
Query: 130 DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDC 189
V LP SVDWR GAVT VKNQG CG C+AFST +VEGI++I + L LSEQ+++DC
Sbjct: 107 GVSGLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDC 166
Query: 190 DNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
+ NNGC+GGLM +F+YI++ GGL E YPY E G C+ K ++ TI GY +V
Sbjct: 167 SGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYTGEVGKCKFNK-KNIGATITGYKNV 225
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV-YDGHC-GTQLDHGVAAVGYGSTRG 306
SE L A+A QP+SVAI+AS FQ Y+ GV Y+ C TQLDHGV AVGYGS G
Sbjct: 226 ESGSESDLQTAVAAQPVSVAIDASQSSFQLYASGVYYEPECSSTQLDHGVLAVGYGSQSG 285
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
DY IVKNSWG WGE G+I M RN + CGI MAS+P
Sbjct: 286 QDYWIVKNSWGADWGENGFILMARN---KDNNCGIATMASFP 324
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|357477225|ref|XP_003608898.1| Cysteine proteinase, partial [Medicago truncatula]
gi|355509953|gb|AES91095.1| Cysteine proteinase, partial [Medicago truncatula]
Length = 260
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 141/262 (53%), Positives = 181/262 (69%), Gaps = 21/262 (8%)
Query: 93 LNEFADLRHEEFKEMFLGLKPDLARR-KDQSHED--FSYKDVVDLPKSVDWRKKGAVTHV 149
LN+FAD+ + EF+ ++ K + R + SH++ F Y++V +P S+DWRK GAVT V
Sbjct: 2 LNKFADMTNYEFRSIYADSKVNHHRMFRGMSHDNGPFMYENVEGVPSSIDWRKIGAVTGV 61
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K+QG CGSCWAFST+ AVEGINQI T L SLSEQEL+DCD N GCNGGLM+YAF++I
Sbjct: 62 KDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTEVNQGCNGGLMEYAFEFI 121
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
G+ E +YPY ++GTC + K V+I+G+ +VP N+E +LLKA ANQP+SVAI
Sbjct: 122 -KQNGITTETNYPYAAKDGTCNIQKENKPAVSIDGHENVPANNEKALLKAAANQPISVAI 180
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMK 329
+A G DFQFYS GV+ GHCGT+L+HGV NSWG +WGE+GYIRM+
Sbjct: 181 DAGGSDFQFYSEGVFTGHCGTELNHGV-----------------NSWGSEWGEQGYIRMQ 223
Query: 330 RNTGKPEGLCGINKMASYPIKK 351
R +GLCGI ASYPIKK
Sbjct: 224 RAISHKQGLCGIAMEASYPIKK 245
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 153/350 (43%), Positives = 217/350 (62%), Gaps = 18/350 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ +ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMSILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL +
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
D S D +P ++DWR+ GAVT VKNQG CG CWAFS V ++EG +I TGNL
Sbjct: 112 PSPINDLSDDD---MPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLM 168
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ ++ TC ++ ++
Sbjct: 169 EFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCR-SQEKTAA 226
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H V A+
Sbjct: 227 VQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAI 284
Query: 300 GYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I K++SYP
Sbjct: 285 GYGTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 157/356 (44%), Positives = 220/356 (61%), Gaps = 23/356 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA+ +ILI+ F IS F + AR S L+ +++ E WMS+ +V
Sbjct: 1 MAMKVDLMSILITLFFVISMFNSQTRAR-------SQPKLSVSER----HELWMSRHGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-L 115
Y+ EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 50 YKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSY 109
Query: 116 ARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I
Sbjct: 110 LSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKI 169
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TGNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC +
Sbjct: 170 ATGNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-S 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+ ++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++
Sbjct: 228 QEKTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRIN 285
Query: 294 HGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
H V A+GYG+ +G Y ++KNSWG WGE G++++ R+ G P GLC I KM+SYP
Sbjct: 286 HAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 153/355 (43%), Positives = 217/355 (61%), Gaps = 20/355 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL +
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 120 DQSHEDFSYKDVVDL-----PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIV 174
+K + DL P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I
Sbjct: 112 PSPMSSTEFKKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIA 171
Query: 175 TGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTK 234
TG L SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ E+ TC ++
Sbjct: 172 TGKLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQ 229
Query: 235 GESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDH 294
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 EKTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINH 287
Query: 295 GVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 AVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 342
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 148/302 (49%), Positives = 189/302 (62%), Gaps = 16/302 (5%)
Query: 53 SKFEKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGLNEFADLRHEEFKEMF 108
S + K YES + +R F+ NL I++ N + + +Y +G+NEFADL +EF ++
Sbjct: 3 SDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALY 62
Query: 109 LGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVE 168
+ K + + + + +D SVDWR KGAVT +KNQG CGSCW+FST + E
Sbjct: 63 VPSKFNRTMPYNTVYLPATSED------SVDWRTKGAVTPIKNQGQCGSCWSFSTTGSTE 116
Query: 169 GINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE 227
G + I TGNL SLSEQ+L+DC ++ N GCNGGLMD AF+YI+S GL EEDYPY ++
Sbjct: 117 GAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTAQD 176
Query: 228 GTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGH 287
GTC K TI+ Y DVP+N+ED L A+A P+SVAIEA FQ Y GV+DG+
Sbjct: 177 GTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVFDGN 236
Query: 288 CGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASY 347
CGT LDHGV VGY DY IVKNSWG WG +GYI MKR G+CGI SY
Sbjct: 237 CGTNLDHGVLVVGYTD----DYWIVKNSWGTTWGVEGYINMKRGV-SASGICGIAMQPSY 291
Query: 348 PI 349
PI
Sbjct: 292 PI 293
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 153/350 (43%), Positives = 217/350 (62%), Gaps = 18/350 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ +ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKIDLMSILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL +
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
D S D +P ++DWR+ GAVT VKNQG CG CWAFS V ++EG +I TGNL
Sbjct: 112 PSPINDLSDDD---MPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLM 168
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ ++ TC ++ ++
Sbjct: 169 EFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCR-SQEKTAA 226
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H V A+
Sbjct: 227 VQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAI 284
Query: 300 GYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I K++SYP
Sbjct: 285 GYGTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ ++ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 219/354 (61%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R+ G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 154/354 (43%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITV---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PDLARR 118
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 119 KDQSH-EDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
+F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPLSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ ++ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 219/354 (61%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK+ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S + D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R+ G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 147/350 (42%), Positives = 207/350 (59%), Gaps = 17/350 (4%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA Q + + C+ + S+ +RD +D ++ FE WM+++ +VY+
Sbjct: 1 MASKVQLVFLFLFLCVMWASPSAASRD-----------EPSDPMMKRFEEWMAEYGRVYK 49
Query: 61 SLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
DEK+ RF+IFK+N+ HI+ NR +Y LG+N+F D+ + EF + G +
Sbjct: 50 DNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIE 109
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
+ F ++ + +S+DWR GAVT VK+Q CGSCWAFS +A VEGI +IVTG L
Sbjct: 110 KEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLV 169
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
SLSEQE++DC +NGC+GG +D A+ +I+S G+ E DYPY +G C +
Sbjct: 170 SLSEQEVLDC--AVSNGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPNSA 227
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
I GY V N E S+ A+ NQP++ AI+ASG +FQ+Y+GGV+ G CGT L+H + +
Sbjct: 228 Y-ITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITII 286
Query: 300 GYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
GYG + G Y IVKNSWG WGE+GYIRM R GLCGI YP
Sbjct: 287 GYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGVSS-SGLCGIAMDPLYP 335
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 157/356 (44%), Positives = 219/356 (61%), Gaps = 23/356 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA+ ILI+ F IS F + AR S L+ +++ E WMS+ +V
Sbjct: 1 MAMKVDLMNILITLFFVISMFNTQTRAR-------SQPKLSVSER----HELWMSRHGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-L 115
Y+ EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 50 YKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSY 109
Query: 116 ARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I
Sbjct: 110 LSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKI 169
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TGNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC +
Sbjct: 170 ATGNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-S 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+ ++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++
Sbjct: 228 QEKTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRIN 285
Query: 294 HGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
H V A+GYG+ +G Y ++KNSWG WGE G++++ R+ G P GLC I KM+SYP
Sbjct: 286 HAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 219/354 (61%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R+ G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|326520659|dbj|BAJ92693.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 289
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 139/261 (53%), Positives = 182/261 (69%), Gaps = 12/261 (4%)
Query: 25 ARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR 84
A D SIV Y S +++ ++ WM++ Y ++ E+ RFE F+DNLR+ID+ N
Sbjct: 23 AADMSIVSYGER---SEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNA 79
Query: 85 K----IKNYWLGLNEFADLRHEEFKEMFLG--LKPDLARRKDQSHEDFSYKDVVDLPKSV 138
+ ++ LGLN FADL +EE++ +LG KPD R+ ++ D +LP+SV
Sbjct: 80 AADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQ---AADNDELPESV 136
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
DWRKKGAV VK+QG CGSCWAFS +AAVEGINQIVTG++ LSEQEL+DCD +YN GCN
Sbjct: 137 DWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCN 196
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
GGLMDYAF++I++ GG+ EEDYPY + C+ K ++VVTI+GY DVP NSE SL K
Sbjct: 197 GGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQK 256
Query: 259 ALANQPLSVAIEASGRDFQFY 279
A+ANQP+SVAIEA GR FQ Y
Sbjct: 257 AVANQPISVAIEAGGRAFQLY 277
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 150/314 (47%), Positives = 197/314 (62%), Gaps = 9/314 (2%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFAD 98
T++D L +F WM K Y S +E + R+ ++++N + I+E NR K +L +N+F D
Sbjct: 21 TTHDPLTGVFAEWMRDNSKSY-SNEEFVFRWNVWRENQQLIEEHNRSNKTSFLAMNKFGD 79
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
L + EF ++F GL D + +++ + + L DWR+KGAVTHVKNQG CGSC
Sbjct: 80 LTNAEFNKLFKGLAFDYSFHANKAAAEKAVP-APGLSADFDWRQKGAVTHVKNQGQCGSC 138
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHK 217
W+FST + EG N + TG L SLSEQ LIDC +Y NNGCNGGLMDYAF+YI++ G+
Sbjct: 139 WSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGIDT 198
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
E YPY + TC+ S ++ Y DV E++LL A+A +P SVAI+AS FQ
Sbjct: 199 EASYPYQTAQYTCQYNPANSG-GSLTSYTDVSSGDENALLNAVATEPTSVAIDASHNSFQ 257
Query: 278 FYSGGV-YDGHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
FYSGGV Y+ C TQLDHGV AVG+G+ G DY +VKNSWG WG GYI+M RN
Sbjct: 258 FYSGGVYYESACSSTQLDHGVLAVGWGTEDGQDYWLVKNSWGADWGLAGYIKMARNRSNN 317
Query: 336 EGLCGINKMASYPI 349
CGI ASYP
Sbjct: 318 ---CGIATSASYPT 328
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 148/311 (47%), Positives = 201/311 (64%), Gaps = 14/311 (4%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
+ WM++F +VY EK RF++FK NL+ I++ N+K + Y LG+NEFAD EEF
Sbjct: 48 QQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIAT 107
Query: 108 FLGLK-----PDLARRKDQSHEDFSYKDVVDLP--KSVDWRKKGAVTHVKNQGSCGSCWA 160
GLK P + D+ +++ +V D+ ++ DWR +GAVT VK QG CG CWA
Sbjct: 108 HTGLKGVNGIPS-SEFVDEMIPSWNW-NVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWA 165
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FS+VAAVEG+ +IV NL SLSEQ+L+DCD +NGCNGG+M AF YI+ G+ E
Sbjct: 166 FSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEAS 225
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY EGTC S I G+ VP N+E +LL+A++ QP+SV+I+A G F YS
Sbjct: 226 YPYQAAEGTCRYNGKPS--AWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGFMHYS 283
Query: 281 GGVYD-GHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
GGVYD +CGT ++H V VGYG S G+ Y + KNSWG WGE GYIR++R+ P+G+
Sbjct: 284 GGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGM 343
Query: 339 CGINKMASYPI 349
CG+ + A YP+
Sbjct: 344 CGVAQYAFYPV 354
>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
Length = 324
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 155/343 (45%), Positives = 212/343 (61%), Gaps = 34/343 (9%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY-ESLDEKLER 68
I +S I F + S A D S+ + L SN+++ +F++WMSK K Y +L +K +R
Sbjct: 10 ITLSLLIIFLLPPSSAMDLSV---TSGGLRSNEEVGFIFQTWMSKHGKTYTNALGDKEQR 66
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
F+ FKDNLR ID+ N K +Y LGL +FADL +E++++F G +P ++ + +
Sbjct: 67 FQNFKDNLRFIDQHNAKNLSYRLGLTQFADLTVQEYQDLFSG-RPIQKQKALRVTHRYVP 125
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
LP+SVDWR+KGAV+ +K+QG C VE IN+IVTG L SLSEQEL+D
Sbjct: 126 LAEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSEQELVD 175
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE-VVTINGYHD 247
C + N+GCNGGLMD AFQ++++ GL + DYPY +G C + S+ V+ I+GY D
Sbjct: 176 C-SIDNHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKIDGYED 234
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGL 307
VP N+E+SL KA+A+QP G+Y G CGT LDH V VGYG+ G
Sbjct: 235 VPANNENSLQKAVAHQP-----------------GIYTGPCGTDLDHAVVIVGYGTENGQ 277
Query: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
DY IV+NSWG WGE GY ++ RN P G+CGI +ASYPIK
Sbjct: 278 DYWIVRNSWGTVWGEAGYAKIARNFENPTGVCGIAMVASYPIK 320
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
Length = 341
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 152/318 (47%), Positives = 204/318 (64%), Gaps = 29/318 (9%)
Query: 40 SNDKLIDLFESWMSKFEKVYE--SLDEKLERFEIFKDNLRHID----ETNRKIKNYWLGL 93
+++++ L+++W S+ + + S+ + L R ++F+DNLR+ID E + + + LGL
Sbjct: 43 ADEEVRQLYKTWKSEHGRPRDGISVADGL-RLKVFRDNLRYIDAHNAEADAGLHTFRLGL 101
Query: 94 NEFADLRHEEFKEMFLG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQ 152
F DL EEF+ LG L L R + + + + DLP +VDWR++GAVT VKNQ
Sbjct: 102 TPFTDLTLEEFRAHALGFLNSTLPR---VASDRYLPRAGDDLPDAVDWRQQGAVTGVKNQ 158
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVST 212
CG CWAFS VAA+EGIN+IVT NL SLSEQELIDCD T + GC GG M AFQ+++
Sbjct: 159 LDCGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCD-TEDYGCQGGEMQKAFQFVIDN 217
Query: 213 GGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEAS 272
GG+ E DYP+I GTC+ + + +VV+I+ Y +VP N E++L KA+ANQP
Sbjct: 218 GGIDTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQP-------- 269
Query: 273 GRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNT 332
G+++G CG LDHGV AVGYGS G D+ IVKNSWG +WGE GYIRMKRN
Sbjct: 270 ---------GIFNGPCGFILDHGVTAVGYGSDNGEDFWIVKNSWGAEWGESGYIRMKRNV 320
Query: 333 GKPEGLCGINKMASYPIK 350
P G CGI ASYP+K
Sbjct: 321 LLPMGKCGIAMYASYPVK 338
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 154/354 (43%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ + F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVITMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GC+GG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCDGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 157/371 (42%), Positives = 213/371 (57%), Gaps = 25/371 (6%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
M +++ T + + + + S A S + Y+ DL S + L L+E W + + +
Sbjct: 1 MVRAAEVATTMAATLVVVGMALSIAPVASAIDYTERDLASEESLWALYERWCAHYNMARD 60
Query: 61 SLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
EK RF++FK+N R I E N + Y LGLN F+D+ EEF G R
Sbjct: 61 H-GEKTRRFDLFKENARRIYEHNHQGNATYTLGLNRFSDMTDEEFNRSPYGGCLTAPRMS 119
Query: 120 DQSHEDF------------------SYKDVVDLPKSVDWRKKGAVTHVKNQG-SCGSCWA 160
D E+ S + P +VDWR + AVT VK+QG +CGSCWA
Sbjct: 120 DDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWRGR-AVTRVKDQGPTCGSCWA 178
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FS +AAVEGIN I T NL LSEQ+L+DCD N+GCNGGLM AF ++V G+ E
Sbjct: 179 FSAIAAVEGINAIRTRNLVPLSEQQLVDCDK-LNHGCNGGLMTTAFSFVVRNRGVVPEGA 237
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY+ EG C+ + VTI GY VP+ ++L+ A+A QP+SVAIEAS +F+ Y
Sbjct: 238 YPYMGREGRCKHVM--APPVTIYGYQRVPRFDANALMNAVAAQPVSVAIEASSFEFRHYQ 295
Query: 281 GGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GGV++G+CG +L H AVGYG+ G + IVKNSWGP WGE GY+R+ RNT +G+CG
Sbjct: 296 GGVFNGNCGGRLGHAATAVGYGADAGGPFWIVKNSWGPGWGEGGYVRISRNTPVRQGVCG 355
Query: 341 INKMASYPIKK 351
I SYP+K+
Sbjct: 356 ILTENSYPVKR 366
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 156/356 (43%), Positives = 220/356 (61%), Gaps = 23/356 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA+ +ILI+ F IS F + AR S L+ +++ E WMS+ +V
Sbjct: 1 MAMKVDLMSILITLFFVISMFNSQTRAR-------SQPKLSVSER----HELWMSRHGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-L 115
Y+ EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 50 YKDEVEKGERFMIFKENIKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSY 109
Query: 116 ARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I
Sbjct: 110 LSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKI 169
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TGNL SEQEL+DC T N GC+GG M AF +I GG+ E DY Y+ E+ TC +
Sbjct: 170 ATGNLMEFSEQELLDC-TTNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCR-S 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+ ++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++
Sbjct: 228 QEKTAAVQISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRIN 285
Query: 294 HGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
H V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 286 HAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 148/311 (47%), Positives = 201/311 (64%), Gaps = 14/311 (4%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
+ WM++F +VY EK RF++FK NL+ I++ N+K + Y LG+NEFAD EEF
Sbjct: 24 QQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIAT 83
Query: 108 FLGLK-----PDLARRKDQSHEDFSYKDVVDLP--KSVDWRKKGAVTHVKNQGSCGSCWA 160
GLK P + D+ +++ +V D+ ++ DWR +GAVT VK QG CG CWA
Sbjct: 84 HTGLKGVNGIPS-SEFVDEMIPSWNW-NVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWA 141
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FS+VAAVEG+ +IV NL SLSEQ+L+DCD +NGCNGG+M AF YI+ G+ E
Sbjct: 142 FSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEAS 201
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY EGTC S I G+ VP N+E +LL+A++ QP+SV+I+A G F YS
Sbjct: 202 YPYQAAEGTCRYNGKPS--AWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGFMHYS 259
Query: 281 GGVYD-GHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
GGVYD +CGT ++H V VGYG S G+ Y + KNSWG WGE GYIR++R+ P+G+
Sbjct: 260 GGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGM 319
Query: 339 CGINKMASYPI 349
CG+ + A YP+
Sbjct: 320 CGVAQYAFYPV 330
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 154/354 (43%), Positives = 219/354 (61%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GC+GG M AF +I+ GG+ +E DY Y+ ++ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ G Y ++KNSWG WGE G++++ R+ G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDENGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 143/352 (40%), Positives = 212/352 (60%), Gaps = 22/352 (6%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA Q + + C + S+ +RD ND ++ FE WM+++ +VY+
Sbjct: 1 MASKVQLVFLFLFLCAMWASPSAASRD-----------EPNDPMMKRFEEWMAEYGRVYK 49
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLG--LKPDLAR 117
DEK+ RF+IFK+N++HI+ N + +N Y LG+N+F D+ EF + G L ++ R
Sbjct: 50 DDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIER 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
S +D ++ +P+S+DWR GAV VKNQ CGSCW+F+ +A VEGI +I TG
Sbjct: 110 EPVVSFDDV---NISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGY 166
Query: 178 LASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGES 237
L SLSEQE++DC +Y GC GG ++ A+ +I+S G+ EE+YPY+ +GTC +
Sbjct: 167 LVSLSEQEVLDCAVSY--GCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPN 224
Query: 238 EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVA 297
I GY V +N E S++ A++NQP++ I+AS +FQ+Y+GGV+ G CGT L+H +
Sbjct: 225 SAY-ITGYSYVRRNDERSMMYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAIT 282
Query: 298 AVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
+GYG + G Y IV+NSWG WGE GY+RM R G+CGI +P
Sbjct: 283 IIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 219/354 (61%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++E +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 154/354 (43%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S +L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPELSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GC+GG M AF +I GG+ E DY Y+ ++ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 218/354 (61%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S + D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R+ G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 154/354 (43%), Positives = 220/354 (62%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S +L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPELSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GC+GG M AF +I GG+ E DY Y+ ++ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 139/259 (53%), Positives = 181/259 (69%), Gaps = 6/259 (2%)
Query: 95 EFADLRHEEFKEMFLGLKPD--LARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVK 150
+FA++ ++EF+ M+ G K D L+ + F Y++V LP +VDWRKKGAVT +K
Sbjct: 1 QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIV 210
NQGSCG CWAFS VAA+EG QI G L SLSEQ+L+DCD T + GC+GGL+D AF++I+
Sbjct: 61 NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD-TNDFGCSGGLIDTAFEHIM 119
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
+TGGL E +YPY E+ TC++ +I GY DVP N E++L+KA+A+QP+SV IE
Sbjct: 120 ATGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGIE 179
Query: 271 ASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMK 329
G DFQFYS GV+ G C T LDH V AVGY S+ G Y I+KNSWG KWGE GY+R+K
Sbjct: 180 GGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIK 239
Query: 330 RNTGKPEGLCGINKMASYP 348
++ EGLCG+ ASYP
Sbjct: 240 KDIKDKEGLCGLAMKASYP 258
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 144/314 (45%), Positives = 198/314 (63%), Gaps = 25/314 (7%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHE 102
++ E WM ++ +VY+ EK +RFE+FK N++ I+ N + +WLG+N+FADL ++
Sbjct: 1 MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTND 60
Query: 103 EFKEMFL--GLKPDLARRKDQSHEDFSYKDV-VD-LPKSVDWRKKGAVTHVKNQGSCGSC 158
EF+ G KP + F Y+++ VD LP ++DWR KGAVT +K+QG C
Sbjct: 61 EFRATKTNKGFKPSPVK----VPTGFRYENISVDALPATIDWRTKGAVTPIKDQGQC--- 113
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHK 217
EGI +I TG L SLSEQEL+DCD + + GC GGLMD AF++I+ GGL
Sbjct: 114 ---------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTT 164
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
E YPY +G C+ G + V T+ G+ DVP N E SL+KA+ANQP+SVA++ FQ
Sbjct: 165 ESSYPYTAADGKCK--SGSNSVATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTFQ 222
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
FYSGGV G CGT LDHG+AA+GYG T G Y ++KNSWG WGE GY+RM+++
Sbjct: 223 FYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKR 282
Query: 337 GLCGINKMASYPIK 350
G+CG+ SYP +
Sbjct: 283 GMCGLAMEPSYPTE 296
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 218/354 (61%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGHVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GC+GG M AF +I GG+ E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 141/309 (45%), Positives = 195/309 (63%), Gaps = 7/309 (2%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLR 100
D +++ FE WM+++ +VY EK+ RF+IFK+N+ HI+ NR +Y LG+N+F D+
Sbjct: 4 DPMMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMT 63
Query: 101 HEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWA 160
+ EF + G L +D F D+ +P+S+DWR GAVT VKNQGSCGSCWA
Sbjct: 64 NNEFLARYTGASLPLNIERDPV-VSFDDVDISAVPQSIDWRDYGAVTSVKNQGSCGSCWA 122
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FS +A VEGI +I GNL SLSEQE++DC +Y GC+GG ++ A+ +I+S G+ +
Sbjct: 123 FSAIATVEGIYKIKAGNLISLSEQEVLDCALSY--GCDGGWVNKAYDFIISNNGVTSFAN 180
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
PY +G C ++ I GY V N+E S++ A+ANQP++ I+A G DFQ+Y
Sbjct: 181 LPYKGYKGPCNHNDLPNKAY-ITGYTYVQSNNERSMMIAVANQPIAALIDAGG-DFQYYK 238
Query: 281 GGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
GV+ G CGT L+H + +GYG T G Y IVKNSWG WGE+GYIRM R+ P GLC
Sbjct: 239 SGVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARDVSSPYGLC 298
Query: 340 GINKMASYP 348
GI +P
Sbjct: 299 GIAMAPLFP 307
>gi|307103885|gb|EFN52142.1| hypothetical protein CHLNCDRAFT_139276 [Chlorella variabilis]
Length = 388
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 186/307 (60%), Gaps = 13/307 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
F W + Y+S E +R +F +N +H+ E N + L LN+FADL EEF
Sbjct: 46 FSQWQMTHGRSYKSASEARKRQAVFVENAKHVAEQNARNSGLVLALNQFADLTLEEFAAT 105
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
LG P L K+ + F Y D DLP +VDWRKK AVT VKNQ CGSCWAFS AV
Sbjct: 106 HLGYNPSLREGKEHTTTSFQYADANDLPSTVDWRKKNAVTPVKNQAMCGSCWAFSATGAV 165
Query: 168 EGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE 227
EGIN I TG L SLSEQ+L+DCD+ + GC GGLMD+AF YI GG+ E+DY Y
Sbjct: 166 EGINAIRTGKLVSLSEQQLVDCDSEKDLGCGGGLMDFAFDYITKNGGIDSEDDYSYWGYG 225
Query: 228 GTCEMTK-GESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
C+ K + VVTI+G+ DVP+N ++L KA+A+QP+S+ ++SG V D
Sbjct: 226 LICQRRKEADRHVVTIDGFEDVPKNDGEALKKAIAHQPVSL----------YHSGVVGDD 275
Query: 287 HCGTQLDHGVAAVGY--GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
C L+HGV AVGY GS G + ++KNSWG WGE+G+ R+ + + G CG+ K
Sbjct: 276 ACCQDLNHGVLAVGYDDGSKGGTPHYVIKNSWGEGWGEQGFFRLAAKSSEASGACGVYKA 335
Query: 345 ASYPIKK 351
ASYP+KK
Sbjct: 336 ASYPLKK 342
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 154/354 (43%), Positives = 218/354 (61%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK ERF IFK N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKGERFMIFKKNMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
G L SEQEL+DC T N GCNGG M AF +I+ GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GKLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+ G YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAEGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R++G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 150/350 (42%), Positives = 202/350 (57%), Gaps = 56/350 (16%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA ++Q++ + ++ FI +++A + + + E WM+++ ++Y+
Sbjct: 1 MASTNQYQYVSMAL---LFILAAWASQ------ATSRSLHEASMYERHEDWMARYGRMYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD 120
+EK +RF+IFKDN+
Sbjct: 52 DANEKEKRFKIFKDNVAQATT--------------------------------------- 72
Query: 121 QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLAS 180
F Y++V +P ++DWRKKGAVT +K+Q CGSCWAFS VAA EGI QI TG L S
Sbjct: 73 -----FKYENVTAVPSTIDWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLIS 127
Query: 181 LSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
LSEQEL+DCD N GC+GGL D AF++I GL E YPY ++GTC K
Sbjct: 128 LSEQELVDCDTGGENQGCSGGLXDDAFRFI-XIHGLASEATYPYEGDDGTCNSKKEAHPA 186
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
I GY DVP N+E +L KA+A+QP++VAI+A G +FQFY+ GV+ G CGT+LDHGVAAV
Sbjct: 187 AKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAV 246
Query: 300 GYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
GYG G+ Y +VKNSWG WGE+GYIRM+R+ EGLCGI ASYP
Sbjct: 247 GYGIGDDGMXYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYP 296
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 280 bits (715), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 156/356 (43%), Positives = 219/356 (61%), Gaps = 23/356 (6%)
Query: 1 MALSSQFKTILIS--FCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA+ +ILI+ F IS F + AR S L+ +++ E WMS+ +V
Sbjct: 1 MAMKVDLMSILITLFFVISMFNSQTRAR-------SQPKLSVSER----HELWMSRHGRV 49
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-L 115
Y+ EK ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 50 YKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSY 109
Query: 116 ARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
S +F D+ D +P ++DWR+ GAVT VK+QG CG CWAFS V ++EG +I
Sbjct: 110 LSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKI 169
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TGNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC +
Sbjct: 170 ATGNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-S 227
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+ ++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QF +GG YDG C +++
Sbjct: 228 QEKTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFCAGGTYDGSCADRIN 285
Query: 294 HGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
H V A+GYG+ +G Y ++KNSWG WGE G++++ R+ G P GLC I KM+SYP
Sbjct: 286 HAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 157/360 (43%), Positives = 212/360 (58%), Gaps = 33/360 (9%)
Query: 8 KTILISFCISFFIRS-----SFARDFSIV---GYSPEDLTSNDKLIDLFESWMSKFEKVY 59
KT++ ++ I + + ARD S GY E + + WM++ + Y
Sbjct: 9 KTVITFTAVALTILAVTTMMAEARDLSSTSTGGYGEEAMKVR------HQQWMAEHGRTY 62
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRK---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLA 116
EK RF++FK N +D +N K+Y L LNEFAD+ ++EF M+ GL+P A
Sbjct: 63 RDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTGLRPVPA 122
Query: 117 RRKDQS---HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
K + + + + D D ++VDWR+KGAVT +KNQG CG CWAF+ VAAVEGI+QI
Sbjct: 123 GAKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQI 182
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TGNL SLSEQ+++DCD NNGCNGG +D AFQYIV GGL E+ YPY + C+
Sbjct: 183 TTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIVGNGGLGTEDAYPYTAAQAMCQSV 242
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD-GHCGT-- 290
+ V I+GY DVP E +L A+ANQP+SVAI+A +FQ Y GGV C T
Sbjct: 243 Q---PVAAISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTAASCSTPP 297
Query: 291 QLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
L+H V AVGYG+ G Y ++KN WG WGE GY+R++R CG+ + ASYP+
Sbjct: 298 NLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLERGANA----CGVAQQASYPV 353
>gi|118145|sp|P20721.1|CYSPL_SOLLC RecName: Full=Low-temperature-induced cysteine proteinase; Flags:
Precursor
gi|806314|gb|AAA66308.1| thiol protease, partial [Solanum lycopersicum]
Length = 346
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 125/217 (57%), Positives = 162/217 (74%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP+S+DWR+KG + VK+QGSCGSCWAFS VAA+E IN IVTGNL SLSEQEL+DCD +Y
Sbjct: 18 LPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSY 77
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
N GC+GGLMDYAF++++ GG+ EEDYPY G C+ + ++VV I+ Y DVP N+E
Sbjct: 78 NEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNE 137
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
+L KA+A+QP+S+A+EA GRDFQ Y G++ G CGT +DHGV GYG+ G+DY IV+
Sbjct: 138 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGTENGMDYWIVR 197
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
NSWG E GY+R++RN GLCG+ SYP+K
Sbjct: 198 NSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPVK 234
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 154/354 (43%), Positives = 218/354 (61%), Gaps = 19/354 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA+ ILI+ FF+ S F + G S L+ +++ E WMS+ +VY+
Sbjct: 1 MAMKVDLMNILITL---FFVISMF--NTQTRGRSQPKLSVSER----HELWMSRHGRVYK 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFLGLK-PD-LAR 117
EK+ERF IFK+N++ I+ N+ +Y LG+NEFAD+ +EF F GL P+
Sbjct: 52 DEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111
Query: 118 RKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
S + D+ D +P ++DW + GAVT VK+QG CG CWAFS V ++EG +I T
Sbjct: 112 PSPMSSTELKINDLSDDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 171
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GNL SEQEL+DC T N GCNGG M AF +I GG+ +E DY Y+ E+ TC ++
Sbjct: 172 GNLMEFSEQELLDC-TTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCR-SQE 229
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
++ V I+ Y VP+ E SLL+A+ QP+S+ I AS +D QFY+GG YDG C +++H
Sbjct: 230 KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHA 287
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
V A+GYG+ +G Y ++KNSWG WGE G++++ R+ G P GLC I KM+SYP
Sbjct: 288 VTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 192/304 (63%), Gaps = 5/304 (1%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKE 106
F SWM KF L E + RFE+F N + I+ N+ + + +G NE++ L +EFK+
Sbjct: 28 FLSWMKKFAVKLNPL-EWVHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKK 86
Query: 107 MFLGLK--PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
+ GL+ P + + + ++ D+P +DW ++G VT VKNQG CGSCWAFST
Sbjct: 87 LRTGLRVSPSYIQSRAKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFSTT 146
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
A+EG + + L S+SEQEL+DCD+ + GCNGGLMD AF+++ + GL KEEDYPY
Sbjct: 147 GAIEGAAFVSSKQLVSVSEQELVDCDHNGDMGCNGGLMDNAFKWVKTHKGLCKEEDYPYH 206
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
+EGTC + K + V + +HDVP N E +L A+A QP+SVAIEA +FQFY GV+
Sbjct: 207 AKEGTCALKKCKP-VTKVTAFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQFYKSGVF 265
Query: 285 DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
D CGT+LDHGV VGYG G Y VKNSWG WG+KGYI++ R G G CG+ +
Sbjct: 266 DKSCGTKLDHGVLVVGYGEEGGKKYWKVKNSWGADWGDKGYIKLAREFGPETGQCGVAMV 325
Query: 345 ASYP 348
SYP
Sbjct: 326 PSYP 329
>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 145/333 (43%), Positives = 194/333 (58%), Gaps = 26/333 (7%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK---NYWLGLNEFAD 98
D + F W ++ + Y + +E+ R ++ N+R+I+ TN Y LG + D
Sbjct: 36 DPMAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLTYELGETAYTD 95
Query: 99 LRHEEFKEMFLGLKPDLARRKDQ-------------------SHEDFSYKDVVDLPKSVD 139
L +EF M+ P L+ D + P SVD
Sbjct: 96 LTSDEFTAMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVNESAGAPASVD 155
Query: 140 WRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNG 199
WR++GAVT VKNQG CGSCWAFSTVA +EGI+QI TG LASLSEQEL+DCD ++GCNG
Sbjct: 156 WRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVDCDK-LDHGCNG 214
Query: 200 GLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKA 259
G+ A Q+I S GG+ ++DYPY ++ TC+ K +I+G+ V SE SL A
Sbjct: 215 GVSYRALQWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGFQRVATRSELSLTNA 274
Query: 260 LANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG--STRGLDYIIVKNSWG 317
+A QP++V+IEA G +FQ Y GVY+G CGT+L+HGV VGYG G Y IVKNSWG
Sbjct: 275 VAMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYGEDEVTGESYWIVKNSWG 334
Query: 318 PKWGEKGYIRMKRN-TGKPEGLCGINKMASYPI 349
KWG+ GY+RMK+ KPEG+CGI S+P+
Sbjct: 335 EKWGDNGYLRMKKGIIDKPEGICGIAIRPSFPL 367
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 141/304 (46%), Positives = 189/304 (62%), Gaps = 5/304 (1%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
+E + +KF + Y +E+ ER +F N++ I+E N K Y LG+N+FADL EEF +
Sbjct: 19 WEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHTYTLGVNQFADLTVEEFSKT 78
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
++G K + D ++ + LP SVDW +GAVT VKNQG CGSCW+FST ++
Sbjct: 79 YMGFKKPAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGSCWSFSTTGSL 138
Query: 168 EGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EG N+I TG L SLSEQ+ +DC TY N GCNGGLMD AF+Y L E+ YPY
Sbjct: 139 EGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKY-AEANALCTEQSYPYKGT 197
Query: 227 EGTCEMTKGESEVV--TINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
+G+C+ + + + +++GY DV +SE ++ A+A QP+S+AIEA FQ YSGGV
Sbjct: 198 DGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKSVFQLYSGGVL 257
Query: 285 DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
G CG LDHGV AVGYG+ G DY VKNSWG WG GY+ ++R G G CG+
Sbjct: 258 TGACGASLDHGVLAVGYGTLSGTDYWKVKNSWGSTWGMSGYVLLQRGKGG-SGECGLLSE 316
Query: 345 ASYP 348
SYP
Sbjct: 317 PSYP 320
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 189/304 (62%), Gaps = 4/304 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM++ + Y+ EK R E+F+ N ID N ++ L N FADL EEF+
Sbjct: 39 EKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVEEFRAA 98
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
GL+P A + + D +SVDWR GAVT VK+QG+CG CWAFS VAAV
Sbjct: 99 RTGLRPRPAPSAGAGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGACGCCWAFSAVAAV 158
Query: 168 EGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EG+N+I TG L SLSEQEL+DCD + + GC+GGLMD AFQ++ GGL E YPY
Sbjct: 159 EGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQGR 218
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+G C + + +I G+ DVP+N+E +L A+ANQP+SVAI F+FY GV G
Sbjct: 219 DGPCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDMAFRFYDSGVLGG 278
Query: 287 HCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
CGT L+H + AVGYG+ G Y ++KNSWG WGE GY+R++R + EG+CG+ K+
Sbjct: 279 ACGTDLNHAITAVGYGTANDGTRYWLMKNSWGASWGEGGYVRIRRGV-RGEGVCGLAKLP 337
Query: 346 SYPI 349
SYP+
Sbjct: 338 SYPV 341
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 155/360 (43%), Positives = 211/360 (58%), Gaps = 33/360 (9%)
Query: 8 KTILISFCISFFIRS-----SFARDFSIV---GYSPEDLTSNDKLIDLFESWMSKFEKVY 59
KT++ ++ I + + ARD S GY E + + WM++ + Y
Sbjct: 9 KTVIAFTAVALTILAVKTMMAEARDLSSTSTGGYGEEAMKVR------HQQWMAEHGRTY 62
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRK---IKNYWLGLNEFADLRHEEFKEMFLGLKPDLA 116
EK RF++FK N +D +N K+Y + LNEFAD+ ++EF M+ GL+P A
Sbjct: 63 RDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTGLRPVPA 122
Query: 117 RRKDQS---HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
K + + + + D D ++VDWR+KGAVT +KNQG CG CWAF+ VAAVEGI+QI
Sbjct: 123 GAKKMAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQI 182
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TGNL SLSEQ+++DCD NNGCNGG +D AFQYI GGL E+ YPY + C+
Sbjct: 183 TTGNLVSLSEQQVLDCDTEGNNGCNGGYIDNAFQYIAGNGGLATEDAYPYTAAQAMCQSV 242
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD-GHCGT-- 290
+ V I+GY DVP E +L A+ANQP+SVAI+A +FQ Y GGV C T
Sbjct: 243 Q---PVAAISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTAASCSTPP 297
Query: 291 QLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
L+H V AVGYG+ G Y ++KN WG WGE GY+R++R CG+ + ASYP+
Sbjct: 298 NLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLERGANA----CGVAQQASYPV 353
>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 377
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 149/344 (43%), Positives = 197/344 (57%), Gaps = 24/344 (6%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK- 87
+ + D T + F+ W ++ + Y + DE+L R ++ N+R+I+ N
Sbjct: 34 TTTAFEETDPTILQTMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAA 93
Query: 88 --NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS-----------------HEDFSY 128
Y LG + DL +EF M+ P L+ D++ + +
Sbjct: 94 GLTYQLGETAYTDLTADEFTAMYTSPSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFN 153
Query: 129 KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELID 188
P SVDWR KGAVT VKNQG CGSCWAFSTVA VEGI+QI TGNL SLSEQEL+D
Sbjct: 154 VSTAGAPASVDWRAKGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVD 213
Query: 189 CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
CD T + GC+GG+ +A ++I S GG+ E DYPY ++G C K I+G+ V
Sbjct: 214 CD-TLDYGCDGGVSYHALEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARV 272
Query: 249 PQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV--GYGSTRG 306
SE SL A+A QP++V+IEA G +FQ Y GVY+G CGT+L+HGV V G G
Sbjct: 273 ATRSEPSLANAVAAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDG 332
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPI 349
Y IVKNSWG KWG+ GY RMK++ GKPEGLCGI S+P+
Sbjct: 333 EKYWIVKNSWGKKWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376
>gi|113120267|gb|ABI30273.1| VXH-B, partial [Vasconcellea x heilbornii]
Length = 266
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 139/269 (51%), Positives = 192/269 (71%), Gaps = 5/269 (1%)
Query: 1 MALSSQF-KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY 59
MA S F K ++ C+S + S+ DFSI GYSP+DLTS +KLI+LF+SWM ++ KVY
Sbjct: 1 MATISSFSKLFFVAICLSVRMGLSYG-DFSIGGYSPDDLTSTEKLINLFDSWMVEYGKVY 59
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG-LKPDLARR 118
+ +DEK+ +FEIFKDNL++IDETN+K YWLGL F DL ++EFKE ++G + +
Sbjct: 60 KDIDEKIYKFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKYVGSISESWSTT 119
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
++ + E F Y DVV++P S+DWR+KGAVT V++QGSCGSCW FS+VAAVEGIN+IVTG L
Sbjct: 120 EESNDEGFIYDDVVNIPASIDWRQKGAVTPVRHQGSCGSCWTFSSVAAVEGINKIVTGRL 179
Query: 179 ASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DC+ + GC GG YA QY V+ G+H ++YPY + C + +
Sbjct: 180 VSLSEQELLDCERR-SYGCRGGFPPYALQY-VAQNGIHLRQNYPYEGVQRQCRARQVQGP 237
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSV 267
V +G VP+N+E +L++A+ANQP+SV
Sbjct: 238 KVKTDGVGRVPRNNERALIQAIANQPVSV 266
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 158/344 (45%), Positives = 204/344 (59%), Gaps = 25/344 (7%)
Query: 10 ILISFCISFFIRS--SFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLE 67
I+++ F I + S AR FS Y F++WM K +K Y + DE
Sbjct: 3 IILALVFCFLIVNCISAARVFSQKQYQTA-----------FQNWMVKHQKSYTN-DEFGS 50
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
R+ IF+DN+ + + N+K + LGLN ADL ++E++ ++LG K + + +
Sbjct: 51 RYTIFQDNMDFVTKWNQKGSDTILGLNSMADLTNQEYQRIYLGTKTTVKK----PNLIIG 106
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
DV P SVDWR GAVT VKNQG CG C++FST +VEGI++I + L SLSEQ+++
Sbjct: 107 VTDVSKAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQIL 166
Query: 188 DCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
DC + NNGC+GGLM +F+YI++ GGL E YPY G C+ K TI GY
Sbjct: 167 DCSGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKANIG-ATITGYK 225
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV-YDGHC-GTQLDHGVAAVGYGST 304
+V SE L A+A QP+SVAI+AS FQ YS GV Y+ C TQLDHGV AVGYGS
Sbjct: 226 NVKSGSESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYGSQ 285
Query: 305 RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G DY IVKNSWG WGEKG+I M RN CGI MASYP
Sbjct: 286 SGQDYWIVKNSWGADWGEKGFILMARN---KHNNCGIATMASYP 326
>gi|388501884|gb|AFK39008.1| unknown [Lotus japonicus]
Length = 151
Score = 277 bits (708), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 127/151 (84%), Positives = 140/151 (92%)
Query: 202 MDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALA 261
MDYAF +IV GGLHKE+DYPYIMEEGTCEM+K ES+VVTI+GYHDVPQN+E SLLKALA
Sbjct: 1 MDYAFSFIVENGGLHKEDDYPYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALA 60
Query: 262 NQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWG 321
NQPLSVAIEASGRDFQFYSGGV+DGHCGTQLDHGVAAVGYG+++GLDYI VKNSWG KWG
Sbjct: 61 NQPLSVAIEASGRDFQFYSGGVFDGHCGTQLDHGVAAVGYGTSKGLDYITVKNSWGTKWG 120
Query: 322 EKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
EKGYIR +RN GKPEG+CG+ KMASYP KKK
Sbjct: 121 EKGYIRFRRNNGKPEGMCGLYKMASYPTKKK 151
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 136/312 (43%), Positives = 197/312 (63%), Gaps = 11/312 (3%)
Query: 41 NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADL 99
ND ++ FE WM+++ ++Y+ DEK+ RF+IFK+N++HI+ N + N Y LG+N+F D+
Sbjct: 3 NDPMMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDM 62
Query: 100 RHEEFKEMFLG--LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGS 157
EF + G L ++ R S +D ++ +P+S+DWR GAV VKNQ CGS
Sbjct: 63 TKSEFVAQYTGVSLPLNIEREPVVSFDDV---NISAVPQSIDWRDYGAVNEVKNQNPCGS 119
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHK 217
CWAF+ +A VEGI +I TG L SLSEQE++DC +Y GC GG ++ A+ +I+S G+
Sbjct: 120 CWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY--GCKGGWVNKAYDFIISNNGVTT 177
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
EE+YPY +GTC + I GY V +N E S++ A++NQP++ I+AS +FQ
Sbjct: 178 EENYPYQAYQGTCNANSFPNSAY-ITGYSYVRRNDERSMMYAVSNQPIAALIDAS-ENFQ 235
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
+Y+GGV+ G CGT L+H + +GYG + G Y IV+NSWG WGE GY+RM R
Sbjct: 236 YYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSSSS 295
Query: 337 GLCGINKMASYP 348
G CGI +P
Sbjct: 296 GACGIAMSPLFP 307
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 146/332 (43%), Positives = 206/332 (62%), Gaps = 16/332 (4%)
Query: 28 FSIVGYSPEDLTSND----KLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN 83
FSI+ P +TS + +++ E+WM +VY+ EK RF+ FK+N+ I+ N
Sbjct: 17 FSILSLYPFIVTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHRFKTFKENVEFIESFN 76
Query: 84 RK-IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE--DFSYKDVVDLPKSVDW 140
+ + Y L +N++ADL EEF F+GL L +++ + F Y V ++P S+DW
Sbjct: 77 KNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTSFKYDSVTEVPNSMDW 136
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGG 200
RK+G+VT VK+QG CG CWAFS AA+EG QI L SLSEQ+L+DC +T N GC GG
Sbjct: 137 RKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQLLDC-STQNKGCEGG 195
Query: 201 LMDYAFQYIVST--GGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
LM A+ +++ GG+ E +YPY + C+ + VTINGY VP + E SLLK
Sbjct: 196 LMTVAYDFLLQNNGGGITTETNYPYEEAQNVCKTE--QPAAVTINGYEVVPSD-ESSLLK 252
Query: 259 ALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR--GLDYIIVKNSW 316
A+ NQP+SV I A+ +F Y G+YDG C ++L+H V +GYG++ G Y IVKNSW
Sbjct: 253 AVVNQPISVGI-AANDEFHMYGSGIYDGSCNSRLNHAVTVIGYGTSEEDGTKYWIVKNSW 311
Query: 317 GPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G WGE+GY+R+ R+ G G CGI K+AS+P
Sbjct: 312 GSDWGEEGYMRIARDVGVDGGHCGIAKVASFP 343
>gi|42563538|gb|AAS20467.1| cysteine protease-like protein [Pelargonium x hortorum]
Length = 234
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 128/198 (64%), Positives = 154/198 (77%), Gaps = 1/198 (0%)
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGG 214
CG CWAFST+AAVEGIN IVTG L SLSEQEL+DCD +YN GCNGGLMDYAF++I+ GG
Sbjct: 1 CGRCWAFSTIAAVEGINHIVTGELISLSEQELVDCDRSYNQGCNGGLMDYAFEFIIKNGG 60
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
+ EEDYPY +GTC+ + ++VVTI+GY DVP+N E+SL KA+A QP+SVAIEA GR
Sbjct: 61 IDSEEDYPYKAVDGTCDPIRKNAKVVTIDGYEDVPENDENSLKKAVAYQPVSVAIEAGGR 120
Query: 275 DFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
+FQ Y G++ G CGT LDHGVAAVGYG+ G+DY IV+NSWG WGE GYIRM+RN
Sbjct: 121 EFQLYQSGIFTGRCGTALDHGVAAVGYGTENGIDYWIVRNSWGSSWGENGYIRMERNVKT 180
Query: 335 PE-GLCGINKMASYPIKK 351
+ G CGI ASYP K+
Sbjct: 181 TKTGKCGIAMEASYPTKE 198
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 151/306 (49%), Positives = 185/306 (60%), Gaps = 8/306 (2%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
FE W+ + ++ Y+ +E RF I++ NL +I+ N + +Y L N+FADL +EEF
Sbjct: 5 FERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQEXSYNLTDNKFADLTNEEFVSP 64
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
+LG H F Y + DLP+S DWRK+GAV+ +K+QG+CGSCWAFS VAAV
Sbjct: 65 YLGFGTRFL-----PHTGFMYHEHEDLPESKDWRKEGAVSDIKDQGNCGSCWAFSAVAAV 119
Query: 168 EGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EGIN+I +G L SLSEQE DCD N GC GGLMD AF +I GGL +DYPY
Sbjct: 120 EGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDYPYEGV 179
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALA--NQPLSVAIEASGRDFQFYSGGVY 284
+GTC K I+G+ VP N E L A NQ SVAI+A G FQ Y GV+
Sbjct: 180 DGTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAFQLYLKGVF 239
Query: 285 DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
G CG QL+HGV VGYG Y IVKNSWG WGE GYIRMKR+ G CGI
Sbjct: 240 SGICGKQLNHGVTIVGYGKGTSDKYWIVKNSWGADWGESGYIRMKRDAFDKAGTCGIAMQ 299
Query: 345 ASYPIK 350
ASYP+K
Sbjct: 300 ASYPLK 305
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 148/308 (48%), Positives = 196/308 (63%), Gaps = 11/308 (3%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHE 102
L + ++ W K+ +Y+ E+ + +IFK N+ +ID N K+Y L +N FADL E
Sbjct: 35 LSERYKHWKIKYRVIYKDDAEEEKHIQIFKHNVAYIDSFNAAGNKSYKLTINRFADLPTE 94
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
+ F K + + F YK++ D+P +VDWRK+GAVT VKNQ CGSCWAFS
Sbjct: 95 PSDDGFKKRKLE-----PTTSSLFKYKNITDIPAAVDWRKRGAVTPVKNQRECGSCWAFS 149
Query: 163 TVAAVEGINQIVTGNLASLSEQELID-CDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
V A+EGI QI +GNL SLSEQEL+D + + NGCNGG + AF++++ GG+ E Y
Sbjct: 150 AVGALEGIQQITSGNLVSLSEQELVDRVRSNWTNGCNGGYLIDAFEFVLENGGIATEASY 209
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY +G +K S V I Y VP+NSEDSLLK +ANQP+SV I+ SG +FYS
Sbjct: 210 PYRGVKGN--NSKKVSRQVQIKSYEQVPRNSEDSLLKVVANQPVSVGIDISGM-IRFYSS 266
Query: 282 GVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
G++ G CGT+ +H V VGYG++ G Y +VKNSWG +WGEK YIRMKR+ EGLCG
Sbjct: 267 GIFTGECGTKPNHAVIIVGYGTSNDGTKYWLVKNSWGIRWGEKRYIRMKRDIDAKEGLCG 326
Query: 341 INKMASYP 348
I ASYP
Sbjct: 327 IPMDASYP 334
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 197/315 (62%), Gaps = 13/315 (4%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHE 102
++D + WM +F +VY+ EK R ++ +NL+ I+ N ++Y LG+NEF D E
Sbjct: 35 IVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTKE 94
Query: 103 EFKEMFLGLK------PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
EF + GL+ P + + +++ DV+ K DWR +GAVT VK+QG CG
Sbjct: 95 EFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNK--DWRNEGAVTPVKSQGECG 152
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
CWAFS +AAVEG+ +I GNL SLSEQ+L+DC NNGC GG AF YI+ G+
Sbjct: 153 GCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGIS 212
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E +YPY ++EG C + I G+ +VP N+E +LL+A++ QP++VAI+AS F
Sbjct: 213 SENEYPYQVKEGPCR--SNARPAILIRGFENVPSNNERALLEAVSRQPVAVAIDASEAGF 270
Query: 277 QFYSGGVYDG-HCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
YSGGVY+ +CGT ++H V VGYG S G+ Y + KNSWG WGE GYIR++R+
Sbjct: 271 VHYSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEW 330
Query: 335 PEGLCGINKMASYPI 349
P+G+CG+ + ASYP+
Sbjct: 331 PQGMCGVAQYASYPV 345
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 156/353 (44%), Positives = 203/353 (57%), Gaps = 29/353 (8%)
Query: 23 SFARDFSIVGYSPEDLTSNDK-LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
S AR G + ++++D +I+ F+ W + + K Y ++ E+ RF ++ N+ +I+
Sbjct: 24 SSARAHRRAGDTMGSMSNDDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEA 83
Query: 82 TNRKIK----NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDL--- 134
TN + + Y LG + DL ++EF M+ P LA+ + VD
Sbjct: 84 TNAEAEAAGLTYELGETAYTDLTNQEFMAMYTA--PALAQLPADESVITTRAGPVDAVGG 141
Query: 135 ---------------PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
P SVDWR GAVT VKNQG CGSCWAFSTVA VEGI QI TG L
Sbjct: 142 APGQLPVYVNLSASAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLV 201
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
SLSEQEL+DCD T ++GC+GG+ A ++I S GG+ E DYPY C K
Sbjct: 202 SLSEQELVDCD-TLDDGCDGGISYRALRWIASNGGITTEADYPYTGTTDACNRAKLSHNA 260
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
V+I G V SE SL A+A QP++V+IEA G +FQ Y GVY+G CGT L+HGV V
Sbjct: 261 VSIAGLRRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVV 320
Query: 300 GYG--STRGLDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPI 349
GYG + G Y IVKNSWG WG+ GYIRMK++ GKPEGLCGI SYP+
Sbjct: 321 GYGQEAAAGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|5853329|gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
Length = 501
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 157/364 (43%), Positives = 220/364 (60%), Gaps = 17/364 (4%)
Query: 1 MALSSQFKTILISFC-ISFFIRS-SFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
MA S+ TILI +S+ I + + +FSI+ D+ S+ K+ DLF W K
Sbjct: 1 MATSNSMITILIFLTYVSYSISTKTLPSEFSILEGQENDILSSAKVSDLFGKWKELHGKT 60
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK---NYWLGLNEFADLRHEEFKEMFL----GL 111
Y+ +E+ R E FK +++ + E N + K ++ +GLN+FADL +EEFKEM++ G
Sbjct: 61 YQHEEEENLRLENFKKSVKFVMEKNSERKSELDHTVGLNKFADLSNEEFKEMYMSKVKGS 120
Query: 112 KPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGIN 171
+ + + D P S+DWR KG VT +K+QG CGSCWAFS ++E N
Sbjct: 121 RSNELKMGGVKRNMSVSSRTCDAPTSLDWRDKGVVTPMKDQGQCGSCWAFSVSGSIESAN 180
Query: 172 QIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM---EEG 228
I TG+L LSEQEL+DCD TY+ GC+GG MD A+++I+ GGL E+DYPY +G
Sbjct: 181 AIATGDLIRLSEQELVDCD-TYDYGCDGGNMDTAYRWIIKNGGLDSEDDYPYTSSNGRDG 239
Query: 229 TCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHC 288
C+ TK VV+++ Y +V N ED++L A+A P+++ I S DFQ Y+GGVY+G C
Sbjct: 240 KCDKTKSAKSVVSLDSYVEVESN-EDAVLCAVATTPVTIGIVGSAYDFQLYTGGVYNGQC 298
Query: 289 GTQ---LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
++ +DH V VGYGS G DY IVKNSWG WG +GYI M+RNT G+CG+
Sbjct: 299 SSKPYDIDHAVLIVGYGSQDGKDYWIVKNSWGTYWGLEGYILMERNTDIKNGVCGMYLEP 358
Query: 346 SYPI 349
YPI
Sbjct: 359 VYPI 362
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 144/353 (40%), Positives = 212/353 (60%), Gaps = 23/353 (6%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA Q + + C+ + S+ + D +D ++ FE WM ++ +VY+
Sbjct: 1 MAWKVQVVFLFLFLCVMWASPSAASAD-----------EPSDPMMKRFEEWMVEYGRVYK 49
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLG--LKP-DLA 116
DEK+ RF+IFK+N+ HI+ N + +N Y LG+N+F D+ + EF + G +P ++
Sbjct: 50 DNDEKMRRFQIFKNNVNHIETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGISRPLNIE 109
Query: 117 RRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG 176
R S +D D+ +P+S+DWR GAVT VKNQ CG+CWAF+ +A VE I +I G
Sbjct: 110 REPVVSFDDV---DISAVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKG 166
Query: 177 NLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L LSEQ+++DC Y GC GG AF++I+S G+ YPY +GTC+ T G
Sbjct: 167 ILEPLSEQQVLDCAKGY--GCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTCK-TNGV 223
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
I GY VP+N+E S++ A++ QP++VA++A+ +FQ+Y GV++G CGT L+H V
Sbjct: 224 PNSAYITGYARVPRNNESSMMYAVSKQPITVAVDANA-NFQYYKSGVFNGPCGTSLNHAV 282
Query: 297 AAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
A+GYG + G Y IVKNSWG +WGE GYIRM R+ G+CGI + YP
Sbjct: 283 TAIGYGQDSNGKKYWIVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 143/353 (40%), Positives = 209/353 (59%), Gaps = 23/353 (6%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA Q + + C+ + S+ +RD +D ++ FE WM+++ +VY+
Sbjct: 1 MASKVQLVFLFLFLCVMWASPSAASRD-----------EPSDPMMKRFEEWMAEYGRVYK 49
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLG--LKP-DLA 116
DEK+ RF+IFK+N+ HI+ N N Y LG+N+F D+ EF + G +P ++
Sbjct: 50 DNDEKMRRFQIFKNNVNHIETFNSHNGNSYTLGINQFTDMTKSEFVAQYTGGISRPLNIE 109
Query: 117 RRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG 176
R S +D ++ +P+S+DWR GAV VKNQ CGSCWAF+ +A VEGI +I TG
Sbjct: 110 REPVVSFDDV---NISAVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTG 166
Query: 177 NLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQE++DC +Y GC GG ++ A+ +I+S G+ EE+YPY +GTC
Sbjct: 167 YLVSLSEQEVLDCAVSY--GCKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTCNANSFP 224
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
+ I GY V +N E S++ A++NQP++ I+AS +FQ+Y+GGV+ G CGT L+H +
Sbjct: 225 NSAY-ITGYSYVRRNDERSMMYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAI 282
Query: 297 AAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
+GYG + G Y IV+NSWG WGE GY+RM R G CGI +P
Sbjct: 283 TIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFP 335
>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 422
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 152/319 (47%), Positives = 198/319 (62%), Gaps = 20/319 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEE 103
F+ W++ K Y E+ +R IF DN + N K++WL LN ADL EE
Sbjct: 70 FDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLADLTREE 129
Query: 104 FKEMFLGLKPDLARRKDQSHE------DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGS 157
FK M L D ++++ +S ++ Y DV P+++DW +GAVT VKNQG CGS
Sbjct: 130 FKHM---LGYDASKKRVESSSPPVDAANWEYADVTP-PETMDWVSRGAVTPVKNQGQCGS 185
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLH 216
CWAFSTV AVEG+ + TG+L SLSEQEL+ C NNGC GGLMD F++IV G+
Sbjct: 186 CWAFSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIVENRGVD 245
Query: 217 KEEDYPYIMEEGTCE-MTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
EED+ Y+ ++ C K ++ +I+G+ DVP+N ED+L KA++ QP++VAIEA R+
Sbjct: 246 DEEDWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAIEADHRE 305
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYG----STRGLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQ YSGGV+DG CGT LDHGV VGYG S Y VKNSWG KWGE+GYIR+ R
Sbjct: 306 FQLYSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGYIRIARG 365
Query: 332 TGKPEGLCGINKMASYPIK 350
P G CG+ ASYP K
Sbjct: 366 GMGPAGQCGVAMQASYPTK 384
>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
Length = 330
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 148/311 (47%), Positives = 191/311 (61%), Gaps = 8/311 (2%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRH 101
D L +F WM K Y S +E + R+ ++++N I E NRK +Y+L +N+F DL +
Sbjct: 24 DPLTGVFADWMRTHTKSY-SNEEFVFRWNVWRENYNFIQEENRKNNSYYLTMNKFGDLTN 82
Query: 102 EEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
EF +++ GL D + ++ LP + DWR+KGAVTHVKNQG CGSCW+F
Sbjct: 83 AEFNKVYKGLAFDYSAHILKAKAATPAAPAPGLPANFDWRQKGAVTHVKNQGQCGSCWSF 142
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEED 220
ST + EG N + G L SLSEQ LIDC +Y NNGCNGGLMDYAF+YI++ G+ E
Sbjct: 143 STTGSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGIDTEAS 202
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
YPY + C S ++ Y DV E++LL A+A +P SVAI+AS FQFYS
Sbjct: 203 YPYETAQYNCRYNPANSG-GSLTSYTDVSSGDENALLNAVAIEPTSVAIDASHNSFQFYS 261
Query: 281 GGV-YDGHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
GGV Y+ C TQLDHGV AVG+G+ G DY +VKNSWG WG +GYI+M RN
Sbjct: 262 GGVYYESSCSSTQLDHGVLAVGWGTENGQDYWLVKNSWGADWGLQGYIKMARN---RHNN 318
Query: 339 CGINKMASYPI 349
CGI ASYP
Sbjct: 319 CGIATAASYPT 329
>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 351
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 133/324 (41%), Positives = 207/324 (63%), Gaps = 11/324 (3%)
Query: 36 EDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNE 95
+D S L+ L++ W S ++ + +E RF++FK+N +H+ + N K+ L LN+
Sbjct: 29 KDFESEKSLMQLYKRW-SSHHRISRNANEMHNRFKVFKNNAKHVFKVNLMGKSLKLKLNQ 87
Query: 96 FADLRHEEFKEMF---LGLKPDLARRKDQSHED----FSYKDVVDLPKSVDWRKKGAVTH 148
FAD+ +EF+ M+ + DL +K ++ F Y+ ++P S+DWRKKGAV
Sbjct: 88 FADMSDDEFRNMYSSNITYYKDLHAKKIEATGGRIGGFMYEHANNIPSSIDWRKKGAVNA 147
Query: 149 VKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQY 208
+KNQG CGSCWAF+ VAAVE I+QI T L SLSE+E++DCD + GC GG + AF++
Sbjct: 148 IKNQGRCGSCWAFAAVAAVESIHQIKTNELVSLSEEEVLDCDYR-DGGCRGGFYNSAFEF 206
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVA 268
++ G+ E++YPY G C G ++ V I+GY +VP+N+E +L+KA+A+QP++VA
Sbjct: 207 MMDNDGVTIEDNYPYYEGNGYCRRRGGRNKRVRIDGYENVPRNNEYALMKAVAHQPVAVA 266
Query: 269 IEASGRDFQFYSGGVY--DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYI 326
I + G DF+FY GG++ + CG +DH V VGYG+ DY I++N +G +WG GY+
Sbjct: 267 IASGGSDFKFYGGGMFTENDFCGFNIDHTVVVVGYGTDEDGDYWIIRNQYGHRWGMNGYM 326
Query: 327 RMKRNTGKPEGLCGINKMASYPIK 350
+M+R P+G+CG+ +YP+K
Sbjct: 327 KMQRGAHSPQGVCGMAMQPAYPVK 350
>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
Length = 330
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 148/307 (48%), Positives = 196/307 (63%), Gaps = 16/307 (5%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE--TNRKIKNYWLGLNEFADLRHEEFK 105
F WM K ++ Y E +++ FKDN+ I TN+ K LGL +FADL +EE++
Sbjct: 33 FLGWMKKHDRSYHH-HEFNNKYQAFKDNMDFIHNWNTNKNSKTV-LGLTQFADLTNEEYR 90
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
+++LG K ++A K +F+ P S+DWR KGAV+HVK+QG CGSCW+FST
Sbjct: 91 KIYLGTKVNVAPEK----HNFNMIHFTG-PDSIDWRTKGAVSHVKDQGQCGSCWSFSTTG 145
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
+VEG +QI TGN+ +LSEQ L+DC + NNGC+GGLM AF++I+S GG+ E+ YPY
Sbjct: 146 SVEGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFIMSQGGVATEDSYPYN 205
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
+G C+ TK I+GY ++ Q SE L AL QP+S+AI+AS + FQ Y GVY
Sbjct: 206 AVQGKCKFTKSMVG-ANISGYKEITQGSELELQAALTKQPVSIAIDASQQSFQLYKSGVY 264
Query: 285 D-GHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGIN 342
D C + QLDHGV AVGYG+ G DY IVKNSW WG+ GYI M RN + CG+
Sbjct: 265 DEPECSSYQLDHGVLAVGYGTENGKDYYIVKNSWADSWGQDGYIFMSRN---AKNQCGVA 321
Query: 343 KMASYPI 349
MASYPI
Sbjct: 322 TMASYPI 328
>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
Length = 355
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 144/315 (45%), Positives = 203/315 (64%), Gaps = 9/315 (2%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADL 99
++D+++ +FE W+ K +KVY +L EK +RF+IFK+NLR IDE N + Y LGLN FADL
Sbjct: 37 TDDEVMSMFEEWLVKHDKVYNALGEKEKRFQIFKNNLRFIDERNSLNRTYKLGLNVFADL 96
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQG-SCG 156
+ E++ M+L D R + Y V +PKSVDWRK+GAVT VKNQG +C
Sbjct: 97 TNAEYRAMYLRTWDDGPRLDLDTPPRNRYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCN 156
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
SCWAF+ V AVE + +I TG+L SLSEQE++DC + + GC GG + + + YI G+
Sbjct: 157 SCWAFTAVGAVESLVKIKTGDLISLSEQEVVDCTTSSSRGCGGGDIQHGYIYI-RKNGIS 215
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E+DYPY +EG C+ K ++ +VTI+G+ VP E++L + +ANQP++V I A +F
Sbjct: 216 LEKDYPYRGDEGKCDSNK-KNAIVTIDGHGWVPTQLEEALKQGIANQPVAVPIPADDYEF 274
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
Q+Y+ GV+ G CGT+L+H + VGYG+ + DY I KNS+ KWGE GYIR++R
Sbjct: 275 QYYTSGVFKGKCGTELNHALLLVGYGAEKDGDYWIAKNSYSDKWGENGYIRIQRKLST-- 332
Query: 337 GLCGINKMASYPIKK 351
C YPI K
Sbjct: 333 --CKFGNGGYYPIIK 345
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 140/330 (42%), Positives = 203/330 (61%), Gaps = 17/330 (5%)
Query: 30 IVGYSPEDLTSNDKLID--------LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
+VG +P + L D +FE W +K K Y S EK R IF D L +I++
Sbjct: 11 VVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDWEKARRLMIFSDTLAYIEK 70
Query: 82 TNRKIKN-YWLGLNEFADLRHEEFKEMFLGL--KPDLARRKDQSHEDFSYKDVVDLPKSV 138
N + + LGLN+F+DL + EF+ M +G +P R ED DV LP S+
Sbjct: 71 HNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAEDEDV---DVSSLPTSL 127
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
DWR+KGAVT +K+QG CGSCWAFS +A++E + + T L SLSEQ+L+DCD T + GC+
Sbjct: 128 DWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCD-TVDAGCD 186
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
GGLM+ AF+++V GG+ E YPY G+C K +++V I G+ V ++S D+L+K
Sbjct: 187 GGLMETAFKFVVKNGGVTTEAAYPYTGSVGSCNANKAKNKVAEITGFKVVTEDSADALMK 246
Query: 259 ALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGP 318
A++ P++V+I S +FQ Y G+ G C LDHGV +GYG+ G+ Y I+KNSWG
Sbjct: 247 AVSKTPVTVSICGSDENFQNYKSGILSGKCDDSLDHGVLLIGYGTEGGMPYWIIKNSWGT 306
Query: 319 KWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
WGE G+++++R G +G+CG+N +SYP
Sbjct: 307 SWGEDGFMKIERKDG--DGMCGMNGDSSYP 334
>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase GP-II
gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
Length = 221
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 128/219 (58%), Positives = 162/219 (73%), Gaps = 2/219 (0%)
Query: 133 DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT 192
DLP S+DWR+ GAV VKNQG CGSCWAFSTVAAVEGINQIVTG+L SLSEQ+L+DC T
Sbjct: 2 DLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC-TT 60
Query: 193 YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNS 252
N+GC GG M+ AFQ+IV+ GG++ EE YPY ++G C T + VV+I+ Y +VP ++
Sbjct: 61 ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNSTV-NAPVVSIDSYENVPSHN 119
Query: 253 EDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIV 312
E SL KA+ANQP+SV ++A+GRDFQ Y G++ G C +H + VGYG+ D+ IV
Sbjct: 120 EQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIV 179
Query: 313 KNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
KNSWG WGE GYIR +RN P+G CGI + ASYP+KK
Sbjct: 180 KNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPVKK 218
>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
Length = 214
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 127/215 (59%), Positives = 158/215 (73%), Gaps = 2/215 (0%)
Query: 137 SVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNG 196
SVDWRKKG VT +K+QG CG+CWAFS +AAVEG+ + TG L SLSEQEL+DCD T N G
Sbjct: 1 SVDWRKKGGVTEIKDQGDCGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQG 60
Query: 197 CNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSL 256
C+GG+MDYAFQY++ GG+ + +YPY + G C+ K + TING+ +P SE+ L
Sbjct: 61 CDGGMMDYAFQYMIRNGGITSQSNYPYRAQRGACDKDKVKYHAATINGFQAIPPQSEELL 120
Query: 257 LKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNS 315
L+A+ANQP+SVAIEA G+DFQ YS GV+ G CG+ LDHGVA VGYG+ G Y +VKNS
Sbjct: 121 LRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNS 180
Query: 316 WGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WG WGE GY+RM+R G G+CGIN ASYP K
Sbjct: 181 WGSGWGESGYVRMERQ-GPGAGVCGINLDASYPTK 214
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 159/332 (47%), Positives = 199/332 (59%), Gaps = 23/332 (6%)
Query: 25 ARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR 84
AR F +S E++ S L D+F ++M ++ K Y S E RF FK N+ I N
Sbjct: 20 ARQFQSALFS-EEVPSEVMLQDMFTAFMKQYSKAY-SHAEFSSRFNQFKANVETIRLHNT 77
Query: 85 KIK-NYWLGLNEFADLRHEEFKEMFLGLK---PDLARRKDQSHEDFSYKDVVDLPKSVDW 140
+Y +GLNEFADL EEFK + G K + AR + +++V P S+DW
Sbjct: 78 LANASYTMGLNEFADLSFEEFKGKYFGYKHVEREFARSNNL------HQEVEAAPTSIDW 131
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG--NLASLSEQELIDCDNTYNN-GC 197
R AVT +K+QG CGSCWAFS ++EG ++ G L SLSEQ+L+DC +Y N GC
Sbjct: 132 RTSNAVTPIKDQGQCGSCWAFSATGSIEGA-WVLQGKHTLTSLSEQQLVDCSTSYGNAGC 190
Query: 198 NGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLL 257
NGGLMDYAF+YI++ G+ E YPY G C+ K ++VVTI+GY DV E SLL
Sbjct: 191 NGGLMDYAFEYIIANKGICAESAYPYKGVGGLCQ--KSCTKVVTISGYKDVASGDEASLL 248
Query: 258 KALAN-QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSW 316
A+ P+SVAIEA FQFYS GV+ G CG LDHGV AVGYG+T DY IVKNSW
Sbjct: 249 NAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSW 308
Query: 317 GPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G WGE GYIRM RN + CGI SYP
Sbjct: 309 GTSWGESGYIRMIRNKNQ----CGIAIQPSYP 336
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 160/374 (42%), Positives = 208/374 (55%), Gaps = 26/374 (6%)
Query: 1 MALSSQFKTILISFCISFFIRS-SFARDFSIVGYSPEDLTSNDK-LIDLFESWMSKFEKV 58
MA SS+ + ++ F S AR G ++++D +I+ F+ W + + K
Sbjct: 1 MASSSKGSLPCVLLLLAVFHHGCSSARAHRRAGDMERSMSTDDSSMIERFQRWKAAYNKS 60
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFADLRHEEFKEMFLGLKP- 113
Y ++ E+ RF + N+ +I+ TN + + Y LG + DL ++EF M+ P
Sbjct: 61 YATVAEERRRFRVCARNMAYIEATNAEAEAAGLTYELGETAYTDLTNQEFMAMYTAPAPA 120
Query: 114 DLARRKDQSHEDFSYKDVV---------------DLPKSVDWRKKGAVTHVKNQGSCGSC 158
L + D V P SVDWR GAVT VKNQG CGSC
Sbjct: 121 QLPADESVITTRAGPVDAVGGAPGQLPVYVNLSTSAPASVDWRASGAVTPVKNQGRCGSC 180
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
WAFSTVA VEGI QI TG L SLSEQEL+DCD T ++GC+GG+ A ++I S GG+ E
Sbjct: 181 WAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD-TLDDGCDGGISYRALRWIASNGGITTE 239
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
DYPY C K V+I G V SE SL A+A QP++V+IEA G +FQ
Sbjct: 240 TDYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEASLANAVAGQPVAVSIEAGGDNFQH 299
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYG--STRGLDYIIVKNSWGPKWGEKGYIRMKRN-TGKP 335
Y GVY+G CGT L+HGV VGYG + G Y IVKNSWG WG+ GYIRMK++ GKP
Sbjct: 300 YKKGVYNGPCGTNLNHGVTVVGYGQEAAGGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKP 359
Query: 336 EGLCGINKMASYPI 349
EGLCGI SYP+
Sbjct: 360 EGLCGIAIRPSYPL 373
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 138/285 (48%), Positives = 180/285 (63%), Gaps = 14/285 (4%)
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
F NLR I+ N ++ +G+ +FADL EF ++ R +++
Sbjct: 48 FRCHLANLRVIEAHNAGNSSFTMGITQFADLTAAEFSAYVKRFPMNVTRPRNEVW----- 102
Query: 129 KDVVDLP-KSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
+ + P + VDWR+K AVT +KNQG CGSCW+FST +VEG + I TG L SLSEQ+L+
Sbjct: 103 --ITEAPLQEVDWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLM 160
Query: 188 DCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
DC Y N+GCNGGLMDYAF+Y+++ GGL EEDYPY E+G C K + I+G+
Sbjct: 161 DCSTRYGNHGCNGGLMDYAFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEIHGFR 220
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG 306
+VP+ ED L A++ P+SVAIEA FQ Y+ GV+DG CGT LDHGV VGY
Sbjct: 221 NVPKEHEDQLAAAVSIGPVSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGYSD--- 277
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
DY IVKNSWG WGE+GYIR+KR K +G+CGI ASYP K+
Sbjct: 278 -DYWIVKNSWGKSWGEEGYIRLKRGVDK-KGMCGITMQASYPEKR 320
>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
Length = 1039
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 123/196 (62%), Positives = 151/196 (77%)
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGL 215
GSCWAFST+AAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGGLMDYAF++I++ GG+
Sbjct: 713 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGI 772
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E+DYPY +G C++ + ++VVTI+ Y DVP N E SL KA+ANQP+SVAIEA+G
Sbjct: 773 DTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTT 832
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
FQ YS G++ G CGT LDHGV VGYG+ G DY I+KNSWG WGE GY+RM+RN
Sbjct: 833 FQLYSSGIFTGSCGTALDHGVTVVGYGTENGKDYWIMKNSWGSSWGESGYVRMERNIKAS 892
Query: 336 EGLCGINKMASYPIKK 351
G CGI SYP+K+
Sbjct: 893 SGKCGIAVEPSYPLKE 908
>gi|356560855|ref|XP_003548702.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
Length = 357
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 160/361 (44%), Positives = 212/361 (58%), Gaps = 20/361 (5%)
Query: 5 SQFKTILISF--CISFFIRSS---FARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY 59
SQ + I F CI+ SS F +SI+G + + L S D+ I LF+ W + VY
Sbjct: 4 SQLSKLFIFFFICITLICFSSSSNFPVQYSILGPNLDKLPSQDETIQLFQLWRKEHGLVY 63
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRKIKN---YWLGLNEFADLRHEEFKEMFLGLKPDLA 116
+ L E +RFEIF NL +I E N K + Y LGLN FAD EF+E++L L
Sbjct: 64 KDLKEMAKRFEIFLSNLNYIIEFNAKRSSPSGYLLGLNNFADWSPSEFQEIYLH---SLD 120
Query: 117 RRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG 176
D + + P S+DWR K AVT +KNQGSCGSCWAFS A+EGI+ I TG
Sbjct: 121 MPTDSAPKLNGPLLSCIAPASLDWRNKVAVTAIKNQGSCGSCWAFSAAGAIEGIHAITTG 180
Query: 177 NLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE-GTCEMTKG 235
L SLSEQEL++CD + GCNGG ++ AF +++S GG+ E +YPY ++ G C K
Sbjct: 181 ELISLSEQELVNCDRV-SKGCNGGWVNKAFDWVISNGGITLEAEYPYTGKDGGNCNSDKQ 239
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG-HCGTQ--- 291
TI+GY V Q S++ LL ++ QP+S+ + A+ DFQ Y G++DG C +
Sbjct: 240 VPIKATIDGYEQVEQ-SDNGLLCSIVKQPISICLNAT--DFQLYESGIFDGQQCSSSSKY 296
Query: 292 LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
+H V VGY S+ G DY IVKNSWG KWG GYI +KRNTG P G+CG+N A P +
Sbjct: 297 TNHCVLIVGYDSSNGEDYWIVKNSWGTKWGINGYIWIKRNTGLPYGVCGMNAWAYNPTIR 356
Query: 352 K 352
K
Sbjct: 357 K 357
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 135/310 (43%), Positives = 199/310 (64%), Gaps = 7/310 (2%)
Query: 41 NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADL 99
+D ++ FE WM+++ +VY+ DEK+ RF+IFK+N+ HI+ NR +Y LG+N+F D+
Sbjct: 30 SDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDM 89
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCW 159
+ EF + GL L +++ F D+ +P+S+DWR GAVT VKNQG CGSCW
Sbjct: 90 TNNEFVAQYTGLSLPLNIKREPV-VSFDDVDISSVPQSIDWRDSGAVTSVKNQGRCGSCW 148
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEE 219
AF+++A VE I +I GNL SLSEQ+++DC +Y GC GG ++ A+ +I+S G+
Sbjct: 149 AFASIATVESIYKIKRGNLVSLSEQQVLDCAVSY--GCKGGWINKAYSFIISNKGVASAA 206
Query: 220 DYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
YPY +GTC+ T G I Y V +N+E +++ A++NQP++ A++ASG +FQ Y
Sbjct: 207 IYPYKAAKGTCK-TNGVPNSAYITRYTYVQRNNERNMMYAVSNQPIAAALDASG-NFQHY 264
Query: 280 SGGVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
GV+ G CGT+L+H + +GYG + G + IV+NSWG WGE GYIR+ R+ GL
Sbjct: 265 KRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNSWGAGWGEGGYIRLARDVSSSFGL 324
Query: 339 CGINKMASYP 348
CGI YP
Sbjct: 325 CGIAMDPLYP 334
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 143/353 (40%), Positives = 211/353 (59%), Gaps = 23/353 (6%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA Q + + C+ + S+ + D +D ++ FE WM ++ +VY+
Sbjct: 1 MAWKVQLVFLFLFLCVMWASPSAASAD-----------EPSDPMMKRFEEWMVEYGRVYK 49
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLG--LKP-DLA 116
DEK+ RF+IFK+N+ HI+ N + K+ Y LG+N+F D+ + EF + G +P ++
Sbjct: 50 DNDEKMRRFQIFKNNVNHIETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGISRPLNIE 109
Query: 117 RRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG 176
R S +D D+ +P+S+DWR GAVT VKNQ CG+CWAF+ +A VE I +I G
Sbjct: 110 REPVVSFDDV---DISAVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKG 166
Query: 177 NLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L LSEQ+++DC Y GC GG AF++I+S G+ YPY +GTC+ T G
Sbjct: 167 ILEPLSEQQVLDCAKGY--GCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTCK-TNGV 223
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGV 296
I GY VP+N+E S++ A++ QP++VA++A+ Q+Y+ GV++G CGT L+H V
Sbjct: 224 PNSAYITGYARVPRNNESSMMYAVSKQPITVAVDANANS-QYYNSGVFNGPCGTSLNHAV 282
Query: 297 AAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
A+GYG + G Y IVKNSWG +WGE GYIRM R+ G+CGI + YP
Sbjct: 283 TAIGYGQDSNGKKYWIVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 148/313 (47%), Positives = 189/313 (60%), Gaps = 17/313 (5%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-----NYWLGLNEFADLRHEE 103
E WM+K K Y+ +EK R E+F+ N + ID N + + L N FADL +E
Sbjct: 43 EKWMAKHGKTYKDEEEKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDE 102
Query: 104 FKEMFLGL-KPDLARRKDQS---HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCW 159
F+ G +P A +E+FS + P+S+DWR GAVT VK+QGSCG CW
Sbjct: 103 FRAARTGYQRPPAAVAGAGGGFLYENFS---LAAAPQSMDWRAMGAVTGVKDQGSCGCCW 159
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKE 218
AFS VAAVEG+ +I TG L SLSEQEL+DCD + GC GGLMD AFQYI GGL E
Sbjct: 160 AFSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAE 219
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
YPY + +I G+ DVP N E +L+ A+A QP+SVAI +G F+F
Sbjct: 220 SSYPYRGVD-GACRAAAGRAAASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRF 278
Query: 279 YSGGVYDGH-CGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
Y GV G CGT+L+H V AVGYG+ G Y ++KNSWG WGE GY+R++R G+ E
Sbjct: 279 YDRGVLGGAGCGTELNHAVTAVGYGTASDGTGYWLMKNSWGASWGEGGYVRIRRGVGR-E 337
Query: 337 GLCGINKMASYPI 349
G CGI +MASYP+
Sbjct: 338 GACGIAQMASYPV 350
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 157/360 (43%), Positives = 211/360 (58%), Gaps = 18/360 (5%)
Query: 1 MALSSQFKTILI--SFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKV 58
M++S T+L+ S C + R+ R+ + E ++ E WM++ +
Sbjct: 1 MSVSRFVLTVLVVASVCTAAAPRALAVRELAG---EEESAAVAAAMVSRHEKWMAEHGRT 57
Query: 59 YESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLGLKPDLAR 117
Y EK R EIF+ N ID N K+ + L N FADL EEF+ G +P A
Sbjct: 58 YTDEAEKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDEEFRAARTGFRPRPAP 117
Query: 118 RKDQS------HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGIN 171
+E+FS + D +SVDWR GAVT VK+QG CG CWAFS VAAVEG+N
Sbjct: 118 AAAAGSGGRFRYENFS---LADAAQSVDWRAMGAVTGVKDQGECGCCWAFSAVAAVEGLN 174
Query: 172 QIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTC 230
+I TG L SLSEQEL+DCD N + GC GGLMD AFQ+I GGL E YPY ++G+C
Sbjct: 175 KIRTGRLVSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASESGYPYQGDDGSC 234
Query: 231 EMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGT 290
+ + +I G+ DVP+N+E +L A+ANQP+SVAI F+FY GV G CGT
Sbjct: 235 RSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRFYDSGVLGGECGT 294
Query: 291 QLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
L+H + AVGYG+ G Y ++KNSWG WGE GY+R++R + EG+CG+ K+ SYP+
Sbjct: 295 DLNHAITAVGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGV-RGEGVCGLAKLPSYPV 353
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 144/354 (40%), Positives = 209/354 (59%), Gaps = 24/354 (6%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA Q + + C+ + S+ +RD +D ++ FE WM+++ +VY+
Sbjct: 1 MASKVQLVFLFLFLCVMWASPSAASRD-----------EPSDPMMKRFEEWMAEYGRVYK 49
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLG--LKPDLAR 117
DEK+ RF+IFK+N+ HI+ N + N Y LG+N+F D+ + EF + G L ++ R
Sbjct: 50 DNDEKMRRFQIFKNNVNHIETFNSRNGNSYTLGINQFTDMTNNEFVAQYTGVSLPLNIER 109
Query: 118 RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
S +D D+ +P+S+DWR GAVT VKN CGSCWAF+ +A VE I +I G
Sbjct: 110 EPVVSFDDV---DISAVPQSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGY 166
Query: 178 LASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME--EGTCEMTKG 235
L SLSEQ+++DC +Y GC+GG ++ A+ +I+S G+ YPY +GTC + G
Sbjct: 167 LISLSEQQVLDCAVSY--GCDGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTCRI-NG 223
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
I GY V N+E S++ A++NQP++ +IEASG DFQ Y GV+ G CGT L+H
Sbjct: 224 VPNSAYITGYTRVQSNNERSMMYAVSNQPIAASIEASG-DFQHYKRGVFSGPCGTSLNHA 282
Query: 296 VAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
+ +GYG + G + IV+NSWG WGE+GYIRM R+ GLCGI YP
Sbjct: 283 ITIIGYGQDSSGKKFWIVRNSWGASWGERGYIRMARDVSSSSGLCGIAIRPLYP 336
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 142/335 (42%), Positives = 198/335 (59%), Gaps = 13/335 (3%)
Query: 24 FARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE-T 82
F+ D I + + + WM F +VY+ EK R E+F +NL+ I+
Sbjct: 14 FSMDLKISEATSRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFN 73
Query: 83 NRKIKNYWLGLNEFADLRHEEFKEMFLGLK------PDLARRKDQSHEDFSYKDVVDLPK 136
N ++Y LG+N+F D EEF GL P + +++ DV+ K
Sbjct: 74 NMGSQSYKLGVNKFTDWTKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTK 133
Query: 137 SVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNG 196
DWR +GAVT VK QG CG CWAFS +AAVEG+ +I GNL SLSEQ+L+DC NNG
Sbjct: 134 --DWRNEGAVTPVKYQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCAREQNNG 191
Query: 197 CNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSL 256
C GG M AF YIV GG+ E YPY ++EG C + + I G+ +VP N+E +L
Sbjct: 192 CKGGTMIEAFNYIVKNGGVSSENAYPYQVKEGPCR--SNDIPAIVIRGFENVPSNNERAL 249
Query: 257 LKALANQPLSVAIEASGRDFQFYSGGVYDGH-CGTQLDHGVAAVGYGSTR-GLDYIIVKN 314
L+A++ QP++V I+AS F YSGGVY+ CGT ++H V VGYG+++ G+ Y + KN
Sbjct: 250 LEAVSRQPVAVDIDASETGFIHYSGGVYNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKN 309
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
SWG WGE GYIR++R+ P+G+CG+ + ASYP+
Sbjct: 310 SWGKTWGENGYIRIRRDVEWPQGMCGVAQYASYPV 344
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 203/332 (61%), Gaps = 19/332 (5%)
Query: 30 IVGYSPEDLTSNDKLID--------LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDE 81
+VG +P + L D +FE W +K K Y S EK R IF D L +I++
Sbjct: 15 VVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDLEKARRLMIFSDTLAYIEK 74
Query: 82 TNRKIKN-YWLGLNEFADLRHEEFKEMFLGL--KPDLARRKDQSHEDFSYKDVVDLPKSV 138
N + + LGLN+F+DL + EF+ M +G +P R ED DV LP S+
Sbjct: 75 HNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAEDEDV---DVSSLPTSL 131
Query: 139 DWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
DWR+KGAVT +K+QG CGSCWAFS +A++E + + T L SLSEQ+L+DCD T + GC+
Sbjct: 132 DWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCD-TVDAGCD 190
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE--SEVVTINGYHDVPQNSEDSL 256
GGLM+ AF+++V GG+ E YPY G+C K ++V I G+ V ++S D+L
Sbjct: 191 GGLMETAFKFVVKNGGVTTEASYPYTGSVGSCNANKVAIINKVAEITGFKVVTEDSADAL 250
Query: 257 LKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSW 316
+KA++ P++V+I S +FQ Y G+ G CG LDHGV +GYG+ G+ Y I+KNSW
Sbjct: 251 MKAVSKTPVTVSICGSDENFQNYKSGILSGQCGDSLDHGVLLIGYGTEGGMPYWIIKNSW 310
Query: 317 GPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G WGE G+++++R G +G+CG+N +SYP
Sbjct: 311 GTSWGEDGFMKIERKDG--DGICGMNGDSSYP 340
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 158/332 (47%), Positives = 199/332 (59%), Gaps = 23/332 (6%)
Query: 25 ARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR 84
AR F +S E++ S L D+F ++M ++ K Y S E RF FK N+ I N
Sbjct: 20 ARQFQSALFS-EEVPSEVMLQDMFTAFMKQYSKAY-SHAEFSSRFNQFKANVETIRLHNT 77
Query: 85 KIK-NYWLGLNEFADLRHEEFKEMFLGLK---PDLARRKDQSHEDFSYKDVVDLPKSVDW 140
+Y +GLNEFADL EEFK + G K + AR + +++V P S+DW
Sbjct: 78 LANASYTMGLNEFADLSFEEFKGKYFGYKHVEREFARSNNL------HQEVEAAPTSIDW 131
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG--NLASLSEQELIDCDNTYNN-GC 197
R AVT +K+QG CGSCWAFS ++EG ++ G L SLSEQ+L+DC +Y + GC
Sbjct: 132 RTSNAVTPIKDQGQCGSCWAFSATGSIEGA-WVLQGKHTLTSLSEQQLVDCSTSYGDAGC 190
Query: 198 NGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLL 257
NGGLMDYAF+YI++ G+ E YPY G C+ K ++VVTI+GY DV E SLL
Sbjct: 191 NGGLMDYAFEYIIANKGICAESAYPYKGVGGLCQ--KSCTKVVTISGYKDVASGDEASLL 248
Query: 258 KALAN-QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSW 316
A+ P+SVAIEA FQFYS GV+ G CG LDHGV AVGYG+T DY IVKNSW
Sbjct: 249 NAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSW 308
Query: 317 GPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G WGE GYIRM RN + CGI SYP
Sbjct: 309 GTSWGESGYIRMIRNKNQ----CGIAIQPSYP 336
>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
Length = 330
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 147/318 (46%), Positives = 191/318 (60%), Gaps = 18/318 (5%)
Query: 40 SNDKLIDLFESWM-----SKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLN 94
++D L +F WM S + VY S +E + R+ +++D +E NR+ K+Y+L +N
Sbjct: 22 THDPLTGVFAKWMRENTKSNYRFVY-SNEEFIYRWNVWRD-----EEHNRQNKSYFLAMN 75
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
+F DL + EF +F GL D ++ + H +P DWR+KGAVTHVKNQG
Sbjct: 76 QFGDLTNAEFNRLFKGLAFDYSKHA-KIHTAAPEAPATGIPSEFDWRQKGAVTHVKNQGQ 134
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTG 213
CGSCW+FST + EG N + TG L SLSEQ LIDC +Y NNGCNGGLMDYAF+YI++
Sbjct: 135 CGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMDYAFEYIINNR 194
Query: 214 GLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASG 273
G+ E YPY ++ ++ GY DV E++LL A +P+SVAI+AS
Sbjct: 195 GIDTEASYPYQTAGPLTCQYNAANKGGSLTGYTDVTSGDENALLNAAVKEPVSVAIDASH 254
Query: 274 RDFQFYSGGV-YDGHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQFYSGGV Y+ C TQLDHGV VG+GS G D+ VKNSWG WG GYI+M RN
Sbjct: 255 NSFQFYSGGVYYESACSSTQLDHGVLVVGWGSENGQDFWWVKNSWGASWGLNGYIKMSRN 314
Query: 332 TGKPEGLCGINKMASYPI 349
CGI ASYP
Sbjct: 315 QNNN---CGIATAASYPT 329
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 146/344 (42%), Positives = 203/344 (59%), Gaps = 8/344 (2%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDL-FESWMSKFEKVYESLDEKLE 67
TI + I+ + + S D +S+ +++ + +ESW+ K+ + Y + DE
Sbjct: 4 TITLVAIINLLVLCNLWITASACPAKHNDNSSDSEVMRMRYESWLKKYGQKYRNKDEWEF 63
Query: 68 RFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFS 127
RFEI++ N++ I+ N + +Y L N+F DL +EEF+ M+L +P + F
Sbjct: 64 RFEIYRANVQFIEVYNSQNYSYKLMDNKFVDLTNEEFRRMYLVYQP-----RSHLQTRFM 118
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
Y+ DLPK +DWR +GAVT +K+QG CGSCW+FS VA VE IN+I TG L SLSEQ+LI
Sbjct: 119 YQKHGDLPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQQLI 178
Query: 188 DCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYH 246
DCDN N GCNGG M+ F +I GGL +++YPY +G K + V I GY
Sbjct: 179 DCDNRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQGSDGDXNKAKVRNHAVAICGYE 237
Query: 247 DVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRG 306
++P ++E+ L A+A+QP SVA +A G FQ YS G + G CG L+H + VGYG G
Sbjct: 238 NLPAHNENMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGSCGKDLNHRMTIVGYGEENG 297
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
Y +VKNSW G GYIRMKR+ +G CG ASYP K
Sbjct: 298 EKYWLVKNSWANDXGVSGYIRMKRDPKDKDGTCGTAMEASYPDK 341
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 142/346 (41%), Positives = 194/346 (56%), Gaps = 42/346 (12%)
Query: 9 TILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
+IL +FF ++ A DL+ + ++ E WM+++ +VY+ EK R
Sbjct: 7 SILAILGFAFFCGAALA---------ARDLSDDSAMVARHEQWMAQYSRVYKDASEKARR 57
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY 128
F+ FADL + EF+ + + F Y
Sbjct: 58 FK-------------------------FADLTNHEFRS--VKTNKGFKSSNMKILTGFRY 90
Query: 129 KDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
++V LP ++DWR KG VT +K+QG CG C AFS VAA EGI +I TG L SL++QEL
Sbjct: 91 ENVSADALPTTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVSLADQEL 150
Query: 187 IDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGY 245
+DCD + + GC GGLMD AF++I+ GGL E YPY +G C G + TI GY
Sbjct: 151 VDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCN--SGSNSAATIKGY 208
Query: 246 HDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR 305
DVP N E +L+KA+ANQP+SVA++ F+FYSGGV G CGT LDHG+AA+GYG T
Sbjct: 209 EDVPANDEAALMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYGKTS 268
Query: 306 -GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G Y ++KNSWG WGE GY+RM+++ G+CG+ SYP K
Sbjct: 269 DGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPTK 314
>gi|113120263|gb|ABI30271.1| VXH-D [Vasconcellea x heilbornii]
Length = 276
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 142/279 (50%), Positives = 197/279 (70%), Gaps = 5/279 (1%)
Query: 1 MALSSQF-KTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVY 59
MA S F K + ++ C+S + S+ FSIVGYSP+DLTS +KLI+LF+SWM +++KVY
Sbjct: 1 MATISSFSKLLFVAICLSVHMGLSYGA-FSIVGYSPDDLTSTEKLINLFDSWMVEYDKVY 59
Query: 60 ESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG-LKPDLARR 118
+ +DEK+ RFEIFKDNL++IDETN+K YWLGL F DL ++EFKE ++G + +
Sbjct: 60 KDIDEKIYRFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKYVGSISESWSTT 119
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 178
++ + E F Y D V++P S+DWR+KGAVT V+NQG CGSCW FS+VAAVEGIN+IVTG L
Sbjct: 120 EESNDEGFIYDDAVNIPTSIDWRQKGAVTPVRNQGGCGSCWTFSSVAAVEGINKIVTGQL 179
Query: 179 ASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESE 238
SLSEQEL+DC+ + GC GG YA QY V+ G+H + YPY + C ++ +
Sbjct: 180 LSLSEQELLDCERR-SYGCRGGFPLYALQY-VANSGIHLRQYYPYEGVQRQCRASQAKGP 237
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
V +G VP+N+E +L++ +A QP+S+ +EA GR FQ
Sbjct: 238 KVKTDGVGRVPRNNEQALIQRIAIQPVSIVVEAKGRAFQ 276
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 147/306 (48%), Positives = 193/306 (63%), Gaps = 9/306 (2%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI--KNYWLGLNEFADLRHEEFKE 106
+ WM ++ + Y + E +RF+IF +NL +I++ N K+Y L LN+F+DL +EEF
Sbjct: 39 QQWMLQYGRSYTNDAEMEKRFKIFMENLEYIEKFNNAPGNKSYKLDLNQFSDLTNEEFIA 98
Query: 107 MFLGLKPDLARRKDQSHEDFSYK-DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
GL D ++ S D+ D P S+DWR++GAVT VKNQG+CGSCWAFS VA
Sbjct: 99 SHTGLMIDPSKPSSSSKRASPASLDLSDTPTSLDWREQGAVTDVKNQGNCGSCWAFSAVA 158
Query: 166 AVEGINQIVTGNLASLSEQELIDC-DNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
AVEGI +I GNL SLSEQ+L+DC N N GC GG MD AF YI G+ E DY Y
Sbjct: 159 AVEGIVKIKNGNLISLSEQQLVDCASNEQNQGCGGGFMDNAFSYITEN-GIASENDYQYR 217
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
GTC+ + + I+GY DVP ED LL A++ QP+SVAI A G+ F Y G+Y
Sbjct: 218 GGAGTCQNNEMITPAARISGYEDVPA-GEDQLLLAVSQQPVSVAI-AVGQSFHLYKEGIY 275
Query: 285 DGHCGTQLDHGVAAVGYGSTR--GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGIN 342
G CG+ L+HGV VGYG++ G Y ++KNSWG WGE GY+R+ R +G+ EG CGI
Sbjct: 276 SGPCGSSLNHGVTLVGYGTSEEDGTKYWLIKNSWGESWGENGYMRLLRESGQSEGHCGIA 335
Query: 343 KMASYP 348
AS+P
Sbjct: 336 VKASHP 341
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 136/299 (45%), Positives = 187/299 (62%), Gaps = 6/299 (2%)
Query: 52 MSKFEKVYESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLG 110
M+++ +VY+ DEK+ RF+IFK+N+ HI+ NR +Y LG+N+F D+ + EF + G
Sbjct: 1 MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60
Query: 111 LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
+ + F ++ + +S+DWR GAVT VK+Q CGSCWAFS +A VEGI
Sbjct: 61 GISRPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGI 120
Query: 171 NQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTC 230
+IVTG L SLSEQE++DC +NGC+GG +D A+ +I+S G+ E DYPY +G C
Sbjct: 121 YKIVTGYLVSLSEQEVLDC--AVSNGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDC 178
Query: 231 EMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGT 290
+ I GY V N E S+ A+ NQP++ AI+ASG +FQ+Y+GGV+ G CGT
Sbjct: 179 AANSWPNSAY-ITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGT 237
Query: 291 QLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
L+H + +GYG + G Y IVKNSWG WGE+GYIRM R GLCGI YP
Sbjct: 238 SLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGV-SSSGLCGIAMDPLYP 295
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 137/304 (45%), Positives = 193/304 (63%), Gaps = 9/304 (2%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFK 105
+FE W +K K Y S EK R IF D L +I++ N + LGLN+F+DL + EF+
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 EMFLG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
++G KP R +D+ DV LP S+DWR++GAVT +K+QG CGSCWAFS +
Sbjct: 61 ANYVGKFKP--PRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
A++E + + T L SLSEQ+LIDCD T + GC GG + AF+++V GG+ EE YPY
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYT 177
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
G+C K ++VV I GY DV ++S D+L+KA++ P++V I S ++FQ Y G+
Sbjct: 178 GFAGSCNANK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 285 DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
GHC DH V +GYG+ G+ Y I+KNSWG WGE G++R+K+ G EG+CG+N
Sbjct: 236 SGHCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKKKDG--EGMCGMNGQ 293
Query: 345 ASYP 348
+SYP
Sbjct: 294 SSYP 297
>gi|4469157|emb|CAB38316.1| chymopapain isoform IV [Carica papaya]
Length = 226
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 127/216 (58%), Positives = 155/216 (71%), Gaps = 2/216 (0%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P+S+DWR KGAVT VKNQG+CGSCWAFST+A VEGIN+IVTGNL LSEQEL+DCD ++
Sbjct: 1 PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDR-HS 59
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GC GG + QY V+ G+H + YPY ++ C T V I GY VP N E
Sbjct: 60 YGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCET 118
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
S L ALANQPLSV +EA G+ FQ Y GV+DG CGT+LDH V AVGYG++ G +YII+KN
Sbjct: 119 SFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIIIKN 178
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWGP WGEKGY+R+KR +G +G CG+ K + YP K
Sbjct: 179 SWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 214
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 137/304 (45%), Positives = 193/304 (63%), Gaps = 9/304 (2%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFK 105
+FE W +K K Y S EK R IF D L +I++ N + LGLN+F+DL + EF+
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 EMFLG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
++G KP R +D+ DV LP S+DWR++GAVT +K+QG CGSCWAFS +
Sbjct: 61 ANYVGKFKP--PRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
A++E + + T L SLSEQ+LIDCD T + GC GG + AF+++V GG+ EE YPY
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYT 177
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
G+C K ++VV I GY DV ++S D+L+KA++ P++V I S ++FQ Y G+
Sbjct: 178 GFAGSCNANK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 285 DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
GHC DH V +GYG+ G+ Y I+KNSWG WGE G++R+K+ G EG+CG+N
Sbjct: 236 SGHCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKKEDG--EGMCGMNGQ 293
Query: 345 ASYP 348
+SYP
Sbjct: 294 SSYP 297
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 141/304 (46%), Positives = 186/304 (61%), Gaps = 8/304 (2%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
F WM K K Y E ++++ FKDN+ I N K + LGLN FADL +EE+K+
Sbjct: 34 FLGWMKKHNKAYHH-HEFNDKYQTFKDNMDFIHNWNSKESDTVLGLNRFADLTNEEYKKT 92
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
+LG+ ++ R +Q + + P S+DWR+ GAV +VK+QG CGSCWAF+T AV
Sbjct: 93 YLGMSINVNLRANQVPMNGLNFERFTGPSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAV 152
Query: 168 EGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EG +QI TGN+ + SEQ L+DC Y NNGC+GGLM AF+YI+ G+ EE YPY
Sbjct: 153 EGAHQIKTGNMVTFSEQHLVDCSGRYGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTAT 212
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY-D 285
+ C + I+GY DVP+ SE +L A++ QP++VAI+AS FQ Y GVY +
Sbjct: 213 QNRC-VYNTTMLGTAISGYKDVPRGSESALTAAISKQPVAVAIDASPITFQLYKSGVYQE 271
Query: 286 GHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
C + +L+HGV AVGYG+ G DY IVKNSW WG +GYI M RN CGI M
Sbjct: 272 ATCSSYRLNHGVLAVGYGTLEGKDYYIVKNSWAETWGNQGYILMARNANNH---CGIATM 328
Query: 345 ASYP 348
ASY
Sbjct: 329 ASYA 332
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/306 (45%), Positives = 182/306 (59%), Gaps = 8/306 (2%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
F+SW + Y ++ E+ R I++ NL I++ N + +Y L +N+FADL + EF
Sbjct: 22 FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLAVNKFADLTYPEFAAK 81
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
+LGL+ D +V LP SVDWR G VT +K+QG CGSCW+FST +V
Sbjct: 82 YLGLRFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTTGSV 141
Query: 168 EGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EG + TG L SLSEQ L+DC + N GCNGGLMD AFQYI+S G+ E YPY +
Sbjct: 142 EGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPYTAQ 201
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD 285
+GTC+ T+ Y D+ SE L A+A P+SVAI+AS FQFYS GVY+
Sbjct: 202 DGTCQFNSANVG-ATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGVYN 260
Query: 286 --GHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINK 343
+QLDHGV AVGYG++ DY +VKNSWG WG+ GYI M RN+ CGI
Sbjct: 261 EPACSSSQLDHGVLAVGYGTSGSSDYWLVKNSWGTSWGQSGYIWMTRNSNNQ---CGIAT 317
Query: 344 MASYPI 349
ASYP+
Sbjct: 318 AASYPL 323
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/304 (45%), Positives = 188/304 (61%), Gaps = 5/304 (1%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM++ + Y+ EK R E+F+ N ID N ++ L N FADL +EF+
Sbjct: 39 EKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVQEFRAA 98
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
GL+P A + + D +SVDWR GAVT VK+QG+ G CWAFS VAAV
Sbjct: 99 RTGLRPRPAPSAGAGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGASGCCWAFSAVAAV 158
Query: 168 EGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EG+N+I TG L SLSEQEL+DCD + + GC+GGLMD AFQ++ GGL E YPY
Sbjct: 159 EGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQCR 218
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+G C + + +I G+ DVP+N+E +L A+A+QP+SVAI F+FY GV G
Sbjct: 219 DGPCRSSA-AAAAASIRGHEDVPRNNEAALAAAVAHQPVSVAINGEDMAFRFYDSGVLGG 277
Query: 287 HCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
CGT L+H + AVGYG+ G Y ++KNSWG WGE GY+R++R + EG+CG+ K+
Sbjct: 278 ACGTDLNHAITAVGYGTAADGTRYWLMKNSWGASWGEGGYVRIRRGV-RGEGVCGLAKLP 336
Query: 346 SYPI 349
SYP+
Sbjct: 337 SYPV 340
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 135/304 (44%), Positives = 194/304 (63%), Gaps = 9/304 (2%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFK 105
+FE W +K +K Y S EK R +F D L +I++ N + + LGLN+F+DL + EF+
Sbjct: 1 MFEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 EMFLG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
++G KP R +D+ DV LP S+DWR++GAVT +K+QG CGSCWAFS +
Sbjct: 61 ANYVGKFKP--PRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
A++E + + T L SLSEQ+LIDCD T + GC GG D AF+++V GG+ EE YPY
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPDDAFKFVVENGGVTTEEAYPYT 177
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
G+C K ++VV I GY DV ++S D+L+KA++ P++V I S ++FQ Y G+
Sbjct: 178 GFAGSCNTNK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 285 DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
G C DH V +GYG+ G+ Y I+KNSWG WGE G++++K+ G EG+CG+N
Sbjct: 236 SGQCCNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIKKKDG--EGMCGMNGQ 293
Query: 345 ASYP 348
+SYP
Sbjct: 294 SSYP 297
>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
(fragment)
gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
gi|226542|prf||1601514A actinidin
Length = 302
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 136/277 (49%), Positives = 180/277 (64%), Gaps = 5/277 (1%)
Query: 76 LRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDL 134
LR IDE N ++Y +GLN+FADL EEF+ +LG + K + + V L
Sbjct: 1 LRFIDEHNADTNRSYKVGLNQFADLTGEEFRSTYLGFTGGSNKTKVSNRYEPRVSQV--L 58
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P VDWR GAV +K+QG CG CWAFS +A VEGIN+IVTG L SLSEQELI C T N
Sbjct: 59 PSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGTQN 118
Query: 195 N-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
GCNGG + FQ+I++ GG++ E+YPY ++G C + + VTI+ Y +VP N+E
Sbjct: 119 TRGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYVTIDTYGNVPYNNE 178
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
+L A+ QP+SVA++A+G F+ YS G++ G CGT +DH V VGYG+ G+DY IV+
Sbjct: 179 WALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIVE 238
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
NSW WGE+GY+R+ RN G G CGI M SYP+K
Sbjct: 239 NSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPVK 274
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 184/308 (59%), Gaps = 10/308 (3%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLNEFADLRHEEFKE 106
FE+W F K Y E++ R +++ N +D N I +Y LG+N FADL HEEFK
Sbjct: 30 FEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFKR 89
Query: 107 MFLGLKPDLAR-RKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
+LG K DL R R + S +V LP SVDWR G VT VK+QG CGSCW+FST
Sbjct: 90 FYLGTKVDLNRPRSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSFSTTG 149
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
+VEG + TG L SLSEQ L+DC N GCNGGLMD AFQYI++ G+ E YPY
Sbjct: 150 SVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASYPYT 209
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV 283
++GTC+ T++ + D+ + SE L A+A P+SVAI+AS FQ Y+ GV
Sbjct: 210 AKDGTCKFNAANVG-ATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLYTSGV 268
Query: 284 YD-GHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
Y+ C T LDHGV A GYG++ G Y +VKNSWG WG+ GYI M RN CGI
Sbjct: 269 YNEKKCSSTSLDHGVLAAGYGTSNGTPYWLVKNSWGSSWGQAGYIWMSRNANNQ---CGI 325
Query: 342 NKMASYPI 349
ASYPI
Sbjct: 326 ATSASYPI 333
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 151/303 (49%), Positives = 201/303 (66%), Gaps = 16/303 (5%)
Query: 56 EKVYESLDEKLERFEIFKDNLRHIDETNRKI----KNYWLGLNEFADLRHEEFKEMFLGL 111
+K Y++L+E+ RFEIF++N++ I+E N+ K+Y+LG+N+F+DL+HEEF + + GL
Sbjct: 64 DKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHEEFVK-YNGL 122
Query: 112 KPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGIN 171
K KD + + + P SVDWRKKG VT VKNQG CGSCW+FST ++EG +
Sbjct: 123 KK--TSLKDGGCSSYLAANNLVEPDSVDWRKKGYVTDVKNQGQCGSCWSFSTTGSLEGQH 180
Query: 172 QIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTC 230
+G L SLSE +L+DC ++ N GCNGGLMD AF+YI S GGL EEDYPY ++GTC
Sbjct: 181 FRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESEEDYPYKPKQGTC 240
Query: 231 EMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD-GHC 288
+ + T G DV SE +L KA++ P+SVAI+AS FQ Y+GGVYD C
Sbjct: 241 KFDDTKV-AATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSFQSYAGGVYDEPEC 299
Query: 289 GT-QLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
+ QLDHGV VGYG+ +G DY IVKNSWG +WGE GY++M RN + CGI AS
Sbjct: 300 SSEQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEWGEDGYVKMSRN---KKNQCGIATQAS 356
Query: 347 YPI 349
YP+
Sbjct: 357 YPL 359
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 150/362 (41%), Positives = 196/362 (54%), Gaps = 58/362 (16%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADL 99
D +++ FE WM + ++Y EK R E+++ N+ + ET + N Y L N+FADL
Sbjct: 26 DPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALV-ETFNSMSNGGYRLADNKFADL 84
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFS-------------YKDVVDLPKSVDWRKKGAV 146
+EEF+ LG + H Y D +LPKSVDWR+KGAV
Sbjct: 85 TNEEFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSD--ELPKSVDWREKGAV 142
Query: 147 THVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAF 206
VKNQG CGSCWAFS VAA+EGINQI G L SLSEQEL+DCD T GC GG M +AF
Sbjct: 143 APVKNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCD-TKAIGCAGGYMSWAF 201
Query: 207 QYIVSTGGLHKEEDYPY----------------------------IMEEGTCEMTKGESE 238
+++++ GL E +YPY G C+ K +
Sbjct: 202 EFVMNNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKES 261
Query: 239 VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAA 298
V+I+GY +V +SE LL+A A QP+SVA++A +Q Y GGV+ G C L+HGV
Sbjct: 262 AVSISGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTV 321
Query: 299 VGYGSTR-----------GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASY 347
VGYG T+ G Y IVKNSWGP+WG+ GYI M+R GLCGI + SY
Sbjct: 322 VGYGETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSY 381
Query: 348 PI 349
P+
Sbjct: 382 PV 383
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 134/303 (44%), Positives = 192/303 (63%), Gaps = 7/303 (2%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFK 105
+FE W +K K Y S EK R IF D L +I++ N + + LGLN+F+DL + EF+
Sbjct: 1 MFEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
++G K R +D+ DV LP S+DWR++GAVT +K+QG CGSCWAFS +A
Sbjct: 61 ANYVG-KFKSPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
++E + + T L SLSEQ+LIDCD T + GC GG + AF+++V GG+ EE YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTG 178
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYD 285
G+C K ++VV I GY DV ++S D+L+KA++ P++V I S ++FQ Y G+
Sbjct: 179 FAGSCNANK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILS 236
Query: 286 GHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
G C DH V +GYG+ G+ Y I+KNSWG WGE G++++K+ G EG+CG+N +
Sbjct: 237 GQCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGENGFMKIKKKDG--EGMCGMNGQS 294
Query: 346 SYP 348
SYP
Sbjct: 295 SYP 297
>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
Length = 246
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 136/265 (51%), Positives = 171/265 (64%), Gaps = 24/265 (9%)
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAV 146
K+Y L +NEFADL +EEF K + + S F Y++V +P + DWRKKGAV
Sbjct: 3 KSYKLSINEFADLTNEEFGTSRNRFKAHICSTEATS---FKYENVTAVPSTXDWRKKGAV 59
Query: 147 THVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYA 205
T +K+QG CGSCWAFS VAA+EGI Q+ TG L SLSEQEL+DCD + + GC G
Sbjct: 60 TPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXGA----- 114
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPL 265
+YPY +GTC K INGY DVP N+E +L KA+A+QP+
Sbjct: 115 --------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPI 160
Query: 266 SVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKG 324
+VAI+A G +FQFYS GV+ G CGT+LDHGV AVGYG++ G+ Y +VKNSWG WGE+G
Sbjct: 161 AVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTGWGEEG 220
Query: 325 YIRMKRNTGKPEGLCGINKMASYPI 349
YIRM+R+ EGLCGI ASYP
Sbjct: 221 YIRMQRDVTAKEGLCGIAMQASYPT 245
>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
Length = 380
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 156/363 (42%), Positives = 203/363 (55%), Gaps = 36/363 (9%)
Query: 15 CISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKD 74
C S +A D +G S +D N +I+ F+ W + + K Y ++ E RF ++
Sbjct: 25 CSSATAHRPYAGD---MGSSTDD---NSPMIERFQRWKAAYNKSYATVAEDRRRFLVYAR 78
Query: 75 NLRHIDETNRKIK----NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKD 130
N+ +I+ TN + + Y LG + DL ++EF M+ P A+ ED + +
Sbjct: 79 NMAYIEATNAEAEAAGLTYELGETAYTDLTNQEFMAMYTA-APSPAQLPADEDEDDAAEA 137
Query: 131 VVDL---------------------PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEG 169
V+ P SVDWR GAVT VKNQG CGSCWAFSTVA VEG
Sbjct: 138 VITTRAGPVDAVGQLPVYVNLSTAAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEG 197
Query: 170 INQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
I QI TG L SLSEQEL+DCD T + GC+GG+ A ++I S GGL EEDYPY
Sbjct: 198 IYQIRTGKLVSLSEQELVDCD-TLDAGCDGGISYRALRWITSNGGLTTEEDYPYTGTTDA 256
Query: 230 CEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCG 289
C K +I G V SE SL A+A QP++V+IEA G +FQ Y GVY+G CG
Sbjct: 257 CNRAKLAHNAASIAGLRRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKRGVYNGPCG 316
Query: 290 TQLDHGVAAVGYGSTR--GLDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLCGINKMAS 346
T L+HGV VGYG G Y I+KNSWG WG+ GYI+M+++ GKPEGLCGI S
Sbjct: 317 TSLNHGVTVVGYGQEEEDGDKYWIIKNSWGASWGDGGYIKMRKDVAGKPEGLCGIAIRPS 376
Query: 347 YPI 349
+P+
Sbjct: 377 FPL 379
>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 358
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 180/315 (57%), Gaps = 13/315 (4%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLNEFADLRHE 102
++D F +W + Y S +E L+RF++++ N ID N R Y L NEFADL E
Sbjct: 43 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 102
Query: 103 EFKEMFLGLKP------DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS-C 155
EF + G D D S+ VD+P SVDWR +GAV K+Q S C
Sbjct: 103 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTC 162
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGL 215
SCWAF T A +E +N I TG L SLSEQ+L+DCD +Y+ GCN G A++++V GGL
Sbjct: 163 SSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAYKWVVENGGL 221
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E DYPY G C K I G+ VP +E +L A+A QP++VAIE G
Sbjct: 222 TTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSG 280
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGS--TRGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
QFY GGVY G CGT+L H V VGYG+ + G Y +KNSWG WGE+GYIR+ R+ G
Sbjct: 281 MQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVG 340
Query: 334 KPEGLCGINKMASYP 348
P GLCG+ +YP
Sbjct: 341 GP-GLCGVTLDIAYP 354
>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
Length = 362
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 180/315 (57%), Gaps = 13/315 (4%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLNEFADLRHE 102
++D F +W + Y S +E L+RF++++ N ID N R Y L NEFADL E
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 106
Query: 103 EFKEMFLGLKP------DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS-C 155
EF + G D D S+ VD+P SVDWR +GAV K+Q S C
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTC 166
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGL 215
SCWAF T A +E +N I TG L SLSEQ+L+DCD +Y+ GCN G A++++V GGL
Sbjct: 167 SSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAYKWVVENGGL 225
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E DYPY G C K I G+ VP +E +L A+A QP++VAIE G
Sbjct: 226 TTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSG 284
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGS--TRGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
QFY GGVY G CGT+L H V VGYG+ + G Y +KNSWG WGE+GYIR+ R+ G
Sbjct: 285 MQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVG 344
Query: 334 KPEGLCGINKMASYP 348
P GLCG+ +YP
Sbjct: 345 GP-GLCGVTLDIAYP 358
>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
Length = 362
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 180/315 (57%), Gaps = 13/315 (4%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLNEFADLRHE 102
++D F +W + Y S +E L+RF++++ N ID N R Y L NEFADL E
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYRLAENEFADLTEE 106
Query: 103 EFKEMFLGLKP------DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS-C 155
EF + G D D S+ VD+P SVDWR +GAV K+Q S C
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTC 166
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGL 215
SCWAF T A +E +N I TG L SLSEQ+L+DCD +Y+ GCN G A++++V GGL
Sbjct: 167 SSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAYKWVVENGGL 225
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E DYPY G C K I G+ VP +E +L A+A QP++VAIE G
Sbjct: 226 TTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSG 284
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGS--TRGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
QFY GGVY G CGT+L H V VGYG+ + G Y +KNSWG WGE+GYIR+ R+ G
Sbjct: 285 MQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVG 344
Query: 334 KPEGLCGINKMASYP 348
P GLCG+ +YP
Sbjct: 345 GP-GLCGVTLDIAYP 358
>gi|302831223|ref|XP_002947177.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
nagariensis]
gi|300267584|gb|EFJ51767.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
nagariensis]
Length = 514
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 165/360 (45%), Positives = 200/360 (55%), Gaps = 49/360 (13%)
Query: 36 EDLTSNDKLI-------DLFESWMSKFEKVY-ESLDEKLERFEIFKDNLRHIDETNRKIK 87
E L S+D L F W ++ + Y E E R IF DN+R I E++ K
Sbjct: 19 EQLASSDLLALAKVEPHRAFTLWSRQYGRTYVEQSPEYTRRLSIFSDNVRAIQESHEKDP 78
Query: 88 NYWLGLNEFADLRHEEFKEMFLGLKPD------LARRKDQSHEDFSYKDVVDLPKSVDWR 141
L LNE+ADL EEF LGL+ D +RR + Y VD PK++DWR
Sbjct: 79 GVTLALNEYADLTWEEFSSTRLGLRIDQDQLDRRSRRSASRRNAWRYAAAVDNPKAIDWR 138
Query: 142 KKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD----------- 190
+KGAV VKNQG CGSCWAFST A+EGIN IVTG L SLSEQ+L+DCD
Sbjct: 139 EKGAVAEVKNQGQCGSCWAFSTTGAIEGINAIVTGQLQSLSEQQLVDCDTGKRTVTRSKR 198
Query: 191 ---------------NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT---CEM 232
N N GC+GGLMD AF+Y++ GGL E+DY Y G C
Sbjct: 199 SCTVILPSYSSNSCRNESNMGCSGGLMDDAFKYVIQNGGLDTEQDYAYWSGYGLGFWCNK 258
Query: 233 TK-GESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQ 291
K + V+I+GY DVPQ ED+LLKA+A+QP++VAI +G QFYS GV C
Sbjct: 259 RKQTDRPAVSIDGYEDVPQG-EDNLLKAVAHQPVAVAI-CAGASMQFYSRGVIS-TCCEG 315
Query: 292 LDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
L+HGV VGY S G Y IVKNSWG WGE+GY R+K G+ GLCGI ASYP K
Sbjct: 316 LNHGVLTVGYNVSQDGEKYWIVKNSWGAGWGEQGYFRLKMGVGE-TGLCGIASAASYPTK 374
>gi|110739710|dbj|BAF01762.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
Length = 300
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 125/192 (65%), Positives = 149/192 (77%)
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEE 219
AFST+ AVEGIN+IVTG+L SLSEQEL+DCD +YN GCNGGLMDYAF++I+ GG+ E
Sbjct: 1 AFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEA 60
Query: 220 DYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
DYPY +G C+ + ++VVTI+ Y DVP+NSE SL KALA+QP+SVAIEA GR FQ Y
Sbjct: 61 DYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLY 120
Query: 280 SGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
S GV+DG CGT+LDHGV AVGYG+ G Y IV+NSWG +WGE GYI+M RN P G C
Sbjct: 121 SSGVFDGLCGTELDHGVVAVGYGTENGKGYWIVRNSWGNRWGESGYIKMARNIEAPTGKC 180
Query: 340 GINKMASYPIKK 351
GI ASYPIKK
Sbjct: 181 GIAMEASYPIKK 192
>gi|157834287|pdb|1YAL|A Chain A, Carica Papaya Chymopapain At 1.7 Angstroms Resolution
Length = 218
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 126/216 (58%), Positives = 154/216 (71%), Gaps = 2/216 (0%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P+S+DWR KGAVT VKNQG+CGS WAFST+A VEGIN+IVTGNL LSEQEL+DCD ++
Sbjct: 2 PQSIDWRAKGAVTPVKNQGACGSXWAFSTIATVEGINKIVTGNLLELSEQELVDCDK-HS 60
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GC GG + QY V+ G+H + YPY ++ C T V I GY VP N E
Sbjct: 61 YGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNXET 119
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
S L ALANQPLSV +EA G+ FQ Y GV+DG CGT+LDH V AVGYG++ G +YII+KN
Sbjct: 120 SFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIIIKN 179
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWGP WGEKGY+R+KR +G +G CG+ K + YP K
Sbjct: 180 SWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 215
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 134/319 (42%), Positives = 193/319 (60%), Gaps = 11/319 (3%)
Query: 34 SPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGL 93
+ +L+ + + E WM+++ ++Y+ EK RFE+FK N+ I+ N +WLG+
Sbjct: 23 AARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGV 82
Query: 94 NEFADLRHEEFKEMFL--GLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKN 151
N+FADL ++EF+ G P R + D LP ++DWR KG VT +K+
Sbjct: 83 NQFADLTNDEFRSTKTNKGFIPSTTRVPTGFRNENVNIDA--LPATMDWRTKGVVTPIKD 140
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLS-EQELIDCDNTYNNGCNGGLMDYAFQYIV 210
QG CG CWAFS VAA+EGI ++ TG L S S + L+ + GC GGLMD AF++I+
Sbjct: 141 QGQCGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSLL---TVMSMGCEGGLMDDAFKFII 197
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
GGL E +YPY + + + V +I GY DVP N+E +L+KA+ANQP+SVA++
Sbjct: 198 KNGGLTTESNYPYAAVDD--KFKSVSNSVASIKGYEDVPANNEAALMKAVANQPVSVAVD 255
Query: 271 ASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMK 329
FQFY GGV G CGT LDHG+ A+GYG ++ G Y ++KNSWG WGE G++RM+
Sbjct: 256 GGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRME 315
Query: 330 RNTGKPEGLCGINKMASYP 348
++ G+CG+ SYP
Sbjct: 316 KDISDKRGMCGLAMEPSYP 334
>gi|312100382|gb|ADQ27799.1| mitogenic proteinase [Vasconcellea cundinamarcensis]
Length = 214
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 127/216 (58%), Positives = 157/216 (72%), Gaps = 6/216 (2%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P+S+DWR+KGAVT VK+Q CGSCWAFSTVA VEGIN+IVTG L SLSEQEL+DCD +
Sbjct: 2 PESIDWRQKGAVTPVKDQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-S 60
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
+GCNGG + QY+V G +H E +YPY ++G C + V I GY VP N E
Sbjct: 61 HGCNGGYQTTSLQYVVDNG-VHTEYEYPYEKKQGNCRAKDKKGLKVQITGYKRVPPNDEI 119
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
SL+K +ANQP+SV IE+ R F FY GG+Y G CGT+LDH V A+GYG DYI++KN
Sbjct: 120 SLIKVIANQPVSVLIESKDRSFHFYRGGIYKGPCGTRLDHAVTAIGYGK----DYILIKN 175
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWGP WGEKGYIR+KR +GK EG+CG+ K + +PIK
Sbjct: 176 SWGPNWGEKGYIRIKRASGKSEGICGVYKSSYFPIK 211
>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
Length = 510
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 184/313 (58%), Gaps = 22/313 (7%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGL----NEFADLRHEE 103
F +WM + E +R E + N +I E N ++N W G+ NEF+ + EE
Sbjct: 29 FSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHN--LENAWTGVKLDHNEFSSMSFEE 86
Query: 104 FKEMFLG-------LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
FK G L+ LA R D D V +P SVDW+ KG VT VKNQG CG
Sbjct: 87 FKFKMTGYVMPEGYLEQRLASRVDNLWSD------VQVPDSVDWQDKGGVTPVKNQGMCG 140
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
SCWAFST AVEG + +G L SLSEQEL+DCD+ + GCNGGLMD+AF +I GG+
Sbjct: 141 SCWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGIC 200
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E+DY Y + C + +VV I+G+ DV E +L A+A QP+SVAIEA + F
Sbjct: 201 SEDDYEYKAKAQVCRDCE---KVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAF 257
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
QFY GV++ CGT+LDHGV AVGYGS G + VKNSWG WGEKGYIR+ R P
Sbjct: 258 QFYKSGVFNLTCGTRLDHGVLAVGYGSENGQKFWKVKNSWGSSWGEKGYIRLAREENGPA 317
Query: 337 GLCGINKMASYPI 349
G CGI + SYP
Sbjct: 318 GQCGIASVPSYPF 330
>gi|357446993|ref|XP_003593772.1| Cysteine proteinase [Medicago truncatula]
gi|355482820|gb|AES64023.1| Cysteine proteinase [Medicago truncatula]
Length = 339
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 144/328 (43%), Positives = 201/328 (61%), Gaps = 12/328 (3%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-- 88
+G + + L + DK I++F+ WM + +VY+ LDE ++F+IF NL++I ETN K K+
Sbjct: 1 MGPNLDKLPTQDKTIEIFQLWMKEHGRVYKDLDEMAKKFDIFISNLKYITETNAKRKSSN 60
Query: 89 -YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVT 147
+ LGL F D EEF+E +L D+ D + + P S+DWR KG V+
Sbjct: 61 GFLLGLTNFTDWSSEEFQERYLH-NIDMPTDIDTMKVNDVHLSSCSAPSSLDWRSKGVVS 119
Query: 148 HVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQ 207
+K+Q +CGSCWAFS V A+EGIN I TG L +LSEQEL+DCD + GCN G ++ AF
Sbjct: 120 DIKDQKNCGSCWAFSAVGAIEGINAITTGKLINLSEQELLDCD-PISGGCNSGWVNKAFD 178
Query: 208 YIVSTGGLHKEEDYPYIMEEGTCEMTK-GESEVVTINGYHDVPQNSEDSLLKALANQPLS 266
+++ G+ + DYPY E+G C+ ++ S + +IN YH V Q S+ LL A+A QP+S
Sbjct: 179 WVIRNKGVALDNDYPYTAEKGVCKASQIPNSAISSINTYHHVEQ-SDQGLLCAVAKQPVS 237
Query: 267 VAIEASGRDFQFYSGGVYDG-HCGTQ---LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
V + A +DF YS G+YDG +C +H V VGY S G DY IVKN WG WG
Sbjct: 238 VCLYAP-QDFHHYSSGIYDGPNCPVNSKDTNHCVLIVGYDSVDGQDYWIVKNQWGTSWGM 296
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPIK 350
+GY+ +KRNT K G+C IN A P+K
Sbjct: 297 EGYMHIKRNTNKKYGVCAINSWAYNPVK 324
>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 535
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 184/313 (58%), Gaps = 22/313 (7%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGL----NEFADLRHEE 103
F +WM + E +R E + N +I E N ++N W G+ NEF+ + EE
Sbjct: 29 FSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHN--LENAWTGVKLDHNEFSSMSFEE 86
Query: 104 FKEMFLG-------LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
FK G L+ LA R D D V +P SVDW+ KG VT VKNQG CG
Sbjct: 87 FKFKMTGYVMPEGYLEQRLASRVDNLWSD------VQVPDSVDWQDKGGVTPVKNQGMCG 140
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
SCWAFST AVEG + +G L SLSEQEL+DCD+ + GCNGGLMD+AF +I GG+
Sbjct: 141 SCWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGIC 200
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E+DY Y + C + +VV I+G+ DV E +L A+A QP+SVAIEA + F
Sbjct: 201 SEDDYEYKAKAQVCRDCE---KVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAF 257
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
QFY GV++ CGT+LDHGV AVGYGS G + VKNSWG WGEKGYIR+ R P
Sbjct: 258 QFYKSGVFNLTCGTRLDHGVLAVGYGSENGQKFWKVKNSWGSSWGEKGYIRLAREENGPA 317
Query: 337 GLCGINKMASYPI 349
G CGI + SYP
Sbjct: 318 GQCGIASVPSYPF 330
>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
Length = 351
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 154/357 (43%), Positives = 207/357 (57%), Gaps = 29/357 (8%)
Query: 2 ALSSQFKTILISFCISFFIRSSFARDFSI-VGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
A + + I+ CI + ARD S GY E +T+ E WM + + Y+
Sbjct: 14 AAVALLTVLAIANCIGCAVA---ARDLSSSTGYGEEAMTAR------HEKWMVEHGRTYK 64
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKI--KNYWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
EK RF++FK N +D +N K Y L +N FAD+ H+EF + G KP A
Sbjct: 65 DEAEKARRFQVFKANAAFVDTSNAAAGGKKYHLAINRFADMTHDEFMARYTGFKPLPATG 124
Query: 119 KDQSHEDFSYKDVV---DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
K F Y +V + ++VDWRKKGAVT VKNQ CG CWAFS VAA+EG++QI T
Sbjct: 125 KKMP--GFKYANVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEGMHQINT 182
Query: 176 GNLASLSEQELIDCDN-TYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTK 234
G L SLSEQ+L+DC NNGC GG M+ AFQY++ G+ E YPY +G C+ +
Sbjct: 183 GELVSLSEQQLVDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQGMCQNVQ 242
Query: 235 GESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG-HCGTQLD 293
V + Y VP++ ED+L A+A QP+SVA++A+ +FQFY GGV CGT L+
Sbjct: 243 ---PAVAVRSYQQVPRDDEDALAAAVAGQPVSVAVDAN--NFQFYKGGVMTADSCGTNLN 297
Query: 294 HGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
H V AVGYG+ G Y ++KN WG WGE+GY+R++R G CG+ K ASYP+
Sbjct: 298 HAVTAVGYGTAEDGTPYWLLKNQWGSTWGEEGYLRLQRGVGA----CGVAKDASYPV 350
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 143/311 (45%), Positives = 187/311 (60%), Gaps = 17/311 (5%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKE 106
+F ++ +K+ KVY ++E RF IFK N+ I TN + + LG+NEF DL EE
Sbjct: 26 MFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEELAA 85
Query: 107 MFLGLKP-----DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
+ GLKP L R +HE + L SVDW +G VT VKNQG CGSCW+F
Sbjct: 86 SYTGLKPASLWSGLPRLS--THE----YNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSF 139
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
ST A+EG + TGNL SLSEQ+ +DCD T ++GCNGG MD AF + + E Y
Sbjct: 140 STTGALEGAWALSTGNLVSLSEQQFVDCDTT-DSGCNGGWMDNAFSF-AKKNSICTEGSY 197
Query: 222 PYIMEEGTCEMTKGESEVVT--INGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
PY +GTC ++ + + + GY DV +SE +++ A+A QP+S+AIEA FQ Y
Sbjct: 198 PYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLY 257
Query: 280 SGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
S GV CGT+LDHGV AVGYGS G DY VKNSWG WGE+GY+R++R G G C
Sbjct: 258 SSGVLTASCGTRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRGKGG-AGEC 316
Query: 340 G-INKMASYPI 349
G + SYP+
Sbjct: 317 GLLAGPPSYPV 327
>gi|281204396|gb|EFA78592.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 330
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 147/327 (44%), Positives = 194/327 (59%), Gaps = 10/327 (3%)
Query: 28 FSIVGY-SPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI 86
F IVG S L S + F +WM + ++ Y+ E +R+ FK+NL I + N +
Sbjct: 8 FLIVGIASANRLFSEQHYQNQFTNWMVRLDRAYDVF-EFQDRYNAFKNNLDLIHKWNSQG 66
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAV 146
+ LG+N ADL +EE++ ++LG+K D +R Q+ K + S+DWR GAV
Sbjct: 67 HSTVLGVNHLADLSNEEYRNLYLGVKVDASRLPQQAASIKLNKVFAPVAASLDWRSSGAV 126
Query: 147 THVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYA 205
VK+QG CGSCW+FST ++EG NQI TGN ASLSEQ+L+DC Y N GCNGGLMD A
Sbjct: 127 GRVKDQGQCGSCWSFSTTGSIEGANQIATGNFASLSEQQLMDCSRDYGNEGCNGGLMDAA 186
Query: 206 FQYIVSTGGLHKEEDYPYIMEEG-TCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQP 264
+Y+++ GGL EE YPY M + TC+ I+ Y DV + SE L L P
Sbjct: 187 MKYVIAQGGLDTEESYPYTMSDSYTCKFNPANIG-AKISSYIDVQRGSETDLAAKLNKGP 245
Query: 265 LSVAIEASGRDFQFYSGGV-YDGHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
+SVAI+AS FQ Y GV Y+ C + LDHGV AVGYG+ +Y IVKNSWGP WG
Sbjct: 246 VSVAIDASHSSFQLYKSGVYYEPACSSYNLDHGVLAVGYGTEGSSNYWIVKNSWGPNWGL 305
Query: 323 KGYIRMKRNTGKPEGLCGINKMASYPI 349
GYI M ++ CGI+ MAS P+
Sbjct: 306 SGYIWMAKDKSNH---CGISSMASIPV 329
>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 352
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 14/311 (4%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
+ WM++ + Y+ EK RF +FK N+ ID +N K Y L N F DL EF M
Sbjct: 43 DKWMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAM 102
Query: 108 FLGLKP-DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
+ G P + + S +D P VDWR++GAVT VKNQ SCG CWAFSTVAA
Sbjct: 103 YTGYNPANTMYAAANATTRLSSEDD-QQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAA 161
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
VEGI+QI TG L SLSEQ+L+DC + N GC GG +D AFQY+ ++GG+ E Y Y
Sbjct: 162 VEGIHQITTGELVSLSEQQLLDCAD--NGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGA 219
Query: 227 EGTCEM---TKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV 283
+G C+ + TI+GY V N E SL A+A+QP+SVAIE SG F+ Y GV
Sbjct: 220 QGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGV 279
Query: 284 YDG-HCGTQLDHGVAAVGYGS----TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
+ CGT+LDH VA VGYG+ + G Y I+KNSWG WG+ GY++++++ G +G
Sbjct: 280 FTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDVGS-QGA 338
Query: 339 CGINKMASYPI 349
CG+ SYP+
Sbjct: 339 CGVAMAPSYPV 349
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 187/311 (60%), Gaps = 17/311 (5%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKE 106
+F ++ +K+ KVY ++E RF IFK N+ I TN + + LG+NEF DL EEF
Sbjct: 26 MFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEEFAA 85
Query: 107 MFLGLKP-----DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
+ GLKP L R +HE + L SVDW +G VT VKNQG CGSCW+F
Sbjct: 86 SYTGLKPASLWSGLPRLS--THE----YNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSF 139
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
ST A+EG + TGNL SLSEQ+ DCD T ++GCNGG MD AF + + E Y
Sbjct: 140 STTGALEGAWALSTGNLVSLSEQQFEDCDTT-DSGCNGGWMDNAFSF-AKKNSICTEGSY 197
Query: 222 PYIMEEGTCEMTKGESEVVT--INGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
PY +GTC ++ + + + GY DV +SE +++ A+A QP+S+AIEA FQ Y
Sbjct: 198 PYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLY 257
Query: 280 SGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
S GV CGT+LDHGV AVGYGS G DY VKNSWG WGE+GY+R++R G G C
Sbjct: 258 SSGVLTASCGTRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRGKGG-AGEC 316
Query: 340 G-INKMASYPI 349
G + SYP+
Sbjct: 317 GLLAGPPSYPV 327
>gi|449532567|ref|XP_004173252.1| PREDICTED: oryzain alpha chain-like [Cucumis sativus]
Length = 321
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 125/196 (63%), Positives = 146/196 (74%), Gaps = 1/196 (0%)
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGL 215
GSCWAFS+VAAVEGINQIVTG L LSEQEL+DCD ++N GCNGGLMDYAFQ+I+ GG+
Sbjct: 13 GSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGI 72
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
EEDYPY + C+ + ++VVTI+GY DVP+N E SL KA+ANQP+SVAIEA GR
Sbjct: 73 DTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRA 132
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK- 334
FQ Y GV+ G CGT LDHGV AVGYG+ G DY IV+NSWG WGE GYIR++RN
Sbjct: 133 FQLYQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLERNVANI 192
Query: 335 PEGLCGINKMASYPIK 350
G CGI SYP K
Sbjct: 193 TTGKCGIAVQPSYPTK 208
>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
Length = 358
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 147/348 (42%), Positives = 208/348 (59%), Gaps = 12/348 (3%)
Query: 7 FKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKL 66
F IL++ C S ++++ A G N ++D F W + + + Y + +E+
Sbjct: 17 FALILVA-CCSLMLQAAAAAGGGADGVVVGADGDNKLMMDRFLRWQATYNRSYPTAEERQ 75
Query: 67 ERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFKEMFL--GLKPDLARRKDQSH 123
RF++++ N+ HI+ TNR Y LG N+FADL EEF +++ G+ P RR
Sbjct: 76 RRFQVYRRNMEHIEATNRAGNLTYTLGENQFADLTEEEFLDLYTMKGMPP--VRRDAGKK 133
Query: 124 EDFSYKDVVDLPKSVDWRKKGAVTHVKNQG-SCGSCWAFSTVAAVEGINQIVTGNLASLS 182
+ ++ VVD P SVDWR +GAVT +KNQG SC SCWAF T A +E I QI TG L SLS
Sbjct: 134 QQANFSSVVDAPTSVDWRSRGAVTPIKNQGPSCSSCWAFVTAATIESITQIRTGKLVSLS 193
Query: 183 EQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
EQELIDCD Y+ GCN G +++++ GGL E +YPY C +K I
Sbjct: 194 EQELIDCD-PYDGGCNLGYFVNGYKWVIQNGGLTTEANYPYQARRYQCNRSKAGQRAARI 252
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
+ Y +PQ E L +A+A QP++ AIE G QFYSGGV+ G CGT+++H + VGYG
Sbjct: 253 SNYRQLPQG-EAQLQQAVAQQPVAAAIEMGG-SLQFYSGGVWSGQCGTRMNHAITVVGYG 310
Query: 303 S-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
+ + G+ Y +VKNSWG WGE+GY+RM+++ + GLCGI +YPI
Sbjct: 311 ADSSGVKYWLVKNSWGQTWGERGYLRMRKDV-RQGGLCGIALDLAYPI 357
>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 498
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 152/316 (48%), Positives = 189/316 (59%), Gaps = 18/316 (5%)
Query: 48 FESWMSKFEKVY-ESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKE 106
F W ++ + Y E E R +F DN+R I E NR+ L LNE+AD EEF
Sbjct: 40 FGLWATQHARTYSEGSPEYTRRLGVFADNVRAIAEQNRRNTGITLALNEYADETWEEFAA 99
Query: 107 MFLGLKPDL-------ARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCW 159
LGLK AR S + Y V P +VDWR K AVT VKNQG CGSCW
Sbjct: 100 KRLGLKISQEQLKAREARSSSSSSSSWRYAQV-QTPAAVDWRAKNAVTQVKNQGQCGSCW 158
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEE 219
AFS V ++EG N + TG L +LSEQ+L+DCD N GC+GGLMD AF+Y++ GG+ EE
Sbjct: 159 AFSAVGSIEGANALATGQLVALSEQQLVDCDTASNMGCSGGLMDDAFKYVLDNGGIDTEE 218
Query: 220 DYPYIMEEG---TCEMTK-GESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
DY Y G C K + V+I+GY DVP SE +LLKA+A QP++VAI AS +
Sbjct: 219 DYSYWSGYGFGFWCNKRKQTDRPAVSIDGYEDVP-TSEPALLKAVAGQPVAVAICASA-N 276
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
QFYS GV + C L+HGV AVGY ++ + Y IVKNSWG WGE+GY R+K G
Sbjct: 277 MQFYSSGVINSCC-EGLNHGVLAVGYDTSDKAQPYWIVKNSWGGSWGEQGYFRLKMGEG- 334
Query: 335 PEGLCGINKMASYPIK 350
P+GLCGI ASY +K
Sbjct: 335 PKGLCGIASAASYAVK 350
>gi|414591548|tpg|DAA42119.1| TPA: hypothetical protein ZEAMMB73_388689, partial [Zea mays]
Length = 229
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 126/197 (63%), Positives = 150/197 (76%), Gaps = 1/197 (0%)
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGL 215
GSCWAFS +AAVEG+N+I+TG L SLSEQEL+DCD+ N GC+GGLMDYAFQYI GG+
Sbjct: 13 GSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAFQYIQRNGGV 72
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E +YPY+ E+ +C K S VTI+GY DVP N+ED+L KA+A+QP++VAIEASG+D
Sbjct: 73 TTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVAIEASGQD 132
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
FQFYS GV+ G CGT LDHGVAAVGYG+T G Y VKNSWG WGE+GYIRM+R
Sbjct: 133 FQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGYIRMQRGVPD 192
Query: 335 PEGLCGINKMASYPIKK 351
GLCGI SYP KK
Sbjct: 193 SRGLCGIAMEPSYPTKK 209
>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
Length = 430
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 147/342 (42%), Positives = 188/342 (54%), Gaps = 32/342 (9%)
Query: 39 TSNDKLIDLFESWMSK--FEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLG 92
++ + L FE W S+ E+ +E +R F +N ++ E N ++W+G
Sbjct: 89 SNANALARHFERWCSEHGLERYLRDTEEYAKRLATFAENAAYVVEHNALYAIGEVSHWVG 148
Query: 93 LNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKD------------VVDLPKSVDW 140
LN A EE++ + LG KP+L D + + D VD P+++DW
Sbjct: 149 LNSLAATTREEYRAL-LGYKPELRSSGDAEMLEATSTDKVEQYKASWEYASVDPPEAIDW 207
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGG 200
+ GAVT KNQG CGSCWAFST AVEGI +I TG L SLSEQE++ C N GCNGG
Sbjct: 208 VELGAVTPPKNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSKQ-NMGCNGG 266
Query: 201 LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKAL 260
LMDYAF++IV GG+ E YPY E C K + V TI+G+ DVP E L KA+
Sbjct: 267 LMDYAFRWIVKNGGIDSEFQYPYSAEALACNRWKLQLHVATIDGFKDVPPGDEKELEKAV 326
Query: 261 ANQPLSVAIEASGRDFQFYSGGVYDG-HCGTQLDHGVAAVGYG-----------STRGLD 308
+ QP+S+AIEA + FQ Y GGVYD CG+Q+DHGV VGYG R
Sbjct: 327 SQQPVSIAIEADTKSFQLYDGGVYDSKECGSQVDHGVLVVGYGFDDTHHNATKHHKRHRH 386
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
+ VKNSWG WGE G+IRM R G CGI SYP K
Sbjct: 387 FWKVKNSWGGTWGEGGFIRMARRISDETGQCGITTAPSYPTK 428
>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
Length = 382
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 140/337 (41%), Positives = 192/337 (56%), Gaps = 34/337 (10%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHE 102
++++F+ W +++ + Y + +E+ R ++ N+R+I+ TN Y LG + DL ++
Sbjct: 48 MMEMFQRWKAEYNRSYATPEEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTND 107
Query: 103 EFKEMFLGLKPDLARRK--------------------DQSHEDFSYKDVVDLPKSVDWRK 142
EF M+ P L + + + + P SVDWR
Sbjct: 108 EFMAMYTA--PPLRSAADDDDDAATTTIITTRAGPVDEHQQPEVYFNESAGAPASVDWRA 165
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
GAVT VK+QG CGSCWAFSTVA VEGI +I G L SLSEQEL+DCD T ++GC+GG+
Sbjct: 166 SGAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCD-TLDSGCDGGVS 224
Query: 203 DYAFQYIVSTGGLHKEEDYPYI-MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALA 261
A ++I + GG+ +DYPY C+ K TI G V SE SL A A
Sbjct: 225 YRALEWITANGGITTRDDYPYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNAAA 284
Query: 262 NQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGY--------GSTRGLDYIIVK 313
QP++V+IEA G +FQ Y GVYDG CGT+L+HGV VGY GS G Y I+K
Sbjct: 285 AQPVAVSIEAGGDNFQHYRKGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWIIK 344
Query: 314 NSWGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPI 349
NSWG WG++GYI+MK++ GKPEGLCGI S+P+
Sbjct: 345 NSWGKNWGDQGYIKMKKDVAGKPEGLCGIAIRPSFPL 381
>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
Length = 342
Score = 263 bits (673), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 14/311 (4%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
+ WM++ + Y+ EK RF +FK N+ ID +N K Y L N F DL EF M
Sbjct: 33 DKWMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAM 92
Query: 108 FLGLKP-DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
+ G P + + S +D P VDWR++GAVT VKNQ SCG CWAFSTVAA
Sbjct: 93 YTGYNPANTMYAAANATTRLSSEDD-QQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAA 151
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
VEGI+QI TG L SLSEQ+L+DC + N GC GG +D AFQY+ ++GG+ E Y Y
Sbjct: 152 VEGIHQITTGELVSLSEQQLLDCAD--NGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGA 209
Query: 227 EGTCEM---TKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV 283
+G C+ + TI+GY V N E SL A+A+QP+SVAIE SG F+ Y GV
Sbjct: 210 QGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGV 269
Query: 284 YDG-HCGTQLDHGVAAVGYGS----TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
+ CGT+LDH VA VGYG+ + G Y I+KNSWG WG+ GY++++++ G +G
Sbjct: 270 FTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDVGS-QGA 328
Query: 339 CGINKMASYPI 349
CG+ SYP+
Sbjct: 329 CGVAMAPSYPV 339
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 153/328 (46%), Positives = 206/328 (62%), Gaps = 27/328 (8%)
Query: 38 LTSNDKLIDLFESWM---SKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI----KNYW 90
+ ++ +L E+W + F KVY++++E+++RF+IF+D L I+E NRK K+Y+
Sbjct: 41 VKASTRLGPYHETWKEFKTLFGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYY 100
Query: 91 LGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE-----DFSYKDVVDLPKSVDWRKKGA 145
+G+N+F+D+ H+E+ L+ + RR ++ + D K L VDWR KG
Sbjct: 101 MGVNQFSDMSHDEY------LRHNGLRRGNRKYSKGEGCDSYTKSGKQLDDKVDWRDKGY 154
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDY 204
VT VKNQG CGSCW+FST ++EG + TG L SLSEQ+L+DC T+ N GCNGGLMD
Sbjct: 155 VTPVKNQGQCGSCWSFSTTGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDN 214
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-Q 263
AF+YI S GGL E+DYPY ++G C + K + G DV ED+L ALA+
Sbjct: 215 AFEYIKSIGGLEGEDDYPYTAKQGKCHLKKSLFK-ANDTGCTDVESGDEDALKDALASVG 273
Query: 264 PLSVAIEASGRDFQFYSGGVYD-GHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKW 320
P+SVAI+AS FQ Y GGVYD C +Q LDHGV VGYG+ G DY +VKNSWG W
Sbjct: 274 PISVAIDASHASFQSYDGGVYDEEECSSQNLDHGVLTVGYGTEENGGDYWLVKNSWGEMW 333
Query: 321 GEKGYIRMKRNTGKPEGLCGINKMASYP 348
GE+GYI+M RN + CGI ASYP
Sbjct: 334 GEEGYIKMSRN---KDNQCGIATQASYP 358
>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 144/307 (46%), Positives = 189/307 (61%), Gaps = 13/307 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNY--WLGLNEFADLRHEEFK 105
F+ W K+ KVYE+ + +LER I++ N + ++ N + + +NEFADL EF
Sbjct: 24 FQDWKVKYNKVYETKETELERQIIWESNKKFVENHNANSDKFGFTVAMNEFADLDAGEFG 83
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
+F GL P R + + V +P +VDW++KGAVT +KNQG CGSCW+FS+
Sbjct: 84 RIFNGLLP---RPSSYNSTNIYKPSGVKVPDTVDWKEKGAVTPIKNQGQCGSCWSFSSTG 140
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
++EG + I TG L SLSEQ+L+DC Y N+GCNGGLMD +F+Y+ S G E++YPY
Sbjct: 141 SLEGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDNSFRYLKSVAGDETEDNYPYT 200
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV 283
E G C + VVT Y D+PQ EDSL A+AN P+SVAI+AS FQ Y+ GV
Sbjct: 201 AENGVCRYDSSLA-VVTDKSYVDIPQGDEDSLKDAVANVGPISVAIDASHSSFQLYNSGV 259
Query: 284 YDGHC--GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
Y TQLDHGV A+GYG+ G DY +VKNSWG WG +GYI+M RN CGI
Sbjct: 260 YYASTCSSTQLDHGVLAIGYGTEDGKDYWLVKNSWGTSWGMEGYIKMSRNRNNN---CGI 316
Query: 342 NKMASYP 348
ASYP
Sbjct: 317 ATQASYP 323
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 152/339 (44%), Positives = 206/339 (60%), Gaps = 15/339 (4%)
Query: 19 FIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDN--- 75
+R S F +V + +S++ L +E++ + +K Y+S E+L RF+IF +N
Sbjct: 1 MLRISLLCAFVVVTTAA---SSHEILRTQWEAFKATHKKSYQSNMEELLRFKIFSENSLL 57
Query: 76 -LRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDL 134
RH ++ R + +Y LG+N+F DL EF MF G + + + + + L
Sbjct: 58 VARHNEKYARGLVSYKLGMNQFGDLLPHEFARMFNGYRGARTAGRGSTFLPPANVNYSSL 117
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY- 193
P+S+DWR+KGAVT VKNQG CGSCWAFST ++EG + + TG L SLSEQ L+DC T+
Sbjct: 118 PQSMDWREKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSETFG 177
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
N+GC GGLMD AFQYI + GG+ E+ YPY E+G C K ++ T G+ D+ Q SE
Sbjct: 178 NHGCEGGLMDNAFQYIKANGGIDTEKSYPYEAEDGECRFKK-QNVGATDTGFVDIEQGSE 236
Query: 254 DSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD-GHCGT-QLDHGVAAVGYGSTRGLDYI 310
D L KA+A P+SVAI+AS FQ YS GVYD C + QLDHGV VGYG G Y
Sbjct: 237 DDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETECSSEQLDHGVLVVGYGVEDGKKYW 296
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
+VKNSW WG+ GYI+M R+ + CGI ASYP+
Sbjct: 297 LVKNSWAESWGDNGYIKMSRD---KDNQCGIASAASYPL 332
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 158/357 (44%), Positives = 208/357 (58%), Gaps = 26/357 (7%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
M LS Q ++I+ + S A P L + + + + E WM++ + Y
Sbjct: 1 MPLSLQITKLVITLLMILGTWVSQAM--------PRPLLNAEAIAEKHEQWMARHGRTYH 52
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLG--LKPDLAR 117
EK RF+IFK+NL +I+ N+ K Y LGLN+F+DL EEF + G + L
Sbjct: 53 DNAEKERRFQIFKNNLDYIENFNKAFNKTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPT 112
Query: 118 RKDQSHEDF--SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
F +Y + ++P+S+DWR+ G VT VKNQG CG CWAFS VAAVEGI
Sbjct: 113 ANTTVKPTFFSNYYNQDEVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGI----A 168
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GN ASLS Q+L+DC N+GC GG M AF+YIV G+ + DYPY E T EM +
Sbjct: 169 GNGASLSAQQLLDCVGD-NSGCGGGTMIKAFEYIVQNQGIVSDTDYPY---EQTQEMCRS 224
Query: 236 ESEVVT-INGYHDVPQNSEDSLLKALANQPLSVAIEA-SGRDFQFYSGGVYDGH-CGTQL 292
S V I GY V Q SE++L +A+A QP+SVAI+A SG +F+ Y GV+ CGT L
Sbjct: 225 GSNVAARITGYESVIQ-SEEALKRAVAKQPISVAIDASSGPNFKSYISGVFSAEDCGTHL 283
Query: 293 DHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
H V VGYG+T G Y +VKNSWG +WGE GY+R++R+ G EG CGI ASYP
Sbjct: 284 THAVTLVGYGTTEDGTKYWLVKNSWGEEWGESGYMRLQRDVGAMEGPCGIAMQASYP 340
>gi|328866326|gb|EGG14711.1| hypothetical protein DFA_10969 [Dictyostelium fasciculatum]
Length = 369
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 156/374 (41%), Positives = 214/374 (57%), Gaps = 45/374 (12%)
Query: 4 SSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLD 63
S + KTI FC++F + +G P L S+++ F+ W+ +FEK YES
Sbjct: 10 SVRSKTI---FCVTFLGGA--------LGSKPTALFSHEQYTTEFKGWVGQFEKNYES-H 57
Query: 64 EKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR----- 118
E L RF+IFK N+ +I N K ++ L LN ADL +E++ ++LG K + A R
Sbjct: 58 EFLNRFDIFKKNMDYIKTWNDKSVDHKLELNTLADLTDKEYQRLYLGTKVNGALRVGLNH 117
Query: 119 ---KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
+D H + +V D P +VDWRK+GAV+HVKNQG CGSCW+FS+ A+EG + I T
Sbjct: 118 ADERDFGHIKSVFSNVKDNP-NVDWRKQGAVSHVKNQGQCGSCWSFSSTGAIEGAHAIKT 176
Query: 176 GNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTK 234
G + SLSEQ+L+DC Y NNGCNGGLM AF Y++ GGL EE YPY + + M
Sbjct: 177 GEMISLSEQQLVDCSKRYGNNGCNGGLMTLAFDYVIDAGGLESEEAYPYTTTDTSACMFN 236
Query: 235 GESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV-YDGHC-GTQ 291
+ V +I+ + ++ +E L L N P+SVAI+AS R F+FY G+ Y C +Q
Sbjct: 237 STNAVTSISDHQNIRAGNEKHLETVLRNVGPVSVAIDASPRSFRFYKSGIFYAPECSSSQ 296
Query: 292 LDHGVAAVGYGS-----------------TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
LDHGV AVG+G T+ +Y IVKNSWG WG G+I M +N
Sbjct: 297 LDHGVLAVGFGKGNPESNFENKVSFIHDDTKNNEYYIVKNSWGSDWGSNGFIYMSKNR-- 354
Query: 335 PEGLCGINKMASYP 348
+ CGI MA+YP
Sbjct: 355 -KNNCGIATMATYP 367
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 195/314 (62%), Gaps = 15/314 (4%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHE 102
++D F SW + + + Y + +E+ RF++++ N+ HI+ TNR Y LG N+FADL E
Sbjct: 45 MMDRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEE 104
Query: 103 EFKEMF----LGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQG-SCGS 157
EF +++ + ++ D +++ S VD P SVDWR KGAVT +KNQG SC S
Sbjct: 105 EFLDLYTMKGMPVRRDAGKKRANVS---SSAAAVDAPTSVDWRSKGAVTPIKNQGPSCSS 161
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHK 217
CWAF T A +E I +I TG L SLSEQELIDCD Y+ GCN G +++++ GGL
Sbjct: 162 CWAFVTAATIESITKITTGKLVSLSEQELIDCD-PYDGGCNLGYFVNGYRWVIQNGGLTT 220
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
E +YPY C ++ TI+ Y +P E L +A+A QP++ AIE G Q
Sbjct: 221 EANYPYQARRYACSRSRAAQHAATISDYVQLPAG-EGQLQQAVAQQPVAAAIEMGG-SLQ 278
Query: 278 FYSGGVYDGHCGTQLDHGVAAVGYG--STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
FYSGGV+ G CGT+++H + VGYG S+ GL Y +VKNSWG WGE+GY+RM+R+ G+
Sbjct: 279 FYSGGVFSGQCGTRMNHAITVVGYGADSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGR- 337
Query: 336 EGLCGINKMASYPI 349
GLCGI +YP+
Sbjct: 338 GGLCGIALDLAYPV 351
>gi|4469159|emb|CAB38317.1| chymopapain isoform V [Carica papaya]
Length = 227
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 124/216 (57%), Positives = 153/216 (70%), Gaps = 2/216 (0%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P+S+DWR KGAVT VKNQG+CGSCWAFST+A VEGIN+IVTGNL LSEQEL+DCD ++
Sbjct: 2 PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDK-HS 60
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
GC GG + QY V+ G+H + YP ++ C T V I GY VP N E
Sbjct: 61 YGCKGGYQTTSLQY-VANNGVHTSKVYPCQAKQYKCRATDKPGPKVKITGYKRVPSNCET 119
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
S L ALANQPLS +EA G+ FQ Y GV+DG CGT+LDH V AVGYG++ G +YII+KN
Sbjct: 120 SFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIIIKN 179
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWGP WGE+GY+R+KR +G +G CG+ K + YP K
Sbjct: 180 SWGPNWGEEGYMRLKRQSGNSQGTCGVYKSSYYPFK 215
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 147/334 (44%), Positives = 197/334 (58%), Gaps = 24/334 (7%)
Query: 36 EDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK--------IK 87
+D+ + ESWM++ + Y +EK R EIF+ N ID N K +
Sbjct: 31 DDVAVGAAMASRHESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVD 90
Query: 88 NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQS----HEDFSYKDVVDLPKSVDWRKK 143
++ L N FADL EEF+ GL+ A +E+FS + D S+DWR
Sbjct: 91 SHRLATNRFADLTDEEFRAARTGLRRPAAVAGAVGGGFRYENFSLQ--ADAAGSMDWRAM 148
Query: 144 GAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY--NNGCNGGL 201
GAVT VK+QGSCG CWAFS VAA+EG+ +I TG L SLSEQ+L+DCD Y + GC GGL
Sbjct: 149 GAVTGVKDQGSCGCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCD-VYGDDQGCEGGL 207
Query: 202 MDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALA 261
MD AFQYI GGL E YPY E+G + +I G+ DVP N+E +L+ A+A
Sbjct: 208 MDNAFQYISRQGGLASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVA 267
Query: 262 NQPLSVAIEASGRDFQFY----SGGVYDGHC-GTQLDHGVAAVGYG-STRGLDYIIVKNS 315
+QP+SVAI F+FY G +G C T+LDH + AVGYG + G Y ++KNS
Sbjct: 268 HQPVSVAINGGDYVFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWLMKNS 327
Query: 316 WGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
WG WGE GY+R++R + + EG+CG+ K+ASYP+
Sbjct: 328 WGSGWGESGYVRIRRGS-RGEGVCGLAKLASYPV 360
>gi|310942960|pdb|3P5W|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 124/218 (56%), Positives = 160/218 (73%), Gaps = 2/218 (0%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP VDWR GAV +K+QG CGSCWAFST+AAVEGIN+I TG+L SLSEQEL+DC T
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSCWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 194 NN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNS 252
N GC+GG M FQ+I++ GG++ E +YPY EEG C + + + V+I+ Y +VP N+
Sbjct: 61 NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120
Query: 253 EDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIV 312
E +L A+A QP+SVA+EA+G +FQ YS G++ G CGT +DH V VGYG+ G+DY IV
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIV 180
Query: 313 KNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
KNSWG WGE+GY+R++RN G G CGI K ASYP+K
Sbjct: 181 KNSWGTTWGEEGYMRIQRNVGGV-GQCGIAKKASYPVK 217
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 134/308 (43%), Positives = 187/308 (60%), Gaps = 22/308 (7%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEE 103
I+ E WMS+F +VY EK RFEIFK NL+ ++ N N Y L +N+F+DL EE
Sbjct: 15 IEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDLTDEE 74
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F+ ++GL P+ Q F Y++V + +S+DWR +GAVT VK+QG CG CWAF+
Sbjct: 75 FQARYMGLVPEGMTGDSQKTVSFRYENVSETGESMDWRLEGAVTPVKDQGQCGCCWAFAA 134
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYP 222
VAAVEG+ +I G L SLSEQ+L+DC NN GC+GGL A+ YI G+ EE+YP
Sbjct: 135 VAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQGITSEENYP 194
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
Y + TC+ T + TI+GY VP++ E++LLKA++ G
Sbjct: 195 YQAVQQTCKST--DPAAATISGYEAVPKDDEEALLKAVSQH-----------------GI 235
Query: 283 VYDGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
D +CGT H V VGYG++ G+ Y ++KNSWG WGE GY+R+KR+ +P+G+CG+
Sbjct: 236 FEDEYCGTDSHHAVTIVGYGTSEEGIKYWLLKNSWGESWGENGYMRIKRDVDEPQGMCGL 295
Query: 342 NKMASYPI 349
A YP+
Sbjct: 296 AHRAYYPV 303
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 144/332 (43%), Positives = 186/332 (56%), Gaps = 23/332 (6%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFA 97
L D ++D FE WM + + Y EK RFE+++ N+ ++ N Y L N+FA
Sbjct: 21 LARADLMLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFA 80
Query: 98 DLRHEEFKEMFLGLKPDLARRK-------DQSHEDFSYKDVVDLPKSVDWRKKGAVTHV- 149
DL +EEF+ LG +P + + D + S D+ LPKSVDWR KGAV +
Sbjct: 81 DLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDI--LPKSVDWRNKGAVINRW 138
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K GSCWAFS VAA+EGINQI G L SLSEQEL+DCD+ GC GG M +AF+++
Sbjct: 139 KICVDAGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDE-AVGCGGGYMSWAFEFV 197
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
V GL E YPY G C+ K V I GY +V +SE L +A A QP+SVA+
Sbjct: 198 VGNHGLTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAV 257
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-----------GLDYIIVKNSWGP 318
+ FQ Y GVY G C ++HGV VGYG + G Y IVKNSWG
Sbjct: 258 DGGSFMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGA 317
Query: 319 KWGEKGYIRMKRN-TGKPEGLCGINKMASYPI 349
+WG+ GYI M+R+ G GLCGI + SYP+
Sbjct: 318 EWGDAGYILMQRDVAGLASGLCGIALLPSYPV 349
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 180/308 (58%), Gaps = 10/308 (3%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKE 106
F W + + Y S E+ R EI+ NL I+E N ++ Y LG+NEF DL H EF
Sbjct: 21 FAEWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAA 80
Query: 107 MFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
+LG++ + +V LP SVDWR G VT VKNQG CGSCW+FST +
Sbjct: 81 KYLGVRFNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGS 140
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
VEG + TG L SLSEQ L+DC + N GCNGGLMD AF+YI+ GG+ E YPY
Sbjct: 141 VEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYPYTA 200
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVY 284
GTC+ T+ Y D+ SE L A+A P+SVAI+AS +FQFY GVY
Sbjct: 201 TTGTCKFNAANIG-ATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVY 259
Query: 285 D-GHCG-TQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
+ C TQLDHGV AVGYG ST G DY +VKNSWG WG+ GYI M RN + CGI
Sbjct: 260 NEKKCSTTQLDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKAGYIWMSRN---ADNQCGI 316
Query: 342 NKMASYPI 349
ASYP+
Sbjct: 317 ATSASYPL 324
>gi|428170119|gb|EKX39047.1| hypothetical protein GUITHDRAFT_154556 [Guillardia theta CCMP2712]
Length = 352
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 149/333 (44%), Positives = 200/333 (60%), Gaps = 19/333 (5%)
Query: 34 SPEDLTS--NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK---- 87
SP TS +D++ F SW +KFEKVY+ E L RF +FK N+ I N +
Sbjct: 19 SPASKTSSVDDEIHLAFISWKNKFEKVYDGA-EHLARFAVFKANMEIIRAHNALYELGEE 77
Query: 88 NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKD----QSHEDFSYK-DVVDLPKSVDWRK 142
+ + N+FAD+ EEFK LG KP+L ++ S ++ +++ + PK++DWR
Sbjct: 78 TFSMAANQFADMTAEEFKRTVLGYKPELKGKRLLQGLNSGKNCTHRSNNSTRPKAIDWRT 137
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM 202
K AVT VKNQG CGSCW+FST AVEG + L SLSE+EL+ CD + GCNGGLM
Sbjct: 138 KSAVTPVKNQGQCGSCWSFSTTGAVEGAWVVAGHPLISLSEEELVQCDTKSDQGCNGGLM 197
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGT---CEMTKGESEVVTINGYHDVPQNSEDSLLKA 259
D A+ +I+ GG+ E+ YPYI GT C + +V +I+ + D+ E L A
Sbjct: 198 DNAYAWIIQNGGIAAEDVYPYISGNGTTGVCHVAFLSKKVASISDWCDLKPEDESDLELA 257
Query: 260 LANQPLSVAIEASGRDFQFYSGGVYDG-HCGTQLDHGVAAVGYG--STRGLDYIIVKNSW 316
L QP++VAIEA FQFY+GGV CGT+LDHGV AVGYG + Y IVKNSW
Sbjct: 258 LVQQPVAVAIEADQSSFQFYNGGVLPAKKCGTKLDHGVLAVGYGYDKKHKMHYWIVKNSW 317
Query: 317 GPKWGEKGYIRMKRNTGKPE-GLCGINKMASYP 348
G +WG++GYIR+++ K + CGI K ASYP
Sbjct: 318 GAEWGDEGYIRLEKMPKKTKHSACGIAKAASYP 350
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 187/311 (60%), Gaps = 9/311 (2%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHID--ETNRKIKNYWLGLNEFADLRH 101
+ + E WM+++ + Y+ E+ RF +FKDN+ I +T + N LG+N AD+ H
Sbjct: 31 MYERHEQWMARYSRNYKDDAEEERRFXMFKDNVDFIQTFDTAGNMPNK-LGVNALADMTH 89
Query: 102 EEFKEM--FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCW 159
EEF+ + P+L R + + F +++V +P ++DWRKK VTH+KNQ CG CW
Sbjct: 90 EEFRASGNTFKIPPNLGLRSETT--SFRHQNVTRIPSTMDWRKKRTVTHIKNQLQCGGCW 147
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKE 218
AFS VAA+EGI ++ T SLSEQEL+DCD +N GC GG MD AF++I+ GL+ E
Sbjct: 148 AFSAVAAMEGIAKLQTSKSISLSEQELVDCDIFGSNIGCEGGCMDDAFKFIIQNRGLNSE 207
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
Y Y EG C K S IN Y ++P+ SE +LLK +A+QP+SVAI+A G FQF
Sbjct: 208 ARYLYKGVEGHCNKKKESSRAARINDYENMPEFSEKALLKVVAHQPISVAIDAGGSAFQF 267
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
Y G+ G LD+GV GYG S G + +VKNSWG WGE GY RM+R G
Sbjct: 268 YEIGIITXESGNDLDYGVTTDGYGRSADGKKHWLVKNSWGTDWGENGYTRMERGVKATTG 327
Query: 338 LCGINKMASYP 348
LCG ASYP
Sbjct: 328 LCGFTMQASYP 338
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 147/311 (47%), Positives = 196/311 (63%), Gaps = 10/311 (3%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFK 105
++E W+ + K Y L EK RF+IFKDNL+HI+E N ++Y GLN+F+DL +EF+
Sbjct: 40 IYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSYDRGLNQFSDLTVDEFQ 99
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVT-HVKNQGSCGSCWAFSTV 164
+LG K + D + E + YK+ LP VDWR++GAV VK QG CGSCWAF+
Sbjct: 100 ASYLGGKIEKKSLSDVA-ERYQYKEGDILPDEVDWRERGAVVPRVKRQGDCGSCWAFAAT 158
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
AVEGINQI TG L SLSEQELIDCD +N GC GG +AF++I GG+ +EDY Y
Sbjct: 159 GAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWAFEFIKENGGIVTDEDYGY 218
Query: 224 IMEE-GTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
++ C+ + + + VVTING+ VP N E SL KA++ QP+SV I A+ + Y
Sbjct: 219 TGDDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVSYQPISVMISAA--NMSDYKS 276
Query: 282 GVYDGHCGTQL-DHGVAAVGYGSTRGL-DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
GVY G C DH V VGYG++ DY +++NSWGP WGE GY+R++RN +P G C
Sbjct: 277 GVYKGPCSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPGWGEGGYLRLQRNFNEPTGKC 336
Query: 340 GINKMASYPIK 350
+ YPIK
Sbjct: 337 AVAVAPVYPIK 347
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 146/323 (45%), Positives = 202/323 (62%), Gaps = 14/323 (4%)
Query: 36 EDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLN 94
E +T + ++ E WM++ + Y + +EK R E+F+ N + ID N + + L N
Sbjct: 32 EAITVDSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATN 91
Query: 95 EFADLRHEEFKEMFLGLK--PDLARRKDQSHEDFSYKD--VVDLPKSVDWRKKGAVTHVK 150
FADL EEF+ GL+ P A F Y++ + D S+DWR GAVT VK
Sbjct: 92 RFADLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVK 151
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN--GCNGGLMDYAFQY 208
+QGSCG CWAFS VAAVEG+ +I TG L SLSEQ+L+DCD Y + GC GGLMD AF+Y
Sbjct: 152 DQGSCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCD-VYGDDEGCAGGLMDNAFEY 210
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVA 268
+++ GGL E YPY +G+C + + +I GY DVP N+E +L+ A+A+QP+SVA
Sbjct: 211 MINRGGLTTESSYPYRGTDGSCRRS---ASAASIRGYEDVPANNEAALMAAVAHQPVSVA 267
Query: 269 IEASGRDFQFYSGGVYDGH-CGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYI 326
I F+FY GV G CGT+L+H + AVGYG+ G Y I+KNSWG WGE GY+
Sbjct: 268 INGGDSVFRFYDSGVLGGSGCGTELNHAITAVGYGTASDGTKYWIMKNSWGGSWGEGGYV 327
Query: 327 RMKRNTGKPEGLCGINKMASYPI 349
R++R + EG+CG+ ++ASYP+
Sbjct: 328 RIRRGV-RGEGVCGLAQLASYPV 349
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 196/311 (63%), Gaps = 11/311 (3%)
Query: 48 FESWMSKFE----KVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEE 103
F+S +F+ K Y + +E+L+R+ IFK+NL +I N + +Y L +N+F DL EE
Sbjct: 85 FQSQFYQFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHNMQGYSYVLKMNKFGDLTLEE 144
Query: 104 FKEMFLGLK-PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
F++ +LG K PDL + + D+P VDWR++G VT VK+QG CGSCWAFS
Sbjct: 145 FRQRYLGYKKPDLRTPPREVDTTLESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFS 204
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
A+EG+ TG L +LS+Q+L+DC N GC+GG M+ AF+Y+V GG+ E+Y
Sbjct: 205 ATGAMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENY 264
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALA-NQPLSVAIEASGRDFQFYS 280
PY+ ++G C+ ++ S V TI GY VP+ SE S+ ALA P+SVAI+A+ FQFY
Sbjct: 265 PYMRKDGVCKSSQCTS-VATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYY 323
Query: 281 GGVYDGHCGTQLDHGVAAVGYGS-TRGL-DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
G++D CGT LDHGV VGY + T G DY I+KNSWG WG+ GY+ M + G P G
Sbjct: 324 DGIFDAPCGTNLDHGVLLVGYSAETAGQGDYWIMKNSWGAAWGKGGYMLMAMHKG-PAGQ 382
Query: 339 CGINKMASYPI 349
CG+ S+P+
Sbjct: 383 CGVLLDGSFPV 393
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 144/312 (46%), Positives = 188/312 (60%), Gaps = 14/312 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDN----LRHIDETNRKIKNYWLGLNEFADLRHEE 103
F +W KF + Y S E+ +R +I+ N + H ++ Y LG+ +ADL HEE
Sbjct: 26 FHAWKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEHEE 85
Query: 104 FKEMFLG--LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
FK+ G L A + +LP+++DWR+ G VT VKNQGSCGSCW+F
Sbjct: 86 FKQTVFGVCLGSFNASKPRGGSSFLKMHRFYNLPQTIDWRQWGFVTPVKNQGSCGSCWSF 145
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEED 220
S+ A+EG N TG L SLSEQEL+DC Y N GCNGG MD AF+YIV+ GG+H E+
Sbjct: 146 SSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYIVNKGGIHTEDS 205
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFY 279
YPY + G C GE T GY+D+P +E +L +A+A P+SVAI AS + FQ Y
Sbjct: 206 YPYEGQVGQCRANYGEIG-ATCTGYYDIPSGNEHALKEAVATFGPVSVAIHASDQSFQLY 264
Query: 280 SGGVYDG-HC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
GVY+ +C GT LDH V VGYG+ G DY +VKNSWGP WG++GYI+M RN
Sbjct: 265 HSGVYNNPYCSGTALDHAVLIVGYGTEYGQDYWLVKNSWGPAWGDQGYIKMSRNR---YN 321
Query: 338 LCGINKMASYPI 349
CGI AS+P+
Sbjct: 322 QCGIASAASFPL 333
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 201/324 (62%), Gaps = 10/324 (3%)
Query: 34 SPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLG 92
+ E + +++ ++E W+ + K Y L EK RF+IFKDNL+ I+E N ++Y G
Sbjct: 27 ATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERG 86
Query: 93 LNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVT-HVKN 151
LN+F+DL +EF+ +LG K + D + E + YK+ LP VDWR++GAV VK
Sbjct: 87 LNKFSDLTADEFQASYLGGKMEKKSLSDVA-ERYQYKEGDVLPDEVDWRERGAVVPRVKR 145
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIV 210
QG CGSCWAF+ AVEGINQI TG L SLSEQELIDCD +N GC GG +AF++I
Sbjct: 146 QGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIK 205
Query: 211 STGGLHKEEDYPYIMEE-GTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVA 268
GG+ +E Y Y E+ C+ + + + VVTING+ VP N E SL KA+A QP+SV
Sbjct: 206 ENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVM 265
Query: 269 IEASGRDFQFYSGGVYDGHCGTQL-DHGVAAVGYGSTRGL-DYIIVKNSWGPKWGEKGYI 326
I A+ + Y GVY G C DH V VGYG++ DY +++NSWGP+WGE GY+
Sbjct: 266 ISAA--NMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYL 323
Query: 327 RMKRNTGKPEGLCGINKMASYPIK 350
R++RN +P G C + YPIK
Sbjct: 324 RLQRNFHEPTGKCAVAVAPVYPIK 347
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 259 bits (663), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 197/314 (62%), Gaps = 10/314 (3%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHE 102
++ ++E W+ + K Y L EK RF+IFKDNL+ I+E N ++Y GLN+F+DL +
Sbjct: 37 VLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTAD 96
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVT-HVKNQGSCGSCWAF 161
EF+ +LG K + D + E + YK+ LP VDWR++GAV VK QG CGSCWAF
Sbjct: 97 EFQASYLGGKMEKKSLSDVA-ERYQYKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAF 155
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEED 220
+ AVEGINQI TG L SLSEQELIDCD +N GC GG +AF++I GG+ +E
Sbjct: 156 AATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEV 215
Query: 221 YPYIMEE-GTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
Y Y E+ C+ + + + VVTING+ VP N E SL KA+A QP+SV I A+ +
Sbjct: 216 YGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA--NMSD 273
Query: 279 YSGGVYDGHCGTQL-DHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
Y GVY G C DH V VGYG S+ DY +++NSWGP+WGE GY+R++RN +P
Sbjct: 274 YKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPT 333
Query: 337 GLCGINKMASYPIK 350
G C + YPIK
Sbjct: 334 GKCAVAVAPVYPIK 347
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 151/300 (50%), Positives = 183/300 (61%), Gaps = 16/300 (5%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNR---KIKNYWLGLNEFADLRHEEFKEMFLGLKP 113
K Y E+L R IF+DNL I+E NR + + LG+NEFAD+ + EF M LGL
Sbjct: 37 KSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEFSNMLLGLG- 95
Query: 114 DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
R K F V DLP VDW +KG VT VKNQG CGSCWAFST ++EG
Sbjct: 96 --GRNKIAGDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTGSLEGQVFK 153
Query: 174 VTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEM 232
TG L SLSEQ L+DC + N GCNGGLMD AF YI GG+ E YPY +GTC
Sbjct: 154 KTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPYTGSDGTCRF 213
Query: 233 TKGESEV-VTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYDG-HC- 288
E++V T++G+ DV E++L +A+A P+SVAI+AS FQFY GGVY+ C
Sbjct: 214 L--ENKVGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVYNPWFCS 271
Query: 289 GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
T+LDHGV VGYG+ G DY +VKNSWG WG KGYI+M RN + CGI ASYP
Sbjct: 272 STELDHGVLVVGYGTEGGKDYWLVKNSWGSSWGLKGYIKMVRN---KKNRCGIATQASYP 328
>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
Length = 336
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 133/308 (43%), Positives = 183/308 (59%), Gaps = 11/308 (3%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEE 103
+ +FE WM+KF K Y+ EK RF IF+DN+ I ++ + +G+N+FADL ++E
Sbjct: 34 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 93
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + G KP + + D + P +DWR +GAVT VK+QG+CGSCWAF+
Sbjct: 94 FVATYTGAKPPHPKEAPRP------VDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAA 147
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
VAA+EG+ +I TG L LSEQEL+DCD T +NGC GG D AF+ + S GG+ E DY Y
Sbjct: 148 VAAIEGLTKIRTGQLTPLSEQELVDCD-TNSNGCGGGHTDRAFELVASKGGITAESDYRY 206
Query: 224 IMEEGTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
+G C + + +I GY VP N E L A+A QP++V I+ASG FQFY G
Sbjct: 207 EGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSG 266
Query: 283 VYDGHCGTQLDHGVAAVGY--GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
V+ G CG +H V VGY G Y + KNSWG WG++GYI ++++ +P G CG
Sbjct: 267 VFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCG 326
Query: 341 INKMASYP 348
+ YP
Sbjct: 327 LAVSPFYP 334
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 141/304 (46%), Positives = 184/304 (60%), Gaps = 11/304 (3%)
Query: 50 SWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFL 109
+W S K Y + E+ R I++ NL I N + +Y + +N DL +EF+ +L
Sbjct: 29 AWKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHNAEDHSYKMAMNHLGDLTEDEFRYFYL 88
Query: 110 GLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEG 169
G++ K + + V +P SVDW +KG VT VKNQG CGSCWAFST +VEG
Sbjct: 89 GVRAHHNSTK-RGWATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEG 147
Query: 170 INQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEG 228
+ TG+L SLSEQ LIDC +Y NNGC GGLMD AF+YI S GG+ E YPY+ ++G
Sbjct: 148 QHFRKTGSLVSLSEQNLIDCSGSYGNNGCQGGLMDNAFRYIESNGGIDTESSYPYLGQQG 207
Query: 229 TCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYDG- 286
+C + + GY D+PQ SE +L A+A P+SVA++AS +QFYS GVYD
Sbjct: 208 SCHFSSSHVG-ARVTGYQDIPQGSEQALQSAVATVGPVSVAVDAS--QWQFYSSGVYDNP 264
Query: 287 HC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
+C TQLDHGV +GYG+ G DY +VKNSWG WG +GYI M RN CGI A
Sbjct: 265 YCSSTQLDHGVLVIGYGNYNGQDYWLVKNSWGYSWGVEGYIMMSRNKNNQ---CGIASSA 321
Query: 346 SYPI 349
SYP+
Sbjct: 322 SYPL 325
>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 371
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 183/320 (57%), Gaps = 19/320 (5%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHE 102
++D F W + + Y +E+L RF++++ N+ +I+ TNR+ Y LG N+FADL E
Sbjct: 55 MLDRFVRWQAAHNRTYGDAEERLRRFQVYRANIEYIEATNRRGGLTYELGENQFADLTSE 114
Query: 103 EFKEMFLGLKPDLARRKDQSH-------EDFSYKDVVDL----PKSVDWRKKGAVTHVKN 151
EF M+ R D++ D ++ D DL P S DWR KGAVT KN
Sbjct: 115 EFLSMYASSYDAGDRADDEAALITTDVAGDGAWSDG-DLEALPPPSWDWRAKGAVTPPKN 173
Query: 152 QG-SCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIV 210
QG +C SCWAF TVA +EG+ I TG L SLSEQ+L+DCD Y+ GCN G F++++
Sbjct: 174 QGPTCSSCWAFVTVATIEGLTFIKTGKLISLSEQQLVDCD-MYDGGCNTGSYSRGFRWVL 232
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
GGL E +YPY G C K I G +P +E + KA+A QP+ VAIE
Sbjct: 233 ENGGLTTEAEYPYTAARGPCNRAKSAHHAAKITGQGRIPPQNELVMQKAVAGQPVGVAIE 292
Query: 271 ASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG--STRGLDYIIVKNSWGPKWGEKGYIRM 328
G QFY GVY G CGT L H V VGYG G Y IVKNSWG WGE+G+IRM
Sbjct: 293 V-GSGMQFYKTGVYSGPCGTNLAHAVTVVGYGVDPASGAKYWIVKNSWGQAWGERGFIRM 351
Query: 329 KRNTGKPEGLCGINKMASYP 348
+R+ G P GLCGI +YP
Sbjct: 352 RRDVGGP-GLCGIALDVAYP 370
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 201/323 (62%), Gaps = 14/323 (4%)
Query: 36 EDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLN 94
E +T + ++ E WM++ + Y + +EK R E+F+ N + ID N + + L N
Sbjct: 32 EAITVDAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATN 91
Query: 95 EFADLRHEEFKEMFLGLK--PDLARRKDQSHEDFSYKD--VVDLPKSVDWRKKGAVTHVK 150
FADL EEF+ GL+ P A F Y++ + D S+DWR GAVT VK
Sbjct: 92 RFADLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVK 151
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN--GCNGGLMDYAFQY 208
+QGSCG CWAFS VAAVEG+ +I TG L SLSEQ+L+DCD Y + GC GGLMD AF+Y
Sbjct: 152 DQGSCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCD-VYGDDEGCAGGLMDNAFEY 210
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVA 268
+++ GGL E YPY +G+C + + +I GY DVP N+E +L+ A+A+QP+SVA
Sbjct: 211 MINRGGLTTESSYPYRGTDGSCRRS---ASAASIRGYEDVPANNEAALMAAVAHQPVSVA 267
Query: 269 IEASGRDFQFYSGGVYDGH-CGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYI 326
I F+FY GV G CGT+L+H + A GYG+ G Y I+KNSWG WGE GY+
Sbjct: 268 INGGDSVFRFYDSGVLGGSGCGTELNHAITAAGYGTASDGTKYWIMKNSWGGSWGEGGYV 327
Query: 327 RMKRNTGKPEGLCGINKMASYPI 349
R++R + EG+CG+ ++ASYP+
Sbjct: 328 RIRRGV-RGEGVCGLAQLASYPV 349
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 124/229 (54%), Positives = 157/229 (68%), Gaps = 6/229 (2%)
Query: 126 FSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
F Y++V LP ++DWR KGAVT +K+QG CG CWAFS VAA EGI +I TG L SL+E
Sbjct: 7 FRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAE 66
Query: 184 QELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTI 242
QEL+DCD + + GC GGLMD AF++I+ GGL E YPY +G C+ G + TI
Sbjct: 67 QELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCK--SGSNSAATI 124
Query: 243 NGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG 302
GY DVP N E +L+KA+ANQP+SVA++ FQFYSGGV G CGT LDHG+AA+GYG
Sbjct: 125 KGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG 184
Query: 303 STR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
T G Y ++KNSWG WGE GY+RM+++ G+CG+ SYP K
Sbjct: 185 KTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPTK 233
>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
Length = 319
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 133/308 (43%), Positives = 183/308 (59%), Gaps = 11/308 (3%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEE 103
+ +FE WM+KF K Y+ EK RF IF+DN+ I ++ + +G+N+FADL ++E
Sbjct: 17 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 76
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + G KP + + D + P +DWR +GAVT VK+QG+CGSCWAF+
Sbjct: 77 FVATYTGAKPPHPKEAPRP------VDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAA 130
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
VAA+EG+ +I TG L LSEQEL+DCD T +NGC GG D AF+ + S GG+ E DY Y
Sbjct: 131 VAAIEGLTKIRTGQLTPLSEQELVDCD-TNSNGCGGGHTDRAFELVASKGGITAESDYRY 189
Query: 224 IMEEGTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
+G C + + +I GY VP N E L A+A QP++V I+ASG FQFY G
Sbjct: 190 EGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSG 249
Query: 283 VYDGHCGTQLDHGVAAVGY--GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
V+ G CG +H V VGY G Y + KNSWG WG++GYI ++++ +P G CG
Sbjct: 250 VFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCG 309
Query: 341 INKMASYP 348
+ YP
Sbjct: 310 LAVSPFYP 317
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 188/319 (58%), Gaps = 24/319 (7%)
Query: 34 SPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGL 93
+ +L+ + + E WM+++ ++Y+ EK RFE+FK N+ I+ N +WLG+
Sbjct: 23 AARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGV 82
Query: 94 NEFADLRHEEFKEMFL--GLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKN 151
N+FADL ++EF+ G P R + D LP ++DWR KG VT +K+
Sbjct: 83 NQFADLTNDEFRSTKTNKGFIPSTTRVPTGFRNENVNIDA--LPATMDWRTKGVVTPIKD 140
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIV 210
QG CG CWAFS VAA+E EL+DCD + + GC GGLMD AF++I+
Sbjct: 141 QGQCGCCWAFSAVAAME----------------ELVDCDVHGEDQGCEGGLMDDAFKFII 184
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIE 270
GGL E +YPY + + + V +I GY DVP N+E +L+KA+ANQP+SVA++
Sbjct: 185 KNGGLTTESNYPYAAVDD--KFKSVSNSVASIKGYEDVPANNEAALMKAVANQPVSVAVD 242
Query: 271 ASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMK 329
FQFY GGV G CGT LDHG+ A+GYG ++ G Y ++KNSWG WGE G++RM+
Sbjct: 243 GGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRME 302
Query: 330 RNTGKPEGLCGINKMASYP 348
++ G+CG+ SYP
Sbjct: 303 KDISDKRGMCGLAMEPSYP 321
>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 335
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 133/308 (43%), Positives = 183/308 (59%), Gaps = 11/308 (3%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEE 103
+ +FE WM+KF K Y+ EK RF IF+DN+ I ++ + +G+N+FADL ++E
Sbjct: 33 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 92
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + G KP + + D + P +DWR +GAVT VK+QG+CGSCWAF+
Sbjct: 93 FVATYTGAKPPHPKEAPRP------VDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAA 146
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
VAA+EG+ +I TG L LSEQEL+DCD T +NGC GG D AF+ + S GG+ E DY Y
Sbjct: 147 VAAIEGLTKIRTGQLTPLSEQELVDCD-TNSNGCGGGHTDRAFELVASKGGITAESDYRY 205
Query: 224 IMEEGTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
+G C + + +I GY VP N E L A+A QP++V I+ASG FQFY G
Sbjct: 206 EGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSG 265
Query: 283 VYDGHCGTQLDHGVAAVGY--GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
V+ G CG +H V VGY G Y + KNSWG WG++GYI ++++ +P G CG
Sbjct: 266 VFPGPCGASSNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTCG 325
Query: 341 INKMASYP 348
+ YP
Sbjct: 326 LAVSPFYP 333
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 190/344 (55%), Gaps = 21/344 (6%)
Query: 10 ILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERF 69
+L++ C+ F + S F F D+ +++W K Y ++ E+ R
Sbjct: 3 LLVAACLLFAVASGFVVKF-------------DEDEQQWQAWKLFHTKKYTTVTEEGARK 49
Query: 70 EIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYK 129
I++DNL+ I + N + ++ L +N DL +EF+ + G++ + + F
Sbjct: 50 AIWRDNLKKIQKHNAEGHSFTLAMNHLGDLTQDEFRYFYTGMRSHYSNYTKKQGSAFLAP 109
Query: 130 DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDC 189
V +P +VDWRK+G VT VKNQG CGSCWAFST ++EG N TG L SLSEQ L+DC
Sbjct: 110 SHVQVPDTVDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDC 169
Query: 190 DNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV 248
Y NNGC GGLMDYAF+YI GG+ EE YPY C K V G+ DV
Sbjct: 170 STAYGNNGCQGGLMDYAFKYIKENGGIDTEESYPYEARNDRCRFQKSNIGAVDT-GFVDV 228
Query: 249 PQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD--GHCGTQLDHGVAAVGYGSTR 305
E++L A P+SVAI+A FQFY GVY+ G T LDHGV VGYG+ +
Sbjct: 229 THGDEEALKTAAGTVGPISVAIDAGHMSFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQ 288
Query: 306 GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
G DY +VKNSWG +WG +GYI M RN CG+ ASYP+
Sbjct: 289 GSDYWLVKNSWGERWGMEGYIMMSRNKNNQ---CGVATQASYPL 329
>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
Length = 342
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 133/308 (43%), Positives = 182/308 (59%), Gaps = 11/308 (3%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEE 103
+ +FE WM+KF K Y+ EK RF IF+DN+ I ++ + +G+N+FADL ++E
Sbjct: 40 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 99
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + G KP + + D + P +DWR +GAVT VK+QG+CGSCWAF+
Sbjct: 100 FVATYTGAKPPHPKEAPRP------VDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAA 153
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
VAA+EG+ +I TG L LSEQEL+DCD T +NGC GG D AF+ + S GG+ E DY Y
Sbjct: 154 VAAIEGLTKIRTGQLTPLSEQELVDCD-TNSNGCGGGHTDRAFELVASKGGITAESDYRY 212
Query: 224 IMEEGTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
+G C + + I GY VP N E L A+A QP++V I+ASG FQFY G
Sbjct: 213 EGFQGKCRVDDMLFNHAARIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSG 272
Query: 283 VYDGHCGTQLDHGVAAVGY--GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
V+ G CG +H V VGY G Y + KNSWG WG++GYI ++++ +P G CG
Sbjct: 273 VFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCG 332
Query: 341 INKMASYP 348
+ YP
Sbjct: 333 LAVSPFYP 340
>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
Length = 319
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 133/308 (43%), Positives = 183/308 (59%), Gaps = 11/308 (3%)
Query: 45 IDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEE 103
+ +FE WM+KF K Y+ EK RF IF+DN+ I ++ + +G+N+FADL ++E
Sbjct: 17 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 76
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + G KP + + D + P +DWR +GAVT VK+QG+CGSCWAF+
Sbjct: 77 FVATYTGAKPPHPKEAPRP------VDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAA 130
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
VAA+EG+ +I TG L LSEQEL+DCD T +NGC GG D AF+ + S GG+ E DY Y
Sbjct: 131 VAAIEGLTKIRTGQLTPLSEQELVDCD-TNSNGCGGGHTDRAFELVASKGGITAESDYRY 189
Query: 224 IMEEGTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
+G C + + +I GY VP N E L A+A QP++V I+ASG FQFY G
Sbjct: 190 EGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSG 249
Query: 283 VYDGHCGTQLDHGVAAVGY--GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
V+ G CG +H V VGY G Y + KNSWG WG++GYI ++++ +P G CG
Sbjct: 250 VFPGPCGASSNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTCG 309
Query: 341 INKMASYP 348
+ YP
Sbjct: 310 LAVSPFYP 317
>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
gi|255645733|gb|ACU23360.1| unknown [Glycine max]
Length = 362
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 155/362 (42%), Positives = 218/362 (60%), Gaps = 29/362 (8%)
Query: 8 KTILISFCISFFIRSSFARDFSIVGYSP--EDLTSNDKLIDLFESWMSKFEKVYESLDEK 65
+T L F F + SF S+ S E S +++ LF++W + ++ Y + +EK
Sbjct: 6 RTKLFPF---FIVLVSFTCSLSLAMSSNQLEQFASEEEVFQLFQAWQKEHKREYGNQEEK 62
Query: 66 LERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRHEEF-----KEMFLGLKPDLA 116
+RF+IF+ NLR+I+E N K K+ + LGLN+FAD+ EEF KE+ + +
Sbjct: 63 AKRFQIFQSNLRYINEMNAKRKSPTTQHRLGLNKFADMSPEEFMKTYLKEIEMPYSNLES 122
Query: 117 RRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG 176
R+K Q +D D +LP SVDWR KGAVT V++QG C S WAFS A+EGIN+IVTG
Sbjct: 123 RKKLQKGDD---ADCDNLPHSVDWRDKGAVTEVRDQGKCQSHWAFSVTGAIEGINKIVTG 179
Query: 177 NLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
NL SLS Q+++DCD ++GC GG AF Y++ GG+ E YPY + GTC+
Sbjct: 180 NLVSLSVQQVVDCDPA-SHGCAGGFYFNAFGYVIENGGIDTEAHYPYTAQNGTCKANA-- 236
Query: 237 SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG-HC---GTQL 292
++VV+I+ V E++LL ++ QP+SV+I+A+G QFY+GGVY G +C T+
Sbjct: 237 NKVVSIDNLL-VVVGPEEALLCRVSKQPVSVSIDATG--LQFYAGGVYGGENCSKNSTKA 293
Query: 293 DHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK--PEGLCGINKMASYPIK 350
VGYGS G DY IVKNSWG WGE+GY+ +KRN P G+C IN +PI
Sbjct: 294 TLVCLIVGYGSVGGEDYWIVKNSWGKDWGEEGYLLIKRNVSDEWPYGVCAINAAPGFPII 353
Query: 351 KK 352
K+
Sbjct: 354 KE 355
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 137/287 (47%), Positives = 186/287 (64%), Gaps = 13/287 (4%)
Query: 71 IFKDNLRHIDETNRKI----KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDF 126
IFK NL++I+E N+K K+Y+LG+N+FAD+++EEF+ M+ GL+ D ++ +
Sbjct: 65 IFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR-MYNGLRRDYNYSREVQCSNH 123
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
+ + P VDWRKKG VT VKNQG CGSCW+FST ++EG + +G L SLSEQ+L
Sbjct: 124 LTPEYLVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGSLEGQHFHKSGKLVSLSEQQL 183
Query: 187 IDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGY 245
+DC + N GCNGGLMD AF+YI++ GG+ EE+YPY + C K E T +G
Sbjct: 184 VDCSGKFGNEGCNGGLMDQAFEYIITNGGIETEEEYPYDARQERCHFKKSEV-AATASGC 242
Query: 246 HDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD-GHC-GTQLDHGVAAVGYG 302
DV E L ++A P+S+AI+AS + FQ YSGGVYD C T+LDHGV VGYG
Sbjct: 243 VDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGGVYDEPKCSSTELDHGVLVVGYG 302
Query: 303 STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
+ G DY +VKNSWG WG +GY++M RN + CG+ ASYP+
Sbjct: 303 TDDGQDYWLVKNSWGTTWGLEGYVKMSRNQ---DNQCGVATQASYPL 346
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 139/312 (44%), Positives = 191/312 (61%), Gaps = 13/312 (4%)
Query: 46 DLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFK 105
D F S+ + + K Y + +EK R+ IFK+NL +I N++ +Y L +N F DL +EF+
Sbjct: 115 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFR 174
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVV-----DLPKSVDWRKKGAVTHVKNQGSCGSCWA 160
+LG K R +SH +++ +LP VDWR +G VT VK+Q CGSCWA
Sbjct: 175 RKYLGFKKS---RNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWA 231
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEE 219
FST A+EG + TG L SLSEQEL+DC N C+GG M+ AFQY++ +GG+ E+
Sbjct: 232 FSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSED 291
Query: 220 DYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
YPY+ + C E +VV I G+ DVP+ SE ++ ALA P+S+AIEA FQFY
Sbjct: 292 AYPYLARDEECRAQSCE-KVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFY 350
Query: 280 SGGVYDGHCGTQLDHGVAAVGYGSTR--GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
GV+D CGT LDHGV VGYG+ + D+ I+KNSWG WG GY+ M + G+ EG
Sbjct: 351 HEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGE-EG 409
Query: 338 LCGINKMASYPI 349
CG+ AS+P+
Sbjct: 410 QCGLLLDASFPV 421
>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 533
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 180/313 (57%), Gaps = 22/313 (7%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYW----LGLNEFADLRHEE 103
F +WMS + E R E + N +I E N +N W LG N F+ + +E
Sbjct: 28 FSAWMSAHGVTFSDALEFARRLENYIANDMYILEHN--AENAWTGVKLGHNAFSHMSFDE 85
Query: 104 FKEMFLGL-------KPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
FK GL + LA R D D V++P +VDW KG VT VKNQG CG
Sbjct: 86 FKFKMTGLVLPEGYLEQRLASRVDGLWSD------VEVPSAVDWVDKGGVTPVKNQGMCG 139
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
SCWAFST AVEG + +G L SLSEQEL+DCD+ + GCNGGLMD+AFQ+I GG+
Sbjct: 140 SCWAFSTTGAVEGATFVSSGKLLSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGIC 199
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E+DY Y + C VV + G+ DV E +L A+A QP+SVAIEA + F
Sbjct: 200 SEDDYEYKAKAQVCRKC---DSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAF 256
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
QFY GV++ CGT+LDHGV AVGYG+ G + VKNSWG WGE+GYIR+ R P
Sbjct: 257 QFYKSGVFNLTCGTRLDHGVLAVGYGNDNGQKFWKVKNSWGASWGEQGYIRLAREENGPA 316
Query: 337 GLCGINKMASYPI 349
G CGI + SYP
Sbjct: 317 GQCGIASVPSYPF 329
>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
Length = 360
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 134/303 (44%), Positives = 172/303 (56%), Gaps = 12/303 (3%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLNEFADLRHE 102
++D F +W + Y S +E L+RF++++ N ID N R Y L NEFADL E
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 106
Query: 103 EFKEMFLGLKP------DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS-C 155
EF + G D D S+ VD+P SVDWR +GAV K+Q S C
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTC 166
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGL 215
SCWAF T A +E +N I TG L SLSEQ+L+DCD +Y+ GCN G A++++V GGL
Sbjct: 167 SSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAYKWVVENGGL 225
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E DYPY G C K I G+ VP +E +L A+A QP++VAIE G
Sbjct: 226 TTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSG 284
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGS--TRGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
QFY GGVY G CGT+L H V VGYG+ + G Y +KNSWG WGE+GYIR+ R+ G
Sbjct: 285 MQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVG 344
Query: 334 KPE 336
P
Sbjct: 345 GPR 347
>gi|310942958|pdb|3P5U|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942959|pdb|3P5V|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942961|pdb|3P5X|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 123/218 (56%), Positives = 159/218 (72%), Gaps = 2/218 (0%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP VDWR GAV +K+QG CGS WAFST+AAVEGIN+I TG+L SLSEQEL+DC T
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSXWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 194 NN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNS 252
N GC+GG M FQ+I++ GG++ E +YPY EEG C + + + V+I+ Y +VP N+
Sbjct: 61 NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120
Query: 253 EDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIV 312
E +L A+A QP+SVA+EA+G +FQ YS G++ G CGT +DH V VGYG+ G+DY IV
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIV 180
Query: 313 KNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
KNSWG WGE+GY+R++RN G G CGI K ASYP+K
Sbjct: 181 KNSWGTTWGEEGYMRIQRNVGGV-GQCGIAKKASYPVK 217
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 139/312 (44%), Positives = 191/312 (61%), Gaps = 13/312 (4%)
Query: 46 DLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFK 105
D F S+ + + K Y + +EK R+ IFK+NL +I N++ +Y L +N F DL +EF+
Sbjct: 114 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFR 173
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVV-----DLPKSVDWRKKGAVTHVKNQGSCGSCWA 160
+LG K R +SH +++ +LP VDWR +G VT VK+Q CGSCWA
Sbjct: 174 RKYLGFKKS---RNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWA 230
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEE 219
FST A+EG + TG L SLSEQEL+DC N C+GG M+ AFQY++ +GG+ E+
Sbjct: 231 FSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSED 290
Query: 220 DYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
YPY+ + C E +VV I G+ DVP+ SE ++ ALA P+S+AIEA FQFY
Sbjct: 291 AYPYLARDEECRAQSCE-KVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFY 349
Query: 280 SGGVYDGHCGTQLDHGVAAVGYGSTR--GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
GV+D CGT LDHGV VGYG+ + D+ I+KNSWG WG GY+ M + G+ EG
Sbjct: 350 HEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGE-EG 408
Query: 338 LCGINKMASYPI 349
CG+ AS+P+
Sbjct: 409 QCGLLLDASFPV 420
>gi|1085731|pir||S46476 cysteine proteinase (EC 3.4.22.-) III - mountain papaya
gi|926847|gb|AAB32657.1| cysteine proteinase CC-III [Carica candamarcensis=mountain papaya,
Hook, latex, Peptide, 214 aa]
Length = 214
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 121/214 (56%), Positives = 157/214 (73%), Gaps = 6/214 (2%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P+S+DWRKKGAVT VKNQGSCGSCWAFST+A VEGIN+IV GNL SLSEQEL+DCD +
Sbjct: 2 PESIDWRKKGAVTPVKNQGSCGSCWAFSTIATVEGINKIVHGNLTSLSEQELVDCDRR-S 60
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
+GC GG + +Y+V G+H E++YPY ++ C + +V I+GY VP N E
Sbjct: 61 HGCKGGYQTTSLKYVVDH-GVHTEKEYPYEEKQYKCRAKDKKPPIVKISGYKKVPSNDEI 119
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
SL+KA+A QP+SV +E+ G+ FQFY G++ G CGT++DH V AVGYG DYI++KN
Sbjct: 120 SLIKAIAKQPVSVLVESKGKAFQFYKKGIFGGPCGTKVDHAVTAVGYGK----DYILIKN 175
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
SWGP WGE GYI++KR +G EG+CGI K + +P
Sbjct: 176 SWGPXWGEXGYIKIKRASGHCEGICGIYKSSYFP 209
>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
Length = 372
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 152/305 (49%), Positives = 184/305 (60%), Gaps = 15/305 (4%)
Query: 56 EKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFADLRHEEFKEMFLGL 111
+KVY+S E+ R +IF DN R I E NRK + NY LG+N++ D+ H E G
Sbjct: 71 KKVYKSPIEEGYRMKIFLDNKRKIVEHNRKYEMKEVNYKLGMNKYGDMLHHELINTLNGF 130
Query: 112 KPDLARRKDQ-SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
+ ++Q F V+LPKSVDWRKKGAVT +K+QG CGSCWAFS+ A+EG
Sbjct: 131 NKSVTVSEEQLIGATFIEPANVELPKSVDWRKKGAVTAIKDQGQCGSCWAFSSTGALEGQ 190
Query: 171 NQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
+ +G L SLSEQ LIDC Y NNGCNGGLMDYAF+YI GL E+ YPY E
Sbjct: 191 HFRQSGVLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFRYIKENKGLDTEKSYPYEAENDQ 250
Query: 230 CEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV-YDGH 287
C S + G+ D+P+ ED L A+A P+SVAI+AS F FYS GV Y+
Sbjct: 251 CRYNPKNSGASDV-GFVDIPEGDEDKLKAAVATIGPISVAIDASHESFHFYSEGVYYEPE 309
Query: 288 CG-TQLDHGVAAVGYG--STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
C LDHGV VGYG S G DY +VKNSWG WGEKGYI+M RN E CGI
Sbjct: 310 CSPANLDHGVLIVGYGTDSGTGEDYWLVKNSWGETWGEKGYIKMARNK---ENHCGIASS 366
Query: 345 ASYPI 349
ASYP+
Sbjct: 367 ASYPL 371
>gi|125592009|gb|EAZ32359.1| hypothetical protein OsJ_16569 [Oryza sativa Japonica Group]
Length = 480
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 119/196 (60%), Positives = 148/196 (75%), Gaps = 1/196 (0%)
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGG 214
GSCWAFS V+ VE INQ+VTG + +LSEQEL++C N N+GCNGGLMD AF +I+ GG
Sbjct: 177 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 236
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
+ E+DYPY +G C++ + ++VV+I+G+ DVPQN E SL KA+A+QP+SVAIEA GR
Sbjct: 237 IDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGR 296
Query: 275 DFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
+FQ Y GV+ G CGT LDHGV AVGYG+ G DY IV+NSWGPKWGE GY+RM+RN
Sbjct: 297 EFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINV 356
Query: 335 PEGLCGINKMASYPIK 350
G CGI MASYP K
Sbjct: 357 TTGKCGIAMMASYPTK 372
>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
Length = 340
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 143/313 (45%), Positives = 189/313 (60%), Gaps = 11/313 (3%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI--KNYWLGLNEF 96
T++ ID F +MS+F K Y+S +E R + +K N+ I+ N + ++ LG N
Sbjct: 34 TADQDHID-FVHFMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQNDGTSFTLGPNHL 92
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
AD H+E+K+M LG KP R E +S ++ D+P+S+DWR+KGAV VK+QG CG
Sbjct: 93 ADYTHDEYKKM-LGYKP----RNKTGKEVYSTPNLKDIPESIDWREKGAVNAVKDQGQCG 147
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
SCWAFST+A++E I TG L SLSEQ+L+DC N GCNGG M A YI S GG+
Sbjct: 148 SCWAFSTIASLESRYFIETGKLQSLSEQQLVDCSKNGNEGCNGGDMGLAMDYIASAGGVE 207
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E+DYPY+ ++ TC + EV T G+ ++ +L A+A P+SVAIEA F
Sbjct: 208 TEKDYPYVGKDQTCAF-EASKEVATDKGHINIVPGKFATLQAAIAEGPVSVAIEADSLFF 266
Query: 277 QFYSGGVYDGH-CGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
QFY G++D CGT LDHGVAAVGYG G Y IV+NSW WG KGYI + N G
Sbjct: 267 QFYRSGIFDSSWCGTNLDHGVAAVGYGVDNGKQYYIVRNSWSDSWGLKGYINIIAN-GDG 325
Query: 336 EGLCGINKMASYP 348
G+CGI P
Sbjct: 326 NGMCGIQMEPVVP 338
>gi|449469176|ref|XP_004152297.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 340
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/347 (40%), Positives = 206/347 (59%), Gaps = 17/347 (4%)
Query: 11 LISFCISFFIRSSFARDFSIVGYSPE--DLTSNDKLIDLFESWMSKFEKVYESLDEKLER 68
++ F I F + +FA G+ E D S L+ L++ W S ++ + E +R
Sbjct: 3 MMKFLIVFVVLIAFASHLC-EGFDLERKDFESEKSLMQLYKRW-SSHHRISRNAHEMHKR 60
Query: 69 FEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMF---LGLKPDLARRKDQSHED 125
F+IF+DN + + + N K+ L LN+FADL +EF M+ + +L +
Sbjct: 61 FKIFQDNAKRVFKVNHMGKSLKLRLNQFADLSDDEFSMMYGSNITHYNNLHAKAGGRVGG 120
Query: 126 FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQE 185
F Y+ +++P S+DWR+KGAV +KNQG C VAAVE I+QI T L SLSEQE
Sbjct: 121 FMYERAMNIPFSIDWREKGAVNAIKNQGLC-------AVAAVESIHQIKTNELVSLSEQE 173
Query: 186 LIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGY 245
++DCD GC GG D AF++I+ GG+ EE+YPY G C SE VTI+GY
Sbjct: 174 VVDCDYKVG-GCRGGNYDSAFEFIMQNGGITIEENYPYFAGNGYCRRRGPNSERVTIDGY 232
Query: 246 HDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY--DGHCGTQLDHGVAAVGYGS 303
VPQN+E +L+KA+A+QP++V++ +SG DF+FY G+ CG ++DH V VGYGS
Sbjct: 233 ECVPQNNEYALMKAVAHQPVAVSVASSGSDFRFYGEGMLREGSFCGYRIDHTVVVVGYGS 292
Query: 304 TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
DY I++N +G +WG GY++M+R T P+G+CG+ S+P+K
Sbjct: 293 DEEGDYWIIRNQYGTQWGMNGYMKMQRGTRNPQGVCGMAMQPSFPVK 339
>gi|46576360|sp|P60994.1|ERVB_TABDI RecName: Full=Ervatamin-B; Short=ERV-B
gi|30749291|pdb|1IWD|A Chain A, Proposed Amino Acid Sequence And The 1.63 Angstrom X-ray
Crystal Structure Of A Plant Cysteine Protease Ervatamin
B: Insight Into The Structural Basis Of Its Stability
And Substrate Specificity
Length = 215
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 120/217 (55%), Positives = 156/217 (71%), Gaps = 3/217 (1%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP VDWR KGAV +KNQ CGSCWAFS VAAVE IN+I TG L SLSEQEL+DCD T
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCD-TA 59
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
++GCNGG M+ AFQYI++ GG+ +++YPY +G+C+ + VV+ING+ V +N+E
Sbjct: 60 SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYR--LRVVSINGFQRVTRNNE 117
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
+L A+A+QP+SV +EA+G FQ YS G++ G CGT +HGV VGYG+ G +Y IV+
Sbjct: 118 SALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVR 177
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
NSWG WG +GYI M+RN GLCGI ++ SYP K
Sbjct: 178 NSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYPTK 214
>gi|116666824|pdb|2BDZ|A Chain A, Mexicain From Jacaratia Mexicana
gi|116666825|pdb|2BDZ|B Chain B, Mexicain From Jacaratia Mexicana
gi|116666826|pdb|2BDZ|C Chain C, Mexicain From Jacaratia Mexicana
gi|116666827|pdb|2BDZ|D Chain D, Mexicain From Jacaratia Mexicana
Length = 214
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 120/216 (55%), Positives = 156/216 (72%), Gaps = 6/216 (2%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P+S+DWR+KGAVT VKNQ CGSCWAFSTVA +EGIN+I+TG L SLSEQEL+DC+ +
Sbjct: 2 PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCERR-S 60
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
+GC+GG + QY+V G +H E +YPY ++G C + V I GY VP N E
Sbjct: 61 HGCDGGYQTTSLQYVVDNG-VHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDEI 119
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
SL++A+ANQP+SV ++ GR FQFY GG+Y+G CGT DH V AVGYG T Y+++KN
Sbjct: 120 SLIQAIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGYGKT----YLLLKN 175
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWGP WGEKGYIR+KR +G+ +G CG+ + +PIK
Sbjct: 176 SWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFFPIK 211
>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 151/335 (45%), Positives = 202/335 (60%), Gaps = 19/335 (5%)
Query: 28 FSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK 87
F+I S +L N+ + + + + ++F+K+YE + E+ R +++ DN I N+ +
Sbjct: 12 FAISSVSSINL--NEVIEEEWSLFKAQFKKIYEDVKEEAFRKKVYLDNKLKIARHNKLYE 69
Query: 88 N----YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHED----FSYKDVVDLPKSVD 139
Y L +N F DL E+K+M G KP LA +D F + V +PK++D
Sbjct: 70 TGEETYALEMNHFGDLMQHEYKKMMNGFKPSLAGGDKNFTDDDAVTFLKSENVVVPKAID 129
Query: 140 WRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCN 198
WRKKG VT VKNQG CGSCW+FS ++EG + TG L SLSEQ LIDC Y NNGC
Sbjct: 130 WRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCE 189
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
GGLMD AF+YI S GL E+ YPY E+ C E+ T G+ D+P+ ED+L+
Sbjct: 190 GGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNP-ENSGATDKGFVDIPEGDEDALMH 248
Query: 259 ALAN-QPLSVAIEASGRDFQFYSGGV-YDGHC-GTQLDHGVAAVGYGST-RGLDYIIVKN 314
ALA P+S+AI+AS FQFY GV Y+ C T+LDHGV AVGYG+ +G DY IVKN
Sbjct: 249 ALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHKGGDYWIVKN 308
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
SWG WG++GYI M RN + CG+ ASYP+
Sbjct: 309 SWGKTWGDQGYIMMARNK---KNNCGVASSASYPL 340
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 144/303 (47%), Positives = 178/303 (58%), Gaps = 12/303 (3%)
Query: 51 WMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG 110
W K Y E+ R+ I+KDN+ I E N K KN L +N F D+ + EF+ G
Sbjct: 30 WKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKSKNVILRMNHFGDMTNTEFRAKMNG 89
Query: 111 LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
L K Q+ F P +VDWR +G VT VKNQG CGSCWAFS+ A+EG
Sbjct: 90 L----LLHKHQNGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGALEGQ 145
Query: 171 NQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
+ TG L SLSEQ L+DC Y NNGCNGGLMD AF YI + GG+ E YPY ++GT
Sbjct: 146 HFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTETGYPYEGQDGT 205
Query: 230 CEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD-GH 287
C +K S G+ D+P+ ED+L +A+A P+SVAI+AS FQFY GVYD
Sbjct: 206 CRYSK-SSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQ 264
Query: 288 CG-TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
C + LDHGV VGYG+ G DY +VKNSWG WG +GYI M RN + CGI AS
Sbjct: 265 CSPSALDHGVLVVGYGTDNGKDYWLVKNSWGTGWGTEGYIYMSRNN---QNQCGIASKAS 321
Query: 347 YPI 349
YP+
Sbjct: 322 YPL 324
>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 337
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 136/304 (44%), Positives = 181/304 (59%), Gaps = 3/304 (0%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFKEM 107
E WM++ KVY+ EK +IF++N+ I+ + K++ L N+FADL EEFK +
Sbjct: 33 EKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFDVCGDKSFNLSTNQFADLHDEEFKAL 92
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS-TVAA 166
+ F Y +V +P S+DWRK+G VT +K+QG C SCWAFS VA
Sbjct: 93 LTNGHKKEHSLWTTTETLFRYDNVTKIPASMDWRKRGVVTPIKDQGKCLSCWAFSLCVAT 152
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
+EG++QI+T L LSEQEL+D + GC G ++ AF++I G + E YPY
Sbjct: 153 IEGLHQIITSELVPLSEQELVDFVKGESEGCYGDYVEDAFKFITKKGRIESETHYPYKGV 212
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
TC++ K V I GY VP SE++LLKA+ANQ +SV++EA FQFYS G++ G
Sbjct: 213 NNTCKVKKETHGVAQIKGYKKVPSKSENALLKAVANQLVSVSVEARDSAFQFYSSGIFTG 272
Query: 287 HCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
CGT DH VA YG S G Y + KNSWG +WGEKGYIR+K + EGLCGI K
Sbjct: 273 KCGTDTDHRVALASYGESGDGTKYWLAKNSWGTEWGEKGYIRIKXDIPAKEGLCGIAKYP 332
Query: 346 SYPI 349
YPI
Sbjct: 333 YYPI 336
>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
Length = 533
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/313 (45%), Positives = 179/313 (57%), Gaps = 22/313 (7%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYW----LGLNEFADLRHEE 103
F +WM + E R E + N +I E N + N W LG N F+ + +E
Sbjct: 28 FSAWMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAE--NAWTGVTLGHNAFSHMSFDE 85
Query: 104 FKEMFLGL-------KPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
FK GL + LA R D D V++P +VDW KG VT VKNQG CG
Sbjct: 86 FKFKMTGLVLPEGYLEQRLASRVDGLWSD------VEVPSAVDWVDKGGVTPVKNQGMCG 139
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
SCWAFST AVEG + +G L SLSEQEL+DCD+ + GCNGGLMD+AFQ+I GG+
Sbjct: 140 SCWAFSTTGAVEGATFVSSGKLPSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGIC 199
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E+DY Y + C VV + G+ DV E +L A+A QP+SVAIEA + F
Sbjct: 200 SEDDYEYKAKAQVCREC---DSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAF 256
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
QFY GV++ CGT+LDHGV AVGYG+ G + VKNSWG WGE+GYIR+ R P
Sbjct: 257 QFYKSGVFNLTCGTRLDHGVLAVGYGNDNGHKFWKVKNSWGASWGEQGYIRLAREENGPA 316
Query: 337 GLCGINKMASYPI 349
G CGI + SYP
Sbjct: 317 GQCGIASVPSYPF 329
>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 144/308 (46%), Positives = 181/308 (58%), Gaps = 13/308 (4%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHE 102
++D F W + + Y S +E+L RFE+++ N+ +ID TNR+ Y LG N+FADL E
Sbjct: 41 MMDRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADLTGE 100
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS-CGSCWAF 161
EF + G A + D S + D P SVDWR KGAVT VKNQGS C SCWAF
Sbjct: 101 EFLARYAGGHTGSAI-TTAAEADGSLE--ADPPASVDWRAKGAVTPVKNQGSQCYSCWAF 157
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
S VA +E + I TG L +LSEQ+L+DCD Y+ GCN G AFQ+I+ GG+ Y
Sbjct: 158 SAVATMESLYFIKTGKLVALSEQQLVDCDK-YDGGCNKGYYHRAFQWIMENGGITTAAQY 216
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
PY G C K VTI G+ V +N E +L A+A QP+ VAIE QFY
Sbjct: 217 PYKAVRGACSAAK---PAVTITGHLAVAKN-ELALQSAVARQPIGVAIEVP-ISMQFYKS 271
Query: 282 GVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GV+ CG Q+ H V VGYG+ GL Y +VKNSWG WGE GYIRM+R+ G GLCG
Sbjct: 272 GVFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMRRDVGG-GGLCG 330
Query: 341 INKMASYP 348
I +YP
Sbjct: 331 IALDTAYP 338
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 189/311 (60%), Gaps = 15/311 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGLNEFADLRHEE 103
+ESW K+ K Y E++ R +++ NL+ + + N + NY LG+N +ADL +EE
Sbjct: 19 WESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEE 78
Query: 104 FKEMFLGLKPDLARRKDQSH-EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
F M L + + KDQS + F V LP SVDWR +G VT VK+QG CGSCW+FS
Sbjct: 79 F--MALKGSSGILQAKDQSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFS 136
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDY 221
++EG + TG L SLSEQ+L+DC +Y N GC+GGLM+ A+ YI GG+ E Y
Sbjct: 137 ATGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYGCSGGLMESAYDYIRDAGGVQLESAY 196
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYS 280
PY + G C + ++ V T G+ +P E SL++A+ P++VAI+ASG DFQ Y
Sbjct: 197 PYTAQNGRCHFDQSKA-VATCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGYDFQLYE 255
Query: 281 GGVYD-GHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
GVYD C + LDHGV A GYG+ G DY +VKNSWGP WG +GYI+M RN
Sbjct: 256 SGVYDRSRCSSSSLDHGVLAAGYGTEGGNDYWLVKNSWGPGWGAQGYIKMSRNKSNQ--- 312
Query: 339 CGINKMASYPI 349
CGI MA YP+
Sbjct: 313 CGIATMACYPL 323
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 193/322 (59%), Gaps = 17/322 (5%)
Query: 41 NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR----KIKNYWLGLNEF 96
N + + LF++W + ++KVY++++E+ ++ + +N I E N K K+Y L +NE+
Sbjct: 22 NQQHVSLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNKISEHNMQYSLKQKSYRLEMNEY 81
Query: 97 ADLRHEEFKEMFLGLKPDL-ARRKDQSHEDF----SYKDVVDLPKSVDWRKKGAVTHVKN 151
DL EEF M G + D+ +RK + S+ + LP VDWRK G VT VKN
Sbjct: 82 GDLTSEEFSSMMNGYRNDIRLKRKSTGGSTYLNLLSFGSQIQLPTLVDWRKHGLVTPVKN 141
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIV 210
QG CGSCW+FS ++EG ++ TG L SLSEQ LIDC N+GCNGGLMD AF+YI
Sbjct: 142 QGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDCSTPEGNDGCNGGLMDQAFKYIK 201
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAI 269
GG+ E YPY ++ TC +S T G+ D+ E+ L +A A P+SVAI
Sbjct: 202 IQGGIDTEAYYPYEAKDDTCRFNITDSG-ATDTGFVDIKSGDEEMLKEAAATVGPISVAI 260
Query: 270 EASGRDFQFYSGGVY-DGHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIR 327
+AS FQFYS GVY + C T LDHGV VGYG+ G DY +VKNSWG WGE GYI+
Sbjct: 261 DASHTSFQFYSNGVYSETACSSTMLDHGVLVVGYGTENGKDYWLVKNSWGEGWGEAGYIK 320
Query: 328 MKRNTGKPEGLCGINKMASYPI 349
M RN + CGI ASYP+
Sbjct: 321 MSRN---ADNQCGIATQASYPL 339
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 147/344 (42%), Positives = 200/344 (58%), Gaps = 19/344 (5%)
Query: 11 LISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFE 70
LI FCI ++ + + +++ + WM K+E+ Y + E +R +
Sbjct: 4 LIGFCIILLWACAYP--------TMSRTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKK 55
Query: 71 IFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLGLK--PDLARRKDQSHE-DF 126
IFK+NL +I+ N K+Y LGLN ++DL EEF G K L+ K +S F
Sbjct: 56 IFKENLEYIENFNNVGNKSYKLGLNRYSDLTSEEFIASHTGFKVSDQLSDSKMRSVAIPF 115
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
+ D D+P + DWR+KG VT VKNQ CG CWAF+ VAAVEGI +I GNL SLSEQ+L
Sbjct: 116 NLND--DVPTNFDWREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQL 173
Query: 187 IDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEE-GTCEMTKGESEVVTINGY 245
+DCD ++GC GG AF I+ + G+ KE+DYPY + TC++ + INGY
Sbjct: 174 VDCDRQ-SSGCGGGDFVLAFDSIIKSRGIVKEDDYPYKANDVQTCQLGQIPG-AAQINGY 231
Query: 246 HDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-ST 304
VP N E LL+A+ QP+SVAI S DF Y GGVY+G CG +L+H V +GYG S
Sbjct: 232 FKVPANDEQQLLRAVLQQPVSVAISTS-YDFHHYMGGVYEGSCGPKLNHAVTIIGYGVSE 290
Query: 305 RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G Y ++KNSWG WGEKGY+++ R + G C I A+YP
Sbjct: 291 AGKKYWLIKNSWGETWGEKGYMKVLRESSATGGQCSIAVHAAYP 334
>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
Length = 337
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 150/303 (49%), Positives = 178/303 (58%), Gaps = 14/303 (4%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFADLRHEEFKEMFLGLK 112
KVY+S E+ R +I+ DN R I E NRK + Y LG+N++ D+ H EF G
Sbjct: 38 KVYKSPVEEGYRMKIYMDNKRKIAEHNRKYELNEVTYKLGMNKYGDMLHHEFVNTLNGFN 97
Query: 113 PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQ 172
+ + F V LP VDW K+GAVT VK+QG CGSCWAFS+ A+EG +
Sbjct: 98 KSVTAGIETEGVTFISPANVKLPDEVDWTKQGAVTAVKDQGHCGSCWAFSSTGALEGQHF 157
Query: 173 IVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCE 231
TG L SLSEQ LIDC Y NNGCNGGLMDYAFQYI GL E+ YPY E C
Sbjct: 158 RSTGYLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFQYIKDNKGLDTEKTYPYEAENDRCR 217
Query: 232 MTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV-YDGHCG 289
S T GY D+PQ E+ L A+A P+SVAI+AS FQ YS GV YD C
Sbjct: 218 YNPRNSG-ATDKGYVDIPQGDEEKLKAAVATIGPISVAIDASHESFQLYSEGVYYDPDCS 276
Query: 290 TQ-LDHGVAAVGYGS--TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
+ LDHGV VGYG+ T G DY +VKNSWG WG+KGYI+M RN CGI AS
Sbjct: 277 AENLDHGVLIVGYGTDETSGHDYWLVKNSWGKTWGQKGYIKMARNKNNH---CGIASSAS 333
Query: 347 YPI 349
YP+
Sbjct: 334 YPL 336
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 188/315 (59%), Gaps = 20/315 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDN----LRHIDETNRKIKNYWLGLNEFADLRHEE 103
F +W KFE+ Y S E+ R +I+ +N L H ++ +K+Y LG+ FAD+ +EE
Sbjct: 26 FHAWKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADMENEE 85
Query: 104 FKEM-----FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
+K + L RR F + DLP +VDWR KG VT VK+Q CGSC
Sbjct: 86 YKRVISQGCLHSFNASLPRRGSTF---FRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGSC 142
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHK 217
WAFS ++EG + TG L SLSEQ+L+DC Y N GC GGLMDYAFQYI + GG+
Sbjct: 143 WAFSATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYIQANGGIDT 202
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDF 276
EE YPY E G C ++ T GY +V Q ED+L +A+A P+SV I+AS F
Sbjct: 203 EESYPYEAENGKCRYNP-DNIGATSTGYTEVSQGDEDALKEAVATIGPISVGIDASQMSF 261
Query: 277 QFYSGGVY-DGHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
QFY GVY + C + +LDHGV AVGYG+ G DY +VKNSWG +WG+KGYI+M RN
Sbjct: 262 QFYESGVYNEPDCSSLELDHGVLAVGYGTEDGNDYWLVKNSWGLEWGDKGYIKMSRNKSN 321
Query: 335 PEGLCGINKMASYPI 349
CGI ASYP+
Sbjct: 322 Q---CGIATAASYPL 333
>gi|340370270|ref|XP_003383669.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 326
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 191/308 (62%), Gaps = 13/308 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNY--WLGLNEFADLRHEEFK 105
F+ W K+ KVYE+ + +LER I++ N + ++ N + + +NEFADL EF
Sbjct: 23 FQDWKVKYNKVYETKETELERQIIWESNKKFVENHNANSDKFGFTVAMNEFADLDAGEFA 82
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
++ GL P R + K V + +VDWR+KGAVT VKNQG CGSCW+FS+
Sbjct: 83 NIYNGLLP---RPASYNSTKLFKKTGVSVGDTVDWREKGAVTEVKNQGKCGSCWSFSSTG 139
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
++EG + + TG L+SLSEQ+L+DC ++ N+GC GGLMD +F+Y+ + G EE YPY
Sbjct: 140 SLEGQHFLKTGTLSSLSEQQLMDCSTSFGNHGCKGGLMDNSFRYLETVAGDMSEEMYPYT 199
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV 283
E+G C E+ + GY D+P+ ED+L +A+A P+SVAI+A R FQ Y G+
Sbjct: 200 AEDGFCRYRSSEA-IAKDTGYKDIPRGDEDALKEAVATVGPISVAIDAGHRSFQLYHEGI 258
Query: 284 -YDGHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
Y+ C T+LDHGV AVGYG+ G +Y +VKNSWGP WG +GY+ M RN E CGI
Sbjct: 259 YYEPACSSTKLDHGVLAVGYGTGEGEEYWLVKNSWGPSWGNEGYVMMSRNR---ENNCGI 315
Query: 342 NKMASYPI 349
ASYP
Sbjct: 316 ATQASYPT 323
>gi|294883322|ref|XP_002770704.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873993|gb|EER02713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 333
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 138/289 (47%), Positives = 181/289 (62%), Gaps = 11/289 (3%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
F + KF K YES +E+++R IF+ NL HI++ N K +Y LG+NE ADL HEEF +
Sbjct: 28 FMGFQHKFGKNYESKEEEVKRNAIFQANLHHIEQVNAKDLSYKLGVNEHADLTHEEFAAL 87
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
LG RR D+ + D LP SVDWR K +T VK+QGSCGSCWAFST A+
Sbjct: 88 KLGTLKMSTRRDDKFVIE---ADTTQLPTSVDWRNKNVLTPVKDQGSCGSCWAFSTTGAL 144
Query: 168 EGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
E I TG L SLSEQ+L+DC + Y NNGC GGLMD A++YI S GL +E Y Y
Sbjct: 145 EAQYAIATGKLLSLSEQQLVDCSSGYGNNGCEGGLMDDAYEYIKSA-GLDQESTYSYNGT 203
Query: 227 EGTCEMTKGESE----VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
+ C+ + + + G+H + + +E SL+KALA+ P+SVA+ A+ DF+FY G
Sbjct: 204 DDVCQGSLAKRSDGIPAGEVTGFHMLDK-TEQSLMKALADAPVSVAMYAADPDFRFYKSG 262
Query: 283 VY-DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKR 330
VY C +LDHGV AVGYG+ G DY I++NSWG WG+ GY +KR
Sbjct: 263 VYSSATCNGKLDHGVVAVGYGTENGSDYFIIRNSWGSSWGQAGYFYLKR 311
>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
Length = 341
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 149/317 (47%), Positives = 192/317 (60%), Gaps = 20/317 (6%)
Query: 46 DLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRH 101
DLF+ +F+K+YE + E+ R +++ DN I N+ + Y L +N F DL
Sbjct: 31 DLFKV---QFKKIYEDVKEEAFRKKVYLDNKLKIARHNKLYETGEETYALEMNHFGDLMQ 87
Query: 102 EEFKEMFLGLKPDLARRKDQSHED----FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGS 157
E+ +M G KP LA +D F + V +PKS+DWRKKG VT VKNQG CGS
Sbjct: 88 HEYTKMMNGFKPSLAGGDKNFTDDDAVTFLKSENVVIPKSIDWRKKGYVTPVKNQGQCGS 147
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLH 216
CW+FS ++EG + TG L SLSEQ LIDC Y NNGC GGLMD AF+YI S GL
Sbjct: 148 CWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLD 207
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRD 275
E+ YPY E+ C E+ T G+ D+P+ ED+L+ ALA P+S+AI+AS
Sbjct: 208 TEKSYPYEAEDDKCRYNP-ENSGATDKGFVDIPEGDEDALVHALATVGPVSIAIDASSEK 266
Query: 276 FQFYSGGV-YDGHC-GTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNT 332
FQFY GV Y+ C T+LDHGV AVGYG+ +G DY IVKNSWG WG++GYI M RN
Sbjct: 267 FQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHKGGDYWIVKNSWGKTWGDQGYIMMARNK 326
Query: 333 GKPEGLCGINKMASYPI 349
+ CG+ ASYP+
Sbjct: 327 ---KNNCGVASSASYPL 340
>gi|59798093|sp|P84346.1|MEX1_JACME RecName: Full=Mexicain
Length = 214
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 120/216 (55%), Positives = 156/216 (72%), Gaps = 6/216 (2%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P+S+DWR+KGAVT VKNQ CGSCWAFSTVA +EGIN+I+TG L SLSEQEL+DC+ +
Sbjct: 2 PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYR-S 60
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
+GC+GG + QY+V G +H E +YPY ++G C + V I GY VP N E
Sbjct: 61 HGCDGGYQTPSLQYVVDNG-VHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDEI 119
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
SL++A+ANQP+SV ++ GR FQFY GG+Y+G CGT DH V AVGYG T Y+++KN
Sbjct: 120 SLIQAIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGYGKT----YLLLKN 175
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWGP WGEKGYIR+KR +G+ +G CG+ + +PIK
Sbjct: 176 SWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFFPIK 211
>gi|219362839|ref|NP_001136636.1| uncharacterized protein LOC100216764 precursor [Zea mays]
gi|194696462|gb|ACF82315.1| unknown [Zea mays]
gi|413934556|gb|AFW69107.1| hypothetical protein ZEAMMB73_554980 [Zea mays]
Length = 361
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 152/367 (41%), Positives = 209/367 (56%), Gaps = 22/367 (5%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
M +++ T + + + I S S + Y+ DL S + L L+E W + + +
Sbjct: 1 MVRAAEVATTMAAALV-VVIALSTTPAASAIDYTEHDLASEESLWALYERWCAHY-NMAR 58
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG--LKPDLARR 118
L EK RF +FK+N I E N+ Y LGLN F+D+ EEF G L + R
Sbjct: 59 DLGEKTRRFNLFKENAHRIYEHNQGNATYTLGLNRFSDMTDEEFSRSPYGRCLFAPVQRI 118
Query: 119 KD------QSHEDFSYK-------DVVDLPKSVDWRKKGAVTHVKNQG-SCGSCWAFSTV 164
D Q HED S+ + LP SVDWR + +VT VK+QG +CGSCWAF+ +
Sbjct: 119 SDGENEELQQHEDVSFNLTHGGATAALGLPPSVDWRGR-SVTRVKDQGLTCGSCWAFAAI 177
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
AAVEGIN I T +L +LSEQ+L+DCDN ++GC GG + A +IV G+ E YPYI
Sbjct: 178 AAVEGINAIRTWSLVTLSEQQLVDCDNV-DHGCAGGWIPSALDFIVRNRGIVPEGTYPYI 236
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
+G C VTI+GY V ++L+ A+A QP++VA+E+S F+ Y GGV+
Sbjct: 237 GTQGRCRHVMAPP--VTIDGYRRVLPFDVNALMSAVAAQPVAVAMESSAWAFRHYQGGVF 294
Query: 285 DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
+G+CG +L H A VGYG G + IVKNSWGPKWGE GY+R+ RN G+CGI
Sbjct: 295 NGNCGGRLGHAAAVVGYGDGAGGPFWIVKNSWGPKWGEGGYVRISRNAPNRLGICGILTQ 354
Query: 345 ASYPIKK 351
YP+K+
Sbjct: 355 PLYPVKR 361
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 145/310 (46%), Positives = 193/310 (62%), Gaps = 12/310 (3%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEE 103
+E++ S +K Y+S E+L RF+IF +N I + N K + +Y LG+N+FADL E
Sbjct: 27 WEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADLLPHE 86
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F +M G + + ++ + + LPK+VDWRKKGAVT VK+QG CGSCWAFS+
Sbjct: 87 FVKMMNGYQGKRLAGRGSTYLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSS 146
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
++EG + + TG L SLSEQ L+DC + Y N GCNGGLMD +F YI + GG+ E+ YP
Sbjct: 147 TGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKANGGIDTEDSYP 206
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSG 281
Y E+G C K E T G+ D+ + SE L KA+A P+SVAI+AS + FQ YS
Sbjct: 207 YEAEDGDCRYKK-EDVGATDTGFVDIKEGSEKDLQKAVATVGPVSVAIDASQQSFQLYSE 265
Query: 282 GVYD-GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
GVYD +C ++ LDHGV AVGYG G Y +VKNSW WG+ GYI M R+ C
Sbjct: 266 GVYDEPNCSSESLDHGVLAVGYGVKNGKKYWLVKNSWAETWGQDGYILMSRDKNNQ---C 322
Query: 340 GINKMASYPI 349
GI ASYP+
Sbjct: 323 GIASSASYPL 332
>gi|298709635|emb|CBJ31444.1| Cathepsin L-like proteinase [Ectocarpus siliculosus]
Length = 475
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 191/323 (59%), Gaps = 24/323 (7%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEE 103
L+ FE W K+ + + S+ E + + I N + Y L N ++ + +E
Sbjct: 158 LLGFFE-WTYKYGQSWGSVHEAFHALQNYARADDKIALHNHEDAGYTLAHNAYSHMSWQE 216
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDV-----------VDLPKSVDWRKKGAVTHVKNQ 152
F+E F + D+ DQ +F+ + +P VDW KGAVT VKNQ
Sbjct: 217 FREHF-SIGKDMVVPPDQLPAEFALRPRGEKAPKELLRGAPIPDEVDWVAKGAVTPVKNQ 275
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVST 212
GSCGSCW+FST ++EG + I GNLA LSEQEL+DCD TY+ GCNGGLMDY+F +I
Sbjct: 276 GSCGSCWSFSTTGSMEGAHFIKHGNLAVLSEQELVDCD-TYDMGCNGGLMDYSFHWIQQN 334
Query: 213 GGLHKEEDYPY-----IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSV 267
GG+ EEDYPY + ++ TC++ +G ++ + DV + E +L++A+A QP+S+
Sbjct: 335 GGICSEEDYPYTAAGDLCKKSTCDVVEG----TMVDKWVDVASDDEQALMEAVAQQPVSI 390
Query: 268 AIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYI 326
AIEA FQ YSGGV CGT LDHGV VGYG S G+ Y VKNSWGP+WG +GYI
Sbjct: 391 AIEADQMSFQLYSGGVLTAACGTNLDHGVLLVGYGVSEDGVKYWKVKNSWGPEWGAEGYI 450
Query: 327 RMKRNTGKPEGLCGINKMASYPI 349
+KR + G CGI + ASYP+
Sbjct: 451 LLKREADQEGGECGILEQASYPV 473
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 144/315 (45%), Positives = 194/315 (61%), Gaps = 17/315 (5%)
Query: 48 FESWMSKFEKVYE---SLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGLNEFADLR 100
FE F+ V+E E+++R E+F++NL+ I+ N + +Y +G+N+FAD+
Sbjct: 40 FEKLWQDFKTVHERNYGETEEMQRKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADME 99
Query: 101 HEEFKEMFLGLK-PDLARRKDQSHEDFSYKDV-VDLPKSVDWRKKGAVTHVKNQGSCGSC 158
+EF + G + + + +D H + + V LP VDWRK+G VT +K+QG CGSC
Sbjct: 100 VKEFASVVNGFRMNNRTKVRDHLHSHYISPAIPVSLPAEVDWRKEGYVTPIKDQGHCGSC 159
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHK 217
W+FST A+EG + TG L SLSEQ LIDC +Y NNGCNGG+MDYAFQYI G
Sbjct: 160 WSFSTTGALEGQHFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDNDGDDT 219
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDF 276
E+ YPY +G C K E T GY D+P+ E+ + +A+A P+SVAI+AS F
Sbjct: 220 EDSYPYEAADGPCRFKK-EYVGATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDASHTSF 278
Query: 277 QFYSGGVYDG-HCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
Q Y GVYD C + LDHGV VGYG+ G DY +VKNSWG KWG++GYI+M RN
Sbjct: 279 QMYQSGVYDEVECDPEGLDHGVLVVGYGTELGQDYWLVKNSWGTKWGDEGYIKMSRNKNN 338
Query: 335 PEGLCGINKMASYPI 349
CGI+ MASYP+
Sbjct: 339 ---QCGISSMASYPL 350
>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
Length = 332
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 151/312 (48%), Positives = 197/312 (63%), Gaps = 16/312 (5%)
Query: 46 DLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRH 101
D +E+W K Y + +E+ R +I++DNL+ + + N + + +Y LG+N++ADLR
Sbjct: 26 DTWEAWKQTHSKQY-TKEEEDNRRKIWEDNLQKVSKHNTEHSLGLHSYTLGMNKYADLRG 84
Query: 102 EEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
EEF +M GLK D +R + Q + SY P SVDWR +G VT VK+QG CGSCWAF
Sbjct: 85 EEFVQMMNGLKFDASRER-QGIKFLSYAKF-QAPDSVDWRDEGYVTPVKDQGQCGSCWAF 142
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEED 220
ST ++EG + TG L SLSEQ L+DC +Y NNGC GGLMDYAFQYI G+ E+
Sbjct: 143 STTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDNLGIDTEDK 202
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKAL-ANQPLSVAIEASGRDFQFY 279
YPY E+ TC + ++ T +GY DV ED+L +A AN P+SVAI+AS FQ Y
Sbjct: 203 YPYEAEDDTCRFSP-DNVGATDSGYVDVDSGDEDALKEACAANGPISVAIDASHESFQLY 261
Query: 280 SGGVYDGH-CGT-QLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
GVYD C + +LDHGV VGYG+ + G DY IVKNSWG WG++GYI M RN +
Sbjct: 262 ESGVYDEESCSSIELDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWMSRN---KD 318
Query: 337 GLCGINKMASYP 348
CGI ASYP
Sbjct: 319 NQCGIATSASYP 330
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 146/319 (45%), Positives = 193/319 (60%), Gaps = 12/319 (3%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLN 94
+S + L +E++ S+ K Y S E+L RF+IF +N + + N K + +Y L +N
Sbjct: 18 SSQEILRTEWEAFKSQHNKAYSSHVEELLRFKIFTENTLLVAKHNAKYAKGLVSYKLAMN 77
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
+F DL EF +M G + + + + + + LP +VDWRKKGAVT VKNQG
Sbjct: 78 KFGDLLPHEFAKMVNGYRGKQNKEQRPTFIPPANLNDSSLPTTVDWRKKGAVTPVKNQGQ 137
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTG 213
CGSCWAFST ++EG + TG L SLSEQ L+DC + + N GCNGGLMD FQYI + G
Sbjct: 138 CGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYIKANG 197
Query: 214 GLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEAS 272
G+ EE +PY ++G C+ K + T G+ D+ Q SED L KA+A P+SVAI+AS
Sbjct: 198 GIDTEESHPYTAQDGDCKFKKADVG-ATDAGFVDIQQGSEDDLKKAVATVGPVSVAIDAS 256
Query: 273 GRDFQFYSGGVYD-GHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKR 330
FQ YS GVYD C +QLDHGV VGYG G Y +VKNSWG WG+ GYI M R
Sbjct: 257 HGSFQLYSQGVYDEPDCSSSQLDHGVLTVGYGVKNGKKYWLVKNSWGGDWGDNGYILMSR 316
Query: 331 NTGKPEGLCGINKMASYPI 349
+ + CGI ASYP+
Sbjct: 317 DK---DNQCGIASSASYPL 332
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 189/312 (60%), Gaps = 14/312 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDN----LRHIDETNRKIKNYWLGLNEFADLRHEE 103
F +W KF + Y + E+++R +I+ +N L H ++ IK+Y LG+ +FAD+ +EE
Sbjct: 27 FHAWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMDNEE 86
Query: 104 FKEMF-LG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
+K + LG L+ + F + LP +VDWR KG VT VK+Q CGSCWAF
Sbjct: 87 YKSLISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSCWAF 146
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEED 220
S ++EG N TG L SLSEQ+L+DC Y N GCNGGLMDYAF+YI GG+ E+
Sbjct: 147 SATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDTEKS 206
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFY 279
YPY E+G C K E+ GY DV ED+L +A+A P+SV I+AS FQ Y
Sbjct: 207 YPYEAEDGQCRF-KPENVGAKCTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQLY 265
Query: 280 SGGVYDGH-CGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
GVYD C +Q LDHGV AVGYG+ G DY +VKNSWG WG++GYI M RN +
Sbjct: 266 DSGVYDEQDCSSQDLDHGVLAVGYGTDNGQDYWLVKNSWGLGWGQEGYIMMSRN---KDN 322
Query: 338 LCGINKMASYPI 349
CGI ASYP+
Sbjct: 323 QCGIATAASYPL 334
>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
Length = 341
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 147/309 (47%), Positives = 188/309 (60%), Gaps = 17/309 (5%)
Query: 54 KFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRHEEFKEMFL 109
+F+K+YE + E+ R +++ DN I N+ ++ Y L +N F DL E+ +M
Sbjct: 36 QFKKLYEDIKEETFRKKVYLDNKLKIARHNKLYESGEETYALEMNHFGDLMQHEYTKMMN 95
Query: 110 GLKPDLARRKDQSHED----FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
G KP LA D F + V +PKSVDWRKKG VT VKNQG CGSCW+FS
Sbjct: 96 GFKPSLAGGDRNFTNDEAVTFLKSENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATG 155
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
++EG + TG L SLSEQ LIDC Y NNGC GGLMD AF+YI S GL E+ YPY
Sbjct: 156 SLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYE 215
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV 283
E+ C E+ T G+ D+P+ ED+L+ ALA P+S+AI+AS FQFY GV
Sbjct: 216 AEDDKCRYNP-ENSGATDKGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGV 274
Query: 284 -YDGHC-GTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
Y+ C T+LDHGV AVG+GS +G DY IVKNSWG WG++GYI M RN + CG
Sbjct: 275 FYNPRCSSTELDHGVLAVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMARNK---KNNCG 331
Query: 341 INKMASYPI 349
+ ASYP+
Sbjct: 332 VASSASYPL 340
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 196/321 (61%), Gaps = 22/321 (6%)
Query: 47 LFESWMS-KFE--KVYESLDEKLERFEIFKDNLRHIDETNRKI----KNYWLGLNEFADL 99
+ E W S KFE K YES E+ R +IF +N + I N+ K Y LG+N++ D+
Sbjct: 25 VMEEWESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDM 84
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD------LPKSVDWRKKGAVTHVKNQG 153
H EF M G + + + +++ F V+ +PKSVDWR+KGAVT VK+QG
Sbjct: 85 LHHEFVNMMNGFRANTSGAGYKANRGFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKDQG 144
Query: 154 SCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVST 212
SCGSCWAFS A+EG + TG+L SLSEQ L+DC + + NNGCNGGLMD AFQYI
Sbjct: 145 SCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIKVN 204
Query: 213 GGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEA 271
GG+ E+ YPY E+ C + G+ DV + +E++L KA+A P+SVAI+A
Sbjct: 205 GGIDTEKSYPYEAEDEPCRYNPANAG-ADDRGFVDVREGNENALKKAIATIGPVSVAIDA 263
Query: 272 SGRDFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRM 328
S FQFY GVY D C + LDHGV AVGYG+T G DY +VKNSW WG++GYI++
Sbjct: 264 SQDSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTTEDGQDYWLVKNSWSKSWGDQGYIKI 323
Query: 329 KRNTGKPEGLCGINKMASYPI 349
RN +CGI ASYP+
Sbjct: 324 ARNQNN---MCGIASAASYPL 341
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 146/310 (47%), Positives = 186/310 (60%), Gaps = 13/310 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFADLRHEE 103
++ W ++ K Y S +E+ R I++ NL + N K Y LG+N+FADL+++E
Sbjct: 28 WKEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIRHNLKYDLGHFTYDLGMNQFADLQNKE 87
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F M G + + + + +V LPK+VDWR KG VT VK+QG CGSCWAFS
Sbjct: 88 FVAMMTGFRVNGTSKAAKGSTFLPPNNVGKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSA 147
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
++EG + TG L SLSEQ L+DC + N GCNGGLMD AFQYI+ GG+ EE YPY
Sbjct: 148 TGSLEGQHFKKTGKLVSLSEQNLVDCSDK-NYGCNGGLMDRAFQYIIDAGGIDTEESYPY 206
Query: 224 IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGG 282
I +G C K + T+ GY DV SE +L KA+A+ P+SVAI+AS FQ Y G
Sbjct: 207 IAMDGNCHF-KTANVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHFSFQLYQSG 265
Query: 283 VYD--GHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
VY+ G T LDHGV AVGYG+T G DY IVKNSW WG GYI M RN + C
Sbjct: 266 VYNEPGCSSTLLDHGVLAVGYGTTIDGTDYWIVKNSWAETWGMNGYIWMSRN---KDNQC 322
Query: 340 GINKMASYPI 349
GI ASYP+
Sbjct: 323 GIATQASYPL 332
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/349 (41%), Positives = 197/349 (56%), Gaps = 32/349 (9%)
Query: 28 FSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK 87
F ++ S L S ++ + FE+W+ +FEK Y+ + E +RF IFK N+ + N K
Sbjct: 161 FGLIAISNALLFSEEQYKNEFENWIDRFEKKYD-VSEFKKRFSIFKSNMDFVHSWNSKNS 219
Query: 88 NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVT 147
LGLN ADL + E+++ +LG +HE + + V +VDWR+KGAV+
Sbjct: 220 QTVLGLNHLADLTNLEYRQFYLGTHKKAVLGTPGNHEVSNLQSVFGDSATVDWRQKGAVS 279
Query: 148 HVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAF 206
+K+QG CGSCW+FST +VEG +QI +GN+ LSEQ L+DC + N GCNGGLMDYAF
Sbjct: 280 PIKDQGQCGSCWSFSTTGSVEGAHQIKSGNMVELSEQNLVDCSTSEGNMGCNGGLMDYAF 339
Query: 207 QYIVSTGGLHKEEDYPYIMEEG-TCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ-P 264
+YI++ G+ E YPY G TC+ K S TI+ Y ++ SE L A+ N P
Sbjct: 340 EYIITNNGIDTESSYPYTASSGTTCKYNKANSG-ATISSYKNITAGSESDLADAVKNAGP 398
Query: 265 LSVAIEASGRDFQFYSGGV-YDGHCGT-QLDHGVAAVGYGSTR----------------- 305
+SVAI+AS FQ YS G+ YD C + LDHGV VGYGS
Sbjct: 399 VSVAIDASHNSFQLYSHGIYYDASCSSVNLDHGVLVVGYGSGTPDSDSRVHKGSQVRVKV 458
Query: 306 -----GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
+Y IVKNSWG WG+KG+I M ++ + CGI ASYPI
Sbjct: 459 PKTDDTKNYWIVKNSWGTSWGDKGFIYMSKDR---DNNCGIASCASYPI 504
>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
Length = 492
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 174/308 (56%), Gaps = 35/308 (11%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
F SW+ + E +R E + N +I N + ++ LG N F+ L +EEF++
Sbjct: 33 FVSWLKTHHLTFSDAFEYAKRLETYIANDIYILTHNLQESSFKLGHNAFSHLTNEEFRQR 92
Query: 108 FLGLKPD-------LARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWA 160
F G K LA+ S +F Y +DLP+SVDW +KGAVT VKNQG CGSCWA
Sbjct: 93 FNGFKASDDYLTKRLAQSNVASSTNFQY---IDLPESVDWVEKGAVTGVKNQGMCGSCWA 149
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
FST A+EG I +G L SLSEQEL+DCD+ ++GCNGGLMD+AF +I G+ EED
Sbjct: 150 FSTTGAIEGATFISSGKLVSLSEQELVDCDHNGDHGCNGGLMDHAFSWISEHDGICSEED 209
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYS 280
Y YI + C K VV+ P++VAI+A R FQFY
Sbjct: 210 YAYIHSQSLCRSCK---PVVS----------------------PVAVAIDAGDRSFQFYQ 244
Query: 281 GGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GVY+ CGTQLDHGV VGYG G Y VKNSWG WGEKGYIR+ R+ G CG
Sbjct: 245 SGVYNKTCGTQLDHGVLTVGYGVEDGQKYWKVKNSWGNSWGEKGYIRLSRDQNGRSGQCG 304
Query: 341 INKMASYP 348
I + SYP
Sbjct: 305 IAMVPSYP 312
>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
Length = 351
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 151/330 (45%), Positives = 198/330 (60%), Gaps = 21/330 (6%)
Query: 36 EDLTSN-DKLIDLFESWMSKFE----KVYESLDEKLERFEIFKDNLRHIDETN----RKI 86
++L SN +++D +W KF+ KVY ++E+ R IF N + I + N
Sbjct: 25 DNLYSNFQEVLDAEVAW-HKFKLEHNKVYVGIEEESLRKTIFATNYKFIKDHNALHATGE 83
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAV 146
K++ +G+NEFAD+ EF +M GLKPD R ++ S LP VDWR KG V
Sbjct: 84 KSFTVGVNEFADMTVHEFAQMMNGLKPDSTRVSGSTY--LSPNIDAPLPVEVDWRTKGLV 141
Query: 147 THVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYA 205
+ VKNQGSCGSCWAFST ++EG + TG + LSEQ L+DC +Y N+GCNGGLM A
Sbjct: 142 SEVKNQGSCGSCWAFSTTGSLEGQHMRKTGTMVDLSEQNLVDCSTSYGNDGCNGGLMTNA 201
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QP 264
F+YI G+ EE YPY +G C+ K + T+ G+ ++P +E L +ALA P
Sbjct: 202 FKYIKDNKGIDTEEAYPYAGRDGDCKFKKNKVG-ATVTGFVEIPAGNEKKLQEALATVGP 260
Query: 265 LSVAIEASGRDFQFYSGGVYD-GHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGE 322
+SVAI+A+ + F Y GVYD C QLDHGV AVGYGS G DY IVKNSWG WGE
Sbjct: 261 VSVAIDANHQSFMLYKSGVYDEPECDSAQLDHGVLAVGYGSIHGKDYYIVKNSWGTTWGE 320
Query: 323 KGYIRMKRNTGKPE---GLCGINKMASYPI 349
+GYIR T P+ G+CGI ASYP+
Sbjct: 321 QGYIRFS-TTAVPDAIGGICGILLDASYPV 349
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 150/312 (48%), Positives = 188/312 (60%), Gaps = 13/312 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNY--WLGLNEFADLRHEEFK 105
+ESW + KVY S E+L R I++ N +++DE N + + +G+N+FADL EF
Sbjct: 22 WESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSEFG 81
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
++ G + +K QS + FS K V DLP SVDWR KG VT +KNQG CGSCWAFS VA
Sbjct: 82 RLYNGYNNKPSMKKAQS-KVFSTK-VGDLPTSVDWRTKGFVTAIKNQGQCGSCWAFSAVA 139
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
+EG + TG L SLSEQ L+DC N GCNGGLMD AFQY++ GG+ E YPY
Sbjct: 140 GLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTEASYPYK 199
Query: 225 MEEGTCEMTKGESEVVTINGYHDV-PQNSEDSLLKALANQ-PLSVAIEASGRDFQFYSGG 282
+ C+ T +G+ D+ P SE +L A+A P+SVAI+AS FQ Y G
Sbjct: 200 AVDQKCKFNAANVG-STCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQLYKSG 258
Query: 283 VY-DGHCG-TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
VY + C T LDHGV AVGY S+ G+ Y IVKNSWG WG+ GYI M RN CG
Sbjct: 259 VYSESACSQTSLDHGVTAVGYDSSSGVAYWIVKNSWGTTWGQAGYIWMSRNKNNQ---CG 315
Query: 341 INKMASYPIKKK 352
I ASYPI K
Sbjct: 316 IATAASYPIVSK 327
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 143/303 (47%), Positives = 184/303 (60%), Gaps = 19/303 (6%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFADLRHEEFKEMFLGLK 112
K Y++ E++ R +IF DN + I+ N K + +Y + +N F DL EFK + G K
Sbjct: 36 KTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMVHEFKALMNGFK 95
Query: 113 --PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
PD R + + +LPK+VDWR+KGAVT VK+QG CGSCW+FS ++EG
Sbjct: 96 MSPDTKRNGE-----LYFPSNSNLPKTVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQ 150
Query: 171 NQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
+ TG L SLSEQ L+DC +Y NNGC GGLMD AFQY+ G+ E YPY E T
Sbjct: 151 VFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAFQYVSDNKGIDTEASYPYEARENT 210
Query: 230 CEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVY-DGH 287
C K + T G+ D+P E +L ALA P+SVAI+A+ FQFYS GVY + +
Sbjct: 211 CRFKKNKVG-GTDKGHVDIPAGDEKALQNALATVGPISVAIDANHGSFQFYSKGVYNEPN 269
Query: 288 CGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
C + LDHGV AVGYG+ G DY +VKNSWGP WGE GYI++ RN CGI MAS
Sbjct: 270 CSSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGENGYIKIARNH---SNHCGIASMAS 326
Query: 347 YPI 349
YP+
Sbjct: 327 YPL 329
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/306 (47%), Positives = 178/306 (58%), Gaps = 12/306 (3%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
+ +W K Y +E L R I+ DNL + + N + +Y L +N FADL EFK+
Sbjct: 27 WHAWKDFHGKTYTGEEEDLRR-AIWNDNLEIVKKHNAENHSYKLDMNHFADLTVTEFKQR 85
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
F+G + A F V LP VDWR KG VT VKNQG CGSCWAFS+ ++
Sbjct: 86 FMGYR---AASNSTGGSTFLPLSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSL 142
Query: 168 EGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EG + TG L SLSEQ L+DC Y NNGC GGLMDYAF+YI + G+ E+ YPY
Sbjct: 143 EGQHFRKTGKLVSLSEQNLVDCSKKYGNNGCEGGLMDYAFKYIKNNDGIDTEQSYPYTAR 202
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVY- 284
+G C G S T+ GY DV + SE L A+A P+SVAI+A FQ Y GVY
Sbjct: 203 DGQCHFKPG-SVGATVTGYTDVQRGSEGDLQSAVATVGPISVAIDAGHSSFQLYKTGVYS 261
Query: 285 DGHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINK 343
+ C TQLDHGV AVGYG+ G DY +VKNSWG WG GYI+M RN + CGI
Sbjct: 262 EPDCSSTQLDHGVLAVGYGAEDGKDYWLVKNSWGEGWGMNGYIKMSRN---KDNQCGIAT 318
Query: 344 MASYPI 349
ASYP+
Sbjct: 319 QASYPL 324
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 142/307 (46%), Positives = 189/307 (61%), Gaps = 18/307 (5%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFADLRHEEFKEMFLGLK 112
K Y E+ R +IF +N HI + N++ +Y L LN++AD+ H EF+E G
Sbjct: 38 KNYADSTEETFRMKIFNENKHHIAKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFN 97
Query: 113 PDLARRKDQSHEDFS-----YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
L ++ + E F+ + V LP +VDWR KGAVT VK+QG CGSCWAFS+ A+
Sbjct: 98 YTLHKQLRSTDESFTGVTFISPEHVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAI 157
Query: 168 EGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EG + +G L SLSEQ L+DC Y NNGCNGGLMD AF+Y+ GG+ E+ Y Y
Sbjct: 158 EGQHFRKSGTLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGI 217
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD 285
+ +C K S T G+ D+PQ +E L +A+A P+SVAI+AS + FQFYS GVYD
Sbjct: 218 DDSCHFDK-NSIGATDRGFADIPQGNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYD 276
Query: 286 -GHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGIN 342
+C + LDHGV VGYG+ + G DY +VKNSWG WG+KG+I+M RN E CGI
Sbjct: 277 EPNCSAENLDHGVLVVGYGTEKDGSDYWLVKNSWGTTWGDKGFIKMSRN---KENQCGIA 333
Query: 343 KMASYPI 349
+SYP+
Sbjct: 334 SASSYPL 340
>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 147/309 (47%), Positives = 188/309 (60%), Gaps = 17/309 (5%)
Query: 54 KFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRHEEFKEMFL 109
+F+K+YE + E+ R +++ DN I N+ ++ Y L +N F DL E+ +M
Sbjct: 36 QFKKLYEDIKEETFRKKVYLDNKLKIAGHNKLYESGEETYALEMNHFGDLMQHEYTKMMN 95
Query: 110 GLKPDLARRKDQSHED----FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
G KP LA D F + V +PKSVDWRKKG VT VKNQG CGSCW+FS
Sbjct: 96 GFKPSLAGGDRNFTNDEAVTFLKSENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATG 155
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
++EG + TG L SLSEQ LIDC Y NNGC GGLMD AF+YI S GL E+ YPY
Sbjct: 156 SLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYE 215
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV 283
E+ C E+ T G+ D+P+ ED+L+ ALA P+S+AI+AS FQFY GV
Sbjct: 216 AEDDKCRYNP-ENSGATDKGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGV 274
Query: 284 -YDGHC-GTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
Y+ C T+LDHGV AVG+GS +G DY IVKNSWG WG++GYI M RN + CG
Sbjct: 275 FYNPRCSSTELDHGVLAVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMARNK---KNNCG 331
Query: 341 INKMASYPI 349
+ ASYP+
Sbjct: 332 VASSASYPL 340
>gi|242072388|ref|XP_002446130.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
gi|241937313|gb|EES10458.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
Length = 276
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 131/281 (46%), Positives = 174/281 (61%), Gaps = 26/281 (9%)
Query: 73 KDNLRHIDETNRKIKN-YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV 131
+DN+ ++ N N +WLG+N+FADL EEFK G KP A + + + V
Sbjct: 19 RDNVAFVESFNANKNNKFWLGVNQFADLTTEEFKAN-KGFKPTSAEKVPTTGFKYENLSV 77
Query: 132 VDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD- 190
LP +VDWR KGAVT +KNQG CG CWAFS VAA+EGI ++ TGNL SLS+QEL+DCD
Sbjct: 78 SALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSKQELVDCDT 137
Query: 191 NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQ 250
++ + GC E PY +G C+ G TI G+ DVP
Sbjct: 138 HSMDEGC--------------------EVQLPYKAVDGKCK--GGSKSAATIKGHEDVPV 175
Query: 251 NSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYG-STRGLDY 309
N+E +L+KA+ANQP+SVA++AS R F YSGGV G CGT+LDHG+AA+GYG + G Y
Sbjct: 176 NNEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKY 235
Query: 310 IIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
I+KNSWG WGEKG++RM+++ G+CG+ SYP +
Sbjct: 236 WILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPTE 276
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 124/230 (53%), Positives = 159/230 (69%), Gaps = 8/230 (3%)
Query: 126 FSYKDV-VD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
F Y++V VD +P ++DWR GAVT +K+QG CG CWAFS VAA EGI +I TG L SLSE
Sbjct: 6 FRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSE 65
Query: 184 QELIDCDNTY--NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVT 241
QEL+DCD Y + GC GGLMD AF++I+ GGL E +YPY +G C+ G +
Sbjct: 66 QELVDCD-VYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCK--SGSNSAAN 122
Query: 242 INGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGY 301
I GY DVP N E +L+KA+ANQP+SVA++ FQFYSGGV G CGT LDHG+AA+GY
Sbjct: 123 IKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGY 182
Query: 302 GSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G T G Y ++KNSWG WGE GY+RM+++ +G+CG+ SYP +
Sbjct: 183 GKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYPTE 232
>gi|328872971|gb|EGG21338.1| cysteine proteinase 5 precursor [Dictyostelium fasciculatum]
Length = 358
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 147/360 (40%), Positives = 197/360 (54%), Gaps = 39/360 (10%)
Query: 24 FARDFSIV---GYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHID 80
FA F IV ++ S + D F +WM K + Y S E R+ ++K N+ +++
Sbjct: 3 FAVIFLIVLMLAFASASSYSEQQYRDSFTNWMQKHSRSYAS-HEFNTRYSVYKKNMDYVN 61
Query: 81 ETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD-LPKSVD 139
E N K LGLN AD+ ++E++ ++LG K D R + S+ V LP S+D
Sbjct: 62 EWNSKGSETVLGLNSLADMTNQEYQAIYLGTKTDATARLAAASASASFGKVQGALPASID 121
Query: 140 WRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCN 198
W +GAVT VKNQG CGSCW+FS + EG +QI T NL +LSEQ LIDC ++Y N+GCN
Sbjct: 122 WVAQGAVTQVKNQGQCGSCWSFSATGSTEGAHQISTSNLVALSEQNLIDCSSSYGNDGCN 181
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
GGLMD AF+YI++ GG+ E YPY+ + C+ S T++ Y DV SE +L
Sbjct: 182 GGLMDNAFKYIIANGGIDTEASYPYVAKVQKCKYNPANSG-ATLSSYVDVTSGSESALQS 240
Query: 259 ALANQPLSVAIEASGRDFQFYSGGV-YDGHC-GTQLDHGVAAVGYGS------------- 303
P+SVAI+AS + FQ Y GV Y+ C T LDHGV VGYG+
Sbjct: 241 QTVKGPVSVAIDASHQSFQLYDSGVYYEPACSSTNLDHGVLVVGYGTASANGSSDSDSSA 300
Query: 304 --------------TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
T+G + VKNSWGP+WG GYI+M RN + CGI AS PI
Sbjct: 301 ASQSSSSESSDDQATQGAQFWKVKNSWGPEWGLSGYIQMARNR---DNNCGIATTASQPI 357
>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
Length = 319
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 146/311 (46%), Positives = 186/311 (59%), Gaps = 19/311 (6%)
Query: 25 ARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR 84
AR F E++ S L D+F ++M ++ K Y S E RF FK ++ I N
Sbjct: 20 ARQFQSA-LXSEEVPSEVMLQDMFTAFMKQYSKAY-SHAEFSSRFNQFKASVETIRLHNT 77
Query: 85 KIK-NYWLGLNEFADLRHEEFKEMFLGLKP---DLARRKDQSHEDFSYKDVVDLPKSVDW 140
+Y +GLNEFADL EEFK + G K + AR + +++V P S+DW
Sbjct: 78 LANASYTMGLNEFADLSFEEFKGKYFGCKHVEREFARSNNL------HQEVEAAPTSIDW 131
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG--NLASLSEQELIDCDNTYNN-GC 197
R AVT +K+QG CGSCWAFS ++EG ++ G L SLSEQ+L+DC +Y N GC
Sbjct: 132 RTSNAVTPIKDQGQCGSCWAFSATGSIEGA-WVLQGKHTLTSLSEQQLVDCSTSYGNAGC 190
Query: 198 NGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLL 257
NGGLMDYAF+YI++ G+ E YPY G C+ K ++VVTI+G+ DV E S L
Sbjct: 191 NGGLMDYAFEYIIANKGICAESAYPYKGVGGLCQ--KSCTKVVTISGHKDVASGDEASSL 248
Query: 258 KALAN-QPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSW 316
A+ P+SVAIEA FQFYS GV+ G CG LDHGV AVGYG+T DY IVKNSW
Sbjct: 249 NAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSW 308
Query: 317 GPKWGEKGYIR 327
G WGE GYIR
Sbjct: 309 GTSWGESGYIR 319
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 186/311 (59%), Gaps = 13/311 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFADLRHEE 103
+ W ++ K Y S +E+ R I++ NL + + N K Y LG+N+FADL++EE
Sbjct: 28 WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLKNEE 87
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F M G + + + + ++ +LPK+VDWR KG VT VK+QG CGSCWAFST
Sbjct: 88 FVAMMTGFRVNGTSKAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCWAFST 147
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYP 222
++EG + TG L SLSEQ L+DC N GC+GGLMD AFQYI+ GG+ EE YP
Sbjct: 148 TGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNEGCDGGLMDQAFQYIIKAGGIDTEESYP 207
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSG 281
Y +G C K T+ GY DV +SE +L KA+A+ P+SVAI+AS FQ Y
Sbjct: 208 YKAVDGECHFKKANIG-ATVTGYTDVTSDSETALQKAVAHIGPISVAIDASHMSFQLYKS 266
Query: 282 GVY-DGHC-GTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
GVY + C T LDHGV AVGYG+T G DY IVKNSW WG GY+ M RN +
Sbjct: 267 GVYNEPDCSSTLLDHGVLAVGYGTTSDGTDYWIVKNSWAETWGMNGYLWMSRN---KDNQ 323
Query: 339 CGINKMASYPI 349
CGI ASYP+
Sbjct: 324 CGIATQASYPL 334
>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 136/332 (40%), Positives = 188/332 (56%), Gaps = 26/332 (7%)
Query: 41 NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEF 96
+D ++ F WM+ + Y + EK RF++++ N+R+I+ N + Y LG F
Sbjct: 53 HDLMMARFHVWMTVQNRSYPTSSEKAHRFKVYRSNMRYIEALNAEATTSGFTYELGEGPF 112
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHED---FSYKDVVD--------------LPKSVD 139
DL EEF ++ G PD R+D H++ ++ V+ P +D
Sbjct: 113 TDLTDEEFISLYTGKIPDDDHREDGVHDEQIITTHAGSVNGAEGVTVYANFSAGAPIRMD 172
Query: 140 WRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNG 199
WRK+GAVT VK+QG CGSCWAF TVA +EGI++I G L SLSEQ+L+DCD + GCNG
Sbjct: 173 WRKRGAVTPVKDQGKCGSCWAFPTVATIEGIHKIKRGRLVSLSEQQLVDCD-FLDGGCNG 231
Query: 200 GLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKA 259
G AFQ+I+ GG+ Y Y EG C+ + + +T GY V NSE S++
Sbjct: 232 GWPRNAFQWIIQNGGITTTSSYTYKAAEGQCKGNRKPAAKIT--GYRKVKSNSEVSMVNI 289
Query: 260 LANQPLSVAIEASGRDFQFYSGGVYDGHCGT-QLDHGVAAVGYG-STRGLDYIIVKNSWG 317
+ANQP++ +I G FQ Y GG+Y+G C T +L+H + VGYG G Y IVKNSWG
Sbjct: 290 VANQPIAASIVVHGGQFQHYKGGIYNGPCATSKLNHVITIVGYGQQAYGAKYWIVKNSWG 349
Query: 318 PKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
WG KGY+ MKR T P G CGI +P+
Sbjct: 350 AAWGNKGYMLMKRGTKNPLGQCGIAVRPIFPL 381
>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 143/311 (45%), Positives = 187/311 (60%), Gaps = 21/311 (6%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHE 102
+ + E M+++ KVY+ ++ FK+N+ +I+ N K Y G+N+FA
Sbjct: 35 MXERHEQRMTRYGKVYKDPPKRX-----FKENVNYIEACNNAANKPYKRGINQFAP---- 85
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
+ F G R F +++V P +VD R+KGAVT +K+QG CG CWAFS
Sbjct: 86 --RNRFKGHMCSSIIRITT----FKFENVTATPSTVDCRQKGAVTPIKDQGQCGCCWAFS 139
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
VAA EGI+ + G L SLSEQEL+DCD + GC GGLMD AF++I+ GL
Sbjct: 140 AVAATEGIHALSAGKLISLSEQELVDCDTKGVDXGCEGGLMDDAFKFIIQNHGLKHXSQL 199
Query: 222 P-YIMEEGTCEMTKGESEVVT-INGYHDVPQNSEDS-LLKALANQPLSVAIEASGRDFQF 278
P Y+ +G C + T I GY DVP N+E + L KA+AN P+S AI+ASG DFQF
Sbjct: 200 PLYMGVDGKCNANEAAKNAATIITGYEDVPANNEKAHLQKAVANNPVSEAIDASGSDFQF 259
Query: 279 YSGGVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
Y GV+ G CGT+LDHGV AVGYG S G +Y +VKNSWG +WGE+GYIRM+R E
Sbjct: 260 YKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEA 319
Query: 338 LCGINKMASYP 348
LCGI ASYP
Sbjct: 320 LCGIAVQASYP 330
>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 180/316 (56%), Gaps = 20/316 (6%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHE 102
++D F W + + Y S +E+L RFE+++ N+ +ID TNR+ Y LG N+FADL E
Sbjct: 41 MMDRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADLTGE 100
Query: 103 EFKEMFLGLKPDLARRK--------DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
EF + G A D S + D P SVDWR KGAVT VKNQGS
Sbjct: 101 EFLARYAGGHTGSAITTAAEADGLWSSGGSDGSLE--ADPPASVDWRAKGAVTPVKNQGS 158
Query: 155 -CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTG 213
C SCWAFS VA +E + I TG L +LSEQ+L+DCD Y+ GCN G AFQ+I+ G
Sbjct: 159 QCYSCWAFSAVATMESLYFIKTGKLVALSEQQLVDCDK-YDGGCNKGYYHRAFQWIMENG 217
Query: 214 GLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASG 273
G+ YPY G C K VTI G+ V +N E +L A+A QP+ VAIE
Sbjct: 218 GITTAAQYPYKAVRGACSAAK---PAVTITGHLAVAKN-ELALQSAVARQPIGVAIEVP- 272
Query: 274 RDFQFYSGGVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNT 332
QFY GV+ CG Q+ H V VGYG+ GL Y +VKNSWG WGE GYIRM+R+
Sbjct: 273 ISMQFYKSGVFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMRRDV 332
Query: 333 GKPEGLCGINKMASYP 348
G GLCGI +YP
Sbjct: 333 GG-GGLCGIALDTAYP 347
>gi|348531523|ref|XP_003453258.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 341
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 189/315 (60%), Gaps = 20/315 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGLNEFADLRHEE 103
F +W KFEK Y+S ++ +R +I+ +N +H+ N + +K+Y LG+ +FAD+ +EE
Sbjct: 33 FHAWKLKFEKSYDSESDEAQRKQIWLNNRKHVLVHNILADQGLKSYRLGMTQFADMENEE 92
Query: 104 FKEM-----FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
+K + L RR F LP +VDWR KG VT+V+NQ CGSC
Sbjct: 93 YKRLVSQGCLHSFNSSLPRRGSTF---FRLPKGTVLPDTVDWRDKGYVTNVQNQMDCGSC 149
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHK 217
WAFS ++EG + TG L SLS+Q+L+DC + N GCNGGLMD AFQYI + GG+
Sbjct: 150 WAFSATGSLEGQHFRKTGKLVSLSKQQLVDCSGEFGNEGCNGGLMDSAFQYIQANGGIDT 209
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDF 276
EE YPY E+G C +S T GY DV +E++L +A+A P+SVAI+A F
Sbjct: 210 EESYPYEAEDGKCRYNP-KSTGATCTGYVDVQPANEETLKEAVATIGPISVAIDAFHPSF 268
Query: 277 QFYSGGVYD-GHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
QFY GVYD C T LDH V AVGYG+ GLDY +VKNS G WGEKGYI+M RN
Sbjct: 269 QFYESGVYDEPDCSSTMLDHAVLAVGYGTENGLDYWLVKNSAGVGWGEKGYIKMSRNKSN 328
Query: 335 PEGLCGINKMASYPI 349
CGI ASYP+
Sbjct: 329 Q---CGIATAASYPL 340
>gi|3097321|dbj|BAA25899.1| Bd 30K [Glycine max]
gi|84371705|gb|ABC56139.1| 34 kDa maturing seed protein [Glycine max]
gi|195957142|gb|ACG59282.1| major allergen Gly m Bd 30K [Glycine max]
gi|223452512|gb|ACM89583.1| maturing seed protein [Glycine max]
gi|226432468|gb|ACO55749.1| Gly m Bd 30K allergen [Glycine max]
gi|320090153|gb|ADW08728.1| P34 allergen [Glycine max]
gi|320090155|gb|ADW08729.1| P34 allergen [Glycine max]
gi|320090157|gb|ADW08730.1| P34 allergen [Glycine max]
gi|320090159|gb|ADW08731.1| P34 allergen [Glycine max]
Length = 379
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 141/342 (41%), Positives = 205/342 (59%), Gaps = 25/342 (7%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN 88
SI+ T+ ++ LF+ W S+ +VY + +E+ +R EIFK+NL +I + N K+
Sbjct: 25 SILDLDLTKFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNLNYIRDMNANRKS 84
Query: 89 ---YWLGLNEFADLRHEEFKEMFLGLKPDLARR-----KDQSHEDFSYKDVVDLPKSVDW 140
+ LGLN+FAD+ +EF + +L D++++ K E +S P S DW
Sbjct: 85 PHSHRLGLNKFADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHP---PASWDW 141
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGG 200
RKKG +T VK QG CGS WAFS A+E + I TG+L SLSEQEL+DC + GC G
Sbjct: 142 RKKGVITQVKYQGGCGSGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGCYNG 200
Query: 201 LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV-------PQNSE 253
+F++++ GG+ ++DYPY +EG C+ K + +V TI+GY + +E
Sbjct: 201 WHYQSFEWVLEHGGIATDDDYPYRAKEGRCKANKIQDKV-TIDGYETLIMSDESTESETE 259
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQ---LDHGVAAVGYGSTRGLDYI 310
+ L A+ QP+SV+I+A +DF Y+GG+YDG T ++H V VGYGS G+DY
Sbjct: 260 QAFLSAILEQPISVSIDA--KDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYW 317
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
I KNSWG WGE GYI ++RNTG G+CG+N ASYP K++
Sbjct: 318 IAKNSWGEDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTKEE 359
>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 354
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 138/319 (43%), Positives = 186/319 (58%), Gaps = 24/319 (7%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM++F + Y+ DEK R E+F N RH+D NR + Y LGLN F+DL EF +
Sbjct: 39 ERWMARFGRAYKDADEKARRQEVFGANARHVDAVNRSGNRTYTLGLNHFSDLTDHEFLQQ 98
Query: 108 FLGLK------PDLARRKDQSHEDFSYKDVV-----DLPKSVDWRKKGAVTHVKNQGSCG 156
LG + L R +DQ D S + D+P SVDWR +GAVT +KNQ SCG
Sbjct: 99 HLGYRHHQPGPGGLLRPEDQ---DMSKATALADYGQDVPDSVDWRAQGAVTEIKNQRSCG 155
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
SCWAF+ VAA EG+ +I TGNL S+SEQ+++DC N C+GG ++ A +Y+ ++GGL
Sbjct: 156 SCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGGGNT-CDGGDINAALRYVAASGGLQ 214
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKAL-ANQPLSVAIEASGRD 275
E Y Y ++G C + ++ G ++ L+ L A QP++VA+EAS D
Sbjct: 215 PEAAYAYAAQKGACRGASPANSAASVGGARFARLGGDEGALRGLAAGQPVAVALEASEPD 274
Query: 276 FQFYSGGVYDG--HCGTQLDHGVAAVGYGST--RGLDYIIVKNSWGPKWGEKGYIRMKRN 331
F+ Y GVY G CG +L+HGV VGYG+ G +Y +VKN WG WGEKGY+R+ R
Sbjct: 275 FRHYKSGVYAGSASCGRRLNHGVTVVGYGAEDDSGDEYWVVKNQWGTLWGEKGYMRVAR- 333
Query: 332 TGKPEGL-CGINKMASYPI 349
G G CGI A YP
Sbjct: 334 -GDVAGANCGIASYAYYPT 351
>gi|357153071|ref|XP_003576329.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 398
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 144/349 (41%), Positives = 194/349 (55%), Gaps = 33/349 (9%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN---- 88
YS D ++ ++ F+ WM+ + Y + +E RFE++K N+R+I+ N +
Sbjct: 47 YSGRDKHNDLLMMGRFQGWMAAQGRSYWTAEETARRFEVYKSNVRYIEAVNAEAATTGLT 106
Query: 89 YWLGLNEFADLRHEEFKEMFLGLKPDLARRK--DQSHED----FSYKDVVDL-------- 134
+ LG F DL HEEF ++ G P + D ED + D VD+
Sbjct: 107 FELGEGPFTDLTHEEFSALYNGSMPPPEEEEGDDIQEEDEQVIATVVDGVDVNVAVHTNL 166
Query: 135 ---------PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQE 185
P+S DWRK GAVT +K+QG CGSCWAF TVA +EG ++IV GNL SLSEQ+
Sbjct: 167 SAGGPRPWPPRSRDWRKHGAVTPIKDQGRCGSCWAFPTVATIEGKHKIVRGNLVSLSEQQ 226
Query: 186 LIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGY 245
LIDCD T N+GC GG + A+++I GGL YPY G C K I G+
Sbjct: 227 LIDCDYT-NSGCKGGFVIRAYRWIRKIGGLTTSSAYPYKGARGKC--MKRRRAAARIAGW 283
Query: 246 HDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGT-QLDHGVAAVGYG-- 302
V SE +L+ A+A QP++V I ASG++FQ Y G+ +G C T +L+H V VGYG
Sbjct: 284 RSVRSRSEVALVNAVAGQPVAVYISASGKNFQHYKKGILNGPCDTARLNHAVTVVGYGRQ 343
Query: 303 STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
+ G Y IVKNSWG WG++GYI MKR T P G CGI +P+ K
Sbjct: 344 ADTGAKYWIVKNSWGTTWGQEGYILMKRGTRNPRGQCGIATSPVFPLMK 392
>gi|52076128|dbj|BAD46641.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|52076135|dbj|BAD46648.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 374
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 148/334 (44%), Positives = 192/334 (57%), Gaps = 22/334 (6%)
Query: 36 EDLTSNDKLIDLFESWMSKFEKVYES---LDEKLERFEIFKDNLRHIDETNRKI-KNYWL 91
+DL S + + L++ W + S L +K RFE+FK N R+I + NRK +Y L
Sbjct: 31 KDLESEESMWSLYQRWRHVYGAASSSPRDLADKGSRFEVFKKNARYIHDFNRKKGMSYKL 90
Query: 92 GLNEFADLRHEEFKEMFLGLKP-DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVK 150
GLN+FADL EEF + G P + K+ + D P + DWR+ GAVT VK
Sbjct: 91 GLNKFADLTLEEFTAKYTGANPGPITGLKNGTGSPPLAAVAGDAPPAWDWREHGAVTRVK 150
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIV 210
+QG CGSCWAFS V AVEGIN I+TGNL +LSEQ+++DC + C+GG YAF Y V
Sbjct: 151 DQGPCGSCWAFSVVEAVEGINAIMTGNLLTLSEQQVLDCSGAGD--CSGGYTSYAFDYAV 208
Query: 211 STGGLHKE--------EDY----PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
S G + E+Y Y + C ++ +V I+ Y V N E++L +
Sbjct: 209 SNGITLDQCFSPPTTGENYFYYPAYEAVQEPCRFDPNKAPIVKIDSYSFVDPNDEEALKQ 268
Query: 259 ALANQ-PLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSW 316
A+ +Q P+SV IEAS +F Y GGV+ G CGT+L+H V VGY T G Y IVKNSW
Sbjct: 269 AVYSQGPVSVLIEAS-YEFMIYQGGVFSGPCGTELNHAVLVVGYDETEDGTPYWIVKNSW 327
Query: 317 GPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G WGE GYIRM RN PEG+CGI YPIK
Sbjct: 328 GAGWGESGYIRMIRNIPAPEGICGIAMYPIYPIK 361
>gi|413956349|gb|AFW88998.1| hypothetical protein ZEAMMB73_678859 [Zea mays]
Length = 1140
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 113/184 (61%), Positives = 139/184 (75%)
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGL 215
GSCWAFST+AAVEGINQIVTG+L SLSEQEL+DCD +YN GCNGGLMDYAF++I++ GG+
Sbjct: 780 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGI 839
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRD 275
E+DYPY +G C++ + ++VVTI+ Y DVP N E SL KA+ANQP+SVAIEA+G
Sbjct: 840 DTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTT 899
Query: 276 FQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
FQ YS G++ G CGT LDHGV AVGYG+ G DY I+KNSWG WGE G +R
Sbjct: 900 FQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIMKNSWGSSWGESGRAPTRRTLAPA 959
Query: 336 EGLC 339
+C
Sbjct: 960 PAVC 963
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 143/295 (48%), Positives = 184/295 (62%), Gaps = 15/295 (5%)
Query: 63 DEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
DE+ R ++F ++ I+ N + + Y +GLN+F D+ EEF+ F GLK D +
Sbjct: 33 DEEHFRRQLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDMTSEEFRN-FKGLKFDATKT 91
Query: 119 KDQSHEDFSYKDVVD-LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
K ++ F + + + LP VDWR+KG VT VKNQG CGSCWAFST ++EG + TG
Sbjct: 92 K-RNGTRFQKELLGEALPTQVDWREKGYVTPVKNQGQCGSCWAFSTTGSLEGQHFKATGK 150
Query: 178 LASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQ L+DC NNGCNGGLMD F YI GG+ EE YPY ++G C +
Sbjct: 151 LVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQNGGIDTEESYPYTGKDGDCAFNE-N 209
Query: 237 SEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD-GHCG-TQLD 293
S + G+ DVPQ E +L A+A+ P+SVAI+AS FQ+Y GVYD C +QLD
Sbjct: 210 SVGARVKGFVDVPQRDEAALQAAVASVGPVSVAIDASNDSFQYYKEGVYDEPSCSFSQLD 269
Query: 294 HGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
HGV VGYG+ G+DY +VKNSWGP WG+ GYI+M RN E CGI MASYP
Sbjct: 270 HGVLVVGYGTENGVDYWLVKNSWGPTWGQDGYIKMMRN---KENQCGIASMASYP 321
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 149/318 (46%), Positives = 191/318 (60%), Gaps = 21/318 (6%)
Query: 49 ESWMS-KFEKVYESLDEKLERF--EIFKDNLRHIDETNRKIK----NYWLGLNEFADLRH 101
E W + K E + DE ERF +IF +N I + N+ ++ +GLN++AD+ H
Sbjct: 26 EEWQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYADMLH 85
Query: 102 EEFKEMFLGLKPDLARRKDQSHEDFS-----YKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
EF E G L ++ S F+ + V LP+SVDWR KGAVT VK+QG CG
Sbjct: 86 HEFHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQGHCG 145
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGL 215
SCWAFS+ A+EG + TG L SLSEQ L+DC Y NNGCNGGLMD AF+YI GG+
Sbjct: 146 SCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 205
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGR 274
E+ YPY + +C KG + T G+ D+PQ E L +A+A P+SVAI+AS
Sbjct: 206 DTEKSYPYEGIDDSCHFNKG-TIGATDRGFTDIPQGDEKKLAQAVATIGPVSVAIDASHE 264
Query: 275 DFQFYSGGVYD-GHCGTQ-LDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQFYS GVYD C Q LDHGV VGYG+ G DY +VKNSWG WG+KG+I+M RN
Sbjct: 265 SFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTDENGKDYWLVKNSWGTTWGDKGFIKMARN 324
Query: 332 TGKPEGLCGINKMASYPI 349
+ CGI +SYP+
Sbjct: 325 ---DDNQCGIATASSYPL 339
>gi|229366214|gb|ACQ58087.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 149/315 (47%), Positives = 183/315 (58%), Gaps = 20/315 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLR----HIDETNRKIKNYWLGLNEFADLRHEE 103
F +W +F + Y S E+ +R EI+ N R H ++ IK+Y LG+ FAD+ +EE
Sbjct: 26 FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85
Query: 104 FKEMF----LG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
+K LG L RR + DLP SVDWR+KG VT VK+Q CGSC
Sbjct: 86 YKRQISQGCLGSFNASLPRRGSAY---LRLPEGADLPNSVDWREKGYVTDVKDQKQCGSC 142
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHK 217
WAFST ++EG TG L SLSEQ+L+DC Y N GC GGLMD AF+YI + GG+
Sbjct: 143 WAFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDT 202
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDF 276
E+ YPY E+G C T GY DV Q ED+L +ALA P+SVAI+AS F
Sbjct: 203 EDSYPYEAEDGQCRYNSANIG-ATCTGYVDVKQGDEDALKEALATIGPVSVAIDASHSSF 261
Query: 277 QFYSGGVYD-GHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
Q Y GVYD C ++LDHGV AVGYGS G DY +VKNSWG WG KGYI M RN
Sbjct: 262 QLYESGVYDEPECSSSELDHGVLAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRNK-- 319
Query: 335 PEGLCGINKMASYPI 349
CGI +SYP+
Sbjct: 320 -HNQCGIATASSYPL 333
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 133/294 (45%), Positives = 183/294 (62%), Gaps = 7/294 (2%)
Query: 46 DLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFK 105
+ F S+ + + K Y + +E +R+ IFK+NL +I N++ +Y L +N F DL EEF+
Sbjct: 117 NAFGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQGYSYSLKMNHFGDLSREEFR 176
Query: 106 EMFLGLKP--DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
+LG +L + D+P +VDWR+KG VT VK+Q CGSCWAFS
Sbjct: 177 RKYLGYNKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSA 236
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
A+EG + TG L SLSEQEL+DC N GC+GG M+ AFQY+V +GGL EE YP
Sbjct: 237 TGALEGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYP 296
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGG 282
Y+ +G C+ + +VVTI+G+ DVP+ SE ++ ALA+ P+S+AIEA FQFY G
Sbjct: 297 YLARDGECK--RACKKVVTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYHEG 354
Query: 283 VYDGHCGTQLDHGVAAVGYGSTRGL--DYIIVKNSWGPKWGEKGYIRMKRNTGK 334
V+D CGT LDHGV VGYG+ + D+ I+KNSWG WG GY+ M + G+
Sbjct: 355 VFDASCGTDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYMAMHKGE 408
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 142/303 (46%), Positives = 181/303 (59%), Gaps = 13/303 (4%)
Query: 51 WMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG 110
W KVY E+ R+ I+KDN R I E N K ++ L +N+F D+ + EFK F G
Sbjct: 30 WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFLLKMNQFGDMTNSEFK-AFNG 88
Query: 111 LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
K + F + P +VDWR +G VT VK+QG CGSCWAFST ++EG
Sbjct: 89 Y----LSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQ 144
Query: 171 NQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
+ TG L SLSEQ L+DC Y NNGCNGGLMD AF YI G+ E YPY E+G
Sbjct: 145 HFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENKGIDSEASYPYTAEDGK 204
Query: 230 CEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD--G 286
C K S T G+ D+P+ +E+ L +A+A+ P+SVAI+AS FQFYS GVY+
Sbjct: 205 CVFKK-PSVAATDTGFVDLPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVYNEPS 263
Query: 287 HCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
T+LDHGV VGYG+ G DY +VKNSW WG+KGYI+M+RN + CGI AS
Sbjct: 264 CSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNA---KNQCGIATKAS 320
Query: 347 YPI 349
YP+
Sbjct: 321 YPL 323
>gi|59798094|sp|P84347.1|MEX2_JACME RecName: Full=Chymomexicain
Length = 215
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 119/216 (55%), Positives = 151/216 (69%), Gaps = 5/216 (2%)
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYN 194
P+S+DWR KGAVT VKNQ CGSCWAFSTVA VEGIN+I TG L SLSEQEL+DCD +
Sbjct: 2 PESIDWRDKGAVTPVKNQNPCGSCWAFSTVATVEGINKIRTGKLISLSEQELLDCDRR-S 60
Query: 195 NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSED 254
+GC GG + QY+ GG+H E++YPY ++G C + + V I GY VP N E
Sbjct: 61 HGCKGGYQTGSIQYVADNGGVHTEKEYPYEKKQGKCRAKEKKGTKVQITGYKRVPANDEI 120
Query: 255 SLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKN 314
SL++ + NQP+SV E+ GR FQ Y GG+++G CG + DH V A+GYG + LD KN
Sbjct: 121 SLIQGIGNQPVSVLHESKGRAFQLYKGGIFNGPCGYKNDHAVTAIGYGKAQLLD----KN 176
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
SWGP WGEKGYI++KR +GK EG CG+ K + +PIK
Sbjct: 177 SWGPNWGEKGYIKIKRASGKSEGTCGVYKSSYFPIK 212
>gi|294938848|ref|XP_002782226.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239893730|gb|EER14021.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 334
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 133/293 (45%), Positives = 187/293 (63%), Gaps = 11/293 (3%)
Query: 45 IDL-FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEE 103
+DL F + KF K YES +E+++R IF+ +L +I++ N K +Y LG+NE ADL HEE
Sbjct: 24 VDLAFMGFQHKFGKNYESKEEEIKRNAIFRAHLHYIEQVNAKNLSYKLGVNEHADLTHEE 83
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + LG ++ ++D + D L SVDWR KG +T +K+QG CGSCWAFS
Sbjct: 84 FAALKLGTSSKMSMKRDD--KLVVKADTTQLLTSVDWRSKGVLTPIKDQGPCGSCWAFSA 141
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYP 222
A+E I TG L SLSEQ+LIDC ++Y N GC+GGLM+ A+ YI S GL +E YP
Sbjct: 142 TGALEAQYAIATGKLLSLSEQQLIDCSSSYGNEGCSGGLMENAYTYIKS-AGLDQESTYP 200
Query: 223 YIMEEGTCEMT-KGESEVVT---INGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQF 278
YI + C+++ + S+ + + G+H + Q +E L+KALA+ P+S+A+ AS DF+F
Sbjct: 201 YIAKNNACQVSLEKRSDGIPAGEVTGFHMLDQ-TEQGLMKALADAPVSIAMYASDPDFRF 259
Query: 279 YSGGVYDGH-CGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKR 330
Y GVY C +DHGV AVGYG+ G DY +++NSWG WG+ GY +KR
Sbjct: 260 YQSGVYSSKTCHGTIDHGVVAVGYGTENGEDYFVIRNSWGSSWGQDGYFYLKR 312
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 249 bits (637), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 144/304 (47%), Positives = 186/304 (61%), Gaps = 16/304 (5%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRHEEFKEMFLGLK 112
K Y+S E+ R +I+ +N I N K N Y L +NE+ D+ H EF G +
Sbjct: 38 KEYQSETEEYYRLKIYMENRMMIARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNGFR 97
Query: 113 PDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
D + Q + + D LPK+VDWRKKGAVT VKNQG CGSCWAFST ++EG
Sbjct: 98 RDYRSKPRQGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQ 157
Query: 171 NQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
+ +G++ SLSEQ L+DC + NNGC GGLMD AF+YI + GG+ E+ YPY +GT
Sbjct: 158 HFRKSGDMVSLSEQNLVDCSTAFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTDGT 217
Query: 230 CEMTKGESEV-VTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD-G 286
C K S+V T G+ D+P+ +E L KA+A P+SVAI+AS + FQFYS GVYD
Sbjct: 218 CHFKK--SDVGATDTGFVDIPEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGVYDEP 275
Query: 287 HCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
C ++ LDHGV VGYG+ DY +VKNSWG WG+ GYI M RN + CGI A
Sbjct: 276 ECSSENLDHGVLVVGYGTKDDQDYWLVKNSWGTTWGDGGYIYMTRN---KDNQCGIASSA 332
Query: 346 SYPI 349
SYP+
Sbjct: 333 SYPL 336
>gi|384247445|gb|EIE20932.1| hypothetical protein COCSUDRAFT_18161 [Coccomyxa subellipsoidea
C-169]
Length = 387
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 145/330 (43%), Positives = 184/330 (55%), Gaps = 37/330 (11%)
Query: 55 FEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYW------------------------ 90
F K Y + +E R IFK N+ +I N ++Y
Sbjct: 7 FNKKYSNEEEAALRLNIFKTNVDYITSVNSAQQSYQASKHFSENTQQTALSSLFLSQLAH 66
Query: 91 ------LGLNEFADLRHEEFKEMFLGLKP-DLARRKDQSHEDFSYKDVVDLPKSVDWRKK 143
LGLNEFAD EEF LGL + + ++ F + DV S++W +
Sbjct: 67 TDLLPQLGLNEFADQTWEEFSSTHLGLNAGEDGSFRSSANTGFRHADVTP-ANSINWVEA 125
Query: 144 GAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMD 203
GAVT VKNQ CGSCWAFST +VEG N + TG+L SLSEQ+L+DCD + GC GGLMD
Sbjct: 126 GAVTPVKNQAFCGSCWAFSTTGSVEGANFLATGDLVSLSEQQLVDCDTKKDQGCGGGLMD 185
Query: 204 YAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
YAF YI+ GGL EEDY Y G C + E VV+I+GY DVP N E +L KA++ Q
Sbjct: 186 YAFDYIIKNGGLDTEEDYSYWSVGGFCNKLREERTVVSIDGYEDVPVNDEVALAKAVSKQ 245
Query: 264 PLSVAIEASGRDFQFYSGGVY--DGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKW 320
P+SVAI AS QFYS GV G C L+HGV A GY G Y +VKNSWG W
Sbjct: 246 PVSVAICAS-EAMQFYSSGVIAAKGSC-IGLNHGVLAAGYDVDESGKPYWLVKNSWGGTW 303
Query: 321 GEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
G +GY+++++++ EG CGI ASYP+K
Sbjct: 304 GMQGYMKLEKDSSVKEGACGIAMAASYPVK 333
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 193/315 (61%), Gaps = 20/315 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRHEE 103
+ ++ +K K Y S E++ R +I+ +N I + N K Y + +NEF D+ H E
Sbjct: 27 WSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHE 86
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSY---KDVVD--LPKSVDWRKKGAVTHVKNQGSCGSC 158
F G K + KDQ E +Y +++ D LPK+VDWR KGAVT VKNQG CGSC
Sbjct: 87 FVSTRNGFKRNY---KDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSC 143
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHK 217
WAFS ++EG + +G++ SLSEQ L+DC + NNGC GGLMD AF+YI + G+
Sbjct: 144 WAFSATGSLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDT 203
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDF 276
E+ YPY +GTC K + T +G+ D+ + SE L KA+A P+SVAI+AS F
Sbjct: 204 EKSYPYNGTDGTCHFKK-STVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESF 262
Query: 277 QFYSGGVYD-GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
QFYS GVYD C ++ LDHGV VGYG+ G DY +VKNSWG WG++GYIRM RN
Sbjct: 263 QFYSDGVYDEPECDSESLDHGVLVVGYGTLNGTDYWLVKNSWGTTWGDEGYIRMSRN--- 319
Query: 335 PEGLCGINKMASYPI 349
+ CGI ASYP+
Sbjct: 320 KKNQCGIASSASYPL 334
>gi|255586666|ref|XP_002533962.1| cysteine protease, putative [Ricinus communis]
gi|223526059|gb|EEF28418.1| cysteine protease, putative [Ricinus communis]
Length = 417
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 128/281 (45%), Positives = 190/281 (67%), Gaps = 14/281 (4%)
Query: 15 CISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKD 74
C+SF + ++SIVG +L S +++ +LF+ W K KVY+ ++E +R E F+
Sbjct: 20 CLSF----TLPDEYSIVGNDLHELLSEERVKELFQQWKEKHRKVYKHVEEAEKRLENFRR 75
Query: 75 NLRHIDETNRKIKN----YWLGLNEFADLRHEEFKEMFLG-LKPDLARRKDQ--SHEDFS 127
NL+++ E N+K KN + +GLN+FAD+ + EF++ +L +K + +R + + +
Sbjct: 76 NLKYVVEKNQKKKNLGSAHTVGLNKFADMSNVEFRQKYLSKVKKPIKKRNNNLMTSRQRN 135
Query: 128 YKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELI 187
+ V P S+DWRKKG VT VK+QG CGSCWAFS+ A+EGIN IVTG+L SLSEQEL+
Sbjct: 136 LQSCV-APSSLDWRKKGVVTPVKDQGDCGSCWAFSSTGAIEGINAIVTGDLVSLSEQELM 194
Query: 188 DCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHD 247
DCD T N GC+GG MDYAF+++++ GG+ E DYPY +GTC + K E++VV+++GY D
Sbjct: 195 DCDTT-NYGCDGGYMDYAFEWVINNGGIDTEIDYPYTGVDGTCNIAKEETKVVSVDGYED 253
Query: 248 VPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHC 288
V + S+ +LL A QP+SV I+ S DFQ Y+ G+Y+G C
Sbjct: 254 VAE-SDSALLCATVQQPISVGIDGSAIDFQLYTSGIYNGSC 293
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 147/312 (47%), Positives = 191/312 (61%), Gaps = 17/312 (5%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEE 103
+E++ + +K Y+S E+L R++IF +N I + N K + +Y LG+N+F DL E
Sbjct: 7 WEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPHE 66
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
F +MF G RK + +V D LPK+VDWRKKGAVT VK+QG CGSCWAF
Sbjct: 67 FAKMFNGYH---GERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAF 123
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEED 220
S ++EG + + +G L SLSEQ LIDC ++ N GC GGLMD AF+YI + G+ EE
Sbjct: 124 SATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEES 183
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFY 279
YPY +G C K E T G+ D+ Q SED L KA+A P+SVAI+AS FQ Y
Sbjct: 184 YPYEAMDGDCRFKK-EDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQLY 242
Query: 280 SGGVYD-GHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
S GVYD +C + +LDHGV AVGYG G Y +VKNSW WG+ GYI M R+ +
Sbjct: 243 SEGVYDEPNCSSEELDHGVLAVGYGVKNGKKYWLVKNSWAETWGDNGYILMSRDK---DN 299
Query: 338 LCGINKMASYPI 349
CGI ASYP+
Sbjct: 300 QCGIASSASYPL 311
>gi|18202414|sp|P82473.1|CPGP1_ZINOF RecName: Full=Zingipain-1; AltName: Full=Cysteine proteinase GP-I
Length = 221
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 122/218 (55%), Positives = 152/218 (69%), Gaps = 2/218 (0%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP S+DWR+KGAV VKNQG CGSCWAF +AAVEGINQIVTG+L SLSEQ+L+DC +T
Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDC-STR 61
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
N+GC GG AFQYI++ GG++ EE YPY GTC+ TK + VV+I+ Y +VP N E
Sbjct: 62 NHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCD-TKENAHVVSIDSYRNVPSNDE 120
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
SL KA+ANQP+SV ++A+GRDFQ Y G++ G C +H G + DY VK
Sbjct: 121 KSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGGRETENDKDYWTVK 180
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
NSWG WGE GYIR++RN + G CGI SYPIK+
Sbjct: 181 NSWGKNWGESGYIRVERNIAESSGKCGIAISPSYPIKE 218
>gi|229367042|gb|ACQ58501.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 148/315 (46%), Positives = 183/315 (58%), Gaps = 20/315 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLR----HIDETNRKIKNYWLGLNEFADLRHEE 103
F +W +F + Y S E+ +R EI+ N R H ++ IK+Y LG+ FAD+ +EE
Sbjct: 26 FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85
Query: 104 FKEMF----LG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
+K LG L RR + DLP SVDWR+KG VT VK+Q CGSC
Sbjct: 86 YKRQISQGCLGSFNASLPRRGSAY---LRLPEGADLPNSVDWREKGYVTEVKDQKQCGSC 142
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHK 217
WAFST ++EG TG L SLSEQ+L+DC Y N GC GGLMD AF+YI + GG+
Sbjct: 143 WAFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDT 202
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDF 276
E+ YPY E+G C T GY DV Q ED+L +A+A P+SVAI+AS F
Sbjct: 203 EDSYPYEAEDGQCRYNSANIG-ATCTGYVDVKQGDEDALKEAVATIGPVSVAIDASHSSF 261
Query: 277 QFYSGGVYD-GHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
Q Y GVYD C ++LDHGV AVGYGS G DY +VKNSWG WG KGYI M RN
Sbjct: 262 QLYESGVYDEPECSSSELDHGVLAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRNK-- 319
Query: 335 PEGLCGINKMASYPI 349
CGI +SYP+
Sbjct: 320 -HNQCGIATASSYPL 333
>gi|297740510|emb|CBI30692.3| unnamed protein product [Vitis vinifera]
Length = 377
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 124/223 (55%), Positives = 155/223 (69%), Gaps = 5/223 (2%)
Query: 132 VDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDN 191
+ P S+DWRKKG VT +K+QG CGSCWAFS+ A+EGIN IVTG+L SLSEQEL+DCD
Sbjct: 10 CEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDT 69
Query: 192 TYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQN 251
T N GC GG MDYAF++++S GG+ E DYPY +GTC TK +++VV+I+GY DV +
Sbjct: 70 T-NYGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDGTCNTTKEDTKVVSIDGYKDVDE- 127
Query: 252 SEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG---HCGTQLDHGVAAVGYGSTRGLD 308
S+ +LL A NQP+SV ++ S DFQ Y+ G+Y G +DH V VGYGS D
Sbjct: 128 SDSALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHAVLIVGYGSEDSED 187
Query: 309 YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
Y I KNSWG WG +GY +KRNT P G C IN MASYP K+
Sbjct: 188 YWICKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYPTKE 230
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 146/323 (45%), Positives = 185/323 (57%), Gaps = 17/323 (5%)
Query: 31 VGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYW 90
+ Y E T +D I W K Y E+ R+ I+KDN R I E N + ++
Sbjct: 14 LAYIIERPTEDDSWI----RWKMAHNKAYSHDGEETVRYTIWKDNERRIREHNLQGGDFL 69
Query: 91 LGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVK 150
L +N+F D+ + EFK+ F G K S F + P SVDWR +G VT VK
Sbjct: 70 LEMNQFGDMTNNEFKD-FNGY----LSHKHVSGSTFLTPNSFVAPDSVDWRNEGYVTPVK 124
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYI 209
+QG CGSCWAFST ++EG N TG L SLSEQ L+DC Y NNGCNGGLMD AF YI
Sbjct: 125 DQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYI 184
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVA 268
G+ E YPY ++G C TK + T G+ D+P E+ L +A+A+ P+SVA
Sbjct: 185 KENNGIDSEASYPYTAKDGKCAFTK-PNVAATDTGFVDIPSGDENKLKEAVASVGPISVA 243
Query: 269 IEASGRDFQFYSGGVYDGH--CGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYI 326
I+AS FQFY GVY+ T+LDHGV VGYG+ G DY +VKNSW WG+KGYI
Sbjct: 244 IDASHFSFQFYRKGVYNERKCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYI 303
Query: 327 RMKRNTGKPEGLCGINKMASYPI 349
+M RN + CGI ASYP+
Sbjct: 304 KMSRNA---KNQCGIATNASYPL 323
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 190/324 (58%), Gaps = 16/324 (4%)
Query: 35 PEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YW 90
P L + +L FE + S F +VY S + +L R IF+ NL+ I N N +
Sbjct: 20 PSMLLTEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFS 79
Query: 91 LGLNEFADLRHEEFKEMFLGLKPDLA-RRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
+ +N F DL +EEF+ F G + A D H D DV LP +VDW KG VT +
Sbjct: 80 VSVNNFTDLSNEEFRATFNGYRRLAAVSLADSVHAD---NDVEALPATVDWTTKGVVTPI 136
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQY 208
KNQ CGSCWAFS VA++EG + + TG L SLSEQ L+DC + GC+GG MDYAF+Y
Sbjct: 137 KNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKY 196
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSV 267
++ G+ E YPY + +CE K S TI+ + DV E +L A+A+ P+SV
Sbjct: 197 VIQNRGIDTEASYPYKAIDESCEF-KRNSIGATIHSFVDVKTGDESALQNAVASIGPISV 255
Query: 268 AIEASGRDFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGY 325
AI+AS FQFYS GVY + C T+ LDHGV AVGYG+ G+ Y VKNSWG WG+KGY
Sbjct: 256 AIDASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGVPYWKVKNSWGTSWGQKGY 315
Query: 326 IRMKRNTGKPEGLCGINKMASYPI 349
I M RN + CGI ASYP+
Sbjct: 316 IFMSRNK---QNQCGIATKASYPV 336
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 194/318 (61%), Gaps = 21/318 (6%)
Query: 49 ESWMS---KFEKVYESLDEKLERFEIFKDNLRHIDETNRKI----KNYWLGLNEFADLRH 101
E W + + K Y+S E+ R +I+ N I + N++ + Y L +N++ADL H
Sbjct: 25 EEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLH 84
Query: 102 EEFKEMFLGL-----KPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
EEF + G K L + + F V++P +VDWRKKGAVT VK+QG CG
Sbjct: 85 EEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCG 144
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGL 215
SCW+FS A+EG + TG L SLSEQ L+DC Y NNGCNGG+MDYAFQYI GG+
Sbjct: 145 SCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGGI 204
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGR 274
E+ YPY + TC ++ T GY D+PQ E++L KALA P+S+AI+AS
Sbjct: 205 DTEKSYPYEAIDDTCHFNP-KAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHE 263
Query: 275 DFQFYSGGV-YDGHCGTQ-LDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQFYS GV Y+ C ++ LDHGV AVGYG++ G DY +VKNSWG WG++GY++M RN
Sbjct: 264 SFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMARN 323
Query: 332 TGKPEGLCGINKMASYPI 349
+ CG+ ASYP+
Sbjct: 324 R---DNHCGVATCASYPL 338
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 195/322 (60%), Gaps = 25/322 (7%)
Query: 47 LFESWMS-KFEKVYESLDEKLERF--EIFKDNLRHIDETNRKIK----NYWLGLNEFADL 99
+ E W + K E DE ERF +IF +N I + N++ ++ L +N++ADL
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV-------VDLPKSVDWRKKGAVTHVKNQ 152
H EF+++ G L ++ + E S+K V V LPKSVDWR KGAVT VK+Q
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADE--SFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQ 172
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCWAFS+ A+EG + +G L SLSEQ L+DC Y NNGCNGGLMD AF+YI
Sbjct: 173 GHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 232
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
GG+ E+ YPY + +C KG + T G+ D+PQ E + +A+A P+SVAI+
Sbjct: 233 NGGIDTEKSYPYEAIDDSCHFNKG-TVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAID 291
Query: 271 ASGRDFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIR 327
AS FQFYS GVY + C Q LDHGV VG+G+ G DY +VKNSWG WG+KG+I+
Sbjct: 292 ASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIK 351
Query: 328 MKRNTGKPEGLCGINKMASYPI 349
M RN E CGI +SYP+
Sbjct: 352 MLRN---KENQCGIASASSYPL 370
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 195/322 (60%), Gaps = 25/322 (7%)
Query: 47 LFESWMS-KFEKVYESLDEKLERF--EIFKDNLRHIDETNRKIK----NYWLGLNEFADL 99
+ E W + K E DE ERF +IF +N I + N++ ++ L +N++ADL
Sbjct: 59 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 118
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV-------VDLPKSVDWRKKGAVTHVKNQ 152
H EF+++ G L ++ + E S+K V V LPKSVDWR KGAVT VK+Q
Sbjct: 119 LHHEFRQLMNGFNYTLHKQLRAADE--SFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQ 176
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCWAFS+ A+EG + +G L SLSEQ L+DC Y NNGCNGGLMD AF+YI
Sbjct: 177 GHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 236
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
GG+ E+ YPY + +C KG + T G+ D+PQ E + +A+A P+SVAI+
Sbjct: 237 NGGIDTEKSYPYEAIDDSCHFNKG-TVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAID 295
Query: 271 ASGRDFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIR 327
AS FQFYS GVY + C Q LDHGV VG+G+ G DY +VKNSWG WG+KG+I+
Sbjct: 296 ASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIK 355
Query: 328 MKRNTGKPEGLCGINKMASYPI 349
M RN E CGI +SYP+
Sbjct: 356 MLRN---KENQCGIASASSYPL 374
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 188/318 (59%), Gaps = 12/318 (3%)
Query: 41 NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK--IKNYWLGLNEFAD 98
+ + + +E W + + Y+ EK RFE+F+ N ID N K+ L N+FAD
Sbjct: 42 DSAMRERYEKWAADHGRTYKDSLEKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFAD 101
Query: 99 LRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
L +EEF E + +P S + D+P +++WR +GAVT VKNQ C SC
Sbjct: 102 LTNEEFAEYYG--RPFSTPVIGGSGFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCASC 159
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHK 217
WAFS VAAVEGI+QI + NL +LS Q+L+DC NN GCN G MD AF+YI S GG+
Sbjct: 160 WAFSAVAAVEGIHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAA 219
Query: 218 EEDYPYIMEE-GTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E DYPY GTC + G+ +I G+ VP N+E +LL A+A+QP+SVA++ G+
Sbjct: 220 ESDYPYEDRALGTCRAS-GKPVAASIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVS 278
Query: 277 QFYSGGVYDGH----CGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRN 331
QF+S GV+ C T L+H + AVGYG+ G Y ++KNSWG WGE GY+++ R+
Sbjct: 279 QFFSSGVFGAMQNETCTTDLNHAMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIARD 338
Query: 332 TGKPEGLCGINKMASYPI 349
GLCG+ SYP+
Sbjct: 339 VASNTGLCGLAMQPSYPV 356
>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
Length = 334
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 187/315 (59%), Gaps = 20/315 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDN----LRHIDETNRKIKNYWLGLNEFADLRHEE 103
F +W KF + Y S E+ +R + + +N L H ++ IK+Y LG+ FAD+ +EE
Sbjct: 26 FHAWRLKFGRTYSSPTEEAQRRQTWLNNRKLVLVHNILADQGIKSYRLGMTYFADMENEE 85
Query: 104 FKEMF----LG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
+K + LG L RR F + DLP +VDWR KG VT VK+Q CGSC
Sbjct: 86 YKRLISQGCLGSFNASLPRRGSTF---FRLPENKDLPAAVDWRDKGYVTDVKDQKQCGSC 142
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHK 217
WAFS ++EG TG L SLSEQ+L+DC Y N GC GGLMD AF+YI +TGG+
Sbjct: 143 WAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNMGCGGGLMDDAFRYIQATGGIDT 202
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDF 276
EE YPY E+G C K ++ T GY DV ED+L +A+A P+SV I+AS F
Sbjct: 203 EESYPYEAEDGECRY-KPDAVGATCTGYVDVSSGDEDALQEAVATIGPISVGIDASHISF 261
Query: 277 QFYSGGVYD-GHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
Q Y G+YD C ++LDHGV AVGYGS G DY +VKNSWG WG++GYI+M +N
Sbjct: 262 QLYESGLYDEPQCSSSELDHGVLAVGYGSENGQDYWLVKNSWGLTWGDQGYIKMSKNKSN 321
Query: 335 PEGLCGINKMASYPI 349
CGI ASYP+
Sbjct: 322 Q---CGIATAASYPL 333
>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
Length = 341
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 144/309 (46%), Positives = 187/309 (60%), Gaps = 17/309 (5%)
Query: 54 KFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRHEEFKEMFL 109
+F+K+YE + E+ R +++ DN I N+ ++ Y L +N F DL E+ +M
Sbjct: 36 QFKKLYEDIKEETFRKKVYLDNKLKIARHNKLYESGEETYALEMNHFGDLMQHEYSKMMN 95
Query: 110 GLKPDLARRKDQSHED----FSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
G KP LA D F + V +PKS+DWRKKG VT VKNQG CGSCW+FS
Sbjct: 96 GFKPSLAGGDSNFTNDEGVTFLKSENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATG 155
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
++EG + TG L SLSEQ LIDC Y NNGC GGLMD AF+YI S GL E+ YPY
Sbjct: 156 SLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYE 215
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV 283
E+ C S T NG+ D+P+ E++L+ ALA P+S+AI+AS FQFY GV
Sbjct: 216 AEDDKCRYNPDNSG-ATDNGFVDIPEGDEEALMHALATVGPVSIAIDASSEKFQFYKKGV 274
Query: 284 -YDGHC-GTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
Y+ C T+LDHGV AVG+ + +G DY IVKNSWG WG++GYI M RN + CG
Sbjct: 275 FYNPRCSSTELDHGVLAVGFRTDKKGGDYWIVKNSWGKTWGDEGYIMMARNK---KNNCG 331
Query: 341 INKMASYPI 349
+ ASYP+
Sbjct: 332 VASSASYPL 340
>gi|340368360|ref|XP_003382720.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 326
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 188/308 (61%), Gaps = 15/308 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNY--WLGLNEFADLRHEEFK 105
F+ W K+ KVYE+ D +L R I++ N + ++ N + + +NEFADL EF
Sbjct: 23 FQEWKVKYNKVYETKDIELARQVIWESNKKFVENHNANSDKFGFTVAMNEFADLDAAEFA 82
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
+F G + S +DF K V + +VDWR+KGAVT +KNQG CGSCW+FST
Sbjct: 83 SIFNGF----LSLPNNSTKDFYKKTGVKVAATVDWREKGAVTAIKNQGKCGSCWSFSTTG 138
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
++EG + + TG L SLSEQ+ +DC + N+GC GG MD AF+Y+ + G E YPY
Sbjct: 139 SLEGQHFLKTGTLLSLSEQQFVDCSTKFGNHGCKGGTMDNAFRYLETVSGDETEMMYPYT 198
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV 283
E+G C+ E + V GY D+P++ ED+L +A+A P+SVAI+A FQ Y GV
Sbjct: 199 AEDGFCKFRSTEGK-VKCEGYKDIPRDDEDALREAVATVGPISVAIDAGHSSFQLYKEGV 257
Query: 284 -YDGHC-GTQLDHGVAAVGYGSTRGL-DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
Y+ C T+LDHGV AVGYG+ G +Y +VKNSWGP WG +GYI M RN E CG
Sbjct: 258 YYNPTCSSTKLDHGVLAVGYGTYEGSEEYWLVKNSWGPSWGMEGYIMMSRNR---ENNCG 314
Query: 341 INKMASYP 348
I MASYP
Sbjct: 315 IATMASYP 322
>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 334
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 129/346 (37%), Positives = 198/346 (57%), Gaps = 20/346 (5%)
Query: 11 LISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFE 70
++S F + + D I P + ++D + WM++F +VY+ EK R +
Sbjct: 1 MVSVRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLK 60
Query: 71 IFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLAR-----RKDQSHE 124
+FK NL+ I+ N ++Y LG+NEF D + EEF GL+ ++ K +
Sbjct: 61 VFKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSR 120
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQ 184
+++ D+ +S DWR +GAVT VK QG+C + +I NL +LSEQ
Sbjct: 121 NWNMSDIDMEDESKDWRDEGAVTPVKYQGACR-------------LTKISGKNLLTLSEQ 167
Query: 185 ELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTING 244
+LIDCD N GCNGG + AF+YI+ GG+ E +YPY +++ +C + I G
Sbjct: 168 QLIDCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRG 227
Query: 245 YHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG-HCGTQLDHGVAAVGYGS 303
+ VP ++E +LL+A+ QP+SV I+A F Y GGVY G CGT ++H V VGYG+
Sbjct: 228 FQMVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGT 287
Query: 304 TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
GL+Y ++KNSWG WGE GY+R++R+ P+G+CGI ++A+YP+
Sbjct: 288 MSGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 333
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 195/322 (60%), Gaps = 25/322 (7%)
Query: 47 LFESWMS-KFEKVYESLDEKLERF--EIFKDNLRHIDETNRKIK----NYWLGLNEFADL 99
+ E W + K E DE ERF +IF +N I + N++ ++ L +N++ADL
Sbjct: 25 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV-------VDLPKSVDWRKKGAVTHVKNQ 152
H EF+++ G L ++ + E S+K V V LPKSVDWR KGAVT VK+Q
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADE--SFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQ 142
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCWAFS+ A+EG + +G L SLSEQ L+DC Y NNGCNGGLMD AF+YI
Sbjct: 143 GHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 202
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
GG+ E+ YPY + +C KG + T G+ D+PQ E + +A+A P+SVAI+
Sbjct: 203 NGGIDTEKSYPYEAIDDSCHFNKG-TVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAID 261
Query: 271 ASGRDFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIR 327
AS FQFYS GVY + C Q LDHGV VG+G+ G DY +VKNSWG WG+KG+I+
Sbjct: 262 ASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIK 321
Query: 328 MKRNTGKPEGLCGINKMASYPI 349
M RN E CGI +SYP+
Sbjct: 322 MLRN---KENQCGIASASSYPL 340
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 141/303 (46%), Positives = 181/303 (59%), Gaps = 13/303 (4%)
Query: 51 WMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLG 110
W KVY E+ R+ I+KDN R I E N K ++ L +N+F D+ + EFK F G
Sbjct: 30 WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFILKMNQFGDMTNSEFK-AFNG 88
Query: 111 LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
K + F + P +VDWR +G VT VK+QG CGSCWAFST ++EG
Sbjct: 89 Y----LSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQ 144
Query: 171 NQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
+ TG L SLSEQ L+DC Y NNGC+GGLMD AF YI G+ E YPY E+G
Sbjct: 145 HFKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPYTAEDGK 204
Query: 230 CEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD--G 286
C K S T G+ D+P+ +E+ L +A+A+ P+SVAI+AS FQFYS GVY+
Sbjct: 205 CVFKK-SSVAATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVYNEPS 263
Query: 287 HCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
T+LDHGV VGYG+ G DY +VKNSW WG+KGYI+M+RN + CGI AS
Sbjct: 264 CSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNA---KNQCGIATKAS 320
Query: 347 YPI 349
YP+
Sbjct: 321 YPL 323
>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
Length = 341
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 151/342 (44%), Positives = 195/342 (57%), Gaps = 19/342 (5%)
Query: 23 SFARDFSIVGYSPEDLTS---NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHI 79
+FA ++ Y +T+ ND + + +E + ++F K Y + E+ R ++F DN I
Sbjct: 3 AFAFLCCVLIYHSNSVTAVSFNDLIAEEWELFKTQFSKAYNTEIEEKFRMKVFMDNKHKI 62
Query: 80 DETNRKIKN----YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV--VD 133
N+ +N Y L +N F DL H EF + G + L R + ++ V
Sbjct: 63 ARHNKLFQNGEVSYELEMNHFGDLLHHEFVKTVNGYRHSLRRVTGDEIDSVTFIPAYNVT 122
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
+P SVDWR +GAVT VKNQG CGSCWAFST ++EG + T L SLSEQ LIDC Y
Sbjct: 123 VPDSVDWRTEGAVTEVKNQGQCGSCWAFSTTGSLEGQHFRNTKQLTSLSEQNLIDCSGKY 182
Query: 194 -NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNS 252
NNGC+GGLMD AF YI S G+ E+ YPY + C ES T G+ D+PQ
Sbjct: 183 GNNGCSGGLMDNAFAYIKSNKGIDTEQSYPYEGIDDKCRYKPQESG-ATDKGFVDIPQGD 241
Query: 253 EDSLLKALAN-QPLSVAIEASGRDFQFYSGGV-YDGHCGT---QLDHGVAAVGYGSTRGL 307
E+ L A+A P+SVAI+AS + FQFY GV YD CG LDHGV AVGYG+ G
Sbjct: 242 EEKLKLAVATVGPISVAIDASHQSFQFYKKGVYYDKGCGNGEEDLDHGVLAVGYGTENGK 301
Query: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
DY +VKNSWG +WG GYI+M RN CGI ASYP+
Sbjct: 302 DYWLVKNSWGKRWGLDGYIKMARN---KHNHCGIATSASYPL 340
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 194/318 (61%), Gaps = 21/318 (6%)
Query: 49 ESWMS---KFEKVYESLDEKLERFEIFKDNLRHIDETNRKI----KNYWLGLNEFADLRH 101
E W + + K Y+S E+ R +I+ N I + N++ + Y L +N++ADL H
Sbjct: 25 EEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLH 84
Query: 102 EEFKEMFLGL-----KPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
EEF + G K L + + F V++P +VDWRKKGAVT VK+QG CG
Sbjct: 85 EEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCG 144
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGL 215
SCW+FS A+EG + TG L SLSEQ L+DC Y NNGCNGG+MDYAFQYI GG+
Sbjct: 145 SCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGGI 204
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGR 274
E+ YPY + TC ++ T GY D+PQ E++L KALA P+S+AI+AS
Sbjct: 205 DTEKSYPYEAIDDTCHFNP-KAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHE 263
Query: 275 DFQFYSGGV-YDGHCGTQ-LDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQFYS GV Y+ C ++ LDHGV AVGYG++ G DY +VKNSWG WG++GY++M RN
Sbjct: 264 SFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMARN 323
Query: 332 TGKPEGLCGINKMASYPI 349
+ CG+ ASYP+
Sbjct: 324 H---DNHCGVATCASYPL 338
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 141/308 (45%), Positives = 182/308 (59%), Gaps = 12/308 (3%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK--IKNYWLGLNEFADLRHEEFK 105
+W ++ K Y + E++ R ++ N ++IDE N+ + Y L +N+F DL + EFK
Sbjct: 22 LRAWKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFK 81
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
++ G + A RK + V DLP SVDW KKG VT VKNQG CGSCW+FS
Sbjct: 82 SLYNGYRMSNAPRKGKPF--VPAARVQDLPASVDWSKKGWVTPVKNQGQCGSCWSFSATG 139
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
++EG + TG L SLSEQ L+DC N+GCNGGLMD AF+Y++ G+ E YPY
Sbjct: 140 SMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASYPYR 199
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV 283
+ TC+ + TI+GY DV ++SE L A+A P+SVAI+AS FQFYS GV
Sbjct: 200 AVDSTCKFNTADVG-ATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYSSGV 258
Query: 284 YDGH--CGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
YD T LDHGV AVGYG+ DY +VKNSWG WG GYI M RN CGI
Sbjct: 259 YDPLICSSTNLDHGVLAVGYGTDGSKDYWLVKNSWGASWGMSGYIEMVRNHNNK---CGI 315
Query: 342 NKMASYPI 349
ASYP+
Sbjct: 316 ATSASYPV 323
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 147/304 (48%), Positives = 181/304 (59%), Gaps = 16/304 (5%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRHEEFKEMFLGLK 112
K Y S E+ R +I+ +N I N K N Y L +NEF DL H EF G K
Sbjct: 59 KEYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDLLHHEFVSTRNGFK 118
Query: 113 PDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
+ + + + D LPK+VDWRKKGAVT VKNQG CGSCWAFST ++EG
Sbjct: 119 RNYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQ 178
Query: 171 NQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
+ TG + SLSEQ L+DC + NNGC GGLMD AF+YI + GG+ E YPY +G
Sbjct: 179 HFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYIKANGGIDTELSYPYNGTDGI 238
Query: 230 CEMTKGESEV-VTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD-G 286
C K S+V T G+ D+P+ +E L KA+A P+SVAI+AS FQFYS GVYD
Sbjct: 239 CHFEK--SDVGATDTGFVDIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYSQGVYDEP 296
Query: 287 HCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
C ++ LDHGV VGYG+ G DY +VKNSWG WG+ GYI M RN E CGI A
Sbjct: 297 ECSSESLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDDGYIYMTRNK---ENQCGIASSA 353
Query: 346 SYPI 349
SYP+
Sbjct: 354 SYPL 357
>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 145/333 (43%), Positives = 178/333 (53%), Gaps = 35/333 (10%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
F+ W+ YE +E RF I++ N+ +I + +Y L N+FADL +EEF
Sbjct: 5 FDRWLKXNGXNYEDKEEWEIRFVIYQANVEYIGCKKSQKNSYNLTDNKFADLTNEEFVST 64
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG----------- 156
+LG L H F Y + +LP S DWRK+GAVT +K+QG+CG
Sbjct: 65 YLGFATRLI-----PHTRFKYHEHGNLPXSKDWRKEGAVTDIKDQGNCGKHSTWFSPEIS 119
Query: 157 ------------------SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGC 197
S WAFS VAAVE IN+I +G L SLSEQEL+D D N GC
Sbjct: 120 HNLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDYDVANKNQGC 179
Query: 198 NGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLL 257
GGLMD F +I GGL +DYPY +G+C K V I+GY P E L
Sbjct: 180 EGGLMDTTFAFIKKNGGLTTSKDYPYEGVDGSCNKEKALHHAVNISGYERAPSKDEAMLK 239
Query: 258 KALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWG 317
A ANQP+SVAI+A G FQ YS GV+ G CG +L+HGV VGY Y VKNS G
Sbjct: 240 VAAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGYDKGTFDKYRTVKNSXG 299
Query: 318 PKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
WGE GYIRMKR+ G CGI ASYP+K
Sbjct: 300 ADWGESGYIRMKRDAFDKAGTCGIAMKASYPLK 332
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 137/306 (44%), Positives = 180/306 (58%), Gaps = 8/306 (2%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
+++W S K Y + +E+ R I+++NL+ I N ++ L +N D+ E +
Sbjct: 29 WKAWKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHNEGKHSFKLAMNHLGDMTSLEISQT 88
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
LGLK F V + S+DWR KG VT VKNQG CGSCWAFST A+
Sbjct: 89 LLGLKLKKHAESQPKGATFLPPANVKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTTGAL 148
Query: 168 EGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EG + TG L SLSEQ L+DC Y NNGC GGLMD AFQYI GG+ E+ YPY+ +
Sbjct: 149 EGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPYLAK 208
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD 285
+G C K + G+ D+P E++L +ALA+ P+S+AI+AS F FY GVYD
Sbjct: 209 DGVCHYNK-SAIGAKDTGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQGVYD 267
Query: 286 GH--CGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINK 343
T+LDHGV AVGYG+ G DY +VKNSWGP WGE+GYI++ RN CG+
Sbjct: 268 DPDCSSTRLDHGVLAVGYGTDDGKDYWLVKNSWGPSWGEEGYIKIARND---HDKCGVAS 324
Query: 344 MASYPI 349
ASYP+
Sbjct: 325 KASYPL 330
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 191/318 (60%), Gaps = 21/318 (6%)
Query: 49 ESWMS---KFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRH 101
E W + + K Y+ E+ R +IF +N I + N++ + + +N++AD+ H
Sbjct: 25 EEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADMLH 84
Query: 102 EEFKEMFLGLKPDLARRKDQSHEDFS-----YKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
EF+E G L + S F+ V LPKSVDWR+KGAVT VK+QG CG
Sbjct: 85 HEFRETMNGFNYTLHKELRASDPSFTGITFISPAHVKLPKSVDWREKGAVTAVKDQGHCG 144
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGL 215
SCWAFS+ A+EG + TG L SLSEQ L+DC Y NNGCNGGLMD AF+YI GG+
Sbjct: 145 SCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKDNGGI 204
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGR 274
E+ YPY + +C K +S T G+ D+PQ +E + +A+A P+SVAI+AS
Sbjct: 205 DTEKSYPYEGIDDSCHFNK-DSVGATDRGFADIPQGNEKKMAEAVATIGPVSVAIDASHE 263
Query: 275 DFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQFYS G+Y + C +Q LDHGV VGYG+ G DY +VKNSWG WG+KG+I+M RN
Sbjct: 264 SFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESGKDYWLVKNSWGTTWGDKGFIKMARN 323
Query: 332 TGKPEGLCGINKMASYPI 349
+ CGI +SYP+
Sbjct: 324 ---EDNQCGIASASSYPL 338
>gi|348542776|ref|XP_003458860.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 188/315 (59%), Gaps = 20/315 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGLNEFADLRHEE 103
F +W KF K Y+S E+ R +I+ N +H+ N + K+Y LG+ FAD+ +EE
Sbjct: 26 FHAWRLKFGKSYDSPSEESHRKQIWLTNRKHVLMHNILADQGFKSYRLGMTYFADMENEE 85
Query: 104 FKEMF----LG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
+K++ LG L RR + +DLP +VDWR++G VT VK+Q CGSC
Sbjct: 86 YKKLVSRGCLGSFNASLPRRGSTF---LRLPEGIDLPDAVDWREQGYVTGVKDQKQCGSC 142
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHK 217
WAFS A+EG + TG L SLSEQ+L+DC Y N GCNGG MD AF+YI + GG+
Sbjct: 143 WAFSATGALEGQHFRKTGILVSLSEQQLVDCSGAYGNEGCNGGWMDSAFRYIEANGGIDT 202
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDF 276
E YPY E+ C S T +GY DV + E++L +A+A P+SVAI+AS F
Sbjct: 203 EASYPYEAEDWLCRYNPA-SVGATCSGYVDVNKYDEEALKEAVATIGPVSVAIDASHASF 261
Query: 277 QFYSGGVYD--GHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
QFY+ GVYD G +LDHGV AVGYG+ G DY +VKNSWG WGE GYI+M RN
Sbjct: 262 QFYTSGVYDEPGCSSIELDHGVLAVGYGTENGHDYWLVKNSWGRGWGEMGYIKMSRNK-- 319
Query: 335 PEGLCGINKMASYPI 349
CGI ASYP+
Sbjct: 320 -HNQCGIASAASYPL 333
>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
proteinase II; Flags: Precursor
gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
Length = 337
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 152/363 (41%), Positives = 200/363 (55%), Gaps = 41/363 (11%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
M LS LI ISF S ++ S+ + D F WM K Y
Sbjct: 1 MRLSITLIFTLIVLSISFI--------------SAGNVFSHKQYQDSFIDWMRSNNKAY- 45
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGL--------- 111
+ E + R+E FK N+ ++ N K LGLN+ ADL +EE++ +LG
Sbjct: 46 THKEFMPRYEEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIKLNGY 105
Query: 112 -KPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
K +L R ++ P +VDWR+K AVT VK+QG CGSC++FST +VEG+
Sbjct: 106 HKRNLGLRLNRPQ--------FKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGV 157
Query: 171 NQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME-EG 228
I TG L SLSEQ ++DC +++ N GCNGGLM AF+YI+ GL+ EE YPY M+
Sbjct: 158 TAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVND 217
Query: 229 TCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV-YDGH 287
C+ +G S I Y ++ E+ L AL P+SVAI+AS FQ Y+ GV Y+
Sbjct: 218 ECKFQEG-SVAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPA 276
Query: 288 CGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
C ++ LDHGV AVG G+ G DY IVKNSWGP WG GYI M RN + CGI+ MAS
Sbjct: 277 CSSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARN---KDNNCGISTMAS 333
Query: 347 YPI 349
YPI
Sbjct: 334 YPI 336
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 188/324 (58%), Gaps = 16/324 (4%)
Query: 35 PEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YW 90
P L + +L FE + S F +VY S + +L R IF+ NL+ I N N +
Sbjct: 20 PSMLLTEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFS 79
Query: 91 LGLNEFADLRHEEFKEMFLGLKPDLA-RRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHV 149
+ +N F DL +EEF+ F G + A D H D DV LP +VDW KG VT +
Sbjct: 80 VSVNNFTDLSNEEFRATFNGYRRLAAVSLADSVHAD---NDVEALPATVDWTTKGVVTPI 136
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQY 208
KNQ CGSCWAFS VA++EG + + TG L SLSEQ L+DC + GC+GG MDYAF+Y
Sbjct: 137 KNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKY 196
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSV 267
++ G+ E YPY + +CE K S TI+ + DV E +L A+A+ P+SV
Sbjct: 197 VIQNRGIDTEASYPYKAIDESCEF-KRNSVGATIHSFVDVKTGDESALQNAVASIGPISV 255
Query: 268 AIEASGRDFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGY 325
AI+A+ FQFYS GVY + C T+ LDHGV AVGYG+ G Y VKNSWG WG KGY
Sbjct: 256 AIDAAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGAPYWKVKNSWGTSWGRKGY 315
Query: 326 IRMKRNTGKPEGLCGINKMASYPI 349
I M RN + CGI ASYP+
Sbjct: 316 IFMSRNK---QNQCGIATKASYPV 336
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 136/302 (45%), Positives = 187/302 (61%), Gaps = 13/302 (4%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKI----KNYWLGLNEFADLRHEEFKEMFLGLK 112
K Y+S DE+ R IF+DN + I E N++ ++Y++G+N+F DL H E+ E+ +G
Sbjct: 29 KQYKSPDEENVRRAIFRDNNQMIKEHNQEAAMGRRSYFMGMNQFGDLAHSEYLELVVGPG 88
Query: 113 PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQ 172
S F + + +VDWR+KGAVT +K+QG CGSCWAFST ++EG +
Sbjct: 89 LLPLNLSTPSENVFESTPGLQVDDTVDWRQKGAVTPIKDQGHCGSCWAFSTTGSLEGQHF 148
Query: 173 IVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM-EEGTC 230
+ TG L SLSEQ L+DC + N GC GGLMD AF+YI S GG+ EE YPY+ +E C
Sbjct: 149 MKTGKLVSLSEQNLLDCSRRFGNKGCEGGLMDQAFRYIKSNGGIDTEECYPYMAKDEKVC 208
Query: 231 EMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD-GHC 288
+ K T++ Y D+ E +L++A+ P+SVAI+AS + +FY G+YD C
Sbjct: 209 DY-KTSCSGATLSSYTDIKAMDEMALMQAVGTVGPVSVAIDASHKSLRFYKSGIYDEPEC 267
Query: 289 G-TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASY 347
T+LDHGV AVGYGS G+DY +VKNSWG WG+ GY++M RN CGI ASY
Sbjct: 268 SRTKLDHGVLAVGYGSMDGMDYWLVKNSWGSAWGDMGYVKMTRNKNNQ---CGIATKASY 324
Query: 348 PI 349
P+
Sbjct: 325 PV 326
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 144/324 (44%), Positives = 198/324 (61%), Gaps = 22/324 (6%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFA 97
D +++ + ++ + K Y+ E+ R +IF +N I + N++ ++ L +N++A
Sbjct: 23 DVVMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 82
Query: 98 DLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV-------VDLPKSVDWRKKGAVTHVK 150
DL H EF+++ G L ++ + E S+K V V LPKSVDWR KGAVT VK
Sbjct: 83 DLLHHEFRQLMNGFNYTLHKQLRAADE--SFKGVTFISPAHVTLPKSVDWRTKGAVTAVK 140
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYI 209
+QG CGSCWAFS+ A+EG + +G L SLSEQ L+DC Y NNGCNGGLMD AF+YI
Sbjct: 141 DQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 200
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVA 268
GG+ E+ YPY + +C KG + T G+ D+PQ E + +A+A P+SVA
Sbjct: 201 KDNGGIDTEKSYPYEAIDDSCHFNKG-TIGATDRGFTDIPQGDEKKMAEAVATVGPVSVA 259
Query: 269 IEASGRDFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGY 325
I+AS FQFYS GVY + C Q LDHGV VG+G+ G DY +VKNSWG WG+KG+
Sbjct: 260 IDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGF 319
Query: 326 IRMKRNTGKPEGLCGINKMASYPI 349
I+M RN E CGI +SYP+
Sbjct: 320 IKMLRN---KENQCGIASASSYPL 340
>gi|150261413|pdb|2PNS|A Chain A, 1.9 Angstrom Resolution Crystal Structure Of A Plant
Cysteine Protease Ervatamin-C Refinement With Cdna
Derived Amino Acid Sequence
gi|150261414|pdb|2PNS|B Chain B, 1.9 Angstrom Resolution Crystal Structure Of A Plant
Cysteine Protease Ervatamin-C Refinement With Cdna
Derived Amino Acid Sequence
gi|166007115|pdb|2PRE|A Chain A, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
Complexed With Irreversible Inhibitor E-64 At 2.7 A
Resolution
gi|166007116|pdb|2PRE|B Chain B, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
Complexed With Irreversible Inhibitor E-64 At 2.7 A
Resolution
Length = 208
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 125/217 (57%), Positives = 153/217 (70%), Gaps = 10/217 (4%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP+ +DWRKKGAVT VKNQG CGSCWAFSTV+ VE INQI TGNL SLSEQ+L+DC N
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNLISLSEQQLVDC-NKK 59
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
N+GC GG YA+QYI+ GG+ E +YPY +G C K +VV I+GY VP +E
Sbjct: 60 NHGCKGGAFVYAYQYIIDNGGIDTEANYPYKAVQGPCRAAK---KVVRIDGYKGVPHCNE 116
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
++L KA+A+QP VAI+AS + FQ Y G++ G CGT+L+HGV VGY DY IV+
Sbjct: 117 NALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKLNHGVVIVGYWK----DYWIVR 172
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
NSWG WGE+GYIRMKR G GLCGI ++ YP K
Sbjct: 173 NSWGRYWGEQGYIRMKRVGG--CGLCGIARLPYYPTK 207
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 152/331 (45%), Positives = 199/331 (60%), Gaps = 25/331 (7%)
Query: 30 IVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI--- 86
VG SP + ++D+ +LF+ + K Y + + R IF+ N++ I+ N
Sbjct: 11 FVGVSPAAVDAHDEHWELFKR---QHNKTYLQ-KQDVGRRAIFEANIKKINAHNLLYDLG 66
Query: 87 -KNYWLGLNEFADLRHEEFKEMFLGLK--PDLARRKDQSHEDFSYKDVVDLPKSVDWRKK 143
+Y LGLN FAD+ +EF E + G + + AR H D + +P +VDWR +
Sbjct: 67 RSSYRLGLNGFADMTPDEF-EKYRGTRFEANEARVSKLQHRD---NRSMHVPDTVDWRTE 122
Query: 144 GAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLM 202
G VT VKNQG CGSCWAFST A+EG + +G+L SLSEQ L+DC Y N GCNGGLM
Sbjct: 123 GYVTPVKNQGVCGSCWAFSTTGALEGQHFRRSGDLVSLSEQMLVDCSAVYGNAGCNGGLM 182
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEM-TKGESEVVTINGYHDVPQNSEDSLLKALA 261
D AF++I GGL E+ YPY ++GTC +G +T G+ DVP E++L +A
Sbjct: 183 DNAFRFIKDAGGLETEKSYPYTGKDGTCHFDARGIGAKLT--GFVDVPSRDEEALKEAAG 240
Query: 262 -NQPLSVAIEASGRDFQFYSGGVYD--GHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWG 317
P+SVAI+ASG++FQFY GVYD T LDHGV VGYG+TR G DY +VKNSWG
Sbjct: 241 VVGPVSVAIDASGQNFQFYKDGVYDEITCSSTSLDHGVLVVGYGTTRDGKDYWLVKNSWG 300
Query: 318 PKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
WG+ GYI+M RN E CGI MASYP
Sbjct: 301 SSWGQSGYIQMSRN---KENQCGIATMASYP 328
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 192/323 (59%), Gaps = 22/323 (6%)
Query: 40 SNDKLIDLF-ESWMS---KFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWL 91
S+ L D F E W++ +F K Y++ E+L R ++K+N R IDE N++ +N Y L
Sbjct: 14 SHTALHDYFPEEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKL 73
Query: 92 GLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV-DLPKSVDWRKKGAVTHVK 150
+N F DL EFK + L R Q + ++ LP VDWR+KGAVT VK
Sbjct: 74 KMNHFGDLMQHEFKAL-----NKLKRSAKQQNSGEVFRATGGKLPAKVDWRQKGAVTPVK 128
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYI 209
+ G CGSCWAFS+ ++ G + L SLSEQ+L+DC Y N+GC+GG+M AFQYI
Sbjct: 129 DPGQCGSCWAFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYI 188
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVA 268
GG+ E YPY E+ C K +S T GY D+ Q E++L +A+A P+SVA
Sbjct: 189 KGNGGIDTEGSYPYEAEDDKCRY-KTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVA 247
Query: 269 IEASGRDFQFYSGGVYD-GHCG-TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYI 326
I+A FQFYS G+YD C T+LDHGV VGYG+ G DY +VKNSWGP WGE GYI
Sbjct: 248 IDAGNLSFQFYSEGIYDEPFCSNTELDHGVLVVGYGTENGQDYWLVKNSWGPSWGENGYI 307
Query: 327 RMKRNTGKPEGLCGINKMASYPI 349
++ RN CGI MASYPI
Sbjct: 308 KIARNHNNH---CGIASMASYPI 327
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 146/309 (47%), Positives = 184/309 (59%), Gaps = 26/309 (8%)
Query: 57 KVYESLDEKLERFEIFKDN----LRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLK 112
K YES E+ R +I+ +N RH ++ + +Y L +NEF D+ H EF G K
Sbjct: 32 KEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDMLHHEFVSTRNGFK 91
Query: 113 P---DLARR-----KDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
D R + + EDF LPK+VDWRKKGAVT VKNQG CGSCW+FST
Sbjct: 92 RNYRDTPREGSFFVEPEGLEDFH------LPKTVDWRKKGAVTPVKNQGQCGSCWSFSTT 145
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
++EG + L SLSEQ LIDC ++ NNGC GGLMDYAF+YI + G+ E+ YPY
Sbjct: 146 GSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYIKANKGIDTEQSYPY 205
Query: 224 IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGG 282
+G C K + T G+ D+P+ E+ L KA+A P+SVAI+AS FQFYS G
Sbjct: 206 NATDGVCHFNK-SAVGATDTGFVDIPEGDENKLKKAVATVGPVSVAIDASHESFQFYSEG 264
Query: 283 VYD-GHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
VYD C + QLDHGV VGYG+ G DY +VKNSWG WG+ GYI M RN + CG
Sbjct: 265 VYDEPECDSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDGGYIYMSRNK---DNQCG 321
Query: 341 INKMASYPI 349
I ASYP+
Sbjct: 322 IASAASYPL 330
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 144/319 (45%), Positives = 193/319 (60%), Gaps = 22/319 (6%)
Query: 49 ESWMS---KFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRH 101
E W + + K Y+S E+ R +I+ N I + N++ + + L +N++ DL H
Sbjct: 25 EEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLH 84
Query: 102 EEFKEMFLGL------KPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSC 155
EEF + G KP L K + V++PK+VDWR+KGAVT VK+QG C
Sbjct: 85 EEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQGHC 144
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGG 214
GSCW+FS A+EG + TG L SLSEQ L+DC Y NNGCNGG+MD+AFQYI GG
Sbjct: 145 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDNGG 204
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ-PLSVAIEASG 273
+ E+ YPY + TC ++ T G+ D+PQ E +L+KA+A P+SVAI+AS
Sbjct: 205 IDTEKAYPYEAIDDTCHYNP-KAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAIDASH 263
Query: 274 RDFQFYSGGV-YDGHCGTQ-LDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKR 330
FQFYS GV Y+ C ++ LDHGV AVGYG S G DY +VKNSWG WG++GY++M R
Sbjct: 264 ESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMAR 323
Query: 331 NTGKPEGLCGINKMASYPI 349
N + CGI ASYP+
Sbjct: 324 NR---DNHCGIATAASYPL 339
>gi|9502426|gb|AAF88125.1|AC021043_18 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 365
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 132/364 (36%), Positives = 204/364 (56%), Gaps = 25/364 (6%)
Query: 11 LISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFE 70
++S F + + D I P + ++D + WM++F +VY+ EK R +
Sbjct: 1 MVSVRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLK 60
Query: 71 IFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLAR-----RKDQSHE 124
+FK NL+ I+ N ++Y LG+NEF D + EEF GL+ ++ K +
Sbjct: 61 VFKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSR 120
Query: 125 DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWA------------FSTVAAV----- 167
+++ D+ +S DWR +GAVT VK QG+C ++ + V
Sbjct: 121 NWNMSDIDMEDESKDWRDEGAVTPVKYQGACPEFPTKQIRRNSLVGKQYTKLLGVLSDWG 180
Query: 168 -EGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EG+ +I NL +LSEQ+LIDCD N GCNGG + AF+YI+ GG+ E +YPY ++
Sbjct: 181 DEGLTKISGKNLLTLSEQQLIDCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVK 240
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+ +C + I G+ VP ++E +LL+A+ QP+SV I+A F Y GGVY G
Sbjct: 241 KESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAG 300
Query: 287 -HCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
CGT ++H V VGYG+ GL+Y ++KNSWG WGE GY+R++R+ P+G+CGI ++A
Sbjct: 301 LDCGTDVNHAVTIVGYGTMSGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVA 360
Query: 346 SYPI 349
+YP+
Sbjct: 361 AYPV 364
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 146/331 (44%), Positives = 204/331 (61%), Gaps = 27/331 (8%)
Query: 40 SNDKLIDLFESWMSKFEKVY----ESLDEKLERFEIFKDNL----RHIDETNRKIKNYWL 91
++ K + + SW+ ++ K + S E FE+F+ NL +H +E N+ +++Y +
Sbjct: 19 AHQKYLSAWSSWVKEYNKEHWVDPYSSPESTRAFEVFQKNLDMIMKHNEEYNQGLQSYEM 78
Query: 92 GLNEFADLRHEEFKEMFLGLK-PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVK 150
GLN FA L EEF +LG ++ + K + K ++P SVDWR+KGAV VK
Sbjct: 79 GLNGFAHLTFEEFSAQYLGYGGAEVEQPKTRRAGKHERKSRSEIPASVDWREKGAVAEVK 138
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYI 209
NQG+CGSCWAFS VAA+EG + + +G L SLSEQ+L+DC + N+GC GG MD AF+Y
Sbjct: 139 NQGACGSCWAFSAVAALEGAHFLNSGELISLSEQQLVDCSKKFGNHGCAGGYMDNAFEYW 198
Query: 210 V-STG-GLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLS 266
+ +TG G E+DYPY +G C+ + + TI+GY+DV Q +E LL A+AN P+S
Sbjct: 199 MNNTGHGDDSEKDYPYKGMDGKCKFS-ADGVRATISGYNDVKQGNETDLLDAVANVGPVS 257
Query: 267 VAIEASGRDFQFYSGGVYDGHCGT---QLDHGVAAVGYGST-----RGLDYIIVKNSWGP 318
VAI A G QFY GV++G GT L+HGV AVGYG+ R +DY I+KNSWG
Sbjct: 258 VAIHA-GAALQFYLRGVFNGVAGTCFGPLNHGVTAVGYGTASLRFGRKMDYWIIKNSWGM 316
Query: 319 KWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
WGEKG++R R + LCG+ ASYP+
Sbjct: 317 GWGEKGFVRFARG----KNLCGVANGASYPL 343
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 143/324 (44%), Positives = 198/324 (61%), Gaps = 22/324 (6%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFA 97
D +++ + ++ + K Y+ E+ R +IF +N I + N++ ++ L +N++A
Sbjct: 23 DVVMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 82
Query: 98 DLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV-------VDLPKSVDWRKKGAVTHVK 150
DL H EF+++ G L ++ + E S+K V V LPKSVDWR KGAVT VK
Sbjct: 83 DLLHHEFRQLMNGFNYTLHKQLRAADE--SFKGVTFISPAHVTLPKSVDWRTKGAVTAVK 140
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYI 209
+QG CGSCWAFS+ A+EG + +G L SLSEQ L+DC Y NNGCNGGLMD AF+YI
Sbjct: 141 DQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 200
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVA 268
GG+ E+ YPY + +C KG + T G+ D+PQ E + +A+A P++VA
Sbjct: 201 KDNGGIDTEKSYPYEAIDDSCHFNKG-TIGATDRGFTDIPQGDEKKMAEAVATVGPVAVA 259
Query: 269 IEASGRDFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGY 325
I+AS FQFYS GVY + C Q LDHGV VG+G+ G DY +VKNSWG WG+KG+
Sbjct: 260 IDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGF 319
Query: 326 IRMKRNTGKPEGLCGINKMASYPI 349
I+M RN E CGI +SYP+
Sbjct: 320 IKMLRN---KENQCGIASASSYPL 340
>gi|348531521|ref|XP_003453257.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 333
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 191/315 (60%), Gaps = 21/315 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDN----LRHIDETNRKIKNYWLGLNEFADLRHEE 103
F +W KFEK Y+S E+ R +I+ +N L H ++ +K+Y LG+ +FAD+ +EE
Sbjct: 26 FHAWKLKFEKSYDSDSEEAHRKQIWLNNRKLVLVHNILADQGLKSYRLGMTQFADMENEE 85
Query: 104 FKEMF----LG-LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
+K + LG L R + DLP +VDWR KG VT V+NQ CGSC
Sbjct: 86 YKRLVSRGCLGSFNTSLHHRGSTF---LRLPEGTDLPDTVDWRDKGYVTDVQNQMQCGSC 142
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHK 217
WAFS + A+EG N TG L SLS+Q+L+DC ++ N+GCNGG MD+AF+YI +TGG+
Sbjct: 143 WAFSAIGALEGQNFRKTGKLVSLSKQQLVDCSQSFGNHGCNGGWMDWAFKYIQATGGIDT 202
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDF 276
E YPY EEG C E+ T GY DV N ED+L +A+A P+S+A++AS F
Sbjct: 203 EASYPYEAEEGNCHYNP-ETVGATCTGYVDVSPN-EDALKEAVATIGPISIAMDASHESF 260
Query: 277 QFYSGGVYD-GHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
QFY GVYD C T + H + AVGYG+ G DY +VKNS+G WGEKGYI+M RN
Sbjct: 261 QFYQSGVYDEPSCITSRFSHAMLAVGYGTENGHDYWLVKNSFGLGWGEKGYIKMSRNKSN 320
Query: 335 PEGLCGINKMASYPI 349
CGI ASYP+
Sbjct: 321 Q---CGIASKASYPL 332
>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
Length = 384
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 144/304 (47%), Positives = 191/304 (62%), Gaps = 18/304 (5%)
Query: 56 EKVYESLDEKLERFEIFKDNLRHIDETNRKI----KNYWLGLNEFADLRHEEFKEMFLGL 111
+K YE +E+ RFEIF++N+ I++ N+ K+Y+LG+N+F DL + EF F GL
Sbjct: 87 DKSYEDHEEESRRFEIFRENVLRIEKHNKLFHLGKKSYYLGVNQFTDLEYAEFVN-FNGL 145
Query: 112 K-PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
K +L K SH S ++V +P SVDWR KG VT VKNQG+CGSCWAFS ++EG
Sbjct: 146 KMTNLNNTKCSSH--LSANNIV-VPDSVDWRSKGYVTKVKNQGACGSCWAFSATGSLEGQ 202
Query: 171 NQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
G L LSE +L+DC ++ N GCNGG M+ AF+Y+ S GG+ E DYPY + T
Sbjct: 203 YFRKNGKLVPLSESQLVDCSGSFGNEGCNGGFMENAFKYVKSVGGIESESDYPYKARQRT 262
Query: 230 CEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYDGH- 287
C K + + T++G DV SE SL + ++ P+SVAI+A FQ Y+GGVYD
Sbjct: 263 CAFDKTKV-IATVSGCVDVESGSESSLKEVVSEVGPVSVAIDAGHSSFQLYAGGVYDEPL 321
Query: 288 CGT-QLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
C T +L+HGV VGYG S +G DY IVKNSWG +WG +GYI+M RN CGI A
Sbjct: 322 CSTSRLNHGVLCVGYGTSLQGKDYWIVKNSWGVRWGVEGYIKMSRNKNNQ---CGIASEA 378
Query: 346 SYPI 349
SYP+
Sbjct: 379 SYPL 382
>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
Length = 349
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 185/321 (57%), Gaps = 20/321 (6%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEE 103
L+D F++W +++ + Y + +E +RF ++ +N++ I+ N+ +Y LG N+FADL EE
Sbjct: 33 LLDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSSYELGENQFADLTEEE 92
Query: 104 FKEMFL-------------GLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVK 150
FK+ +L L D R S + + P SVDWR KGAVT VK
Sbjct: 93 FKDTYLMKLDNVASSPEAMALTVDTMNRAGTS----GGSNTNEAPNSVDWRTKGAVTPVK 148
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM-DYAFQYI 209
+Q CGSCWAF+ VA++EG+++I TG L SLSEQE++DCD NN G A +++
Sbjct: 149 SQQHCGSCWAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWV 208
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
GGL E DYPY+ +G C K I G V +E +L A+A +P++V+I
Sbjct: 209 TRNGGLTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSI 268
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRM 328
AS R FQFY G++ G C T +H V VGYG+ G Y IVKNSWG +WGEKGY+RM
Sbjct: 269 NAS-RAFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRM 327
Query: 329 KRNTGKPEGLCGINKMASYPI 349
+R EG+CGI Y +
Sbjct: 328 QRGVRAREGVCGIAIAPFYAV 348
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 145/305 (47%), Positives = 192/305 (62%), Gaps = 22/305 (7%)
Query: 62 LDEKLERF--EIFKDNLRHIDETNR-----KIKNYWLGLNEFADLRHEEFKEMFLGLKPD 114
LDE ERF +IF +N I + N+ K+ +Y L +N++AD+ H EF+++ G
Sbjct: 117 LDETEERFRLKIFNENKHKIAKHNQLWASGKV-SYKLAVNKYADMLHHEFRQLMNGFNYT 175
Query: 115 L---ARRKDQSHEDFSY--KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEG 169
L R D+S + ++ + V LPKSVDWR KGAVT VK+QG CGSCWAFS+ A+EG
Sbjct: 176 LHKELRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSSTGALEG 235
Query: 170 INQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEG 228
+ +G L SLSEQ L+DC Y NNGCNGGLMD AF+YI GG+ E+ YPY +
Sbjct: 236 QHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEALDD 295
Query: 229 TCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVY-DG 286
+C KG + T G+ D+PQ +E L +A+A P+SVAI+AS FQFYS GVY +
Sbjct: 296 SCHFNKG-TIGATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSEGVYVEP 354
Query: 287 HCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
C Q LDHGV VG+G+ G DY +VKNSWG WG+KG+I+M RN + CGI
Sbjct: 355 ACDAQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRN---KDNQCGIASA 411
Query: 345 ASYPI 349
+SYP+
Sbjct: 412 SSYPL 416
>gi|219112639|ref|XP_002178071.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410956|gb|EEC50885.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 360
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/331 (41%), Positives = 197/331 (59%), Gaps = 25/331 (7%)
Query: 43 KLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK--IKNYWLGLNEFADLR 100
+L+ F+ W+ +K+Y+S D K+ER I+ +N I+ N + ++ LG NEF+D+
Sbjct: 29 ELMSKFKGWVDFHQKMYDSHDNKMERLNIWLNNDERIEAHNNQNPTPSFALGHNEFSDMT 88
Query: 101 HEEFKEMF-LG---------------LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKG 144
+EF + F LG + PD + + + + LP ++W + G
Sbjct: 89 EDEFAQYFRLGPYASVRQKEAAQAKIMDPDQQISTAERRRLWEEQAPLTLPDYMNWVQAG 148
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDY 204
AVT +KNQG+CGSCWAFST A+EG + TG L +LSEQ LIDCD + GCNGGLMD
Sbjct: 149 AVTPMKNQGACGSCWAFSTTGALEGAKFLKTGELVALSEQHLIDCDKV-DLGCNGGLMDN 207
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEG-TCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
AF++ +S GL EE+YPY+ ++ TC + E + + DVP E +LL A+A Q
Sbjct: 208 AFKFDMSEAGLCSEEEYPYLAKQSRTCMTNCTKVEGSGVKTFIDVPPGDEKALLSAIAMQ 267
Query: 264 PLSVAIEASGRDFQFYSGGVY-DGHCGTQ--LDHGVAAVGYGSTRGLD--YIIVKNSWGP 318
P+SVAI+AS FQFY GV D CG++ +DHGV AVGYG+ + Y +VKNSWG
Sbjct: 268 PISVAIQASQFVFQFYKNGVLTDDSCGSRASIDHGVLAVGYGTDVDTNEPYFLVKNSWGE 327
Query: 319 KWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
WG+KGY+++ R G+C I KMAS+P+
Sbjct: 328 TWGDKGYVKLGRGGKNEFGMCAILKMASFPV 358
>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 358
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 140/324 (43%), Positives = 192/324 (59%), Gaps = 20/324 (6%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHE 102
++D F ++ + + + Y S +E+L RFE+++ N+ +I+ NR+ Y LG N+FADL +
Sbjct: 36 MMDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQ 95
Query: 103 EFKEMF-----LGLKPDLARRKDQ-------SHEDFS--YKDVVDL--PKSVDWRKKGAV 146
EF+ M+ + +PD RR+ ED Y D + P SVDWR KGAV
Sbjct: 96 EFRAMYTMPARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVDWRSKGAV 155
Query: 147 THVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAF 206
T VK+QG CG CWAF+TVA +EG+++I TG L SLSEQEL+D + ++GC GGL + A
Sbjct: 156 TPVKDQGGCGCCWAFATVATIEGLHKIKTGQLVSLSEQELVD-CDDADDGCGGGLPEIAM 214
Query: 207 QYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLS 266
+++ GGL E +YPY + G C+ K + I V NSE L +A+A QP++
Sbjct: 215 EWVAHNGGLTTEANYPYTGKAGKCDRGKASNHAAKIAAAQMVRANSEAELERAVARQPVA 274
Query: 267 VAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGY 325
VAI A FY GVY G C + DH V VGYG+ +G Y I+KNSW WGEKGY
Sbjct: 275 VAINAPD-SLMFYKSGVYSGPCTAEFDHAVTVVGYGADNKGHKYWIIKNSWAETWGEKGY 333
Query: 326 IRMKRNTGKPEGLCGINKMASYPI 349
RM+R EGLCGI ASYP+
Sbjct: 334 GRMQRGVAAKEGLCGIATHASYPV 357
>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
Length = 258
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 130/264 (49%), Positives = 170/264 (64%), Gaps = 16/264 (6%)
Query: 93 LNEFADLRHEEFKEMFLGLKPDLARRKDQS---HEDFSYKDVVDLPKSVDWRKKGAVTHV 149
LNEFAD+ ++EF M+ GL+P A K + + + + D D ++VDWR+KGAVT +
Sbjct: 3 LNEFADMTNDEFMAMYTGLRPVPAGAKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTGI 62
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYI 209
K+Q CG CWAF+ VAAVEGI+QI TGNL SLSEQ+++DCD NNGCNGG +D AFQYI
Sbjct: 63 KDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYI 122
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
V GGL E+ YPY + C+ + V I+GY DVP E +L A+ANQP+SVAI
Sbjct: 123 VGNGGLATEDAYPYTAAQAMCQSVQ---PVAAISGYQDVPSGDEAALAAAVANQPVSVAI 179
Query: 270 EASGRDFQFYSGGVYD-GHCGT--QLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGY 325
+A +FQ Y GGV C T L+H V AVGYG+ G Y ++KN WG WGE GY
Sbjct: 180 DA--HNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGY 237
Query: 326 IRMKRNTGKPEGLCGINKMASYPI 349
+R++R CG+ + ASYP+
Sbjct: 238 LRLERGANA----CGVAQQASYPV 257
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/320 (43%), Positives = 184/320 (57%), Gaps = 13/320 (4%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGL 93
L + KL ++ W K Y +E + R ++ NL+ + E N + + YWLG+
Sbjct: 18 LAFDAKLNQHWKLWKEANNKRYSDAEEHVRR-ATWEGNLQKVQEHNLQADLGVHTYWLGM 76
Query: 94 NEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQG 153
N++AD+ EF ++ G + ++ Q FS+ + LP +VDWR KG VT VK+QG
Sbjct: 77 NKYADMTVTEFVKVMNGYNATMRGQRTQDRHTFSFNSKIALPDTVDWRDKGYVTDVKDQG 136
Query: 154 SCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVST 212
CGSCWAFST A+EG + TG L SLSEQ L+DC N GCNGGLMD AF+YI
Sbjct: 137 QCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLVDCSGKQGNMGCNGGLMDQAFEYIKEN 196
Query: 213 GGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEA 271
G+ E+ YPY + C K + T G+ D+ E +L +A+A P+SVAI+A
Sbjct: 197 NGIDTEDSYPYEAVDNQCRF-KAANVGATDTGFTDITSKDESALQQAVATVGPISVAIDA 255
Query: 272 SGRDFQFYSGGVY-DGHCG-TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMK 329
FQ Y GVY + C T+LDHGV AVGYG+ G DY +VKNSWG WG+KGYI+M
Sbjct: 256 GHTSFQLYKHGVYNEPFCSQTRLDHGVLAVGYGTDSGKDYWLVKNSWGEGWGDKGYIKMT 315
Query: 330 RNTGKPEGLCGINKMASYPI 349
RN CGI ASYP+
Sbjct: 316 RN---KRNQCGIATAASYPL 332
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 144/295 (48%), Positives = 188/295 (63%), Gaps = 10/295 (3%)
Query: 60 ESLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLGLK--PDLA 116
+ + E +R IFK+NL +I+ N K+Y LGLN+++DL +EF GLK L+
Sbjct: 74 DKISELEKRKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQLS 133
Query: 117 RRKDQSHE-DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
K +S F+ D D+P + DWR++GAVT VK+QGSCG CWAFS VAAVEG +I T
Sbjct: 134 SSKMRSAAVPFNLND--DVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINT 191
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
G L SLSEQ+L+DCD N+GC+GG MD AF+YI+ G+ E DYPY TC++
Sbjct: 192 GELISLSEQQLVDCDER-NSGCHGGNMDSAFKYIIQK-GIVSEADYPYQEGSQTCQLNDQ 249
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHG 295
I + DVP N E LL+A+A QP+SV IE G +FQ Y G VY G CG ++H
Sbjct: 250 MKFEAQITNFIDVPANDEQQLLQAVAQQPVSVGIEV-GDEFQHYMGDVYSGTCGQSMNHA 308
Query: 296 VAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
V AVGYG S G Y ++KNSWG WGE+GY+++ R +G+P G CGI ASYPI
Sbjct: 309 VTAVGYGVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCGIAAHASYPI 363
>gi|66812702|ref|XP_640530.1| counting factor associated protein [Dictyostelium discoideum AX4]
gi|74897159|sp|Q54TR1.1|CFAD_DICDI RecName: Full=Counting factor associated protein D; Flags:
Precursor
gi|60468561|gb|EAL66564.1| counting factor associated protein [Dictyostelium discoideum AX4]
Length = 531
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 133/325 (40%), Positives = 187/325 (57%), Gaps = 19/325 (5%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFA 97
L ++ +LF+ + +++ K Y S DE ERF FK + I N K +Y LG+N +A
Sbjct: 215 LAKEEQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSYKLGMNHYA 274
Query: 98 DLRHEEFKEMFLGLKPDLARRK----DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQG 153
DL ++EF + +KP +AR D H+D S + + P +VDWR + VT VK+QG
Sbjct: 275 DLSNKEFNTL---VKPKVARPSVTGADSVHDDESLRSI---PSTVDWRNQNCVTPVKDQG 328
Query: 154 SCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDN-TYNNGCNGGLMDYAFQYIVST 212
CGSCW F + ++EG N + G L SLSEQ+L+DC T + GC GG AFQY++
Sbjct: 329 ICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEI 388
Query: 213 GGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ-PLSVAIEA 271
G L E +YPY+M+ G C V+I GY +V SE +L A+A P+++AI+A
Sbjct: 389 GSLATESNYPYLMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDA 448
Query: 272 SGRDFQFYSGGVYDGHCGTQ----LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIR 327
S DF++Y GVY+ LDH V A+GYG+ +G DY +VKNSW WG GY+
Sbjct: 449 SVDDFRYYMSGVYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGMDGYVY 508
Query: 328 MKRNTGKPEGLCGINKMASYPIKKK 352
M RN LCG++ A+YPI K
Sbjct: 509 MARNDNN---LCGVSSQATYPIPTK 530
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 143/324 (44%), Positives = 198/324 (61%), Gaps = 22/324 (6%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFA 97
D +++ + ++ + K Y+ E+ R +IF +N I + N++ ++ L +N++A
Sbjct: 23 DVVMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 82
Query: 98 DLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV-------VDLPKSVDWRKKGAVTHVK 150
DL H EF+++ G L ++ + D S+K V V LPKSVDWR KGAVT VK
Sbjct: 83 DLLHHEFRQLMNGFNYTLHKQLRAT--DDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVK 140
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYI 209
+QG CGSCWAFS+ A+EG + +G L SLSEQ L+DC Y NNGCNGGLMD AF+YI
Sbjct: 141 DQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 200
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVA 268
GG+ E+ YPY + +C KG + T G+ D+PQ E + +A+A P+SVA
Sbjct: 201 KDNGGIDTEKSYPYEAIDDSCHFNKG-TIGATDRGFTDIPQGDEKKMAEAVATVGPVSVA 259
Query: 269 IEASGRDFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGY 325
I+AS FQFYS GVY + C Q LDHGV VG+G+ G DY +VKNSWG WG+KG+
Sbjct: 260 IDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGF 319
Query: 326 IRMKRNTGKPEGLCGINKMASYPI 349
I+M RN + CGI +SYP+
Sbjct: 320 IKMLRN---KDNQCGIASASSYPL 340
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 142/352 (40%), Positives = 194/352 (55%), Gaps = 38/352 (10%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MALS + K + I+ + F +S A L + D L++ E WM++ + Y+
Sbjct: 1 MALSLE-KKLAIALLVVFSTWASQAM--------ARQLINEDALVEKHEQWMARHGRTYQ 51
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
+EK RF+IFK NL +ID N+ + Y LGLN FADL HEE+ + K
Sbjct: 52 DSEEKERRFQIFKSNLEYIDNFNKASNQTYQLGLNNFADLSHEEYVATYTARKMP----- 106
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
V++P+S+DWR GAVT +KNQ CG CWAFS AAVEGI N
Sbjct: 107 ------------VEVPESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGI----VANGV 150
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
SLS Q+L+DC + N GC GG M+ AF YI+ G+ E DYPY + C ++
Sbjct: 151 SLSAQQLLDCVSD-NQGCKGGWMNNAFNYIIQNQGIALETDYPYQQMQQMCSSRMAAAQ- 208
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEA-SGRDFQFYSGGVYDGH-CGTQLDHGVA 297
I+G+ DV E++L++A+A QP+SV I+A S +F+ Y GV+ CG H V
Sbjct: 209 --ISGFEDVTPKDEEALMRAVAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVT 266
Query: 298 AVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
VGYG++ G Y + KNSWG WGE GY+R++R+ G G CGI ASYP
Sbjct: 267 LVGYGTSEDGTKYWLAKNSWGETWGESGYMRLQRDIGLEGGPCGIALYASYP 318
>gi|313221001|emb|CBY31833.1| unnamed protein product [Oikopleura dioica]
gi|313229611|emb|CBY18426.1| unnamed protein product [Oikopleura dioica]
Length = 362
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 188/323 (58%), Gaps = 20/323 (6%)
Query: 43 KLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFAD 98
+LI+ F+ W FEK YES+++++ER + N+ HI+E N + K + LG+N++ D
Sbjct: 33 RLINEFKQWKDAFEKEYESIEQEIERMGTWMKNMLHIEEHNFQHSLGKKTFTLGMNKYGD 92
Query: 99 LRHEEFKEMFLGL--KPDLARRKDQSHEDFSYKDVVD-----LPKSVDWRKKGAVTHVKN 151
EEF + G R+ HED Y D VD L KSVDWR+KGAVT VK+
Sbjct: 93 QSSEEFAATYNGFLHAEGQTRKLFGLHEDAFYLDWVDADESKLDKSVDWREKGAVTEVKD 152
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIV 210
QG CGSCW+FS A+EG V G L LSEQ L+DC N GCNGGLMD AFQY+
Sbjct: 153 QGQCGSCWSFSATGALEGQMAQVFGKLPDLSEQNLVDCSRPEGNQGCNGGLMDAAFQYVK 212
Query: 211 STGGLHKEEDYPYI-MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVA 268
GL E+ YPY ++ C K E G+ +P+ +E +L ALA P+SVA
Sbjct: 213 DQDGLDGEDWYPYEGVDNKECRYDKSHRE-ADDTGFKMIPEGNEKALKHALAKVGPVSVA 271
Query: 269 IEASGRDFQFYSGGV-YDGHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYI 326
I+AS FQFY GV Y+ +C + LDHGV AVGYG+ G Y +VKNSW WG+ GYI
Sbjct: 272 IDASNPSFQFYQSGVYYEPNCSPENLDHGVLAVGYGTEDGEHYYLVKNSWSEAWGDNGYI 331
Query: 327 RMKRNTGKPEGLCGINKMASYPI 349
+M RN E CGI A YPI
Sbjct: 332 KMARNK---ENHCGIASYAVYPI 351
>gi|157829826|pdb|1AEC|A Chain A, Crystal Structure Of Actinidin-E-64 Complex+
Length = 218
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 118/218 (54%), Positives = 152/218 (69%), Gaps = 2/218 (0%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IVTG L SLSEQELIDC T
Sbjct: 1 LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQ 60
Query: 194 NN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNS 252
N GCNGG + FQ+I++ GG++ EE+YPY ++G C + + VTI+ Y +VP N+
Sbjct: 61 NTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNN 120
Query: 253 EDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIV 312
E +L A+ QP+SVA++A+G F+ YS G++ G CGT +DH V VGYG+ G+DY IV
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIV 180
Query: 313 KNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
KNSW WGE+GY+R+ RN G G CGI M SYP+K
Sbjct: 181 KNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPVK 217
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 143/324 (44%), Positives = 198/324 (61%), Gaps = 22/324 (6%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFA 97
D +++ + ++ + K Y+ E+ R +IF +N I + N++ ++ L +N++A
Sbjct: 23 DVVMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYA 82
Query: 98 DLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV-------VDLPKSVDWRKKGAVTHVK 150
DL H EF+++ G L ++ + D S+K V V LPKSVDWR KGAVT VK
Sbjct: 83 DLLHHEFRQLMNGFNYTLHKQLRST--DDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVK 140
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYI 209
+QG CGSCWAFS+ A+EG + +G L SLSEQ L+DC Y NNGCNGGLMD AF+YI
Sbjct: 141 DQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 200
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVA 268
GG+ E+ YPY + +C KG + T G+ D+PQ E + +A+A P++VA
Sbjct: 201 KDNGGIDTEKSYPYEAIDDSCHFNKG-AIGATDRGFTDIPQGDEKKMAEAVATVGPVAVA 259
Query: 269 IEASGRDFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGY 325
I+AS FQFYS GVY + C Q LDHGV VGYG+ G DY +VKNSWG WG+KG+
Sbjct: 260 IDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTWGDKGF 319
Query: 326 IRMKRNTGKPEGLCGINKMASYPI 349
I+M RN + CGI +SYP+
Sbjct: 320 IKMLRN---KDNQCGIASASSYPL 340
>gi|340368358|ref|XP_003382719.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 329
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 134/307 (43%), Positives = 183/307 (59%), Gaps = 10/307 (3%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNY--WLGLNEFADLRHEEFK 105
F+ W K+ K YE+ + +L R I++ N + ++ N + + +NEFADL EF
Sbjct: 23 FQDWKVKYNKAYETKETELARQVIWESNKKFVENHNANSDKFGFTVAMNEFADLGAGEFA 82
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
++ G+ P + + + + L SVDWRK GAVT VKNQG CG+CWAFS
Sbjct: 83 NIYNGIIPHPPSYNNTNTFKRTVRSTFALADSVDWRKSGAVTGVKNQGKCGACWAFSATG 142
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
A+EG + I TG L SLSEQ+L+DC +++ NNGC GGLMD AF+Y+ + G EE YPY+
Sbjct: 143 ALEGQHFINTGTLISLSEQQLMDCSSSFGNNGCKGGLMDNAFRYLETVAGDMTEEAYPYL 202
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV 283
E GTC E++V Y D+P+ ED+L +A+A P+SV+I + FQ Y GV
Sbjct: 203 AEVGTCRYNSSEAKVKNT-VYKDIPEGDEDALQEAVATIGPISVSINSEHSSFQLYDQGV 261
Query: 284 -YDGHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
Y+ C ++LDHGV +GYG++ DY +VKNSWG WG GYI M RN E CGI
Sbjct: 262 YYEPTCSSSKLDHGVLVIGYGTSDNNDYWLVKNSWGTNWGMDGYIMMSRN---KENNCGI 318
Query: 342 NKMASYP 348
ASYP
Sbjct: 319 ATRASYP 325
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 145/304 (47%), Positives = 183/304 (60%), Gaps = 16/304 (5%)
Query: 57 KVYESLDEKLERFEIFKDN----LRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLK 112
K Y S E+ R +I+ +N RH ++ + +Y L +NEF DL H EF G K
Sbjct: 36 KDYASDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDLLHHEFVSTRNGFK 95
Query: 113 P---DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEG 169
D R E ++D+ LPK+VDWRKKGAVT VKNQG CGSCWAFST ++EG
Sbjct: 96 RNYRDSPREGSFFVEPEGFEDL-QLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEG 154
Query: 170 INQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEG 228
+ T L SLSEQ L+DC ++ NNGC GGLMD AF+YI S G+ E YPY +G
Sbjct: 155 PHFRKTRKLVSLSEQNLVDCSRSFGNNGCEGGLMDNAFKYIKSNKGIDTEWSYPYNATDG 214
Query: 229 TCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYD-G 286
C + + T G+ D+P+ E+ L KA+A P+SVAI+AS FQFYS GVYD
Sbjct: 215 VCHFNRSDVG-ATDTGFVDIPEGDENKLKKAVAAVGPVSVAIDASHESFQFYSEGVYDEP 273
Query: 287 HCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
C + QLDHGV VGYG+ G DY +VKNSWG WG++GYI M RN + CGI A
Sbjct: 274 ECSSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDEGYIYMTRNK---DNQCGIASSA 330
Query: 346 SYPI 349
SYP+
Sbjct: 331 SYPL 334
>gi|281206749|gb|EFA80934.1| counting factor associated protein [Polysphondylium pallidum PN500]
Length = 530
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 133/321 (41%), Positives = 188/321 (58%), Gaps = 14/321 (4%)
Query: 37 DLTSNDKLI-DLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNE 95
D+ + DK+ D FE + + ++KVY +E ERF +K N I N + +Y L +N
Sbjct: 215 DIYNKDKMTKDEFEQFKTTYDKVYAHDEEHSERFATYKQNREMIIAHNTQESSYKLAMNH 274
Query: 96 FADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKD-VVDLPKSVDWRKKGAVTHVKNQGS 154
F D+ EEF+ L +KP + R D D ++LP +VDWR++G VT VK+QG
Sbjct: 275 FGDMTAEEFE---LKIKPRVPRPDTNGAHDVHDNDRTINLPATVDWRQQGCVTRVKDQGV 331
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT-YNNGCNGGLMDYAFQYIVSTG 213
CGSCW F + ++EG++ + TG L SLSEQ+L+DC + GCNGG AFQYI++ G
Sbjct: 332 CGSCWTFGSTGSLEGVSCLATGKLVSLSEQQLVDCAYLGQSQGCNGGFASDAFQYIMNFG 391
Query: 214 GLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ-PLSVAIEAS 272
G+ E YPY+M+ G C+ + + + + Y +V SE +L A+A P+++AI+AS
Sbjct: 392 GIAYESTYPYLMQNGYCKDSSSQLSNIKVKSYVNVTSFSEPALQNAVATVGPVAIAIDAS 451
Query: 273 GRDFQFYSGGV-YDGHCGT---QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
DF+FYS GV Y C LDH V AVGYG+ G DY IVKNSW +G +GYI M
Sbjct: 452 APDFRFYSSGVYYSSVCKNGLDDLDHEVLAVGYGTLNGADYWIVKNSWSTHYGAEGYILM 511
Query: 329 KRNTGKPEGLCGINKMASYPI 349
RN G CG+ +YP+
Sbjct: 512 SRNRGNN---CGVASQPTYPV 529
>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 141/306 (46%), Positives = 193/306 (63%), Gaps = 21/306 (6%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKE 106
LF+++ +K+ K Y S E+ R ++ N+ I++ N ++ LG+ FAD+ + EF
Sbjct: 26 LFQTFEAKYGKNYLS-SEREYRKKVLAYNMDWIEKFNSDEHSFTLGMTPFADMTNTEFAT 84
Query: 107 MFLG--LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
L +K L ++ + + + + S+DWR+KGAVT VKNQGSCGSCWAFS
Sbjct: 85 SKLCGCMKKPLNHKQARVLNNMAVE-------SIDWREKGAVTPVKNQGSCGSCWAFSAT 137
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
A+EG N + TG L SLSEQ+L+DCD T + GC GG MD AF+Y++ GL EEDYPY
Sbjct: 138 GALEGGNFVATGKLVSLSEQQLVDCD-TEDAGCGGGFMDTAFEYVMKK-GLCTEEDYPYH 195
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
++ C+ + S V++I GY DVP N +L +AL P+SVAI+A FQ Y+GGV
Sbjct: 196 AKDEDCKDDQCTS-VISITGYEDVPANDGVALKQALTKAPVSVAIQADSFVFQMYTGGVL 254
Query: 285 DG-HCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMK-RNTGKPEGLCGIN 342
D CGT L+HGV AVGY +YIIVKNSWG WG+KGY+++ R+ G EG+CGIN
Sbjct: 255 DSDMCGTSLNHGVLAVGYAK----EYIIVKNSWGASWGDKGYVKIAHRDQG--EGICGIN 308
Query: 343 KMASYP 348
ASYP
Sbjct: 309 MAASYP 314
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 142/322 (44%), Positives = 198/322 (61%), Gaps = 20/322 (6%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEF 96
L+ N L +E++ ++ K YES E+L R IF++N + I++ N K + +++LG+N F
Sbjct: 71 LSPNPNLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFDFYLGMNHF 130
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHEDFSY-----KDVVDLPKSVDWRKKGAVTHVKN 151
DL ++E++E +LG RR + + SY + + D+P +DWR +G VT VKN
Sbjct: 131 GDLTNKEYRERYLGY-----RRPENTPSKASYIFSRAEKIEDVPDQIDWRDQGFVTPVKN 185
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIV 210
QG CGSCWAFS V ++EG + TG L SLSEQ L+DC N+GCNGG MD AF+Y+
Sbjct: 186 QGQCGSCWAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMDQAFEYVK 245
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALA-NQPLSVAI 269
G+ E+ YPY+ +G+C K +S T+ G+ DV + E++L +A+ P+SVAI
Sbjct: 246 DNHGIDTEDSYPYVGTDGSCHF-KNKSIGATLKGFMDVKEGDEEALRQAVGVAGPVSVAI 304
Query: 270 EASGRDFQFYSGGVYD-GHCGT-QLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYI 326
+AS FQFY GGVY+ C T +LDHGV VGYG +G D+ +VKNSWG WG GYI
Sbjct: 305 DASSMLFQFYRGGVYNVPWCSTSELDHGVLVVGYGKQFQGKDFWMVKNSWGVGWGIYGYI 364
Query: 327 RMKRNTGKPEGLCGINKMASYP 348
M RN G CGI AS P
Sbjct: 365 EMSRNKGNQ---CGIASKASIP 383
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 149/317 (47%), Positives = 192/317 (60%), Gaps = 23/317 (7%)
Query: 49 ESWMS-KFEKVYESLDEKLERF--EIFKDNLRHIDETNRKIK----NYWLGLNEFADLRH 101
E W + K E L E ERF +IF +N I + N+ ++ LGLN+++D+ +
Sbjct: 25 EEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDMLY 84
Query: 102 EEFKEMFLGLKPDLARRKDQSHEDFS-----YKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
EFKE G + RK + FS V +PKSVDWR+ GAVT VK+QG CG
Sbjct: 85 HEFKETMNGYNHTM--RKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHCG 142
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGL 215
SCWAFS+ AA+EG + G L SLSEQ L+DC Y NNGCNGGLMD AF+YI GG+
Sbjct: 143 SCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 202
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ-PLSVAIEASGR 274
E+ YPY + +C TK T G+ D+PQ E++L+KA+A P+SVAI+AS
Sbjct: 203 DTEKSYPYEGIDDSCHFTK-SGVGATDTGFVDIPQGDEEALMKAVATMGPVSVAIDASHE 261
Query: 275 DFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQ YS GVY + C Q LDHGV VGYG+ + GLDY +VKNSWG WG++GYI+M RN
Sbjct: 262 SFQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTWGDQGYIKMARN 321
Query: 332 TGKPEGLCGINKMASYP 348
+ CGI +SYP
Sbjct: 322 Q---DNQCGIATASSYP 335
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 188/318 (59%), Gaps = 13/318 (4%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNE 95
S++ L +E++ + +K YES E+L RF+IF +N I + N K + +Y LG+N+
Sbjct: 19 SHEILRTQWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQ 78
Query: 96 FADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSC 155
F DL EF ++F G + R + D LP +VDWRKKGAVT VK+QG C
Sbjct: 79 FGDLLAHEFAKIFNGYRGQRTSRGSTFMPPANVNDS-SLPSTVDWRKKGAVTPVKDQGQC 137
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGG 214
GSCWAFS ++EG + + G L SLSEQ L+DC ++ NNGC GGLMD AF+YI + G
Sbjct: 138 GSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFKYIKANDG 197
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASG 273
+ EE YPY + C K E T G+ D+ SED L KA+A P+SVAI+A
Sbjct: 198 IDAEESYPYEAMDDKCRFKK-EDVGATDTGFVDIEGGSEDDLKKAVATVGPISVAIDAGH 256
Query: 274 RDFQFYSGGVYD-GHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQ YS GVYD C + +LDHGV AVGYG G Y +VKNSWG WG+ GYI M R+
Sbjct: 257 SSFQLYSEGVYDEPECSSEELDHGVLAVGYGVKDGKKYWLVKNSWGGSWGDNGYILMSRD 316
Query: 332 TGKPEGLCGINKMASYPI 349
CGI ASYP+
Sbjct: 317 KNNQ---CGIASAASYPL 331
>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 188/311 (60%), Gaps = 15/311 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGLNEFADLRHEE 103
+ESW K+ K Y E++ R +++ NL+ + + N + NY LG+N +ADL +EE
Sbjct: 19 WESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEE 78
Query: 104 FKEMFLGLKPDLARRKDQSH-EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
F M L L + KD+S + F V LP SVDWR +G VT VK+QG CGSCW FS
Sbjct: 79 F--MALKGSGGLLQAKDKSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWTFS 136
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDY 221
++EG + TGNL SLSEQ+L+DC Y N GCNGGLM+ A+ YI GG+ E Y
Sbjct: 137 ATGSLEGQHFAKTGNLLSLSEQQLVDCAGRYGNYGCNGGLMESAYDYIKGVGGVELESAY 196
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYS 280
PY +G C+ + + V T GY +P E +L++A+ P++V+I+ASG FQ Y
Sbjct: 197 PYTARDGRCKFDRSKV-VATCKGYVVIPVGDEQALMQAVGTIGPVAVSIDASGYSFQLYE 255
Query: 281 GGVYD-GHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
GVYD C T LDHGV AVGYG+ G +Y +VKNSWGP WG++GYI+M ++
Sbjct: 256 SGVYDFRRCSSTNLDHGVLAVGYGTEGGQNYWLVKNSWGPGWGDQGYIKMSKDKNNQ--- 312
Query: 339 CGINKMASYPI 349
CGI + YP+
Sbjct: 313 CGIATDSCYPL 323
>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 360
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 142/332 (42%), Positives = 187/332 (56%), Gaps = 39/332 (11%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHE 102
++D F W + + Y S +E+L RF++++DN+ +I+ TNR+ Y LG N+FADL E
Sbjct: 38 MMDRFLMWQATHNQSYRSAEERLRRFQVYRDNVEYIETTNRRGDLTYQLGENQFADLTRE 97
Query: 103 EFKEMFL----------------------GLKPDLARRKDQSHEDFSYKDVVDL-PKSVD 139
EF F G PDL S D V L P SVD
Sbjct: 98 EFIARFTSYNGDDDRTGDDDSVITTAAVGGGDPDLWS---------SGGDDVSLDPPSVD 148
Query: 140 WRKKGAVTHVKNQGSCGSC-WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCN 198
WR KGAV K+Q S S WAF VA +E ++ I TG L +LSEQ+L+DCD Y+ GCN
Sbjct: 149 WRAKGAVVPPKSQSSSCSSSWAFVAVATIESLHAIKTGKLVALSEQQLVDCDQ-YDGGCN 207
Query: 199 GGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLK 258
G AF +++ GGL E +YPY +GTC K + V I+G+ VP ++E ++
Sbjct: 208 RGTFRRAFHWVIQNGGLTTEAEYPYTAAQGTCNSAKSDHHVAAISGHASVPGSNELAMKH 267
Query: 259 ALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS--TRGLDYIIVKNSW 316
A+A QP++ AIE G D QFY GVY G CG +L+H V VGYG+ + G Y IVKNSW
Sbjct: 268 AVATQPVAAAIEL-GSDMQFYKSGVYSGPCGARLEHAVTVVGYGADESTGDKYWIVKNSW 326
Query: 317 GPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 348
G WGE+GYIRM+R P GLCGI +YP
Sbjct: 327 GQTWGERGYIRMQRKILGP-GLCGIMLDVAYP 357
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 147/331 (44%), Positives = 197/331 (59%), Gaps = 19/331 (5%)
Query: 37 DLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLG 92
+L N ++ ++ K K Y++ DE+L RF++F N + I++ N + + ++ L
Sbjct: 32 NLLINHPYYPVWTNFKLKHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALS 91
Query: 93 LNEFADLRHEEFKEMFLGLKPDLARRKDQSH---ED---FSYKDVVDLPKSVDWRKKGAV 146
LN+FAD+ + EF++ G K R+ +S ED F D V +P SVDWRK+G V
Sbjct: 92 LNKFADMTNAEFRQRMNGFKLPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYV 151
Query: 147 THVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYA 205
T VK+QGSCGSCWAFS ++EG + TG L SLSEQ L+DCD N + GCNGG MD A
Sbjct: 152 TKVKDQGSCGSCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGA 211
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QP 264
FQY+ + G+ E YPY +G C K E T G+ D+P+ +E L A+A P
Sbjct: 212 FQYVETNKGIDTEASYPYKGRDGRCRF-KSEDVGATDTGFVDIPEGNETLLEAAIATVGP 270
Query: 265 LSVAIEASGRDFQFYSGGV-YDGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWG 321
+SVAI+A+ FQFYS GV YD C + LDHGV AVGY ST+ G Y IVKNSW WG
Sbjct: 271 VSVAIDAASFKFQFYSHGVYYDRSCSPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSEDWG 330
Query: 322 EKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
+ GYI M R + CGI MASYP ++
Sbjct: 331 DDGYILMSR---RKNNNCGIATMASYPFVQQ 358
>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
gi|1582620|prf||2119193A cathepsin L-related Cys protease
Length = 324
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 136/310 (43%), Positives = 182/310 (58%), Gaps = 16/310 (5%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRHEE 103
+E + KF + Y L+E+ R +F DNL++I+E N+K ++ Y L +N+F+DL ++E
Sbjct: 20 WEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQFSDLTNDE 79
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F M G K L + + F+ D VDWR KG VTHVK+QG CGSCWAFS
Sbjct: 80 FNSMMKGYKTSL---RPKPVAVFTSTDAAPETTEVDWRTKGCVTHVKDQGQCGSCWAFSA 136
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNT--YNNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
++EG + + G L SL+EQ+L+DC YN GCNGG ++ AF+YI + GG+ E Y
Sbjct: 137 TGSLEGQHFLKYGELVSLAEQQLVDCAGGIYYNQGCNGGWVNQAFKYIKANGGIDTESSY 196
Query: 222 PYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ-PLSVAIEASGRDFQFYS 280
PY + TC S T +G+ + Q SE ++ N P+SVAI+A+ R FQ YS
Sbjct: 197 PYEARDNTCRFNS-NSVAATCSGFVSIAQGSESPEVRRTTNTGPISVAIDAAHRSFQSYS 255
Query: 281 GGV-YDGHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
GV Y+ C +QLDH V AVGYGS G D+ +VKNSWG WG GYI M RN
Sbjct: 256 SGVYYEPSCSSSQLDHAVLAVGYGSEGGQDFWLVKNSWGTSWGSAGYINMARNRNNN--- 312
Query: 339 CGINKMASYP 348
CGI ASYP
Sbjct: 313 CGIATDASYP 322
>gi|169659203|dbj|BAG12786.1| putative cysteine protease [Sorogena stoianovitchae]
Length = 293
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 142/296 (47%), Positives = 190/296 (64%), Gaps = 16/296 (5%)
Query: 54 KFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKP 113
++ K Y ++K R +F +++R ++ N K +Y LGLN+FADL EEF ++LGL
Sbjct: 12 EYNKTYGGAEDK-HRLALFAESVRIVETENAKGHSYTLGLNQFADLTTEEFSSLYLGL-- 68
Query: 114 DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQI 173
+ K Q+ E +D D ++VDWR+KGAVT VK+Q SCGSCWAFS A+EG
Sbjct: 69 -VLENKVQASESVVLQDG-DSEENVDWRQKGAVTPVKDQKSCGSCWAFSATGAMEGALVK 126
Query: 174 VTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMT 233
TG L +LSEQ+L+DC T NGCNGGLM AF Y++ G E+DYPY +G C+ T
Sbjct: 127 STGKLINLSEQQLVDCV-TKCNGCNGGLMTAAFDYVLGR-GRATEKDYPYKGVDGRCKQT 184
Query: 234 KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLD 293
+++ I GY++VPQN+ +L A+A+ PLSVA+ A+G Q Y GV D +CGT+LD
Sbjct: 185 ATDNK---IKGYNNVPQNNYKALKAAVAS-PLSVAVNAAG-TIQRYKSGVIDANCGTRLD 239
Query: 294 HGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNT-GKPEGLCGINKMASYP 348
HGV AVGY +G DY IVKNSWG +GE GY R+K T G+CGIN MA+ P
Sbjct: 240 HGVLAVGY---QGEDYWIVKNSWGNGYGENGYFRVKMGTQNGGAGVCGINMMAAQP 292
>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
Length = 349
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 184/321 (57%), Gaps = 20/321 (6%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEE 103
L+D F++W +++ + Y + +E +RF ++ +N++ I+ N+ +Y LG N FADL EE
Sbjct: 33 LLDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSSYELGENRFADLTEEE 92
Query: 104 FKEMFL-------------GLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVK 150
FK+ +L L D R S + + P SVDWR KGAVT VK
Sbjct: 93 FKDTYLMKLDNVASSPEAMALTVDTMNRAGTS----GGSNTNEAPNSVDWRTKGAVTPVK 148
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLM-DYAFQYI 209
+Q CGSCWAF+ VA++EG+++I TG L SLSEQE++DCD NN G A +++
Sbjct: 149 SQQHCGSCWAFAAVASIEGVHKIKTGLLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWV 208
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
GGL E DYPY+ +G C K I G V +E +L A+A +P++V+I
Sbjct: 209 TRNGGLTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSI 268
Query: 270 EASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRM 328
AS R FQFY G++ G C T +H V VGYG+ G Y IVKNSWG +WGEKGY+RM
Sbjct: 269 NAS-RAFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRM 327
Query: 329 KRNTGKPEGLCGINKMASYPI 349
+R EG+CGI Y +
Sbjct: 328 QRGVRAREGVCGIAIAPFYAV 348
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 142/310 (45%), Positives = 181/310 (58%), Gaps = 13/310 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFADLRHEE 103
+ W ++ K Y S +E+ R I++ NL + + N K Y LG+N+F DL++EE
Sbjct: 28 WNEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGINQFTDLQNEE 87
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F M G + + + +V +LPK+VDWR KG VT VK+QG CGSCWAFST
Sbjct: 88 FVAMMTGFRVSGTSKAAKGSTFLPPNNVGELPKTVDWRTKGYVTPVKDQGQCGSCWAFST 147
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
+VEG + TG L SLSEQ L+DC + GC+GG MD AFQYI+ GG+ E YPY
Sbjct: 148 TGSVEGQHFKATGKLVSLSEQNLVDCSGR-DAGCDGGFMDRAFQYIIDAGGIDTEASYPY 206
Query: 224 IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGG 282
+G C K T+ GY DV SE +L KA+A+ P+SVAI+AS FQ Y G
Sbjct: 207 KAVDGKCHFKKANVG-ATVTGYTDVTSGSEKALQKAVAHVGPISVAIDASHMSFQHYKSG 265
Query: 283 VYD--GHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
VY+ G T LDHGV AVGYG S+ G DY IVKNSW WG GY+ M RN + C
Sbjct: 266 VYNEPGCDSTVLDHGVLAVGYGTSSDGTDYWIVKNSWAETWGMNGYVWMSRN---KDNQC 322
Query: 340 GINKMASYPI 349
GI ASYP+
Sbjct: 323 GIATNASYPL 332
>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
Length = 307
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 139/297 (46%), Positives = 181/297 (60%), Gaps = 16/297 (5%)
Query: 64 EKLERFEIFKDNLR----HIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
E+ R EIF++N + H +E + + YWLG N+FA + ++EF +G L R
Sbjct: 15 EESRRMEIFENNTKLINLHNNEADLGMHTYWLGHNQFAHMTNDEFVANVIG-GCLLDRNA 73
Query: 120 DQSHEDFSYK---DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTG 176
+S D ++ ++V+LP +VDWR KG VT VKNQ CGSCWAFST ++EG TG
Sbjct: 74 SKSTADRVHQYDSNLVELPDTVDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKKTG 133
Query: 177 NLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
L SLSEQ L+DC + N GCNGGLMD AF+YI + GG+ E+ YPY +G C K
Sbjct: 134 KLVSLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARDGKCRF-KP 192
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV-YDGHC-GTQL 292
T+ GY D+ + E +L +A+A P+SVAI+AS FQ YS GV Y+ C T+L
Sbjct: 193 ADVGATVTGYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSSTEL 252
Query: 293 DHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
DHGV AVGYG+ G DY +VKNSWG WG+ GYI M RN CGI ASYP+
Sbjct: 253 DHGVLAVGYGTEGGKDYWLVKNSWGEVWGQNGYIMMSRNKNNQ---CGIATSASYPL 306
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 142/317 (44%), Positives = 189/317 (59%), Gaps = 20/317 (6%)
Query: 49 ESWMS---KFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRH 101
E W S + K Y+S E+ R +IF +N + + N+ + LGLN++AD+ H
Sbjct: 25 EQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHNKLFSQGFVKFKLGLNKYADMLH 84
Query: 102 EEFKEMFLGL---KPDLARRKDQSHE-DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGS 157
EF G K ++ + D + F V LP +VDWR KGAVT VK+QG CGS
Sbjct: 85 HEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTEVKDQGHCGS 144
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLH 216
CW+FS ++EG + TG L SLSEQ L+DC Y NNGCNGGLMD AF+YI GG+
Sbjct: 145 CWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDNGGID 204
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRD 275
E+ YPY+ E+ C K ++ T G+ D+ + +ED L A+A P+S+AI+AS
Sbjct: 205 TEKSYPYLAEDEKCHY-KAQNSGATDKGFVDIEEANEDDLKAAVATVGPVSIAIDASHET 263
Query: 276 FQFYSGGVY-DGHCGTQ-LDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNT 332
FQ YS GVY D C +Q LDHGV VGYG++ G DY +VKNSWGP WG GYI+M RN
Sbjct: 264 FQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDGQDYWLVKNSWGPSWGLNGYIKMARNQ 323
Query: 333 GKPEGLCGINKMASYPI 349
+ +CG+ ASYP+
Sbjct: 324 ---DNMCGVASQASYPL 337
>gi|351721011|ref|NP_001238219.1| P34 probable thiol protease precursor [Glycine max]
gi|1199563|gb|AAB09252.1| 34 kDa maturing seed vacuolar thiol protease precursor [Glycine
max]
Length = 379
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 202/342 (59%), Gaps = 25/342 (7%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN 88
SI+ T+ ++ LF+ W S+ +VY + +E+ +R EIFK+N +I + N K+
Sbjct: 25 SILDLDLTKFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKS 84
Query: 89 ---YWLGLNEFADLRHEEFKEMFLGLKPDLARR-----KDQSHEDFSYKDVVDLPKSVDW 140
+ LGLN+FAD+ +EF + +L D++++ K E +S P S DW
Sbjct: 85 PHSHRLGLNKFADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYS---CDHPPASWDW 141
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGG 200
RKKG +T VK QG CG WAFS A+E + I TG+L SLSEQEL+DC + G G
Sbjct: 142 RKKGVITQVKYQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNG 200
Query: 201 LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV-------PQNSE 253
+F++++ GG+ ++DYPY +EG C+ K + +V TI+GY + +E
Sbjct: 201 WQYQSFEWVLEHGGIATDDDYPYRAKEGRCKANKIQDKV-TIDGYETLIMSDESTESETE 259
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQ---LDHGVAAVGYGSTRGLDYI 310
+ L A+ QP+SV+I+A +DF Y+GG+YDG T ++H V VGYGS G+DY
Sbjct: 260 QAFLSAILEQPISVSIDA--KDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYW 317
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
I KNSWG WGE GYI ++RNTG G+CG+N ASYP K++
Sbjct: 318 IAKNSWGEDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTKEE 359
>gi|357167707|ref|XP_003581294.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 358
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 183/318 (57%), Gaps = 15/318 (4%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHE 102
+ E WM++F + Y EK R E+F N RH+D NR + Y LGLN+F+DL
Sbjct: 38 MASRHERWMARFGRSYTDAGEKARRQEVFGANARHVDAVNRAGNRTYTLGLNQFSDLTDH 97
Query: 103 EFKEMFLGLKPDLARRKDQSHEDFSYKDVV------DLPKSVDWRKKGAVTHVKNQGSCG 156
EF + LG +R E+ D+P SVDWR KGAVT +KNQ SCG
Sbjct: 98 EFLQQHLGYGRHHGQRGLLLPEEEVMPKATALGYGQDMPYSVDWRAKGAVTEIKNQRSCG 157
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
SCWAF+ VAA EG+ +I TGNL S+SEQ+++DC ++ C+ G + A +Y+V++GGL
Sbjct: 158 SCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGDRSS-CDSGYISDALRYVVTSGGLQ 216
Query: 217 KEEDYPYIMEEGTCEMTK--GESEVVTINGYHDVPQNSEDSLLKAL-ANQPLSVAIEASG 273
+E Y Y ++G C + + ++ G H N ++ L+ L A QP++V +EAS
Sbjct: 217 REAAYAYTGQKGACGSRRFARPNSAASVGGVHMATLNGDEGALQGLAARQPVAVIVEASE 276
Query: 274 RDFQFYSGGVYDG--HCGTQLDHGVAAVGYGSTRGL-DYIIVKNSWGPKWGEKGYIRMKR 330
DF+ YS GVY G CG +L+H + VGYG+ G +Y +VKN WG WGE GY+R+ R
Sbjct: 277 PDFRHYSSGVYAGSASCGRELNHALTVVGYGTENGAGEYWLVKNQWGTWWGENGYMRVAR 336
Query: 331 NTGKPEGLCGINKMASYP 348
G CGI +A YP
Sbjct: 337 RNGAGAN-CGIASVAFYP 353
>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 176/314 (56%), Gaps = 17/314 (5%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+K+ +VY EKL R E+F N RHID NR + Y LGLN F+DL +EEF +
Sbjct: 42 ERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEEFAQT 101
Query: 108 FLGLK----PDLARRKDQSHE---DFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWA 160
LG + P R +D S + + + P SVDWR +GAVT VK+QG CGSCWA
Sbjct: 102 HLGYRHQPGPGGLRPEDSSPAAAVNVTDAQLQSTPDSVDWRARGAVTPVKHQGHCGSCWA 161
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
F+ VAA EG+ QI TGNL S+SEQ+++DC ++ C G ++ A YI ++GGL E
Sbjct: 162 FAAVAATEGLVQIATGNLISMSEQQVLDCTGGTSS-CKSGYVNAALTYITASGGLQTEAA 220
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQ-NSEDSLLKAL-ANQPLSVAIEASGRDFQF 278
Y Y E+G C G H N ++ L+ L A QP++VA+EA DF
Sbjct: 221 YAYSAEQGACRSGGASPNSAAAVGVHRSAMLNGDEGALQVLVAGQPVAVAVEAE-PDFHH 279
Query: 279 YSGGVYDG--HCGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKP 335
Y GVY G CG +L H V VGYG+ G Y +VKN WG WGE GY+R+ R G
Sbjct: 280 YKSGVYVGSPSCGQKLHHAVTVVGYGADGDGQGYWVVKNQWGAGWGEVGYMRLTRGNGGN 339
Query: 336 EGLCGINKMASYPI 349
CG+ A YP
Sbjct: 340 N--CGMATHAYYPT 351
>gi|46576373|sp|P83654.1|ERVC_TABDI RecName: Full=Ervatamin-C; Short=ERV-C
gi|46014979|pdb|1O0E|A Chain A, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
Protease Ervatamin C
gi|46014980|pdb|1O0E|B Chain B, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
Protease Ervatamin C
Length = 208
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 124/217 (57%), Positives = 157/217 (72%), Gaps = 10/217 (4%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP+ +DWRKKGAVT VKNQGSCGSCWAFSTV+ VE INQI TGNL SLSEQEL+DCD
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
N+GC GG +A+QYI++ GG+ + +YPY +G C+ S+VV+I+GY+ VP +E
Sbjct: 60 NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAA---SKVVSIDGYNGVPFCNE 116
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
+L +A+A QP +VAI+AS FQ YS G++ G CGT+L+HGV VGY + +Y IV+
Sbjct: 117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQA----NYWIVR 172
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
NSWG WGEKGYIRM R G GLCGI ++ YP K
Sbjct: 173 NSWGRYWGEKGYIRMLRVGGC--GLCGIARLPYYPTK 207
>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
Length = 329
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 135/308 (43%), Positives = 180/308 (58%), Gaps = 17/308 (5%)
Query: 51 WMSKFEKVYESLDEKLERFEIFKDNLR----HIDETNRKIKNYWLGLNEFADLRHEEFKE 106
W K Y S E+L R EI++ NLR H E + + Y LG+N D+ EE +
Sbjct: 29 WKKNHSKTYTSELEELGRREIWERNLRLITVHNLEASLGMHTYDLGMNHMGDMTREEILQ 88
Query: 107 MFLG--LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
MF G ++P+L RR F + +P SVDWR+KG VT VKNQGSCGSCWAFS
Sbjct: 89 MFAGTRVRPNLTRRS----SPFVASAGISVPDSVDWREKGYVTEVKNQGSCGSCWAFSAA 144
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
A+EG + TG + SLS Q L+DC + Y N GCNGG M AFQY++ GG+ +E YPY
Sbjct: 145 GALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTQAFQYVIDDGGIDSDEAYPY 204
Query: 224 IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGG 282
+G C + + + Y+ V + E++L +A+A P+SVAI+A+ F Y G
Sbjct: 205 TAMDGQCRYDQSQ-RAANCSSYNYVSEGDEEALKQAVATIGPISVAIDATRPMFILYHSG 263
Query: 283 VY-DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
VY D C ++HGV VGYGS G DY +VKNSWG ++G+ GYIR+ RN G +CGI
Sbjct: 264 VYSDPTCTQNVNHGVLVVGYGSLNGEDYWLVKNSWGTRFGDGGYIRIARNKGN---MCGI 320
Query: 342 NKMASYPI 349
A YP+
Sbjct: 321 ANYACYPL 328
>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 145/358 (40%), Positives = 202/358 (56%), Gaps = 26/358 (7%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MAL Q K ++ + ++ + P L D + + E WM++ + Y+
Sbjct: 1 MALPLQTKLAIVLMILVTWVSQAM----------PRPLIDEDAVAEKHEQWMARHGRTYQ 50
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLK-PDLARR 118
+EK RF IFK NL+HI+ N + Y LGLN FADL EEF + G K P +
Sbjct: 51 DDEEKERRFHIFKKNLKHIENFNNAFNRTYKLGLNHFADLTDEEFLATYTGYKMPKVLPT 110
Query: 119 KDQSHEDFSYKDVV---DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
+ + + DV+ ++P+S+DWR +G VT VKNQG CG CWAFS AAVEGI
Sbjct: 111 ANITTKTTQSSDVLYEANVPESIDWRTRGVVTPVKNQGRCGCCWAFSAAAAVEGI----I 166
Query: 176 GNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKG 235
GN SLS Q+L+DC +NGCNGG MD AF+YI+ GL YPY + EM +
Sbjct: 167 GNGVSLSAQQLLDCVPD-SNGCNGGFMDNAFRYIIQNQGLASATYYPYQLMR---EMCRP 222
Query: 236 ESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR-DFQFYSGGVYDGH-CGTQLD 293
+ I+GY DV E++L A+A QP+S A++A+ +F++Y GG++ CG+ L
Sbjct: 223 SNNAARISGYVDVTPADEETLKSAVARQPVSAAVDATSELNFKYYGGGIFPPQDCGSTLT 282
Query: 294 HGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
H + VGYG S G Y ++KNSWG WGE GY+R++R+ G G CGI ASYP +
Sbjct: 283 HAITIVGYGTSAEGTKYWLIKNSWGEGWGEGGYMRLQRDVGSYGGACGIALRASYPTR 340
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 143/315 (45%), Positives = 190/315 (60%), Gaps = 20/315 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRHEE 103
+ ++ +K K Y S E++ R +I+ +N I + N K Y + +NEF D+ H E
Sbjct: 27 WSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHE 86
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSY---KDVVD--LPKSVDWRKKGAVTHVKNQGSCGSC 158
F G K + KDQ E +Y +++ D LPK+VDWR KGAVT VKNQG CGSC
Sbjct: 87 FVSTRNGFKRNY---KDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSC 143
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHK 217
WAFS ++EG + +G++ SLSEQ L+ C + NNGC GGLMD AF+YI + G+
Sbjct: 144 WAFSATGSLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDT 203
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDF 276
E+ YPY +GTC K + T +G+ D+ + SE L KA+A P+SVAI+AS F
Sbjct: 204 EKSYPYNGTDGTCHFKK-STVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESF 262
Query: 277 QFYSGGVYD-GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
QFYS GVYD C ++ LDHGV VGYG+ G DY VKNSWG WG++GYIRM RN
Sbjct: 263 QFYSDGVYDEPECDSESLDHGVLVVGYGTLNGTDYWFVKNSWGTTWGDEGYIRMSRN--- 319
Query: 335 PEGLCGINKMASYPI 349
+ CGI AS P+
Sbjct: 320 KKNQCGIASSASIPL 334
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 190/321 (59%), Gaps = 17/321 (5%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLN 94
+S + L +E++ + +K Y+S E+L RF+IF +N I + N K + +Y LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQ 152
+F DL EF +F G RK +V D LPK+VDWRKKGAVT VK+Q
Sbjct: 78 QFGDLLAHEFARIFNGYH---GSRKSGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQ 134
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCWAFST ++EG + + G L SLSEQ L+DC ++ NNGC GGLM+ AF+YI +
Sbjct: 135 GQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
G+ E+ YPY +G C K E T GY ++ ED L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKK-EDVGATDTGYVEIKAGCEDDLKKAVATVGPISVAID 253
Query: 271 ASGRDFQFYSGGVYD-GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
AS FQ YS GVYD C ++ LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 329 KRNTGKPEGLCGINKMASYPI 349
R+ CGI ASYP+
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|294883340|ref|XP_002770717.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239874002|gb|EER02722.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 333
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 183/313 (58%), Gaps = 12/313 (3%)
Query: 28 FSIVGYSPEDLTSNDKLIDL-FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI 86
F ++ P +++ ++L F + KF K YES +E+++R IF+ NL HI+ N K
Sbjct: 7 FVLLSILPLVKCLDEETVELAFMGFQHKFGKNYESKEEEVKRNAIFQANLHHIEHVNAKN 66
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAV 146
+Y LG+NE ADL HEEF + LG RR D+ + D LP SVDWR K +
Sbjct: 67 LSYKLGVNEHADLTHEEFAALKLGTLEMSTRRDDKFVVE---ADTTQLPTSVDWRNKSVL 123
Query: 147 THVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYA 205
+ VKNQGSCGSCWAFS A+E I TG L LS QEL+DC ++Y N GC GGLM A
Sbjct: 124 SPVKNQGSCGSCWAFSAAGALEAQYAIATGKLRPLSVQELVDCSSSYGNKGCLGGLMTNA 183
Query: 206 FQYIVSTGGLHKEEDYPYIMEEGTC----EMTKGESEVVTINGYHDVPQNSEDSLLKALA 261
++YI S GL +E YPY C E + G H + Q +E SL+KALA
Sbjct: 184 YKYIKSA-GLDQESTYPYKGWNKHCFRSSEKKADGIPAGEVTGSHMLAQ-TEQSLMKALA 241
Query: 262 NQPLSVAIEASGRDFQFYSGGVYDG-HCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKW 320
P+S+A+ A R+F+FY GVY C ++DHGV AVGYG+ +G DY I+KNSWG W
Sbjct: 242 AAPVSLAMYARDRNFRFYRSGVYSSTTCNGEIDHGVVAVGYGADKGSDYFILKNSWGSSW 301
Query: 321 GEKGYIRMKRNTG 333
G GY +KR G
Sbjct: 302 GIGGYFYLKRGVG 314
>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
Length = 324
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 146/308 (47%), Positives = 186/308 (60%), Gaps = 22/308 (7%)
Query: 53 SKFEKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGLNEFADLRHEEFKEMF 108
+K K Y S DE + R I++ NL+ I+ N + + Y+LG N++AD+ +EEF+
Sbjct: 27 AKHNKTY-SGDEDIIRRYIWQTNLQKIEAHNELYAKGLSTYFLGENKYADMTNEEFRRTL 85
Query: 109 LGLKPDLARRKDQSHEDF---SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
GL+ D K+ + DF +KD LP +VDWRK+G VT VK+QG CGSCWAFST
Sbjct: 86 SGLRVD----KELTPGDFVSGMFKD--SLPTAVDWRKEGYVTEVKDQGQCGSCWAFSTTG 139
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
++EG + T L SLSE L+DC + N GCNGGLMD AF+YI G+ E+ YPY
Sbjct: 140 SLEGQHFKATKQLVSLSESNLVDCSKKWGNQGCNGGLMDNAFKYIADNKGIDTEKSYPYK 199
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV 283
E+ C K T Y D+ SED+L +A+A P+SVAI+AS FQ YSGGV
Sbjct: 200 PEDRKCNFKKANVG-ATDKLYKDITSGSEDALQEAVATIGPISVAIDASHDSFQLYSGGV 258
Query: 284 Y-DGHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
Y + C T+ LDHGV AVGY S G DY IVKNSWG WG GYI M RN + CGI
Sbjct: 259 YNEKACSTKTLDHGVLAVGYDSKNGDDYWIVKNSWGKSWGIDGYIWMSRN---KKNQCGI 315
Query: 342 NKMASYPI 349
MASYP+
Sbjct: 316 ATMASYPV 323
>gi|129353|sp|P22895.1|P34_SOYBN RecName: Full=P34 probable thiol protease; Flags: Precursor
Length = 379
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 202/342 (59%), Gaps = 25/342 (7%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN 88
SI+ T+ ++ LF+ W S+ +VY + +E+ +R EIFK+N +I + N K+
Sbjct: 25 SILDLDLTKFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKS 84
Query: 89 ---YWLGLNEFADLRHEEFKEMFLGLKPDLARR-----KDQSHEDFSYKDVVDLPKSVDW 140
+ LGLN+FAD+ +EF + +L D++++ K E +S P S DW
Sbjct: 85 PHSHRLGLNKFADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYS---CDHPPASWDW 141
Query: 141 RKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGG 200
RKKG +T VK QG CG WAFS A+E + I TG+L SLSEQEL+DC + G G
Sbjct: 142 RKKGVITQVKYQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNG 200
Query: 201 LMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDV-------PQNSE 253
+F++++ GG+ ++DYPY +EG C+ K + + VTI+GY + +E
Sbjct: 201 WQYQSFEWVLEHGGIATDDDYPYRAKEGRCKANKIQDK-VTIDGYETLIMSDESTESETE 259
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQ---LDHGVAAVGYGSTRGLDYI 310
+ L A+ QP+SV+I+A +DF Y+GG+YDG T ++H V VGYGS G+DY
Sbjct: 260 QAFLSAILEQPISVSIDA--KDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYW 317
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
I KNSWG WGE GYI ++RNTG G+CG+N ASYP K++
Sbjct: 318 IAKNSWGFDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTKEE 359
>gi|356552228|ref|XP_003544471.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 351
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 191/315 (60%), Gaps = 13/315 (4%)
Query: 40 SNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADL 99
++D+++ +FE W+ K +KVY +L EK +RF+IFK+NLR IDE N + Y LGLN FADL
Sbjct: 37 TDDEVMSMFEEWLVKHDKVYNALGEKEKRFQIFKNNLRFIDERNSLNRTYKLGLNVFADL 96
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKNQG-SCG 156
+ E++ M+L D R + Y V +PKSVDWRK+GAVT VKNQG +C
Sbjct: 97 TNAEYRAMYLRTWDDGPRLDLDTPPRNHYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCN 156
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
SCWAF+ V AVE + +I TG+L SLSEQE++DC + + GC GG + + + YI G+
Sbjct: 157 SCWAFTAVGAVESLVKIKTGDLISLSEQEVVDCTTSSSRGCGGGDIQHGYIYI-RKNGIS 215
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
E+DYPY +EG C+ K ++ +VTI+G+ VP E++L +AL D
Sbjct: 216 LEKDYPYRGDEGKCDSNK-KNAIVTIDGHGWVPTQLEEALNRAL----FCYCAYFLYVDK 270
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
F GV+ G CGT+L+H + VGYG+ + DY I KNS+ KWGE GYIR++R
Sbjct: 271 FFLCQGVFKGKCGTELNHALLLVGYGTEKDGDYWIAKNSYSDKWGENGYIRIQRKLST-- 328
Query: 337 GLCGINKMASYPIKK 351
C YPI K
Sbjct: 329 --CKFGNGGYYPIIK 341
>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 326
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 183/309 (59%), Gaps = 19/309 (6%)
Query: 51 WMSKFEKVYESLDEKLERFEIFKDN----LRHIDETNRKIKNYWLGLNEFADLRHEEFKE 106
W + KVY S DE+ RF+IF++N +H +E + Y LG+N F DL H EF E
Sbjct: 26 WKATHGKVYNSADEESLRFKIFQENSLMITQHNEEYRQGFHTYILGMNHFGDLLHSEFLE 85
Query: 107 MFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAA 166
G + ++ + F++ +P +W KGAVT VK+QG CGSCWAFS +
Sbjct: 86 RSNGFQGGVS-----GGDVFTFDTNAPVPSYANWTAKGAVTPVKDQGKCGSCWAFSATGS 140
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
VEG + L SLSEQ+L+DC N GC GGLMD AF+Y ++ G+ E+ YPY
Sbjct: 141 VEGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYFIANKGIANEKSYPYTA 200
Query: 226 EEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV- 283
++ C+ K S V TI+ + DV ED L A+AN P+SVAI+AS FQFY GV
Sbjct: 201 KDNDCKYKKSMS-VATISSFKDVKHKDEDQLKMAVANVGPVSVAIDASSSKFQFYESGVY 259
Query: 284 YDGHCGTQ-LDHGVAAVGYGSTR--GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
YD +C ++ LDHGV AVGYG+ + G+D+ +VKNSW WG GYI+M RN + CG
Sbjct: 260 YDENCSSEVLDHGVLAVGYGTDKKSGMDFWLVKNSWAASWGLNGYIKMARN---KDNNCG 316
Query: 341 INKMASYPI 349
I MASYPI
Sbjct: 317 IATMASYPI 325
>gi|326493706|dbj|BAJ85314.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/369 (36%), Positives = 195/369 (52%), Gaps = 27/369 (7%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
M ++S +++ C++ F+++ P ++ + F WM K+ K Y
Sbjct: 1 MKMASSTPYLVLLLCLTTFLQAWLTAATYPPPAPPAFELPESEVRERFSKWMIKYSKHYS 60
Query: 61 SLDEKLERFEIFKDNLRHIDETNRKIKNYWLG-----------------LNEFADLRHEE 103
E+ RF++FK+N I + +R+ N +G +N F DL E
Sbjct: 61 CKQEEEMRFQVFKNNTNSIGQLDRQNPNPGVGGALGPSGSQVHTFQKVSMNRFGDLSPRE 120
Query: 104 FKEMFLGLKPDLARRKDQSHEDF-SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
+ + GL R ++ + S+K P VDWR GAVT VK+QG+CGSCWAF+
Sbjct: 121 VIQQYTGLNTTSFRTASPTYLPYHSFK-----PCCVDWRSSGAVTGVKHQGTCGSCWAFA 175
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
VAA+EG+N+I TG L SLSEQ L+DCD T + GC GG D A + + GG+ EE YP
Sbjct: 176 AVAAIEGMNKIRTGELVSLSEQVLVDCD-TVSTGCGGGHSDSAMALVAARGGITSEERYP 234
Query: 223 YIMEEGTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSG 281
Y +G C++ K +I G+ VP N+E L A+A QP++V I+ASG FQFYSG
Sbjct: 235 YAGFQGKCDVDKLMFDHQASIKGFKAVPSNNEAQLAIAVAMQPVTVYIDASGSAFQFYSG 294
Query: 282 GVYDGHCGTQLDHGVAAVGY--GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
G+Y G C ++H V VGY G G Y I KNSW WGE+GY+ + ++ G C
Sbjct: 295 GIYRGPCSANVNHAVTIVGYCEGPGEGNKYWIAKNSWSNDWGEQGYVYLAKDVAWSTGTC 354
Query: 340 GINKMASYP 348
G+ YP
Sbjct: 355 GLATSPFYP 363
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 190/321 (59%), Gaps = 17/321 (5%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLN 94
+S + L +E++ + +K Y+S E+L RF+IF +N I + N K + +Y LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQ 152
+F DL EF +F G RK +V D LPK+VDWRKKGAVT VK+Q
Sbjct: 78 QFGDLLAHEFARIFNGHH---GTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQ 134
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCWAFS ++EG + + G L SLSEQ L+DC ++ NNGC GGLM+ AF+YI +
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
G+ E+ YPY +G C K E T GY ++ SED L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKK-EDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAID 253
Query: 271 ASGRDFQFYSGGVYD-GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
AS FQ YS GVYD C ++ LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 329 KRNTGKPEGLCGINKMASYPI 349
R+ CGI ASYP+
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 147/335 (43%), Positives = 194/335 (57%), Gaps = 23/335 (6%)
Query: 28 FSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNL----RHIDETN 83
S++ + ++ D ++ +ESW +K Y+S E+ R +IF +N RH E
Sbjct: 9 LSVIISTASAVSFFDVVLSDWESWKLTHQKGYDSSVEEKLRLKIFMENSLRISRHNAEAI 68
Query: 84 RKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKK 143
+ Y++ +N + DL H EF M G + K F ++LP+ VDWR++
Sbjct: 69 QGRHTYFMKMNHYGDLLHHEFVAMVNGY---IYNNKTTLGGTFIPSKNINLPEHVDWREE 125
Query: 144 GAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLM 202
GAVT VKNQG CGSCW+FS ++EG + TG L SLSEQ L+DC Y NNGC GGLM
Sbjct: 126 GAVTPVKNQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYGNNGCEGGLM 185
Query: 203 DYAFQYIVSTGGLHKEEDYPYIMEEGTCEM---TKGESEVVTINGYHDVPQNSEDSLLKA 259
DYAF+YI G+ E YPY +G C KG S++ G+ D+ + SE L KA
Sbjct: 186 DYAFKYIQDNNGIDTEASYPYEGIDGHCHYDPKNKGGSDI----GFVDIKKGSEKDLQKA 241
Query: 260 LAN-QPLSVAIEASGRDFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGS--TRGLDYIIVKN 314
LA P+SVAI+AS FQFYS GVY + C + LDHGV AVGYG+ G DY +VKN
Sbjct: 242 LATVGPISVAIDASHMSFQFYSHGVYSEKKCSPENLDHGVLAVGYGTDEVTGEDYWLVKN 301
Query: 315 SWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
SW KWGE GYI+M RN + +CGI ASYP+
Sbjct: 302 SWSEKWGEDGYIKMARN---KDNMCGIASSASYPV 333
>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 132/315 (41%), Positives = 171/315 (54%), Gaps = 20/315 (6%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFK 105
+FE WM+KF K Y EK RF +F+DN+R I N L +N+FADL ++EF
Sbjct: 40 MFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEFV 99
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
G KP + + D + LP +DWR KGAVT VK+QG+CGSCWAF+ VA
Sbjct: 100 STHTGAKPPCPKDAPRG------VDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFAAVA 153
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
A+EG+ QI TG L LSEQEL+DCD T ++GC GG D AF+ + + GG+ E Y Y
Sbjct: 154 AIEGLTQIRTGKLTPLSEQELVDCD-TGSSGCAGGHTDRAFELVAAKGGITAESGYRYEG 212
Query: 226 EEGTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
G C + I G+ VP E L A+A QP++ I+ASG FQFY GV+
Sbjct: 213 YRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGSGVF 272
Query: 285 DGHC---------GTQLDHGVAAVGY--GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
G C +H V VGY G Y + KNSWG WGEKGYI ++++
Sbjct: 273 PGPCGSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILLEKDVA 332
Query: 334 KPEGLCGINKMASYP 348
P G CG+ YP
Sbjct: 333 SPHGTCGVAVSPFYP 347
>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
Length = 401
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/327 (42%), Positives = 180/327 (55%), Gaps = 17/327 (5%)
Query: 35 PEDLTSNDKLIDL-----FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN- 88
P N + ++L F WM K Y D L RFEI+K N R I N+K N
Sbjct: 77 PRQPAPNPRDVELEEQRAFTEWMRTHRKSYHH-DHFLPRFEIWKTNNRWITHWNKKHANA 135
Query: 89 --YWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE-DFSYKDVVDLPKSVDWRKKGA 145
+ + +N+F DL +EF ++ GL A + + E + + +P+S DWR+KG
Sbjct: 136 SSFTVAINQFGDLTSDEFNRLYNGLHVFSAPKASEKVERPRQWANTAGIPESGDWRQKGV 195
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDN-TYNN-GCNGGLMD 203
V+ VK+QG CGSCWAFST + EGIN I T L LSEQ L+DC Y+N GCNGG MD
Sbjct: 196 VSRVKDQGMCGSCWAFSTTGSTEGINAITTSRLVPLSEQNLVDCATAAYDNYGCNGGFMD 255
Query: 204 YAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
AF+YI+ G+ E YPY+ +G C +P+ E +LL A A Q
Sbjct: 256 NAFRYIIDNKGIDSEASYPYVAADGQCRFNPKTVYGGKGGTLKSLPKGDEKALLVAAARQ 315
Query: 264 PLSVAIEASGRDFQFYSGGVY-DGHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWG 321
P+SV I+A FQFYS GVY + C T+L+HGV VG+G RG Y +VKNSWG WG
Sbjct: 316 PISVGIDAGRPSFQFYSKGVYNEPECSSTELNHGVLIVGWGVERGQAYWLVKNSWGQTWG 375
Query: 322 EKGYIRMKRNTGKPEGLCGINKMASYP 348
GYI+M R+ CGI +ASYP
Sbjct: 376 MDGYIKMSRDKNN---QCGIATLASYP 399
>gi|194352764|emb|CAQ00110.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 406
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 141/351 (40%), Positives = 183/351 (52%), Gaps = 48/351 (13%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFA 97
D ++D F WM+ + Y + EK RFE+++ N+R I+ N + Y LG F
Sbjct: 57 DLMMDRFHVWMTVHNRSYSTAGEKARRFEVYRSNMRFIEAVNAEAATSGLTYELGEGPFT 116
Query: 98 DLRHEEFKEMFLGLK-----------------------PDLARRKDQS-HEDFSYKDVVD 133
DL +EEF E++ G L K + + +FS
Sbjct: 117 DLTNEEFMELYTGQILEDDQSEDGDDDEQIITTHAGSIDGLGTHKGATVYANFS----AS 172
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
P S+DWRK+G VT VKNQ CGSCWAF TVA +EGI++I G L SLSEQ+LIDCD
Sbjct: 173 APTSIDWRKRGVVTPVKNQKQCGSCWAFPTVATIEGIHKIKRGTLVSLSEQQLIDCD-YL 231
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
+NGC GGL+ AFQ+I GG+ Y Y G C + I G+ V NSE
Sbjct: 232 DNGCKGGLVTRAFQWIKKNGGITSTSSYKYKAVRGRC--LRNRKPAAKIVGFRKVKSNSE 289
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCG-TQLDHGVAAVGYG---------- 302
SL+ A+ANQP++V+I + F Y GG+Y+G C T+L+H V VGYG
Sbjct: 290 VSLMNAVANQPVAVSISSHSSHFHHYKGGIYNGPCSTTKLNHAVTVVGYGQQQQNGADSV 349
Query: 303 --STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKK 351
S G Y IVKNSWG WG+KGYI MKR T G CGI +P+ K
Sbjct: 350 HASAPGAKYWIVKNSWGTTWGDKGYILMKRGTKHSSGQCGIATRPVFPLMK 400
>gi|405966499|gb|EKC31777.1| Cathepsin L [Crassostrea gigas]
Length = 331
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 137/301 (45%), Positives = 180/301 (59%), Gaps = 15/301 (4%)
Query: 56 EKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGLNEFADLRHEEFKEMFLGL 111
+K Y +E++ R I++DN+ +I + N R YWLG NE+AD+ EF+ + G
Sbjct: 36 KKTYSQDEEQMRRL-IWEDNVNYIQKHNLAADRGEHTYWLGQNEYADMTIFEFRAIMNGY 94
Query: 112 KPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGIN 171
K R K + S ++ DLP SVDWRK+G VT +KNQG CGSCW+FS ++EG +
Sbjct: 95 KMSANRTKGDLY--MSPSNIGDLPDSVDWRKEGYVTDIKNQGHCGSCWSFSATGSLEGQH 152
Query: 172 QIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTC 230
+ L SLSEQ L+DC N+GC GGLMD AF+YI S G+ EE YPY + G C
Sbjct: 153 FKASKKLVSLSEQNLVDCSKKEGNHGCQGGLMDNAFRYIESNKGIDTEESYPYTAKNGFC 212
Query: 231 EMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVY-DGHC 288
K E+ T GY D+P ED L +A+A P+SV I+A + FQ Y GVY + C
Sbjct: 213 HF-KAENVGATDTGYVDIPHMQEDKLQEAVATVGPISVGIDAGHKSFQLYREGVYSEPAC 271
Query: 289 -GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASY 347
++LDHGV AVGYG+ G DY +VKNSWG WG +GY+ M RN +CGI ASY
Sbjct: 272 SSSKLDHGVLAVGYGTESGDDYWLVKNSWGTSWGMQGYVMMARN---KHNMCGIATQASY 328
Query: 348 P 348
P
Sbjct: 329 P 329
>gi|242079875|ref|XP_002444706.1| hypothetical protein SORBIDRAFT_07g026400 [Sorghum bicolor]
gi|241941056|gb|EES14201.1| hypothetical protein SORBIDRAFT_07g026400 [Sorghum bicolor]
Length = 374
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 150/342 (43%), Positives = 192/342 (56%), Gaps = 30/342 (8%)
Query: 36 EDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLN 94
+DL S+ + DL+E W S + + L EK RF+ FK N R I+E N R+ ++Y L LN
Sbjct: 38 KDLESDASMWDLYERWCSVYAGSSD-LAEKQRRFDAFKMNARQINEFNKREDESYKLALN 96
Query: 95 EFADLRHEEFKE-MFLGLKPDLARRKDQSHE----DFSYKDVVD-------------LPK 136
+F+ L EEF M+ G P+L + S S D D +P
Sbjct: 97 QFSGLTEEEFNSGMYTGALPELDAGGNISSSVGTSGMSMTDDNDDKLLVSAGGNDDKVPA 156
Query: 137 SVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNG 196
DWR+ GAVT VKNQG CGSCWAFS V +VEGIN I TG L +LSEQE++DC
Sbjct: 157 KWDWRRHGAVTPVKNQGQCGSCWAFSMVGSVEGINAIKTGKLQTLSEQEVLDCSGA--GT 214
Query: 197 CNGGLMDYAFQYIVSTG-GLHKEEDYP----YIMEEGTCEMTKGESEVVTINGYHDVPQN 251
C GG +F + + G L + + P Y+ E+ C + VV ING +
Sbjct: 215 CKGGNTYKSFDHAMRPGLALDHQGNPPYYPAYVAEKKKCRFNPNK-PVVKINGKRMMRNT 273
Query: 252 SEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGST-RGLDYI 310
+E LL ++ QP+SV +EAS + F YS GV+ G CGT L+H V VGYG+T G++Y
Sbjct: 274 NEAELLLRVSKQPVSVVVEAS-QAFSRYSKGVFTGPCGTNLNHAVLVVGYGTTPNGINYW 332
Query: 311 IVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIKKK 352
IVKNSWG WGE GYIRMKRN G GLCGI M YPIK K
Sbjct: 333 IVKNSWGKGWGENGYIRMKRNVGTKAGLCGIYMMPMYPIKNK 374
>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
Length = 314
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 135/291 (46%), Positives = 174/291 (59%), Gaps = 13/291 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFADLRHEE 103
+ES+ +K+ K YES + + R I+ + E N + + +Y LGLN FAD+ + E
Sbjct: 27 WESYKAKYGKTYESNENEAARRTIYFMAKEKVMEHNARFEQGLVSYKLGLNSFADMHNGE 86
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F++M G + R H + + + LP SVDWR KGAVT +KNQG CGSCWAFST
Sbjct: 87 FRKMMNGYRRGTPRNSVVVHVESN----ITLPASVDWRTKGAVTPIKNQGQCGSCWAFST 142
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
++EG + + G L SLSEQEL+DC N+GC+GGLMD AF YI G+ E+ YP
Sbjct: 143 TGSLEGQHALKKGKLVSLSEQELVDCSAAEGNDGCDGGLMDDAFTYIKKNNGIDTEQSYP 202
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSG 281
Y E+GTC K + T+ G+ DV SE L A A P+SVAI+AS DFQ Y
Sbjct: 203 YTGEDGTCSFKKSDV-AATVTGFVDVTSGSESGLQDASATIGPISVAIDASSWDFQLYES 261
Query: 282 GVYD-GHCG-TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKR 330
GVYD C T+LDHGV VGYG+ G Y +VKNSWG WG GYI+M R
Sbjct: 262 GVYDVSDCSTTELDHGVLVVGYGTDDGTAYWLVKNSWGTDWGHHGYIQMSR 312
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 138/323 (42%), Positives = 188/323 (58%), Gaps = 23/323 (7%)
Query: 41 NDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI----KNYWLGLNEF 96
N L +++ +M+ +++ Y E RF+IF +N I + N + +Y +G+NEF
Sbjct: 59 NFTLSSIWKHFMTTYKRNYIDPSEHERRFKIFANNFVRISKHNVRFIQGQVSYTMGINEF 118
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKS-VDWRKKGAVTHVKNQGSC 155
+D EE K + + L +D S Y + P S +DWR KGAVT VKNQG+C
Sbjct: 119 SDKTDEELKRLRC-FRGSLNASRDGS----KYITIAAPPPSEIDWRNKGAVTPVKNQGNC 173
Query: 156 GSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGG 214
GSCWAFS A+EG N + TGNL SLSEQ+L+DC + Y NN CNGGLMD AF+Y+ + G
Sbjct: 174 GSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNG 233
Query: 215 LHKEEDYPYIMEEG-----TCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ-PLSVA 268
+ E YPY+ E TC E+ VV + GY D+P+ L +A+ + P+SVA
Sbjct: 234 IDTEASYPYVSGETGDANPTCRFNLKEA-VVRVTGYIDLPRGQVSELKQAVGHYGPISVA 292
Query: 269 IEASGRDFQFYSGGVY-DGHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYI 326
I A F Y GVY D C + LDHGV VGYG G+ Y ++KNSWGP WGE GY+
Sbjct: 293 INAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYGEENGIPYWLIKNSWGPHWGENGYV 352
Query: 327 RMKRNTGKPEGLCGINKMASYPI 349
++ R+ LCG+ MASYP+
Sbjct: 353 KILRDHNN---LCGVASMASYPL 372
>gi|302758108|ref|XP_002962477.1| hypothetical protein SELMODRAFT_78855 [Selaginella moellendorffii]
gi|300169338|gb|EFJ35940.1| hypothetical protein SELMODRAFT_78855 [Selaginella moellendorffii]
Length = 370
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 146/371 (39%), Positives = 203/371 (54%), Gaps = 23/371 (6%)
Query: 1 MALSSQFKTI-LISFCISFFIRSS----FARDFSIVG----YSPEDLTSNDKLIDLFESW 51
MALS + + L++ C + + ++ RD G Y PE+L + +F+ W
Sbjct: 1 MALSRRHVLLALLACCFTLVVVATAFPHHGRDEDREGPNFWYPPEELDPDAGFKFMFDRW 60
Query: 52 MSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGL 111
++ +VY E+ +FE+FK N+R + + R ++ YWLGL+ DL HEEFK
Sbjct: 61 RAEHSRVYAERAEEERKFELFKRNVRMLHDYYRNLRLYWLGLDHLPDLDHEEFKPRLPSR 120
Query: 112 KPDLARRKDQSHEDFSYKDVV----DLPKSVDWRKKGAVTHVKNQGSC-GSCWAFSTVAA 166
R H D + +P++VDWRK+GAVT VK+ G+C G WAF+T A
Sbjct: 121 VASPVLRTKVDHSDERPEPRRPPFPHVPEAVDWRKEGAVTSVKDVGNCTGGGWAFATAGA 180
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
VEG+N+IVTGNL LS QELIDCD YN GC+ G +F YI TG L YPYI +
Sbjct: 181 VEGLNKIVTGNLVELSAQELIDCD-VYNGGCDYGFPQDSFAYIQKTG-LEASASYPYIGK 238
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNS----EDSLLKALANQPLSVAIEASGRDFQFYSGG 282
TC + T+ G P S E+ L +A QP++ I+ S +DF Y+GG
Sbjct: 239 NSTCHIGVFIDGFDTLRGSLCAPGVSASDIEEELKMRVAQQPVTALIDGSSKDFAKYTGG 298
Query: 283 VYDGHCGTQLDHGVAAV---GYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
++ G C + D G+ AV GYGS G DY I+KNS G KWGE+GY++++R TG G C
Sbjct: 299 IFKGPCHSTGDTGLTAVLIVGYGSDNGDDYWILKNSRGTKWGEQGYMKIQRGTGLYGGRC 358
Query: 340 GINKMASYPIK 350
GIN +P K
Sbjct: 359 GINNYVFFPRK 369
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 185/315 (58%), Gaps = 17/315 (5%)
Query: 48 FESWMSKFEKVYESL---DEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLR 100
FE F+ V+E E+ +R E+F++NL+ I N + Y +G+N+FAD+
Sbjct: 39 FEKLWQDFKTVHERTYGETEESQRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQFADME 98
Query: 101 HEEFKEMFLGLK-PDLARRKDQSHEDFSYKDV-VDLPKSVDWRKKGAVTHVKNQGSCGSC 158
EF + G + + +D H ++ + V +P VDWRK+G VT VKNQG CGSC
Sbjct: 99 ANEFASIMNGFRMNNRTEVRDHLHANYISPAIPVSVPAEVDWRKEGYVTPVKNQGQCGSC 158
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHK 217
WAFST ++EG + TG L SLSEQ L+DC +Y N GCNGG++DYAFQYI G
Sbjct: 159 WAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIKDNDGDDT 218
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALA-NQPLSVAIEASGRDF 276
E YPY +GTC K T GY D+P+ E + +A+A P+SVAI+AS F
Sbjct: 219 EACYPYEAVDGTCRF-KSVCVGATCTGYTDLPKGDEAKMKEAVALVGPVSVAIDASHSSF 277
Query: 277 QFYSGGVY-DGHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGK 334
Q Y G+Y + C QLDH V VGYG+ +G DY +VKNSWG WG++GYI+M RN
Sbjct: 278 QMYQSGIYVEQECSPKQLDHAVLVVGYGTEQGQDYWLVKNSWGTTWGDEGYIKMARNM-- 335
Query: 335 PEGLCGINKMASYPI 349
+ CGI ASYP+
Sbjct: 336 -DNQCGIASQASYPL 349
>gi|294897727|ref|XP_002776051.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239882576|gb|EER07867.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 361
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 131/294 (44%), Positives = 176/294 (59%), Gaps = 13/294 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
F + KF K YES +E+++R IF+ NL HI++ N + +Y LG+NE+ DL HEEF +
Sbjct: 31 FIGFQYKFGKKYESKEEEIKRNAIFQVNLHHIEQINARNLSYKLGVNEYTDLTHEEFAAL 90
Query: 108 FLGLKPDLARRKDQ-----SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
LG+ R+ D + D L SVDWR K +T +K+QG CGSCWAFS
Sbjct: 91 KLGILKMSLRKDDNWISLANSSLLVSADTTQLAASVDWRNKSVLTPIKDQGHCGSCWAFS 150
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDY 221
+ A+E I TG L SLSEQ+L+DC ++Y N+GCNGG M YA+ YI S+ G+ +E Y
Sbjct: 151 STGALEAQYAIATGKLLSLSEQQLVDCSSSYGNHGCNGGWMQYAYDYIKSS-GIDQESTY 209
Query: 222 PYIMEEGTCEMTKGESE----VVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQ 277
PY + TC+ + + V + GYH + Q +E +L+ L P+SVA+ AS DFQ
Sbjct: 210 PYEASDNTCQKSLEKLSDGLPVGEVTGYHMLEQ-TEQALMTRLVAAPVSVAMYASDPDFQ 268
Query: 278 FYSGGVYDGH-CGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKR 330
FY GVY C LDH V AVGYG+ G DY I +NSWG WG+ GY +KR
Sbjct: 269 FYKSGVYSSDTCNGGLDHAVVAVGYGNENGEDYFIGRNSWGTSWGQDGYFYLKR 322
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 188/323 (58%), Gaps = 15/323 (4%)
Query: 41 NDKLI-DLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-----KNYWLGLN 94
+DK + + +E WM++ + Y+ EK RFE+FK N ID N L N
Sbjct: 12 DDKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTN 71
Query: 95 EFADLRHEEFKEMFL-GLKPDLARRKDQSHEDFSYKDVV--DLPKSVDWRKKGAVTHVKN 151
+FADL +EF+ +++ G + + + F + V D+P S+DWR +GAVT VK+
Sbjct: 72 KFADLTEDEFRNIYVTGHRVNYRPTSLVTDTVFKFGAVSLSDVPPSIDWRARGAVTSVKD 131
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVS 211
Q C CWAFS+ AAVEGI+QI TGN SLS Q+L+DC N N C G +D A++YI
Sbjct: 132 QHLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYEYIAR 191
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEA 271
+GGL ++DYPY GTC + G+ V I+G+ VP +E +LL A+A+QP+SVA++
Sbjct: 192 SGGLVADQDYPYEGHSGTCRV-YGKQAVARISGFQYVPARNETALLLAVAHQPVSVALDG 250
Query: 272 SGRDFQFYSGGVYDGH---CGTQLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIR 327
R Q G++ C T L+H + VGYG+ G Y ++KNSWG WG+KGY++
Sbjct: 251 LSRALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGDKGYVK 310
Query: 328 MKRNTGKP-EGLCGINKMASYPI 349
R+ G+CG+ ASYP+
Sbjct: 311 FARDVASEINGVCGLALEASYPV 333
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 190/321 (59%), Gaps = 17/321 (5%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDN----LRHIDETNRKIKNYWLGLN 94
+S + L +E++ + +K Y+S E+L RF+IF ++ RH + + + +Y LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIARHNAKYAKGLVSYKLGMN 77
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQ 152
+F DL EF +F G RK +V D LPK+VDWRKKGAVT VK+Q
Sbjct: 78 QFGDLLAHEFARIFNGHH---GTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQ 134
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCWAFS ++EG + + G L SLSEQ L+DC ++ NNGC GGLM+ AF+YI +
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
G+ E+ YPY +G C K E T GY ++ SED L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKK-EDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAID 253
Query: 271 ASGRDFQFYSGGVYD-GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
AS FQ YS GVYD C ++ LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 329 KRNTGKPEGLCGINKMASYPI 349
R+ CGI ASYP+
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 179/313 (57%), Gaps = 14/313 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEE 103
+E W + K YE+ E+ R IF+ N I E N + + +Y L +N+F D+ HEE
Sbjct: 24 WEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEE 83
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + +G + ++ E D LPKSVDWR V+ VK+QG CGSCWAFST
Sbjct: 84 FHQRIMGGCLKIVKKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFST 143
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
++EG + TG L LSEQ+L+DC + N GC GGLMD AFQYI + GGL EE YP
Sbjct: 144 TGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYP 203
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSG 281
Y + S T+ GY DV +E +L +A+A P+SVAI+A FQFYS
Sbjct: 204 YTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYSS 263
Query: 282 GVYD-GHCGT-QLDHGVAAVGYGSTRG---LDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
GVYD C T QLDHGV AVGYG+ + IVKNSWGP WG++GYI M RN
Sbjct: 264 GVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQ- 322
Query: 337 GLCGINKMASYPI 349
CGI ASYP+
Sbjct: 323 --CGIATSASYPL 333
>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
Length = 327
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 132/315 (41%), Positives = 171/315 (54%), Gaps = 20/315 (6%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHEEFK 105
+FE WM+KF K Y EK RF +F+DN+R I N L +N+FADL ++EF
Sbjct: 18 MFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEFV 77
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
G KP + + D + LP +DWR KGAVT VK+QG+CGSCWAF+ VA
Sbjct: 78 STHTGAKPPCPKDAPRG------VDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFAAVA 131
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
A+EG+ QI TG L LSEQEL+DCD T ++GC GG D AF+ + + GG+ E Y Y
Sbjct: 132 AIEGLTQIRTGKLTPLSEQELVDCD-TGSSGCAGGHTDRAFELVAAKGGITAESGYRYEG 190
Query: 226 EEGTCEMTKGE-SEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY 284
G C + I G+ VP E L A+A QP++ I+ASG FQFY GV+
Sbjct: 191 YRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGSGVF 250
Query: 285 DGHC---------GTQLDHGVAAVGY--GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
G C +H V VGY G Y + KNSWG WGEKGYI ++++
Sbjct: 251 PGPCGSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILLEKDVA 310
Query: 334 KPEGLCGINKMASYP 348
P G CG+ YP
Sbjct: 311 SPHGTCGVAVSPFYP 325
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 194/322 (60%), Gaps = 25/322 (7%)
Query: 49 ESWMS---KFEKVYESLDEKLERFEIFKDNLRHIDETNRKI----KNYWLGLNEFADLRH 101
E W + + K Y+S E+ R +I+ N I + N++ + + L +N++ADL H
Sbjct: 26 EEWTAFKLQHRKKYDSETEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLH 85
Query: 102 EEFKEMFLGL------KPDLARRKDQSHED---FSYKDVVDLPKSVDWRKKGAVTHVKNQ 152
EEF G K L R + + E+ + VD+P ++DWR KGAVT VK+Q
Sbjct: 86 EEFVHTLNGFNRSVSGKGQLLRGELKPIEEPVTWIEPANVDVPTAMDWRTKGAVTQVKDQ 145
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCW+FS A+EG + TG L SLSEQ L+DC Y NNGCNGG+MD+AFQYI
Sbjct: 146 GHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSQKYGNNGCNGGMMDFAFQYIKD 205
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
G+ E+ YPY + C ++ T G+ D+PQ +E +L+KALA P+SVAI+
Sbjct: 206 NKGIDTEKSYPYEAIDDECHYNP-KAVGATDKGFVDIPQGNEKALMKALATVGPVSVAID 264
Query: 271 ASGRDFQFYSGGV-YDGHCGT-QLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIR 327
AS FQFYS GV Y+ C + QLDHGV AVGYG+T G DY +VKNSWG WG++GY++
Sbjct: 265 ASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTWGDQGYVK 324
Query: 328 MKRNTGKPEGLCGINKMASYPI 349
M RN + CGI ASYP+
Sbjct: 325 MARNR---DNHCGIATTASYPL 343
>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 356
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 135/324 (41%), Positives = 183/324 (56%), Gaps = 33/324 (10%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEM 107
E WM+KF +VY EK R E+F N R++D NR + Y LGLN+F+DL +EF +
Sbjct: 40 EEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFVQT 99
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVV-----DLPKSVDWRKKGAVTHVKNQGSCGSCWAFS 162
LG + E+ S + D+P+SVDWR +GAVT VKNQGSCG CWAF+
Sbjct: 100 HLGYRGHQQGGLRPEEENVSKVAALGYGQADMPESVDWRAQGAVTGVKNQGSCGCCWAFA 159
Query: 163 TVAAVEGINQIVTGNLASLSEQELIDCDNTY-----NNGCNGGLMDYAFQYIVSTGGLHK 217
VAA EG+ +I TGNL S+SEQ+++DC N C+GG +D A +Y+ ++ GL
Sbjct: 160 AVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYVAASRGLQP 219
Query: 218 EEDYPYIMEEGTCE--------MTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAI 269
E Y Y +G C+ + GE + VT+ Q E L +A QP++V++
Sbjct: 220 EAAYAYTGLQGACQSGFTPNSAASFGEPQTVTL-------QGDEGRLQGLVAGQPIAVSV 272
Query: 270 EASGRDFQFYSGGVYDG---HCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGY 325
EAS DF+ Y GV+ CG +L+H V VGYGS G +Y +VKN WG WGE GY
Sbjct: 273 EAS-DDFRHYMSGVFTAGTSSCGQRLNHAVTVVGYGSADGGQEYWLVKNQWGTSWGEGGY 331
Query: 326 IRMKRNTGKPEGLCGINKMASYPI 349
+R+ R G P CGI+ A YP
Sbjct: 332 MRIARGNGAPN--CGISAYAYYPT 353
>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 139/306 (45%), Positives = 183/306 (59%), Gaps = 25/306 (8%)
Query: 58 VYESLDEKLERFE---------IFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEM 107
+YES +++ R+ +FK+N+ +I+ N K Y +N+FA K+
Sbjct: 35 MYESHGQRMTRYSKVDKDPPDXVFKENVNYIEACNNAADKPYKRDINQFAP------KKR 88
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
F G R F +++V P +VD R+K AVT +K+QG CG WA S VAA
Sbjct: 89 FKGHMCSSIIRITT----FKFENVTATPSTVDCRQKVAVTPIKDQGQCGCFWALSAVAAT 144
Query: 168 EGINQIVTGNLASLS-EQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIM 225
EGI+ + G L LS EQEL+DCD + C GGLMD AF++I+ GL+ E +YPY
Sbjct: 145 EGIHALXAGKLILLSSEQELVDCDTKGVDQDCQGGLMDDAFKFIIQNHGLNTEANYPYKG 204
Query: 226 EEGTCEMTKGESEVVTI-NGYHDVPQNSEDS-LLKALANQPLSVAIEASGRDFQFYSGGV 283
+G C + + TI GY DVP N+E + L KA+AN P+SVAI+ASG DFQFY GV
Sbjct: 205 VDGKCNAYEADKNAATIITGYEDVPANNEKAHLQKAVANNPVSVAIDASGSDFQFYKSGV 264
Query: 284 YDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGIN 342
+ G CGT+LDHGV AVGYG S G +Y +VKNS G +WGE+GYIRM+R E LCGI
Sbjct: 265 FTGSCGTELDHGVTAVGYGVSDDGTEYWLVKNSRGTEWGEEGYIRMQRGVDSEEALCGIA 324
Query: 343 KMASYP 348
ASYP
Sbjct: 325 VQASYP 330
>gi|414591039|tpg|DAA41610.1| TPA: hypothetical protein ZEAMMB73_356414 [Zea mays]
Length = 376
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 152/339 (44%), Positives = 186/339 (54%), Gaps = 33/339 (9%)
Query: 36 EDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLN 94
+DL S + + L+E W S V L EK RFE FK N RHI E N RK Y LGLN
Sbjct: 33 KDLESEESMWSLYERWRS-VHTVSRDLREKQSRFEAFKANARHIGEFNKRKDVPYKLGLN 91
Query: 95 EFADLRHEEFKEMFLGLK---PDLARR---------KDQSHEDFSYKDVVDLPKSVDWRK 142
+FADL EEF + G K + A R D+S + V D P + DWR
Sbjct: 92 KFADLTQEEFVSKYTGAKVVDSEAAARLASGVRVSSSDESPPQLA-ASVGDAPDAWDWRD 150
Query: 143 KGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDC----DNTYNNGCN 198
GAVT VK+QG CGSCWAFS V AVE +N IVTGNL +LSEQ+++DC D TY
Sbjct: 151 HGAVTAVKDQGQCGSCWAFSAVGAVESVNAIVTGNLLTLSEQQMLDCSGAGDCTY----- 205
Query: 199 GGLMDYAFQYIVSTG-GLHKEEDYPY-----IMEEGTCEMTKGESEVVTINGYHDVPQNS 252
GG YA Y +S G L + PY + C + VV I+ + +
Sbjct: 206 GGYTYYAMLYAISNGLTLDQCGKTPYYQRYDAQQHLPCRFDAKKPPVVKIDSMYVMNNAD 265
Query: 253 EDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTR-GLDYII 311
E +L +A+ QP+SV I+A G +YS GV+ G CGT L+H V VGYG+T G Y I
Sbjct: 266 EAALKRAVYKQPVSVLIDAGG--IGYYSEGVFTGPCGTSLNHAVLLVGYGATADGTKYWI 323
Query: 312 VKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VKNSWG WGEKGY R+KR+ G GLCGI YPIK
Sbjct: 324 VKNSWGADWGEKGYFRLKRDVGTQGGLCGITMYPIYPIK 362
>gi|47230018|emb|CAG10432.1| unnamed protein product [Tetraodon nigroviridis]
Length = 294
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 142/298 (47%), Positives = 178/298 (59%), Gaps = 18/298 (6%)
Query: 64 EKLERFEIFKDN----LRHIDETNRKIKNYWLGLNEFADLRHEEFKEMF----LGLKPDL 115
E+ R +I+ N L H ++ IK+Y LG+ +FAD+ +EE+K + LG
Sbjct: 2 EEAARRQIWLSNRKLVLVHNILADQGIKSYRLGMTQFADMDNEEYKRLISLGCLGAFNAS 61
Query: 116 ARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVT 175
A RK + F + LP +VDWR KG VT VK+Q CGSCWAFS ++EG N T
Sbjct: 62 APRKGSAF--FRLAEGTPLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLEGQNYRKT 119
Query: 176 GNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTK 234
G L SLSEQ+L+DC Y N GC GGLMD AF+YI GG+ EE YPY E+G C K
Sbjct: 120 GKLVSLSEQQLVDCSGDYGNMGCGGGLMDSAFKYIQENGGIDTEESYPYEAEDGKCRF-K 178
Query: 235 GESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVYDG-HCGTQ- 291
++ GY DV ED+L +A+A P+SVAI+AS FQ Y GVYD C ++
Sbjct: 179 PQNIGAKCTGYVDVTAGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDELECSSED 238
Query: 292 LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
LDHGV AVGYG+ G DY +VKNSWG WG+KGYI M RN CGI MASYP+
Sbjct: 239 LDHGVLAVGYGTDNGQDYWLVKNSWGLGWGQKGYIMMSRN---KHNQCGIASMASYPL 293
>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 179/313 (57%), Gaps = 14/313 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEE 103
+E W + K YE+ E+ R IF+ N I E N + + +Y L +N+F D+ HEE
Sbjct: 24 WEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEE 83
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + +G + ++ + D LPKSVDWR V+ VK+QG CGSCWAFST
Sbjct: 84 FHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFST 143
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
++EG + TG L LSEQ+L+DC + N GC GGLMD AFQYI + GGL EE YP
Sbjct: 144 TGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYITANGGLDTEESYP 203
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSG 281
Y + S T+ GY DV +E +L +A+A P+SVAI+A FQFYS
Sbjct: 204 YTATDDEPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYSS 263
Query: 282 GVYD-GHCGT-QLDHGVAAVGYGSTRG---LDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
GVYD C T QLDHGV AVGYG+ + IVKNSWGP WG++GYI M RN
Sbjct: 264 GVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQ- 322
Query: 337 GLCGINKMASYPI 349
CGI ASYP+
Sbjct: 323 --CGIATSASYPL 333
>gi|302758762|ref|XP_002962804.1| hypothetical protein SELMODRAFT_78186 [Selaginella moellendorffii]
gi|300169665|gb|EFJ36267.1| hypothetical protein SELMODRAFT_78186 [Selaginella moellendorffii]
Length = 370
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 146/371 (39%), Positives = 204/371 (54%), Gaps = 23/371 (6%)
Query: 1 MALSSQFKTI-LISFCISFFIRSS----FARDFSIVG----YSPEDLTSNDKLIDLFESW 51
MALS + + L++ C + + ++ RD G Y+PE+L + +F+ W
Sbjct: 1 MALSRRHVLLALLACCFTLVVVAAAFPHHGRDEDREGPNFWYAPEELDPDAGFKFMFDRW 60
Query: 52 MSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMFLGL 111
++ +VY E+ +FE+FK N+R + + R ++ YWLGL+ DL HEEFK
Sbjct: 61 RAEHSRVYAERAEEERKFELFKRNVRMLHDYYRNLRLYWLGLDHLPDLDHEEFKPRLPSR 120
Query: 112 KPDLARRKDQSHEDFSYKDVV----DLPKSVDWRKKGAVTHVKNQGSC-GSCWAFSTVAA 166
R H D + +P++VDWRK+GAVT VK+ G+C G WAF+T A
Sbjct: 121 VASPVLRTKVDHSDEPPEPRRPPFPHVPEAVDWRKEGAVTSVKDVGNCTGGGWAFATAGA 180
Query: 167 VEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
VEG+N+IVTGNL LS QELIDCD YN GC+ G +F YI TG L YPYI +
Sbjct: 181 VEGLNKIVTGNLVELSAQELIDCD-VYNGGCDYGFPQDSFVYIQKTG-LEASASYPYIGK 238
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNS----EDSLLKALANQPLSVAIEASGRDFQFYSGG 282
TC + T+ G P S E+ L +A QP++ I+ S +DF Y+GG
Sbjct: 239 NSTCHIGVFIDGFDTLRGSLCAPGVSASDIEEELKMRVAQQPVTALIDGSSKDFVKYTGG 298
Query: 283 VYDGHCGTQLDHGVAAV---GYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
++ G C + D G+ AV GYGS G DY I+KNS G KWGE+GY++++R TG G C
Sbjct: 299 IFKGPCHSTGDTGLTAVLIVGYGSDNGDDYWILKNSRGTKWGEQGYMKIQRGTGLYGGRC 358
Query: 340 GINKMASYPIK 350
GIN +P K
Sbjct: 359 GINNYVFFPRK 369
>gi|161172356|pdb|3BCN|A Chain A, Crystal Structure Of A Papain-Like Cysteine Protease
Ervatamin-A Complexed With Irreversible Inhibitor E-64
gi|161172357|pdb|3BCN|B Chain B, Crystal Structure Of A Papain-Like Cysteine Protease
Ervatamin-A Complexed With Irreversible Inhibitor E-64
Length = 209
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 122/217 (56%), Positives = 150/217 (69%), Gaps = 10/217 (4%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP+ VDWR KGAV +KNQG CGSCWAFSTV VE INQI TGNL SLSEQ+L+DC
Sbjct: 1 LPEHVDWRAKGAVIPLKNQGKCGSCWAFSTVTTVESINQIRTGNLISLSEQQLVDCSKK- 59
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
N+GC GG D A+QYI++ GG+ E +YPY +G C K +VV I+G VPQ +E
Sbjct: 60 NHGCKGGYFDRAYQYIIANGGIDTEANYPYKAFQGPCRAAK---KVVRIDGCKGVPQCNE 116
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
++L A+A+QP VAI+AS + FQ Y GG++ G CGT+L+HGV VGYG DY IV+
Sbjct: 117 NALKNAVASQPSVVAIDASSKQFQHYKGGIFTGPCGTKLNHGVVIVGYGK----DYWIVR 172
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
NSWG WGE+GY RMKR G GLCGI ++ YP K
Sbjct: 173 NSWGRHWGEQGYTRMKRVGGC--GLCGIARLPFYPTK 207
>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 350
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 130/293 (44%), Positives = 171/293 (58%), Gaps = 12/293 (4%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK-IKNYWLGLNEFADLRHEEFKEM 107
E WM+KF +VY +EK R +F N R++D NR + Y LGLNEF+DL EF +
Sbjct: 41 EQWMAKFGRVYTDANEKARRQAVFGANARYVDAVNRAGNRTYTLGLNEFSDLTDNEFAKT 100
Query: 108 FLG---LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
LG +P+ A D Y ++PKS DWR KGAVT VK+QG CG CWAF+ V
Sbjct: 101 HLGYREFRPETA--NISKGVDPGYGLAGNIPKSFDWRTKGAVTEVKSQGGCGCCWAFAAV 158
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
AA EG+ +I G L S+SEQ+++DC T NN C GG M+ A Y+ ++GGL EEDY Y
Sbjct: 159 AATEGLVKIAKGTLISMSEQQVLDC-TTGNNTCKGGYMNDALSYVFASGGLQTEEDYEYN 217
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKAL-ANQPLSVAIEASGRDFQFYSGGV 283
E+G C + ++ +P + + LL+ L A QP+ VA+EA G DF+ Y GGV
Sbjct: 218 AEKGACRRDVTPNPATSVGHAEYMPLDGNEFLLQKLVARQPVVVAVEAYGTDFKNYGGGV 277
Query: 284 YDG--HCGTQLDHGVAAVGYGSTRGLD--YIIVKNSWGPKWGEKGYIRMKRNT 332
+ G CG LDH VGYG G Y +VKN WG WGE GY+R+ R +
Sbjct: 278 FTGSPSCGQNLDHFFTVVGYGFADGGKQMYWLVKNQWGTSWGESGYMRIARGS 330
>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
Length = 354
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 190/318 (59%), Gaps = 15/318 (4%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFA 97
D+ + ++ + F K YE DE+ + E F N+ HI+E N++ K + +GLNE A
Sbjct: 41 DEAFNKWDDYKETFGKSYEP-DEENDYMEAFVKNVIHIEEHNKEHRLGRKTFEMGLNEIA 99
Query: 98 DLRHEEFKEMF-LGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
DL +++++ ++ + F V +P+SVDWR++G VT VKNQG CG
Sbjct: 100 DLPFSQYRKLNGYRMRRQFGDSLQSNGTKFLVPFNVQIPESVDWREEGLVTPVKNQGMCG 159
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGL 215
SCWAFS+ A+EG + TG L SLSEQ L+DC Y N+GCNGGLMD AF+YI G+
Sbjct: 160 SCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENHGV 219
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ-PLSVAIEASGR 274
E+ YPY+ E C K + G+ D+P+ E++L KA+A Q P+S+AI+A R
Sbjct: 220 DTEDSYPYVGRETKCHF-KRNAVGADDKGFVDLPEGDEEALKKAVATQGPISIAIDAGHR 278
Query: 275 DFQFYSGGVY-DGHCGT-QLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQ Y GVY D C + +LDHGV VGYG+ DY +VKNSWGP WGEKGYIR+ RN
Sbjct: 279 SFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTWGEKGYIRIARN 338
Query: 332 TGKPEGLCGINKMASYPI 349
CG+ ASYP+
Sbjct: 339 RNNH---CGVATKASYPL 353
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 141/317 (44%), Positives = 194/317 (61%), Gaps = 22/317 (6%)
Query: 49 ESWMS-KFEKVYESLDEKLERF--EIFKDNLRHIDETNRKIK----NYWLGLNEFADLRH 101
E W + K E L E ERF +IF +N I + N+ ++ LGLN++AD+ H
Sbjct: 25 EEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADMLH 84
Query: 102 EEFKEMFLGLKPDLARRKDQSHEDFS-----YKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
EFKE G + R++ ++ E F+ V +PK+VDWR+ GAVT VK+QG CG
Sbjct: 85 HEFKETMNGYNHTM-RKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGHCG 143
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGL 215
SCW+FS+ ++EG + G L SLSEQ L+DC Y NNGCNGGLMD AF+YI GG+
Sbjct: 144 SCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGV 203
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ-PLSVAIEASGR 274
E+ YPY + +C K + T G+ D+PQ E++++KA+A P++VAI+AS
Sbjct: 204 DTEKSYPYEGIDDSCHFNKA-TVGATDTGFVDIPQGDEEAMMKAVATMGPVAVAIDASNE 262
Query: 275 DFQFYSGGVY-DGHCGT-QLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQ YS GVY D +C + LDHGV VGYG+ + G DY +VKNSWG WG++GYI+M RN
Sbjct: 263 SFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTWGDQGYIKMARN 322
Query: 332 TGKPEGLCGINKMASYP 348
+ CGI +S+P
Sbjct: 323 ---QDNQCGIATASSFP 336
>gi|19698257|dbj|BAB86771.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 178/310 (57%), Gaps = 16/310 (5%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR----KIKNYWLGLNEFADLRHEE 103
F W +KF K Y SL+E+ R ++ N + I N+ + +Y GLN+F+D+ HEE
Sbjct: 22 FNEWKAKFGKSYPSLEEEAHRKGLWLANHQKIQAHNQLADQGVHSYRQGLNQFSDMDHEE 81
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F++ L K D + + E F +V L SVDWR G V+ +KNQG CGSCW+FS
Sbjct: 82 FRQTVL-TKMDPPKNNRGASEPFRAPNV-GLAASVDWRTSGCVSPIKNQGQCGSCWSFSA 139
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYP 222
A+E + G L SLSEQ+L+DC Y N GCNGG D+AFQY+ + GG+ E YP
Sbjct: 140 TGALESQTCLRRGYLPSLSEQQLVDCSGPYGNYGCNGGWPDHAFQYVQANGGIDSESYYP 199
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDV-PQNSEDSLLKALAN-QPLSVAIEASGRDFQFYS 280
Y GTC S T +GY DV P SE +L +AN PLS+AI+ASG +Q Y
Sbjct: 200 YQARVGTCHYNSAYS-AATCSGYQDVTPVGSESALQYYVANVGPLSIAIDASG--WQSYQ 256
Query: 281 GGVY-DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
GV+ D C DH V VGYG+ G DY +VKNSWG WGE+GYI M RN C
Sbjct: 257 SGVFNDPSCSQTADHAVLLVGYGTYNGQDYWLVKNSWGTWWGEQGYIMMARNANN---QC 313
Query: 340 GINKMASYPI 349
GI ASYP+
Sbjct: 314 GIANHASYPL 323
>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 179/313 (57%), Gaps = 14/313 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEE 103
+E W + K YE+ E+ R IF+ N I E N + + +Y L +N+F D+ HEE
Sbjct: 24 WEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEE 83
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + +G + ++ + D LPKSVDWR V+ VK+QG CGSCWAFST
Sbjct: 84 FHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFST 143
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
++EG + TG L LSEQ+L+DC + N GC GGLMD AFQYI + GGL EE YP
Sbjct: 144 TGSLEGQHSSKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYP 203
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSG 281
Y + S T+ GY DV +E +L +A+A P+SVAI+A FQFYS
Sbjct: 204 YTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYSS 263
Query: 282 GVYD-GHCGT-QLDHGVAAVGYGSTRG---LDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
GVYD C T QLDHGV AVGYG+ + IVKNSWGP WG++GYI M RN
Sbjct: 264 GVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQ- 322
Query: 337 GLCGINKMASYPI 349
CGI ASYP+
Sbjct: 323 --CGIATSASYPL 333
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 144/321 (44%), Positives = 190/321 (59%), Gaps = 17/321 (5%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLN 94
+S + L +E++ + +K Y+S E+L RF+IF +N I + N K + +Y LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQ 152
+F DL EF +F G + RK +V D LPK+VDWRKKGAVT VK+Q
Sbjct: 78 QFGDLLAHEFARIFNGHR---GTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQ 134
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCWAFS ++EG + + G L SLSEQ L+DC ++ NNGC GGLM+ AF+YI +
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
G+ E+ YPY +G C K E T GY ++ SE L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKK-EDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAID 253
Query: 271 ASGRDFQFYSGGVYD-GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
AS FQ YS GVYD C ++ LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 329 KRNTGKPEGLCGINKMASYPI 349
R+ CGI ASYP+
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 146/352 (41%), Positives = 199/352 (56%), Gaps = 18/352 (5%)
Query: 11 LISFCISFF--IRSSFARDFSIVGYSPEDLTSN-DKLIDLFESWMSKFEKVYESLDEKLE 67
L+ C S F I S RD +I + + L D+ L++ + F K Y DE+ +
Sbjct: 7 LVLLCASVFASIDSGSRRDHTIRLHRVKSLRQKIDEAFKLWDDYKEAFGKSYNK-DEEND 65
Query: 68 RFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEEFKEMF-LGLKPDLARRKDQS 122
E F N+ HIDE N++ K + +GLN ADL +++++ + + +
Sbjct: 66 YMEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKLNGYRHRRNFGDSMQSN 125
Query: 123 HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLS 182
+ V++P SVDWR KG VT VKNQG CGSCWAFS A+EG + +G + SLS
Sbjct: 126 GTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARASGKMVSLS 185
Query: 183 EQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVT 241
EQ L+DC Y N+GCNGGLMD AF+YI G+ EE YPY+ E C K +
Sbjct: 186 EQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKCHFKKKDIGAED 245
Query: 242 INGYHDVPQNSEDSLLKALANQ-PLSVAIEASGRDFQFYSGGV-YDGHCGT-QLDHGVAA 298
G+ D+P+ E++L A+A Q P+S+AI+A R FQ Y GV YD C + +LDHGV
Sbjct: 246 -KGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSEELDHGVLL 304
Query: 299 VGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
VGYG+ DY ++KNSWGP WGEKGYIR+ RN CG+ ASYP+
Sbjct: 305 VGYGTDPEAGDYWLIKNSWGPGWGEKGYIRIARNRSNH---CGVATKASYPL 353
>gi|294885991|ref|XP_002771503.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875207|gb|EER03319.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 138/323 (42%), Positives = 188/323 (58%), Gaps = 17/323 (5%)
Query: 43 KLIDL------FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEF 96
K +DL F + K K Y++ DE+++R IF DNL +I+E N + +Y LG+NE+
Sbjct: 16 KAVDLETSSLAFIGFQKKHGKSYDNKDEEMKRAAIFHDNLNYIEEVNAQNLSYKLGVNEY 75
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
DL EEF + L D++ + LP SVDWRKKG + VK+QG CG
Sbjct: 76 TDLTLEEFAALKLS-STDMSEGMGDGFVAGAGPTTTTLPTSVDWRKKGVLNPVKDQGYCG 134
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGL 215
SCWAFS + A+E I TG L SLSEQ+L+DC Y N GCNGGLMD AF+YI +T G+
Sbjct: 135 SCWAFSAIGALEPRYAIATGKLLSLSEQQLVDCAGAYGNEGCNGGLMDKAFEYIKAT-GV 193
Query: 216 HKEEDYPYIMEEGTCEMT---KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEAS 272
KE YPY+ + TC+ T K + V + + +E +L++ +A P+S+A+ A+
Sbjct: 194 DKESTYPYVGSDETCQATVENKTDGLPVGEVTGNQMLHQTEKALMEGVAAAPVSIAMYAN 253
Query: 273 GRDFQFYSGGVY-DGHC---GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
+ FQ Y GVY D +C G +DHGV AVGYG+ G DY I++NSWG WG+ GY+ +
Sbjct: 254 LQSFQHYKSGVYSDPNCNAKGGSIDHGVVAVGYGTENGQDYFIIRNSWGRSWGQDGYVYL 313
Query: 329 KRNTGKPEGLCGINKMASYPIKK 351
KR G G C I K P K
Sbjct: 314 KRGVGS-FGQCNIYKYMCVPTLK 335
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 188/321 (58%), Gaps = 17/321 (5%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLN 94
+S + L +E++ + +K Y+S E+L RF+IF +N I + N K + +Y LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQ 152
+F DL EF +F G RK +V D LPK VDWRKKGAVT VK+Q
Sbjct: 78 QFGDLLAHEFARIFNGHH---GTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQ 134
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCWAFS ++EG + + G L SLSEQ L+DC ++ NNGC GGLM+ AF+YI
Sbjct: 135 GQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKE 194
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
G+ E+ YPY +G C K E T GY ++ SED L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKK-EDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAID 253
Query: 271 ASGRDFQFYSGGVYD-GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
AS FQ YS GVYD C ++ LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 329 KRNTGKPEGLCGINKMASYPI 349
R+ CGI ASYP+
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 142/306 (46%), Positives = 183/306 (59%), Gaps = 25/306 (8%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFADLRHEEFKEMFLGLK 112
K Y + E++ R ++F DN + IDE N K + +Y + +N DL EFK + G K
Sbjct: 22 KNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMVHEFKALMNGFK 81
Query: 113 --PDLARRKD---QSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
P+ R S+E+ LPKSVDWR++GAVT VK+QG CGSCW+FS ++
Sbjct: 82 KTPNAERNGKIYVPSNEN--------LPKSVDWRQRGAVTPVKDQGHCGSCWSFSATGSL 133
Query: 168 EGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EG + TG L SLSEQ L+DC TY N+GC GGLM+ AFQY+ G+ E YPY
Sbjct: 134 EGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGIDTEASYPYEAR 193
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVY- 284
E C K + T GY D+ + SE L A+A P+SV I+AS FQFYS GVY
Sbjct: 194 ENNCRF-KEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHESFQFYSEGVYK 252
Query: 285 DGHCG-TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINK 343
+ +C +QLDHGV VGYG+ G DY +VKNSWGP WGE GYI++ RN + CGI
Sbjct: 253 EQYCSPSQLDHGVLTVGYGTENGQDYWLVKNSWGPSWGESGYIKIARNH---KNHCGIAS 309
Query: 344 MASYPI 349
MASYP+
Sbjct: 310 MASYPV 315
>gi|443685370|gb|ELT89004.1| hypothetical protein CAPTEDRAFT_95613, partial [Capitella teleta]
Length = 295
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 135/296 (45%), Positives = 181/296 (61%), Gaps = 14/296 (4%)
Query: 64 EKLERFEIFKDNLRHIDETN----RKIKNYWLGLNEFADLRHEEFKEMFLGLK-PDLARR 118
E+ +R E+F++N++ I N + + +G+N+F+D+ +EF + G + + +
Sbjct: 3 EENQRKEVFRNNIKKIQMHNYLHEQGKSPFTMGINQFSDMDEKEFSTIMNGFRMNNRTKV 62
Query: 119 KDQSHEDFSYKDV-VDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
+D H + + V +P VDWRKKG VT VKNQG CGSCWAFS + A+EG + TG
Sbjct: 63 RDHLHSHYISPAIPVSVPAEVDWRKKGYVTPVKNQGQCGSCWAFSAIGALEGQHFRKTGK 122
Query: 178 LASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGE 236
L SLSEQ L+DC +Y NNGCNGG+MDYAF+YI G E YPY +G C K E
Sbjct: 123 LVSLSEQNLVDCSKSYGNNGCNGGVMDYAFKYIKDNDGDDTEACYPYEAVDGMCRF-KRE 181
Query: 237 SEVVTINGYHDVPQNSEDSLLKALA-NQPLSVAIEASGRDFQFYSGGVY-DGHCGT-QLD 293
T GY D+P +E + +A+A P+SVAI+AS F Y GGVY + C QLD
Sbjct: 182 CVGATCRGYTDLPWGNEVKMKEAVALVGPVSVAIDASHSSFMSYKGGVYVEKECSPYQLD 241
Query: 294 HGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
HGV VGYG+ +GLDY +VKNSWG WG++GYI+M RN CGI MA YP+
Sbjct: 242 HGVLVVGYGTEQGLDYWLVKNSWGTTWGDQGYIKMARNM---HNHCGIASMACYPL 294
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 142/303 (46%), Positives = 180/303 (59%), Gaps = 19/303 (6%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFADLRHEEFKEMFLGLK 112
K Y++ E++ R +IF +N + I+ N K + +Y + +N F DL E K + G K
Sbjct: 36 KNYKNQFEEMFRRKIFMNNKKRIEAHNAKYEQGEVSYKMKMNHFGDLMSHEIKALMNGFK 95
Query: 113 --PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGI 170
P+ R + F D LPKSVDWR+KGAVT VK+QG CGSCW+FS ++EG
Sbjct: 96 MTPNTKR---EGKIYFPSND--KLPKSVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQ 150
Query: 171 NQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGT 229
+ G L SLSEQ L+DC Y NNGC GGLMD AFQY+ G+ E YPY +
Sbjct: 151 IFLKKGKLVSLSEQNLMDCSKEYGNNGCEGGLMDKAFQYVSDNKGIDTESSYPYEARDYA 210
Query: 230 CEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGVY-DGH 287
C K + T GY D+P+ E +L ALA P+SVAI+AS F FYS GVY + +
Sbjct: 211 CRFKK-DKVGGTDKGYVDIPEGDEKALQNALATVGPISVAIDASHESFHFYSEGVYNEPY 269
Query: 288 CGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMAS 346
C + LDHGV AVGYG+ G DY +VKNSWGP WGE GYI++ RN CGI MAS
Sbjct: 270 CSSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGESGYIKIARNHSNH---CGIASMAS 326
Query: 347 YPI 349
YPI
Sbjct: 327 YPI 329
>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 179/313 (57%), Gaps = 14/313 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEE 103
+E W + K YE+ E+ R IF+ N I E N + + +Y L +N+F D+ HEE
Sbjct: 24 WEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEE 83
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + +G + ++ + D LPKSVDWR V+ VK+QG CGSCWAFST
Sbjct: 84 FHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFST 143
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
++EG + TG L LSEQ+L+DC + N GC GGLMD AFQYI + GGL EE YP
Sbjct: 144 TGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYP 203
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSG 281
Y + S T+ GY DV +E +L +A+A P+SVAI+A FQFYS
Sbjct: 204 YTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYSS 263
Query: 282 GVYD-GHCGT-QLDHGVAAVGYGSTRG---LDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
GVYD C T QLDHGV AVGYG+ + IVKNSWGP WG++GYI M RN
Sbjct: 264 GVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQ- 322
Query: 337 GLCGINKMASYPI 349
CGI ASYP+
Sbjct: 323 --CGIATSASYPL 333
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 152/353 (43%), Positives = 198/353 (56%), Gaps = 30/353 (8%)
Query: 11 LISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFE 70
+I CI + SF F+ G P L D + SW S K Y +E R
Sbjct: 1 MIYLCI---LALSFGASFAAPGLDP-------ALNDHWLSWKSWHSKKYHEKEEGWRRM- 49
Query: 71 IFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDF 126
I++ NL+ I+ N +Y LG+N F D+ +EEF+++ G K ++RK + + F
Sbjct: 50 IWEKNLKMIELHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGFKQSRSQRKYKGSQ-F 108
Query: 127 SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQEL 186
+ + PKSVDWR+KG VT VK+QG CGSCWAFS A+EG + TG L SLSEQ L
Sbjct: 109 LEPNFLQAPKSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQHFRKTGKLVSLSEQNL 168
Query: 187 IDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGY 245
IDC N GCNGGLMD AFQYI G+ EE YPYI ++ + K E G+
Sbjct: 169 IDCSGPEGNQGCNGGLMDQAFQYIKDNNGIDSEESYPYIGKDDEDCLYKPEYNSANDTGF 228
Query: 246 HDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV-YDGHCGT-QLDHGVAAVGYG 302
D+P+ E +L+KA+A P+SVAI+AS FQFY GV Y+ C + +LDHGV VGYG
Sbjct: 229 VDIPEGRERALMKAVAAVGPISVAIDASHTSFQFYESGVYYEPQCNSEELDHGVLVVGYG 288
Query: 303 STRGLD------YIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
G D Y IVKNSW KWG++GYI M ++ CGI ASYP+
Sbjct: 289 -YEGTDDDNKKRYWIVKNSWSEKWGDQGYIHMAKDRSNN---CGIASAASYPM 337
>gi|33242882|gb|AAQ01145.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 178/313 (56%), Gaps = 14/313 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEE 103
+E W + K YE+ E+ R IF+ N I E N + + +Y L +N+F D+ HEE
Sbjct: 24 WEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEE 83
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + +G + ++ E D LPKSVDWR V+ VK+QG CG CWAFST
Sbjct: 84 FHQRIMGGCLKIVKKPLLGSEVGDSDDNGTLPKSVDWRNSHMVSEVKDQGECGPCWAFST 143
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
++EG + TG L LSEQ+L+DC + N GC GGLMD AFQYI + GGL EE YP
Sbjct: 144 TGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIPANGGLDTEESYP 203
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSG 281
Y + S T+ GY DV +E +L +A+A P+SVAI+A FQFYS
Sbjct: 204 YTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYSS 263
Query: 282 GVYD-GHCGT-QLDHGVAAVGYGSTRG---LDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
GVYD C T QLDHGV AVGYG+ + IVKNSWGP WG++GYI M RN
Sbjct: 264 GVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQ- 322
Query: 337 GLCGINKMASYPI 349
CGI ASYP+
Sbjct: 323 --CGIATSASYPL 333
>gi|403333364|gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 141/325 (43%), Positives = 193/325 (59%), Gaps = 24/325 (7%)
Query: 33 YSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWL 91
Y+P +T+ D F ++++K+ K Y + +E R ++FK NL + N R Y L
Sbjct: 33 YTP--ITAEDHA---FTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNARNDVTYRL 87
Query: 92 GLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKS--VDWRKKGAVTHV 149
GLN+FAD E+K + +K+++ + V+ PK+ V+W ++GAVT V
Sbjct: 88 GLNKFADYTEAEYKRLL-----GFGGQKNKNPRNIK---VLGAPKNDGVNWVEQGAVTPV 139
Query: 150 KNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQY 208
K+QG CGSCW+FS A+EG +I G L SLSEQ+L+DC N GC GG MD AFQY
Sbjct: 140 KDQGQCGSCWSFSATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQY 199
Query: 209 IVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVA 268
+ T L E+ YPY + TC + + VV ++ + DV N+ + L AL P+SVA
Sbjct: 200 VEQTA-LETEDQYPYEAVDDTCRASS--AGVVKVDSFVDVTPNNVNELKAALDKGPVSVA 256
Query: 269 IEASGRDFQFYSGGVY-DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIR 327
IEA FQFYSGGV D CGT LDHGV AVGYG+ G DY +VKNSWG WGE+GY++
Sbjct: 257 IEADQMVFQFYSGGVINDASCGTTLDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVK 316
Query: 328 MKRNTGKPEGLCGINKMASYPIKKK 352
+ P+ +CGI ASYPI K+
Sbjct: 317 I---AASPDNICGILSQASYPIMKQ 338
>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 179/313 (57%), Gaps = 14/313 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEE 103
+E W + K YE+ E+ R IF+ N I E N + + +Y L +N+F D+ HEE
Sbjct: 24 WEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEE 83
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + +G + ++ E D LPKSVDWR V+ VK+QG CGSCWAFST
Sbjct: 84 FHQRIMGGCLKIVKKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFST 143
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
++EG + TG L LSEQ+L+DC + N GC GGLMD AFQYI + GGL EE YP
Sbjct: 144 TGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYP 203
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSG 281
Y + S T+ GY DV ++E +L +A+A P+SVAI+A FQFYS
Sbjct: 204 YTATDDKPCKFDNSSVGATLIGYKDVKSSNEHALKRAVATVGPVSVAIDAGHESFQFYSS 263
Query: 282 GVYD-GHCGT-QLDHGVAAVGYGSTRG---LDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
GVYD C T QLDHGV VGYG+ + IVKNSWGP WG++GYI M RN
Sbjct: 264 GVYDEPQCSTEQLDHGVLVVGYGAMNDNSHQAFWIVKNSWGPNWGDQGYIMMSRNKNNQ- 322
Query: 337 GLCGINKMASYPI 349
CGI ASYP+
Sbjct: 323 --CGIATSASYPL 333
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 149/332 (44%), Positives = 186/332 (56%), Gaps = 22/332 (6%)
Query: 30 IVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-- 87
+VG + LT ++++ K YE + R +IF N I N K
Sbjct: 14 LVGAASAALTLEQLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLIARHNIKHAKG 73
Query: 88 --NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
Y L +N+F D+ H EF GL L + + + V LPKSVDWR+KGA
Sbjct: 74 ETTYKLKMNQFGDMLHHEFVSTMNGL---LRSNRTYFGSTWIEPESVSLPKSVDWREKGA 130
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDY 204
VT VKNQG CGSCW+FST A+EG TG L SLSEQ LIDC +Y NNGC GGLMD
Sbjct: 131 VTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNGCGGGLMDN 190
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-Q 263
AF YI G+ EE YPY ++G C K E G+ D+P +E +L KALA
Sbjct: 191 AFTYIKENHGIDTEESYPYEGKQGKCRYHK-EDSAGRDTGFVDIPSGNERALAKALATIG 249
Query: 264 PLSVAIEASGRDFQFYSGGVY-----DGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWG 317
P+SVAI+AS FQFY GVY D H LDHGV AVGYG+T G DY I+KNSWG
Sbjct: 250 PVSVAIDASHESFQFYHEGVYNPPDCDSH---SLDHGVLAVGYGTTDDGQDYYIIKNSWG 306
Query: 318 PKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
+WG++GY+ M RN+ + CG+ ASYP+
Sbjct: 307 ERWGQEGYVLMARNS---KNECGVATQASYPL 335
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 149/332 (44%), Positives = 186/332 (56%), Gaps = 22/332 (6%)
Query: 30 IVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-- 87
+VG + LT ++++ K YE + R +IF N I N K
Sbjct: 9 LVGAASAALTLEQLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLIARHNIKHAKG 68
Query: 88 --NYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
Y L +N+F D+ H EF GL L + + + V LPKSVDWR+KGA
Sbjct: 69 ETTYKLKMNQFGDMLHHEFVSTMNGL---LRSNRTYFGSTWIEPESVSLPKSVDWREKGA 125
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDY 204
VT VKNQG CGSCW+FST A+EG TG L SLSEQ LIDC +Y NNGC GGLMD
Sbjct: 126 VTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNGCGGGLMDN 185
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-Q 263
AF YI G+ EE YPY ++G C K E G+ D+P +E +L KALA
Sbjct: 186 AFTYIKENHGIDTEESYPYEGKQGKCRYHK-EDSAGRDTGFVDIPSGNERALAKALATIG 244
Query: 264 PLSVAIEASGRDFQFYSGGVY-----DGHCGTQLDHGVAAVGYGST-RGLDYIIVKNSWG 317
P+SVAI+AS FQFY GVY D H LDHGV AVGYG+T G DY I+KNSWG
Sbjct: 245 PVSVAIDASHESFQFYHEGVYNPPDCDSH---SLDHGVLAVGYGTTDDGQDYYIIKNSWG 301
Query: 318 PKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
+WG++GY+ M RN+ + CG+ ASYP+
Sbjct: 302 ERWGQEGYVLMARNS---KNECGVATQASYPL 330
>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
Length = 355
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 138/318 (43%), Positives = 190/318 (59%), Gaps = 15/318 (4%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFA 97
D+ + ++ + F K YE +E+ + E F N+ HI+E N++ K + +GLNE A
Sbjct: 42 DEAFNKWDDYKETFGKSYEP-EEENDYMEAFVKNVIHIEEHNKEHRLGRKTFEMGLNEIA 100
Query: 98 DLRHEEFKEMF-LGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
DL +++++ ++ + F V +P+SVDWR++G VT VKNQG CG
Sbjct: 101 DLPFSQYRKLNGYRMRRQFGDSMQSNGTKFLVPFNVQIPESVDWREEGLVTPVKNQGMCG 160
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGL 215
SCWAFS+ A+EG + TG L SLSEQ L+DC Y N+GCNGGLMD AF+YI G+
Sbjct: 161 SCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENHGV 220
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ-PLSVAIEASGR 274
E+ YPY+ E C K + G+ D+P+ E++L KA+A Q P+S+AI+A R
Sbjct: 221 DTEDSYPYVGRETKCHF-KRNTVGADDKGFVDLPEGDEEALKKAVATQGPISIAIDAGHR 279
Query: 275 DFQFYSGGVY-DGHCGT-QLDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQ Y GVY D C + +LDHGV VGYG+ DY +VKNSWGP WGEKGYIR+ RN
Sbjct: 280 SFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTWGEKGYIRIARN 339
Query: 332 TGKPEGLCGINKMASYPI 349
CG+ ASYP+
Sbjct: 340 RNNH---CGVATKASYPL 354
>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 141/308 (45%), Positives = 181/308 (58%), Gaps = 24/308 (7%)
Query: 49 ESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI-KNYWLGLNEFADLRHEEFKEM 107
E M+++ KVY+ E F N+ +I+ N K Y G+N+F +
Sbjct: 40 EQRMTRYSKVYKDPPES------FXGNVNYIEACNNAADKPYKXGINQFPP------RNR 87
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTH--VKNQGSCGSCWAFSTVA 165
F G R F +++V P +VD R+KGAVT VK+QG CG WA S VA
Sbjct: 88 FKGHMCSSIIRITT----FKFENVTATPSTVDCRQKGAVTPYTVKDQGQCGCFWALSAVA 143
Query: 166 AVEGINQIVTGNLASLS-EQELIDCDNT-YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
A EGI+ + G L LS E EL+DCD + GC GGL D AF++I+ GL+ E +YPY
Sbjct: 144 ATEGIHALXAGKLILLSXEPELVDCDTKGVDQGCEGGLTDDAFKFIIQNHGLNTEANYPY 203
Query: 224 IMEEGTCEMTKGESEVVTI-NGYHDVPQNSEDS-LLKALANQPLSVAIEASGRDFQFYSG 281
+G C + + TI GY DVP N+E + L KA+AN P+SVAI+ASG DFQFY
Sbjct: 204 KGVDGKCNANEADKNAATIITGYDDVPANNEKAHLQKAVANNPVSVAIDASGSDFQFYKS 263
Query: 282 GVYDGHCGTQLDHGVAAVGYG-STRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCG 340
GV+ G CGT+LDHGV AVGYG S G +Y +VKNS GP+WGE+GYIRM+R E LCG
Sbjct: 264 GVFTGSCGTELDHGVTAVGYGVSDDGTEYWLVKNSRGPEWGEEGYIRMQRGVDSEEALCG 323
Query: 341 INKMASYP 348
I ASYP
Sbjct: 324 IAVQASYP 331
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 145/328 (44%), Positives = 191/328 (58%), Gaps = 22/328 (6%)
Query: 34 SPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NY 89
SP DL + + ++ + K Y + E+ R +IF +N I + N+ +Y
Sbjct: 19 SPLDLIKEE-----WHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSY 73
Query: 90 WLGLNEFADLRHEEFKEMFLG----LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGA 145
LGLN++AD+ H EFKE G L+ + R + V +PKSVDWR+ GA
Sbjct: 74 KLGLNKYADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGA 133
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDY 204
VT VK+QG CGSCWAFS+ A+EG + G L SLSEQ L+DC Y NNGCNGGLMD
Sbjct: 134 VTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDN 193
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ- 263
AF+YI GG+ E+ YPY + +C K + T G+ D+P+ E+ + KA+A
Sbjct: 194 AFRYIKDNGGIDTEKSYPYEGIDDSCHFNKA-TIGATDTGFVDIPEGDEEKMKKAVATMG 252
Query: 264 PLSVAIEASGRDFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKW 320
P+SVAI+AS FQ YS GVY + C Q LDHGV VGYG+ G+DY +VKNSWG W
Sbjct: 253 PVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTW 312
Query: 321 GEKGYIRMKRNTGKPEGLCGINKMASYP 348
GE+GYI+M RN CGI +SYP
Sbjct: 313 GEQGYIKMARNQNNQ---CGIATASSYP 337
>gi|326431661|gb|EGD77231.1| cysteine protease [Salpingoeca sp. ATCC 50818]
Length = 347
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 134/312 (42%), Positives = 173/312 (55%), Gaps = 18/312 (5%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNL----RHIDETNRKIKNYWLGLNEFADLRHEE 103
FE + K+ KVYES +E+ R IF+++L +H E + Y +G+NEFADL EE
Sbjct: 31 FEEFKDKYNKVYESAEEEARRAAIFQESLDFIEKHNAEAAAGMHTYLVGVNEFADLTREE 90
Query: 104 FKEMFLGLKPDLARRKDQS----HEDFSYKDVVDL---PKSVDWRKKGAVTHVKNQGSCG 156
F++ + P ++D H D D +DWRK+GAVT V+NQG CG
Sbjct: 91 FRQHHVTRLPFDDDKRDPVTATLHLDEHAVHAADSNGDSSGIDWRKRGAVTPVRNQGQCG 150
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLH 216
+ F+ V AVEG++ I +GNL LS Q++IDC T GC+GG + F+YI GGL
Sbjct: 151 NPAIFAAVEAVEGMHAISSGNLVELSTQQVIDCSGT--PGCSGGSLVSFFKYIARNGGLD 208
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDF 276
DYP G C K V + GY VP +E L A+ P++VAIEA F
Sbjct: 209 SAADYPTSGAGGQCNKAKEARHVAKVGGYSVVPPRNETKLAAAVFKMPVAVAIEADTPSF 268
Query: 277 QFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
Q Y+ GVY G CGTQLDH V VGY +Y IVKNSWG WG++GYI MKR G
Sbjct: 269 QMYTSGVYSGPCGTQLDHAVLVVGYTD----EYWIVKNSWGASWGDQGYIMMKRGVGA-A 323
Query: 337 GLCGINKMASYP 348
G+CGI A YP
Sbjct: 324 GICGITLDAMYP 335
>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
Length = 333
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 138/318 (43%), Positives = 175/318 (55%), Gaps = 22/318 (6%)
Query: 45 IDLFESWM---SKFEKVYESLDEKLERFE-------IFKDNLRHIDETNRKIKNYWLGLN 94
++L W + F K Y + +E R I + NL H + + Y LGLN
Sbjct: 22 VELDSHWALFKTTFGKQYSTAEEITRRLAWEANVAIIRQHNLEH----DLGLHTYTLGLN 77
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
+ADL + EF ++ GL+ + ++ K + + V+LP SVDWR KG VT +K+QG
Sbjct: 78 NYADLTNAEFNQVMNGLRVNASQTKSANRRTYVAPVGVELPTSVDWRTKGYVTPIKDQGQ 137
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTG 213
CGSCWAFS+ ++EG + TG L SLSEQ L DC N GCNGGLMD AF YI
Sbjct: 138 CGSCWAFSSTGSLEGQHFAKTGQLVSLSEQNLTDCSQKQGNMGCNGGLMDQAFTYIKENN 197
Query: 214 GLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEAS 272
G+ E YPY + C K T GY D+ Q E++L A+A P+SVAI+AS
Sbjct: 198 GIDTESSYPYKAVDEKCHF-KAADVGATDTGYTDIAQQDENALQSAIATVGPISVAIDAS 256
Query: 273 GRDFQFYSGGVYDGHC--GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKR 330
FQ Y G Y+ TQLDHGV AVGY S G DY IVKNSWG WG+KGYI M R
Sbjct: 257 HSSFQLYRSGAYNERACSATQLDHGVLAVGYDSEDGKDYYIVKNSWGTSWGQKGYIWMTR 316
Query: 331 NTGKPEGLCGINKMASYP 348
N CGI M++YP
Sbjct: 317 NKNNQ---CGIATMSTYP 331
>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
Length = 382
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 196/343 (57%), Gaps = 18/343 (5%)
Query: 24 FARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN 83
FA + G + D T L++ F++W +++ + Y + +E +RF I+ +N+R I N
Sbjct: 40 FACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMN 99
Query: 84 R--KIKNYWLGLNEFADLRHEEFKE---MFLGLKPDLARRKDQSHEDFSY------KDVV 132
+ +Y LG N+F DL EEFK+ M L +P A + S +
Sbjct: 100 QLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMPPTVGTMSTAGMSNGNNTG 159
Query: 133 DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT 192
+ P SVDWR KGAVT VK+Q CGSCWAF+TVA++EG++QI TG L SLSEQE++DCD
Sbjct: 160 EAPNSVDWRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRG 219
Query: 193 YN-NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQN 251
N NGC GG A +++ GGL E DYPY+ + C K I GY V +N
Sbjct: 220 GNDNGCRGGSPRSAMEWVTRNGGLTTESDYPYVGSQRQCMSGKLGHHAARIRGYQAVQRN 279
Query: 252 SEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHC-GTQLDHGVAAVGYGST----RG 306
+E L +A+A QP++V ++AS R FQFY GV+ G C T ++H V VGYGST G
Sbjct: 280 NEAELERAVAGQPVAVFVDAS-RAFQFYKSGVFSGPCDTTTVNHVVTVVGYGSTGSDSGG 338
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
Y IVKNSWG WGE GY+RM R EG+C I YP+
Sbjct: 339 RKYWIVKNSWGQGWGENGYVRMARRVRAREGMCAIAIEPYYPV 381
>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 186/310 (60%), Gaps = 19/310 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIKNYWLGLNEFADLRHEEFKE 106
F ++++K+ K Y + +E R ++FK NL + N R Y LGLN+FAD E+K
Sbjct: 43 FTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNVRNDVTYRLGLNKFADYTEAEYKR 102
Query: 107 MFLGLKPDLARRKDQSHEDFSYKDVVDLPKS--VDWRKKGAVTHVKNQGSCGSCWAFSTV 164
+ +K+++ + V+ PK+ V+W ++GAVT VK+QG CGSCW+FS
Sbjct: 103 LL-----GFGGQKNKNPRNIK---VLGAPKNDGVNWVEQGAVTPVKDQGQCGSCWSFSAT 154
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
A+EG +I G L SLSEQ+L+DC N GC GG MD AFQY+ T L E+ YPY
Sbjct: 155 GAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQTA-LETEDQYPY 213
Query: 224 IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGV 283
+ TC + + VV ++ + DV N+ + L AL P+SVAIEA FQFYSGGV
Sbjct: 214 EAVDDTCRASS--AGVVKVDSFVDVTPNNVNELKAALDKGPVSVAIEADQMVFQFYSGGV 271
Query: 284 Y-DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGIN 342
D CGT LDHGV AVGYG+ G DY +VKNSWG WGE+GY+++ P+ +CGI
Sbjct: 272 INDASCGTTLDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVKI---AASPDNICGIL 328
Query: 343 KMASYPIKKK 352
ASYPI K+
Sbjct: 329 SQASYPIMKQ 338
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 141/323 (43%), Positives = 192/323 (59%), Gaps = 17/323 (5%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYES-LDEKLERFEIFKDNLRHIDETN----RKIKNYWLG 92
L+ + L D + + + +K Y S L+EKL R +I+ +N + + N + K+Y +
Sbjct: 21 LSLTNLLADEWHLFKATHKKEYPSQLEEKL-RMKIYLENKHKVAKHNILYEKGEKSYQVA 79
Query: 93 LNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV--VDLPKSVDWRKKGAVTHVK 150
+N+F DL H EF+ + G + + ++ F++ + V++P+SVDWR+KGA+T VK
Sbjct: 80 MNKFGDLLHHEFRSIMNGYQHK-KQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVK 138
Query: 151 NQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYI 209
+QG CGSCWAFS+ A+EG TG L SLSEQ LIDC Y N GCNGGLMD AFQYI
Sbjct: 139 DQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYI 198
Query: 210 VSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVA 268
G+ E YPY E+G C V G+ D+P ED L A+A P+SVA
Sbjct: 199 KDNKGIDTENTYPYEAEDGVCRYNPRNRGAVD-RGFVDIPSGEEDKLKAAVATVGPVSVA 257
Query: 269 IEASGRDFQFYS-GGVYDGHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYI 326
I+AS FQFYS G Y+ C + LDHGV VGYGS G DY +VKNSW WG++GYI
Sbjct: 258 IDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYGSDNGEDYWLVKNSWSEHWGDEGYI 317
Query: 327 RMKRNTGKPEGLCGINKMASYPI 349
++ RN + CG+ ASYP+
Sbjct: 318 KIARNR---KNHCGVATAASYPL 337
>gi|1311024|pdb|1GEC|E Chain E, Glycyl Endopeptidase-complex With
Benzyloxycarbonyl-leucine-valine- Glycine-methylene
Covalently Bound To Cysteine 25
Length = 216
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 122/218 (55%), Positives = 155/218 (71%), Gaps = 4/218 (1%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NT 192
LP+SVDWR KGAVT VK+QG C SCWAFSTVA VEGIN+I TGNL LSEQEL+DCD +
Sbjct: 1 LPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDLQS 60
Query: 193 YNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNS 252
Y GCN G + QY V+ G+H YPYI ++ TC + V NG V N+
Sbjct: 61 Y--GCNRGYQSTSLQY-VAQNGIHLRAKYPYIAKQQTCRANQVGGPKVKTNGVGRVQSNN 117
Query: 253 EDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIV 312
E SLL A+A+QP+SV +E++GRDFQ Y GG+++G CGT++DH V AVGYG + G YI++
Sbjct: 118 EGSLLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVDHAVTAVGYGKSGGKGYILI 177
Query: 313 KNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
KNSWGP WGE GYIR++R +G G+CG+ + + YPIK
Sbjct: 178 KNSWGPGWGENGYIRIRRASGNSPGVCGVYRSSYYPIK 215
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 143/318 (44%), Positives = 188/318 (59%), Gaps = 21/318 (6%)
Query: 49 ESWMS-KFEKVYESLDEKLERF--EIFKDNLRHIDETNR----KIKNYWLGLNEFADLRH 101
E W + K E DE ERF +IF +N I + N+ ++ + +N++AD+ H
Sbjct: 27 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLH 86
Query: 102 EEFKEMFLGLKPDLARRKDQSHEDFS-----YKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
EF G L ++ + E F + V LPK VDWR KGAVT VK+QG CG
Sbjct: 87 HEFYSTMNGFNYTLHKQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQGHCG 146
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGL 215
SCWAFS+ A+EG + +G L SLSEQ L+DC Y NNGCNGGLMD AF+YI GG+
Sbjct: 147 SCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 206
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGR 274
E+ YPY + +C KG S T G+ D+PQ +E + +A+A P++VAI+AS
Sbjct: 207 DTEKSYPYEAIDDSCHFNKG-SIGATDRGFVDIPQGNEKKMAEAVATIGPVAVAIDASHE 265
Query: 275 DFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQFYS GVY + C Q LDHGV VG+G+ G DY +VKNSWG WG+KG+I+M RN
Sbjct: 266 SFQFYSEGVYNEPACDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRN 325
Query: 332 TGKPEGLCGINKMASYPI 349
E CGI +SYP+
Sbjct: 326 ---KENQCGIASASSYPL 340
>gi|125592011|gb|EAZ32361.1| hypothetical protein OsJ_16571 [Oryza sativa Japonica Group]
Length = 416
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 134/294 (45%), Positives = 167/294 (56%), Gaps = 36/294 (12%)
Query: 62 LDEKLERFEIFKDNLRHIDETNRKIK---NYWLGLNEFADLRHEEFKEMFLGLKPDLARR 118
+ E RF +F DNL+ +D N + + LG+N FADL + EF+ +LG P A R
Sbjct: 46 IGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTP--AGR 103
Query: 119 KDQSHEDFSYKDVVDLPKSVDWRKKGAVTH-VKNQGSCGSCWAFSTVAAVEGINQIVTGN 177
+ E + + V LP SVDWR KGAV VKNQG CG+ G
Sbjct: 104 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGA-----------------GGV 146
Query: 178 LASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGES 237
+EQ L +MD AF +I GGL EEDYPY +G C + K
Sbjct: 147 REERAEQRL-----------QRWIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSR 195
Query: 238 EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVA 297
+VV+I+G+ DVP+N E SL KA+A+QP+SVAI+A GR+FQ Y GV+ G CGT LDHGV
Sbjct: 196 KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVV 255
Query: 298 AVGYGS--TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
AVGYG+ G Y V+NSWGP WGE GYIRM+RN G CGI MASYPI
Sbjct: 256 AVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 309
>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 329
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 139/328 (42%), Positives = 189/328 (57%), Gaps = 14/328 (4%)
Query: 28 FSIVGYSP-EDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKI 86
F IVG + L + + F +WM ++ Y++ + + R+ FKDNL I N
Sbjct: 8 FMIVGLAAGSRLFAEKHYQNQFTNWMVVQDRQYDAYEFR-TRYSAFKDNLDFIHRWNAVN 66
Query: 87 KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHE-DFSYKDVVDLPKSVDWRKKGA 145
K LG FADL +EE++ ++LG+ D + Q D Y+ V ++DWR GA
Sbjct: 67 KETELGATVFADLTNEEYRAVYLGMNVDASNFAAQPATLDQVYQPV---RSTLDWRNNGA 123
Query: 146 VTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDY 204
V VK+QG CGSCWAFST AVEG +QI TGN SLSEQ+L+DC +Y N+GC GGLMD
Sbjct: 124 VGRVKDQGQCGSCWAFSTTGAVEGAHQIATGNFVSLSEQQLMDCSRSYGNHGCQGGLMDS 183
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEG-TCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQ 263
A YIV GG++ EE YPY M + TC+ + ++GY ++ + SE L L
Sbjct: 184 AMSYIVKQGGINTEESYPYEMRDSYTCKYNPANNG-AKLSGYSNIKRGSEADLAAKLNIG 242
Query: 264 PLSVAIEASGRDFQFYSGGV-YDGHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWG 321
P+++A++AS FQ Y GV YD C T L HGV AVGYG+ Y IVKNSWG +WG
Sbjct: 243 PVAIALDASHSSFQLYKSGVFYDPACSSTSLSHGVLAVGYGTEGSSAYWIVKNSWGTRWG 302
Query: 322 EKGYIRMKRNTGKPEGLCGINKMASYPI 349
+ GYI + ++ CG+ M+S PI
Sbjct: 303 DAGYIWIAKDRNNH---CGVATMSSIPI 327
>gi|294885989|ref|XP_002771502.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875206|gb|EER03318.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 137/323 (42%), Positives = 188/323 (58%), Gaps = 17/323 (5%)
Query: 43 KLIDL------FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEF 96
K +DL F + K K Y++ +E+++R IF DNL +I+E N + +Y LG+NE+
Sbjct: 16 KAVDLEAAGLAFIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVNAQNLSYKLGVNEY 75
Query: 97 ADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
DL EEF + L D++ + LP SVDWRKKG + VK+QG CG
Sbjct: 76 TDLTLEEFAALKLS-STDMSEGMGDGFVAGAGPTTTTLPTSVDWRKKGVLNPVKDQGYCG 134
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGL 215
SCWAFS + A+E I TG L SLSEQ+L+DC Y N GCNGGLMD AF+YI +T G+
Sbjct: 135 SCWAFSAIGALEPRYAIATGKLLSLSEQQLVDCAGAYGNEGCNGGLMDKAFEYIKAT-GV 193
Query: 216 HKEEDYPYIMEEGTCEMT---KGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEAS 272
KE YPY+ + TC+ T K + V + + +E +L++ +A P+S+A+ A+
Sbjct: 194 DKESTYPYVGSDETCQATVENKTDGLPVGEVTGNQMLHQTEKALMEGVAAAPVSIAMYAN 253
Query: 273 GRDFQFYSGGVY-DGHC---GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
+ FQ Y GVY D +C G +DHGV AVGYG+ G DY I++NSWG WG+ GY+ +
Sbjct: 254 LQSFQHYKSGVYSDPNCNAKGGSIDHGVVAVGYGTENGQDYFIIRNSWGRSWGQDGYVYL 313
Query: 329 KRNTGKPEGLCGINKMASYPIKK 351
KR G G C I K P K
Sbjct: 314 KRGVGS-FGQCNIYKYMCVPTLK 335
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 144/321 (44%), Positives = 188/321 (58%), Gaps = 17/321 (5%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLN 94
+S + L +E++ + +K Y+S E+L RF+IF +N I + N K + +Y LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQ 152
+F DL EF +F G RK +V D LPK VDWRKKGAVT VK+Q
Sbjct: 78 QFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQ 134
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCWAFS ++EG + + G L SLSEQ L+DC ++ NNGC GGLM+ AF+YI +
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
G+ E+ YPY +G C K E T GY ++ SE L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYKAVDGECRFKK-EDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAID 253
Query: 271 ASGRDFQFYSGGVYD-GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
AS FQ YS GVYD C ++ LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 329 KRNTGKPEGLCGINKMASYPI 349
R+ CGI ASYP+
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 186/318 (58%), Gaps = 21/318 (6%)
Query: 49 ESWMS-KFEKVYESLDEKLERF--EIFKDNLRHIDETNRKIKN----YWLGLNEFADLRH 101
E W + K E +DE ERF +IF +N I + N++ + + + +N++AD+ H
Sbjct: 25 EEWQTFKLEHRKNYVDETEERFRLKIFNENKHKIAKHNQRYASGEVSFKMAVNKYADMLH 84
Query: 102 EEFKEMFLGLKPDLARRKDQSHEDFS-----YKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
EF G L ++ S F + V +PKSVDWR KGAVT VK+QG CG
Sbjct: 85 HEFHTTMNGFNYTLHKQLRASDPSFVGVTFISPEHVKIPKSVDWRSKGAVTEVKDQGHCG 144
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGL 215
SCWAFS+ A+EG + G L SLSEQ L+DC Y NNGCNGGLMD AF+YI GG+
Sbjct: 145 SCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 204
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGR 274
E+ YPY + +C K + T G D+PQ E + +A+A P+SVAI+AS
Sbjct: 205 DTEKSYPYEGIDDSCHFNKA-TIGATDRGSVDIPQGDEKKMAEAVATIGPVSVAIDASHE 263
Query: 275 DFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQFYS G+Y + C Q LDHGV VGYG+ G DY +VKNSWG WG+KG+I+M RN
Sbjct: 264 SFQFYSEGIYNEPQCDPQNLDHGVLVVGYGTDESGQDYWLVKNSWGTTWGDKGFIKMARN 323
Query: 332 TGKPEGLCGINKMASYPI 349
+ CGI +SYP+
Sbjct: 324 A---DNQCGIASASSYPL 338
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 126/270 (46%), Positives = 173/270 (64%), Gaps = 9/270 (3%)
Query: 19 FIRSSFARDFSIVGYSPEDLTSNDKLIDLFE---SWMSKFEKVYESLDEKLERFEIFKDN 75
+I +FA FSI ++ + + + ++E WM+ + +VY+ +EK R++IFK+N
Sbjct: 7 YICITFALFFSIGAWTSQCMARTLQEASMYERHEQWMASYARVYKDANEKQMRYKIFKEN 66
Query: 76 LRHIDETNRKI-KNYWLGLNEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDL 134
++ ID N + K+Y L +N+FADL +EEFK + G K + + F Y++V +
Sbjct: 67 VQRIDSFNSESDKSYKLAVNQFADLTNEEFKSLRNGFKGHMCSAQAG---HFRYENVTAV 123
Query: 135 PKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCD-NTY 193
P S+DWRKKGAVT +K QG CGSCWAFS VAAVEGI +I TG L SLSEQEL+DCD N+
Sbjct: 124 PASIDWRKKGAVTQIKEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSE 183
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
+ GC GGLMD AF++I GL E YPY + TC+ + I GY DVP N E
Sbjct: 184 DQGCQGGLMDDAFKFI-EQHGLASEATYPYDAADSTCKTKEEAKPSAKITGYEDVPANDE 242
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGV 283
+L A+ANQP+SVAI+A G +FQFYS G+
Sbjct: 243 AALKNAVANQPVSVAIDAGGFEFQFYSSGI 272
>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
Length = 291
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 122/303 (40%), Positives = 180/303 (59%), Gaps = 15/303 (4%)
Query: 1 MALSSQFKTILISFCISFFIRSSFARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYE 60
MA Q + + C+ + S+ +RD +D ++ FE WM+++ +VY+
Sbjct: 1 MASKVQLVFLFLFLCVMWASPSAASRD-----------EPSDPMMKRFEEWMAEYGRVYK 49
Query: 61 SLDEKLERFEIFKDNLRHIDE-TNRKIKNYWLGLNEFADLRHEEFKEMFLGLKPDLARRK 119
DEK+ RF+IFK+N+ HI+ NR +Y LG+N+F D+ + EF + G +
Sbjct: 50 DNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIE 109
Query: 120 DQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLA 179
+ F ++ + +S+DWR GAVT VK+Q CGSCWAFS +A VEGI +IVTG L
Sbjct: 110 KEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLV 169
Query: 180 SLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEV 239
SLSEQE++DC +NGC+GG +D A+ +I+S G+ E DYPY +G C +
Sbjct: 170 SLSEQEVLDC--AVSNGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPNSA 227
Query: 240 VTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAV 299
I GY V N E S+ A+ NQP++ AI+ASG +FQ+Y+GGV+ G CGT L+H + +
Sbjct: 228 Y-ITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITII 286
Query: 300 GYG 302
GYG
Sbjct: 287 GYG 289
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 188/318 (59%), Gaps = 21/318 (6%)
Query: 49 ESWMS-KFEKVYESLDEKLERF--EIFKDNLRHIDETNR----KIKNYWLGLNEFADLRH 101
E W + K E DE ERF +IF +N I + N+ ++ + +N++AD+ H
Sbjct: 27 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLH 86
Query: 102 EEFKEMFLGLKPDLARRKDQSHEDFS-----YKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
EF G L ++ + E F + V LPK VDWR KGAVT VK+QG CG
Sbjct: 87 HEFYSTMNGFNYTLHKQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQGHCG 146
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGL 215
SCWAFS+ A+EG + +G L SLSEQ L+DC Y NNGCNGGLMD AF+YI GG+
Sbjct: 147 SCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 206
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGR 274
E+ YPY + +C KG + T G+ D+PQ +E + +A+A P++VAI+AS
Sbjct: 207 DTEKSYPYEAIDDSCHFNKG-TIGATDRGFVDIPQGNEKKMAEAVATIGPVAVAIDASHE 265
Query: 275 DFQFYSGGVY-DGHCGTQ-LDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRN 331
FQFYS GVY + C Q LDHGV VG+G+ G DY +VKNSWG WG+KG+I+M RN
Sbjct: 266 SFQFYSEGVYNEPACDAQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRN 325
Query: 332 TGKPEGLCGINKMASYPI 349
E CGI +SYP+
Sbjct: 326 ---KENQCGIASASSYPL 340
>gi|116788286|gb|ABK24823.1| unknown [Picea sitchensis]
Length = 294
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 120/254 (47%), Positives = 164/254 (64%), Gaps = 8/254 (3%)
Query: 29 SIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN-RKIK 87
+ + Y+P DL+ N L+ LF+ W + K Y + L RF++FK+NL +I E N R
Sbjct: 21 TAITYNPRDLSENG-LLSLFDRWCNHHGKTYTAKQRPL-RFQVFKENLFYISEHNSRGNH 78
Query: 88 NYWLGLNEFADLRHEEFKEMFLGLK---PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKG 144
+WLGLN F+DL +EF+ +GL+ P L R+ + ++ ++P S+DWR K
Sbjct: 79 TFWLGLNAFSDLTSDEFRTQQMGLRGHPPSLKSRRREPKSGL--LELYNIPSSLDWRDKD 136
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDY 204
AVT VK+QG+CG CWAFS A+EGIN+IVTG+L SLSEQEL DCD +YN+GC+GGLMDY
Sbjct: 137 AVTGVKDQGACGDCWAFSATGAIEGINKIVTGSLVSLSEQELCDCDTSYNSGCDGGLMDY 196
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQP 264
AFQ+++ GG+ E DYPY + C K VVTI+ Y DVP N+E +LL+A+ QP
Sbjct: 197 AFQWVIVNGGIDTEVDYPYKGVQKACNSKKVNRRVVTIDDYIDVPANNERALLQAVVGQP 256
Query: 265 LSVAIEASGRDFQF 278
+SV I R FQ
Sbjct: 257 VSVGISGGERAFQL 270
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 188/317 (59%), Gaps = 13/317 (4%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFA 97
D D F + K Y++ E+ R +IF +N + I++ N + K ++ L LN A
Sbjct: 21 DLSADEFTLFKKFHRKEYDNELEESYRKKIFLENKKRIEKHNSRYKQGKVSFKLKLNHLA 80
Query: 98 DLRHEEFKEMFLGL-KPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCG 156
D+ E+ +++LG K A F V L K VDWR KGAVT VKNQG CG
Sbjct: 81 DMLIHEYSDVYLGFNKSSKANNNKLQSYTFIPPAHVTLNKEVDWRTKGAVTPVKNQGHCG 140
Query: 157 SCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGL 215
SCWAFST A+EG N TG L SLSEQ L+DC +Y NNGC GGLMD AFQYI G+
Sbjct: 141 SCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCSGSYGNNGCEGGLMDNAFQYIKENHGI 200
Query: 216 HKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGR 274
E+ YPY E+ TC K S T +G+ D+ Q E++L++A+A P+SVAI+AS +
Sbjct: 201 DTEKSYPYEGEDETCRFRK-TSIGATDSGFVDITQGDEEALMQAVATIGPISVAIDASHQ 259
Query: 275 DFQFYSGGV-YDGHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNT 332
FQFYS GV Y+ C ++ LDHGV VGYG Y +VKNSWG +WG+ GYI+M R+
Sbjct: 260 SFQFYSEGVYYEPECSSENLDHGVLVVGYGVEDNQKYWLVKNSWGTQWGDGGYIKMARD- 318
Query: 333 GKPEGLCGINKMASYPI 349
+ CGI ASYP+
Sbjct: 319 --QDNNCGIATQASYPL 333
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 144/321 (44%), Positives = 188/321 (58%), Gaps = 17/321 (5%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLN 94
+S + L +E++ + +K Y+S E+L RF+IF +N I + N K + +Y LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQ 152
+F DL EF +F G RK +V D LPK VDWRKKGAVT VK+Q
Sbjct: 78 QFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQ 134
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCWAFS ++EG + + G L SLSEQ L+DC ++ NNGC GGLM+ AF+YI +
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
G+ E+ YPY +G C K E T GY ++ SE L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKK-EDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAID 253
Query: 271 ASGRDFQFYSGGVYD-GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
AS FQ YS GVYD C ++ LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 329 KRNTGKPEGLCGINKMASYPI 349
R+ CGI ASYP+
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|414887427|tpg|DAA63441.1| TPA: hypothetical protein ZEAMMB73_713985 [Zea mays]
Length = 355
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 185/319 (57%), Gaps = 28/319 (8%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHE 102
++D F +W + + + Y + E+L RFE+++ N+ I+ TNR+ + +Y L F DL E
Sbjct: 36 MMDRFRAWQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRAELSYQLSETPFTDLTSE 95
Query: 103 EF---KEMFLGLKPDLARRKDQ----SH-----------EDFSYKDVVDLPKSVDWRKKG 144
EF M L A R+ + +H +Y +D+P+SVDWR KG
Sbjct: 96 EFLATHTMSTRLHASEAARRHRELITTHAGPVSDGGRQWNRRNYTTDLDVPESVDWRTKG 155
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDY 204
AVT VK+QG+CG CW+F+TVAA+EG+++I TG L SLSEQE++DC + NNGC+GG
Sbjct: 156 AVTTVKDQGACGGCWSFATVAAIEGLHKIRTGQLVSLSEQEVLDCSSPPNNGCHGGNPAA 215
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQP 264
A ++ + GGL E DYPY +G C++ K + V I G V QN+E +L A+A QP
Sbjct: 216 AIDWVSANGGLTTESDYPYEGRQGKCKLDKARNHVAKIRGRKLVDQNNEAALEVAVAQQP 275
Query: 265 LSVAIEASGRDFQFYSGGVYDGHCGTQ-LDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGE 322
++V + Q Y GV+ G C + L+H V VGYG+ + G Y IVKNSWG KWGE
Sbjct: 276 VAVGMNVHPIQ-QHYKSGVFHGPCDPEDLNHAVTMVGYGAESGGRKYWIVKNSWGEKWGE 334
Query: 323 KGYIR------MKRNTGKP 335
KGY R R +G P
Sbjct: 335 KGYFRGFASRGASRTSGAP 353
>gi|19698255|dbj|BAB86770.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 178/310 (57%), Gaps = 16/310 (5%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR----KIKNYWLGLNEFADLRHEE 103
F W +KF K Y SL+++ R ++ N + I N+ + +Y GLN+F+D+ HEE
Sbjct: 22 FNEWKAKFGKSYPSLEKEAHRKGLWLANHQKIQAHNQLADQGVHSYRQGLNQFSDMDHEE 81
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F++ L K D + + E F +V L SVDWR G V+ +KNQG CGSCW+FS
Sbjct: 82 FRQTVL-TKMDPPKNNRGASEPFRALNV-GLAASVDWRTSGCVSPIKNQGQCGSCWSFSA 139
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYP 222
A+E + G L SLSEQ+L+DC +Y N GCNGG D AFQYI + GG+ E YP
Sbjct: 140 TGALESQTCLRRGYLPSLSEQQLVDCSGSYGNYGCNGGWPDQAFQYIQANGGIDSESYYP 199
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDV-PQNSEDSLLKALAN-QPLSVAIEASGRDFQFYS 280
Y GTC S T +GY DV P SE +L +AN PLS+AI+ASG +Q Y
Sbjct: 200 YQARVGTCHYNSAYS-AATCSGYQDVTPVGSESALQYYVANVGPLSIAIDASG--WQSYQ 256
Query: 281 GGVY-DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
GV+ D C DH V VGYG+ G DY +VKNSWG WGE+GYI M RN C
Sbjct: 257 SGVFNDPSCSQTADHAVLLVGYGTYNGQDYWLVKNSWGTWWGEQGYIMMTRNANNQ---C 313
Query: 340 GINKMASYPI 349
GI ASYP+
Sbjct: 314 GIANHASYPL 323
>gi|318816588|ref|NP_001187996.1| cathepsin L precursor [Ictalurus punctatus]
gi|308324547|gb|ADO29408.1| cathepsin L [Ictalurus punctatus]
Length = 334
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 137/312 (43%), Positives = 185/312 (59%), Gaps = 14/312 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDN----LRHIDETNRKIKNYWLGLNEFADLRHEE 103
F SW KF K+Y+S++E+ +R + +N L H ++ IK+Y LG+ FAD+ ++E
Sbjct: 26 FHSWKLKFGKIYKSVEEESQRKNTWLENRKLVLVHNMLADQGIKSYRLGMTYFADMDNQE 85
Query: 104 FKE-MFLGLKPDLARRKDQSHEDFSYK-DVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
+++ +F G R K F + LP +VDWR KG V VK+Q +CGSCWAF
Sbjct: 86 YRQSVFKGCLGSFNRTKGHRASTFLLQAGGAVLPDTVDWRDKGYVAEVKDQKNCGSCWAF 145
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEED 220
S ++EG TG L SLSEQ+L+DC Y N GC GGLMD AF+YI G+ EE
Sbjct: 146 SATGSLEGQTFRKTGKLVSLSEQQLVDCSGKYGNMGCGGGLMDLAFEYIEDNKGIDTEES 205
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFY 279
YPY +G C K + T GY D+ E++L KA+AN P+SVAI+A FQ Y
Sbjct: 206 YPYEATDGDCRF-KPATVGATCTGYVDINSEDENALQKAVANIGPISVAIDAGHISFQLY 264
Query: 280 SGGVY-DGHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
G+Y + +C ++ LDHGV AVGYG+ DY +VKNSWG WG++GYI+M RN
Sbjct: 265 GSGIYNEPNCSSEDLDHGVLAVGYGTDNQQDYWLVKNSWGLDWGDQGYIKMTRNKNNQ-- 322
Query: 338 LCGINKMASYPI 349
CGI ASYP+
Sbjct: 323 -CGIATAASYPL 333
>gi|225707912|gb|ACO09802.1| Cathepsin K precursor [Osmerus mordax]
Length = 331
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 182/310 (58%), Gaps = 15/310 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEE 103
+E+W + K Y LDE+ R I++ N+R I+ N++ + +Y LG+N D+ EE
Sbjct: 28 WENWKTTHNKEYNGLDEEGIRRAIWEKNMRMIEAHNQEAALGMHSYELGMNNLGDMTSEE 87
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
E +GL+ L R D+ + V LPKS+D+R+KG VT VKNQGSCGSCWAFS+
Sbjct: 88 VAEKMMGLQVPLNR--DRGNTFVPDNTVERLPKSIDYRRKGMVTPVKNQGSCGSCWAFSS 145
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
V A+EG TG L LS Q L+DC T NNGC GG M AF Y+ G+ E YPY
Sbjct: 146 VGALEGQLMKTTGKLVDLSPQNLVDCV-TENNGCGGGYMTNAFNYVRDNQGIDSEAAYPY 204
Query: 224 IMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGG 282
I ++ TC + GY ++P+ +E +L A+A P+SV I+A+ FQFY G
Sbjct: 205 IGQDETCAYNV-SGMTASCRGYKEIPEGNERALTVAVAKVGPVSVGIDATLSTFQFYQKG 263
Query: 283 V-YDGHCGT-QLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
V YD +C ++H V AVGYG T +G Y IVKNSW WG KGYI M RN G LC
Sbjct: 264 VYYDRNCNKDDINHAVLAVGYGVTPKGKKYWIVKNSWSESWGNKGYILMARNRGN---LC 320
Query: 340 GINKMASYPI 349
GI +ASYPI
Sbjct: 321 GIANLASYPI 330
>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
Length = 356
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 196/343 (57%), Gaps = 18/343 (5%)
Query: 24 FARDFSIVGYSPEDLTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN 83
FA + G + D T L++ F++W +++ + Y + +E +RF I+ +N+R I N
Sbjct: 14 FACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMN 73
Query: 84 R--KIKNYWLGLNEFADLRHEEFKE---MFLGLKPDLARRKDQSHEDFSY------KDVV 132
+ +Y LG N+F DL EEFK+ M L +P A + S +
Sbjct: 74 QLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMGPTVGTMSTAGMSNGNNTG 133
Query: 133 DLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNT 192
+ P SVDWR KGAVT VK+Q CGSCWAF+TVA++EG++QI TG L SLSEQE++DCD
Sbjct: 134 EAPNSVDWRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRG 193
Query: 193 YN-NGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQN 251
N NGC GG A +++ GGL E DYPY+ + C K I GY V +N
Sbjct: 194 GNDNGCRGGSPRSAMEWVTRNGGLTTESDYPYVGSQRQCMSGKLGHHAARIRGYQAVQRN 253
Query: 252 SEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHC-GTQLDHGVAAVGYGST----RG 306
+E L +A+A +P++V I+AS R FQFY GV+ G C T ++H V VGYGST G
Sbjct: 254 NEAELERAVAERPVAVFIDAS-RAFQFYKSGVFSGPCDTTTVNHVVTVVGYGSTGSDSGG 312
Query: 307 LDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
Y IVKNSWG WGE GY+RM R EG+C I YP+
Sbjct: 313 RKYWIVKNSWGQGWGENGYVRMARRVRAREGMCAIAIEPYYPV 355
>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 145/352 (41%), Positives = 198/352 (56%), Gaps = 18/352 (5%)
Query: 11 LISFCISFF--IRSSFARDFSIVGYSPEDLTSN-DKLIDLFESWMSKFEKVYESLDEKLE 67
L+ C S F I S D +I + + L D+ L++ + F K Y DE+ +
Sbjct: 7 LVLLCASVFASIDSGSRHDHTIRLHRVKSLRQKIDEAFKLWDDYKESFGKSYNK-DEEND 65
Query: 68 RFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEEFKEMF-LGLKPDLARRKDQS 122
E F N+ HIDE N++ K + +GLN ADL +++++ + + +
Sbjct: 66 YMEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKLNGYRHRRNFGDSMQSN 125
Query: 123 HEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLS 182
+ V++P SVDWR KG VT VKNQG CGSCWAFS A+EG + +G + SLS
Sbjct: 126 GTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARASGKMVSLS 185
Query: 183 EQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVT 241
EQ L+DC Y N+GCNGGLMD AF+YI G+ EE YPY+ E C K +
Sbjct: 186 EQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKCHFKKKDIGAED 245
Query: 242 INGYHDVPQNSEDSLLKALANQ-PLSVAIEASGRDFQFYSGGV-YDGHCGT-QLDHGVAA 298
G+ D+P+ E++L A+A Q P+S+AI+A R FQ Y GV YD C + +LDHGV
Sbjct: 246 -KGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSEELDHGVLL 304
Query: 299 VGYGS-TRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPI 349
VGYG+ DY ++KNSWGP WGEKGYIR+ RN CG+ ASYP+
Sbjct: 305 VGYGTDPEAGDYWLIKNSWGPGWGEKGYIRIARNRSNH---CGVATKASYPL 353
>gi|50657029|emb|CAH04632.1| cathepsin L [Suberites domuncula]
Length = 324
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 133/306 (43%), Positives = 179/306 (58%), Gaps = 13/306 (4%)
Query: 50 SWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN--YWLGLNEFADLRHEEFKEM 107
+W + K Y E+L R I++ N + ID N Y L +NEF DL EFK++
Sbjct: 25 AWKQEHSKEYTEELEELRRHTIWQSNKKFIDSHNSVSDKFGYTLEMNEFGDLSGVEFKQI 84
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
+ G + + + + F+ ++ SVDWR+KG V+ VKNQG CGSCW+FS ++
Sbjct: 85 YNGY---IMQERANDTKLFTASPYMEPAASVDWRQKGVVSEVKNQGQCGSCWSFSATGSL 141
Query: 168 EGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
EG + + G L SLSEQ L+DC + + N+GC GG+MD AF+Y++S G+ E YPY +
Sbjct: 142 EGQHALKMGRLVSLSEQNLMDCSSRFGNHGCKGGIMDDAFRYVISNHGVDTESSYPYTAK 201
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV-Y 284
+G C + T Y D+ + SE SL +A A P+SVAI+AS R FQFY GV Y
Sbjct: 202 DGYCRFNQNNVG-ATETSYRDIARGSESSLTQASAQIGPISVAIDASHRSFQFYKNGVYY 260
Query: 285 DGHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINK 343
+ C ++LDHGV VGYG+ G DY IVKNSWG +WG GYI M RN CGI
Sbjct: 261 EPSCSSSRLDHGVLVVGYGTEGGQDYFIVKNSWGTRWGMDGYIMMSRNR---RNNCGIAS 317
Query: 344 MASYPI 349
ASYPI
Sbjct: 318 QASYPI 323
>gi|293334761|ref|NP_001168296.1| uncharacterized protein LOC100382061 [Zea mays]
gi|223947281|gb|ACN27724.1| unknown [Zea mays]
Length = 322
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 185/319 (57%), Gaps = 28/319 (8%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK-NYWLGLNEFADLRHE 102
++D F +W + + + Y + E+L RFE+++ N+ I+ TNR+ + +Y L F DL E
Sbjct: 3 MMDRFRAWQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRAELSYQLSETPFTDLTSE 62
Query: 103 EF---KEMFLGLKPDLARRKDQ----SH-----------EDFSYKDVVDLPKSVDWRKKG 144
EF M L A R+ + +H +Y +D+P+SVDWR KG
Sbjct: 63 EFLATHTMSTRLHASEAARRHRELITTHAGPVSDGGRQWNRRNYTTDLDVPESVDWRTKG 122
Query: 145 AVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDY 204
AVT VK+QG+CG CW+F+TVAA+EG+++I TG L SLSEQE++DC + NNGC+GG
Sbjct: 123 AVTTVKDQGACGGCWSFATVAAIEGLHKIRTGQLVSLSEQEVLDCSSPPNNGCHGGNPAA 182
Query: 205 AFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQP 264
A ++ + GGL E DYPY +G C++ K + V I G V QN+E +L A+A QP
Sbjct: 183 AIDWVSANGGLTTESDYPYEGRQGKCKLDKARNHVAKIRGRKLVDQNNEAALEVAVAQQP 242
Query: 265 LSVAIEASGRDFQFYSGGVYDGHCGTQ-LDHGVAAVGYGS-TRGLDYIIVKNSWGPKWGE 322
++V + Q Y GV+ G C + L+H V VGYG+ + G Y IVKNSWG KWGE
Sbjct: 243 VAVGMNVHPIQ-QHYKSGVFHGPCDPEDLNHAVTMVGYGAESGGRKYWIVKNSWGEKWGE 301
Query: 323 KGYIR------MKRNTGKP 335
KGY R R +G P
Sbjct: 302 KGYFRGFASRGASRTSGAP 320
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 139/322 (43%), Positives = 188/322 (58%), Gaps = 15/322 (4%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGL 93
L+ + L D + + + +K Y S E+ R +I+ +N + + N + K+Y + +
Sbjct: 17 LSLTNLLADEWHLFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYHVAM 76
Query: 94 NEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV--VDLPKSVDWRKKGAVTHVKN 151
N+F DL H EF+ + G + + ++ F++ + V +P+SVDWR+KGA+T VK+
Sbjct: 77 NKFGDLLHHEFRSIMNGYQHK-KQNSSRAESTFTFMEPANVTVPESVDWREKGAITPVKD 135
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIV 210
QG CGSCWAFS+ A+EG TG L SLSEQ LIDC Y N GCNGGLMD AFQYI
Sbjct: 136 QGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIK 195
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAI 269
G+ E YPY E+ C V G+ D+P ED L A+A P+SVAI
Sbjct: 196 DNKGIDTENTYPYEAEDDVCRYNPRNRGAVD-RGFVDIPSGEEDKLKAAVATVGPVSVAI 254
Query: 270 EASGRDFQFYSGGV-YDGHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIR 327
+AS FQFYS GV Y+ C + LDHGV VGYGS G DY +VKNSW WG++GYI+
Sbjct: 255 DASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIK 314
Query: 328 MKRNTGKPEGLCGINKMASYPI 349
M RN + CG+ ASYP+
Sbjct: 315 MARNR---KNHCGVASAASYPL 333
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 144/321 (44%), Positives = 188/321 (58%), Gaps = 17/321 (5%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLN 94
+S + L +E++ + +K Y+S E+L RF+IF +N I + N K + +Y LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQ 152
+F DL EF +F G RK +V D LPK VDWRKKGAVT VK+Q
Sbjct: 78 QFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQ 134
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCWAFS ++EG + + G L SLSEQ L+DC ++ NNGC GGLM+ AF+YI +
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
G+ E+ YPY +G C K E T GY ++ SE L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKK-EDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAID 253
Query: 271 ASGRDFQFYSGGVYD-GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
AS FQ YS GVYD C ++ LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 329 KRNTGKPEGLCGINKMASYPI 349
R+ CGI ASYP+
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|313507179|pdb|2ACT|A Chain A, Crystallographic Refinement Of The Structure Of Actinidin
At 1.7 Angstroms Resolution By Fast Fourier
Least-Squares Methods
Length = 220
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 112/218 (51%), Positives = 153/218 (70%), Gaps = 2/218 (0%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP VDWR GAV +K+QG CG WAFS +A VEGIN+I +G+L SLSEQELIDC T
Sbjct: 1 LPSYVDWRSAGAVVDIKSQGECGGXWAFSAIATVEGINKITSGSLISLSEQELIDCGRTQ 60
Query: 194 NN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNS 252
N GC+GG + FQ+I++ GG++ EE+YPY ++G C++ + + VTI+ Y +VP N+
Sbjct: 61 NTRGCDGGYITDGFQFIINDGGINTEENYPYTAQDGDCDVALQDQKYVTIDTYENVPYNN 120
Query: 253 EDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIV 312
E +L A+ QP+SVA++A+G F+ Y+ G++ G CGT +DH + VGYG+ G+DY IV
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYASGIFTGPCGTAVDHAIVIVGYGTEGGVDYWIV 180
Query: 313 KNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
KNSW WGE+GY+R+ RN G G CGI M SYP+K
Sbjct: 181 KNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPVK 217
>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 139/313 (44%), Positives = 178/313 (56%), Gaps = 14/313 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADLRHEE 103
+E W + K YE+ E+ R I + N I E N + + +Y L +N+F D+ HEE
Sbjct: 24 WEMWKLQHGKQYETEAEEYSRRFILEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEE 83
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F + +G + ++ + D LPKSVDWR V+ VK+QG CGSCWAFST
Sbjct: 84 FHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFST 143
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
++EG + TG L LSEQ+L+DC + N GC GGLMD AFQYI + GGL EE YP
Sbjct: 144 TGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYP 203
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSG 281
Y + S T+ GY DV +E +L +A+A P+SVAI+A FQFYS
Sbjct: 204 YTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYSS 263
Query: 282 GVYD-GHCGT-QLDHGVAAVGYGSTRG---LDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
GVYD C T QLDHGV AVGYG+ + IVKNSWGP WG++GYI M RN
Sbjct: 264 GVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQ- 322
Query: 337 GLCGINKMASYPI 349
CGI ASYP+
Sbjct: 323 --CGIATSASYPL 333
>gi|351629617|gb|AEQ54772.1| KDEL-tailed cysteine proteinase CP4, partial [Coffea canephora]
Length = 215
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 120/201 (59%), Positives = 148/201 (73%), Gaps = 4/201 (1%)
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVST 212
G CGSCWAFSTV VEGIN+I TG L SLSEQEL+DC+ T N GCNGGLM+ A+++I +
Sbjct: 1 GKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCE-TDNEGCNGGLMENAYEFIKKS 59
Query: 213 GGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEAS 272
GG+ E YPY +G+C+ +K + VTI+G+ VP N E++L+KA+ANQP+SVAI+AS
Sbjct: 60 GGITTERLYPYKARDGSCDSSKMNAPAVTIDGHEMVPANDENALMKAVANQPVSVAIDAS 119
Query: 273 GRDFQFYSGGVYDG-HCGTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKR 330
G D QFYS GVY G CG +LDHGVA VGYG+ G Y IVKNSWG WGE+GYIRM+R
Sbjct: 120 GSDMQFYSEGVYTGDSCGNELDHGVAVVGYGTALDGTKYWIVKNSWGTGWGEQGYIRMQR 179
Query: 331 NTGKPE-GLCGINKMASYPIK 350
E G+CGI ASYP+K
Sbjct: 180 GVDAAEGGVCGIAMEASYPLK 200
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 191/323 (59%), Gaps = 26/323 (8%)
Query: 49 ESWMS---KFEKVYESLDEKLERFEIFKDNLRHIDETNRKI----KNYWLGLNEFADLRH 101
E W + + K Y+S E+ R +I+ N I + N++ + + L +N++ADL H
Sbjct: 25 EEWNAFKLQHRKKYDSESEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLH 84
Query: 102 EEFKEMFLGLKPD-------LARRKDQSHED---FSYKDVVDLPKSVDWRKKGAVTHVKN 151
EEF G L R + + E+ + VD+P ++DWR+KGAVT VK+
Sbjct: 85 EEFVHTLNGFNRSAAAGSKLLGREQLMTIEEPITWIEPANVDVPTTIDWREKGAVTPVKD 144
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIV 210
QG CGSCW+FS A+EG + TG L SLSEQ L+DC Y NNGCNGGLMD AFQY+
Sbjct: 145 QGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYVK 204
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAI 269
G+ E+ YPY + C ++ T G+ D+PQ E +L KALA P+SVAI
Sbjct: 205 DNKGIDTEKAYPYEAIDDECHYNP-KAIGATDKGFVDIPQGDEKALKKALATVGPVSVAI 263
Query: 270 EASGRDFQFYSGGV-YDGHCGT-QLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYI 326
+AS FQFYS GV Y+ C + QLDHGV AVGYG+T G DY +VKNSWG WG++GY+
Sbjct: 264 DASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTWGDQGYV 323
Query: 327 RMKRNTGKPEGLCGINKMASYPI 349
+M RN E CGI ASYP+
Sbjct: 324 KMARNR---ENHCGIATTASYPL 343
>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332642714|gb|AEE76235.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 290
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 123/238 (51%), Positives = 163/238 (68%), Gaps = 7/238 (2%)
Query: 47 LFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNR-KIKNYWLGLNEFADLRHEEFK 105
++E W+ + K Y L EK RF+IFKDNL+ +DE N + + +GL FADL +EEF+
Sbjct: 43 MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR 102
Query: 106 EMFLGLKPDLARRKDQ-SHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTV 164
++L + + R KD E + YK+ LP VDWR GAV VK+QG+CGSCWAFS V
Sbjct: 103 AIYL--RKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAV 160
Query: 165 AAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPY 223
AVEGINQI TG L SLSEQEL+DCD + N GC+GG+M+YAF++I+ GG+ ++DYPY
Sbjct: 161 GAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPY 220
Query: 224 IMEE-GTCEMTK-GESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFY 279
+ G C K + VVTI+GY DVP++ E SL KA+A+QP+SVAIEAS + FQ Y
Sbjct: 221 NANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLY 278
>gi|294890024|ref|XP_002773045.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239877748|gb|EER04861.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 329
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 141/306 (46%), Positives = 175/306 (57%), Gaps = 8/306 (2%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEM 107
F + KF K YES +E+++R IF+ NL HI+ N K +Y LG+NE ADL HEEF +
Sbjct: 28 FMGFQHKFGKNYESKEEEVKRNAIFQANLHHIEHVNAKNLSYKLGVNEHADLTHEEFAAL 87
Query: 108 FLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAV 167
LG RR D E D LP SVDWR K +T VKNQGSCGS WAFST A+
Sbjct: 88 KLGTLKMSTRRDD---EFVVEADTTQLPTSVDWRNKSVLTPVKNQGSCGSSWAFSTTGAL 144
Query: 168 EGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIME 226
I TG L SLSEQEL+DC Y N+GC GG M A++YI + GL +E YPY
Sbjct: 145 GAQYAIATGKLLSLSEQELVDCSLKYGNDGCIGGYMGAAYEYI-NQAGLDQESTYPYKGW 203
Query: 227 EGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDG 286
+ C E + I + +E SL+KALA+ P+SV + AS +F+FY GVY
Sbjct: 204 DEPC-FRSSEKKADGIPVRFVLNTKTEQSLMKALADAPVSVGMYASDPNFRFYRSGVYSS 262
Query: 287 -HCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMA 345
C + DH V AVGYG+ +G DY I+KNSWG KWG GY +KR G G C I +
Sbjct: 263 TTCNGETDHAVVAVGYGADKGSDYFILKNSWGSKWGIGGYFFLKRGVGG-HGECNILEYM 321
Query: 346 SYPIKK 351
P K
Sbjct: 322 LVPTLK 327
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 141/317 (44%), Positives = 187/317 (58%), Gaps = 20/317 (6%)
Query: 49 ESWMS---KFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKN----YWLGLNEFADLRH 101
E W S + +K YES E+ R +IF DN + + N+ + Y L +N++ DL H
Sbjct: 25 EQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAKHNKLFEQGLYPYKLAMNKYGDLLH 84
Query: 102 EEFKEMFLGL---KPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSC 158
EF + G K L R + Q F VD+P +VDWR++GAVT VK+QG CGSC
Sbjct: 85 HEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVDIPDTVDWRQEGAVTPVKDQGHCGSC 144
Query: 159 WAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHK 217
W+FS A+EG + T L SLSEQ L+DC + + NNGCNGGLMD AF+YI + GG+
Sbjct: 145 WSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFGNNGCNGGLMDNAFRYIKNNGGIDT 204
Query: 218 EEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDF 276
E YPY+ E+ + ++ T G+ D+P ED L A+A P+S+AI+AS F
Sbjct: 205 EAAYPYMGEDEKFRYS-AKNRGATDKGFVDIPSGDEDKLKAAVATVGPISIAIDASHESF 263
Query: 277 QFYSGGVY-DGHC-GTQLDHGVAAVGYGSTR--GLDYIIVKNSWGPKWGEKGYIRMKRNT 332
Q YS GVY D C T+LDHGV VGYG+ G+DY +VKNSWG WG GYI+M RN
Sbjct: 264 QLYSNGVYSDPTCSSTELDHGVLVVGYGTDEKTGMDYWLVKNSWGDTWGLDGYIKMARNQ 323
Query: 333 GKPEGLCGINKMASYPI 349
+ CG+ ASYP+
Sbjct: 324 ---DNQCGVATQASYPL 337
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 145/317 (45%), Positives = 181/317 (57%), Gaps = 20/317 (6%)
Query: 49 ESWMS---KFEKVYESLDEKLERFEIFKDNLRHIDETNR----KIKNYWLGLNEFADLRH 101
+ WM+ + +KVY+S E+ R +IF DN I + N K +Y L +N++ D+ H
Sbjct: 32 QEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDMLH 91
Query: 102 EEFKEMFLG----LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGS 157
EF + G + L + F V LPK VDWRK+GAVT VK+QG CGS
Sbjct: 92 HEFVNILNGFNKSINTQLRSERLPVGASFIEPANVVLPKKVDWRKEGAVTPVKDQGHCGS 151
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLH 216
CW+FS A+EG + TG L SLSEQ LIDC Y NNGCNGGLMD AFQYI GL
Sbjct: 152 CWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLD 211
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRD 275
E YPY E C S + + GY D+P E L A+A P+SVAI+AS +
Sbjct: 212 TEASYPYEAENDKCRYNPANSGAIDV-GYIDIPTGDEKLLKAAVATIGPVSVAIDASHQS 270
Query: 276 FQFYSGGV-YDGHCGT-QLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNT 332
FQFYS GV Y+ C + +LDHGV +GYG+ G DY +VKNSWG WG GYI+M RN
Sbjct: 271 FQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWGETWGNNGYIKMARNK 330
Query: 333 GKPEGLCGINKMASYPI 349
CGI ASYP+
Sbjct: 331 ---LNHCGIASSASYPL 344
>gi|317135059|gb|ADV03094.1| cathepsin L [Hyriopsis cumingii]
gi|372126672|gb|AEX88474.1| cathepsin L [Hyriopsis schlegelii]
Length = 333
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 134/312 (42%), Positives = 185/312 (59%), Gaps = 19/312 (6%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNL----RHIDETNRKIKNYWLGLNEFADLRHEE 103
++ ++ + K Y + +E + R+ ++KDN RH + ++ YWL +NE+ DL +EE
Sbjct: 30 WQEFVRIYNKTYRAHEEPV-RYSVWKDNFLAINRHNSKADQGFHTYWLAMNEYGDLTNEE 88
Query: 104 FKEMFLGLK--PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
+ + GLK ++ RR F Y ++ + P VDWR KG VT VKNQG CGSC+AF
Sbjct: 89 YFRLRTGLKINANIERRGLV----FKYTNLSEYPSEVDWRSKGYVTPVKNQGGCGSCYAF 144
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEED 220
S AVEG + TG L SLSEQ ++DC N GC GGLMD +F YI G+ EE
Sbjct: 145 SATGAVEGQHFRKTGKLVSLSEQNIVDCSFKEGNKGCRGGLMDKSFTYIKDNNGIDTEEA 204
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFY 279
YPY +G C + E T+ GY D+P+N E +L A+ P+SVAI+ +F+FY
Sbjct: 205 YPYEARDGPCRFRRSEVG-ATVRGYVDLPENDEIALQHAVTTIGPISVAIDGHHFNFRFY 263
Query: 280 SGGVYDG-HCG-TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
GV+D +C T+++HGV VGYG+ GLDY +VKNSWG +WG +GYI M RN +
Sbjct: 264 HHGVFDNPNCSKTKINHGVLVVGYGTRDGLDYWLVKNSWGERWGAEGYILMSRNN---DN 320
Query: 338 LCGINKMASYPI 349
C I ASYPI
Sbjct: 321 QCCITCAASYPI 332
>gi|449524450|ref|XP_004169236.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 283
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 120/289 (41%), Positives = 181/289 (62%), Gaps = 14/289 (4%)
Query: 67 ERFEIFKDNLRHIDETNRKIKNYWLGLNEFADLRHEEFKEMF---LGLKPDLARRKDQSH 123
RF++FKDN +H+ + N K+ L LN+FAD+ +EF + + + +L +
Sbjct: 3 RRFKVFKDNAKHVFKVNHMGKSLKLKLNQFADMSDDEFSKTYGSNITYYKNLHAKVGGRV 62
Query: 124 EDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSE 183
F Y+ ++P S+DWRKKGA + C CWAF+ VAAVE I+QI T L SLSE
Sbjct: 63 GGFMYERATNIPSSIDWRKKGA------RRMC--CWAFAAVAAVESIHQIRTNELVSLSE 114
Query: 184 QELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTIN 243
QE++DCD GC GG AF++I+ GG+ E +YPY +G C +E VTI+
Sbjct: 115 QEVVDCDYKVG-GCRGGDYISAFEFIMENGGITVENNYPYYAGDGYCRRRGPNNERVTID 173
Query: 244 GYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVY--DGHCGTQLDHGVAAVGY 301
GY +VP+N+E +L+KA+A+QP++V+I + G DF+FY G++ + CG ++DH V VGY
Sbjct: 174 GYENVPRNNEYALMKAVAHQPVAVSIASRGSDFKFYGEGMFTEENFCGIRIDHTVVVVGY 233
Query: 302 GSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
GS DY I++N +G +WG GY++M+R T P+G+CG+ ++P+K
Sbjct: 234 GSDEEGDYWIIRNQYGTQWGMNGYMKMQRGTRSPQGVCGMAMYPAFPVK 282
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 181/311 (58%), Gaps = 15/311 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIK----NYWLGLNEFADLRHEE 103
+ W ++ K Y S +E+ R I++ NL + + N K Y LG+N+FADL++EE
Sbjct: 28 WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLQNEE 87
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
F M G + + + + +V LPK+VDWR KG VT VK+QG CGSCWAFS
Sbjct: 88 FVAMMTGFRVNGTSKAAKGSTFLPSNNVDKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSA 147
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYP 222
++EG TG L SLSEQ L+DC +Y N GC+GG MD AFQYI+ GG+ E Y
Sbjct: 148 TGSLEGQQFKKTGKLVSLSEQNLVDC--SYRNYGCHGGFMDRAFQYIIDAGGIDTEATYS 205
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSG 281
Y +G C K T+ GY DV SE +L KA+A+ P+SVAI+AS + F+FY
Sbjct: 206 YRAVDGNCHFKKANVG-ATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHKFFKFYKS 264
Query: 282 GVYD--GHCGTQLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 338
GVY+ G T+L H V VGYG+T G DY IVKNSW WG GY+ M RN +
Sbjct: 265 GVYNEPGCSTTRLGHAVLVVGYGTTSDGTDYWIVKNSWAKTWGMNGYLWMSRN---KDNQ 321
Query: 339 CGINKMASYPI 349
CGI ASYP+
Sbjct: 322 CGIASEASYPM 332
>gi|318054062|ref|NP_001187179.1| cathepsin S precursor [Ictalurus punctatus]
gi|190351079|gb|ACE75948.1| cathepsin S [Ictalurus punctatus]
Length = 329
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 182/318 (57%), Gaps = 18/318 (5%)
Query: 42 DKLIDL-FESWMSKFEKVYESLDEKLERFEIFKDNLR----HIDETNRKIKNYWLGLNEF 96
D+ +D+ + W K Y S E+L R EI++ NLR H E + + Y LG+N
Sbjct: 19 DQSLDMHWLMWKKNHSKTYTSELEELGRREIWERNLRLITVHNLEASLGMHTYDLGMNHM 78
Query: 97 ADLRHEEFKEMFLGLK--PDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
D+ EE +MF G + P+L RR F + +P SVDWR+KG VT VKNQGS
Sbjct: 79 GDMAREEILQMFAGTRVPPNLTRRSST----FVASSGISVPDSVDWREKGYVTEVKNQGS 134
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTG 213
CGSCWAFS A+EG + TG + SLS Q L+DC + Y N GCNGG M AFQY++ G
Sbjct: 135 CGSCWAFSAAGALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTEAFQYVIDNG 194
Query: 214 GLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEAS 272
G+ +E YPY +G C + + + Y+ V Q E++L +A+A P+SVAI+A+
Sbjct: 195 GIDSDEAYPYTAMDGQCRYDQAQ-RAANCSSYNYVSQGDEEALKQAVATIGPISVAIDAT 253
Query: 273 GRDFQFYSGGVYDGHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRN 331
F Y GVY+ T V VGYGS G DY +VKNSWGP++G+ GYIR+ RN
Sbjct: 254 RPMFILYHSGVYNDQTSTPWFTFWVQDVGYGSLNGEDYWLVKNSWGPRFGDGGYIRIARN 313
Query: 332 TGKPEGLCGINKMASYPI 349
G +CGI A YP+
Sbjct: 314 KGN---MCGIANYACYPL 328
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 136/305 (44%), Positives = 188/305 (61%), Gaps = 16/305 (5%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNR----KIKNYWLGLNEFADLRHEEFKEMFLGL- 111
K Y+S E+ R +IF +N + + N+ + ++ LG+N++AD+ H EF ++ G
Sbjct: 36 KQYQSETEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFN 95
Query: 112 KPDLARRKDQSHEDFSY--KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEG 169
+ R +S + ++ V LP +DWR KGAVT VK+QG CGSCW+FS ++EG
Sbjct: 96 RTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEG 155
Query: 170 INQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEG 228
+ +G L SLSEQ L+DC + NNGCNGGLMD AF+YI + GG+ E+ YPY E+
Sbjct: 156 QHFRQSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDE 215
Query: 229 TCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV-YDG 286
C K +++ T GY D+ +ED L A+A P+SVAI+AS + FQ YSGGV Y+
Sbjct: 216 KCHY-KPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEP 274
Query: 287 HC-GTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
C +QLDHGV VGYG+ G DY +VKNSWG WG++GYI+M RN CGI
Sbjct: 275 DCSASQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARNRNNN---CGIATE 331
Query: 345 ASYPI 349
ASYP+
Sbjct: 332 ASYPL 336
>gi|157833553|pdb|1PPO|A Chain A, Determination Of The Structure Of Papaya Protease Omega
gi|1460162|prf||1411165A:PDB=1PPO thiol proteinase omega
Length = 216
Score = 236 bits (603), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 118/217 (54%), Positives = 153/217 (70%), Gaps = 2/217 (0%)
Query: 134 LPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY 193
LP++VDWRKKGAVT V++QGSCGSCWAFS VA VEGIN+I TG L LSEQEL+DC+
Sbjct: 1 LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR- 59
Query: 194 NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSE 253
++GC GG YA +Y V+ G+H YPY ++GTC + +V +G V N+E
Sbjct: 60 SHGCKGGYPPYALEY-VAKNGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNE 118
Query: 254 DSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVAAVGYGSTRGLDYIIVK 313
+LL A+A QP+SV +E+ GR FQ Y GG+++G CGT++DH V AVGYG + G YI++K
Sbjct: 119 GNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYILIK 178
Query: 314 NSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
NSWG WGEKGYIR+KR G G+CG+ K + YP K
Sbjct: 179 NSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPTK 215
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 236 bits (603), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 138/322 (42%), Positives = 189/322 (58%), Gaps = 15/322 (4%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETN----RKIKNYWLGL 93
L+ + L D + + + +K Y S E+ R +I+ +N + + N + K+Y + +
Sbjct: 21 LSLTNLLADEWHLFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAM 80
Query: 94 NEFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDV--VDLPKSVDWRKKGAVTHVKN 151
N+F DL H EF+ + G + + ++ F++ + V++P+SVDWR+KGA+T VK+
Sbjct: 81 NKFGDLLHHEFRSIMNGYQHK-KQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKD 139
Query: 152 QGSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIV 210
QG CGSCWAFS+ A+EG TG L SLSEQ LIDC Y N GCNGGLMD AFQYI
Sbjct: 140 QGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIK 199
Query: 211 STGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAI 269
G+ E YPY E+ C V G+ D+P ED L A+A P+SVAI
Sbjct: 200 DNKGIDTENTYPYEAEDDVCRYNPRNRGAVD-RGFVDIPSGEEDKLKAAVATVGPVSVAI 258
Query: 270 EASGRDFQFYSGGV-YDGHCGT-QLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIR 327
+AS FQFYS GV Y+ C + LDHGV VGYGS G DY +VKNSW WG++GYI+
Sbjct: 259 DASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIK 318
Query: 328 MKRNTGKPEGLCGINKMASYPI 349
+ RN + CG+ ASYP+
Sbjct: 319 IARNR---KNHCGVATAASYPL 337
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 236 bits (603), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 136/305 (44%), Positives = 189/305 (61%), Gaps = 16/305 (5%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNR----KIKNYWLGLNEFADLRHEEFKEMFLGL- 111
K Y+S E+ R +IF +N + + N+ + ++ LG+N++AD+ H EF ++ G
Sbjct: 36 KQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFN 95
Query: 112 KPDLARRKDQSHEDFSY--KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEG 169
+ R +S + ++ V LP +DWR KGAVT VK+QG CGSCW+FS ++EG
Sbjct: 96 RTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEG 155
Query: 170 INQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEG 228
+ +G L SLSEQ L+DC + NNGCNGGLMD AF+YI + GG+ E+ YPY E+
Sbjct: 156 QHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDE 215
Query: 229 TCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV-YDG 286
C K +++ T GY D+ +ED L A+A P+SVAI+AS + FQ YSGGV Y+
Sbjct: 216 KCHY-KPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEP 274
Query: 287 HCG-TQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
C +QLDHGV VGYG+ G DY +VKNSWG WG++GYI+M RN + CGI
Sbjct: 275 ECSPSQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARNR---DNNCGIATE 331
Query: 345 ASYPI 349
ASYP+
Sbjct: 332 ASYPL 336
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 236 bits (603), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 136/305 (44%), Positives = 189/305 (61%), Gaps = 16/305 (5%)
Query: 57 KVYESLDEKLERFEIFKDNLRHIDETNR----KIKNYWLGLNEFADLRHEEFKEMFLGL- 111
K Y+S E+ R +IF +N + + N+ + ++ LG+N++AD+ H EF ++ G
Sbjct: 36 KQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFN 95
Query: 112 KPDLARRKDQSHEDFSY--KDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVAAVEG 169
+ R +S + ++ V LP +DWR KGAVT VK+QG CGSCW+FS ++EG
Sbjct: 96 RTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEG 155
Query: 170 INQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEG 228
+ +G L SLSEQ L+DC + NNGCNGGLMD AF+YI + GG+ E+ YPY E+
Sbjct: 156 QHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDE 215
Query: 229 TCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV-YDG 286
C K +++ T GY D+ +ED L A+A P+SVAI+AS + FQ YSGGV Y+
Sbjct: 216 KCHY-KPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEP 274
Query: 287 HC-GTQLDHGVAAVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKM 344
C +QLDHGV VGYG+ G DY +VKNSWG WG++GYI+M RN + CGI
Sbjct: 275 DCSASQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARNR---DNNCGIATE 331
Query: 345 ASYPI 349
ASYP+
Sbjct: 332 ASYPL 336
>gi|308322281|gb|ADO28278.1| cathepsin L [Ictalurus furcatus]
Length = 359
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 192/313 (61%), Gaps = 17/313 (5%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDN----LRHIDETNRKIKNYWLGLNEFADLRHEE 103
F+ W KF K+Y+S++E+ +R + +++N + H ++ IK+Y LG+N FAD+ ++E
Sbjct: 25 FQEWKQKFGKIYKSVEEESQRKKTWQENHKLVMNHNILADKGIKSYRLGMNYFADMSNQE 84
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVD---LPKSVDWRKKGAVTHVKNQGSCGSCWA 160
+++ K L+ + +H ++ V LP +V+W + G VT V+ Q C SCWA
Sbjct: 85 YRQSVF--KGCLSFNRTLNHSAATFLRQVGGPALPNTVNWTQMGYVTEVEEQKQCNSCWA 142
Query: 161 FSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKEE 219
FS A+EG TG L SLS+Q+L+DC + NNGC GGLM++AF+Y+ GGLH EE
Sbjct: 143 FSATGALEGQTFKKTGKLVSLSKQQLVDCSKKFGNNGCKGGLMNWAFEYVKENGGLHTEE 202
Query: 220 DYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQF 278
YPY ++G+C G + VT G+ + E++L +A+A P+SVAI+A+ FQ
Sbjct: 203 SYPYEAKDGSCRDNLG-TVGVTCTGHVQINSEDENALQEAVATIGPISVAIDANHTSFQL 261
Query: 279 YSGGVYD-GHCG-TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE 336
Y G+YD C T ++HGV AVGYG+ G DY ++KNSWG WG+KGYI+M RN
Sbjct: 262 YESGLYDEPDCSCTDMNHGVLAVGYGTDDGKDYWLIKNSWGINWGDKGYIKMSRNKNNQ- 320
Query: 337 GLCGINKMASYPI 349
CGI ASYP+
Sbjct: 321 --CGIATAASYPL 331
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 144/321 (44%), Positives = 188/321 (58%), Gaps = 17/321 (5%)
Query: 39 TSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLN 94
+S + L +E++ + +K Y+S E+L RF+IF +N I + N K + +Y LG+N
Sbjct: 18 SSQEILRTQWEAFKTTHKKSYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMN 77
Query: 95 EFADLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVD--LPKSVDWRKKGAVTHVKNQ 152
+F DL EF +F G RK +V D LPK VDWRKKGAVT VK+Q
Sbjct: 78 QFGDLLAHEFARIFNGHH---GTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQ 134
Query: 153 GSCGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVS 211
G CGSCWAFS ++EG + + G L SLSEQ L+DC ++ NNGC GGLM+ AF+YI +
Sbjct: 135 GQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKA 194
Query: 212 TGGLHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIE 270
G+ E+ YPY +G C K E T GY ++ SE L KA+A P+SVAI+
Sbjct: 195 NDGIDTEKSYPYEAVDGECRFKK-EDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAID 253
Query: 271 ASGRDFQFYSGGVYD-GHCGTQ-LDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRM 328
AS FQ YS GVYD C ++ LDHGV VGYG G Y +VKNSW WG++GYI M
Sbjct: 254 ASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILM 313
Query: 329 KRNTGKPEGLCGINKMASYPI 349
R+ CGI ASYP+
Sbjct: 314 SRDNNNQ---CGIASQASYPL 331
>gi|52546918|gb|AAU81592.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 196
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 112/174 (64%), Positives = 134/174 (77%), Gaps = 1/174 (0%)
Query: 178 LASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYPYIMEEGTCEMTKGES 237
L SLSEQEL+DCDN N GCNGGLMD AF +I GG+ EE+YPY+ +G C++ K +
Sbjct: 5 LVSLSEQELVDCDNGENQGCNGGLMDLAFDFIKKKGGITTEENYPYMAADGKCDLKKRNT 64
Query: 238 EVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGRDFQFYSGGVYDGHCGTQLDHGVA 297
VV+I+G+ DVP N E+SLLKA+ANQP+SVAIEASG DFQFYS GV+ G CGT+LDHGVA
Sbjct: 65 PVVSIDGHEDVPPNDEESLLKAVANQPVSVAIEASGSDFQFYSEGVFTGDCGTELDHGVA 124
Query: 298 AVGYGST-RGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYPIK 350
VGYG+T G Y V+NSWGP+WGEKGYIRM+R+ EGLCGI SYPIK
Sbjct: 125 IVGYGTTLDGTKYWTVRNSWGPEWGEKGYIRMQRDIDAEEGLCGIAMQPSYPIK 178
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 142/321 (44%), Positives = 185/321 (57%), Gaps = 22/321 (6%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNL----RHIDETNRKIKNYWLGLNEFA 97
D ++ +ESW K Y S E+ R +I+ +N RH E I Y++ +N +
Sbjct: 24 DVVLSDWESWKLMHGKTYSSSIEEKLRLKIYMENSLKISRHNSEALNGIHPYYMKMNHYG 83
Query: 98 DLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGS 157
DL H EF M G + A + + + LP VDWR++GAVT VKNQG CGS
Sbjct: 84 DLLHHEFVAMVNGYQ--YANKTASLGGTYIPNKNIQLPTHVDWREEGAVTPVKNQGQCGS 141
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLH 216
CW+FS A+EG + TG L SLSEQ L+DC + NNGC GGLMD+AF YI G+
Sbjct: 142 CWSFSATGALEGQDFRKTGKLISLSEQNLVDCSRKFGNNGCEGGLMDFAFTYIRDNKGID 201
Query: 217 KEEDYPYIMEEGTCEM---TKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEAS 272
E YPY +G C KG S++ G+ D+ + SE L KA+A P+SVAI+AS
Sbjct: 202 TEASYPYEGIDGHCHYNPKNKGGSDI----GFVDIKKGSEKDLKKAVAGVGPISVAIDAS 257
Query: 273 GRDFQFYSGGVY-DGHCGT-QLDHGVAAVGYG--STRGLDYIIVKNSWGPKWGEKGYIRM 328
FQFYS GVY + C + +LDHGV VG+G S G DY +VKNSW KWG++GYI+M
Sbjct: 258 HMSFQFYSHGVYVESKCSSEELDHGVLVVGFGTDSVSGEDYWLVKNSWSEKWGDQGYIKM 317
Query: 329 KRNTGKPEGLCGINKMASYPI 349
RN E +CGI ASYP+
Sbjct: 318 ARN---KENMCGIASSASYPV 335
>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
Length = 339
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 143/320 (44%), Positives = 188/320 (58%), Gaps = 20/320 (6%)
Query: 44 LIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFADL 99
L D +++W + K Y +E R I++ NL+ I N +Y LG+N F D+
Sbjct: 25 LDDHWQAWKTWHSKKYHQQEEGWRRM-IWEKNLKMIQLHNLDHSLGKHSYRLGMNHFGDM 83
Query: 100 RHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCW 159
+EEF+++ G K +K + E F + + +PKSVDWR+KG VT VK+QG CGSCW
Sbjct: 84 TNEEFRQVMNGYKHSKTEKKYRGSE-FLEPNFLVVPKSVDWREKGYVTPVKDQGQCGSCW 142
Query: 160 AFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLHKE 218
AFST ++EG + TG L SLSEQ L+DC N GCNGGLMD AF+YI GG+ E
Sbjct: 143 AFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFEYIADNGGIDSE 202
Query: 219 EDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQ 277
E YPYI ++ + K E G+ DVP+ E +L+KA+A P+SVAI+AS FQ
Sbjct: 203 ESYPYIAKDDEDCLYKSEFNAANDTGFVDVPEGHERALMKAVAAVGPVSVAIDASHSTFQ 262
Query: 278 FYSGGV-YDGHCGT-QLDHGVAAVGYGSTRGLD------YIIVKNSWGPKWGEKGYIRMK 329
FY G+ YD C + +LDHGV VGYG G D Y IVKNSW KWG+KGYI M
Sbjct: 263 FYESGIYYDPDCSSEELDHGVLVVGYG-FEGTDDDNKKKYWIVKNSWSDKWGDKGYILMA 321
Query: 330 RNTGKPEGLCGINKMASYPI 349
++ CGI ASYP+
Sbjct: 322 KDRNNH---CGIATAASYPL 338
>gi|50539796|ref|NP_001002368.1| cathepsin L.1 precursor [Danio rerio]
gi|49900360|gb|AAH75887.1| Cathepsin L.1 [Danio rerio]
Length = 334
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 136/312 (43%), Positives = 182/312 (58%), Gaps = 14/312 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDN----LRHIDETNRKIKNYWLGLNEFADLRHEE 103
F +W KF K Y S +E+ R + N L H ++ +K+Y LG+ FAD+ +EE
Sbjct: 26 FHAWKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFADMSNEE 85
Query: 104 FKEM-FLGLKPDLARRKDQSHEDF-SYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAF 161
++++ F G + K + F + +P +VDWR KG VT +K+Q CGSCWAF
Sbjct: 86 YRQLVFRGCLGSMNNTKARGGSTFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGSCWAF 145
Query: 162 STVAAVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEED 220
S ++EG TG L SLSEQ+L+DC +Y N GC+GGLMD AFQYI + GL E+
Sbjct: 146 SATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLDTEDS 205
Query: 221 YPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFY 279
YPY ++G C + + GY D+ E +L +A+A P+SVAI+A FQ Y
Sbjct: 206 YPYEAQDGECRFNP-STVGASCTGYVDIASGDESALQEAVATIGPISVAIDAGHSSFQLY 264
Query: 280 SGGVY-DGHC-GTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 337
S GVY + C ++LDHGV AVGYGS+ G DY IVKNSWG WG +GYI M RN
Sbjct: 265 SSGVYNEPDCSSSELDHGVLAVGYGSSNGDDYWIVKNSWGLDWGVQGYILMSRNKSNQ-- 322
Query: 338 LCGINKMASYPI 349
CGI ASYP+
Sbjct: 323 -CGIATAASYPL 333
>gi|339765072|gb|AEK01110.1| cathepsin L [Cristaria plicata]
gi|397880684|gb|AFO67888.1| cathepsin L [Cristaria plicata]
Length = 333
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 133/310 (42%), Positives = 187/310 (60%), Gaps = 15/310 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNL----RHIDETNRKIKNYWLGLNEFADLRHEE 103
++ ++ K Y S E+L R+ ++K+N+ RH + ++ + YWL +NE+ DL +EE
Sbjct: 30 WQEFVRTHNKTY-SAHEELFRYAVWKENVLAINRHNSKADQGVHTYWLSMNEYGDLTNEE 88
Query: 104 FKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFST 163
+ + G + ++S F Y ++ + P+ VDWR+KG VT VK+QG CGSC+AFS
Sbjct: 89 YFRLRTGFI--MNGNIERSGSIFKYTNLSEYPRQVDWRRKGYVTRVKDQGGCGSCYAFSA 146
Query: 164 VAAVEGINQIVTGNLASLSEQELIDCD-NTYNNGCNGGLMDYAFQYIVSTGGLHKEEDYP 222
A+EG + TG L SLSEQ ++DC N GC GGLMD +F YI + G+ KEE YP
Sbjct: 147 TGALEGQHFRKTGKLVSLSEQNIVDCSFKEGNKGCKGGLMDKSFTYIKNNNGIDKEEAYP 206
Query: 223 YIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSG 281
Y +G C + E T GY D+P+N E +L A+A P+SVAI+ +F+FY
Sbjct: 207 YEARDGPCRFRRSEVG-ATDRGYVDLPENDETALRHAVATIGPISVAIDGHHFNFRFYDH 265
Query: 282 GVYDG-HCG-TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLC 339
GV+D +C T+++HGV VGYG+ GLDY +VKNSWG WG KGYI M RN + C
Sbjct: 266 GVFDNPNCSKTKINHGVLVVGYGTRNGLDYWMVKNSWGRGWGAKGYILMSRNN---DNQC 322
Query: 340 GINKMASYPI 349
I ASYPI
Sbjct: 323 CIACAASYPI 332
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 181/317 (57%), Gaps = 20/317 (6%)
Query: 49 ESWMS---KFEKVYESLDEKLERFEIFKDNLRHIDETNR----KIKNYWLGLNEFADLRH 101
+ WM+ + +K Y+S E+ R +IF DN I + N K +Y L +N++ D+ H
Sbjct: 26 QEWMTFKMEHKKAYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDMLH 85
Query: 102 EEFKEMFLG----LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGS 157
EF + G + L + F V LPK VDWRK+GAVT VK+QG CGS
Sbjct: 86 HEFVNILNGFNKSINTQLRSERMPIGASFIEPANVALPKKVDWRKEGAVTPVKDQGHCGS 145
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLH 216
CW+FS A+EG + TG L SLSEQ LIDC Y NNGCNGGLMD AFQYI GL
Sbjct: 146 CWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLD 205
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRD 275
E YPY E C S + + GY D+P +E L A+A P+SVAI+AS +
Sbjct: 206 TEASYPYEAENDKCRYNPANSGAIDV-GYIDIPTGNEKLLKAAVATIGPVSVAIDASHQS 264
Query: 276 FQFYSGGV-YDGHCGT-QLDHGVAAVGYGSTR-GLDYIIVKNSWGPKWGEKGYIRMKRNT 332
FQFYS GV Y+ C + +LDHGV +GYG+ G DY +VKNSWG WG GYI+M RN
Sbjct: 265 FQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGEDYWLVKNSWGETWGNNGYIKMARNK 324
Query: 333 GKPEGLCGINKMASYPI 349
CGI ASYP+
Sbjct: 325 ---LNHCGIASSASYPL 338
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 141/320 (44%), Positives = 188/320 (58%), Gaps = 20/320 (6%)
Query: 42 DKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRK----IKNYWLGLNEFA 97
D L++SW SK Y +E R +++ NL+ I+ N +Y LG+N+F
Sbjct: 41 DSHWQLWKSWHSK---DYHEREESWRRV-VWEKNLKMIELHNLDHSLGKHSYKLGMNQFG 96
Query: 98 DLRHEEFKEMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGS 157
D+ EEF+++ G K + RK + + F ++ P+SVDWR+KG VT VK+QG CGS
Sbjct: 97 DMTAEEFRQLMNGYKHKKSERKYRGSQ-FLEPSFLEAPRSVDWREKGYVTPVKDQGQCGS 155
Query: 158 CWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTY-NNGCNGGLMDYAFQYIVSTGGLH 216
CWAFST A+EG + TG L SLSEQ L+DC N GCNGGLMD AFQY+ GG+
Sbjct: 156 CWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGID 215
Query: 217 KEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRD 275
EE YPY ++ K E G+ D+PQ E +L+KA+A+ P+SVAI+A
Sbjct: 216 SEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSS 275
Query: 276 FQFYSGGV-YDGHCGTQ-LDHGVAAVGYG----STRGLDYIIVKNSWGPKWGEKGYIRMK 329
FQFY G+ Y+ C ++ LDHGV VGYG G Y IVKNSWG KWG+KGYI M
Sbjct: 276 FQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMA 335
Query: 330 RNTGKPEGLCGINKMASYPI 349
++ + CGI ASYP+
Sbjct: 336 KDR---KNHCGIATAASYPL 352
>gi|300175245|emb|CBK20556.2| unnamed protein product [Blastocystis hominis]
Length = 325
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 132/315 (41%), Positives = 189/315 (60%), Gaps = 11/315 (3%)
Query: 39 TSNDKL-IDL-FESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEF 96
T+ D L ++L F ++ KF K Y +E+ R +F +NL+ +D N K ++ LG+ F
Sbjct: 13 TAKDTLSVELQFAAFEKKFGKTYVGEEERRFRMSVFSNNLKIVDYYNSKQSSFVLGITPF 72
Query: 97 ADLRHEEFKEMFLGLKP--DLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
DL ++EF+E F A+ + S + +D LP+S+DWR K V+ VK+Q +
Sbjct: 73 IDLSNDEFRERFASNTAFEKKAKSVESSSSQQTSQDYSSLPRSIDWRAKNTVSSVKDQKN 132
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGG 214
CG+CWAF+ VA++EG+ TG + S Q+L+DCD + + GC+GGLM YA++Y+++ G
Sbjct: 133 CGACWAFAAVASIEGVYAQKTGKILDFSPQQLVDCDYS-SLGCSGGLMTYAYEYVMNN-G 190
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
+ E DYPY +G+C K V +I GY++VP S LLKA P+SVAI A
Sbjct: 191 ISLESDYPYKASQGSC---KKVDFVTSIMGYYEVPVGSTYELLKATTKNPVSVAIGADSI 247
Query: 275 DFQFYSGGVY-DGHCGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
FQ Y+ G+ + CGT L+HGV VGY ++IVKNSWG WGEKGYIR+ +
Sbjct: 248 FFQLYTSGILAEELCGTTLNHGVLLVGYELDTATPFLIVKNSWGASWGEKGYIRLALSDS 307
Query: 334 KPEGLCGINKMASYP 348
G CGIN MASYP
Sbjct: 308 YA-GTCGINLMASYP 321
>gi|300175452|emb|CBK20763.2| unnamed protein product [Blastocystis hominis]
Length = 313
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 137/315 (43%), Positives = 190/315 (60%), Gaps = 20/315 (6%)
Query: 38 LTSNDKLIDLFESWMSKFEKVYESLDEKLERFEIFKDNLRHIDETNRKIKNYWLGLNEFA 97
L ++ + + F S+ +++ K Y + E+ R ++F N+ + N + Y +G FA
Sbjct: 13 LATSLRYENTFNSFEARYGKNYINAAERAFRQKVFAYNMEWAQKINSEDHPYTVGATPFA 72
Query: 98 DLRHEEFKEMFLG---LKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGS 154
D+ + EF L LKP + + E + ++VDWR+KGAVT VKNQ S
Sbjct: 73 DMTNTEFAVSKLCGCMLKPKMTKPATPIMEPAA--------EAVDWREKGAVTPVKNQAS 124
Query: 155 CGSCWAFSTVAAVEGINQIVTGNLASLSEQELIDCDNTYNNGCNGGLMDYAFQYIVSTGG 214
CGSCWAFS A+EG N + G L SLSEQ+L+DCD+ ++GC GGLM YAF+Y G
Sbjct: 125 CGSCWAFSATGAMEGRNFVANGELISLSEQQLVDCDHQ-SSGCGGGLMTYAFEY-AKKKG 182
Query: 215 LHKEEDYPYIMEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALANQPLSVAIEASGR 274
+ KEEDYPY + C+ K + VV GY +VP+ +L +A++ P+SVA+EA
Sbjct: 183 MCKEEDYPYHAVDEDCKDDKC-TPVVFPKGYEEVPRFDGAALKQAVSQGPVSVAVEADSI 241
Query: 275 DFQFYSGGVYDGH-CGTQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTG 333
FQ Y+GGV D CGT L+HGV AVGYG+ DY IVKNSWG WG+KGY+++K T
Sbjct: 242 VFQMYTGGVIDSSACGTSLNHGVLAVGYGA----DYWIVKNSWGESWGDKGYLKIKY-TE 296
Query: 334 KPEGLCGINKMASYP 348
G+CGIN+M SYP
Sbjct: 297 SGAGICGINQMNSYP 311
>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
Length = 323
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 137/308 (44%), Positives = 177/308 (57%), Gaps = 13/308 (4%)
Query: 48 FESWMSKFEKVYESLDEKLERFEIFKDNLRHID--ETNRKIKNYWLGLNEFADLRHEEFK 105
+E W ++ K Y E+L R++I++ N + I+ N + LG+N+F DL EF
Sbjct: 22 WEDWKNEHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDKFGFTLGMNKFGDLESHEFA 81
Query: 106 EMFLGLKPDLARRKDQSHEDFSYKDVVDLPKSVDWRKKGAVTHVKNQGSCGSCWAFSTVA 165
EMF G + + + S + F +VDWR KGAVT VKNQG CGSCWAFST
Sbjct: 82 EMFNGY---MMQARSNSTKVFVADPNYKADPTVDWRTKGAVTGVKNQGQCGSCWAFSTTG 138
Query: 166 AVEGINQIVTGNLASLSEQELIDCDNTYNN-GCNGGLMDYAFQYIVSTGGLHKEEDYPYI 224
++EG + + TG L SLSEQ L+DC N GCNGGLMD AF+YI GG+ E YPY
Sbjct: 139 SLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYIKKNGGIDTEASYPYQ 198
Query: 225 MEEGTCEMTKGESEVVTINGYHDVPQNSEDSLLKALAN-QPLSVAIEASGRDFQFYSGGV 283
+ C K T GY D+ + E++L++A+ P+SVAI+AS FQ Y GV
Sbjct: 199 AHDERCRF-KASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSFQLYRSGV 257
Query: 284 -YDGHCG-TQLDHGVAAVGYGSTRGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGI 341
Y+ C T LDHGV A+GYG+ G DY +VKNSWG WG +GYI M RN CGI
Sbjct: 258 YYERECSQTALDHGVLAIGYGTEGGSDYWLVKNSWGTDWGMEGYIMMSRNRNNN---CGI 314
Query: 342 NKMASYPI 349
ASYP
Sbjct: 315 ATEASYPT 322
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.136 0.412
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,743,084,280
Number of Sequences: 23463169
Number of extensions: 252437393
Number of successful extensions: 700808
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5930
Number of HSP's successfully gapped in prelim test: 1449
Number of HSP's that attempted gapping in prelim test: 672494
Number of HSP's gapped (non-prelim): 8653
length of query: 352
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 209
effective length of database: 9,003,962,200
effective search space: 1881828099800
effective search space used: 1881828099800
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)