BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023507
(281 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|302143414|emb|CBI21975.3| unnamed protein product [Vitis vinifera]
Length = 286
Score = 265 bits (678), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 134/280 (47%), Positives = 188/280 (67%), Gaps = 26/280 (9%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L + ASQ ++R+ E S+ E HE WMAQ+GR YKD EK R KIFK+N+ IE
Sbjct: 14 LLFFLAAWASQA-TARNLLEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY++++ VP++
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYEHVAA--VPST 126
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T N+
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTKQNHAN 186
Query: 200 LGGSREKA--------------FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 245
+ +KA F + + G+F G CGT+LDH V VG+GT++DG YW
Sbjct: 187 NEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYW 246
Query: 246 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
L+KNSWG WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 247 LVKNSWGTGWGEVGYIRMQRDVTAKEGLCGIAMQASYPTA 286
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 139/343 (40%), Positives = 195/343 (56%), Gaps = 76/343 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T++EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK ++S D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E+
Sbjct: 239 YQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEN 298
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGEKGFMKIIRDYGNPSGLCDIAKLSSYP 341
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 261 bits (667), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 194/316 (61%), Gaps = 60/316 (18%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ +L + ASQ ++R+ HE S+ E HE WM Q+GR YKD EK R KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G +
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGC-----------------------------LGGSREKAFAYIIQNQ------------ 215
GC + + EKA + +Q
Sbjct: 185 QGCTNYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGSE 244
Query: 216 ------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
G+F G CGT+LDH V+ VG+GT++DG YWL+KNSWG WG+ GY+++ RD
Sbjct: 245 FQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTA 304
Query: 267 -EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 305 KEGLCGIAMQASYPTA 320
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 140/343 (40%), Positives = 193/343 (56%), Gaps = 76/343 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E
Sbjct: 239 YQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEK 298
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 279
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 341
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 261 bits (666), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 192/316 (60%), Gaps = 60/316 (18%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ +L + ASQ ++RS HE S+ E HE WM Q+GR YKD EK R KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G +
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGC-----------------------------LGGSREKAFAYIIQNQ------------ 215
GC + + EKA + +Q
Sbjct: 185 QGCTNYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSE 244
Query: 216 ------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
G+F G CGT+LDH V VG+GT++DG YWL+KNSW WG+ GY+++ RD
Sbjct: 245 FQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVTA 304
Query: 267 -EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 305 KEGLCGIAMQASYPTA 320
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 261 bits (666), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 139/343 (40%), Positives = 193/343 (56%), Gaps = 76/343 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T++EF A +TG +P SPS +T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMPSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQGKTAAVQISN 238
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E
Sbjct: 239 YQVVPEGETSLLQAVTKQPVSIGIAASHDLQFYAGGTYDGSCANRINHAVTAIGYGTDEK 298
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 279
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 139/343 (40%), Positives = 192/343 (55%), Gaps = 76/343 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E
Sbjct: 239 YKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDYGNPSGLCDIAKMSSYP 341
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 258 bits (660), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 139/343 (40%), Positives = 193/343 (56%), Gaps = 76/343 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISIFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q +CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKTNDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E+
Sbjct: 239 YQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYSGGTYDGSCADRINHAVTAIGYGTDEE 298
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 258 bits (659), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 139/343 (40%), Positives = 192/343 (55%), Gaps = 76/343 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E+
Sbjct: 239 YQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEN 298
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 139/343 (40%), Positives = 192/343 (55%), Gaps = 76/343 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E+
Sbjct: 239 YQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEN 298
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 193/318 (60%), Gaps = 62/318 (19%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ +L + ASQ ++R+ HE S+ E HE WMAQ+GR YKD EK R KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N+F+DLTN+EF +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFGTSRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G +
Sbjct: 125 STIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGCLGG-----------SREKA-------------------------------------- 207
GC G +R+KA
Sbjct: 185 QGCNGANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVVHQPIAVAIDAGG 244
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + + G+F G CGT+LDH V VG+GT++DG YWL+KNSWG WG+ GY+++ RD
Sbjct: 245 FEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDV 304
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 305 TAKEGLCGIAMQASYPTA 322
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 257 bits (657), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 139/343 (40%), Positives = 191/343 (55%), Gaps = 76/343 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E
Sbjct: 239 YKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 257 bits (657), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 139/343 (40%), Positives = 191/343 (55%), Gaps = 76/343 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E
Sbjct: 239 YKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 139/343 (40%), Positives = 191/343 (55%), Gaps = 76/343 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E
Sbjct: 239 YQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 256 bits (655), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 139/343 (40%), Positives = 191/343 (55%), Gaps = 76/343 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E
Sbjct: 239 YKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 254 bits (649), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 190/343 (55%), Gaps = 76/343 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E
Sbjct: 239 YKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 253 bits (647), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 134/296 (45%), Positives = 178/296 (60%), Gaps = 56/296 (18%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E S +E HE+WM++ R Y D+ EK R +IFK+NL+++E N N TYKL N+FSDL
Sbjct: 11 EASAIEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDL 70
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T++EF+A Y G + S + +F+Y+N+S T S+DWR +GAVTP+K+Q +CGC
Sbjct: 71 TDEEFQARYMGLVPEGMTGDSQKTVSFRYENVSETG--ESMDWRLEGAVTPVKDQGQCGC 128
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI- 217
CWAFAAVAAVEG+TKI +G L+ LSEQQL+DCST NN GC GG A+ YI +NQGI
Sbjct: 129 CWAFAAVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQGIT 188
Query: 218 ------FNGV------------------------------------------CGTQLDHA 229
+ V CGT HA
Sbjct: 189 SEENYPYQAVQQTCKSTDPAAATISGYEAVPKDDEEALLKAVSQHGIFEDEYCGTDSHHA 248
Query: 230 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
VTIVG+GT+E+G YWL+KNSWG +WG+ GYM+I RD +G+CG+ R+ YP+A
Sbjct: 249 VTIVGYGTSEEGIKYWLLKNSWGESWGENGYMRIKRDVDEPQGMCGLAHRAYYPVA 304
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 253 bits (645), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 139/342 (40%), Positives = 192/342 (56%), Gaps = 74/342 (21%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTN--GNNGCL------------GGSREKAFAY--------------------- 210
Q+LLDC+TN G NG G SRE + Y
Sbjct: 180 QELLDCTTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISSY 239
Query: 211 ----------------------IIQNQ-------GIFNGVCGTQLDHAVTIVGFGTTEDG 241
I +Q G ++G C +++HAVT +G+GT E G
Sbjct: 240 QVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKG 299
Query: 242 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 300 QKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 134/341 (39%), Positives = 188/341 (55%), Gaps = 72/341 (21%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITVFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS--STFK 127
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P+ + + FK
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSSTEFK 121
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SEQ+
Sbjct: 122 INDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQE 181
Query: 188 LLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ----------------- 215
LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 182 LLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISSYK 240
Query: 216 ---------------------------------GIFNGVCGTQLDHAVTIVGFGTTEDGA 242
G ++G C +++HAVT +G+GT E G
Sbjct: 241 VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQ 300
Query: 243 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 301 KYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 190/343 (55%), Gaps = 75/343 (21%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFK 121
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
K +LS +P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 122 -KINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 180
Query: 186 QQLLDCSTNGNNGCLGG---------------SREKAFAYIIQNQ--------------- 215
Q+LLDC+TN N GC GG SRE + Y+ Q
Sbjct: 181 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 239
Query: 216 -----------------------------------GIFNGVCGTQLDHAVTIVGFGTTED 240
G ++G C +++HAVT +G+GT E+
Sbjct: 240 YQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGNCADRINHAVTAIGYGTDEE 299
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G YWL+KNSWG +WG+ GYMKI+RD GLC I SSYP
Sbjct: 300 GQKYWLLKNSWGTSWGENGYMKIIRDSGDPSGLCDIAKMSSYP 342
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 251 bits (641), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 140/342 (40%), Positives = 192/342 (56%), Gaps = 74/342 (21%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS +P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTN--GNNGCL------------GGSREKAFAY--------------------- 210
Q+LLDC+TN G NG L G SRE + Y
Sbjct: 180 QELLDCTTNNYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTCRSREKTAAVQISSY 239
Query: 211 ----------------------IIQNQ-------GIFNGVCGTQLDHAVTIVGFGTTEDG 241
I +Q G ++G C Q++HAVT +G+GT E+G
Sbjct: 240 KVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGNCADQINHAVTAIGYGTDEEG 299
Query: 242 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 300 QKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 136/339 (40%), Positives = 189/339 (55%), Gaps = 75/339 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ KI+ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKIDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P+ S+ S +
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPN-SYLSPS----PIN 116
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+LS D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG KI +GNL++ SEQ+LL
Sbjct: 117 DLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 176
Query: 190 DCSTNGNNGCLGG---------------SREKAFAYIIQNQ------------------- 215
DC+TN N GC GG SRE + Y+ Q
Sbjct: 177 DCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISSYQVV 235
Query: 216 -------------------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANY 244
G ++G C +++HAVT +G+GT E G Y
Sbjct: 236 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQKY 295
Query: 245 WLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 279
WL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 296 WLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 250 bits (639), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 135/339 (39%), Positives = 189/339 (55%), Gaps = 75/339 (22%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P+ S+ S +
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPN-SYLSPS----PIN 116
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+LS D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG KI +GNL++ SEQ+LL
Sbjct: 117 DLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 176
Query: 190 DCSTNGNNGCLGG---------------SREKAFAYIIQNQ------------------- 215
DC+TN N GC GG SRE + Y+ Q
Sbjct: 177 DCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISSYQVV 235
Query: 216 -------------------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANY 244
G ++G C +++HAVT +G+GT E G Y
Sbjct: 236 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQKY 295
Query: 245 WLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 279
WL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 296 WLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 250 bits (638), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 191/342 (55%), Gaps = 74/342 (21%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFK+N+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKKNMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q +CGCCWAF+AV ++EG KI +G L++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSE 179
Query: 186 QQLLDCSTN--GNNGCL------------GGSREKAFAYIIQN----------------- 214
Q+LLDC+TN G NG G SRE + Y+ +
Sbjct: 180 QELLDCTTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISSY 239
Query: 215 ---------------------------------QGIFNGVCGTQLDHAVTIVGFGTTEDG 241
+G ++G C +++HAVT +G+GT E G
Sbjct: 240 QVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAEGTYDGSCADRINHAVTAIGYGTDEKG 299
Query: 242 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 300 QKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 248 bits (633), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 190/342 (55%), Gaps = 74/342 (21%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTN--GNNGCL------------GGSREKAFAY--------------------- 210
Q+LLDC+TN G NG G SRE + Y
Sbjct: 180 QELLDCTTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISSY 239
Query: 211 ----------------------IIQNQ-------GIFNGVCGTQLDHAVTIVGFGTTEDG 241
I +Q G ++G C +++HAVT +G+GT E G
Sbjct: 240 QVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKG 299
Query: 242 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 300 QKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 246 bits (628), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 129/338 (38%), Positives = 185/338 (54%), Gaps = 74/338 (21%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
+++ + I S S V+++R + ++VE HE WM ++GR YKD EK R ++FK
Sbjct: 2 VSSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEVFK 61
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+N+ ++E N N + LG NQF+DLT +EF+A G+K S TT FKY+NLS
Sbjct: 62 DNVAFVESFNTNKNNKFWLGINQFADLTIEEFKA-NKGFKPISAEKVPTTG--FKYENLS 118
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
++ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ +GNLI LSEQ+L+DC
Sbjct: 119 VSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCD 178
Query: 193 TNG------------------NNG----------------CLGGSR-------------- 204
T+ NG C GGS+
Sbjct: 179 THSMDEGCEGGWMDSAFEFVIKNGGLATVSSYPYKAVDGKCKGGSKSAATIKGHEDVPVN 238
Query: 205 -EKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYW 245
E A + NQ G+ G CGT+LDH + +G+G DG YW
Sbjct: 239 DEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYW 298
Query: 246 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
++KNSWG TWG+ G++++ +D +G+CG+ + SYP
Sbjct: 299 ILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 243 bits (620), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 128/337 (37%), Positives = 183/337 (54%), Gaps = 76/337 (22%)
Query: 15 TTPMFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
T+ F++ L S S V+++R + ++VE HE WM ++GR YKD EK R + FK
Sbjct: 3 TSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKH 62
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+ ++E N + LG NQF+DLT +EF+A G+K P+ ++ FKY+NLS+
Sbjct: 63 NVAFVESFNTNKKNKFWLGVNQFADLTTEEFKA-NKGFK---PTAEKVPTTGFKYENLSV 118
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ +GNLI LSEQ+L+DC T
Sbjct: 119 SALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDT 178
Query: 194 NG------------------NNG----------------CLGGSR--------------- 204
+ NG C GGS+
Sbjct: 179 HSMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSKSAATIKGHEDVPVNN 238
Query: 205 EKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 246
E A + NQ G+ G CGT+LDH + +G+G DG YW+
Sbjct: 239 EAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWI 298
Query: 247 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
+KNSWG TWG+ G++++ +D G+CG+ + SYP
Sbjct: 299 LKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 335
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 243 bits (620), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 130/337 (38%), Positives = 183/337 (54%), Gaps = 75/337 (22%)
Query: 15 TTPMFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
T+ F++ L S S V+++R + ++VE HE WM ++GR YKD EK R + FK
Sbjct: 3 TSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKH 62
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+ ++E N + LG NQF+DLT +EF+A G+K S TT FKY+NLS+
Sbjct: 63 NVAFVESFNTNKKNKFWLGVNQFADLTTEEFKA-NKGFKPISAEMVPTTG--FKYENLSV 119
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ +GNLI LSEQ+L+DC T
Sbjct: 120 SALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDT 179
Query: 194 NG------------------NNG----------------CLGGSR--------------- 204
+ NG C GGS+
Sbjct: 180 HSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSKSAATIKGHEDVPVND 239
Query: 205 EKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 246
E A + NQ G+ G CGT+LDH + +G+G DG YW+
Sbjct: 240 EAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYWI 299
Query: 247 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
+KNSWG TWG+ G++++ +D +G+CG+ + SYP
Sbjct: 300 LKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 126/333 (37%), Positives = 177/333 (53%), Gaps = 73/333 (21%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ C + + + + + ++V HE+WMAQ+ R YKD EK R ++FK N+++
Sbjct: 8 ILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKANVKF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR++ T S + + T F+Y+N+S+ +P
Sbjct: 68 IESFNAGGNNKFWLGVNQFADLTNDEFRSIKTNKGFKSSNMKIPTG--FRYENVSVDALP 125
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-- 195
T++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +G L+ L+EQ+L+DC +G
Sbjct: 126 TTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGED 185
Query: 196 ----------------NNG----------------CLGGSR---------------EKAF 208
NNG C GS E A
Sbjct: 186 QGCEGGLMDDAFKFIINNGGLTTESSYPYTAADGKCKSGSNSAATIKGYEDVPANDEAAL 245
Query: 209 AYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 250
+ NQ G+ G CGT LDH + +G+G T DG YWL+KNS
Sbjct: 246 MKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNS 305
Query: 251 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
WG TWG+ GY+++ +D G+CG+ SYP
Sbjct: 306 WGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 338
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 192/342 (56%), Gaps = 78/342 (22%)
Query: 18 MFIIITLLVSCASQVVSSRST------HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
+ +++T+L+ + S++T EQS+V+ HE+WMA+ R Y+DELEK MR +F
Sbjct: 4 IMVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQN 130
K+NL++IE NK+GN++YKLG N+F+D TN+EF A++TG K + S + T Q
Sbjct: 64 KKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQT 123
Query: 131 LSMTD-VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+++D V S DWR +GAVTP+K Q +CGCCWAF+AVAAVEG+ KI GNL+ LSEQQLL
Sbjct: 124 WNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLL 183
Query: 190 --------------------------------DCSTNGNNG-CLGGSR------------ 204
D S G++G C +R
Sbjct: 184 DCDREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNARPAARISGFQTVP 243
Query: 205 ---EKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGAN 243
E+A + Q G+++G CGT +HAVT VG+GT++DG
Sbjct: 244 SNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTK 303
Query: 244 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
YWL KNSWG TWG+ GY++I RD +G+CG+ + YP+A
Sbjct: 304 YWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 241 bits (615), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 135/334 (40%), Positives = 185/334 (55%), Gaps = 72/334 (21%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ S + + + HE S +E HE+WMA+ R Y+DELEK+MR +FK+NL++
Sbjct: 10 IFTILFTTFSISQATSRTVTFHEPSSLEKHEQWMARFSRVYRDELEKQMRRDVFKKNLKF 69
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE NK+GN++YKLG N+F+D TN+EF A++TG K S T S+ + M V
Sbjct: 70 IENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLSSKVVDETISSRSWNISDMVGV- 128
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL-------- 189
S DWR +GAVTP+K Q +CGCCWAF+AVAAVEG+TKI GNL+ LSEQQLL
Sbjct: 129 -SKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDCDREYDR 187
Query: 190 ------------------------DCSTNGNNG-CLGGSR---------------EKAFA 209
D S G++G C +R E+A
Sbjct: 188 GCDGGIMSDAFNYIIQNRGIASENDYSYQGSDGRCRSSARPAARISGFQTVPSNNEQALL 247
Query: 210 YIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 251
+ Q G+++G CGT +HAVT VG+GT++DG YWL KNSW
Sbjct: 248 EAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSW 307
Query: 252 GNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
G TWG+ GY++I RD +G+CG+ + YP+A
Sbjct: 308 GETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 341
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 239 bits (611), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 125/327 (38%), Positives = 176/327 (53%), Gaps = 69/327 (21%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
+ V ++ V + E ++ ++KWMAQ+ R YKD+ EK R ++FK N E+I+++
Sbjct: 34 VAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRS 93
Query: 82 NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSMTDVPTS 139
N G + Y LGTNQF+DLT+ EF A+YTG + P+ PS + FKYQN + D
Sbjct: 94 NAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAKQIPAGFKYQNFTRLDDDVQ 153
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNG 198
+DWR +GAVTP+KNQ +CGCCWAF+AV A+EG+ I +GNL+ LSEQQ+LDC ++GN G
Sbjct: 154 VDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQG 213
Query: 199 CLGG-----------------------------------------------SREKAFAYI 211
C GG E A A
Sbjct: 214 CNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQGTCQNVQPAATISGFQDLPSGDENALANA 273
Query: 212 IQNQ------------------GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 252
+ NQ GI++G CGT ++HAVT +G+G + G YW++KNSWG
Sbjct: 274 VANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWG 333
Query: 253 NTWGDAGYMKIVRDEGLCGIGTRSSYP 279
WG+ G+M++ G CGI T +SYP
Sbjct: 334 TGWGENGFMQLQMGVGACGISTMASYP 360
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 239 bits (610), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 139/338 (41%), Positives = 193/338 (57%), Gaps = 82/338 (24%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ + LL + ++R+ + S+ E HE+WMAQHG+ YKD EKE+R KIF++N++ IE
Sbjct: 12 LALLLLFGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIE 71
Query: 80 KANKEGNRTYKLGTNQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N GN+++KLG NQF+DLT +EF+A+ GY S +STFKY+++ T VP
Sbjct: 72 GFNNAGNKSHKLGVNQFADLTEEEFKAINKLKGYMWSKISR----TSTFKYEHV--TKVP 125
Query: 138 TSLDWRDKGAVTPIKNQK-ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
+LDWR KGAVTPIK+Q +CG CWAFAAVAA EGITK+ +G LI LSEQ+L+DC TNG+
Sbjct: 126 ATLDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGD 185
Query: 197 N-GCLGGSREKAFAYIIQNQGI----------FNGVCGTQLD--HAVTIVGF-------- 235
N GC G ++AF +I+QN+G+ +G C +++ H +I G+
Sbjct: 186 NGGCKWGIIQEAFKFIVQNKGLATEASYPYQAVDGTCNAKVESKHVASIKGYEDVPANNE 245
Query: 236 -----------------------------------GTT-------------EDGANYWLI 247
GTT +DG YWLI
Sbjct: 246 TALLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDDGTKYWLI 305
Query: 248 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
KNSWG WG+ GY++I RD EG+CGI ++SYP+A
Sbjct: 306 KNSWGVYWGEQGYIRIKRDVAAKEGMCGIAMQASYPIA 343
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 133/342 (38%), Positives = 189/342 (55%), Gaps = 78/342 (22%)
Query: 18 MFIIITLLVSCASQVVSSRST------HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
+ +++T+L+ + S++T EQS+V+ HE+WMA+ R Y+DELEK MR +F
Sbjct: 4 IMVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQN 130
K+NL++IE NK+GN++YKLG N+F+D TN+EF A++TG K + S + T Q
Sbjct: 64 KKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQT 123
Query: 131 LSMTD-VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+++D V S DWR +GAVTP+K Q +CGCCWAF+AVAAVEG+ KI GNL+ LSEQQLL
Sbjct: 124 WNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLL 183
Query: 190 --------DCS-------------------------TNGNNGCLGGSR------------ 204
DC + GC +R
Sbjct: 184 DCDREYDRDCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNARPAARISGFQTVP 243
Query: 205 ---EKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGAN 243
E+A + Q G+++G CGT +HAVT VG+GT++DG
Sbjct: 244 SNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTK 303
Query: 244 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
YWL KNSWG TW + GY++I RD +G+CG+ + YP+A
Sbjct: 304 YWLAKNSWGETWEEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 134/326 (41%), Positives = 183/326 (56%), Gaps = 74/326 (22%)
Query: 30 SQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
SQV SSR +E S+ H++W+A H + YKD EKEMR KIFKEN+E IE N ++
Sbjct: 24 SQVASSRPINYEASMRARHDQWIAHHDKVYKDLNEKEMRFKIFKENVERIEAFNAGEDKG 83
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
YKLG N+FSDLTN++FR L+TGYK P S++ ++ ++TD+P ++DWR KGAV
Sbjct: 84 YKLGVNKFSDLTNEKFRVLHTGYKRSHPKVMSSSKPKTHFRYANVTDIPPTMDWRKKGAV 143
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKA 207
TPIK+QKECGCCWAF+AVAA EG+ ++++G LI LSEQ+L+DC G + GC GG + A
Sbjct: 144 TPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDVEGEDEGCSGGLLDTA 203
Query: 208 FAYIIQNQGIF----------NGVCGTQLD--HAVTIVGFGTTE---------------- 239
F +I++N+G+ +GVC + A I G+
Sbjct: 204 FDFILKNKGLTTEANYPYKGEDGVCNKKKSALSAAKIAGYEDVPANSEKALLQAVANQPV 263
Query: 240 ----DGANY----------------WL--------------------IKNSWGNTWGDAG 259
DG+++ WL IKNSWG+ WGD+G
Sbjct: 264 SVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKYWIIKNSWGSKWGDSG 323
Query: 260 YMKIVRD----EGLCGIGTRSSYPLA 281
YM+I RD EGLCG+ +SYP A
Sbjct: 324 YMRIKRDVHEKEGLCGLAMDASYPTA 349
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 134/324 (41%), Positives = 175/324 (54%), Gaps = 80/324 (24%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+ RS + S+ E HE+WM ++G+ YKD E+E R +IFKEN+ YIE N N+ YKL
Sbjct: 572 VTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLA 631
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T VP+++DWR KGAVTP
Sbjct: 632 INQFADLTNEEFIAPRNRFKGHMCSSIIRTTT---FKYENV--TAVPSTVDWRQKGAVTP 686
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL--------------------- 189
IK+Q +CGCCWAF+AVAA EGI + SG LI LSEQ+L+
Sbjct: 687 IKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFK 746
Query: 190 -------------------DCSTNGNNGC-----------LGGSREKAFAYIIQNQ---- 215
D N N + + EKA + NQ
Sbjct: 747 FVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANNEKALQKAVANQPVSV 806
Query: 216 --------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+
Sbjct: 807 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYI 866
Query: 262 KIVR----DEGLCGIGTRSSYPLA 281
++ R +EGLCGI ++SYP A
Sbjct: 867 RMQRGVDSEEGLCGIAMQASYPTA 890
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 129/335 (38%), Positives = 177/335 (52%), Gaps = 74/335 (22%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I++ L C++ + + + ++V HE+WM Q+GR YKD EK R +IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTN EFRA T + R T TF+Y+N+S+ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGFIPSTVRVPT--TFRYENVSIDTLP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-- 195
++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 196 ----------------NNG----------------CLGGSR---------------EKAF 208
NG C GGS E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSAATIKGYEEVPANNEAAL 244
Query: 209 AYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 250
+ NQ G+ G CGT LDH + +G+G DG YWL+KNS
Sbjct: 245 MKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKNS 304
Query: 251 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
WG TWG+ G++++ +D G+CG+ SYP A
Sbjct: 305 WGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 129/335 (38%), Positives = 177/335 (52%), Gaps = 74/335 (22%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I++ L C++ + + + ++V HE+WM Q+GR YKD EK R +IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTN EFRA T + R T TF+Y+N+S+ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGFIPSTVRVPT--TFRYENVSIDTLP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-- 195
++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 196 ----------------NNG----------------CLGGSR---------------EKAF 208
NG C GGS E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSAATIKGYEDVPANNEAAL 244
Query: 209 AYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 250
+ NQ G+ G CGT LDH + +G+G DG YWL+KNS
Sbjct: 245 MKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKNS 304
Query: 251 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
WG TWG+ G++++ +D G+CG+ SYP A
Sbjct: 305 WGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 236 bits (602), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 127/331 (38%), Positives = 178/331 (53%), Gaps = 72/331 (21%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++I+ L+++ + V SR E E HEKWMAQ+G+ Y D EKE R +IFK N+++I
Sbjct: 9 YLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFI 68
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L NQF+DL N+EF+A + + T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETATETSFRYE--SITKIPV 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC------- 191
++DWR +GAVTPIK+Q CG CWAF+ VAA+EGI +I +G L+ LSEQ+L+DC
Sbjct: 127 TMDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLVSLSEQELVDCVKGKSEG 186
Query: 192 -------------STNG-------------NNGCL-----------------GGSREKAF 208
+ NG N C+ + EKA
Sbjct: 187 CNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVPSNSEKAL 246
Query: 209 AYIIQNQ----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 252
+ NQ GIF G CGT +HAVT++G+G GA YWL+KNSWG
Sbjct: 247 LKAVANQPVSVYIDAGALQFYSSGIFTGKCGTAPNHAVTVIGYGKARGGAKYWLVKNSWG 306
Query: 253 NTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
WG+ GY+K+ RD EGLCGI T +SYP
Sbjct: 307 TKWGEKGYIKMKRDIRAKEGLCGIATNASYP 337
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 235 bits (600), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/335 (38%), Positives = 176/335 (52%), Gaps = 74/335 (22%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I++ L C++ + + + ++V HE+WM Q+GR YKD EK R +IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + L NQF+DLTN EFRA T + R T TF+Y+N+S+ +P
Sbjct: 68 IESFNA-GNHKFWLSVNQFADLTNYEFRATKTNKGFIPSTVRVPT--TFRYENVSIDTLP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-- 195
++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 196 ----------------NNG----------------CLGGSR---------------EKAF 208
NG C GGS E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSAATIKGYEDVPANNEAAL 244
Query: 209 AYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 250
+ NQ G+ G CGT LDH + +G+G DG YWL+KNS
Sbjct: 245 MKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKNS 304
Query: 251 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
WG TWG+ G++++ +D G+CG+ SYP A
Sbjct: 305 WGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 131/324 (40%), Positives = 179/324 (55%), Gaps = 80/324 (24%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+SR+ + S+ E H++WM Q+ + Y D E E R +IFKEN+ YIE +NKEG R YKLG
Sbjct: 25 VTSRTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIFKENVNYIETSNKEGGRFYKLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF DLTN+EF A +K M S R+ +T+KY+N+ T VP+++DWR KGAVTP
Sbjct: 85 VNQFVDLTNEEFIAPRNRFKGHMCSSIIRT---NTYKYENV--TTVPSNVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ------------------------ 186
+K+Q +CGCCWAF+AVAA EGI ++ +G LI LSEQ
Sbjct: 140 VKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFK 199
Query: 187 ----------------QLLDCSTNGNNGCLGGS-----------REKAFAYIIQNQ---- 215
Q +D + N N + + E+A + NQ
Sbjct: 200 FIIQNHGLDTEAKYPYQGVDGTCNANEASINAATITSYEDVPTNNEQALQKAVANQPISV 259
Query: 216 --------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G+F G CGT+LDH VT VG+G ++DG YWL+KNSWG +WG+ GY+
Sbjct: 260 AIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVSDDGTKYWLVKNSWGTSWGEEGYI 319
Query: 262 KIVRD----EGLCGIGTRSSYPLA 281
++ R EGLCGI ++SYP+A
Sbjct: 320 RMQRGVDAVEGLCGIAMQASYPIA 343
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 134/347 (38%), Positives = 188/347 (54%), Gaps = 83/347 (23%)
Query: 11 FKINTTPMFIIITLLVSCAS--QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
++ FI + LL + ++R+ + S+ E HE+WMAQ+GR YKD+ EKE R
Sbjct: 1 MRLTKQSQFICLALLFVLGAWPSKSAARTLQDVSMYERHEQWMAQYGRVYKDDAEKETRY 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTF 126
IFKEN+ I+ N + ++YKLG NQF+DL+N+EF+A +K M SP + F
Sbjct: 61 NIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKASRNRFKGHMCSPQ-----AGPF 115
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
+Y+N+S VP ++DWR KGAVTP+K+Q +CGCCWAF+AVAA+EGI ++ +G LI LSEQ
Sbjct: 116 RYENVSA--VPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTTGKLISLSEQ 173
Query: 187 QLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLD--HAVTIV 233
+++DC T G + GC GG + AF +I QN+G+ +G C TQ + HA I
Sbjct: 174 EVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTCNTQKEATHAAKIT 233
Query: 234 GF-------------------------------------------GTTED---------- 240
GF GT D
Sbjct: 234 GFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGTQLDHGVTAVGYGI 293
Query: 241 --GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
G YWL+KNSWG WG+ GY+++ +D EGLCGI ++SYP A
Sbjct: 294 SDGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPSA 340
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 135/326 (41%), Positives = 176/326 (53%), Gaps = 84/326 (25%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK-EGNRTYKL 91
V+SR T + + E H +WM+Q+G+ YKD E+E R KIF EN+ YIE NK + N+ Y L
Sbjct: 25 VTSR-TLQDDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTL 83
Query: 92 GTNQFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
G NQF+DLTNDEF R + G+ S T +STFKY+N S +P+S+DWR KGAV
Sbjct: 84 GVNQFADLTNDEFTSSRNKFKGHMCSSI----TRTSTFKYENASA--IPSSVDWRKKGAV 137
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL------------------- 189
TP+KNQ +CGCCWAF+AVAA EGI K+ +G LI LSEQ+L+
Sbjct: 138 TPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDA 197
Query: 190 ---------------------DCSTNGNNGCLGG-----------SREKAFAYIIQNQ-- 215
D + N N G + + E+A + NQ
Sbjct: 198 FKFIIQNHGLNTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPI 257
Query: 216 ----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 259
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ G
Sbjct: 258 SVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEEG 317
Query: 260 YMKIVRD----EGLCGIGTRSSYPLA 281
Y+ + R EGLCGI ++SYP A
Sbjct: 318 YIMMQRGVDAAEGLCGIAMQASYPTA 343
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 233 bits (595), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 124/328 (37%), Positives = 177/328 (53%), Gaps = 70/328 (21%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
+ V ++ V + E ++ ++KWMAQ+ R YKD+ EK R ++FK N E+I+++
Sbjct: 34 VAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRS 93
Query: 82 NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PS-HRSTTSSTFKYQNLSMTDVPT 138
N G + Y LGTNQF+DLT+ EF A+YTG + P+ PS + ++ KYQN + D
Sbjct: 94 NAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDV 153
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNN 197
+DWR +GAVTP+KNQ +CGCCWAF+AV A+EG+ I +GNL+ LSEQQ+LDC ++GN
Sbjct: 154 QVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQ 213
Query: 198 GCLGG-----------------------------------------------SREKAFAY 210
GC GG E A A
Sbjct: 214 GCNGGYMDNAFQYVINNGGVTTEDAYPYSAVQGTCQNVQPAATISGFQDLPSGDENALAN 273
Query: 211 IIQNQ------------------GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSW 251
+ NQ GI++G CGT ++HAVT +G+G + G YW++KNSW
Sbjct: 274 AVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSW 333
Query: 252 GNTWGDAGYMKIVRDEGLCGIGTRSSYP 279
G WG+ G+M++ G CGI T +SYP
Sbjct: 334 GTGWGENGFMQLQMGVGACGISTMASYP 361
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 233 bits (595), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 126/340 (37%), Positives = 178/340 (52%), Gaps = 74/340 (21%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I + I+ L C+S + + + S+ HE WMAQ+GR YKD EK + ++FK
Sbjct: 3 IPKASILAILGCLCFCSSVLAARELNDDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N +I+ N E N + LG NQF+DLTN+EF+A T S +++ S+ FKY+NL
Sbjct: 63 ANARFIDSFNAE-NHKFWLGINQFADLTNEEFKATKTNKGFIS--NKARVSTGFKYENLK 119
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ +PTS+DWR KGAVTP+K+Q +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC
Sbjct: 120 IEALPTSIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 ---------------------TNG-------------NNGCLGGSR-------------- 204
TNG + C GS+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSGSKSAGTIKSYEDVPAN 239
Query: 205 -EKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYW 245
E A + NQ G+ G CGT LDH + +G+G T DG +W
Sbjct: 240 NEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKFW 299
Query: 246 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
L+KNSWG TWG+ G++++ +D +G+CG+ SYP A
Sbjct: 300 LMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
Length = 351
Score = 233 bits (595), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 129/352 (36%), Positives = 186/352 (52%), Gaps = 81/352 (23%)
Query: 9 GSFKINTTPM------FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGR 56
S+ N P+ + + + +C V++R E+++ HEKWM +HGR
Sbjct: 2 ASYIANNKPLITAAVALLTVLAIANCIGCAVAARDLSSSTGYGEEAMTARHEKWMVEHGR 61
Query: 57 SYKDELEKEMRLKIFKENLEYIEKANKE-GNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
+YKDE EK R ++FK N +++ +N G + Y L N+F+D+T+DEF A YTG+K P
Sbjct: 62 TYKDEAEKARRFQVFKANAAFVDTSNAAAGGKKYHLAINRFADMTHDEFMARYTGFK-PL 120
Query: 116 PSHRSTTSSTFKYQNLSMT-DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITK 174
P+ FKY N++++ + ++DWR KGAVT +KNQ++CGCCWAF+AVAA+EG+ +
Sbjct: 121 PATGKKMPG-FKYANVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEGMHQ 179
Query: 175 IRSGNLIQLSEQQLLDCSTN-------------------GNNGC---------------- 199
I +G L+ LSEQQL+DCSTN GNNG
Sbjct: 180 INTGELVSLSEQQLVDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQGMCQ 239
Query: 200 -------------LGGSREKAFAYIIQNQGIFNGV-----------------CGTQLDHA 229
+ E A A + Q + V CGT L+HA
Sbjct: 240 NVQPAVAVRSYQQVPRDDEDALAAAVAGQPVSVAVDANNFQFYKGGVMTADSCGTNLNHA 299
Query: 230 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 281
VT VG+GT EDG YWL+KN WG+TWG+ GY+++ R G CG+ +SYP+A
Sbjct: 300 VTAVGYGTAEDGTPYWLLKNQWGSTWGEEGYLRLQRGVGACGVAKDASYPVA 351
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 233 bits (594), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 125/331 (37%), Positives = 177/331 (53%), Gaps = 72/331 (21%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++I+ L+++ + V SR E E HEKWMAQ+G+ Y D EKE R +IFK N+++I
Sbjct: 9 YLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFI 68
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L NQF+DL N+EF+A + + T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETATETSFRYE--SITKIPV 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC------- 191
++DWR +GAVTPIK+Q CG CWAF+ VAA+EGI +I +G L+ LSEQ+L+DC
Sbjct: 127 TMDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDCVKGKSEG 186
Query: 192 -------------STNG-------------NNGCL-----------------GGSREKAF 208
+ NG N C+ + EKA
Sbjct: 187 CNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVPSNSEKAL 246
Query: 209 AYIIQNQ----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 252
+ NQ GIF G CGT +HA T++G+G GA YWL+KNSWG
Sbjct: 247 LKAVANQPVSVYIDAGALQFYSSGIFTGKCGTAPNHAATVIGYGKARGGAKYWLVKNSWG 306
Query: 253 NTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
WG+ GY+++ RD EGLCGI T +SYP
Sbjct: 307 TKWGEKGYIRMKRDIRAKEGLCGIATNASYP 337
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 231 bits (588), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 131/346 (37%), Positives = 185/346 (53%), Gaps = 79/346 (22%)
Query: 15 TTPMFIIITLLVSC------ASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMR 67
+ P+ + I + C + V ++R + ++ HE+WMAQHGR YKD EK R
Sbjct: 5 SKPLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARR 64
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTF 126
L++FK N+ +IE N G Y LG NQF+DLT++EF+A T K +P++ S+ F
Sbjct: 65 LEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVSTGF 124
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
KY+N+S +P S+DWR KGAVT IK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ
Sbjct: 125 KYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQ 184
Query: 187 QLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF----------NGVCGT--QLDHAVTIV 233
+L+DC +GN+ GC GG + AF +I+ N G+ +G C T D A +I
Sbjct: 185 ELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAASIR 244
Query: 234 GF-----------------------------------------GTTED------------ 240
G+ GT+ D
Sbjct: 245 GYEDVPANDEPSLMKAVAGQPVSVAVDASKFQFYGGGVMAGECGTSLDHGVTVIGYGAAS 304
Query: 241 -GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
G YWL+KNSWG TWG+AGY+++ +D G+CG+ + SYP A
Sbjct: 305 DGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPTA 350
>gi|357446977|ref|XP_003593764.1| Cysteine proteinase [Medicago truncatula]
gi|355482812|gb|AES64015.1| Cysteine proteinase [Medicago truncatula]
Length = 286
Score = 230 bits (586), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 122/266 (45%), Positives = 168/266 (63%), Gaps = 12/266 (4%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
I LL +CA +S R+ E SVVE H++WM ++ R+Y + E E R KIFKENLEYIE
Sbjct: 9 IILLWACAYPTMS-RTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENF 67
Query: 82 NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
N GN++YKLG N++SDLT++EF A +TG+K+ S S NL+ DVPT+ D
Sbjct: 68 NNVGNKSYKLGLNRYSDLTSEEFIASHTGFKVSDQLSDSKMRSVAIPFNLN-DDVPTNFD 126
Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVE----GITKIRSGNLIQLSEQQLLDCSTNGNN 197
WR+KG VT +KNQ++CGCCWAF AVAAVE G +I + +++Q L +
Sbjct: 127 WREKGVVTDVKNQRQCGCCWAFTAVAAVEGQIPGAAQINGYFKVPANDEQQLLRAVLQQP 186
Query: 198 GCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 257
+ S F + + G++ G CG +L+HAVTI+G+G +E G YWLIKNSWG TWG+
Sbjct: 187 VSVAISTSYDFHHYM--GGVYEGSCGPKLNHAVTIIGYGVSEAGKKYWLIKNSWGETWGE 244
Query: 258 AGYMKIVRDE----GLCGIGTRSSYP 279
GYMK++R+ G C I ++YP
Sbjct: 245 KGYMKVLRESSATGGQCSIAVHAAYP 270
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 123/330 (37%), Positives = 171/330 (51%), Gaps = 73/330 (22%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
II C + + + + + +V HE+WMAQ+ R YKD EK R ++FK N+++IE
Sbjct: 104 IIGFAFFCGAAMAARDLSDDSVMVARHEQWMAQYSRVYKDASEKARRFEVFKANVQFIES 163
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
N GN + LG NQF+DLTNDEFR+ T + S + + T F+Y+N+S +PT++
Sbjct: 164 FNAGGNNKFWLGVNQFADLTNDEFRSTKTNKGLKSSNMKIPTG--FRYENVSADALPTTI 221
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ------------- 187
DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +G L+ L+EQ+
Sbjct: 222 DWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGC 281
Query: 188 ---LLDCS------------------TNGNNGCLGGSR---------------EKAFAYI 211
L+D + T + C GS E A
Sbjct: 282 EGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNSAATIKGYEDVPANDEAALMKA 341
Query: 212 IQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 253
+ NQ G+ G CGT LDH + +G+G T DG YWL+KNSWG
Sbjct: 342 VANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGT 401
Query: 254 TWGDAGYMKIVRD----EGLCGIGTRSSYP 279
TWG+ GY+++ +D G+CG+ SYP
Sbjct: 402 TWGENGYLRMEKDISDKRGMCGLAMEPSYP 431
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 129/344 (37%), Positives = 183/344 (53%), Gaps = 79/344 (22%)
Query: 15 TTPMFIIITLLVSC------ASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMR 67
+ P+ + I + C + V ++R + ++ HE+WMAQHGR YKD EK R
Sbjct: 5 SKPLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARR 64
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTF 126
L++FK N+ +IE N G Y LG NQF+DLT++EF+A T K +P++ S+ F
Sbjct: 65 LEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVSTGF 124
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
KY+N+S +P S+DWR KGAVT IK+Q +CGCCWAF+AVAA+EG K+ +G LI LSEQ
Sbjct: 125 KYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVKLSTGKLISLSEQ 184
Query: 187 QLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF----------NGVCGT--QLDHAVTIV 233
+L+DC +GN+ GC GG + AF +I+ N G+ +G C T D A +I
Sbjct: 185 ELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAASIR 244
Query: 234 GF-----------------------------------------GTTED------------ 240
G+ GT+ D
Sbjct: 245 GYEDVPANDEPSLMKAVAGQPVSVAVDASKFQFYGGGVMAGECGTSLDHGVTVIGYGAAS 304
Query: 241 -GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G YWL+KNSWG TWG+AGY+++ +D G+CG+ + SYP
Sbjct: 305 DGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYP 348
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 124/340 (36%), Positives = 174/340 (51%), Gaps = 74/340 (21%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I + I+ L C+S + + + S+V HE WM Q+GR YKD EK + ++FK
Sbjct: 3 IPKASLLAILGCLCFCSSVLAARELNDDLSMVARHESWMLQYGRVYKDAAEKASKFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N +I+ N GN + LG NQF+D+TN EF+A T S R+ T F Y+N+S
Sbjct: 63 ANAGFIDSFNA-GNHKFWLGINQFADITNKEFKATKTNKGFISNKVRAPTG--FSYENVS 119
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P S+DWR KGAVTP+K+Q +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC
Sbjct: 120 FDALPASIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 TNG------------------NNG----------------CLGGSR-------------- 204
+G +NG C GS+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIISNGGLTQESSYPYDAEDGKCKSGSKSAGTIKSYEDVPAN 239
Query: 205 -EKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYW 245
E A + NQ G+ G CGT LDH + +G+G T DG YW
Sbjct: 240 NEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYW 299
Query: 246 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
L+KNSWG +WG+ G++++ +D +G+CG+ SYP A
Sbjct: 300 LMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 132/341 (38%), Positives = 180/341 (52%), Gaps = 78/341 (22%)
Query: 19 FIIITLLVSC----ASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
++ + L C +SQV SR +E ++ H++W+ H + YKD EKE+R +IFKE
Sbjct: 9 YLCLALFFICLGLWSSQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQIFKE 68
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+E IE N ++ YKLG N+FSDLTN+EFR L+TGYK P +++ ++ ++
Sbjct: 69 NVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHFRYTNV 128
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA------------------------V 169
TD+P ++DWR KGAVTPIK+QKECGCCWAF+AVAA V
Sbjct: 129 TDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLSEQELVDCDV 188
Query: 170 EGITKIRSGNLIQLSEQQLL----------------DCSTNGNNGCLGGSR--------- 204
EG + SG L+ + +L D N L ++
Sbjct: 189 EGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCNKKKSALSAAKITGYEDVPA 248
Query: 205 --EKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANY 244
EKA + NQ G+F+G C T L+HAVT VG+G T DG Y
Sbjct: 249 NSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKY 308
Query: 245 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
W+IKNSWG+ WGD+GYM+I RD EGLCG+ +SYP A
Sbjct: 309 WIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 117/232 (50%), Positives = 157/232 (67%), Gaps = 16/232 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ ++ LLV+ + SRS HE S+ H+ WM Q+GR YK +EKE R KIFKEN+E+
Sbjct: 9 LVLMAMLLVTLWASQSWSRSLHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKENVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST-TSSTFKYQNLSMTDV 136
IE N GN+ YKLG N F+DLTN+EFRA + GY M SH+S+ + +F+Y+N+ T V
Sbjct: 69 IESFNNNGNKPYKLGINAFTDLTNEEFRASHNGYTMSMSSHQSSYRTKSFRYENV--TAV 126
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P SLDWR KGAVT IK+Q +CGCCWAF+AVAA+EGITK+ +G LI LSEQ+L+DC T+G
Sbjct: 127 PPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLISLSEQELVDCDTSGM 186
Query: 196 NNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQ--LDHAVTIVGF 235
+ GC GG + AF +II+N G+ +G C T+ +HA I G+
Sbjct: 187 DQGCEGGLMDDAFEFIIENNGLTTEANYPYEGVDGSCNTRKAANHAAKITGY 238
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 45/81 (55%), Positives = 58/81 (71%), Gaps = 6/81 (7%)
Query: 205 EKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 264
E AF + + GIF G CGT+LDH VT+VG+GT++DG YWL+KNSWG +WG+ GY+++
Sbjct: 266 ESAFQHY--SSGIFTGDCGTELDHGVTVVGYGTSDDGTKYWLVKNSWGTSWGEDGYIRME 323
Query: 265 RD----EGLCGIGTRSSYPLA 281
RD EGLCGI SYP A
Sbjct: 324 RDIDAKEGLCGIAMEPSYPTA 344
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 134/343 (39%), Positives = 191/343 (55%), Gaps = 79/343 (23%)
Query: 15 TTPMFIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
T+ +F ++ +++S + +SR E S +E HE+WM++ R Y D+ EK R +IFK+
Sbjct: 2 TSIIFFLLAIILSSRTSGATSRGGLFEASAIEKHEQWMSRFHRVYSDDSEKTSRFEIFKK 61
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSS----TFKY 128
NL+++E N N+TY L N+FSDLT++EF+A YTG +P R STT S +F+Y
Sbjct: 62 NLKFVESFNMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRMSTTDSHETVSFRY 121
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+N+ T S+DWR++GAVT +K+Q++CGCCWAF+AVAAVEG+TKI G L+ LSEQQL
Sbjct: 122 ENVGETG--ESMDWREEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIAKGELVSLSEQQL 179
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGI-------FNGV---CGTQLDHAVTIVGFGTT 238
LDCST N+GC GG KAF YI++NQGI + G C + A TI G+ T
Sbjct: 180 LDCSTE-NDGCDGGIMWKAFDYIVENQGITAEDNYPYQGAQQTCESNHVAAATISGYETV 238
Query: 239 E--------------------DGANYWLI------------------------------- 247
+G+ Y I
Sbjct: 239 PQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTHLNHAVTIVGYGVSEEGI 298
Query: 248 -----KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
KNSWG +WG+ GYM+I+RD +G+CG+ + + YP+A
Sbjct: 299 KYWLLKNSWGESWGEDGYMRIMRDVDAPQGMCGLASLAYYPVA 341
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 132/331 (39%), Positives = 179/331 (54%), Gaps = 84/331 (25%)
Query: 28 CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR 87
C SQV SR H+ S+ E HE+WM ++G+ YKD E + R IF+ N+E+IE N GN+
Sbjct: 20 CTSQV-KSRKLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNK 78
Query: 88 TYKLGTNQFSDLTNDEFRALYTGYKMPSPSH----RSTTSSTFKYQNLSMTDVPTSLDWR 143
YKL N +D TN+EF A + GYK SH R TT + FKY+N+ TD+P ++DWR
Sbjct: 79 PYKLSINHLADQTNEEFMASHKGYK---GSHWQGLRITTQTPFKYENV--TDIPWAVDWR 133
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
KG VT IK+Q +CG CWAF+AVAA EGI +I +GNL+ LSE++L+DC + ++GC GG
Sbjct: 134 QKGDVTSIKDQAQCGNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSV-DHGCDGGL 192
Query: 204 REKAFAYIIQNQGI----------FNGVCGT----------------------------- 224
E F +II+N GI NG C T
Sbjct: 193 MEHGFEFIIKNGGISSEANYPYTAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVA 252
Query: 225 -QLDHAVTI-----------------------------VGFGTTEDGANYWLIKNSWGNT 254
QL +V+I VG+G+T+ G YW++KNSWG
Sbjct: 253 NQLTMSVSIDAGGSAFQFYPSGVFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQ 312
Query: 255 WGDAGYMKIVR----DEGLCGIGTRSSYPLA 281
WG+ GY++++R EGLCGI +SYP A
Sbjct: 313 WGEEGYIRMLRGIDAQEGLCGIAMDASYPTA 343
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 125/335 (37%), Positives = 173/335 (51%), Gaps = 77/335 (22%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ +++ C + + + + ++V HE+WMAQ+ R YKD EK R ++FK N+++
Sbjct: 8 ILAVLSFAFFCGAALAARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE N GNR + LG NQF+DLTNDEFR T G+K PS ST F+Y+N+S+
Sbjct: 68 IESFNTGGNRKFWLGINQFADLTNDEFRTTKTNKGFK-PSLDKVSTG---FRYENVSVDA 123
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS----------- 184
+P ++DWR GAVTPIK+Q +CGCCWAF+AVAA EGI KI +G LI LS
Sbjct: 124 IPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQELVDCDVHG 183
Query: 185 -----EQQLLDCS------------------TNGNNGCLGGSR---------------EK 206
E L+D + T + C GS E
Sbjct: 184 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSNSAANIKGYEDVPTNDEA 243
Query: 207 AFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 248
A + NQ G+ G CGT LDH + +G+G T DG YWL+K
Sbjct: 244 ALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMK 303
Query: 249 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
NSWG TWG+ GY+++ +D +G+CG+ SYP
Sbjct: 304 NSWGTTWGENGYLRMEKDISDKKGMCGLAMEPSYP 338
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 226 bits (576), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 177/340 (52%), Gaps = 74/340 (21%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I + I+ L AS + + + S+V HE WM+Q+GRSYKD EK+ + ++FK
Sbjct: 3 IPNASLLAILGCLCFFASGLAARELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N +I+ N + N + LG NQF+D+TN+EF+ T S R++T F Y+N+S
Sbjct: 63 ANAAFIDSFNAK-NHKFWLGINQFADITNEEFKVTKTNKGFISNKVRASTG--FSYENVS 119
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ +P ++DWR KGAVTP+K+Q +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC
Sbjct: 120 IDALPATIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 ---------------------TNG-------------NNGCLGGSR-------------- 204
TNG + C GS+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSGSKSAGTIKSYEDVPAN 239
Query: 205 -EKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYW 245
E A + NQ G+ G CGT LDH + +G+G T DG YW
Sbjct: 240 NEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYW 299
Query: 246 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
L+KNSWG +WG+ G++++ +D +G+CG+ SYP A
Sbjct: 300 LMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 114/220 (51%), Positives = 150/220 (68%), Gaps = 13/220 (5%)
Query: 12 KINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
K NT + +L + A+++ ++ +++ HE+WMAQHGR Y D EKE R IF
Sbjct: 5 KCNTRIFLPFLLILAAWATKIACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIF 64
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
KEN+E IE N +R YKLG N+F+DLTN+EFRA+Y GYK S SS+F+Y+NL
Sbjct: 65 KENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYHGYKRQSS---KLMSSSFRYENL 121
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
S D+PTS+DWR+ GAVTP+K+Q CGCCWAF+ VAA+EGI K+++GNLI LSEQQL+DC
Sbjct: 122 S--DIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDC 179
Query: 192 STNGNNGCLGGSREKAFAYIIQNQGI-------FNGVCGT 224
T GN GC GG + AF YII+N G+ + GV GT
Sbjct: 180 -TAGNKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQGVDGT 218
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 48/122 (39%), Positives = 69/122 (56%), Gaps = 14/122 (11%)
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCG 223
A + E + + L+Q +Q + +G GG+ + G+FNG CG
Sbjct: 229 AQITGYEDVPQNNENALLQAVAKQPVSVGVDG-----GGND-----FQFYKSGVFNGDCG 278
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 279
TQ +HAVT +G+GT DG +YWL+KNSWG +WG+ GYM++ R EGLCG+ +SYP
Sbjct: 279 TQQNHAVTAIGYGTDIDGTDYWLVKNSWGTSWGENGYMRMRRGIGSSEGLCGVAMDASYP 338
Query: 280 LA 281
A
Sbjct: 339 TA 340
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 117/308 (37%), Positives = 167/308 (54%), Gaps = 74/308 (24%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ E HE+WMA++ R YKD EK R ++FK+N ++E N + + LG NQF+DLT +
Sbjct: 1 MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTE 60
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+A G+K S TT FKY+NLS++ +PT++DWR KGAVTPIKNQ +CGCCWA
Sbjct: 61 EFKA-NKGFKPISAEEVPTTG--FKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWA 117
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG------------------NNG------ 198
F+A+AA+EGI K+ +GNL+ LSEQ+ +DC T+ NG
Sbjct: 118 FSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLATES 177
Query: 199 ----------CLGGSR---------------EKAFAYIIQNQ------------------ 215
C GGS+ E A ++ +Q
Sbjct: 178 SYPYKVVDGKCKGGSKSAATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTFMLYSG 237
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+ G CGTQLDH + +G+G D YW++KNSWG TWG+ G++++ +D G+C
Sbjct: 238 GVMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDISDKRGMCD 297
Query: 272 IGTRSSYP 279
+ + SYP
Sbjct: 298 LAMKPSYP 305
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 120/332 (36%), Positives = 171/332 (51%), Gaps = 73/332 (21%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG NQF+D+TN+EF A YTG +P R S + ++ ++ VP
Sbjct: 68 IETFNSRNGNSYTLGINQFTDMTNNEFVAQYTGVSLPLNIEREPVVS---FDDVDISAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS----- 192
S+DWR+ GAVT +KN CG CWAFAA+A VE I KI+ G LI LSEQQ+LDC+
Sbjct: 125 QSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQQVLDCAVSYGC 184
Query: 193 ----------------------------TNGNNGC----------------LGGSREKAF 208
+ G C + + E++
Sbjct: 185 DGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTCRINGVPNSAYITGYTRVQSNNERSM 244
Query: 209 AYIIQNQ-----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 251
Y + NQ G+F+G CGT L+HA+TI+G+G G +W+++NSW
Sbjct: 245 MYAVSNQPIAASIEASGDFQHYKRGVFSGPCGTSLNHAITIIGYGQDSSGKKFWIVRNSW 304
Query: 252 GNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G +WG+ GY+++ RD GLCGI R YP
Sbjct: 305 GASWGERGYIRMARDVSSSSGLCGIAIRPLYP 336
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 223 bits (569), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 109/243 (44%), Positives = 157/243 (64%), Gaps = 13/243 (5%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F + ++T++V S+ E+++ H++WMA+HGR+YKDE EK R +
Sbjct: 12 TFTAAALMILAVMTMVVEARDLSTSTGGYGEEAMKVRHQQWMAEHGRTYKDEAEKARRFQ 71
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
+FK N ++++++N G ++Y+L N+F+D+TNDEF A+YTG K P P+ + FKY+
Sbjct: 72 VFKANADFVDRSNAAGGKSYELAINEFADMTNDEFVAMYTGLK-PVPAGPKKMAG-FKYE 129
Query: 130 NLSMTDVPT-SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
NL+++DV ++DWR KGAVT IKNQ +CGCCWAFAAVAAVE I +I +GNL+ LSEQQ+
Sbjct: 130 NLTLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIHQITTGNLVSLSEQQV 189
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTIVGFGTT 238
LDC T+GNNGC GG + AF YII N G+ G C + + AVTI +
Sbjct: 190 LDCDTDGNNGCNGGYIDNAFQYIISNGGLATEDAYPYAAAQGTCQSSVQPAVTISSYQDV 249
Query: 239 EDG 241
G
Sbjct: 250 PSG 252
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 32/61 (52%), Positives = 43/61 (70%), Gaps = 1/61 (1%)
Query: 222 CGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
CGT L+HAVT VG+ T EDG YWL+KN WG WG+ GY+++ R CG+ ++SYP+
Sbjct: 289 CGTPSLNHAVTAVGYSTAEDGTPYWLLKNQWGQNWGEGGYLRVERGTNACGVAQQASYPV 348
Query: 281 A 281
A
Sbjct: 349 A 349
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 223 bits (568), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 112/202 (55%), Positives = 148/202 (73%), Gaps = 7/202 (3%)
Query: 19 FIIITLLVSC--ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
I ITLL+ ASQ +S R+ HE S+ E HE WM +GR+YKD EKE R KIFKEN+E
Sbjct: 7 IICITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVE 65
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
YIE N GNR YKL N+F+D TN+EF+A GY M S RS+ ++F+Y+N++ V
Sbjct: 66 YIESVNSAGNRRYKLSINEFADQTNEEFKASRNGYNMSSRP-RSSEITSFRYENVAA--V 122
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P+S+DWR KGAVTPIK+Q +CGCCWAF+AVAA+EG+T++++G LI LSEQ+L+DC T+G
Sbjct: 123 PSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGE 182
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
+ GC GG + AF +II N G+
Sbjct: 183 DQGCGGGLMDSAFEFIIGNGGL 204
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 41/72 (56%), Positives = 51/72 (70%), Gaps = 4/72 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
+ G+F G CGT+LDH VT VG+G T+DG YWL+KNSWG WG+ GY+ + R DEGL
Sbjct: 269 SSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWLVKNSWGTGWGEDGYIWMERDIGADEGL 328
Query: 270 CGIGTRSSYPLA 281
CGI +SYP A
Sbjct: 329 CGIAMEASYPTA 340
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 136/343 (39%), Positives = 188/343 (54%), Gaps = 79/343 (23%)
Query: 15 TTPMFIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
T+ +F ++ +L+S + V+SR E S VE HE+WM++ R Y D+ EK R +IF
Sbjct: 2 TSIVFFLLAILLSSRTSGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTN 61
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSS----TFKY 128
NL+++E N N+TY L N+FSDLT++EF+A YTG +P R STT S +F+Y
Sbjct: 62 NLKFVESINMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFRY 121
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+N+ T S+DW +GAVT +K+Q++CGCCWAF+AVAAVEG+TKI +G L+ LSEQQL
Sbjct: 122 ENVGETG--ESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQL 179
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGI-------FNGV---CGTQLDHAVTIVGFGTT 238
LDCST NNGC GG KAF YI +NQGI + G C + A TI G+ T
Sbjct: 180 LDCSTE-NNGCGGGIMWKAFDYIKENQGITTEDNYPYQGAQQTCESNHLAAATISGYETV 238
Query: 239 E--------------------DGANYWLI------------------------------- 247
+G+ Y I
Sbjct: 239 PQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEEGI 298
Query: 248 -----KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
KNSWG +WG+ GYM+I+RD +G+CG+ + + YP+A
Sbjct: 299 KYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYYPVA 341
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 104/205 (50%), Positives = 147/205 (71%), Gaps = 6/205 (2%)
Query: 14 NTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
N+ + I + L+ + ++ + +SR+ + + HE+WMAQ+GR YK+E+EK R IFKE
Sbjct: 4 NSLKLLIALALVFATSAYLATSRTLLDSLMAVRHEQWMAQYGRVYKNEVEKTKRYNIFKE 63
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+EYIE NK G + YKLG N F+DLTN EF A GY +P H ++++ F+Y+N+S
Sbjct: 64 NVEYIESFNKAGTKPYKLGINAFADLTNKEFIASRNGYILP---HECSSNTPFRYENVSA 120
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
VPT++DWR KGAVTP+K+Q +CGCCWAF+AVAA+EGITK+ +GNLI LSEQ+L+DC
Sbjct: 121 --VPTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDV 178
Query: 194 NG-NNGCLGGSREKAFAYIIQNQGI 217
G + GC GG + AF +II N+G+
Sbjct: 179 KGIDQGCEGGLMDDAFTFIINNKGL 203
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 41/72 (56%), Positives = 54/72 (75%), Gaps = 4/72 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
+ G+F G CGT+LDH VT VG+G EDG+ YWL+KNSWG +WG+ GY+++ +D EGL
Sbjct: 268 SSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGL 327
Query: 270 CGIGTRSSYPLA 281
CGI +SSYP A
Sbjct: 328 CGIAMQSSYPSA 339
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 115/245 (46%), Positives = 157/245 (64%), Gaps = 19/245 (7%)
Query: 19 FIIITLLVSCA--SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
F++I L A + S+R HE ++VE HEKWMA+HG+ YKD+ EK R +IFK N+E
Sbjct: 9 FLLIALFFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVE 68
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+IE +N GN +Y LG N+F+DLTN+EFRA + GYK P + R T FKY+N+ T +
Sbjct: 69 FIESSNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDASRIVT--PFKYENV--TAL 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P S+DWR KGAVT IK+Q+ECG CWAF+AVAA EG+ K+R+G L+ LSEQ+L+DC G
Sbjct: 125 PYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGE 184
Query: 196 NNGCLGGSREKAFAYIIQNQGIF----------NGVCGTQLD--HAVTIVGFGTTEDGAN 243
+ GC GG E AF +I +N GI +G C T+ + H I G+ + +
Sbjct: 185 DKGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHVAKITGYQVVPENSE 244
Query: 244 YWLIK 248
L+K
Sbjct: 245 AALLK 249
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 51/79 (64%), Gaps = 4/79 (5%)
Query: 207 AFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
+ ++ GI+ G CG+ L+H V VG+GT+ G+ YW++KNSWG WG+ GY+++ RD
Sbjct: 264 SMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTSSSGSKYWIVKNSWGPEWGERGYVRMKRD 323
Query: 267 ----EGLCGIGTRSSYPLA 281
+GLCGI SYP A
Sbjct: 324 ITSRKGLCGIAMDCSYPTA 342
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 221 bits (562), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 102/190 (53%), Positives = 143/190 (75%), Gaps = 6/190 (3%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
++ + +SR+ + +V HE+WMAQ+GR Y++E+EK R IFKEN+EYIE NK G +
Sbjct: 21 SAYLATSRTLSDSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKP 80
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
YKLG N F+DLTN EF+A GYK+P H ++++ F+Y+N+S VPT++DWR KGAV
Sbjct: 81 YKLGINAFADLTNQEFKASRNGYKLP---HDCSSNTPFRYENVS--SVPTTVDWRTKGAV 135
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKA 207
TP+K+Q +CGCCWAF+AVAA+EGITK+ +GNLI LSEQ+L+DC G + GC GG + A
Sbjct: 136 TPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDA 195
Query: 208 FAYIIQNQGI 217
F++II N+G+
Sbjct: 196 FSFIINNKGL 205
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 41/70 (58%), Positives = 53/70 (75%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+G EDG+ YWL+KNSWG +WG+ GY+++ +D EGLCG
Sbjct: 272 GVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGLCG 331
Query: 272 IGTRSSYPLA 281
I +SSYP A
Sbjct: 332 IAMQSSYPSA 341
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 120/330 (36%), Positives = 168/330 (50%), Gaps = 71/330 (21%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +S +++ E+WMA++GR YKD EK +R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG NQF+D+TN+EF A YTG +P R S + ++ ++ VP
Sbjct: 68 IETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVVS---FDDVDISSVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD------- 190
S+DWRD GAVT +KNQ CG CWAFA++A VE I KI+ GNL+ LSEQQ+LD
Sbjct: 125 QSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAVSYGC 184
Query: 191 --------------------------------CSTNGNNGCLGGSR--------EKAFAY 210
C TNG +R E+ Y
Sbjct: 185 KGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNSAYITRYTYVQRNNERNMMY 244
Query: 211 IIQNQ-----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 253
+ NQ G+F G CGT+L+HA+ I+G+G G +W+++NSWG
Sbjct: 245 AVSNQPIAAALDASGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNSWGA 304
Query: 254 TWGDAGYMKIVRDE----GLCGIGTRSSYP 279
WG+ GY+++ RD GLCGI YP
Sbjct: 305 GWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 220 bits (561), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 103/190 (54%), Positives = 141/190 (74%), Gaps = 6/190 (3%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
++ + +SR+ + +V HE+WMAQ+GR YK E EK R IFKEN+EYIE NK G +
Sbjct: 19 SAYLATSRTLSDSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKP 78
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
YKLG N F+DLTN EF+A GYK+P H ++++ F+Y+N+S VPT++DWR KGAV
Sbjct: 79 YKLGINAFADLTNQEFKASRNGYKLP---HDCSSNTPFRYENVS--SVPTTVDWRTKGAV 133
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKA 207
TP+K+Q +CGCCWAF+AVAA+EGITK+ +GNLI LSEQ+L+DC G + GC GG + A
Sbjct: 134 TPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDA 193
Query: 208 FAYIIQNQGI 217
F++II N+G+
Sbjct: 194 FSFIINNKGL 203
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 41/70 (58%), Positives = 53/70 (75%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+G EDG+ YWL+KNSWG +WG+ GY+++ +D EGLCG
Sbjct: 270 GVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGLCG 329
Query: 272 IGTRSSYPLA 281
I +SSYP A
Sbjct: 330 IAMQSSYPSA 339
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 220 bits (561), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 117/330 (35%), Positives = 173/330 (52%), Gaps = 71/330 (21%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L AS +SR +++ E+WMA++GR YKD+ EK R +IFK N+++
Sbjct: 8 VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG NQF+D+T EF A YTG +P R S + +++++ VP
Sbjct: 68 IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVS---FDDVNISAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN--- 194
S+DWRD GAV +KNQ CG CW+FAA+A VEGI KI++G L+ LSEQ++LDC+ +
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSYGC 184
Query: 195 -------------GNNG---------------CLGGS----------------REKAFAY 210
NNG C S E++ Y
Sbjct: 185 KGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNSAYITGYSYVRRNDERSMMY 244
Query: 211 IIQNQ-----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 253
+ NQ G+F+G CGT L+HA+TI+G+G G YW+++NSWG+
Sbjct: 245 AVSNQPIAALIDASENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGS 304
Query: 254 TWGDAGYMKIVR----DEGLCGIGTRSSYP 279
+WG+ GY+++ R G+CGI +P
Sbjct: 305 SWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 219 bits (559), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 121/337 (35%), Positives = 177/337 (52%), Gaps = 85/337 (25%)
Query: 15 TTPMFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
T+ F++ L S S V+++R + ++VE HE WM ++GR YKD EK R ++FK+
Sbjct: 3 TSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKD 62
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+ ++E N N + LG NQF+DLT +EF+A G+K P+ ++ FKY+NLS+
Sbjct: 63 NVAFVESFNTNKNNKFWLGVNQFADLTTEEFKA-NKGFK---PTAEKVPTTGFKYENLSV 118
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+ +PT++DWR KGAVTPIKNQ +C AA+EGI K+ +GNLI LSEQ+L+DC T
Sbjct: 119 SALPTAVDWRTKGAVTPIKNQGQC---------AAMEGIVKLSTGNLISLSEQELVDCDT 169
Query: 194 NG------------------NNG----------------CLGGSR--------------- 204
+ NG C GGS+
Sbjct: 170 HSMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSKSAATIKGHEDVPVNN 229
Query: 205 EKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 246
E A + NQ G+ G CGT+LDH + +G+G DG YW+
Sbjct: 230 EAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWI 289
Query: 247 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
+KNSWG TWG+ G++++ +D G+CG+ + SYP
Sbjct: 290 LKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 326
>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 109/204 (53%), Positives = 139/204 (68%), Gaps = 7/204 (3%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
T + I++ +LV+ SQ + E +V E HE+WMA+HGR+Y+D+ EKE R IFK+NL
Sbjct: 7 TKLAIVLMILVTWVSQAMPRPLIDEDAVAEKHEQWMARHGRTYQDDEEKERRFHIFKKNL 66
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSM 133
++IE N NRTYKLG N F+DLT++EF A YTGYKMP P+ TT +T L
Sbjct: 67 KHIENFNNAFNRTYKLGLNHFADLTDEEFLATYTGYKMPKVLPTANITTKTTQSSDVLYE 126
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+VP S+DWR +G VTP+KNQ CGCCWAF+A AAVEGI GN + LS QQLLDC
Sbjct: 127 ANVPESIDWRTRGVVTPVKNQGRCGCCWAFSAAAAVEGII----GNGVSLSAQQLLDCVP 182
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI 217
+ +NGC GG + AF YIIQNQG+
Sbjct: 183 D-SNGCNGGFMDNAFRYIIQNQGL 205
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/69 (56%), Positives = 48/69 (69%), Gaps = 5/69 (7%)
Query: 216 GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLC 270
GIF CG+ L HA+TIVG+GT+ +G YWLIKNSWG WG+ GYM++ RD G C
Sbjct: 270 GIFPPQDCGSTLTHAITIVGYGTSAEGTKYWLIKNSWGEGWGEGGYMRLQRDVGSYGGAC 329
Query: 271 GIGTRSSYP 279
GI R+SYP
Sbjct: 330 GIALRASYP 338
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 109/189 (57%), Positives = 137/189 (72%), Gaps = 13/189 (6%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+++ HE+WMAQHGR Y D EKE R IFKEN+E IE N +R YKLG N+F+DLTN+
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EFRA+Y GYK S SS+F+Y+NLS D+PTS+DWR+ GAVTP+K+Q CGCCWA
Sbjct: 61 EFRAMYHGYKRQSSK---LMSSSFRYENLS--DIPTSMDWRNDGAVTPVKDQGTCGCCWA 115
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----- 217
F+ VAA+EGI K+++GNLI LSEQQL+DC T GN GC GG + AF YII+N G+
Sbjct: 116 FSTVAAIEGIIKLQTGNLISLSEQQLVDC-TAGNKGCQGGLMDTAFQYIIRNGGLTSEDN 174
Query: 218 --FNGVCGT 224
+ GV GT
Sbjct: 175 YPYQGVDGT 183
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 68/120 (56%), Gaps = 14/120 (11%)
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCG 223
A + E + + L+Q +Q + + +G GG+ + + G+F G CG
Sbjct: 194 AQITGYEDVPQNNENALLQAVAKQPVSVAVDG-----GGNDFRFY-----KSGVFEGDCG 243
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 279
T L+H VT +G+GT DG +YWL+KNSWG +WG++GY ++ R EGLCG+ +SYP
Sbjct: 244 TNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASEGLCGVAMDASYP 303
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 216 bits (551), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 121/330 (36%), Positives = 173/330 (52%), Gaps = 71/330 (21%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG N+F+D+TN+EF YTG +P R S + +++++ V
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVTQYTGVSLPLNFKREPVVS---FDDVNISAVG 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG- 195
S+DWRD GAVT +K+Q CG CWAF+A+A VEGI KI +G L+ LSEQ++LDC+ +NG
Sbjct: 125 QSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVSNGC 184
Query: 196 --------------NNG---------------CLGGS----------------REKAFAY 210
NNG C S E + Y
Sbjct: 185 DGGFVDNAYDFIISNNGVASEADYPYQAYEGDCTANSWPNSAYITGYSYVRSNDESSMKY 244
Query: 211 IIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 252
+ NQ G+F+G CGT L+HA+TI+G+G G YW++KNSWG
Sbjct: 245 AVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQYWIVKNSWG 304
Query: 253 NTWGDAGYMKIVR---DEGLCGIGTRSSYP 279
++WG+ GY+++ R GLCGI YP
Sbjct: 305 SSWGERGYVRMARGVSSSGLCGIAMDPLYP 334
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 216 bits (551), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 110/201 (54%), Positives = 143/201 (71%), Gaps = 5/201 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
MF+ + ++ SQ S RS H+ ++ E HE WM ++GR YKD EKE R +IF+ N+E+
Sbjct: 10 MFVALLVVGLWVSQAWS-RSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE NK GNR YKL N+F+DLTN+EF+A GYK S S SS F+Y N+ T VP
Sbjct: 69 IESFNKPGNRPYKLDINEFADLTNEEFKASRNGYKRSSNVGLSEKSS-FRYGNV--TAVP 125
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
TS+DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGITK+ +G LI LSEQ+L+DC T+G +
Sbjct: 126 TSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +I QN G+
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGL 206
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 43/77 (55%), Positives = 56/77 (72%), Gaps = 5/77 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+F G CGT+LDH VT VG+GT+ DG YWL+KNSWG +WG+ GY+++ RD
Sbjct: 266 AFQFYSGGVFTGDCGTELDHGVTAVGYGTS-DGTKYWLVKNSWGTSWGEDGYIRMERDIE 324
Query: 267 --EGLCGIGTRSSYPLA 281
EGLCGI +SSYP A
Sbjct: 325 AKEGLCGIAMQSSYPTA 341
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 216 bits (550), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 124/331 (37%), Positives = 168/331 (50%), Gaps = 72/331 (21%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +S +++ E+WM ++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG NQF+D+TN+EF A YTG P R S + ++ ++ V
Sbjct: 68 IETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGISRPLNIEREPVVS---FDDVDISAV 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD------ 190
P S+DWRD GAVT +KNQ CG CWAFAA+A VE I KI+ G L LSEQQ+LD
Sbjct: 125 PQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAKGYG 184
Query: 191 ---------------------------------CSTNG--NNGCLGG------SREKAFA 209
C TNG N+ + G + E +
Sbjct: 185 CKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTCKTNGVPNSAYITGYARVPRNNESSMM 244
Query: 210 YIIQ-----------------NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 252
Y + N G+FNG CGT L+HAVT +G+G +G YW++KNSWG
Sbjct: 245 YAVSKQPITVAVDANANSQYYNSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVKNSWG 304
Query: 253 NTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
WG+AGY+++ RD G+CGI S YP
Sbjct: 305 ARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 216 bits (550), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 124/331 (37%), Positives = 168/331 (50%), Gaps = 72/331 (21%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +S +++ E+WM ++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG NQF+D+TN+EF A YTG P R S + ++ ++ V
Sbjct: 68 IETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGISRPLNIEREPVVS---FDDVDISAV 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD------ 190
P S+DWRD GAVT +KNQ CG CWAFAA+A VE I KI+ G L LSEQQ+LD
Sbjct: 125 PQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAKGYG 184
Query: 191 ---------------------------------CSTNG--NNGCLGG------SREKAFA 209
C TNG N+ + G + E +
Sbjct: 185 CKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTCKTNGVPNSAYITGYARVPRNNESSMM 244
Query: 210 YIIQNQ-----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 252
Y + Q G+FNG CGT L+HAVT +G+G +G YW++KNSWG
Sbjct: 245 YAVSKQPITVAVDANANFQYYKSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVKNSWG 304
Query: 253 NTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
WG+AGY+++ RD G+CGI S YP
Sbjct: 305 ARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 215 bits (548), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 109/201 (54%), Positives = 143/201 (71%), Gaps = 5/201 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
MF+ + ++ ASQ S RS H+ ++ E HE WMA++GR YKD EKE R +IF+ N+E+
Sbjct: 10 MFVALLVVGLWASQAWS-RSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE NK GNR YKL N+F+DLTN+EF+ GYK S T S+F+Y N+ T VP
Sbjct: 69 IESFNKLGNRPYKLDINEFADLTNEEFKVSKNGYKRSSGVGL-TEKSSFRYANV--TAVP 125
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
TS+DWR GAVTPIK+Q +CGCCWAF+AVAA+EGITK+ +G LI LSEQ+L+DC T+G +
Sbjct: 126 TSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +I QN G+
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGL 206
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 42/77 (54%), Positives = 56/77 (72%), Gaps = 4/77 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+F G CGT+LDH VT VG+GT++DG YWL+KNSWG +WG+ GY+++ RD
Sbjct: 266 AFQFYSGGVFTGDCGTELDHGVTAVGYGTSDDGTKYWLVKNSWGTSWGEDGYIRMERDIE 325
Query: 267 --EGLCGIGTRSSYPLA 281
EGLCGI + SYP A
Sbjct: 326 AKEGLCGIAMQPSYPTA 342
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 215 bits (548), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 114/231 (49%), Positives = 155/231 (67%), Gaps = 20/231 (8%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+++ L + ASQ+ ++RS + S+ E HE+WMA +GR YKD EK+ R KIF+EN+ I
Sbjct: 10 LVVMVTLGALASQLAAARSLQDASMRERHEEWMASYGRVYKDINEKQKRYKIFEENVALI 69
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E +NK+ N+ YKL NQF+DLTN+EF+A +K H ST S++FKY N+S VP
Sbjct: 70 ESSNKDANKPYKLSVNQFADLTNEEFKASRNRFK----GHICSTKSTSFKYGNVSA--VP 123
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTP+K+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L+DC T+G +
Sbjct: 124 SAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKLTTGELISLSEQELVDCDTSGVD 183
Query: 197 NGCLGGSREKAFAYIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
GC GG + AF +I N G+ + GV GT Q HA I GF
Sbjct: 184 QGCEGGLMDNAFTFIQHNHGLASEANYPYKGVDGTCNTNKQAIHAAEINGF 234
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 42/72 (58%), Positives = 55/72 (76%), Gaps = 4/72 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F G CGTQLDH VT VG+GT++DG YWL+KNSWG WG+ GY+++ RD EGL
Sbjct: 269 SKGVFIGACGTQLDHGVTAVGYGTSDDGTKYWLVKNSWGTQWGEEGYIRMQRDVDAKEGL 328
Query: 270 CGIGTRSSYPLA 281
CGI ++SYP A
Sbjct: 329 CGIAMKASYPTA 340
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 215 bits (548), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 128/325 (39%), Positives = 175/325 (53%), Gaps = 82/325 (25%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+SR+ + S+ E HE+WMA++G+ YKD EKE R ++FKEN+ YIE N N+ YKLG
Sbjct: 25 VASRTLQDASMYERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLG 84
Query: 93 TNQFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
NQF+DLT++EF R + G+ +T ++TFKY+N+++ +P S+DWR KGAVT
Sbjct: 85 INQFADLTSEEFIVPRNRFNGHT----RSSNTRTTTFKYENVTV--LPDSIDWRQKGAVT 138
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
PIKNQ CGCCWAF+A+AA EGI KI +G L+ LSEQ+++DC T G ++GC GG + AF
Sbjct: 139 PIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAF 198
Query: 209 AYIIQNQGI-------FNGVCGT-----QLDHAVTIVGF--------------------- 235
+IIQN GI + GV G + HA TI G+
Sbjct: 199 KFIIQNHGINTEASYPYKGVDGKCNIKEEAVHAATITGYEDVPINNEKALQKAVANQPVS 258
Query: 236 -GTTEDGANYWLIK----------------------------------NSWGNTWGDAGY 260
GA++ K NSWG WG+ GY
Sbjct: 259 VAIDASGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEGTKYWLVKNSWGTEWGEEGY 318
Query: 261 MKIVRD----EGLCGIGTRSSYPLA 281
+ + R EG+CGI +SYP A
Sbjct: 319 IMMQRGVKAVEGICGIAMMASYPTA 343
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 116/232 (50%), Positives = 154/232 (66%), Gaps = 22/232 (9%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F ++ L A QV SSR+ + S+ E HE+WMA++GR YKD EKE R IFKEN+ YI
Sbjct: 12 FALVLCLGLWAFQV-SSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSIFKENVNYI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV 136
E +N G++ YKLG NQF+DLTN+EF A +K M S R+TT FKY+N++
Sbjct: 71 EASNNAGDKPYKLGVNQFADLTNEEFIATRNKFKGHMSSSITRTTT---FKYENVT---A 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P+++DWR +GAVTP+KNQ CGCCWAF+AVAA EGI K+ +GNL+ LSEQ+L+DC T+G
Sbjct: 125 PSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGA 184
Query: 196 NNGCLGGSREKAFAYIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
+ GC GG + AF +IIQN G+ + GV GT + H TI G+
Sbjct: 185 DQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEATHVATITGY 236
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 51/70 (72%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGTQLDH V +VG+G ++DG YWL+KNSWG WG+ GY+++ RD EGLCG
Sbjct: 273 GVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLVKNSWGADWGEEGYIRMQRDVDAPEGLCG 332
Query: 272 IGTRSSYPLA 281
+ + SYP A
Sbjct: 333 LAMQPSYPTA 342
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/209 (50%), Positives = 146/209 (69%), Gaps = 2/209 (0%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ KI+ + I + ++S + ++RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKIDLMSILITLFFVISMFNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKY 128
IFKEN+++IE NK GN +YKLG N+F+D+T++EF +TG +PS S SST FK
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGINIPSYLSPSPMSSTEFKI 121
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+LS D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG KI +GNL++ SEQ+L
Sbjct: 122 NDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQEL 181
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
LDC+TN N GC GG AF +I +N GI
Sbjct: 182 LDCTTN-NYGCNGGFMTNAFDFIKENGGI 209
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 47/68 (69%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD G C
Sbjct: 273 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPGGHCD 332
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 333 IAKMSSYP 340
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/343 (38%), Positives = 177/343 (51%), Gaps = 81/343 (23%)
Query: 12 KINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
KI +F ++ + CA Q +SR HE + HEKWMA+HG+ YKD+ EK R +IF
Sbjct: 8 KILPIALFFVLAM---CADQA-ASRELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
K N+ +IE N GN++Y LG N+F+DLTN+EFRA + GYK P + R T FKY+N+
Sbjct: 64 KSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAFWNGYKRPLGASRKIT--PFKYENV 121
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
T +P+S+DWR KGAVTPIK+Q CG CWAF+AVAA EGI K+R+G L+ LSEQ+L+DC
Sbjct: 122 --TALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDC 179
Query: 192 STNGNN-GCLGGSREKAFAYIIQNQGIF----------NGVCGTQLD--HAVTIVGFGTT 238
G + GC GG AF +I ++ G+ +G C T+ + AV I G+
Sbjct: 180 DVKGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQAV 239
Query: 239 EDGANYWLIK----------------------------------------------NSWG 252
+ L+K NS
Sbjct: 240 PKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTGICGKDINHGVAAVGYGRSNSGS 299
Query: 253 NTW----------GDAGYMKIVRD----EGLCGIGTRSSYPLA 281
W G+ GY+++ RD EGLCGI SYP A
Sbjct: 300 KYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAMECSYPTA 342
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 119/331 (35%), Positives = 170/331 (51%), Gaps = 72/331 (21%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG NQF+D+T EF A YTG P R S + +++++ V
Sbjct: 68 IETFNSHNGNSYTLGINQFTDMTKSEFVAQYTGGISRPLNIEREPVVS---FDDVNISAV 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-- 194
P S+DWRD GAV +KNQ CG CWAFAA+A VEGI KI++G L+ LSEQ++LDC+ +
Sbjct: 125 PQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSYG 184
Query: 195 --------------GNNG---------------CLGGS----------------REKAFA 209
NNG C S E++
Sbjct: 185 CKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTCNANSFPNSAYITGYSYVRRNDERSMM 244
Query: 210 YIIQNQ-----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 252
Y + NQ G+F+G CGT L+HA+TI+G+G G YW+++NSWG
Sbjct: 245 YAVSNQPIAALIDASENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWG 304
Query: 253 NTWGDAGYMKIVR----DEGLCGIGTRSSYP 279
++WG+ GY+++ R G CGI +P
Sbjct: 305 SSWGEGGYVRMARGVSSSSGACGIAMSPLFP 335
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 214 bits (545), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 106/201 (52%), Positives = 145/201 (72%), Gaps = 9/201 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ + L+ S ++R+ + S+ E HE+WMAQ+G+ YKD EKE+R KIFKEN++ IE
Sbjct: 12 LTLLLVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFKENVQRIE 71
Query: 80 KANKEGNRTYKLGTNQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N GN++YKLG NQF+DLTN+EF+A + G+ + ST + TFKY+++ T VP
Sbjct: 72 AFNNAGNKSYKLGINQFADLTNEEFKARNRFKGHMCSN----STRTPTFKYEHV--TSVP 125
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
SLDWR KGAVTPIK+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L+DC T G +
Sbjct: 126 ASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVD 185
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +I+QN+G+
Sbjct: 186 QGCEGGLMDDAFKFIMQNKGL 206
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 38/70 (54%), Positives = 50/70 (71%), Gaps = 5/70 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+G+ + G YWL+KNSWG WG+ GY+++ RD EGLCG
Sbjct: 273 GVFTGSCGTELDHGVTAVGYGS-DGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCG 331
Query: 272 IGTRSSYPLA 281
++SYP A
Sbjct: 332 FAMQASYPTA 341
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 214 bits (544), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 115/232 (49%), Positives = 155/232 (66%), Gaps = 22/232 (9%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F ++ L A QV SSR+ + S+ E HE+WMA++G+ YKD EKE R IF+EN++YI
Sbjct: 12 FALVLCLGLWAFQV-SSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNIFQENVKYI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV 136
E +N GN+ YKLG NQF+DLTN EF A +K M S R+TT FKY+N++
Sbjct: 71 EASNNAGNKPYKLGVNQFTDLTNKEFIATRNKFKGHMSSSITRTTT---FKYENVT---A 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P+++DWR +GAVTP+KNQ CGCCWAF+AVAA EGI K+ +GNL+ LSEQ+L+DC T+G
Sbjct: 125 PSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGA 184
Query: 196 NNGCLGGSREKAFAYIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
+ GC GG + AF +IIQN G+ + GV GT ++ H TI G+
Sbjct: 185 DQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVTHVATITGY 236
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 51/70 (72%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGTQLDH V +VG+G ++DG YWL+KNSWG WG+ GY+++ RD EGLCG
Sbjct: 273 GVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLVKNSWGEDWGEEGYIRMQRDVEAPEGLCG 332
Query: 272 IGTRSSYPLA 281
I + SYP A
Sbjct: 333 IAMQPSYPTA 342
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 214 bits (544), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 106/192 (55%), Positives = 142/192 (73%), Gaps = 8/192 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ +L + ASQ +SRS HE S+ E HE WMA++GR YKD EKE R KIFK+N+ IE
Sbjct: 14 LLFILAAWASQA-TSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
NK ++TYKL N+F+DLTN+EFR+L +K +H + ++TFKY+N+ T VP+++
Sbjct: 73 FNKAMDKTYKLSINEFADLTNEEFRSLRNRFK----AHICSEATTFKYENV--TAVPSTI 126
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGC 199
DWR KGAVTPIK+Q++CGCCWAF+AVAA EGIT+I +G LI LSEQ+L+DC T G N GC
Sbjct: 127 DWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGC 186
Query: 200 LGGSREKAFAYI 211
GG + AF +I
Sbjct: 187 SGGLMDDAFRFI 198
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 40/78 (51%), Positives = 52/78 (66%), Gaps = 4/78 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + G+F G CGT+LDH V VG+G +DG YWL+KNSWG WG+ GY+++ RD
Sbjct: 262 FEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDV 321
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 322 TAKEGLCGIAMQASYPTA 339
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 213 bits (543), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 106/212 (50%), Positives = 145/212 (68%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS E SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +II+N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 213 bits (543), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 106/205 (51%), Positives = 141/205 (68%), Gaps = 6/205 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVV--EIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+F+I++L+ S + SR + ++ + H++WMA+HGR Y D EK R +FK N+
Sbjct: 8 IFLIVSLISSFCLSITLSRPLDDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKRNV 67
Query: 76 EYIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLS 132
E IE+ N RT+KL NQF+DLTNDEFR++YTGYK S S T +S+F+YQN+S
Sbjct: 68 ERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQNVS 127
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P S+DWR KGAVTPIKNQ CGCCWAF+AVAA+EG TKI+ G LI LSEQQL+DC
Sbjct: 128 SGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVDCD 187
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI 217
TN + GC GG + AF +I+ G+
Sbjct: 188 TN-DFGCSGGLMDTAFEHIMATGGL 211
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 38/77 (49%), Positives = 53/77 (68%), Gaps = 4/77 (5%)
Query: 207 AFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
F + G+F G C T LDHAVT VG+G + +G+ YW+IKNSWG WG++GYM+I +D
Sbjct: 269 GFDFQFYGSGVFTGECTTYLDHAVTAVGYGQSSNGSKYWIIKNSWGTKWGESGYMRIKKD 328
Query: 267 ----EGLCGIGTRSSYP 279
+GLCG+ ++SYP
Sbjct: 329 VKDKKGLCGLAMKASYP 345
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 213 bits (543), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 112/239 (46%), Positives = 160/239 (66%), Gaps = 22/239 (9%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K TP+ ++ T+ V + + ++RS +E S+ E H++WMA++GR YK EK R
Sbjct: 4 TIKHQCTPLALLFTIGV--LASLAAARSLNEASMTETHDQWMARYGRVYKTANEKNRRST 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKY 128
IF+ENL+YI+ NK N+ YKLG N+F+DLTN+EF +K SH +T ++ F+Y
Sbjct: 62 IFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTTSRNKFK----SHVCATVTNVFRY 117
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+N+ T VP ++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGIT++++G LI LSEQ+L
Sbjct: 118 ENV--TAVPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQEL 175
Query: 189 LDCSTNG-NNGCLGGSREKAFAYIIQNQGI-------FNGVCGT-----QLDHAVTIVG 234
+DC TNG + GC GG + AF +I QN G+ ++G GT + +HA TI G
Sbjct: 176 VDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEANHAATITG 234
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 51/70 (72%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+GT DG YWL+KNSWG +WG+ GY+++ R EGLCG
Sbjct: 272 GVFTGECGTELDHGVTAVGYGTAADGTKYWLVKNSWGTSWGEEGYIQMQRGVAAAEGLCG 331
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 332 IAMQASYPTA 341
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 213 bits (542), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 101/201 (50%), Positives = 146/201 (72%), Gaps = 5/201 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + LL + A Q +SR+ E S+ E HE+WM Q+GR YKDE EK +R +IF +N+++
Sbjct: 29 MIAALILLGAWACQA-TSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKF 87
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE+ NK+G ++YKL N+F+D TN+EF+A GYKM + S R + ++ F+Y+N+ T VP
Sbjct: 88 IEEFNKDGRQSYKLAVNEFADQTNEEFQASRNGYKM-AVSSRPSQTTLFRYENV--TAVP 144
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+S+DWR KGAVTP+K+Q +CG CWAF+ +AA EGITK+++G LI LSEQ+L+DC G +
Sbjct: 145 SSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKTGED 204
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG E F +I++N+GI
Sbjct: 205 QGCEGGYMEDGFEFIVKNKGI 225
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 41/77 (53%), Positives = 51/77 (66%), Gaps = 4/77 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
A+ + G+F G CGT LDH VT VG+G T DG YWL+KNSWG +WGD+GY+ + R
Sbjct: 285 AFQFYSSGVFTGECGTDLDHGVTAVGYGKTSDGTKYWLVKNSWGASWGDSGYIMMQRGVA 344
Query: 266 -DEGLCGIGTRSSYPLA 281
GLCGI +SYP A
Sbjct: 345 AKGGLCGIAMDASYPTA 361
>gi|222636309|gb|EEE66441.1| hypothetical protein OsJ_22818 [Oryza sativa Japonica Group]
Length = 318
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 116/317 (36%), Positives = 171/317 (53%), Gaps = 61/317 (19%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ ++ L + A ++SR+ ++ H+KWMA+HGR+YKD EK R ++FK N++
Sbjct: 6 LLVVAGGLSTMAKVTMASRAGTMEAR---HDKWMAEHGRTYKDAAEKARRFRVFKANVDL 62
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-- 135
I+++N GN+ Y+L TN+F+DLT+ EF A+YTGY + + + ++T LS D
Sbjct: 63 IDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATT----RLSSEDDQ 118
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL-------SEQQL 188
P +DWR +GAVT +KNQ+ CGCCWAF+ VAAVEGI +I +G L+ L S +
Sbjct: 119 QPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLTWPTAAASPPRR 178
Query: 189 LDCSTNGNNGC--------------------LGGSREKAFAYIIQNQ------------- 215
C + + E + A + +Q
Sbjct: 179 AYAYQGAQGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMF 238
Query: 216 -----GIFNG-VCGTQLDHAVTIVGFGTTEDGA---NYWLIKNSWGNTWGDAGYMKIVRD 266
G+F CGT+LDHAV +VG+G DG+ YW+IKNSWG TWGD GYMK+ +D
Sbjct: 239 RHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKD 298
Query: 267 ---EGLCGIGTRSSYPL 280
+G CG+ SYP+
Sbjct: 299 VGSQGACGVAMAPSYPV 315
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 105/212 (49%), Positives = 146/212 (68%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +II+N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGI 210
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 102/201 (50%), Positives = 141/201 (70%), Gaps = 5/201 (2%)
Query: 19 FIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
F+I L +CA +++R T + S+V HE+WMA++GR Y D EK RL++FK N+ +
Sbjct: 82 FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + L NQF+D+T DEFRA +TGYK P P+++ T+ FKY N+S+ +P
Sbjct: 142 IELVNA-GNDKFSLEANQFADMTVDEFRAAHTGYK-PVPANKGRTTQ-FKYANVSLDALP 198
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
S+DWR KGAVTPIK+Q +CGCCWAF+ VA+VEGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 199 ASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDGMD 258
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +II N G+
Sbjct: 259 QGCEGGLMDNAFEFIIDNGGL 279
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 35/70 (50%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+ +G CGT+LDH + VG+G T DG +WL+KNSWG +WG+ G++++ RD EGLCG
Sbjct: 346 GVLSGACGTELDHGIAAVGYGITSDGTKFWLMKNSWGTSWGEKGFIRMERDIADEEGLCG 405
Query: 272 IGTRSSYPLA 281
+ + SYP A
Sbjct: 406 LAMQPSYPTA 415
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 105/212 (49%), Positives = 145/212 (68%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +II+N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 105/212 (49%), Positives = 146/212 (68%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +II+N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGI 210
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 212 bits (539), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 105/212 (49%), Positives = 145/212 (68%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +II+N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 105/212 (49%), Positives = 145/212 (68%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +II+N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 100/175 (57%), Positives = 134/175 (76%), Gaps = 4/175 (2%)
Query: 44 VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
+E HE WMAQ+GR+YK +EKE RL IFK N+E+IE NK G + YKL N+F+DLTN+E
Sbjct: 1 MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEE 60
Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
F+A GYKM S S+++ F+Y+N+S VP+++DWR KGAVTPIK+Q +CGCCWAF
Sbjct: 61 FQASRNGYKM-SAHLSSSSTKPFRYENVSA--VPSTMDWRKKGAVTPIKDQGQCGCCWAF 117
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
+AVAA EGIT++ +G LI LSEQ+L+DC T+G + GC GG + AF +IIQN+G+
Sbjct: 118 SAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGL 172
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 41/77 (53%), Positives = 54/77 (70%), Gaps = 4/77 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+F G CGT LDH VT VG+G ++DG YWL+KNSWG +WG+ GY+++ RD
Sbjct: 229 AFQFYSSGVFTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERDID 288
Query: 267 --EGLCGIGTRSSYPLA 281
EGLCGI +SYP A
Sbjct: 289 AQEGLCGIAMEASYPTA 305
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 112/234 (47%), Positives = 154/234 (65%), Gaps = 21/234 (8%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
T +F+I CA + ++R+ + + E HE+WMA HG+ YK EKE + +IF EN
Sbjct: 10 TLALFLIFAF---CAFEA-NARTLEDAPMRERHEQWMATHGKVYKHSYEKEQKYQIFMEN 65
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
++ IE N G + YKLG N F+DLTN+EF+A+ +K S R+ T+ TF+Y+N+ T
Sbjct: 66 VQRIEAFNNAGXKPYKLGINHFADLTNEEFKAI-NRFKGHVCSKRTRTT-TFRYENV--T 121
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
VP SLDWR KGAVTPIK+Q +CGCCWAF+AVAA EGITK+R+G LI LSEQ+L+DC T
Sbjct: 122 AVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQELVDCDTK 181
Query: 195 G-NNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLD--HAVTIVGF 235
G + GC GG + AF +I+QN+G+ F+G C + D HA +I G+
Sbjct: 182 GVDQGCEGGLMDDAFKFILQNKGLATEAIYPYEGFDGTCNAKADGNHAGSIKGY 235
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 41/78 (52%), Positives = 52/78 (66%), Gaps = 4/78 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + + G+F G CGT LDH VT VG+G +DG YWL+KNSWG WG+ GY+++ RD
Sbjct: 264 FKFQFYSGGVFTGSCGTNLDHGVTSVGYGVGDDGTKYWLVKNSWGVKWGEKGYIRMQRDV 323
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI +SYP A
Sbjct: 324 AAKEGLCGIAMLASYPSA 341
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/331 (36%), Positives = 173/331 (52%), Gaps = 72/331 (21%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG N+F+D+TN+EF A YTG P + S + +++++ V
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPVVS---FDDVNISAV 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
S+DWRD GAVT +K+Q CG CWAF+A+A VEGI KI +G L+ LSEQ++LDC+ +NG
Sbjct: 125 GQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVSNG 184
Query: 196 ---------------NNG---------------CLGGS----------------REKAFA 209
NNG C S E +
Sbjct: 185 CDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPNSAYITGYSYVRSNDESSMK 244
Query: 210 YIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 251
Y + NQ G+F+G CGT L+HA+TI+G+G G YW++KNSW
Sbjct: 245 YAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQYWIVKNSW 304
Query: 252 GNTWGDAGYMKIVR---DEGLCGIGTRSSYP 279
G++WG+ GY+++ R GLCGI YP
Sbjct: 305 GSSWGERGYIRMARGVSSSGLCGIAMDPLYP 335
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 101/210 (48%), Positives = 145/210 (69%), Gaps = 9/210 (4%)
Query: 17 PMFIIITLLVSC---ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
P ++ +++ C S V+S+R + ++VE HE+WMAQHGR YKD EK R + F+
Sbjct: 4 PKVFLLAVVLGCICLCSTVLSARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAFRN 63
Query: 74 NLEYIEKANKEGNR-TYKLGTNQFSDLTNDEFRALYT--GY--KMPSPSHRSTTSSTFKY 128
N+ +IE N GNR + LG NQF+DLTNDEFRA T G+ + + ++++ + TF+Y
Sbjct: 64 NVVFIESFNAAGNRRKFWLGVNQFTDLTNDEFRATKTNKGFIKRNAAAVNKASPTGTFRY 123
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
N+S +P ++DWR KGAVTPIKNQ +CGCCWAF+AVAA EGI ++ +G L+ LSEQ+L
Sbjct: 124 SNVSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQEL 183
Query: 189 LDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
+DC NG ++GC GG + AF +II+N G+
Sbjct: 184 VDCDANGADHGCEGGEMDDAFEFIIKNGGL 213
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+ +G CGT LDH + VG+G +DG +WL+KNSWG TWG+ GY+++ +D G+CG
Sbjct: 280 GVLSGSCGTSLDHGIVAVGYGAADDGTKFWLMKNSWGTTWGEDGYIRMEKDVADAGGMCG 339
Query: 272 IGTRSSYP 279
+ + SYP
Sbjct: 340 LAMQPSYP 347
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 121/312 (38%), Positives = 163/312 (52%), Gaps = 78/312 (25%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+++ HE+WMAQHGR Y D EKE R IFKEN+E IE N +R YKLG N+F+DLTN+
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW- 161
EFRA++ GYK S SS+F+++NLS +PTS+DWR GAVTP+K+Q CGCCW
Sbjct: 61 EFRAMHHGYKRQSSK---LMSSSFRHENLSA--IPTSMDWRKAGAVTPVKDQGTCGCCWA 115
Query: 162 --AFAAVAA---------------------VEGITKIRSGNLIQLSEQQLL--------- 189
A AA+ V+G+ + G L+ + Q +L
Sbjct: 116 FSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSEA 175
Query: 190 ---------------------------DCSTNGNNGCLGGSREKAFAYIIQ--------- 213
D N N L ++ + ++
Sbjct: 176 TYPYQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQFY 235
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
G+F G CGT LDHAVT +G+GT DG NYWL+KNSWG +WG++GYM++ R EGL
Sbjct: 236 KSGVFKGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRGIGAREGL 295
Query: 270 CGIGTRSSYPLA 281
CG+ +SYP A
Sbjct: 296 CGVAMDASYPTA 307
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 211 bits (536), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 114/218 (52%), Positives = 148/218 (67%), Gaps = 19/218 (8%)
Query: 33 VSSRSTHEQSVV-EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-RTYK 90
V+SR+ + S++ E HE+WM +G+ YKD E+E RLKIFKEN+ YIE +N GN + YK
Sbjct: 26 VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+DLTN+EF A +K S T +STFKY+N S VP+++DWR KGAVTP
Sbjct: 86 LGINQFADLTNEEFIASRNKFKGHMCS-SITKTSTFKYENAS---VPSTVDWRKKGAVTP 141
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
+KNQ +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC T G + GC GG + AF
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201
Query: 210 YIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
+IIQN G+ + GV GT HAVTI G+
Sbjct: 202 FIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGY 239
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 48/70 (68%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+G DG YWL+KNSWG WG+ GY+K+ R EGLCG
Sbjct: 276 GVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIKMQRGVDAAEGLCG 335
Query: 272 IGTRSSYPLA 281
I +SYP A
Sbjct: 336 IAMEASYPTA 345
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 211 bits (536), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 105/212 (49%), Positives = 144/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS E SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 211 bits (536), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 105/212 (49%), Positives = 144/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS E SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 112/300 (37%), Positives = 158/300 (52%), Gaps = 71/300 (23%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E+WMA++GR YKD EK R +IFK N+++IE N +Y LG NQF+D+T EF A
Sbjct: 11 EEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDMTKSEFVAQ 70
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
YTG +P R S + +++++ VP S+DWRD GAV +KNQ CG CWAFAA+A
Sbjct: 71 YTGVSLPLNIEREPVVS---FDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIA 127
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTN----------------GNNG------------- 198
VEGI KI++G L+ LSEQ++LDC+ + NNG
Sbjct: 128 TVEGIYKIKTGYLVSLSEQEVLDCAVSYGCKGGWVNKAYDFIISNNGVTTEENYPYQAYQ 187
Query: 199 --CLGGS----------------REKAFAYIIQNQ-----------------GIFNGVCG 223
C S E++ Y + NQ G+F+G CG
Sbjct: 188 GTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGVFSGPCG 247
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 279
T L+HA+TI+G+G G YW+++NSWG++WG+ GY+++ R G CGI +P
Sbjct: 248 TSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFP 307
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 145/212 (68%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 210 bits (535), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 145/212 (68%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENIKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGI 210
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 210 bits (535), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 106/203 (52%), Positives = 138/203 (67%), Gaps = 10/203 (4%)
Query: 20 IIITLLV---SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
++ITLL+ + SQ + + +++ E HE+WMA+HGR+Y D EKE R +IFK NL+
Sbjct: 10 LVITLLMILGTWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQIFKNNLD 69
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSMT 134
YIE NK N+TYKLG N+FSDL+ +EF Y GY+MP+ P+ +T TF +
Sbjct: 70 YIENFNKAFNKTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTANTTVKPTFFSNYYNQD 129
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+VP S+DWR+ G VT +KNQ ECGCCWAF+AVAAVEGI +GN LS QQLLDC
Sbjct: 130 EVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGI----AGNGASLSAQQLLDC-VG 184
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
N+GC GG+ KAF YI+QNQGI
Sbjct: 185 DNSGCGGGTMIKAFEYIVQNQGI 207
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 41/69 (59%), Positives = 52/69 (75%), Gaps = 5/69 (7%)
Query: 216 GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G+F+ CGT L HAVT+VG+GTTEDG YWL+KNSWG WG++GYM++ RD EG C
Sbjct: 272 GVFSAEDCGTHLTHAVTLVGYGTTEDGTKYWLVKNSWGEEWGESGYMRLQRDVGAMEGPC 331
Query: 271 GIGTRSSYP 279
GI ++SYP
Sbjct: 332 GIAMQASYP 340
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 107/231 (46%), Positives = 147/231 (63%), Gaps = 17/231 (7%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ L + C + + + + ++V HE+WMAQ+ R YKD EK R ++FK N+++
Sbjct: 8 ILAILGLALFCGAALAARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE N GNR + LG NQF+DLTNDEFRA T G+K PSP T F+Y+N+S+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFK-PSPVKVPTG---FRYENVSVDA 123
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P S+DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI + LI LSEQ+L+DC +G
Sbjct: 124 LPASIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVHG 183
Query: 196 -NNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTIVGF 235
+ GC GG + AF +II+N G+ +G C + + A I GF
Sbjct: 184 EDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTATDGKCKSGTNSAANIKGF 234
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 33/74 (44%), Positives = 48/74 (64%), Gaps = 4/74 (5%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + + G+ G CGT LDH + +G+G T DG YWL+KNSWG TWG+ GY+++ +D
Sbjct: 265 FQLYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDISD 324
Query: 267 -EGLCGIGTRSSYP 279
G+CG+ SYP
Sbjct: 325 KRGMCGLAMEPSYP 338
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 145/212 (68%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 113/219 (51%), Positives = 147/219 (67%), Gaps = 22/219 (10%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKL 91
V+SR+ + S+ E HE+WM +G+ YK+ E+E RL+IF ENL+YIE +N GN+ YKL
Sbjct: 25 VTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNKKPYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
G NQF+DLTN+EF A +K M S R+TT FKY+N T VP+++DWR KGAVT
Sbjct: 85 GINQFADLTNEEFIASRNKFKGHMCSSIIRTTT---FKYEN---TSVPSTVDWRKKGAVT 138
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
P+KNQ +CGCCWAF+A+AA EGI KI +G L+ LSEQ+L+DC TNG + GC GG + AF
Sbjct: 139 PVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAF 198
Query: 209 AYIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
+IIQN GI + GV GT A TI G+
Sbjct: 199 KFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGY 237
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+++ R EGLCG
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCG 333
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 334 IAMQASYPTA 343
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 113/219 (51%), Positives = 147/219 (67%), Gaps = 22/219 (10%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-RTYKL 91
V+SR+ + S+ E HE+WM +G+ YK+ E+E RL+IF ENL+YIE +N GN + YKL
Sbjct: 25 VTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNNKPYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
G NQF+DLTN+EF A +K M S R+TT FKY+N T VP+++DWR KGAVT
Sbjct: 85 GINQFADLTNEEFIASRNKFKGHMCSSIIRTTT---FKYEN---TSVPSTVDWRKKGAVT 138
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
P+KNQ +CGCCWAF+A+AA EGI KI +G L+ LSEQ+L+DC TNG + GC GG + AF
Sbjct: 139 PVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAF 198
Query: 209 AYIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
+IIQN GI + GV GT A TI G+
Sbjct: 199 KFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGY 237
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+++ R EGLCG
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCG 333
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 334 IAMQASYPTA 343
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 209 bits (533), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 145/212 (68%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 209 bits (533), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 114/221 (51%), Positives = 146/221 (66%), Gaps = 18/221 (8%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-R 87
A QV S + ++ E HE+WM +G+ YKD E+E RLKIFKEN+ YIE +N GN +
Sbjct: 23 AIQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNK 82
Query: 88 TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA 147
YKLG NQF+DLTN+EF A +K S T +STFKY+N S VP+++DWR KGA
Sbjct: 83 LYKLGINQFADLTNEEFIASRNKFKGHMCS-SITKTSTFKYENAS---VPSTVDWRKKGA 138
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREK 206
VTP+KNQ +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC T G + GC GG +
Sbjct: 139 VTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDD 198
Query: 207 AFAYIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
AF +IIQN G+ + GV GT HAVTI G+
Sbjct: 199 AFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGY 239
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 48/70 (68%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+G DG YWL+KNSWG WG+ GY+K+ R EGLCG
Sbjct: 276 GVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIKMQRGVDAAEGLCG 335
Query: 272 IGTRSSYPLA 281
I +SYP A
Sbjct: 336 IAMEASYPTA 345
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 209 bits (533), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 118/321 (36%), Positives = 168/321 (52%), Gaps = 75/321 (23%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
+SR+ ++ +++ HE+WMA HGR Y DE EK++R +IFK N+ YI+ N +++Y L
Sbjct: 41 ATSRTLNDPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLE 100
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLTNDEFRA GYK S S F+Y N+S VP +DWR +GAVTP+K
Sbjct: 101 VNKFADLTNDEFRASRNGYKKQPDSDSHVVSGLFRYANVSA--VPDEVDWRKEGAVTPVK 158
Query: 153 NQKECGCCW---AFAAVAA---------------------VEGITKIRSGNLIQ-----L 183
+Q +CGCCW A AA+ ++GI + G L++ +
Sbjct: 159 DQGDCGCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFI 218
Query: 184 SEQQLLDCST----NGNNGC------------------LGGSREKAFAYIIQNQ------ 215
+++ L + G +G + + EKA + NQ
Sbjct: 219 EKRKGLAAESVYPYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAI 278
Query: 216 ------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 263
G+F G CGT+LDHA+T VG+G T DG YWL+KNSWG +WG+ GY++I
Sbjct: 279 DASGYEFQFYSGGVFTGSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGYIRI 338
Query: 264 VRD----EGLCGIGTRSSYPL 280
RD EGLCGI SYP+
Sbjct: 339 KRDSLAKEGLCGIAMDPSYPV 359
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 209 bits (533), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 144/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
K +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +II+N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGI 210
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 109/218 (50%), Positives = 147/218 (67%), Gaps = 20/218 (9%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+SR+ + S+ E H +WM+Q+G+ YKD E+E R KIF EN+ Y+E +N + ++YKLG
Sbjct: 25 VTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+S +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFVASRNKFKGHMCSSITRTTT---FKYENVSA--IPSTVDWRKKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
+KNQ +CGCCWAF+AVAA EGI K+ +G LI LSEQ+L+DC T G + GC GG + AF
Sbjct: 140 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFK 199
Query: 210 YIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
+IIQN G+ + GV GT AVTI G+
Sbjct: 200 FIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGY 237
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 49/70 (70%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+ + R EGLCG
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCG 333
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 334 IAMQASYPTA 343
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 144/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 144/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 ITKMSSYP 341
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 124/337 (36%), Positives = 166/337 (49%), Gaps = 75/337 (22%)
Query: 18 MFIIITLLVS-CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+F+I++L+ S C S +S E + + H++WMA+HGR+Y D EK R +FK N+E
Sbjct: 8 IFLIVSLVSSFCFSTTLSRLLDDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRNVE 67
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSM 133
IE+ N RT+KL NQF+DLTNDEFR +YTGYK S T S++F+YQN+
Sbjct: 68 RIERLNNVPAGRTFKLAVNQFADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNVFF 127
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCW-------------------------------- 161
+P ++DWR KGAVTPIKNQ CGCCW
Sbjct: 128 GALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187
Query: 162 ------------AFAAVAAVEGIT--------------KIRSGNLIQLSEQQLLDCSTNG 195
AF + A G+T KI+S S D N
Sbjct: 188 NDFGCSGGLMDTAFEHIMATGGLTTESNYPYKGEDANCKIKSTKPSAASITGYEDVPVND 247
Query: 196 NNGCLG---------GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 246
N + G F + + G+F G C T LDHAVT VG+ + G+ YW+
Sbjct: 248 ENALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWI 307
Query: 247 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
IKNSWG WG+ GYM+I +D EGLCG+ ++SYP
Sbjct: 308 IKNSWGTKWGEGGYMRIKKDIKDKEGLCGLAMKASYP 344
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 144/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 144/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +II+N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 144/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 144/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +II+N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 144/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 208 bits (529), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 103/212 (48%), Positives = 144/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + +++ + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVITMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 208 bits (529), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 105/206 (50%), Positives = 139/206 (67%), Gaps = 7/206 (3%)
Query: 18 MFIIITLLVSC--ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
MF+ +T+L SQ S + HE V E H++WM + R Y DELEK+MR +FK+NL
Sbjct: 7 MFVSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNL 66
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK----MPSPSHRSTTSSTFKYQNL 131
++IEK NK+G+RTYKLG N+F+D T +EF A +TG K +PS ++ + N+
Sbjct: 67 KFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIPSSEFVDEMIPSWNW-NV 125
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
S P DWR +GAVTP+K Q +CGCCWAF++VAAVEG+TKI GNL+ LSEQQLLDC
Sbjct: 126 SDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNLVSLSEQQLLDC 185
Query: 192 STNGNNGCLGGSREKAFAYIIQNQGI 217
+NGC GG AF+YII+N+GI
Sbjct: 186 DRERDNGCNGGIMSDAFSYIIKNRGI 211
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 34/64 (53%), Positives = 47/64 (73%), Gaps = 4/64 (6%)
Query: 222 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSS 277
CGT ++HAVT VG+GT+ +G YWL KNSWG TWG+ GY++I RD +G+CG+ +
Sbjct: 283 CGTDVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAF 342
Query: 278 YPLA 281
YP+A
Sbjct: 343 YPVA 346
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 208 bits (529), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 122/317 (38%), Positives = 178/317 (56%), Gaps = 78/317 (24%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
++H++ ++E+ EKW+A++ ++Y EK R ++FK+NL +I+ NK+ +Y LG N+F
Sbjct: 42 ASHDR-LIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGLNEF 99
Query: 97 SDLTNDEFRALYTGYKMPSPSHRST---TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
+DLT+DEF+A Y G P P+ ++ +S F+Y +S +VP +DWR K AVT +KN
Sbjct: 100 ADLTHDEFKATYLGL-TPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKN 158
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN---NGCL---------- 200
Q +CG CWAF+ VAAVEGI I +GNL LSEQ+L+DCST+GN NG L
Sbjct: 159 QGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIAS 218
Query: 201 -GGSR-EKAFAYIIQ--------------------------------------------- 213
GG R E+A+ Y ++
Sbjct: 219 TGGLRTEEAYPYAMEEGDCDEGKGAAVVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 278
Query: 214 -------NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 265
+ G+F+G CG QLDH VT VG+GT++ G +Y ++KNSWG WG+ GY+++ R
Sbjct: 279 GRHFQFYSGGVFDGPCGEQLDHGVTAVGYGTSK-GQDYIIVKNSWGPHWGEKGYIRMKRG 337
Query: 266 ---DEGLCGIGTRSSYP 279
EGLCGI +SYP
Sbjct: 338 TGKGEGLCGINKMASYP 354
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 103/212 (48%), Positives = 144/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HG YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGHVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q +CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 107/218 (49%), Positives = 144/218 (66%), Gaps = 23/218 (10%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + LL S +SR+ + E+HE+WM QHG+ YK EK+ R IFKEN+ Y
Sbjct: 14 LFLCLGLL----SFQATSRTLQNDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENVNY 69
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N GN++YKLG N F+DLTN EF R + GY + +TFKY+N+S
Sbjct: 70 IEAFNNVGNKSYKLGLNHFADLTNHEFIAARNKFNGYL------HGSIITTFKYKNVS-- 121
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
DVP+++DWR +GAVTP+KNQ +CGCCWAF+AVA+ EGI K+ +GNL+ LSEQ+L+DC TN
Sbjct: 122 DVPSAVDWRQEGAVTPVKNQGQCGCCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTN 181
Query: 195 G-NNGCLGGSREKAFAYIIQNQGI-------FNGVCGT 224
G + GC GG + AF +IIQN G+ + GV GT
Sbjct: 182 GEDQGCEGGLMDDAFEFIIQNNGLSTEAEYPYQGVDGT 219
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 38/70 (54%), Positives = 48/70 (68%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F G CGT+LDH V +VG+G ED YWL+KNSWG WG+ GY+++ R EGLCG
Sbjct: 272 GVFTGSCGTELDHGVAVVGYGVGEDETEYWLVKNSWGTQWGEEGYIRMQRGVDASEGLCG 331
Query: 272 IGTRSSYPLA 281
I + SYP A
Sbjct: 332 IAMQPSYPTA 341
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 207 bits (527), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 136/201 (67%), Gaps = 4/201 (1%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ + C+S V+S+R + ++VE HE+WMA+ R YKD EK R ++FK N+ +
Sbjct: 8 LLAIVGCICLCSSAVLSARELGDTAMVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N E NR + LG NQF+DLTNDEFRA T + R+ T FKY N+S+ +P
Sbjct: 68 IESFNAE-NRKFWLGVNQFTDLTNDEFRATKTNKGLKMSGGRAPTG--FKYSNVSIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
T++DWR KG VTPIK+Q +CGCCWAF+AV A EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 TAVDWRTKGVVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHGVD 184
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +II+N G+
Sbjct: 185 QGCEGGEMDDAFKFIIKNGGL 205
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 33/68 (48%), Positives = 47/68 (69%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+ G CGT LDH + +G+G T DG YWL+KNSWG TWG++GY+++ +D G+CG
Sbjct: 272 GVMTGSCGTDLDHGIAAIGYGMTSDGTKYWLLKNSWGTTWGESGYLRMEKDISDKSGMCG 331
Query: 272 IGTRSSYP 279
+ + SYP
Sbjct: 332 LAMQPSYP 339
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 109/209 (52%), Positives = 140/209 (66%), Gaps = 8/209 (3%)
Query: 15 TTPMFIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
++ + I+T+ +S + + +SR E S +E HE+WMA+ R Y DE EK R IFK+
Sbjct: 2 SSTIIFILTIFLSYRTSLATSRGGLFEASPIEKHEQWMARFNRVYSDESEKRNRFNIFKK 61
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP-SHRSTTSS----TFKY 128
NLE+++ N N TYKL N+FSDLT++EFRA +TG +P + ST SS F+Y
Sbjct: 62 NLEFVQSFNMNKNITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSSDKTVPFRY 121
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
N+S D S+DWR +GAVTP+K Q CG CWAF+AVAAVEGITKI G L+ LSEQQL
Sbjct: 122 GNVS--DTGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQL 179
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
LDC T+ N GC GG KAF YII+NQGI
Sbjct: 180 LDCDTDYNQGCHGGIMSKAFEYIIKNQGI 208
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 41/70 (58%), Positives = 53/70 (75%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIFNG CGT L HAVTIVG+G +E+G YW++KNSWG TWG+ G+M+I RD +G+CG
Sbjct: 278 GIFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGEDGFMRIKRDVDAPQGMCG 337
Query: 272 IGTRSSYPLA 281
+ + YPLA
Sbjct: 338 LAMLAFYPLA 347
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 207 bits (526), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 113/253 (44%), Positives = 158/253 (62%), Gaps = 25/253 (9%)
Query: 11 FKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
I++ + ++ L A+ +R+ + S+ E HE+WM Q+G+ Y D EKE+R I
Sbjct: 7 LNISSLALLLVFGFLAFEAN----ARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNI 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKY 128
FKEN++ IE N GN+ YKLG NQF+DLTN+EF+A + G+ + ST + TFKY
Sbjct: 63 FKENVQRIEAFNNAGNKPYKLGINQFADLTNEEFKARNRFKGHMCSN----STRTPTFKY 118
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+++S VP SLDWR KGAVTPIK+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L
Sbjct: 119 EDVS--SVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQEL 176
Query: 189 LDCSTNG-NNGCLGGSREKAFAYIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
+DC T G + GC GG + AF +I+QN+G+ + GV T + A +I GF
Sbjct: 177 VDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGF 236
Query: 236 GTTEDGANYWLIK 248
+ L+K
Sbjct: 237 EDVPANSESALLK 249
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 52/70 (74%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+G ++DG YWL+KNSWG WG+ GY+++ RD EGLCG
Sbjct: 273 GLFTGSCGTELDHGVTAVGYGVSDDGTKYWLVKNSWGEQWGEEGYIRMQRDVAAEEGLCG 332
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 333 IAMQASYPTA 342
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 207 bits (526), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 103/212 (48%), Positives = 144/212 (67%), Gaps = 6/212 (2%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFK 121
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
K +LS D+P++LDWR+ GAVT +K+Q +CGCCWAF+AV ++EG KI +G L++ SE
Sbjct: 122 -KINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSE 180
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +II+N GI
Sbjct: 181 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGI 211
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 275 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCD 334
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 335 IAKMSSYP 342
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 103/212 (48%), Positives = 143/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
K +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 120/308 (38%), Positives = 168/308 (54%), Gaps = 77/308 (25%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
++ E ++ W ++ YKD+ E+E ++IFK N+ YI+ N GN++YKL N+F+DL
Sbjct: 34 TLSERYKHWKIKYRVIYKDDAEEEKHIQIFKHNVAYIDSFNAAGNKSYKLTINRFADLPT 93
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
+ + K+ TTSS FKY+N+ TD+P ++DWR +GAVTP+KNQ+ECG CW
Sbjct: 94 EPSDDGFKKRKL-----EPTTSSLFKYKNI--TDIPAAVDWRKRGAVTPVKNQRECGSCW 146
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCS----TNG-NNGCL-----------GGSRE 205
AF+AV A+EGI +I SGNL+ LSEQ+L+D TNG N G L G + E
Sbjct: 147 AFSAVGALEGIQQITSGNLVSLSEQELVDRVRSNWTNGCNGGYLIDAFEFVLENGGIATE 206
Query: 206 KAFAY---------------------------------IIQNQ----------------- 215
++ Y ++ NQ
Sbjct: 207 ASYPYRGVKGNNSKKVSRQVQIKSYEQVPRNSEDSLLKVVANQPVSVGIDISGMIRFYSS 266
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF G CGT+ +HAV IVG+GT+ DG YWL+KNSWG WG+ Y+++ RD EGLCG
Sbjct: 267 GIFTGECGTKPNHAVIIVGYGTSNDGTKYWLVKNSWGIRWGEKRYIRMKRDIDAKEGLCG 326
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 327 IPMDASYP 334
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 114/317 (35%), Positives = 167/317 (52%), Gaps = 82/317 (25%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E ++ +++E+W + ++ ++L R +FK N+ ++ + NK ++ YKL N+F+D+
Sbjct: 33 EDNLWDMYERWRHKVATNHGEKLR---RFNVFKSNVLHVHETNKM-DKPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMP----SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
TN EFR++Y G K+ S + S TF Y N+ VPTS+DWR KGAV P+K+Q
Sbjct: 89 TNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVE--SVPTSVDWRKKGAVAPVKDQG 146
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGG------------- 202
+CG CWAF+ VAAVEGI KI++ L+ LSEQ+L+DC T N GC GG
Sbjct: 147 QCGSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDFIKKTG 206
Query: 203 --SREKAFAYIIQN---------------------------------------------- 214
+RE A+ Y ++
Sbjct: 207 GLTREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAIDAGS 266
Query: 215 -------QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 265
+G+F G CGTQLDH V VG+GTT DG YW+++NSWG+ WG+ GY+++ R
Sbjct: 267 SDFQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSEWGEKGYIRMERGI 326
Query: 266 --DEGLCGIGTRSSYPL 280
GLCGI +SYP+
Sbjct: 327 SDKRGLCGIAMEASYPI 343
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 103/212 (48%), Positives = 143/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
K +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|242072388|ref|XP_002446130.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
gi|241937313|gb|EES10458.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
Length = 276
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 105/259 (40%), Positives = 149/259 (57%), Gaps = 54/259 (20%)
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
++N+ ++E N N + LG NQF+DLT +EF+A G+K S TT FKY+NL
Sbjct: 19 RDNVAFVESFNANKNNKFWLGVNQFADLTTEEFKA-NKGFKPTSAEKVPTTG--FKYENL 75
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
S++ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ +GNLI LS+Q+L+DC
Sbjct: 76 SVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSKQELVDC 135
Query: 192 STNG--------------NNGCLGGSR---------------EKAFAYIIQNQ------- 215
T+ + C GGS+ E A + NQ
Sbjct: 136 DTHSMDEGCEVQLPYKAVDGKCKGGSKSAATIKGHEDVPVNNEAALMKAVANQPVSVAVD 195
Query: 216 -----------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 264
G+ G CGT+LDH + +G+G DG YW++KNSWG TWG+ G++++
Sbjct: 196 ASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILKNSWGTTWGEKGFLRME 255
Query: 265 RD----EGLCGIGTRSSYP 279
+D G+CG+ + SYP
Sbjct: 256 KDITDKRGMCGLAMKPSYP 274
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 103/212 (48%), Positives = 143/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 206 bits (524), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 103/212 (48%), Positives = 143/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 206 bits (524), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 103/212 (48%), Positives = 143/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 206 bits (524), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 106/209 (50%), Positives = 138/209 (66%), Gaps = 9/209 (4%)
Query: 16 TPMFIIITLLVSCASQVVSSR-STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+ + I+T+ +S + + +SR S E S +E HE+WMA+ R Y DE EK R IFK+N
Sbjct: 3 STIIFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKN 62
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST------FKY 128
LE+++ N TYK+ N+FSDLT++EFRA +TG +P R +T S+ F+Y
Sbjct: 63 LEFVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFRY 122
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
N+S D S+DWR +GAVTP+K Q CG CWAF+AVAAVEGITKI G L+ LSEQQL
Sbjct: 123 GNVS--DNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQL 180
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
LDC + N GC GG KAF YII+NQGI
Sbjct: 181 LDCDRDYNQGCRGGIMSKAFEYIIKNQGI 209
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 41/70 (58%), Positives = 53/70 (75%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+FNG CGT L HAVTIVG+G +E+G YW++KNSWG TWG+ GYM+I RD +G+CG
Sbjct: 279 GVFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCG 338
Query: 272 IGTRSSYPLA 281
+ + YPLA
Sbjct: 339 LAILAFYPLA 348
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 99/208 (47%), Positives = 146/208 (70%), Gaps = 10/208 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH---EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+ I+ + ++ +SS ST E+++ H++WMA+HGR+Y+DE EK R ++FK N
Sbjct: 19 LTILAVTTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAHRFQVFKAN 78
Query: 75 LEYIEKANKEGN--RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
++++ +N G+ ++Y+L N+F+D+TNDEF A+YTG + P P+ + FKY N++
Sbjct: 79 ADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTGLR-PVPAGAKKMAG-FKYGNVT 136
Query: 133 MTDVPT---SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
++D ++DWR KGAVT IKNQ +CGCCWAFAAVAAVEGI +I +GNL+ LSEQQ+L
Sbjct: 137 LSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVL 196
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGI 217
DC T+GNNGC GG + AF YI+ N G+
Sbjct: 197 DCDTDGNNGCNGGYIDNAFQYIVGNGGL 224
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/56 (53%), Positives = 41/56 (73%)
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 281
L+HAVT VG+GT EDG YWL+KN WG WG+ GY+++ R CG+ ++SYP+A
Sbjct: 299 LNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 103/212 (48%), Positives = 143/212 (67%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++E KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 107/202 (52%), Positives = 141/202 (69%), Gaps = 14/202 (6%)
Query: 33 VSSRSTHEQSVV-EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-RTYK 90
V+SR+ + S++ E HE+WM +G+ YKD E+E RLKIFKEN+ YIE +N GN + YK
Sbjct: 26 VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+D+TN+EF A +K S T +STFKY+N S VP+++DWR KGAVTP
Sbjct: 86 LGINQFADITNEEFIASRNKFKGHMCS-SITKTSTFKYENAS---VPSTVDWRKKGAVTP 141
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
+KNQ +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC T G + GC GG + AF
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201
Query: 210 YIIQNQGI-------FNGVCGT 224
+IIQN G+ + GV GT
Sbjct: 202 FIIQNHGLHTEAQYPYQGVDGT 223
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGTQLDH VT VG+G + DG YWL+KNSWGN WG+ GY+++ R +GLCG
Sbjct: 276 GVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEEGYIRMQRSVDAAQGLCG 335
Query: 272 IGTRSSYPLA 281
I +SYP A
Sbjct: 336 IAMMASYPTA 345
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 114/231 (49%), Positives = 149/231 (64%), Gaps = 18/231 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
I L CA QV +SRS S+ E HE+WM+Q+ + YKD E+E R KIF N+ YI
Sbjct: 13 LTFIFCLGLCAIQV-TSRSLQVDSMYERHEQWMSQYSKVYKDPQEREERHKIFTANVNYI 71
Query: 79 EKANKEGN-RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
E N + N + YKLG NQF+DLTN+EF A +K S + T+ TFKY+N+S +P
Sbjct: 72 EVFNNDANNKLYKLGINQFADLTNEEFIASRNKFKGHMCSSIAKTT-TFKYENVSA--IP 128
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTP+KNQ +CGCCWAF+AVAA EGITK+ +G L+ LSEQ+L+DC T G +
Sbjct: 129 STVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVD 188
Query: 197 NGCLGGSREKAFAYIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
GC GG + AF +IIQN G+ + GV GT HA TI G+
Sbjct: 189 QGCEGGLMDDAFKFIIQNHGLSTEAAYPYQGVDGTCNANKASIHAATITGY 239
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 104/201 (51%), Positives = 144/201 (71%), Gaps = 9/201 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ +L + ASQ ++RS HE S+ E HE WM Q+GR YKD EK R KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G +
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +I QN G+
Sbjct: 185 QGCSGGLMDDAFKFIEQNHGL 205
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 51/70 (72%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH V VG+GT++DG YWL+KNSW WG+ GY+++ RD EGLCG
Sbjct: 272 GVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVTAKEGLCG 331
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 332 IAMQASYPTA 341
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 116/313 (37%), Positives = 169/313 (53%), Gaps = 74/313 (23%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T+ ++E+ E WM++H ++YK EK R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42 TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT++EF+ Y G P S + S+ F+Y+++ TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T N+GC GG + AF YII G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 218 FN----------GVCGTQLDHA--VTIVGF------------------------------ 235
G+C Q + VTI G+
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 236 -------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD---- 266
GT D G++Y ++KNSWG WG+ G++++ R+
Sbjct: 279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKP 338
Query: 267 EGLCGIGTRSSYP 279
EGLCGI +SYP
Sbjct: 339 EGLCGINKMASYP 351
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 114/309 (36%), Positives = 174/309 (56%), Gaps = 77/309 (24%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK +R ++FK+NL++I+ NK + Y LG N+F+DL++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+ Y G K+ R ++ F Y+++ D+P S+DWR KGAVTP+KNQ +CG CWA
Sbjct: 102 EFKNKYLGLKVDLSQRRESSEEEFTYRDV---DLPKSVDWRKKGAVTPVKNQGQCGSCWA 158
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCST---NGNNGCL------------GGSREKA 207
F+ VAAVEGI +I +GNL LSEQ+L+DC T NG NG L G +E+
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKNGGLHKEED 218
Query: 208 FAYIIQ-----------------------------------NQ----------------- 215
+ YI++ NQ
Sbjct: 219 YPYIMEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 216 -GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G+F+G CG++LDH V+ VG+GT++ G +Y ++KNSWG WG+ G++++ R+ EG+C
Sbjct: 279 GGVFDGHCGSELDHGVSAVGYGTSK-GLDYIIVKNSWGAKWGEKGFIRMKRNIGKSEGIC 337
Query: 271 GIGTRSSYP 279
G+ +SYP
Sbjct: 338 GLYKMASYP 346
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 100/212 (47%), Positives = 143/212 (67%), Gaps = 11/212 (5%)
Query: 16 TPMFIIITLLV-SCA---SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
+ F++++++ +CA S + +Q++V HE+WMA++ R Y D EK R ++F
Sbjct: 6 SSAFVLLSVVAWACALSGSLAARDLADQDQAMVARHEEWMAKYDRVYSDAAEKARRFEVF 65
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK----MPSPSHRSTTSST-F 126
K N+ IE N GN + L N+F+DLT+DEFRA +TGY+ S RS T++T F
Sbjct: 66 KANMALIESVNA-GNHKFWLEANRFADLTDDEFRATWTGYRPKTAAASSKGRSRTATTGF 124
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
KY N+S+ DVP S+DWR KGAVTPIKNQ ECGCCWAF+AVA++EG+ K+ +G L+ LSEQ
Sbjct: 125 KYANVSLDDVPASVDWRTKGAVTPIKNQGECGCCWAFSAVASMEGVVKLSTGKLVSLSEQ 184
Query: 187 QLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
+L+DC NG + GC GG + AF +I+ N G+
Sbjct: 185 ELVDCDVNGMDQGCEGGEMDDAFDFIVGNGGL 216
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 39/51 (76%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
G+ +G CGT+LDH + VG+G DG YW++KNSWG +WG+AGY+++ RD
Sbjct: 283 GVLSGACGTELDHGIAAVGYGVASDGTKYWVMKNSWGTSWGEAGYIRMERD 333
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 102/188 (54%), Positives = 134/188 (71%), Gaps = 8/188 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+SR+ + S+ E HE+WMA++ + YKD E+E R KIFKEN+ YIE N N+ YKLG
Sbjct: 25 VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TALPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + SG LI LSEQ+++DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGI 217
+IIQN G+
Sbjct: 200 FIIQNHGL 207
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 48/70 (68%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F G CGTQLDH VT VG+G + DG YWL+KNSWG WG+ GY+ + R EGLCG
Sbjct: 274 GVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYIMMQRGVKAQEGLCG 333
Query: 272 IGTRSSYPLA 281
I +SYP A
Sbjct: 334 IAMMASYPTA 343
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 204 bits (519), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 116/313 (37%), Positives = 168/313 (53%), Gaps = 74/313 (23%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++E+ E WM++H + YK EK R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42 TSTEKLLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT++EF+ Y G P S + S+ F+Y+++ TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T N+GC GG + AF YII G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 218 FN----------GVCGTQLDHA--VTIVGF------------------------------ 235
G+C Q + VTI G+
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 236 -------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD---- 266
GT D G++Y ++KNSWG WG+ G++++ R+
Sbjct: 279 FQFYKGGVFNGQCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKP 338
Query: 267 EGLCGIGTRSSYP 279
EGLCGI +SYP
Sbjct: 339 EGLCGINKMASYP 351
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 114/239 (47%), Positives = 153/239 (64%), Gaps = 24/239 (10%)
Query: 13 INTTPMFIIITLLVSC--ASQVVSSRSTHE-QSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
T + + LL+ ASQ + RS E +S++E HE+WMAQHGR YK+ EK R +
Sbjct: 4 FKTVKLLPALALLIVAIWASQGEAGRSLGENKSMLERHEQWMAQHGRVYKNAAEKAHRFE 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IF+ N+E IE N E N +KLG NQF+DLTN+EF+ T PS ++T S FKY+
Sbjct: 64 IFRANVERIESFNAE-NHKFKLGVNQFADLTNEEFKTRNT----LKPSKMASTKS-FKYE 117
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
N+ T VP ++DWR KGAVTPIK+Q +CG CWAF+AVAA EGITK+ +G LI LSEQ+++
Sbjct: 118 NV--TAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLISLSEQEVV 175
Query: 190 DCS-TNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQ--LDHAVTIVGF 235
DC T+ + GC GG + AF YII+N+GI +G C T+ HA +I G+
Sbjct: 176 DCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPYKAADGTCNTKKAASHAASITGY 234
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/122 (39%), Positives = 71/122 (58%), Gaps = 14/122 (11%)
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCG 223
A++ E +T L++ + Q + + + + FA+ + + G+F G CG
Sbjct: 229 ASITGYEDVTVNSEAALLKAAANQPIAVAIDAGD----------FAFQMYSSGVFTGDCG 278
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
T LDH VT+VG+G T DG YWL+KNSWG +WG+ GY+++ RD EGLCGI +SYP
Sbjct: 279 TDLDHGVTLVGYGATSDGTKYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGIAMDASYP 338
Query: 280 LA 281
A
Sbjct: 339 TA 340
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 110/219 (50%), Positives = 147/219 (67%), Gaps = 21/219 (9%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK-ANKEGNRTYKL 91
V+SR+ + S+ E H +WM+Q+G+ YKD E+E R KIFKEN+ YIE N + ++YKL
Sbjct: 25 VTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
G NQF+DLTN+EF A +K M S R+T+ FKY+N+S +P+++DWR KGAVT
Sbjct: 85 GINQFADLTNEEFIASRNKFKGHMCSSIMRTTS---FKYENVS--GIPSTVDWRKKGAVT 139
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
P+KNQ +CGCCWAF+AVAA EGI K+ +G LI LSEQ+L+DC T G + GC GG + AF
Sbjct: 140 PVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAF 199
Query: 209 AYIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
+IIQN G+ + GV GT AVTI G+
Sbjct: 200 KFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGY 238
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 38/70 (54%), Positives = 49/70 (70%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+ + R EG+CG
Sbjct: 275 GVFTGACGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGIEAAEGICG 334
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 335 IAMQASYPTA 344
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 120/335 (35%), Positives = 165/335 (49%), Gaps = 89/335 (26%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ L C + + + + ++V HE+WM Q+ R YKD EK R ++FK N+++
Sbjct: 8 ILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE N GNR + LG NQF+DLTNDEFRA T G+K PSP ST F+Y+N+S+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFK-PSPVKVSTG---FRYENVSVDA 123
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ-------- 187
+P ++DWR KGAVTPIK+Q +C EGI KI +G LI LSEQ+
Sbjct: 124 LPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDVHG 171
Query: 188 --------LLDCS------------------TNGNNGCLGGSR---------------EK 206
L+D + T + C GS E
Sbjct: 172 EDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNSAATVKGFEDVPANDEA 231
Query: 207 AFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 248
A + NQ G+ G CGT LDH + +G+G T DG YWL+K
Sbjct: 232 ALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLK 291
Query: 249 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
NSWG TWG+ GY+++ +D G+CG+ SYP
Sbjct: 292 NSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 326
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 204 bits (518), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 106/240 (44%), Positives = 149/240 (62%), Gaps = 14/240 (5%)
Query: 8 SGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
S F N T + ++ ++L S +V+SR+ E S++E HE WM HGR YKD++EKE R
Sbjct: 3 SNFFLKNITVVLLLFSIL-SLYPFIVTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHR 61
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
K FKEN+E+IE NK G + YKL N+++DLT +EF + G S + +T++T
Sbjct: 62 FKTFKENVEFIESFNKNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTS 121
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
++ S+T+VP S+DWR +G+VT +K+Q CGCCWAF+A AA+EG +I + LI LSEQQ
Sbjct: 122 FKYDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQ 181
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQG------------IFNGVCGTQLDHAVTIVGF 235
LLDCST N GC GG A+ +++QN G VC T+ AVTI G+
Sbjct: 182 LLDCSTQ-NKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVCKTEQPAAVTINGY 240
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 37/71 (52%), Positives = 54/71 (76%), Gaps = 5/71 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVRDEGL----C 270
GI++G C ++L+HAVT++G+GT+E DG YW++KNSWG+ WG+ GYM+I RD G+ C
Sbjct: 275 GIYDGSCNSRLNHAVTVIGYGTSEEDGTKYWIVKNSWGSDWGEEGYMRIARDVGVDGGHC 334
Query: 271 GIGTRSSYPLA 281
GI +S+P A
Sbjct: 335 GIAKVASFPTA 345
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 110/202 (54%), Positives = 142/202 (70%), Gaps = 5/202 (2%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQS--VVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
P+ + T+L +CA +S E S V + H++WM Q+GRSY ++ E E R KIF EN
Sbjct: 6 PIIALCTMLWACAYTAMSRTLYDETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFMEN 65
Query: 75 LEYIEK-ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
LEYIEK N GN++YKL NQFSDLTN+EF A +TG M PS S++S +L +
Sbjct: 66 LEYIEKFNNAPGNKSYKLDLNQFSDLTNEEFIASHTGL-MIDPSKPSSSSKRASPASLDL 124
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+D PTSLDWR++GAVT +KNQ CG CWAF+AVAAVEGI KI++GNLI LSEQQL+DC++
Sbjct: 125 SDTPTSLDWREQGAVTDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCAS 184
Query: 194 N-GNNGCLGGSREKAFAYIIQN 214
N N GC GG + AF+YI +N
Sbjct: 185 NEQNQGCGGGFMDNAFSYITEN 206
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 58/76 (76%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVRD- 266
++ + +GI++G CG+ L+H VT+VG+GT+E DG YWLIKNSWG +WG+ GYM+++R+
Sbjct: 266 SFHLYKEGIYSGPCGSSLNHGVTLVGYGTSEEDGTKYWLIKNSWGESWGENGYMRLLRES 325
Query: 267 ---EGLCGIGTRSSYP 279
EG CGI ++S+P
Sbjct: 326 GQSEGHCGIAVKASHP 341
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 133/201 (66%), Gaps = 4/201 (1%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ II + C+S V+S+R + ++VE HE+WMA+ R YKD EK R K FK N+ +
Sbjct: 8 LLAIIGSICLCSSTVLSARELGDAAMVEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFRA T + R+ T FKY N+S +P
Sbjct: 68 IESFNT-GNHKFWLGVNQFTDLTNDEFRATKTNKGLKRNGARAPTR--FKYNNVSTDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KG VTPIK+Q +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC +G +
Sbjct: 125 AAVDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGVD 184
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +II+N G+
Sbjct: 185 QGCEGGEMDNAFKFIIKNGGL 205
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 38/51 (74%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
G+ G CGT LDH + +G+G T DG +WL+KNSWG TWG++GY+++ +D
Sbjct: 272 GVMTGSCGTDLDHGIVAIGYGMTSDGTKFWLLKNSWGTTWGESGYLRMEKD 322
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 120/340 (35%), Positives = 171/340 (50%), Gaps = 74/340 (21%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
+ +T +F+ TL SCA + + + V+ ++E+W+ +H + Y EK+ R ++FK
Sbjct: 8 VTSTLLFLSFTL--SCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFK 65
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNL 131
+NL +I++ N N TYKLG NQF+D+TN+E+R +Y G K + T ST +Y
Sbjct: 66 DNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYS 125
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ +P +DWR KGAV PIK+Q CG CWAF+ VA VE I KI +G + LSEQ+L+DC
Sbjct: 126 AGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC 185
Query: 192 STNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHA--VTIVGF---- 235
N GC GG + AF +IIQN GI F+G+C +A V I GF
Sbjct: 186 DRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGFEDVP 245
Query: 236 ---------------------------------------GTTED------------GANY 244
GT+ D G +Y
Sbjct: 246 PYDENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYGSENGVDY 305
Query: 245 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
WL++NSWG WG+ GY K+ R+ G CGI +SYP+
Sbjct: 306 WLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 104/203 (51%), Positives = 141/203 (69%), Gaps = 12/203 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + +T L A QV + R+ + S+ E HE+WM ++G+ YKD E+E R ++FKEN+ Y
Sbjct: 14 MLLCMTFL---AFQV-TCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENVNY 69
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTD 135
IE N N++YKLG NQF+DLTN EF A G+K M S R+TT FK++N++ T
Sbjct: 70 IEAFNNAANKSYKLGINQFADLTNKEFIAPRNGFKGHMCSSIIRTTT---FKFENVTAT- 125
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P+++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI + +G LI LSEQ+L+DC T G
Sbjct: 126 -PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKG 184
Query: 196 -NNGCLGGSREKAFAYIIQNQGI 217
+ GC GG + AF +IIQN G+
Sbjct: 185 VDQGCEGGLMDDAFKFIIQNHGL 207
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 52/70 (74%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F G CGT+LDH VT VG+G ++DG YWL+KNSWG WG+ GY+++ R +EGLCG
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGVSDDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCG 333
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 334 IAMQASYPTA 343
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 103/201 (51%), Positives = 144/201 (71%), Gaps = 9/201 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ +L + ASQ ++R+ HE S+ E HE WM Q+GR YKD EK R KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G +
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +I QN G+
Sbjct: 185 QGCSGGLMDDAFKFIEQNHGL 205
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 53/70 (75%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH V+ VG+GT++DG YWL+KNSWG WG+ GY+++ RD EGLCG
Sbjct: 272 GVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCG 331
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 332 IAMQASYPTA 341
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 103/201 (51%), Positives = 143/201 (71%), Gaps = 9/201 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ +L + ASQ ++R HE S+ E HE WM Q+GR YKD EK R KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARXLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G +
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +I QN G+
Sbjct: 185 QGCSGGLMDDAFKFIEQNHGL 205
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 51/70 (72%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH V VG+GT++DG YWL+KNSW WG+ GY+++ RD EGLCG
Sbjct: 272 GVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVTVKEGLCG 331
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 332 IAMQASYPTA 341
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 102/212 (48%), Positives = 142/212 (66%), Gaps = 7/212 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
K +LS D+P++LDW + GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+LLDC+TN N GC GG AF +I +N GI
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGI 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G ++G C +++HAVT +G+GT E G YWL+KNSWG +WG+ G+MKI+RD GLC
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCD 333
Query: 272 IGTRSSYP 279
I SSYP
Sbjct: 334 IAKMSSYP 341
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 203 bits (516), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 111/319 (34%), Positives = 162/319 (50%), Gaps = 58/319 (18%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR YKD+ EK R ++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR+ T + R T F+ +N+++ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTG--FRNENVNIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-- 195
++DWR KG VTPIK+Q +CGCCWAF+AVAA+E + Q E L+D +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEELVDCDVHGEDQGCEGGLMDDAFKFII 184
Query: 196 NNGCL-------------------------------GGSREKAFAYIIQNQ--------- 215
NG L + E A + NQ
Sbjct: 185 KNGGLTTESNYPYAAVDDKFKSVSNSVASIKGYEDVPANNEAALMKAVANQPVSVAVDGG 244
Query: 216 ---------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
G+ G CGT LDH + +G+G DG YWL+KNSWG TWG+ G++++ +D
Sbjct: 245 DMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKD 304
Query: 267 ----EGLCGIGTRSSYPLA 281
G+CG+ SYP A
Sbjct: 305 ISDKRGMCGLAMEPSYPTA 323
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 203 bits (516), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 101/188 (53%), Positives = 134/188 (71%), Gaps = 8/188 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+SR+ + S+ E HE+WMA++ + YKD E+E R KIFKEN+ YIE N ++ YKLG
Sbjct: 25 VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNKFKGHMCSSITRTTT---FKYENV--TALPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + SG LI LSEQ+++DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGI 217
+IIQN G+
Sbjct: 200 FIIQNHGL 207
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 48/70 (68%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F G CGTQLDH VT VG+G + DG YWL+KNSWG WG+ GY+ + R EGLCG
Sbjct: 274 GVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYIMMQRGVKAQEGLCG 333
Query: 272 IGTRSSYPLA 281
I +SYP A
Sbjct: 334 IAMMASYPTA 343
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 203 bits (516), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 119/336 (35%), Positives = 165/336 (49%), Gaps = 89/336 (26%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ L C + + + + ++V HE+WM Q+ R YKD EK R ++FK N+++
Sbjct: 8 ILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE N GNR + LG NQF+DLTNDEFRA T G+K PSP T F+Y+N+S+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFK-PSPVKVPTG---FRYENVSVDA 123
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ-------- 187
+P ++DWR KGAVTPIK+Q +C EGI KI +G LI LSEQ+
Sbjct: 124 LPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDVHG 171
Query: 188 --------LLDCS------------------TNGNNGCLGGSR---------------EK 206
L+D + T + C GS E
Sbjct: 172 EDQGCEGGLMDDAFQFIIKNGGLTTESSYPYTAADGKCKSGSNSAATVKGFEDVPANDEA 231
Query: 207 AFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 248
A + NQ G+ G CGT LDH + +G+G T DG YWL+K
Sbjct: 232 ALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLK 291
Query: 249 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
NSWG TWG+ GY+++ +D G+CG+ SYP+
Sbjct: 292 NSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPI 327
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 145/220 (65%), Gaps = 8/220 (3%)
Query: 1 MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+L F + K + + + + L ++ + V+ RS + S+ E HE+WM ++G+ YKD
Sbjct: 11 FLLFFASTMVAKNHFCHISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKD 70
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSH 118
E+E R +IFKEN+ YIE N N+ YKL NQF+DLTN+EF A +K M S
Sbjct: 71 PQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSII 130
Query: 119 RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG 178
R+TT FKY+N+ T VP+++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI + SG
Sbjct: 131 RTTT---FKYENV--TAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSG 185
Query: 179 NLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
LI LSEQ+L+DC T G + GC GG + AF ++IQN G+
Sbjct: 186 KLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGL 225
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 51/70 (72%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+++ R +EGLCG
Sbjct: 292 GVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCG 351
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 352 IAMQASYPTA 361
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 118/348 (33%), Positives = 178/348 (51%), Gaps = 83/348 (23%)
Query: 11 FKINTTPMFIIITLLVSCASQV--------VSSRSTHEQSVVEIHEKWMAQHGRSYKDEL 62
F T +F++ +++C++ T V+ + E W+A+H + Y+
Sbjct: 5 FSSKKTSLFLVFVSVLACSALANEFSILGYAPEDLTSIHKVIHLFESWLAKHSKIYESLD 64
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK R +IF +NL++I+ NK+ + Y LG N+F+DLT++EF+ + G K P + +
Sbjct: 65 EKLHRFEIFMDNLKHIDDTNKKVS-NYWLGLNEFADLTHEEFKNKFLGLKGELPERKDES 123
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
F Y++ D+P S+DWR KGAV P+KNQ +CG CWAF+ VAAVEGI +I +GNL
Sbjct: 124 IEEFSYRDF--VDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTM 181
Query: 183 LSEQQLLDCST---NGNNGCL-----------GGSREKAFAYI----------------- 211
LSEQ+L+DC T NG NG L G +E+ + YI
Sbjct: 182 LSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSGLHKEEEYPYIMSEGTCDEKKDVSETVT 241
Query: 212 ------------------IQNQ------------------GIFNGVCGTQLDHAVTIVGF 235
+ NQ G+F+G CGT+LDH V VG+
Sbjct: 242 ISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGY 301
Query: 236 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
GTT+ G +Y +++NSWG WG+ GY+++ R G+CG+ +SYP
Sbjct: 302 GTTK-GLDYVIVRNSWGPKWGEKGYIRMKRKTGKPHGMCGLYMMASYP 348
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 99/179 (55%), Positives = 128/179 (71%), Gaps = 5/179 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ S+ E H +WMA+HGR+YKD EKE RL IFK N+EYIE N G R Y+L NQF+DL
Sbjct: 28 DASMAERHVEWMARHGRTYKDAAEKEQRLGIFKSNVEYIESFNA-GKRKYQLAANQFADL 86
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T++EF+A++TG+K PS + + F++ +LS VP S+DWR KGAVTP+K+Q CG
Sbjct: 87 THEEFKAMHTGFK-PSGTGAKKAGNGFRHGSLS--SVPDSVDWRSKGAVTPVKDQGLCGS 143
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
CWAF VAAVEGITKI +G LI LSEQQL+DC +G + GC GG + AF +I+ N GI
Sbjct: 144 CWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKDQGCQGGDMDAAFEFIVNNGGI 202
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 44/76 (57%), Positives = 57/76 (75%), Gaps = 4/76 (5%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + + G+F+G CGT LDHAVT+VG+GTT DG YWL KNSWG TWG+ GY+++ RD
Sbjct: 264 FQLYSGGVFSGECGTDLDHAVTVVGYGTTSDGTKYWLAKNSWGETWGENGYIRMERDVAA 323
Query: 267 -EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 324 KEGLCGIAMQASYPTA 339
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 102/211 (48%), Positives = 141/211 (66%), Gaps = 9/211 (4%)
Query: 15 TTPMFIIITLLVSC----ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
T+ +F++++L + SQ S + HE V E H++WM + R Y DELEK+MR +
Sbjct: 11 TSILFMLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDV 70
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK----MPSPSHRSTTSSTF 126
FK+NL++IEK NK+G+RTYKLG N+F+D T +EF A +TG K +PS ++
Sbjct: 71 FKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSW 130
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
+ N+S + DWR +GAVTP+K Q +CGCCWAF++VAAVEG+TKI NL+ LSEQ
Sbjct: 131 NW-NVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQ 189
Query: 187 QLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
QLLDC +NGC GG AF+YII+N+GI
Sbjct: 190 QLLDCDRERDNGCNGGIMSDAFSYIIKNRGI 220
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 34/64 (53%), Positives = 47/64 (73%), Gaps = 4/64 (6%)
Query: 222 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSS 277
CGT ++HAVT VG+GT+ +G YWL KNSWG TWG+ GY++I RD +G+CG+ +
Sbjct: 292 CGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAF 351
Query: 278 YPLA 281
YP+A
Sbjct: 352 YPVA 355
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 97/211 (45%), Positives = 145/211 (68%), Gaps = 9/211 (4%)
Query: 13 INTTPMFIIITLLVSCA----SQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
+++ +++ +L CA S V+++R + + ++ E HE+WMA +GR YKD EK R
Sbjct: 2 VSSRAFLLLLAILTGCACSFPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAEKARR 61
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
++FK+NL ++E N + + LG NQF+DLT +EF+A G+K S TT FK
Sbjct: 62 FEVFKDNLAFVESFNADKKNKFWLGVNQFADLTTEEFKA-NKGFKPISAEEVPTTG--FK 118
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y+NLS++ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ + NL+ LSEQ+
Sbjct: 119 YENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLSEQE 178
Query: 188 LLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
L+DC T+ + GC GG + AF ++I+N G+
Sbjct: 179 LVDCDTHSMDEGCEGGWMDSAFEFVIKNGGL 209
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 50/74 (67%), Gaps = 4/74 (5%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+++ + G+ G CGTQLDH + +G+G DG YW++KNSWG TWG+ ++++ +D
Sbjct: 268 FMLYSGGVMTGSCGTQLDHGIAAIGYGVESDGTKYWILKNSWGTTWGEKRFLRMEKDISD 327
Query: 267 -EGLCGIGTRSSYP 279
+G+CG+ + SYP
Sbjct: 328 KQGMCGLAMKPSYP 341
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 144/208 (69%), Gaps = 10/208 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH---EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+ I+ + ++ +SS ST E+++ H++WMA+HGR+Y+DE EK R ++FK N
Sbjct: 19 LTILAVKTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAHRFQVFKAN 78
Query: 75 LEYIEKANKEGN--RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
++++ +N G+ ++Y++ N+F+D+TNDEF A+YTG + P P+ + FKY N++
Sbjct: 79 ADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTGLR-PVPAGAKKMAG-FKYGNVT 136
Query: 133 MTDVP---TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
++D ++DWR KGAVT IKNQ +CGCCWAFAAVAAVEGI +I +GNL+ LSEQQ+L
Sbjct: 137 LSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVL 196
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGI 217
DC T GNNGC GG + AF YI N G+
Sbjct: 197 DCDTEGNNGCNGGYIDNAFQYIAGNGGL 224
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 30/56 (53%), Positives = 41/56 (73%)
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 281
L+HAVT VG+GT EDG YWL+KN WG WG+ GY+++ R CG+ ++SYP+A
Sbjct: 299 LNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354
>gi|413947586|gb|AFW80235.1| hypothetical protein ZEAMMB73_542371 [Zea mays]
Length = 264
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 145/211 (68%), Gaps = 9/211 (4%)
Query: 13 INTTPMFIIITLLVSCA----SQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
+++ +++ +L+ C S V+++R + + ++ E HE+WMA++GR YKD +K R
Sbjct: 2 VSSKAFLLLLAVLIGCVCSFPSPVLAARELSDDAAMAERHERWMAEYGRVYKDAADKARR 61
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
++FK+N ++E N + + LG NQF+DLT + F+A G+K S TT FK
Sbjct: 62 FEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTEAFKA-NKGFKPISAEKAPTTG--FK 118
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y+NLS++ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAAVEGI K+ +GNL+ LSEQ+
Sbjct: 119 YENLSISALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAVEGIVKLSTGNLVSLSEQE 178
Query: 188 LLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
L+DC T+ + GC GG + AF ++I+N G+
Sbjct: 179 LVDCDTHSMDEGCEGGWMDSAFEFVIKNGGL 209
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 105/203 (51%), Positives = 140/203 (68%), Gaps = 16/203 (7%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK-EGNRTYKL 91
V+SR+ + S+ E HE+WM +G+ YKD E+E R KIF EN++YIE N + N +YKL
Sbjct: 25 VTSRTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
G NQF+DLTN+EF A +K M S R+TT FKY+N+S +P+++DWR KGAVT
Sbjct: 85 GINQFADLTNEEFVASRNKFKGHMCSSIIRTTT---FKYENVSA--IPSTVDWRKKGAVT 139
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
P+KNQ +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC T G + GC GG + AF
Sbjct: 140 PVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAF 199
Query: 209 AYIIQNQGI-------FNGVCGT 224
+IIQN G+ + GV GT
Sbjct: 200 KFIIQNHGLNTEAQYPYQGVDGT 222
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 49/70 (70%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+ + R EGLCG
Sbjct: 275 GVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCG 334
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 335 IAMQASYPTA 344
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 101/188 (53%), Positives = 132/188 (70%), Gaps = 8/188 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+ RS + S+ E HE+WM ++G+ YKD E+E R +IFKEN+ YIE N N+ YKL
Sbjct: 25 VTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLA 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T VP+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSIIRTTT---FKYENV--TAVPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + SG LI LSEQ+L+DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFK 199
Query: 210 YIIQNQGI 217
++IQN G+
Sbjct: 200 FVIQNHGL 207
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 51/70 (72%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+++ R +EGLCG
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRGVNSEEGLCG 333
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 334 IAMQASYPTA 343
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/196 (54%), Positives = 138/196 (70%), Gaps = 3/196 (1%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
I LL +CA +S R+ E SVVE H++WM ++ R+Y + E E R KIFKENLEYIE
Sbjct: 9 IILLWACAYPTMS-RTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENF 67
Query: 82 NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
N GN++YKLG N++SDLT++EF A +TG+K+ S S NL+ DVPT+ D
Sbjct: 68 NNVGNKSYKLGLNRYSDLTSEEFIASHTGFKVSDQLSDSKMRSVAIPFNLN-DDVPTNFD 126
Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLG 201
WR+KG VT +KNQ++CGCCWAF AVAAVEGI KI++GNLI LSEQQL+DC ++GC G
Sbjct: 127 WREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDCDRQ-SSGCGG 185
Query: 202 GSREKAFAYIIQNQGI 217
G AF II+++GI
Sbjct: 186 GDFVLAFDSIIKSRGI 201
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 49/68 (72%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCG 271
G++ G CG +L+HAVTI+G+G +E G YWLIKNSWG TWG+ GYMK++R+ G C
Sbjct: 267 GVYEGSCGPKLNHAVTIIGYGVSEAGKKYWLIKNSWGETWGEKGYMKVLRESSATGGQCS 326
Query: 272 IGTRSSYP 279
I ++YP
Sbjct: 327 IAVHAAYP 334
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 104/201 (51%), Positives = 132/201 (65%), Gaps = 27/201 (13%)
Query: 20 IIITLLV---SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+ I LLV + ASQ ++ + +E ++VE HE+WMA+HGR+Y+D EKE R +IFK NLE
Sbjct: 9 LAIALLVVFSTWASQAMARQLINEDALVEKHEQWMARHGRTYQDSEEKERRFQIFKSNLE 68
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
YI+ NK N+TY+LG N F+DL+++E+ A YT KMP +V
Sbjct: 69 YIDNFNKASNQTYQLGLNNFADLSHEEYVATYTARKMP-------------------VEV 109
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWRD GAVTPIKNQ +CGCCWAF+A AAVEGI N + LS QQLLDC ++ N
Sbjct: 110 PESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGIV----ANGVSLSAQQLLDCVSD-N 164
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG AF YIIQNQGI
Sbjct: 165 QGCKGGWMNNAFNYIIQNQGI 185
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 107/240 (44%), Positives = 148/240 (61%), Gaps = 23/240 (9%)
Query: 17 PMFIIITLL----VSCASQVVSSR---STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
P +++ +L C++ V+++R E ++V HE+WM QHGR YKDE +K R
Sbjct: 4 PKALLLAILGCGVCLCSAAVLAARELGGDDELAMVARHEQWMVQHGRVYKDETDKAHRFL 63
Query: 70 IFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF 126
+FK N+++IE N GNR + LG NQF+DLTNDEFRA T + T F
Sbjct: 64 VFKANVKFIESFNAAAAAGNRKFWLGVNQFADLTNDEFRATKTNKGFNPNVVKVPTG--F 121
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
+YQNLS+ +P ++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +G L LSEQ
Sbjct: 122 RYQNLSIDALPQTVDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLTSLSEQ 181
Query: 187 QLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIF----------NGVCGTQLDHAVTIVGF 235
+L+DC +G + GC GG + AF +II+N G+ +G C + + A TI G+
Sbjct: 182 ELVDCDVHGEDQGCNGGEMDDAFKFIIKNGGLTTESNYPYTAQDGQCKSGSNGAATIKGY 241
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 48/70 (68%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+ G CGT LDH + +G+G T DG YWL+KNSWG TWG+ G++++ +D +G+CG
Sbjct: 278 GVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGFLRMEKDIADKKGMCG 337
Query: 272 IGTRSSYPLA 281
+ + SYP A
Sbjct: 338 LAMQPSYPTA 347
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 141/218 (64%), Gaps = 20/218 (9%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+ R+ + S+ E HE+WM ++ + YKD E+E R KIFKEN+ YIE N N+ Y LG
Sbjct: 25 VTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TAIPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + +G LI LSEQ+++DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGIFN----------GVCGTQ--LDHAVTIVGF 235
+IIQN G+ N G C + +H TI G+
Sbjct: 200 FIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGY 237
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 38/70 (54%), Positives = 49/70 (70%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+++ R +EGL G
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLXG 333
Query: 272 IGTRSSYPLA 281
I +SYP A
Sbjct: 334 IAMMASYPTA 343
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 141/218 (64%), Gaps = 20/218 (9%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+ R+ + S+ E HE+WM ++ + YKD E+E R KIFKEN+ YIE N N+ Y LG
Sbjct: 25 VTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TAIPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + +G LI LSEQ+++DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGIFN----------GVCGTQ--LDHAVTIVGF 235
+IIQN G+ N G C + +H TI G+
Sbjct: 200 FIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGY 237
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+++ R +EGLCG
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCG 333
Query: 272 IGTRSSYPLA 281
I +SYP A
Sbjct: 334 IAMMASYPTA 343
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 141/218 (64%), Gaps = 20/218 (9%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+ R+ + S+ E HE+WM ++ + YKD E+E R KIFKEN+ YIE N N+ Y LG
Sbjct: 25 VTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TAIPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + +G LI LSEQ+++DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGIFN----------GVCGTQ--LDHAVTIVGF 235
+IIQN G+ N G C + +H TI G+
Sbjct: 200 FIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGY 237
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+++ R +EGLCG
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCG 333
Query: 272 IGTRSSYPLA 281
I +SYP A
Sbjct: 334 IAMMASYPTA 343
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 105/207 (50%), Positives = 146/207 (70%), Gaps = 11/207 (5%)
Query: 18 MFIIITLLVS-CASQVVS--SRSTHEQSVVEIHEKWMAQHGRSYKDELE--KEMRLKIFK 72
+F+ + L++S C S ++ SR ++ + HE+WM+QHGR Y DE E K R +FK
Sbjct: 6 IFLFVALVLSFCFSIQLAGLSRPLLDEDSMR-HEEWMSQHGRVYADEQEDHKNKRFNVFK 64
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP-SHRSTTSSTFKYQNL 131
EN+E IE+ N +T+KL NQF+DLTN+EFRA Y G+K P S + T + F+Y+N+
Sbjct: 65 ENVERIEEFND--GKTFKLAINQFADLTNEEFRASYNGFKGPMVLSSQITKPTPFRYENV 122
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
S + +P S+DWR KGAVTP+KNQ +CGCCWAF+AVAA+EGIT+I +G LI LSEQ+L+DC
Sbjct: 123 S-SALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISLSEQELVDC 181
Query: 192 STNG-NNGCLGGSREKAFAYIIQNQGI 217
T G ++GC GG + AF +II N G+
Sbjct: 182 DTKGIDHGCEGGLMDTAFEFIINNGGL 208
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 55/70 (78%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDHAVT VG+G +EDG+ YW++KNSWG WG++GY+++ +D +GLCG
Sbjct: 275 GVFTGECGTELDHAVTAVGYGESEDGSKYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCG 334
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 335 IAMQASYPTA 344
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 201 bits (510), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 103/203 (50%), Positives = 142/203 (69%), Gaps = 10/203 (4%)
Query: 19 FIIITLLVSCASQV--VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+I + LL A+ +R+ HE S+ E HE WMAQ+GR YKD EK R KIFK+N+
Sbjct: 9 YICLALLFVLAAWASHAKARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRYKIFKDNVA 68
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTD 135
IE NK N++YKL N+F+DLTN+EFRA +K +H ST +++FKY+++
Sbjct: 69 RIESFNKAMNKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYEHVXA-- 122
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
VP+++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G
Sbjct: 123 VPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSG 182
Query: 196 -NNGCLGGSREKAFAYIIQNQGI 217
+ GC GG + AF +I QN G+
Sbjct: 183 EDQGCSGGLMDDAFKFIEQNHGL 205
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 41/78 (52%), Positives = 56/78 (71%), Gaps = 4/78 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + + G+F G CGT+LDH V+ VG+GT++DG YWL+KNSWG WG+ GY+++ RD
Sbjct: 264 FEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDV 323
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 324 TEKEGLCGIAMQASYPTA 341
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 201 bits (510), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 96/186 (51%), Positives = 131/186 (70%), Gaps = 7/186 (3%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V R HE S+ E HE+WM ++G+ YKD EK+ R +IFK+N+E+IE N +GN+ YKLG
Sbjct: 24 VMCRKLHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLG 83
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N +DLT +EF+A G+K P H +T +TFKY+N+ T +P ++DWR KGAVTPIK
Sbjct: 84 VNHLADLTVEEFKASRNGFKRP---HEFST-TTFKYENV--TAIPAAIDWRTKGAVTPIK 137
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
+Q +CG CWAF+ +AA EGI +I +G L+ LSEQ+L+DC T G + GC GG E F +I
Sbjct: 138 DQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFI 197
Query: 212 IQNQGI 217
I+N GI
Sbjct: 198 IKNGGI 203
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/130 (36%), Positives = 71/130 (54%), Gaps = 18/130 (13%)
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSE---QQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
C A + VA ++G K+ + L + Q + S + + ++ +
Sbjct: 218 CNKATSPVAQIKGYEKVPPNSETALQKAVANQPVSVSIDADGA----------GFMFYSS 267
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
GI+NG CGT+LDH VT VG+GT +G +YW++KNSWG WG+ GY+++ R GLCG
Sbjct: 268 GIYNGECGTELDHGVTAVGYGTA-NGTDYWIVKNSWGTQWGEKGYVRMQRGIAAKHGLCG 326
Query: 272 IGTRSSYPLA 281
I SSYP +
Sbjct: 327 IALDSSYPTS 336
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 137/201 (68%), Gaps = 4/201 (1%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR Y+D+ EK R ++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR + T + R T F+Y+N+++ +P
Sbjct: 68 IESFNA-GNHNFWLGVNQFADLTNDEFRWMKTNKGFIPSTTRVPTG--FRYENVNIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +II+N G+
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGL 205
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 65/124 (52%), Gaps = 17/124 (13%)
Query: 165 AVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGV 221
+VA+++G + + N L++ Q + + +G + + G+ G
Sbjct: 226 SVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD----------MTFQFYKGGVMTGS 275
Query: 222 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSS 277
CGT LDH + +G+G DG YWL+KNSWG TWG+ G++++ +D G+CG+ S
Sbjct: 276 CGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPS 335
Query: 278 YPLA 281
YP A
Sbjct: 336 YPTA 339
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 200 bits (509), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 103/217 (47%), Positives = 145/217 (66%), Gaps = 21/217 (9%)
Query: 34 SSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGT 93
++R+ + + E HE+WMA HG+ Y EKE + + FKEN++ IE N GN+ YKLG
Sbjct: 27 NARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQKYQTFKENVQRIEAFNHAGNKPYKLGI 86
Query: 94 NQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
N F+DLTN+EF+A+ + G+ + T + TF+Y+N MT VP +LDWR +GAVTPI
Sbjct: 87 NHFADLTNEEFKAINRFKGH----VCSKITRTPTFRYEN--MTAVPATLDWRQEGAVTPI 140
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
K+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L+DC T G + GC GG + AF +
Sbjct: 141 KDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKF 200
Query: 211 IIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
I+QN+G+ + GV GT + +HA +I G+
Sbjct: 201 ILQNKGLAAEAIYPYEGVDGTCNAKAEGNHATSIKGY 237
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/78 (53%), Positives = 53/78 (67%), Gaps = 4/78 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + + G+F G CGT LDH VT VG+G ++DG YWL+KNSWG WGD GY+++ RD
Sbjct: 266 FEFQFYSGGVFTGSCGTNLDHGVTAVGYGVSDDGTKYWLVKNSWGVKWGDKGYIRMQRDV 325
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI +SYP A
Sbjct: 326 AAKEGLCGIAMLASYPNA 343
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 200 bits (509), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 136/201 (67%), Gaps = 4/201 (1%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR YKD+ EK R ++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR+ T + R T F+Y+N+++ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTG--FRYENVNIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KG VTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +II+N G+
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGL 205
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 65/124 (52%), Gaps = 17/124 (13%)
Query: 165 AVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGV 221
+VA+++G + + N L++ Q + + +G + + G+ G
Sbjct: 226 SVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD----------MTFQFYKGGVMTGS 275
Query: 222 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSS 277
CGT LDH + +G+G DG YWL+KNSWG TWG+ G++++ +D G+CG+ S
Sbjct: 276 CGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPS 335
Query: 278 YPLA 281
YP A
Sbjct: 336 YPTA 339
>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
Length = 492
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 109/281 (38%), Positives = 147/281 (52%), Gaps = 51/281 (18%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W+ H ++ D E RL+ + N YI N + ++KLG N FS LTN+EFR +
Sbjct: 36 WLKTHHLTFSDAFEYAKRLETYIANDIYILTHNLQ-ESSFKLGHNAFSHLTNEEFRQRFN 94
Query: 110 GYKMPSP--SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
G+K + R S+ N D+P S+DW +KGAVT +KNQ CG CWAF+
Sbjct: 95 GFKASDDYLTKRLAQSNVASSTNFQYIDLPESVDWVEKGAVTGVKNQGMCGSCWAFSTTG 154
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI---------- 217
A+EG T I SG L+ LSEQ+L+DC NG++GC GG + AF++I ++ GI
Sbjct: 155 AIEGATFISSGKLVSLSEQELVDCDHNGDHGCNGGLMDHAFSWISEHDGICSEEDYAYIH 214
Query: 218 ---------------------------------FNGVCGTQLDHAVTIVGFGTTEDGANY 244
+N CGTQLDH V VG+G EDG Y
Sbjct: 215 SQSLCRSCKPVVSPVAVAIDAGDRSFQFYQSGVYNKTCGTQLDHGVLTVGYG-VEDGQKY 273
Query: 245 WLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYPLA 281
W +KNSWGN+WG+ GY+++ RD+ G CGI SYP A
Sbjct: 274 WKVKNSWGNSWGEKGYIRLSRDQNGRSGQCGIAMVPSYPTA 314
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 200 bits (508), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 96/200 (48%), Positives = 131/200 (65%), Gaps = 4/200 (2%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F L++ + V+SR E S+ HE+WM G+ Y D EKE R +IFK+N+EYI
Sbjct: 10 FFAFILILGMWAYEVASRELQEPSMSARHEQWMETFGKVYADAAEKERRFEIFKDNVEYI 69
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N GN+ YKL N+F+DLTN+E + GY+ P + R ++FKY+N+ T VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADLTNEELKVARNGYRRPLQT-RPMKVTSFKYENV--TAVPA 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
++DWR KGAVTPIK+Q +CG CWAF+ VAA EGI ++ +G L+ LSEQ+L+DC T G +
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDTQGEDQ 186
Query: 198 GCLGGSREKAFAYIIQNQGI 217
GC GG E F +II+N GI
Sbjct: 187 GCEGGLMEDGFEFIIKNHGI 206
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 42/70 (60%), Positives = 51/70 (72%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT+LDH VT VG+G T DG YWL+KNSWG +WG+ GY+++ RD EGLCG
Sbjct: 273 GVFTGQCGTELDHGVTAVGYGETSDGTKYWLVKNSWGTSWGEEGYIRMQRDTEAEEGLCG 332
Query: 272 IGTRSSYPLA 281
I SSYP A
Sbjct: 333 IAMDSSYPTA 342
>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
Length = 348
Score = 200 bits (508), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 112/316 (35%), Positives = 165/316 (52%), Gaps = 82/316 (25%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
++S+ +++E+W +QH S + EK+ R +FK N+ +I + N+ G + YKL N+F+D+
Sbjct: 33 DKSLWDLYERWGSQHMVSRAPD-EKKKRFNVFKYNVNHINRVNQLG-KPYKLKLNEFADM 90
Query: 100 TNDEFRALYTG----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
TN EF+A + ++M R T + + TD P S+DWR GAV PIKNQ
Sbjct: 91 TNHEFKAGFDSKILHFRMLKGKRRQTP-----FTHAKTTDPPPSIDWRTNGAVNPIKNQG 145
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCL------------G 201
CG CWAF+ + VEGI KI++ L+ LSEQ+L+DC T+ G NG L G
Sbjct: 146 RCGSCWAFSTIVGVEGINKIKTNQLVSLSEQELVDCETDCEGCNGGLMENGYEFIKETGG 205
Query: 202 GSREKAFAYIIQN----------------------------------------------- 214
+ E+ + Y +N
Sbjct: 206 VTTEQIYPYFARNGRCDISKRNSPVVKIDGFENVPANDESAMLRAVANQPVSIAIDAGGL 265
Query: 215 ------QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
QG+FNG CGT+L+H V IVG+GTT+DG NYW+++NSWG WG+ GY+++ R
Sbjct: 266 NFQFYSQGVFNGACGTELNHGVAIVGYGTTQDGTNYWIVRNSWGTGWGEQGYVRMQRGVN 325
Query: 266 -DEGLCGIGTRSSYPL 280
EGLCG+ +SYP+
Sbjct: 326 VPEGLCGLAMDASYPI 341
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 200 bits (508), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 115/310 (37%), Positives = 173/310 (55%), Gaps = 78/310 (25%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK +R ++FK+NL++I+ NK + Y LG N+F+DL++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+ Y G K+ R S+ F Y+++ D+P S+DWR KGAVTP+KNQ +CG CW
Sbjct: 102 EFKNKYLGLKVDLSQRRESSNEEEFTYRDV---DLPKSVDWRKKGAVTPVKNQGQCGSCW 158
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCST---NGNNGCL------------GGSREK 206
AF+ VAAVEGI +I +GNL LSEQ+L+DC T NG NG L G +E+
Sbjct: 159 AFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGGLHKEE 218
Query: 207 AFAYIIQ-----------------------------------NQ---------------- 215
+ YI++ NQ
Sbjct: 219 DYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFY 278
Query: 216 --GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
G+F+G CG+ LDH V+ VG+GT+++ +Y ++KNSWG WG+ G++++ RD EG+
Sbjct: 279 SGGVFDGHCGSDLDHGVSAVGYGTSKN-LDYIIVKNSWGAKWGEKGFIRMKRDIGKPEGI 337
Query: 270 CGIGTRSSYP 279
CG+ +SYP
Sbjct: 338 CGLYKMASYP 347
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 109/315 (34%), Positives = 159/315 (50%), Gaps = 77/315 (24%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
++++ +++E+W H R ++ EK R FKEN +I NK G+R Y+L N+F D+
Sbjct: 35 DEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGDM 93
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+EFR+ + ++ T + F Y + TD+P S+DWR KGAVT +KNQ
Sbjct: 94 GREEFRSGFADSRINDLRREPTAAPAVPGFMYDD--ATDLPRSVDWRQKGAVTAVKNQGR 151
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN-------------------- 196
CG CWAF+ V AVEGI IR+G+L+ LSEQ+L+DC T+ N
Sbjct: 152 CGSCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTDENGCQGGLMENAFEFIKSHGGI 211
Query: 197 -----------NGCLGGSR-------------------EKAFAYIIQNQ----------- 215
NG G+R E A A + +Q
Sbjct: 212 TTESAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGGQ 271
Query: 216 -------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
G+F G CGT LDH V VG+G ++DG YW++KNSWG +WG+ GY+++ R
Sbjct: 272 ALQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGEGGYIRMQRGTG 331
Query: 266 DEGLCGIGTRSSYPL 280
+ GLCGI +S+P+
Sbjct: 332 NGGLCGIAMEASFPI 346
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 199 bits (507), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 122/343 (35%), Positives = 175/343 (51%), Gaps = 83/343 (24%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F ++ + + ++S +H ++ V+ I+E+W+ +HG+ Y EKE R +
Sbjct: 15 LFTVLAVSSALDMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHGKVYNAVEEKEKRFQ 74
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL +IE+ N NRTYK+G N+FSDL+N+E+R+ Y G K+ PS R + +Y
Sbjct: 75 IFKDNLNFIEEHNAV-NRTYKVGLNRFSDLSNEEYRSKYLGTKI-DPS-RMMARPSRRYS 131
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
++P S+DWR +GAV +KNQ EC CWAF+A+AAVEGI KI +GNL LSEQ+LL
Sbjct: 132 PRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLTALSEQELL 191
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQL--DHAVTIVGF-- 235
DC N GC GG + AF +II N GI +G+C AVTI G+
Sbjct: 192 DCDRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADGICDQYKINARAVTIDGYER 251
Query: 236 -----------------------------------------GTTED------------GA 242
GT+ D G
Sbjct: 252 VPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSIDHGVTAVGYGTENGI 311
Query: 243 NYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 280
+YW++KNSWG WG+AGY+ + R+ G CGI + YP+
Sbjct: 312 DYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPI 354
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 199 bits (507), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 100/204 (49%), Positives = 135/204 (66%), Gaps = 5/204 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEI-HEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+F+ + + S + SR + +++ H +WM +HGR Y D EK R +FK N+E
Sbjct: 8 IFLFVAIFSSFYFSISLSRPLDNELIMQKRHIEWMTKHGRVYADVKEKSNRYVVFKSNVE 67
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP--SHRSTTSSTFKYQNLSM 133
IE N RT+KL NQF+DLTNDEFR++YTG+K S S T +++F+YQN+S
Sbjct: 68 RIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQTKTTSFRYQNVSS 127
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR KGAVTPIKNQ CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC T
Sbjct: 128 GALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI 217
N + GC GG + AF +I+ G+
Sbjct: 188 N-DFGCEGGLMDTAFEHIMATGGL 210
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 54/76 (71%), Gaps = 4/76 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + + G+F G C T LDHAVT +G+G + +G+ YW+IKNSWG WG++GYM+I +D
Sbjct: 269 FDFQFYSSGVFTGECTTYLDHAVTAIGYGQSTNGSKYWIIKNSWGTKWGESGYMRIQKDI 328
Query: 267 ---EGLCGIGTRSSYP 279
+GLCG+ ++SYP
Sbjct: 329 KDKQGLCGLAMKASYP 344
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 199 bits (507), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 118/338 (34%), Positives = 170/338 (50%), Gaps = 74/338 (21%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+T +F+ TL SCA + + + V+ ++E+W+ +H + Y EK+ R ++FK+N
Sbjct: 10 STLLFLSFTL--SCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKRFQVFKDN 67
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSM 133
L +I++ N N TYKLG N+F+D+TN+E+R +Y G K + T ST +Y +
Sbjct: 68 LGFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYSAG 127
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P +DWR KGAV PIK+Q CG CWAF+ VA VE I KI +G + LSEQ+L+DC
Sbjct: 128 DQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDR 187
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDH--AVTIVGF------ 235
N GC GG + AF +IIQN GI F+G+C + AV I G+
Sbjct: 188 AYNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKAVNIDGYEDVPPY 247
Query: 236 -------------------------------------GTTED------------GANYWL 246
GT+ D G +YWL
Sbjct: 248 DENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVVVVGYGSENGVDYWL 307
Query: 247 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
++NSWG WG+ GY K+ R+ G CGI +SYP+
Sbjct: 308 VRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 102/201 (50%), Positives = 143/201 (71%), Gaps = 9/201 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ +L + ASQ ++R+ HE S+ E HE WMAQ+GR YKD EK R KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N+F+DLTN+EF +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFGTSRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G +
Sbjct: 125 STIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +I QN G+
Sbjct: 185 QGCNGGLMDDAFKFIKQNHGL 205
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 41/78 (52%), Positives = 55/78 (70%), Gaps = 4/78 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + + G+F G CGT+LDH V VG+GT++DG YWL+KNSWG WG+ GY+++ RD
Sbjct: 264 FEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDV 323
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 324 TAKEGLCGIAMQASYPTA 341
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 120/357 (33%), Positives = 180/357 (50%), Gaps = 87/357 (24%)
Query: 1 MVLIFERSGSFKINTTPMFIIITLLVSCASQ-------VVSSRSTHEQSVVEIHEKWMAQ 53
M IF T+ +F+ +++L A T V+ + E W+ +
Sbjct: 1 MAFIFSSK-----KTSLLFLFVSILACSALAHEFSILGYAPEDLTSIHKVIHLFESWLVK 55
Query: 54 HGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM 113
H + Y+ EK R +IF +NL++I++ NK+ + Y LG N+F+DLT++EF+ + G+K
Sbjct: 56 HSKFYESLDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLTHEEFKHKFLGFKG 114
Query: 114 PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGIT 173
+ +S F Y++ D+P S+DWR KGAV P+KNQ +CG CWAF+ VAAVEGI
Sbjct: 115 ELAERKDESSKEFGYRDF--VDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGIN 172
Query: 174 KIRSGNLIQLSEQQLLDCST---NGNNGCL-----------GGSREKAFAYI-------- 211
+I +GNL LSEQ+L+DC T NG NG L G +E+ + YI
Sbjct: 173 QIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSGLHKEEEYPYIMSEGTCDE 232
Query: 212 ---------------------------IQNQ------------------GIFNGVCGTQL 226
+ NQ G+F+G CGT+L
Sbjct: 233 KKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTEL 292
Query: 227 DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 279
DH V VG+GTT+ G +Y +++NSWG WG+ GY+++ R G+CG+ +SYP
Sbjct: 293 DHGVAAVGYGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYP 348
>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 354
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 118/318 (37%), Positives = 156/318 (49%), Gaps = 80/318 (25%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
+V HE+WMA+ GR+YKD EK R ++F N +++ N+ GNRTY LG N FSDLT+
Sbjct: 33 TVASRHERWMARFGRAYKDADEKARRQEVFGANARHVDAVNRSGNRTYTLGLNHFSDLTD 92
Query: 102 DEFRALYTGYK--MPSPSH--RSTTSSTFKYQNLS--MTDVPTSLDWRDKGAVTPIKNQK 155
EF + GY+ P P R K L+ DVP S+DWR +GAVT IKNQ+
Sbjct: 93 HEFLQQHLGYRHHQPGPGGLLRPEDQDMSKATALADYGQDVPDSVDWRAQGAVTEIKNQR 152
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD-------CSTNGNNGCL-------G 201
CG CWAFAAVAA EG+ KI +GNLI +SEQQ+LD C N L G
Sbjct: 153 SCGSCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGGGNTCDGGDINAALRYVAASGG 212
Query: 202 GSREKAFAYIIQ--------------------------NQGIFNGV-------------- 221
E A+AY Q ++G G+
Sbjct: 213 LQPEAAYAYAAQKGACRGASPANSAASVGGARFARLGGDEGALRGLAAGQPVAVALEASE 272
Query: 222 ----------------CGTQLDHAVTIVGFGTTED-GANYWLIKNSWGNTWGDAGYMKIV 264
CG +L+H VT+VG+G +D G YW++KN WG WG+ GYM++
Sbjct: 273 PDFRHYKSGVYAGSASCGRRLNHGVTVVGYGAEDDSGDEYWVVKNQWGTLWGEKGYMRVA 332
Query: 265 RDE---GLCGIGTRSSYP 279
R + CGI + + YP
Sbjct: 333 RGDVAGANCGIASYAYYP 350
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 136/201 (67%), Gaps = 4/201 (1%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR Y+D+ EK R ++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR T + R T F+Y+N+++ +P
Sbjct: 68 IESFNA-GNHNFWLGVNQFADLTNDEFRWTKTNKGFIPSTTRVPTG--FRYENVNIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +II+N G+
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGL 205
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 65/124 (52%), Gaps = 17/124 (13%)
Query: 165 AVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGV 221
+VA+++G + + N L++ Q + + +G + + G+ G
Sbjct: 226 SVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD----------MTFQFYKGGVMTGS 275
Query: 222 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSS 277
CGT LDH + +G+G DG YWL+KNSWG TWG+ G++++ +D G+CG+ S
Sbjct: 276 CGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPS 335
Query: 278 YPLA 281
YP A
Sbjct: 336 YPTA 339
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 113/301 (37%), Positives = 159/301 (52%), Gaps = 71/301 (23%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
+ E W A+HG+SY + EK RL IF + L YIEK N + N T+ LG N+FSDLTN EFR
Sbjct: 1 MFEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
A Y G K SP ++ + K ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A
Sbjct: 61 ANYVG-KFKSPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI-------- 217
+A++E + + L+ LSEQQL+DC T + GC GG E AF ++++N G+
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPEDAFKFVVENGGVTTEEAYPY 176
Query: 218 --FNGVCGTQLDHAVTIVGF-------------------------GTTEDGANY------ 244
F G C + V I G+ G+ ++ NY
Sbjct: 177 TGFAGSCNANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILS 236
Query: 245 ------------------------WLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRSSY 278
W+IKNSWG +WG+ G+MKI + EG+CG+ +SSY
Sbjct: 237 GQCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGENGFMKIKKKDGEGMCGMNGQSSY 296
Query: 279 P 279
P
Sbjct: 297 P 297
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 113/306 (36%), Positives = 164/306 (53%), Gaps = 78/306 (25%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
++KW+ Q+GR Y + E +R I+ N+++IE N + N ++KL N+F+DLTNDEF +
Sbjct: 46 YDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQ-NLSFKLTDNKFADLTNDEFNS 104
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
+Y GY++ RS + + + TD+P ++DWR+ GAVTPIK+Q +CG CWAF+AV
Sbjct: 105 IYLGYQI-----RSYKRRNLSHMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAV 159
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYI-------IQNQGIF 218
AAVEGI KI++GNL+ LSEQ+L+DC NG+N GC GG EKAF +I +N +
Sbjct: 160 AAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYPY 219
Query: 219 NGVCGT-----QLDHAVTIVGFGTTE--------------------DGANY--------- 244
G G+ +HAV I G+ T D + Y
Sbjct: 220 KGTDGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGV 279
Query: 245 --------------------------WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGT 274
WL+KNSWG WG++GY+++ RD +G+CGI
Sbjct: 280 FSGYCGIQLNHGVTIVGYGDNNGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCGIAM 339
Query: 275 RSSYPL 280
SYP+
Sbjct: 340 EPSYPI 345
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 109/229 (47%), Positives = 149/229 (65%), Gaps = 21/229 (9%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I LL + SQ ++ R+ + S+ E HE+WM++ GR Y D EKE+R KIFKEN++ IE
Sbjct: 14 LIFLLGALVSQAMA-RTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK ++YKLG NQF+DLTN+EF+ +K H S+ + F+Y+NL T P+S
Sbjct: 73 FNKASGKSYKLGINQFADLTNEEFKTSRNRFK----GHMCSSQAGPFRYENL--TAAPSS 126
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
+DWR KGAVT IK+Q +CG CWAF+AVAAVEGIT++ + LI LSEQ+L+DC T G + G
Sbjct: 127 MDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQG 186
Query: 199 CLGGSREKAFAYIIQNQGIF----------NGVCGT--QLDHAVTIVGF 235
C GG + AF +I QNQG+ +G C T + +HA I GF
Sbjct: 187 CQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGF 235
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 40/78 (51%), Positives = 54/78 (69%), Gaps = 5/78 (6%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + + GIF G CGT+LDH V VG+G + +G NYWL+KNSWG WG+ GY+++ +D
Sbjct: 264 FGFQFYSSGIFTGDCGTELDHGVAAVGYGES-NGMNYWLVKNSWGTQWGEEGYIRMQKDI 322
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 323 DAKEGLCGIAMQASYPTA 340
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 99/192 (51%), Positives = 131/192 (68%), Gaps = 5/192 (2%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
SQ S + HE V E H++WM + R Y DELEK+MR +FK+NL++IEK NK+G+RTY
Sbjct: 6 SQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTY 65
Query: 90 KLGTNQFSDLTNDEFRALYTGYK----MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDK 145
KLG N+F+D T +EF A +TG K +PS ++ + N+S + DWR +
Sbjct: 66 KLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNW-NVSDVAGRETKDWRYE 124
Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
GAVTP+K Q +CGCCWAF++VAAVEG+TKI NL+ LSEQQLLDC +NGC GG
Sbjct: 125 GAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMS 184
Query: 206 KAFAYIIQNQGI 217
AF+YII+N+GI
Sbjct: 185 DAFSYIIKNRGI 196
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 34/64 (53%), Positives = 47/64 (73%), Gaps = 4/64 (6%)
Query: 222 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSS 277
CGT ++HAVT VG+GT+ +G YWL KNSWG TWG+ GY++I RD +G+CG+ +
Sbjct: 268 CGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAF 327
Query: 278 YPLA 281
YP+A
Sbjct: 328 YPVA 331
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 103/243 (42%), Positives = 144/243 (59%), Gaps = 16/243 (6%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F L++ + V+SR E + HE+WMA +G+ Y D EKE R KIFK N+EYI
Sbjct: 10 FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N GN+ YKL N+F+D TN++F+ GY+ P + R ++FKY+N+ T VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT-RPMKVTSFKYENV--TAVPA 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
++DWR KGAVTPIK+Q +CG CWAF+ VAA EGI ++ +G L+ LSEQ+L+DC G +
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGEDQ 186
Query: 198 GCLGGSREKAFAYIIQNQGI----------FNGVCGT--QLDHAVTIVGFGTTEDGANYW 245
GC GG E F +II+N GI +G C + Q H I G+ + +
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246
Query: 246 LIK 248
L+K
Sbjct: 247 LLK 249
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/122 (40%), Positives = 67/122 (54%), Gaps = 14/122 (11%)
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCG 223
A + E + L+++ Q + S + GGS + + G+F G CG
Sbjct: 231 AKITGYESVPANSEAELLKVVANQPISVSIDA-----GGSD-----FQFYSSGVFTGKCG 280
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
T+LDH VT VG+G T DG YWL+KNSW +WG+ GY+++ RD EGLCGI SSYP
Sbjct: 281 TELDHGVTAVGYGETSDGTKYWLVKNSWXTSWGEEGYIRMQRDIDAEEGLCGIAMDSSYP 340
Query: 280 LA 281
A
Sbjct: 341 TA 342
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
Length = 341
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 112/300 (37%), Positives = 166/300 (55%), Gaps = 65/300 (21%)
Query: 40 EQSVVEIHEKWMAQHGRSYKD-ELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQ 95
++ V ++++ W ++HGR + +RLK+F++NL YI+ N E G T++LG
Sbjct: 44 DEEVRQLYKTWKSEHGRPRDGISVADGLRLKVFRDNLRYIDAHNAEADAGLHTFRLGLTP 103
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
F+DLT +EFRA G+ + S R + +Y + D+P ++DWR +GAVT +KNQ
Sbjct: 104 FTDLTLEEFRAHALGF-LNSTLPRVASD---RYLPRAGDDLPDAVDWRQQGAVTGVKNQL 159
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN- 214
+CG CWAF+AVAA+EGI KI + NLI LSEQ+L+DC T + GC GG +KAF ++I N
Sbjct: 160 DCGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDTE-DYGCQGGEMQKAFQFVIDNG 218
Query: 215 --------------------------------------------------QGIFNGVCGT 224
GIFNG CG
Sbjct: 219 GIDTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQPGIFNGPCGF 278
Query: 225 QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
LDH VT VG+G +++G ++W++KNSWG WG++GY+++ R+ G CGI +SYP+
Sbjct: 279 ILDHGVTAVGYG-SDNGEDFWIVKNSWGAEWGESGYIRMKRNVLLPMGKCGIAMYASYPV 337
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 117/337 (34%), Positives = 171/337 (50%), Gaps = 74/337 (21%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
T +F+ TL + + + + + +E V+ ++E+W+ +H + Y + +K+ R ++FK+NL
Sbjct: 9 TLLFLSFTLSYAIKTSTIINYTDNE--VMAMYEEWLVRHQKGYNELGKKDKRFQVFKDNL 66
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+I++ N N TYKLG N+F+D+TN+E+RA+Y G K + T ST S D
Sbjct: 67 GFIQEHNNNLNNTYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKSTGHRYAFSARD 126
Query: 136 -VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+P +DWR KGAV PIK+Q CG CWAF+ VA VE I KI +G + LSEQ+L+DC
Sbjct: 127 RLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRA 186
Query: 195 GNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHA--VTIVGF------- 235
N GC GG + AF +IIQN GI F+G+C +A V I G+
Sbjct: 187 YNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGYEDVPPYD 246
Query: 236 ------------------------------------GTTED------------GANYWLI 247
GT+ D G +YWL+
Sbjct: 247 ENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHGVVVVGYGSENGVDYWLV 306
Query: 248 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
+NSWG WG+ GY K+ R+ G CGI +SYP+
Sbjct: 307 RNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPV 343
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 134/201 (66%), Gaps = 4/201 (1%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR YKD+ EK R ++FK N +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANAAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR T + R T F+Y+N+++ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRLTKTNKGFIPSTTRVPTG--FRYENVNIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KG VTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +II+N G+
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGL 205
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 62/115 (53%), Gaps = 17/115 (14%)
Query: 165 AVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGV 221
+VA+++G + + N L++ Q + + +G++ + G+ G
Sbjct: 226 SVASIKGYEDVPANNEAALMKAVANQPVSVAVDGDD----------MTFQFYKGGVMIGS 275
Query: 222 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGI 272
CGT LDH + +G+G DG YWL+KNSWG TWG+ G++++ +D G+CG+
Sbjct: 276 CGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGL 330
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 115/309 (37%), Positives = 170/309 (55%), Gaps = 78/309 (25%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK +R +IFK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 43 LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+ Y G K+ S R + F Y+++ ++P S+DWR KGAV P+KNQ CG CWA
Sbjct: 102 EFKNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCS---TNGNNGCL------------GGSREKA 207
F+ VAAVEGI +I +GNL LSEQ+L+DC NG NG L G +E+
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 208 FAYIIQ-----------------------------------NQ----------------- 215
+ YI++ NQ
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 277
Query: 216 -GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G+F+G CG+ LDH V VG+GT + G +Y ++KNSWG+ WG+ GY+++ R+ EG+C
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYIIVKNSWGSKWGEKGYIRMRRNIGKPEGIC 336
Query: 271 GIGTRSSYP 279
GI +SYP
Sbjct: 337 GIYKMASYP 345
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 114/310 (36%), Positives = 173/310 (55%), Gaps = 78/310 (25%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK +R ++FK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+ Y G K+ R S+ F Y+++ D+P S+DWR KGAVTP+KNQ +CG CW
Sbjct: 102 EFKNKYLGLKVNLSQRRESSNEEEFTYRDV---DLPKSVDWRKKGAVTPVKNQGQCGSCW 158
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCST---NGNNGCL------------GGSREK 206
AF+ VAAVEGI +I +GNL LSEQ+L+DC T NG NG L G +E
Sbjct: 159 AFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGGLHKED 218
Query: 207 AFAYIIQ-----------------------------------NQ---------------- 215
+ YI++ NQ
Sbjct: 219 DYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFY 278
Query: 216 --GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
G+F+G CG+ LDH V+ VG+GT+++ +Y ++KNSWG WG+ G++++ R+ EG+
Sbjct: 279 SGGVFDGHCGSDLDHGVSAVGYGTSKN-LDYIIVKNSWGAKWGEKGFIRMKRNIGKPEGI 337
Query: 270 CGIGTRSSYP 279
CG+ +SYP
Sbjct: 338 CGLYKMASYP 347
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 197 bits (502), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 112/310 (36%), Positives = 166/310 (53%), Gaps = 75/310 (24%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
V+ + E W+ +H + Y+ EK R +IF +NL++I++ NK+ + Y LG N+F+DLT
Sbjct: 43 HKVIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLT 101
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
++EF+ + G+K + +S F Y++ D+P S+DWR KGAV P+KNQ +CG C
Sbjct: 102 HEEFKHKFLGFKGELAERKDESSKEFGYRDF--VDLPKSVDWRKKGAVAPVKNQGQCGNC 159
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST---NGNNGCL-----------GGSREK 206
WAF+ VAAVEGI +I +GNL LSEQ+L+DC T NG NG L G +E+
Sbjct: 160 WAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSGLHKEE 219
Query: 207 AFAYI-----------------------------------IQNQ---------------- 215
+ YI + NQ
Sbjct: 220 EYPYIMSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFY 279
Query: 216 --GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
G+F+G CGT+LDH V VG+GTT+ G +Y +++NSWG WG+ GY+++ R G+
Sbjct: 280 SGGVFDGHCGTELDHGVAAVGYGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGM 338
Query: 270 CGIGTRSSYP 279
CG+ +SYP
Sbjct: 339 CGLYMMASYP 348
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 96/203 (47%), Positives = 135/203 (66%), Gaps = 5/203 (2%)
Query: 18 MFIIITLLVSCASQVVSSR--STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+ I+ L C++ V+++R + ++ HE+WMAQ GR YKD EK RL++FK N+
Sbjct: 10 LVAIVGCLCLCSTAVLAARELGDADNAMAARHEQWMAQFGRVYKDPAEKAHRLEVFKANV 69
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+IE N E N + LG NQF+DLTNDEFRA T + R + FKY ++S+
Sbjct: 70 AFIESFNAE-NHEFWLGANQFADLTNDEFRASKTNKGIKQGGVRDAPTG-FKYSDVSIDA 127
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P S+DWR KGAVTPIKNQ +CG CWAF+AVAA EG+ K+ +G L+ LSEQ+L+DC +G
Sbjct: 128 LPASVDWRTKGAVTPIKNQGQCGSCWAFSAVAATEGVVKLSTGKLVSLSEQELVDCDVHG 187
Query: 196 -NNGCLGGSREKAFAYIIQNQGI 217
+ GC+GG + AF +II+N G+
Sbjct: 188 VDQGCMGGWMDDAFKFIIKNGGL 210
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 46/68 (67%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+ G CG ++DH + +G+G T +G YWL+KNSWG TWG+ G++++ +D G+CG
Sbjct: 277 GVMTGSCGVEMDHGIAAIGYGATSNGTKYWLMKNSWGTTWGEKGFLRMAKDIPDKRGMCG 336
Query: 272 IGTRSSYP 279
+ + SYP
Sbjct: 337 LAMKPSYP 344
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 101/199 (50%), Positives = 143/199 (71%), Gaps = 9/199 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L + ASQ ++R+ E S+ E HE WMAQ+GR YKD EK R KIFK+N+ IE
Sbjct: 14 LLFFLAAWASQA-TARNLLEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY++++ VP++
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYEHVAA--VPST 126
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
+DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 199 CLGGSREKAFAYIIQNQGI 217
C GG + AF +I QN G+
Sbjct: 187 CNGGLMDDAFKFIEQNHGL 205
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 41/78 (52%), Positives = 55/78 (70%), Gaps = 4/78 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + + G+F G CGT+LDH V VG+GT++DG YWL+KNSWG WG+ GY+++ RD
Sbjct: 264 FEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWGTGWGEVGYIRMQRDV 323
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 324 TAKEGLCGIAMQASYPTA 341
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 100/200 (50%), Positives = 136/200 (68%), Gaps = 8/200 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ ++ LL C SQV+S R+ HE S+ E HE+WM ++G+ YKD EK+ RL IFK+N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN+ YKL N +D TN+EF A + GYK SH T FKY N+ TD+P
Sbjct: 69 IESFNAAGNKPYKLSINHLADQTNEEFVASHNGYKYKG-SHSQT---PFKYGNV--TDIP 122
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
T++DWR GAVT +K+Q +CG CWAF+ VAA EGI +I +G L+ LSEQ+L+DC + ++
Sbjct: 123 TAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV-DH 181
Query: 198 GCLGGSREKAFAYIIQNQGI 217
GC GG E F +II+N GI
Sbjct: 182 GCDGGLMEDGFEFIIKNGGI 201
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 40/71 (56%), Positives = 53/71 (74%), Gaps = 5/71 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGAN-YWLIKNSWGNTWGDAGYMKIVR----DEGLC 270
G+F G CGTQLDH VT+VG+GTT+DG + YW++KNSWG WG+ GY+++ R EGLC
Sbjct: 268 GVFTGQCGTQLDHGVTVVGYGTTDDGTHEYWIVKNSWGTQWGEEGYIRMQRGIDAQEGLC 327
Query: 271 GIGTRSSYPLA 281
GI +SYP+
Sbjct: 328 GIAMDASYPMG 338
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 133/204 (65%), Gaps = 5/204 (2%)
Query: 18 MFIIITLLVS-CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+F+ + + S C S +S +E + + H +WM +HGR Y D E+ R +FK N+E
Sbjct: 8 IFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVE 67
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP--SHRSTTSSTFKYQNLSM 133
IE N RT+KL NQF+DLTNDEFR++YTG+K S S T S F+YQN+S
Sbjct: 68 RIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSS 127
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR KGAVTPIKNQ CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC T
Sbjct: 128 GALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI 217
N + GC GG + AF +I G+
Sbjct: 188 N-DFGCEGGLMDTAFEHIKATGGL 210
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 54/76 (71%), Gaps = 4/76 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + + G+F G C T LDHAVT +G+G + +G+ YW+IKNSWG WG++GYM+I +D
Sbjct: 269 FDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDV 328
Query: 267 ---EGLCGIGTRSSYP 279
+GLCG+ ++SYP
Sbjct: 329 KDKQGLCGLAMKASYP 344
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 106/230 (46%), Positives = 139/230 (60%), Gaps = 17/230 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQ--SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+ + LL++ V SR HE S++E HE+WMA++ + YKD EKE R IFK+N+E
Sbjct: 11 ILALFLLLAVGISRVISRELHETETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVE 70
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+IE N GN+ YKLG N +DLT +EF+A G K TTS FKY+N+ T +
Sbjct: 71 FIESFNAAGNKPYKLGVNHLADLTIEEFKASRNGLKRSYDYEVGTTS--FKYENV--TAI 126
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P S+DWR KGAVTPIK+Q +CG CWAF+ VAA EGI KI +G L+ LSEQ+L+DC G
Sbjct: 127 PASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGT 186
Query: 196 NNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTIVGF 235
+ GC GG E F +II+N GI +G C A I G+
Sbjct: 187 DQGCEGGYMEDGFEFIIKNGGITTEANYPYKAVDGSCKNATAPAAQIKGY 236
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 72/130 (55%), Gaps = 18/130 (13%)
Query: 159 CCWAFAAVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
C A A A ++G K+ + L++ Q + S + +G +++ +
Sbjct: 223 CKNATAPAAQIKGYEKVPVNSEKALLKAVANQPVSVSIDAADG----------SFMFYSS 272
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
GIF G CGT+LDH VT VG+G +G +YW++KNSWG WG+ GY+++ R EGLCG
Sbjct: 273 GIFTGECGTELDHGVTAVGYGRA-NGTDYWIVKNSWGTVWGEQGYIRMQRGIAAKEGLCG 331
Query: 272 IGTRSSYPLA 281
I SSYP A
Sbjct: 332 IAMDSSYPTA 341
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 197 bits (500), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 107/229 (46%), Positives = 148/229 (64%), Gaps = 21/229 (9%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I L + ASQ ++ R+ + S+ E HE+WM + R Y D EKE+R KIFKEN++ IE
Sbjct: 14 LIFFLGALASQAIA-RTLQDASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK ++YKLG NQF+DLTN+EF+ +K H S+ + F+Y+N+ T VP+S
Sbjct: 73 FNKASEKSYKLGINQFADLTNEEFKTSRNRFK----GHMCSSQAGPFRYENI--TAVPSS 126
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
+DWR +GAVT IK+Q +CG CWAF+AVAAVEGIT++ + LI LSEQ+L+DC T G + G
Sbjct: 127 MDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQG 186
Query: 199 CLGGSREKAFAYIIQNQGIF----------NGVCGT--QLDHAVTIVGF 235
C GG + AF +I QNQG+ +G C T + +HA I GF
Sbjct: 187 CQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGF 235
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 40/78 (51%), Positives = 54/78 (69%), Gaps = 5/78 (6%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + + GIF G CGT+LDH V VG+G + +G NYWL+KNSWG WG+ GY+++ +D
Sbjct: 264 FEFQFYSSGIFTGDCGTELDHGVAAVGYGES-NGMNYWLVKNSWGTQWGEEGYIRMQKDI 322
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 323 DAKEGLCGIAMQASYPTA 340
>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
Length = 246
Score = 197 bits (500), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 107/251 (42%), Positives = 148/251 (58%), Gaps = 61/251 (24%)
Query: 86 NRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRD 144
+++YKL N+F+DLTN+EF +K +H ST +++FKY+N+ T VP++ DWR
Sbjct: 2 DKSYKLSINEFADLTNEEFGTSRNRFK----AHICSTEATSFKYENV--TAVPSTXDWRK 55
Query: 145 KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLG-- 201
KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + GC G
Sbjct: 56 KGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXGAN 115
Query: 202 -----------------------------GSREKAFAYIIQNQ----------------- 215
+ EKA + +Q
Sbjct: 116 YPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGXEFQFYS 175
Query: 216 -GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G+F G CGT+LDH V VG+GT++DG YWL+KNSWG WG+ GY+++ RD EGLC
Sbjct: 176 SGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLC 235
Query: 271 GIGTRSSYPLA 281
GI ++SYP A
Sbjct: 236 GIAMQASYPTA 246
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 96/188 (51%), Positives = 133/188 (70%), Gaps = 8/188 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+ R+ + S+ E H +WMA++ + YKD E+E R +IFKEN+ YIE N N++YKL
Sbjct: 25 VTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIFKENVNYIETFNSADNKSYKLD 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+++ +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENVTV--IPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + +G LI LSEQ+++DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGI 217
+IIQN G+
Sbjct: 200 FIIQNHGL 207
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+++ R +EGLCG
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCG 333
Query: 272 IGTRSSYPLA 281
I +SYP A
Sbjct: 334 IAMMASYPTA 343
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 170/314 (54%), Gaps = 77/314 (24%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T ++++ E W+++HG+ Y+ EK +R +IFK+NL +I++ NK+ Y LG N+FS
Sbjct: 24 TSGDKIIDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKK-VVNYWLGLNEFS 82
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+++EF+ Y G K+ S R S F Y+++ +P S+DWR KGAVT +KNQ C
Sbjct: 83 DLSHEEFKNKYLGLKV-DMSERRECSQEFNYKDV--MSIPKSVDWRKKGAVTDVKNQGSC 139
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCL------------GG 202
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T G NG L G
Sbjct: 140 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAFSYIISNGGL 199
Query: 203 SREKAFAYIIQ-----------------------------------NQ------------ 215
+E + YI++ NQ
Sbjct: 200 HKEVDYPYIMEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRD 259
Query: 216 ------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
G+F+G CGTQLDH V VG+G+T +G +Y ++KNSWG+ WG+ GY+++ R+
Sbjct: 260 FQFYSGGVFDGHCGTQLDHGVAAVGYGST-NGLDYIIVKNSWGSKWGEKGYIRMKRNTGK 318
Query: 267 -EGLCGIGTRSSYP 279
GLCGI +SYP
Sbjct: 319 PAGLCGINKMASYP 332
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 196 bits (498), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 116/309 (37%), Positives = 168/309 (54%), Gaps = 78/309 (25%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK R IFK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 43 LIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+ Y G K+ S R + F Y++ ++P S+DWR KGAVT +KNQ CG CWA
Sbjct: 102 EFKNKYLGLKVDY-SRRRESPEEFTYKDF---ELPKSVDWRKKGAVTQVKNQGSCGSCWA 157
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCS---TNGNNGCL------------GGSREKA 207
F+ VAAVEGI +I +GNL LSEQ+L+DC NG NG L G +E+
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 208 FAYIIQ-----------------------------------NQ----------------- 215
+ YI++ NQ
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYS 277
Query: 216 -GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G+F+G CG+ LDH V VG+GT++ G NY ++KNSWG+ WG+ GY+++ R+ EG+C
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTSK-GVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEGIC 336
Query: 271 GIGTRSSYP 279
GI +SYP
Sbjct: 337 GIYKMASYP 345
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 117/313 (37%), Positives = 164/313 (52%), Gaps = 76/313 (24%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T ++++ E W+++ GR Y+ EK R +IFK+NL +I+ NK+ R Y LG N+F+
Sbjct: 38 TSNDKLIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKK-VRNYWLGLNEFA 96
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+++EF+ Y G K P S R+ F Y++++ +P S+DWR KGAVTP+KNQ C
Sbjct: 97 DLSHEEFKNKYLGLK-PDLSKRAQCPEEFTYKDVA---IPKSVDWRKKGAVTPVKNQGSC 152
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T NNGC GG + AFAYI+ N G+
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGL 212
Query: 218 FN----------GVCGTQLDH--AVTIVGF------------------------------ 235
G C + + AVTI G+
Sbjct: 213 HKEEDYPYIMEEGTCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRD 272
Query: 236 -------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD---- 266
GT D G +Y ++KNSWG WG+ GY+++ R
Sbjct: 273 FQFYSGGVFDGHCGTELDHGVAAVGYGTSKGLDYIIVKNSWGPKWGEKGYIRMKRKTSKP 332
Query: 267 EGLCGIGTRSSYP 279
EG+CGI +SYP
Sbjct: 333 EGICGIYKMASYP 345
>gi|302763127|ref|XP_002964985.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
gi|300167218|gb|EFJ33823.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
Length = 320
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 114/306 (37%), Positives = 164/306 (53%), Gaps = 56/306 (18%)
Query: 14 NTTPMFIIITLLVSCASQVVSSRSTHEQS----VVEIHEKWMAQHGRSYKDELEKEMRLK 69
N + +I+ ++V A ++ + E + + E W A+HG+SY + EK R+
Sbjct: 4 NMIALILILLVVVGAAPFAIARPAALEDDRALEIKNMFEDWAAKHGKSYSSDWEKARRMT 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IF + L YIEK N N T+ LG N+FSDLTN EFRA Y G K P ++ + K
Sbjct: 64 IFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRANYVG-KFKPPRYQDRRPA--KDV 120
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A+A++E + + L+ LSEQQL+
Sbjct: 121 DVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIASIESAHFLATNQLVSLSEQQLI 180
Query: 190 DCSTNGNNGC---------LGGS------------------REKAFAYI----------- 211
DC T + GC L GS ++KA A +
Sbjct: 181 DCDTV-DEGCQEEAYPYTGLAGSCNANKNKVAEITGFNVVTKDKADALMKAVSKTPVTVG 239
Query: 212 -------IQN--QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 262
QN GI +G C DH V ++G+G TE G YW+IKNSWG +WG+ G+MK
Sbjct: 240 ICGSDQNFQNYRSGILSGQCCNSRDHVVLVIGYG-TEGGMPYWIIKNSWGTSWGEDGFMK 298
Query: 263 IVRDEG 268
I + +G
Sbjct: 299 IEKKDG 304
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 102/219 (46%), Positives = 139/219 (63%), Gaps = 10/219 (4%)
Query: 4 IFERSGSFKINTTPMFIIITLLVSCA--SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDE 61
IF+R + I + +L+ A + V+ + + S+ E HE+WM +HG+ YKD
Sbjct: 90 IFKRDSTMVAKNHFYHISLAMLLCTAFLAFQVTCCTLQDASMYERHEQWMTRHGKVYKDP 149
Query: 62 LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHR 119
E+E R +IF EN+ Y+E N N+ YKLG NQF DLTN EF A +K M S R
Sbjct: 150 REREKRFRIFNENVNYVEAFNNAANKPYKLGINQFXDLTNQEFIAPRNRFKGHMCSSIIR 209
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+TT FKY+N+ T VP+++DWR GAVTP+K+Q +CGCCWAF+AVAA EGI + G
Sbjct: 210 TTT---FKYENV--TTVPSTVDWRQNGAVTPVKDQGQCGCCWAFSAVAATEGIHALSGGK 264
Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
LI LSEQ+L+DC T G + GC GG + A+ +IIQN G+
Sbjct: 265 LISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNHGL 303
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G F G CGT+LDH VT VG+G ++ G YWL+KNSWG WG+ GY+++ R +EG+CG
Sbjct: 370 GAFTGSCGTELDHGVTAVGYGVSDHGTKYWLVKNSWGTEWGEEGYIRMQRGVDSEEGVCG 429
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 430 IAMQASYPTA 439
>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
AltName: Allergen=Car p 1; Flags: Precursor
gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
gi|387885|gb|AAA72774.1| papain [synthetic construct]
gi|225437|prf||1303270A papain
Length = 345
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 126/336 (37%), Positives = 174/336 (51%), Gaps = 82/336 (24%)
Query: 18 MFIIITLLVSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+F+ + L S V S++ T + ++++ E WM +H + YK+ EK R +IFK+NL
Sbjct: 17 LFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNL 76
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+YI++ NK+ N +Y LG N F+D++NDEF+ YTG + ++ +T S + N +
Sbjct: 77 KYIDETNKK-NNSYWLGLNVFADMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVN 133
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P +DWR KGAVTP+KNQ CG CWAF+AV +EGI KIR+GNL + SEQ+LLDC
Sbjct: 134 IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR- 192
Query: 196 NNGCLGG------------------------------SREK------------------- 206
+ GC GG SREK
Sbjct: 193 SYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEG 252
Query: 207 AFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 248
A Y I NQ GIF G CG ++DHAV VG+ G NY LIK
Sbjct: 253 ALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILIK 307
Query: 249 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 280
NSWG WG+ GY++I R G+CG+ T S YP+
Sbjct: 308 NSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPV 343
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 138/201 (68%), Gaps = 10/201 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ ++ LL C SQV+S R HE S+ E HE+WM ++G+ YKD EK+ RL IFK+N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDV 136
IE N GN+ YKLG N +D TN+EF A + GYK H+++ S T FKY+N+ T V
Sbjct: 69 IESFNAAGNKPYKLGINHLADQTNEEFVASHNGYK-----HKASHSQTPFKYENV--TGV 121
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P ++DWR+ GAVT +K+Q +CG CWAF+ VAA EGI +I + L+ LSEQ+L+DC + +
Sbjct: 122 PNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV-D 180
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
+GC GG E F +II+N GI
Sbjct: 181 HGCDGGYMEGGFEFIIKNGGI 201
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 41/77 (53%), Positives = 54/77 (70%), Gaps = 4/77 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
A+ + G+F G CGTQLDH VT VG+G+T+DG YW++KNSWG WG+ GY+++ R
Sbjct: 261 AFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRGTD 320
Query: 266 -DEGLCGIGTRSSYPLA 281
EGLCGI +SYP A
Sbjct: 321 AQEGLCGIAMDASYPTA 337
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 195 bits (496), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 115/309 (37%), Positives = 169/309 (54%), Gaps = 78/309 (25%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y++ EK +R +IFK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 44 LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHR 102
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF Y G K+ S R + F Y+++ ++P S+DWR KGAV P+KNQ CG CWA
Sbjct: 103 EFNNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCS---TNGNNGCL------------GGSREKA 207
F+ VAAVEGI +I +GNL LSEQ+L+DC NG NG L G +E+
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218
Query: 208 FAYIIQ-----------------------------------NQ----------------- 215
+ YI++ NQ
Sbjct: 219 YPYIMEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 216 -GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G+F+G CG+ LDH V VG+GT + G +Y +KNSWG+ WG+ GY+++ R+ EG+C
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGIC 337
Query: 271 GIGTRSSYP 279
GI +SYP
Sbjct: 338 GIYKMASYP 346
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 111/301 (36%), Positives = 158/301 (52%), Gaps = 71/301 (23%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
+ E W A+HG+SY + EK RL IF + L YIEK N N T+ LG N+FSDLTN EFR
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
A Y G K P ++ + K ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A
Sbjct: 61 ANYVG-KFKPPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI-------- 217
+A++E + + L+ LSEQQL+DC T + GC GG E AF ++++N G+
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPEDAFKFVVENGGVTTEEAYPY 176
Query: 218 --FNGVCGTQLDHAVTIVGF-------------------------GTTEDGANY------ 244
F G C + V I G+ G+ ++ NY
Sbjct: 177 TGFAGSCNANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILS 236
Query: 245 ------------------------WLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRSSY 278
W+IKNSWG +WG+ G+M+I ++ EG+CG+ +SSY
Sbjct: 237 GHCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKKEDGEGMCGMNGQSSY 296
Query: 279 P 279
P
Sbjct: 297 P 297
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 101/219 (46%), Positives = 140/219 (63%), Gaps = 16/219 (7%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ EKW+A+H ++Y EK R ++FK+NL++I+K N+E +Y LG N+F+DLT++
Sbjct: 146 IIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVT-SYWLGLNEFADLTHE 204
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+A Y G P+P+ S S FKY+++S D+P S+DWR KGAVT +KNQ +CG CWA
Sbjct: 205 EFKATYLGLAPPAPARESRGS--FKYEDVSADDLPKSVDWRTKGAVTEVKNQGQCGSCWA 262
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF---- 218
F+ VAAVEGI I +GNL LSEQ+L+DCS +GNNGC GG + AF+YI + G+
Sbjct: 263 FSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGGLMDYAFSYIASSGGLHTEEA 322
Query: 219 ------NGVCG---TQLDHAVTIVGFGTTEDGANYWLIK 248
G CG AVTI G+ LIK
Sbjct: 323 YPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIK 361
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 34/69 (49%), Positives = 48/69 (69%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVR----DEGLC 270
G+F+G CGTQLDH V VG+G+ + G +Y +++NSWG WG+ GY+++ R EGLC
Sbjct: 385 GVFDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAKWGEKGYIRMKRGTGKGEGLC 444
Query: 271 GIGTRSSYP 279
GI +SYP
Sbjct: 445 GINKMASYP 453
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 89/183 (48%), Positives = 128/183 (69%), Gaps = 3/183 (1%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++V HEKWMA+HGR+Y +E EK RL++F+ N + I+ N + T++L TN+F+
Sbjct: 35 TVDSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFA 94
Query: 98 DLTNDEFRALYTGYKMP--SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
DLT++EFRA TG + P + + + + F+Y+N S+ D S+DWR GAVT +K+Q
Sbjct: 95 DLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQG 154
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQN 214
CGCCWAF+AVAAVEG+TKIR+G L+ LSEQQL+DC G++ GC GG + AF Y+I
Sbjct: 155 SCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINR 214
Query: 215 QGI 217
G+
Sbjct: 215 GGL 217
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 72/130 (55%), Gaps = 17/130 (13%)
Query: 158 GCCWAFAAVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
G C A+ A++ G + + N L+ Q + + NG + + +
Sbjct: 230 GSCRRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDS----------VFRFYD 279
Query: 215 QGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---VRDEGLC 270
G+ G CGT+L+HA+T VG+GT DG YW++KNSWG +WG+ GY++I VR EG+C
Sbjct: 280 SGVLGGSGCGTELNHAITAVGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRGVRGEGVC 339
Query: 271 GIGTRSSYPL 280
G+ +SYP+
Sbjct: 340 GLAQLASYPV 349
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 137/201 (68%), Gaps = 10/201 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ ++ LL C SQV+S R+ HE S+ E HE+WM ++G+ YKD EK+ RL IFK+N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDV 136
IE N GNR YKL N +D TN+EF A + GYK H+ + S T FKY+N+ T V
Sbjct: 69 IESFNAAGNRPYKLSINHLADQTNEEFVASHNGYK-----HKGSHSQTPFKYENV--TGV 121
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P ++DWR+ GAVT +K+Q +CG CWAF+ VAA EGI +I + L+ LSEQ+L+DC + +
Sbjct: 122 PNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV-D 180
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
+GC GG E F +II+N GI
Sbjct: 181 HGCDGGYMEGGFEFIIKNGGI 201
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 41/77 (53%), Positives = 54/77 (70%), Gaps = 4/77 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
A+ + G+F G CGTQLDH VT VG+G+T+DG YW++KNSWG WG+ GY+++ R
Sbjct: 261 AFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRGTD 320
Query: 266 -DEGLCGIGTRSSYPLA 281
EGLCGI +SYP A
Sbjct: 321 AQEGLCGIAMDASYPTA 337
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 89/183 (48%), Positives = 128/183 (69%), Gaps = 3/183 (1%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++V HEKWMA+HGR+Y +E EK RL++F+ N + I+ N + T++L TN+F+
Sbjct: 35 TVDAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFA 94
Query: 98 DLTNDEFRALYTGYKMP--SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
DLT++EFRA TG + P + + + + F+Y+N S+ D S+DWR GAVT +K+Q
Sbjct: 95 DLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQG 154
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQN 214
CGCCWAF+AVAAVEG+TKIR+G L+ LSEQQL+DC G++ GC GG + AF Y+I
Sbjct: 155 SCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINR 214
Query: 215 QGI 217
G+
Sbjct: 215 GGL 217
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 71/130 (54%), Gaps = 17/130 (13%)
Query: 158 GCCWAFAAVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
G C A+ A++ G + + N L+ Q + + NG + + +
Sbjct: 230 GSCRRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDSV----------FRFYD 279
Query: 215 QGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---VRDEGLC 270
G+ G CGT+L+HA+T G+GT DG YW++KNSWG +WG+ GY++I VR EG+C
Sbjct: 280 SGVLGGSGCGTELNHAITAAGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRGVRGEGVC 339
Query: 271 GIGTRSSYPL 280
G+ +SYP+
Sbjct: 340 GLAQLASYPV 349
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 110/301 (36%), Positives = 158/301 (52%), Gaps = 71/301 (23%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
+ E W A+H +SY + EK RL +F + L YIEK N + N T+ LG N+FSDLTN EFR
Sbjct: 1 MFEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
A Y G K P ++ + K ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A
Sbjct: 61 ANYVG-KFKPPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI-------- 217
+A++E + + L+ LSEQQL+DC T + GC GG + AF ++++N G+
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPDDAFKFVVENGGVTTEEAYPY 176
Query: 218 --FNGVCGTQLDHAVTIVGF-------------------------GTTEDGANY------ 244
F G C T + V I G+ G+ ++ NY
Sbjct: 177 TGFAGSCNTNKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILS 236
Query: 245 ------------------------WLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRSSY 278
W+IKNSWG +WG+ G+MKI + EG+CG+ +SSY
Sbjct: 237 GQCCNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIKKKDGEGMCGMNGQSSY 296
Query: 279 P 279
P
Sbjct: 297 P 297
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 111/297 (37%), Positives = 156/297 (52%), Gaps = 70/297 (23%)
Query: 51 MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG 110
MA++GR YKD EK R +IFK N+ +IE N +Y LG N+F+D+TN+EF A YTG
Sbjct: 1 MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
+ P + + +++++ V S+DWRD GAVT +K+Q CG CWAF+A+A VE
Sbjct: 61 -GISRPLNIEK-EPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVE 118
Query: 171 GITKIRSGNLIQLSEQQLLDCS-TNG---------------NNG---------------C 199
GI KI +G L+ LSEQ++LDC+ +NG NNG C
Sbjct: 119 GIYKIVTGYLVSLSEQEVLDCAVSNGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDC 178
Query: 200 LGGS----------------REKAFAYIIQNQ------------------GIFNGVCGTQ 225
S E + Y + NQ G+F+G CGT
Sbjct: 179 AANSWPNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTS 238
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYP 279
L+HA+TI+G+G G YW++KNSWG++WG+ GY+++ R GLCGI YP
Sbjct: 239 LNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGVSSSGLCGIAMDPLYP 295
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 114/307 (37%), Positives = 156/307 (50%), Gaps = 79/307 (25%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
++++KW+ +HG++Y E + R +IFKEN+ YI N N ++ LG N+F+DLTN EF
Sbjct: 36 QVYQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEF 95
Query: 105 RALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
R LY G + P+P H + + D TS+DWR KG VT IK+Q +CG CWAF
Sbjct: 96 RGLYVGRLQRPAPFHEVGDIAL-------VADTATSVDWRKKGGVTEIKDQGDCGSCWAF 148
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI------ 217
+AVAAVEG+T + +G L+ LSEQ+L+DC T N GC GG + AF Y+I+N GI
Sbjct: 149 SAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRNGGITSQSNY 208
Query: 218 ----FNGVCGTQ--LDHAVTIVGFG----------------------------------- 236
G C HA TI GF
Sbjct: 209 PYRALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYSS 268
Query: 237 ---TTEDGAN------------------YWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
T E G+N YWL+KNSWG+ WG++GY+++ R G+CGI
Sbjct: 269 GVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGSGWGESGYVRMERQGPGAGVCGI 328
Query: 273 GTRSSYP 279
+SYP
Sbjct: 329 NLDASYP 335
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 90/176 (51%), Positives = 122/176 (69%), Gaps = 6/176 (3%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK----ANKEGNRTYKLGTNQFSDLTND 102
HEKWMA+HG++YKDE EK RL++F+ N + I+ A K+G ++L TN+F+DLT+D
Sbjct: 42 HEKWMAKHGKTYKDEEEKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDD 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EFRA TGY+ P P+ + F Y+N S+ P S+DWR GAVT +K+Q CGCCWA
Sbjct: 102 EFRAARTGYQRP-PAAVAGAGGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWA 160
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
F+AVAAVEG+ KIR+G L+ LSEQ+L+DC G + GC GG + AF YI + G+
Sbjct: 161 FSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGL 216
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/107 (40%), Positives = 62/107 (57%), Gaps = 14/107 (13%)
Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGV-CGTQLDHAVTIVGFG 236
G L+ +Q + + NG + + ++G+ G CGT+L+HAVT VG+G
Sbjct: 254 GALMAAVARQPVSVAING----------AGYVFRFYDRGVLGGAGCGTELNHAVTAVGYG 303
Query: 237 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 280
T DG YWL+KNSWG +WG+ GY++I R EG CGI +SYP+
Sbjct: 304 TASDGTGYWLMKNSWGASWGEGGYVRIRRGVGREGACGIAQMASYPV 350
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 100/206 (48%), Positives = 137/206 (66%), Gaps = 12/206 (5%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
S+V+ H++WM Q R Y DE EK++RL++ ENL++IE N GN++YKLG N+F+D T
Sbjct: 34 SIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTK 93
Query: 102 DEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKECGC 159
+EF A YTG + + S + T N +++DV T+ DWR++GAVTP+K+Q ECG
Sbjct: 94 EEFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNKDWRNEGAVTPVKSQGECGG 153
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFN 219
CWAF+A+AAVEG+TKI GNLI LSEQQLLDC+ NNGC GG+ AF YII+++GI +
Sbjct: 154 CWAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGISS 213
Query: 220 ----------GVCGTQLDHAVTIVGF 235
G C + A+ I GF
Sbjct: 214 ENEYPYQVKEGPCRSNARPAILIRGF 239
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 69/118 (58%), Gaps = 12/118 (10%)
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGV-CGTQLD 227
+ G + S N E+ LL+ + + E F + + G++N CGT ++
Sbjct: 236 IRGFENVPSNN-----ERALLEAVSRQPVAVAIDASEAGFVHY--SGGVYNARNCGTSVN 288
Query: 228 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
HAVT+VG+GT+ +G YWL KNSWG TWG+ GY++I RD +G+CG+ +SYP+A
Sbjct: 289 HAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASYPVA 346
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 194 bits (493), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 90/199 (45%), Positives = 133/199 (66%), Gaps = 2/199 (1%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++I+ L++S + V SR E E HEKWMAQ+GR YKD EKE R ++FK N+ +I
Sbjct: 9 YLILFLVLSVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFI 68
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L NQF+DL ++EF+AL + + ++T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTQTSFRYE--SVTKIPA 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
++DWR +GAVTPIK+Q CG CWAF+AVAA EGI +I +G L+ LSEQ+L+DC + G
Sbjct: 127 TIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEG 186
Query: 199 CLGGSREKAFAYIIQNQGI 217
C+GG + AF +I + GI
Sbjct: 187 CIGGYVDDAFEFIAKKGGI 205
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 66/121 (54%), Gaps = 12/121 (9%)
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGV-CGT 224
VA ++G K+ S N E+ LL N + AF Y + GIFN CGT
Sbjct: 229 VAEIKGYEKVPSNN-----EKALLKAVANQPVSVYIDAGTHAFKYY--SSGIFNVRNCGT 281
Query: 225 QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
+HAV +VG+G DG+ YWL+KNSWG WG+ GY++I RD EGLCGI YP
Sbjct: 282 DPNHAVAVVGYGKALDGSKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPT 341
Query: 281 A 281
A
Sbjct: 342 A 342
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 194 bits (493), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 111/301 (36%), Positives = 157/301 (52%), Gaps = 71/301 (23%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
+ E W A+HG+SY + EK RL IF + L YIEK N N T+ LG N+FSDLTN EFR
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
A Y G K P ++ + K ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A
Sbjct: 61 ANYVG-KFKPPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI-------- 217
+A++E + + L+ LSEQQL+DC T + GC GG E AF ++++N G+
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPEDAFKFVVENGGVTTEEAYPY 176
Query: 218 --FNGVCGTQLDHAVTIVGF-------------------------GTTEDGANY------ 244
F G C + V I G+ G+ ++ NY
Sbjct: 177 TGFAGSCNANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILS 236
Query: 245 ------------------------WLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRSSY 278
W+IKNSWG +WG+ G+M+I + EG+CG+ +SSY
Sbjct: 237 GHCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKKKDGEGMCGMNGQSSY 296
Query: 279 P 279
P
Sbjct: 297 P 297
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 102/243 (41%), Positives = 143/243 (58%), Gaps = 16/243 (6%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F L++ + V+SR E + HE+WMA +G+ Y D EKE R KIFK N+EYI
Sbjct: 10 FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N GN+ YKL N+F+D TN++F+ GY+ P + R ++FKY+N+ T VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT-RPMKVTSFKYENV--TAVPA 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
++DWR KGAVT IK+Q +CG CWAF+ VAA EGI ++ +G L+ LSEQ+L+DC G +
Sbjct: 127 TMDWRKKGAVTLIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGEDQ 186
Query: 198 GCLGGSREKAFAYIIQNQGI----------FNGVCGT--QLDHAVTIVGFGTTEDGANYW 245
GC GG E F +II+N GI +G C + Q H I G+ + +
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246
Query: 246 LIK 248
L+K
Sbjct: 247 LLK 249
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/122 (40%), Positives = 68/122 (55%), Gaps = 14/122 (11%)
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCG 223
A + E + L+++ Q + S + GGS + + G+F G CG
Sbjct: 231 AKITGYESVPANSEAELLKVVANQPISVSIDA-----GGSD-----FQFYSSGVFTGKCG 280
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
T+LDH VT VG+G T DG YWL+KNSWG +WG+ GY+++ RD EGLCGI SSYP
Sbjct: 281 TELDHGVTAVGYGETSDGTKYWLVKNSWGTSWGEEGYIRMQRDIDTEEGLCGIAMDSSYP 340
Query: 280 LA 281
A
Sbjct: 341 TA 342
>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 352
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 91/212 (42%), Positives = 139/212 (65%), Gaps = 10/212 (4%)
Query: 8 SGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
S ++ + +++ +S ++V + ++ ++ H+KWMA+HGR+YKD EK R
Sbjct: 5 SSKLQVMAASLLLVVAGGLSTMAKV--TMASRAGTMEARHDKWMAEHGRTYKDAAEKARR 62
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
++FK N++ I+++N GN+ Y+L TN+F+DLT+ EF A+YTGY + + + ++T
Sbjct: 63 FRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATT-- 120
Query: 128 YQNLSMTD--VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
LS D P +DWR +GAVT +KNQ+ CGCCWAF+ VAAVEGI +I +G L+ LSE
Sbjct: 121 --RLSSEDDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSE 178
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
QQLLDC+ NG GC GGS + AF Y+ + G+
Sbjct: 179 QQLLDCADNG--GCTGGSLDNAFQYMANSGGV 208
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 38/72 (52%), Positives = 49/72 (68%), Gaps = 7/72 (9%)
Query: 216 GIFNG-VCGTQLDHAVTIVGFGTTEDGAN---YWLIKNSWGNTWGDAGYMKIVRD---EG 268
G+F CGT+LDHAV +VG+G DG+ YW+IKNSWG TWGD GYMK+ +D +G
Sbjct: 278 GVFTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDVGSQG 337
Query: 269 LCGIGTRSSYPL 280
CG+ SYP+
Sbjct: 338 ACGVAMAPSYPV 349
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 94/218 (43%), Positives = 136/218 (62%), Gaps = 15/218 (6%)
Query: 15 TTPMFIIITLLVSCASQVVSSRS-------------THEQSVVEIHEKWMAQHGRSYKDE 61
+ F++ L+V+ + R+ ++V HEKWMA+HGR+Y DE
Sbjct: 2 SVSRFVLTVLVVASVCTAAAPRALAVRELAGEEESAAVAAAMVSRHEKWMAEHGRTYTDE 61
Query: 62 LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST 121
EK RL+IF+ N E+I+ N G +++L TN+F+DLT++EFRA TG++ +
Sbjct: 62 AEKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDEEFRAARTGFRPRPAPAAAA 121
Query: 122 TSST-FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
S F+Y+N S+ D S+DWR GAVT +K+Q ECGCCWAF+AVAAVEG+ KIR+G L
Sbjct: 122 GSGGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCWAFSAVAAVEGLNKIRTGRL 181
Query: 181 IQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
+ LSEQ+L+DC NG + GC GG + AF +I + G+
Sbjct: 182 VSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGL 219
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 54/76 (71%), Gaps = 3/76 (3%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---V 264
+A+ + G+ G CGT L+HA+T VG+GT DG+ YWL+KNSWG +WG+ GY++I V
Sbjct: 278 YAFRFYDSGVLGGECGTDLNHAITAVGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGV 337
Query: 265 RDEGLCGIGTRSSYPL 280
R EG+CG+ SYP+
Sbjct: 338 RGEGVCGLAKLPSYPV 353
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 116/298 (38%), Positives = 162/298 (54%), Gaps = 77/298 (25%)
Query: 56 RSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
++Y EK R ++FK+NL +I+ NK+ +Y LG N+F+DLT+DEF+A Y G P
Sbjct: 38 KAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGLNEFADLTHDEFKATYLGL-TPP 95
Query: 116 PSHRST---TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGI 172
P+ ++ +S F+Y +S +VP +DWR K AVT +KNQ +CG CWAF+ VAAVEGI
Sbjct: 96 PTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGI 155
Query: 173 TKIRSGNLIQLSEQQLLDCSTNGN---NGCL-----------GGSR-EKAFAYIIQ---- 213
I +GNL LSEQ+L+DCST+GN NG L GG R E+A+ Y ++
Sbjct: 156 NAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTEEAYPYAMEEGDC 215
Query: 214 ------------------------------------------------NQGIFNGVCGTQ 225
+ G+F+G CG Q
Sbjct: 216 DEGKGAAVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGEQ 275
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 279
LDH VT VG+GT++ G +Y ++KNSWG WG+ GY+++ R EGLCGI +SYP
Sbjct: 276 LDHGVTAVGYGTSK-GQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCGINKMASYP 332
>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
Length = 342
Score = 194 bits (492), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 92/202 (45%), Positives = 135/202 (66%), Gaps = 11/202 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ ++ L + A ++SR+ ++ H+KWMA+HGR+YKD EK R ++FK N++
Sbjct: 6 LLVVAGGLSTMAKVTMASRAGTMEAR---HDKWMAEHGRTYKDAAEKARRFRVFKANVDL 62
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-- 135
I+++N GN+ Y+L TN+F+DLT+ EF A+YTGY + + + ++T LS D
Sbjct: 63 IDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATT----RLSSEDDQ 118
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P +DWR +GAVT +KNQ+ CGCCWAF+ VAAVEGI +I +G L+ LSEQQLLDC+ NG
Sbjct: 119 QPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCADNG 178
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
GC GGS + AF Y+ + G+
Sbjct: 179 --GCTGGSLDNAFQYMANSGGV 198
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 38/72 (52%), Positives = 49/72 (68%), Gaps = 7/72 (9%)
Query: 216 GIFNG-VCGTQLDHAVTIVGFGTTEDGAN---YWLIKNSWGNTWGDAGYMKIVRD---EG 268
G+F CGT+LDHAV +VG+G DG+ YW+IKNSWG TWGD GYMK+ +D +G
Sbjct: 268 GVFTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDVGSQG 327
Query: 269 LCGIGTRSSYPL 280
CG+ SYP+
Sbjct: 328 ACGVAMAPSYPV 339
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 130/182 (71%), Gaps = 8/182 (4%)
Query: 35 SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTN 94
+R+ E S+ E HE+WMA + R YKD EK+MR KIFKEN++ I+ N E +++YKL N
Sbjct: 27 ARTLQEASMYERHEQWMASYARVYKDANEKQMRYKIFKENVQRIDSFNSESDKSYKLAVN 86
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
QF+DLTN+EF++L G+K H S + F+Y+N+ T VP S+DWR KGAVT IK
Sbjct: 87 QFADLTNEEFKSLRNGFK----GHMCSAQAGHFRYENV--TAVPASIDWRKKGAVTQIKE 140
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYII 212
Q +CG CWAF+AVAAVEGIT+I++G LI LSEQ+L+DC TN + GC GG + AF +I
Sbjct: 141 QGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSEDQGCQGGLMDDAFKFIE 200
Query: 213 QN 214
Q+
Sbjct: 201 QH 202
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 128/187 (68%), Gaps = 7/187 (3%)
Query: 33 VSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
V SR +E S+ E HE+WM +HG+ Y+D +EKE R IFK+N+E+IE N N+ YKL
Sbjct: 25 VMSRKLYESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
N +DLT DEF+A GYK R T+++FKY+N+ T +P ++DWR KGAVTPI
Sbjct: 85 SVNHLADLTLDEFKASRNGYK---KIDREFTTTSFKYENV--TAIPAAVDWRVKGAVTPI 139
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
K+Q +CG CWAF+ VAA EGI +I +G L+ LSEQ+L+DC T G + GC GG E F +
Sbjct: 140 KDQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 211 IIQNQGI 217
II+N GI
Sbjct: 200 IIKNGGI 206
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 38/77 (49%), Positives = 54/77 (70%), Gaps = 5/77 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
+++ + GI+ G CGT+LDH VT VG+G+ +G +YW++KNSWG WG+ GY+++ R
Sbjct: 265 SFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRMQRGIA 323
Query: 266 -DEGLCGIGTRSSYPLA 281
EGLCGI SSYP A
Sbjct: 324 AKEGLCGIAMDSSYPTA 340
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 101/218 (46%), Positives = 139/218 (63%), Gaps = 12/218 (5%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
S+ S + HE ++ H+KWM R Y DE EK+MRL++F ENL++IE N G+++Y
Sbjct: 21 SEATSRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSY 80
Query: 90 KLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGA 147
KLG N+F+D T +EF A +TG + S + T N +++DV T+ DWR++GA
Sbjct: 81 KLGVNKFTDWTKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGA 140
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
VTP+K Q ECG CWAF+A+AAVEG+TKI GNLI LSEQQLLDC+ NNGC GG+ +A
Sbjct: 141 VTPVKYQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCAREQNNGCKGGTMIEA 200
Query: 208 FAYIIQNQGIFN----------GVCGTQLDHAVTIVGF 235
F YI++N G+ + G C + A+ I GF
Sbjct: 201 FNYIVKNGGVSSENAYPYQVKEGPCRSNDIPAIVIRGF 238
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 38/77 (49%), Positives = 57/77 (74%), Gaps = 5/77 (6%)
Query: 210 YIIQNQGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
+I + G++N CGT ++HAVT+VG+GT+++G YWL KNSWG TWG+ GY++I RD
Sbjct: 269 FIHYSGGVYNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKNSWGKTWGENGYIRIRRDVE 328
Query: 267 --EGLCGIGTRSSYPLA 281
+G+CG+ +SYP+A
Sbjct: 329 WPQGMCGVAQYASYPVA 345
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 193 bits (491), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 114/309 (36%), Positives = 169/309 (54%), Gaps = 78/309 (25%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y++ EK +R +IFK+NL++I++ NK + Y LG ++F+DL++
Sbjct: 44 LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLSEFADLSHR 102
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF Y G K+ S R + F Y+++ ++P S+DWR KGAV P+KNQ CG CWA
Sbjct: 103 EFNNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCS---TNGNNGCL------------GGSREKA 207
F+ VAAVEGI +I +GNL LSEQ+L+DC NG NG L G +E+
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218
Query: 208 FAYIIQ-----------------------------------NQ----------------- 215
+ YI++ NQ
Sbjct: 219 YPYIMEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 216 -GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G+F+G CG+ LDH V VG+GT + G +Y +KNSWG+ WG+ GY+++ R+ EG+C
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGIC 337
Query: 271 GIGTRSSYP 279
GI +SYP
Sbjct: 338 GIYKMASYP 346
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 193 bits (491), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 134/204 (65%), Gaps = 7/204 (3%)
Query: 19 FIIITLLVS----CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
F++ L+V C + + + ++ HEKWMA+HGR+YKDE EK RL++F+ N
Sbjct: 6 FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
E I+ N G +++L TN+F+DLT +EFRA TG + P P+ S + F+Y+N S+
Sbjct: 66 AELIDSFNAAGTHSHRLATNRFADLTVEEFRAARTGLR-PRPAP-SAGAGRFRYENFSLA 123
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
D S+DWR GAVT +K+Q CGCCWAF+AVAAVEG+ KIR+G L+ LSEQ+L+DC +
Sbjct: 124 DAAQSVDWRAMGAVTGVKDQGACGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVS 183
Query: 195 G-NNGCLGGSREKAFAYIIQNQGI 217
G + GC GG + AF ++ + G+
Sbjct: 184 GVDQGCDGGLMDNAFQFVARRGGL 207
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 52/76 (68%), Gaps = 3/76 (3%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---V 264
A+ + G+ G CGT L+HA+T VG+GT DG YWL+KNSWG +WG+ GY++I V
Sbjct: 266 MAFRFYDSGVLGGACGTDLNHAITAVGYGTANDGTRYWLMKNSWGASWGEGGYVRIRRGV 325
Query: 265 RDEGLCGIGTRSSYPL 280
R EG+CG+ SYP+
Sbjct: 326 RGEGVCGLAKLPSYPV 341
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 193 bits (491), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 132/204 (64%), Gaps = 5/204 (2%)
Query: 18 MFIIITLLVS-CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+F+ + + S C S +S +E + + H +WM +HGR Y D E+ R +FK N+E
Sbjct: 8 IFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVE 67
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP--SHRSTTSSTFKYQNLSM 133
IE N RT+KL NQF+DLTNDEF ++YTG+K S S T S F+YQN+S
Sbjct: 68 RIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTKMSPFRYQNVSS 127
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR KGAVTPIKNQ CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC T
Sbjct: 128 GALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI 217
N + GC GG + AF +I G+
Sbjct: 188 N-DFGCEGGLMDTAFEHIKATGGL 210
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 54/76 (71%), Gaps = 4/76 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + + G+F G C T LDHAVT +G+G + +G+ YW+IKNSWG WG++GYM+I +D
Sbjct: 269 FDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDV 328
Query: 267 ---EGLCGIGTRSSYP 279
+GLCG+ ++SYP
Sbjct: 329 KDKQGLCGLAMKASYP 344
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 193 bits (491), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 89/199 (44%), Positives = 133/199 (66%), Gaps = 2/199 (1%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++I+ L+++ + V SR E E HEKWMAQ+GR YKD EKE R ++FK N+ +I
Sbjct: 9 YLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFI 68
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L NQF+DL ++EF+AL + + ++T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTETSFRYE--SVTKIPA 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
++DWR +GAVTPIK+Q CG CWAF+AVAA EGI +I +G L+ LSEQ+L+DC + G
Sbjct: 127 TIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEG 186
Query: 199 CLGGSREKAFAYIIQNQGI 217
C+GG + AF +I + GI
Sbjct: 187 CIGGYVDDAFEFIAKKGGI 205
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 66/121 (54%), Gaps = 12/121 (9%)
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGV-CGT 224
VA ++G K+ S N E+ LL N + AF Y + GIFN CGT
Sbjct: 229 VAEIKGYEKVPSNN-----EKALLKAVANQPVSVYIDAGTHAFKYY--SSGIFNARNCGT 281
Query: 225 QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
+HAV +VG+G DG+ YWL+KNSWG WG+ GY++I RD EGLCGI YP
Sbjct: 282 DPNHAVAVVGYGKALDGSKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPT 341
Query: 281 A 281
A
Sbjct: 342 A 342
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 7/187 (3%)
Query: 33 VSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
V SR +E S+ E HE+WM+++G+ YKD +EKE R IFK+N+E+IE N N+ YKL
Sbjct: 25 VMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
N +DLT DEF+A GYK R +++FKY+N+ T +P ++DWR KGAVTPI
Sbjct: 85 SVNHLADLTLDEFKASRNGYK---KIDREFATTSFKYENV--TAIPEAVDWRVKGAVTPI 139
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
K+Q +CG CWAF+ VAA+EGI +I +G LI LSEQ+L+DC T G + GC GG E F +
Sbjct: 140 KDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 211 IIQNQGI 217
II+N GI
Sbjct: 200 IIKNGGI 206
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 38/77 (49%), Positives = 54/77 (70%), Gaps = 5/77 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
+++ + GI+ G CGT+LDH VT VG+G+ +G +YW++KNSWG WG+ GY+++ R
Sbjct: 265 SFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRMQRGIA 323
Query: 266 -DEGLCGIGTRSSYPLA 281
EGLCGI SSYP A
Sbjct: 324 DKEGLCGIAMDSSYPTA 340
>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 340
Score = 193 bits (490), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 104/229 (45%), Positives = 140/229 (61%), Gaps = 18/229 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F ++ L C + SSR+ E S+ HE+WMA H R Y D EK+ R +IFKENLE+I
Sbjct: 13 FFMLFLTCICRA---SSRTLSESSIATQHEEWMAMHDRVYADSAEKDRRQQIFKENLEFI 69
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTG--YKMPSPSHRSTTSSTFKYQNLSMTDV 136
EK N EG + Y L N F+DLTN+EF A +TG YK P+ + + + +S+ D+
Sbjct: 70 EKHNNEGKKRYNLSLNSFADLTNEEFVASHTGALYKPPTQLGSFKINHSLGFHKMSVGDI 129
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
SLDWR +GAV IKNQ CG CWAF+AVAAVEGI +I++G L+ LSEQ L+DC++ N
Sbjct: 130 EASLDWRKRGAVNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSLSEQNLVDCAS--N 187
Query: 197 NGCLGGSREKAFAYIIQNQGIFN----------GVCGTQLDHAVTIVGF 235
+GC G EKAF Y I++ G+ N G C + A+ I G+
Sbjct: 188 DGCHGQYVEKAFDY-IRDYGLANEEEYPYVETVGTCSGNSNPAIQIRGY 235
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 68/101 (67%), Gaps = 7/101 (6%)
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGAN 243
+E+QLL + L ++ + F + + G+F+G CGT+L+HAVTIVG+G +G
Sbjct: 242 NEEQLLTAVASQPVSVLLEAKGQGFQFY--SGGVFSGECGTELNHAVTIVGYGEEAEG-K 298
Query: 244 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
YWLI+NSWG +WG+ GYMK++RD +GLCGI ++SYP
Sbjct: 299 YWLIRNSWGKSWGEGGYMKLMRDTGNPQGLCGINMQASYPF 339
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 193 bits (490), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 7/187 (3%)
Query: 33 VSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
V SR +E S+ E HE+WM+++G+ YKD +EKE R IFK+N+E+IE N N+ YKL
Sbjct: 25 VMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
N +DLT DEF+A GYK R +++FKY+N+ T +P ++DWR KGAVTPI
Sbjct: 85 SVNHLADLTLDEFKASRNGYK---KIDREFATTSFKYENV--TAIPEAVDWRVKGAVTPI 139
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
K+Q +CG CWAF+ VAA+EGI +I +G LI LSEQ+L+DC T G + GC GG E F +
Sbjct: 140 KDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 211 IIQNQGI 217
II+N GI
Sbjct: 200 IIKNGGI 206
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 38/77 (49%), Positives = 54/77 (70%), Gaps = 5/77 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
+++ + GI+ G CGT+LDH VT VG+G+ +G +YW++KNSWG WG+ GY+++ R
Sbjct: 265 SFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRMQRGIA 323
Query: 266 -DEGLCGIGTRSSYPLA 281
EGLCGI SSYP A
Sbjct: 324 DKEGLCGIAMDSSYPTA 340
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 96/201 (47%), Positives = 136/201 (67%), Gaps = 2/201 (0%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+I++L+ S + SR E ++ + H WM +HGR Y D EK R +FK N+E
Sbjct: 8 IFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVES 67
Query: 78 IEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE+ N+ + T+KL NQF+DLTN+EFR++YTGYK S T ++F+YQ++S +
Sbjct: 68 IERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGNSVLSSRTKPTSFRYQHVSSDAL 127
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAVTPIK+Q CG CWAF+AVAA+EG+ +I+ G LI LSEQ+L+DC TN +
Sbjct: 128 PISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-D 186
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
+GC+GG AF Y + G+
Sbjct: 187 DGCMGGYMNSAFNYTMTTGGL 207
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 33/68 (48%), Positives = 47/68 (69%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F+G C T LDH V +VG+G + +G+ YW++KNSWG WG+ GYM+I +D G CG
Sbjct: 274 GVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKKDTKAKHGQCG 333
Query: 272 IGTRSSYP 279
+ +SYP
Sbjct: 334 LAMNASYP 341
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 96/201 (47%), Positives = 136/201 (67%), Gaps = 2/201 (0%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+I++L+ S + SR E ++ + H WM +HGR Y D EK R +FK N+E
Sbjct: 2 IFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVES 61
Query: 78 IEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE+ N+ + T+KL NQF+DLTN+EFR++YTGYK S T ++F+YQ++S +
Sbjct: 62 IERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGNSVLSSRTKPTSFRYQHVSSDAL 121
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAVTPIK+Q CG CWAF+AVAA+EG+ +I+ G LI LSEQ+L+DC TN +
Sbjct: 122 PISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-D 180
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
+GC+GG AF Y + G+
Sbjct: 181 DGCMGGYMNSAFNYTMTTGGL 201
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 43/61 (70%), Gaps = 4/61 (6%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F+G C T LDH V +VG+G + +G+ YW++KNSWG WG+ GYM+I +D G CG
Sbjct: 268 GVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKKDTKAKHGQCG 327
Query: 272 I 272
+
Sbjct: 328 L 328
>gi|449521046|ref|XP_004167542.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like [Cucumis
sativus]
Length = 297
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 105/301 (34%), Positives = 167/301 (55%), Gaps = 39/301 (12%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEM 66
+ I+ +L++ S + S E+S+++++++W + H R ++ E
Sbjct: 1 MTMMKFLIVFVVLIAFTSHLCESFELEGKDFESERSLMQLYKRWSSHH-RISRNAHEMHK 59
Query: 67 RLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF 126
R KIF++N +++ + N G ++ KL NQF+DL++DEF +Y + F
Sbjct: 60 RFKIFQDNAKHVFRVNHMG-KSLKLRLNQFADLSDDEFSMMYGSNITHYNGLHANRVGEF 118
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
Y+ ++P+S+DWR KGAV IKNQ CG CWAFAAVAAVE I +I++ L+ LSEQ
Sbjct: 119 MYERA--MNIPSSIDWRQKGAVNAIKNQGHCGSCWAFAAVAAVESIHQIKTNELVSLSEQ 176
Query: 187 QLLDCSTNGNNGCLGGSREKAFAYIIQNQGI--------FNG---------------VCG 223
+++DC GC GG+ AF +I+QN GI F G CG
Sbjct: 177 EVVDCDYK-VGGCRGGNYNSAFEFIMQNGGITIEENYPYFAGNGYCRRRGGMLREDSFCG 235
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 279
++DH V +VG+G+ E+G +YW+I+N +G WG GYMK+ R +G+CG+ + S+P
Sbjct: 236 YRIDHTVVVVGYGSDEEG-DYWIIRNQYGTQWGMNGYMKMQRGTRNPQGVCGMAMQPSFP 294
Query: 280 L 280
+
Sbjct: 295 V 295
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 192 bits (489), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 113/313 (36%), Positives = 167/313 (53%), Gaps = 75/313 (23%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++ E WM++HG+SY+ EK R ++F++NL++I++ NK+ + +Y LG N+F+
Sbjct: 39 TSMDKLTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+++EF+ Y G K+ P R + F Y++++ D+P S+DWR KGAV +KNQ C
Sbjct: 98 DLSHEEFKRKYLGLKIELPKRRDSPEE-FSYKDVA--DLPKSVDWRKKGAVAHVKNQGAC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC NNGC GG + AFA+II N G+
Sbjct: 155 GSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGL 214
Query: 218 FN----------GVCGTQLD--HAVTIVGF------------------------------ 235
G CG + + VTI G+
Sbjct: 215 RKEEDYPYVMEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRG 274
Query: 236 -------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD---- 266
GT D G +Y +KNSWG+ WG+ GY+++ R+
Sbjct: 275 FQFYSGGIFNGHCGTELDHGVAAVGYGTSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKP 334
Query: 267 EGLCGIGTRSSYP 279
EG+CGI +SYP
Sbjct: 335 EGICGIYKMASYP 347
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 147/234 (62%), Gaps = 24/234 (10%)
Query: 19 FIIITLLVSCAS--QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
F+ + LL + ++R+ + + E HE+WM Q+GR YKD+ E+ R IFKEN+
Sbjct: 9 FVCLALLFILGAWPSKSTARTLLDAPMYERHEQWMTQYGRVYKDDNERATRYSIFKENVA 68
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMT 134
I+ N + ++YKLG NQF+DLTN+EF+A +K M SP + F+Y+N+S
Sbjct: 69 RIDAFNSQTGKSYKLGVNQFADLTNEEFKASRNRFKGHMCSPQ-----AGPFRYENVSA- 122
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
VP+++DWR +GAVTP+K+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+++DC T
Sbjct: 123 -VPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTK 181
Query: 195 G-NNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLD--HAVTIVGF 235
G + GC GG + AF +I QN+G+ +G C T HA I GF
Sbjct: 182 GEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYKGTDGTCNTNKAAIHAAKITGF 235
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 51/70 (72%), Gaps = 5/70 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF G C TQLDH VT VG+G + DG+ YWL+KNSWG WG+ GY+++ +D EGLCG
Sbjct: 272 GIFTGSCDTQLDHGVTAVGYGVS-DGSKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCG 330
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 331 IAMQASYPTA 340
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 130/206 (63%), Gaps = 4/206 (1%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I + I+ L C S + + + S+V HE WM Q+GR YKD EK + ++FK
Sbjct: 3 IPKASLLAILGCLCLCGSVLAARELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N E+I N GN + LG NQF+D+TN+EF+A T S R T F Y+N+S
Sbjct: 63 ANAEFINSFNA-GNHKFWLGINQFADITNEEFKATKTNKGFISNKVRVPTG--FMYENMS 119
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P ++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G L+ LSEQ+L+DC
Sbjct: 120 FDALPATIDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGI 217
+G + GC GG + AF +II+N G+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIIKNGGL 205
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 65/122 (53%), Gaps = 14/122 (11%)
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCG 223
A + + E + G L++ Q + + +G + + + G+ G CG
Sbjct: 228 ATIKSYEDVPANNEGALMKAVANQPVSVAVDGGD----------MTFQFYSGGVMTGSCG 277
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
T LDH + +G+GTT DG +W++KNSWG +WG+ G++++ +D +G+CG+ SYP
Sbjct: 278 TDLDHGIAAIGYGTTSDGTKFWIMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYP 337
Query: 280 LA 281
A
Sbjct: 338 TA 339
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/202 (48%), Positives = 137/202 (67%), Gaps = 3/202 (1%)
Query: 18 MFIIITLLVSCASQVVSSRST-HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+F+I++L+ S + + SR E ++ + H +WM +HGR Y D EK R +FK N+E
Sbjct: 8 IFLIVSLVSSFSLSITLSRPLLDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVE 67
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE+ N + T+KL NQF+DLTN+EFR++YTG+K S T ++F+YQN+S
Sbjct: 68 RIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGNSVLSSRTKPTSFRYQNVSSDA 127
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P S+DWR KGAVTPIK+Q CG CWAF+AVAA+EG+ +I+ G LI LSEQ+L+DC TN
Sbjct: 128 LPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN- 186
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
+ GC+GG + AF Y I G+
Sbjct: 187 DGGCMGGLMDTAFNYTITIGGL 208
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 47/68 (69%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F+G C T LDH VT VG+G +++G YW++KNSWG WG+ GYM+I +D G CG
Sbjct: 275 GVFSGECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKDIKPKHGQCG 334
Query: 272 IGTRSSYP 279
+ +SYP
Sbjct: 335 LAMNASYP 342
>gi|125570285|gb|EAZ11800.1| hypothetical protein OsJ_01674 [Oryza sativa Japonica Group]
Length = 289
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 104/286 (36%), Positives = 154/286 (53%), Gaps = 34/286 (11%)
Query: 18 MFIIITLLV--SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+ ++ TL+ + + + + + +++ E+WMA+ G++YK EKE R IF++N+
Sbjct: 12 LLVVCTLMALQAMGADAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNV 71
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+I + +G NQF+DLTNDEF A YTG K P P + +
Sbjct: 72 HFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIW 123
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ-LLDCSTN 194
P +DWR +GAVT +K+Q CG CWAFAAVAA+EG+TKIR+G L LS+ + L++
Sbjct: 124 TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSDARTLVELRNQ 183
Query: 195 GNNGCLGGSREKAF--------------AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTED 240
G G+ ++AF Y G+F G CG +HAVT+VG+ +D
Sbjct: 184 HATGAAAGTPDRAFELVASTRADSRRHATYPFYKSGVFPGPCGASSNHAVTLVGY--CQD 241
Query: 241 GAN---YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
GA+ YW+ KNSWG TWG GY+ + +D G CG+ YP
Sbjct: 242 GASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYP 287
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 120/325 (36%), Positives = 167/325 (51%), Gaps = 85/325 (26%)
Query: 38 THEQS-------VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
+H+QS V+ I++ W+ +HG++Y EK R +IFK NL +I++ N + NRTYK
Sbjct: 12 SHDQSSWRSDDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQ-NRTYK 70
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHR--STTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
+G +F+DLTN E+RA++ G + P R + + + +Y + +P S+DWR KGAV
Sbjct: 71 VGLTKFADLTNQEYRAMFLGTR-SDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAV 129
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ------QLLDCSTNG------- 195
PIK+Q CG CWAF+ VAAVEGI +I +G LI LSEQ + + NG
Sbjct: 130 NPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGCNGGLMDYAF 189
Query: 196 ----NNGCLGGSR---------------------------------EKAFAYIIQNQ--- 215
NNG L + EKA + +Q
Sbjct: 190 QFIINNGGLDTEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQPVS 249
Query: 216 ---------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
G+F G CGT LDH V +VG+GT E G +YWL++NSWG WG+ GY
Sbjct: 250 VAIEASGMALQFYQSGVFTGECGTALDHGVVVVGYGT-EKGLDYWLVRNSWGTEWGEHGY 308
Query: 261 MKI---VRD--EGLCGIGTRSSYPL 280
+K+ VRD G CGI SSYP+
Sbjct: 309 IKMQRNVRDTYTGRCGIAMESSYPV 333
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/202 (48%), Positives = 137/202 (67%), Gaps = 3/202 (1%)
Query: 18 MFIIITLLVSCASQVVSSRST-HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+F+I++L+ S + + SR E ++ + H +WM +HGR Y D EK R +FK N+E
Sbjct: 2 IFLIVSLVSSFSLSITLSRPLLDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVE 61
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE+ N + T+KL NQF+DLTN+EFR++YTG+K S T ++F+YQN+S
Sbjct: 62 RIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGNSVLSSRTKPTSFRYQNVSSDA 121
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P S+DWR KGAVTPIK+Q CG CWAF+AVAA+EG+ +I+ G LI LSEQ+L+DC TN
Sbjct: 122 LPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN- 180
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
+ GC+GG + AF Y I G+
Sbjct: 181 DGGCMGGLMDTAFNYTITIGGL 202
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 31/61 (50%), Positives = 43/61 (70%), Gaps = 4/61 (6%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F+G C T LDH VT VG+G +++G YW++KNSWG WG+ GYM+I +D G CG
Sbjct: 269 GVFSGECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKDIKPKHGQCG 328
Query: 272 I 272
+
Sbjct: 329 L 329
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 87/178 (48%), Positives = 126/178 (70%), Gaps = 3/178 (1%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLT 100
++ + HE+WMA+HGR+Y D+ EK RL++F++N+ +IE N ++ + L NQF+DLT
Sbjct: 35 AMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLT 94
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
N EFRA TG + PS S + ++F+Y N+S D+P S+DWR KGAV P+K+Q +CGCC
Sbjct: 95 NAEFRATRTGLR-PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCC 153
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
WAF+AVAA+EG K+ +G L+ LSEQQL+ C G + GC GG + AF +II+N G+
Sbjct: 154 WAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGL 211
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 35/72 (48%), Positives = 49/72 (68%), Gaps = 6/72 (8%)
Query: 216 GIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
G+ +G C T+LDHA+T VG+G DG YWL+KNSWG +WG+ GY+++ R EG+
Sbjct: 278 GVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKEGV 337
Query: 270 CGIGTRSSYPLA 281
CG+ +SYP A
Sbjct: 338 CGLAMMASYPTA 349
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 102/238 (42%), Positives = 147/238 (61%), Gaps = 17/238 (7%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
TT + ++ +S ++ +S RS E V EI++ W+A+HG++Y E+E R +IFKEN
Sbjct: 5 TTSLALLSFFFLSISASALSRRSDGE--VREIYDLWLAKHGKAYNGIDEREKRFQIFKEN 62
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQNLS 132
L++I+ N E NRTYK+G N F+DLTN+E+RALY G + P P+ R + T +Y +
Sbjct: 63 LKFIDDHNSE-NRTYKVGLNMFADLTNEEYRALYLGTRSP-PARRVMKAKTASRRYAVNN 120
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ +P S+DWR +GAV P+KNQ CG CWAF+ +AAVEGI +I +G LI LSEQ+L+ C
Sbjct: 121 LDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCD 180
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTIVGFGTTED 240
N+GC GG + AF +II N G+ F+G C +A +V ED
Sbjct: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNA-KVVSIDAYED 237
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 6/78 (7%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
A + G+F G CG+ LDH V VG+G E+G +YWL++NSWG +WG+ GY K+ R+
Sbjct: 264 LALQLYQSGVFTGKCGSALDHGVVAVGYGK-ENGVDYWLVRNSWGTSWGEDGYFKLERNV 322
Query: 267 ----EGLCGIGTRSSYPL 280
EG CGI ++SYP+
Sbjct: 323 KHITEGKCGIAMQASYPV 340
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/200 (48%), Positives = 131/200 (65%), Gaps = 8/200 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
I + LL++ + SR HE S+ E HE+WMA++G+ YKD EKE R IFK N+E+IE
Sbjct: 11 IALFLLLALGIPQMMSRKLHETSMRERHEQWMAEYGKVYKDAAEKEKRFLIFKHNVEFIE 70
Query: 80 KANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
N N+ YKLG N +DLT +EF+A G K P +++ FKY+N+ T +P +
Sbjct: 71 SFNAAANKPYKLGVNHLADLTVEEFKASRNGLKRP----YELSTTPFKYENV--TAIPAA 124
Query: 140 LDWRDKGAVTPIKNQKEC-GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
+DWR KGAVT IK+Q +C G CWAF+ VAA EGI +I +G L+ LSEQ+L+DC T G +
Sbjct: 125 IDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLSEQELVDCDTKGVDQ 184
Query: 198 GCLGGSREKAFAYIIQNQGI 217
GC GG E F +II+N GI
Sbjct: 185 GCEGGYMEDGFEFIIKNGGI 204
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 70/130 (53%), Gaps = 18/130 (13%)
Query: 159 CCWAFAAVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
C A + VA ++G K+ + L + Q + S + N ++ +
Sbjct: 219 CNKATSPVAQIKGYEKVPPNSEKTLQKAVANQPVSVSIDANGE----------GFMFYSS 268
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
GI+NG CGT+LDH VT VG+G +G +YWL+KNSWG WG+ GY+++ R GLCG
Sbjct: 269 GIYNGECGTELDHGVTAVGYGIA-NGTDYWLVKNSWGTQWGEKGYVRMQRGVAAKHGLCG 327
Query: 272 IGTRSSYPLA 281
I SSYP A
Sbjct: 328 IALDSSYPTA 337
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 114/309 (36%), Positives = 168/309 (54%), Gaps = 78/309 (25%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E W+++HG+ Y+ EK R +IFK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 44 LIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 102
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+ Y G K+ S R + F Y+++ ++P S+DWR KGAVT +KNQ CG CWA
Sbjct: 103 EFKNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVTQVKNQGSCGSCWA 158
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCS---TNGNNGCL------------GGSREKA 207
F+ VAAVEGI +I +GNL LSEQ+L+DC NG NG L G +E+
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENDGLHKEED 218
Query: 208 FAYIIQ-----------------------------------NQ----------------- 215
+ YI++ NQ
Sbjct: 219 YPYIMEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 216 -GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G+F+G CG+ LDH V VG+GT + G +Y +KNSWG+ WG+ GY+++ R+ EG+C
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGIC 337
Query: 271 GIGTRSSYP 279
GI +SYP
Sbjct: 338 GIYKMASYP 346
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 87/177 (49%), Positives = 125/177 (70%), Gaps = 3/177 (1%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLTN 101
+ + HE+WMA+HGR+Y D+ EK RL++F++N+ +IE N ++ + L NQF+DLTN
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EFRA TG + PS S + ++F+Y N+S D+P S+DWR KGAV P+K+Q +CGCCW
Sbjct: 61 AEFRATRTGLR-PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCW 119
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
AF+AVAA+EG K+ +G L+ LSEQQL+ C G + GC GG + AF +II+N G+
Sbjct: 120 AFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGL 176
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 35/72 (48%), Positives = 49/72 (68%), Gaps = 6/72 (8%)
Query: 216 GIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
G+ +G C T+LDHA+T VG+G DG YWL+KNSWG +WG+ GY+++ R EG+
Sbjct: 243 GVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKEGV 302
Query: 270 CGIGTRSSYPLA 281
CG+ +SYP A
Sbjct: 303 CGLAMMASYPTA 314
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 137/208 (65%), Gaps = 22/208 (10%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ E HE+WM Q+GR YKD+ E+ R IFKEN+ I+ N + ++YKLG NQF+DLTN+
Sbjct: 1 MYERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNE 60
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
EF+A +K M SP + F+Y+N+S VP+++DWR +GAVTP+K+Q +CGCC
Sbjct: 61 EFKASRNRFKGHMCSPQ-----AGPFRYENVSA--VPSTVDWRKEGAVTPVKDQGQCGCC 113
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI-- 217
WAF+AVAA+EGI K+ +G LI LSEQ+++DC T G + GC GG + AF +I QN+G+
Sbjct: 114 WAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTT 173
Query: 218 --------FNGVCGTQLD--HAVTIVGF 235
+G C T+ HA I GF
Sbjct: 174 EANYPYKGTDGTCNTKKSAIHAAKITGF 201
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 51/70 (72%), Gaps = 5/70 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF G C TQLDH VT VG+G + DG+ YWL+KNSWG WG+ GY+++ +D EGLCG
Sbjct: 238 GIFTGSCDTQLDHGVTAVGYGVS-DGSKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCG 296
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 297 IAMQASYPTA 306
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 90/180 (50%), Positives = 128/180 (71%), Gaps = 3/180 (1%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
H ++++ E+W+A++ ++Y EK R ++FK+NL +I++ANK+ TY LG N F+
Sbjct: 57 VHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVT-TYWLGLNAFA 115
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT+DEF+A Y G + P + TT S F+Y ++ DVP S+DWR KGAVT +KNQ +C
Sbjct: 116 DLTHDEFKATYLGLRQPET--KKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQGQC 173
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DCST+GNNGC GG + AF+YI + G+
Sbjct: 174 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAFSYIASSGGL 233
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 50/68 (73%), Gaps = 5/68 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+FNG CG++LDH V VG+G+++ G +Y ++KNSWG+ WG+ GY+++ R EGLCG
Sbjct: 301 GVFNGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGSHWGEKGYIRMKRGTGKPEGLCG 359
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 360 INKMASYP 367
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 102/211 (48%), Positives = 134/211 (63%), Gaps = 21/211 (9%)
Query: 28 CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR 87
C SQV SR H+ S+ E HE+WM ++G+ YKD E E R IF+ N+E+IE N GN+
Sbjct: 20 CTSQV-KSRKLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNK 78
Query: 88 TYKLGTNQFSDLTNDEFRALYTGYKMPSPSH----RSTTSSTFKYQNLSMTDVPTSLDWR 143
YKL N +D TN+EF A + GYK SH R TT + FKY+N+ TD+P ++DWR
Sbjct: 79 PYKLSINHLADQTNEEFMASHKGYK---GSHWQGLRITTQTPFKYENV--TDIPWAVDWR 133
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
KG T IK+Q +CG CWAF+AVAA EGI +I +GNL+ LSEQ+L+DC + ++GC GG
Sbjct: 134 QKGDATSIKDQGQCGICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSV-DHGCDGGL 192
Query: 204 REKAFAYIIQNQGI----------FNGVCGT 224
E F +II+N GI NG C T
Sbjct: 193 MEHGFEFIIKNGGISSEANYPYTAVNGTCDT 223
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 41/77 (53%), Positives = 55/77 (71%), Gaps = 4/77 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
A+ + G+F G CGTQLDH VT VG+G+T+DG YW++KNSWG WG+ GY++++R
Sbjct: 266 AFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQWGEEGYIRMLRGID 325
Query: 266 -DEGLCGIGTRSSYPLA 281
EGLCGI +SYP A
Sbjct: 326 AQEGLCGIAMDASYPTA 342
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/177 (49%), Positives = 125/177 (70%), Gaps = 3/177 (1%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLTN 101
+ + HE+WMA+HGR+Y D+ EK RL++F++N+ +IE N ++ + L NQF+DLTN
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EFRA TG + PS S + ++F+Y N+S D+P S+DWR KGAV P+K+Q +CGCCW
Sbjct: 61 AEFRATRTGLR-PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCW 119
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
AF+AVAA+EG K+ +G L+ LSEQQL+ C G + GC GG + AF +II+N G+
Sbjct: 120 AFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGL 176
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 35/72 (48%), Positives = 49/72 (68%), Gaps = 6/72 (8%)
Query: 216 GIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
G+ +G C T+LDHA+T VG+G DG YWL+KNSWG +WG+ GY+++ R EG+
Sbjct: 243 GVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKEGV 302
Query: 270 CGIGTRSSYPLA 281
CG+ +SYP A
Sbjct: 303 CGLAMMASYPTA 314
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 99/221 (44%), Positives = 142/221 (64%), Gaps = 16/221 (7%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
+ +VE+ EKW+A+H ++Y EK R ++FK+NL++I+K N+E +Y LG N+F+DLT
Sbjct: 43 ERLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINREVT-SYWLGLNEFADLT 101
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
+DEF+A Y G + + R +S +F+Y+++S +D+P S+DWR KGAVT +KNQ +CG C
Sbjct: 102 HDEFKAAYLG--LDAAPARRGSSRSFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSC 159
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF-- 218
WAF+ VAAVEGI I +GNL LSEQ+L+DCS +GN+GC GG + AF+YI + G+
Sbjct: 160 WAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTE 219
Query: 219 --------NGVCG---TQLDHAVTIVGFGTTEDGANYWLIK 248
G CG AVTI G+ LIK
Sbjct: 220 EAYPYLMEEGSCGDGKKAESEAVTISGYEDVPANDEQALIK 260
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/69 (47%), Positives = 47/69 (68%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVR----DEGLC 270
G+F+G CG QLDH V VG+G+ + G +Y +++NSWG WG+ GY+++ R EGLC
Sbjct: 284 GVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAQWGEKGYIRMKRGTSNGEGLC 343
Query: 271 GIGTRSSYP 279
GI +SYP
Sbjct: 344 GINKMASYP 352
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 190 bits (483), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 131/202 (64%), Gaps = 5/202 (2%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQS--VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
F+ ++ + A + +R + + HE+WMA++GR Y D EK RL++FK N+
Sbjct: 3 FLFALVVCTFALGALGARDLADDDWLIAARHEQWMARYGRVYSDVAEKARRLEVFKANVG 62
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+IE N GN + L NQF+D+T DEFRA++ GYKM ++ + F+Y N+S+ D+
Sbjct: 63 FIESVNA-GNHKFWLEANQFADITKDEFRAMHKGYKMQVIGSKARATG-FRYANVSIDDL 120
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P S+DWR GAVTP+K+Q +CGCCWAF+ VA++EGI K+ +G LI LSEQ+L+DC
Sbjct: 121 PASVDWRANGAVTPVKDQGQCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVGMQ 180
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
N GC GG + AF +I+ N G+
Sbjct: 181 NKGCGGGLMDNAFEFIVNNGGL 202
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 47/70 (67%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+ G CGT+LDH V VG+G DG YWL+KNSWG +WG+ G++++ RD G+CG
Sbjct: 269 GVLTGACGTELDHGVAAVGYGVAGDGTKYWLVKNSWGTSWGEDGFIRLERDVADEAGMCG 328
Query: 272 IGTRSSYPLA 281
+ + SYP A
Sbjct: 329 LAMKPSYPTA 338
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 137/218 (62%), Gaps = 18/218 (8%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ + LL+S V SR HE S+ E HE W+A++G+ YK EKE +IFKEN+E+IE
Sbjct: 11 LALFLLLSIEISQVMSRKLHETSLREEHENWIARYGQVYKVAAEKET-FQIFKENVEFIE 69
Query: 80 KANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
N N+ YKLG N F+DLT +EF+ G K +H + + FKY+N+ TD+P +
Sbjct: 70 SFNAAANKPYKLGVNLFADLTLEEFKDFRFGLK---KTHEFSITP-FKYENV--TDIPEA 123
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
LDWR+KGAVTPIK+Q +CG CWAF+ VAA EGI +I +GNL+ L EQ+L+ C T G + G
Sbjct: 124 LDWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKGVDQG 183
Query: 199 CLGGSREKAFAYIIQNQGI----------FNGVCGTQL 226
C GG E F +II+N GI NG C T +
Sbjct: 184 CEGGYMEDGFEFIIKNGGITTKANYPYKGVNGTCNTTI 221
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 68/130 (52%), Gaps = 18/130 (13%)
Query: 157 CGCCWAFAAVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
C A + VA ++G + S + L + Q + S + NNG ++
Sbjct: 217 CNTTIAASTVAQIKGYETVPSYSEEALQKAVANQPVSVSIDANNG----------HFMFY 266
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
GI+ G CGT LDH VT VG+GTT + +YW++KNSWG W + G++++ R GL
Sbjct: 267 AGGIYTGECGTDLDHGVTAVGYGTTNE-TDYWIVKNSWGTGWDEKGFIRMQRGITVKHGL 325
Query: 270 CGIGTRSSYP 279
CG+ SSYP
Sbjct: 326 CGVALDSSYP 335
>gi|125606655|gb|EAZ45691.1| hypothetical protein OsJ_30364 [Oryza sativa Japonica Group]
Length = 326
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 103/295 (34%), Positives = 152/295 (51%), Gaps = 55/295 (18%)
Query: 40 EQSVVEIHEKWMAQHG---RSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
E+S+ ++++W +G S +D +K R ++FK+N YI N++ +YKLG N+F
Sbjct: 19 EESMWSLYQRWRHVYGAASSSPRDLADKGSRFEVFKKNARYIHDFNRKKGMSYKLGLNKF 78
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLT +EF A YTG P P + D P + DWR+ GAVT +K+Q
Sbjct: 79 ADLTLEEFTAKYTGAN-PGPITGLKNGTGSPPLAAVAGDAPPAWDWREHGAVTRVKDQGP 137
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN------------------- 197
CG CWAF+ V AVEGI +I +GN + LSEQQ T G N
Sbjct: 138 CGSCWAFSVVEAVEGINEIMTGNFLTLSEQQCFSPPTTGENYFYYPAYEAVQEPCRFDPN 197
Query: 198 ----------------------------GCLGGSREKAFAYIIQNQGIFNGVCGTQLDHA 229
G + E ++ ++I G+F+G CGT+L+HA
Sbjct: 198 KAPIVKIDSYSFVDPNDEEALKQAVYSQGPVSVLIEASYEFMIYQGGVFSGPCGTELNHA 257
Query: 230 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
V +VG+ TEDG YW++KNSWG WG++GY++++R+ EG+CGI YP+
Sbjct: 258 VLVVGYDETEDGTPYWIVKNSWGAGWGESGYIRMIRNIPAPEGICGIAMYPIYPI 312
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 97/208 (46%), Positives = 133/208 (63%), Gaps = 17/208 (8%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
++S ++EKWM HGR Y EKE R +IF++N EYIE+ N++ N+TY LG N F+D+
Sbjct: 27 DRSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADM 86
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T+DEF+ALY G K+P + T S F+Y++ T++P DWR KGAV +KNQ CG
Sbjct: 87 THDEFKALYFGTKVPLSN---TIKSGFRYKD--ATNLPLDTDWRSKGAVATVKNQGACGS 141
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI-- 217
CWAF+ VAAVEG+ +I +G L+ LSEQ+L+DC N GC GG + AF +IIQN G+
Sbjct: 142 CWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDS 201
Query: 218 -----FNGVCGT-----QLDHAVTIVGF 235
+ V G+ + H VTI GF
Sbjct: 202 EADYPYKAVSGSCDESRRNSHVVTIDGF 229
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 50/73 (68%), Gaps = 8/73 (10%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGT--TEDGA--NYWLIKNSWGNTWGDAGYMKIVRD----E 267
G++ G CG +LDH V VG+GT T DG +YW+++NSWG+ WG++GY+++ R+
Sbjct: 266 GVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASPR 325
Query: 268 GLCGIGTRSSYPL 280
G CGI +SYP+
Sbjct: 326 GKCGIAMMASYPV 338
>gi|167345242|gb|ABZ69061.1| cysteine protease [Pinus sylvestris]
Length = 214
Score = 189 bits (481), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 89/187 (47%), Positives = 133/187 (71%), Gaps = 3/187 (1%)
Query: 32 VVSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
++S++ E +++E++E W+A+H ++Y EK+ R +FK+N YI + N +GNR+YK
Sbjct: 26 IISNKDLREDDAIMELYELWLAEHKKAYNGLDEKQKRFTVFKDNFLYIHEHN-QGNRSYK 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+DL+++EF+A Y G K+ + R S + +YQ D+P S+DWR+KGAV P
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKLDTKK-RLLRSPSPRYQYSDGEDLPKSIDWREKGAVAP 143
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q +CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG R+ AF +
Sbjct: 144 VKDQGQCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLRDYAFEF 203
Query: 211 IIQNQGI 217
II N G+
Sbjct: 204 IINNGGL 210
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 189 bits (481), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 108/316 (34%), Positives = 158/316 (50%), Gaps = 79/316 (25%)
Query: 40 EQSVVEIHEKWMAQHGRSYKD--ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E+++ ++E+W + + S + +E R +FKEN YI + NK+ +R ++L N+F+
Sbjct: 33 EENLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYIHEGNKK-DRPFRLALNKFA 91
Query: 98 DLTNDEFRALYTGYKMP---SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
D+T DEFR Y G ++ S S +F+Y + ++P ++DWR KGAVT IK+Q
Sbjct: 92 DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGSFRYGDAD--NLPPAVDWRQKGAVTAIKDQ 149
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST--------------------- 193
+CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC
Sbjct: 150 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIHKN 209
Query: 194 ----------NGNNGCLGGSREKAFAYIIQ------------------------------ 213
G G ++EKA A I
Sbjct: 210 GITTESNYPYQGEQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASG 269
Query: 214 ------NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
++G+F G C T LDH V VG+GTT DG YW++KNSWG WG+ GY+++ R
Sbjct: 270 NDFQFYSEGVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGV 329
Query: 267 ---EGLCGIGTRSSYP 279
EG CGI ++SYP
Sbjct: 330 SQAEGQCGIAMQASYP 345
>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
Length = 324
Score = 189 bits (481), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 117/325 (36%), Positives = 167/325 (51%), Gaps = 75/325 (23%)
Query: 15 TTPMFIIITLLVSCASQ--VVSSRSTHEQSVVEIHEKWMAQHGRSYKDEL-EKEMRLKIF 71
T + II L S A V S + V I + WM++HG++Y + L +KE R + F
Sbjct: 11 TLSLLIIFLLPPSSAMDLSVTSGGLRSNEEVGFIFQTWMSKHGKTYTNALGDKEQRFQNF 70
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
K+NL +I++ N + N +Y+LG QF+DLT E++ L++G + + T +Y L
Sbjct: 71 KDNLRFIDQHNAK-NLSYRLGLTQFADLTVQEYQDLFSGRPI---QKQKALRVTHRYVPL 126
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ +P S+DWR KGAV+ IK+Q C VE I KI +G LI LSEQ+L+DC
Sbjct: 127 AEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSEQELVDC 176
Query: 192 STNGNNGCLGGSREKAFAYIIQNQ------------------------------------ 215
S + N+GC GG + AF ++I N
Sbjct: 177 SID-NHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKIDGYEDV 235
Query: 216 ----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 259
GI+ G CGT LDHAV IVG+G TE+G +YW+++NSWG WG+AG
Sbjct: 236 PANNENSLQKAVAHQPGIYTGPCGTDLDHAVVIVGYG-TENGQDYWIVRNSWGTVWGEAG 294
Query: 260 YMKIVRD----EGLCGIGTRSSYPL 280
Y KI R+ G+CGI +SYP+
Sbjct: 295 YAKIARNFENPTGVCGIAMVASYPI 319
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 189 bits (481), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 138/216 (63%), Gaps = 13/216 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + L SQV+ R H+ ++ E HE WMA++G+ YKD EKE R +IFK+N+E+
Sbjct: 10 MLALFLFLAVGISQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDV 136
IE N GN+ YKLG N +DLT +EF+ G K +T + FKY+N+ TD+
Sbjct: 69 IESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENV--TDI 126
Query: 137 PTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P ++DWR KGAVTPIK+Q +CG CWAF+ +AA EGI +I +GNL+ LSEQ+L+DC +
Sbjct: 127 PEAIDWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSV- 185
Query: 196 NNGCLGGSREKAFAYIIQNQGI-------FNGVCGT 224
++GC GG E F +II+N GI + GV GT
Sbjct: 186 DDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGT 221
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/76 (51%), Positives = 53/76 (69%), Gaps = 5/76 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 265
++ + GI+NG CGT LDH VT VG+GT E+G +YW++KNSWG WG+ GY+++ R
Sbjct: 268 FMFYSSGIYNGECGTDLDHGVTAVGYGT-ENGTDYWIVKNSWGTQWGEKGYIRMHRGIAA 326
Query: 266 DEGLCGIGTRSSYPLA 281
G+CGI SSYP A
Sbjct: 327 KHGICGIALDSSYPTA 342
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/202 (47%), Positives = 130/202 (64%), Gaps = 17/202 (8%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
++EKWM HGR Y EKE R +IF++N EYIE+ N++ N+TY LG N F+D+T+DEF+
Sbjct: 33 LYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFK 92
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
ALY G K+P + T S F+Y++ T++P DWR KGAV +KNQ CG CWAF+
Sbjct: 93 ALYFGTKVPLSN---TIKSGFRYED--ATNLPLDTDWRSKGAVATVKNQGACGSCWAFST 147
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI-------F 218
VAAVEG+ +I +G L+ LSEQ+L+DC N GC GG + AF +IIQN G+ +
Sbjct: 148 VAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPY 207
Query: 219 NGVCGT-----QLDHAVTIVGF 235
V G+ + H VTI GF
Sbjct: 208 KAVSGSCDESRRNSHVVTIDGF 229
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 50/73 (68%), Gaps = 8/73 (10%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGT--TEDGA--NYWLIKNSWGNTWGDAGYMKIVRD----E 267
G++ G CG +LDH V VG+GT T DG +YW+++NSWG+ WG++GY+++ R+
Sbjct: 266 GVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASSR 325
Query: 268 GLCGIGTRSSYPL 280
G CGI +SYP+
Sbjct: 326 GKCGIAMMASYPV 338
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 102/209 (48%), Positives = 134/209 (64%), Gaps = 16/209 (7%)
Query: 14 NTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
N +F I+TL S V+SSR ++E HE+WM +HG+ YKD EKE R +IFKE
Sbjct: 11 NILTLFFILTLWTSL---VISSR------LLEKHEQWMEEHGKFYKDAAEKEQRFQIFKE 61
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY-TGYKMPSPS---HRSTTSSTFKYQ 129
NLE+IE N G+ + L NQF D TNDEF+A Y G K P S F+Y+
Sbjct: 62 NLEFIESFNAAGDNGFNLSINQFGDQTNDEFKANYLNGKKKPLIGVGIAAIEEESVFRYE 121
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
N+ T+VP ++DWR++GAVTPIK+Q CG CWAFA VAA+EGI +I +G L+ LSEQ+L+
Sbjct: 122 NV--TEVPATMDWRERGAVTPIKHQHLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELV 179
Query: 190 DC-STNGNNGCLGGSREKAFAYIIQNQGI 217
DC TN +GC GG E A +I++ GI
Sbjct: 180 DCVKTNTTDGCNGGYVEDACDFIVKKGGI 208
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 50/120 (41%), Positives = 68/120 (56%), Gaps = 11/120 (9%)
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQ 225
VA ++G + + N E+ LL N + ++AF + + GI G CG
Sbjct: 232 VAKIKGYEHVPANN-----EKALLKAVANQPIAVYIAATKRAFQFY--SSGILKGKCGID 284
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
LDH VTIVG+GT++DG YWL+KNSWG WG+ GY+KI RD EG CGI +YP+
Sbjct: 285 LDHTVTIVGYGTSDDGVKYWLVKNSWGTKWGEKGYIKIKRDVHAKEGSCGIAMVPTYPIV 344
>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
Length = 363
Score = 189 bits (479), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 163/314 (51%), Gaps = 80/314 (25%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ E WM +H + YK+ EK R +IFK+NL+YI++ NK+ N +Y LG N F+
Sbjct: 57 TSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKK-NNSYWLGLNVFA 115
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
D++NDEF+ YTG + ++ +T S + N ++P +DWR KGAVTP+KNQ C
Sbjct: 116 DMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSC 173
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGG--------------- 202
G WAF+AV+ +E I KIR+GNL + SEQ+LLDC + GC GG
Sbjct: 174 GSAWAFSAVSTIESIIKIRTGNLNEYSEQELLDCDRR-SYGCNGGYPWSALQLVAQYGIH 232
Query: 203 ---------------SREK-------------------AFAYIIQNQ------------- 215
SREK A Y I NQ
Sbjct: 233 YRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDF 292
Query: 216 -----GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----D 266
GIF G CG ++DHAV VG+ G NY LI+NSWG WG+ GY++I R
Sbjct: 293 QLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILIRNSWGTGWGENGYIRIKRGTGNS 347
Query: 267 EGLCGIGTRSSYPL 280
G+CG+ T S YP+
Sbjct: 348 YGVCGLYTSSFYPV 361
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 189 bits (479), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 91/204 (44%), Positives = 132/204 (64%), Gaps = 7/204 (3%)
Query: 19 FIIITLLVS----CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
F++ L+V C + + + ++ HEKWMA+HGR+YKDE EK RL++F+ N
Sbjct: 6 FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
E I+ N G +++L TN+F+DLT EFRA TG + P P+ S + F+Y+N S+
Sbjct: 66 AELIDSFNAAGTHSHRLATNRFADLTVQEFRAARTGLR-PRPAP-SAGAGRFRYENFSLA 123
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
D S+DWR GAVT +K+Q GCCWAF+AVAAVEG+ KIR+G L+ LSEQ+L+DC +
Sbjct: 124 DAAQSVDWRAMGAVTGVKDQGASGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVS 183
Query: 195 G-NNGCLGGSREKAFAYIIQNQGI 217
G + GC GG + AF ++ + G+
Sbjct: 184 GVDQGCDGGLMDNAFQFVARRGGL 207
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 52/76 (68%), Gaps = 3/76 (3%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---V 264
A+ + G+ G CGT L+HA+T VG+GT DG YWL+KNSWG +WG+ GY++I V
Sbjct: 265 MAFRFYDSGVLGGACGTDLNHAITAVGYGTAADGTRYWLMKNSWGASWGEGGYVRIRRGV 324
Query: 265 RDEGLCGIGTRSSYPL 280
R EG+CG+ SYP+
Sbjct: 325 RGEGVCGLAKLPSYPV 340
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 189 bits (479), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 116/327 (35%), Positives = 177/327 (54%), Gaps = 81/327 (24%)
Query: 25 LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE 84
+V A + ++SR ++++ E W+++H + Y+ EK R +IFK+NL +I++ NK+
Sbjct: 15 IVGYAPEDLTSR----DRIIDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKK 70
Query: 85 GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRD 144
Y LG N+F+DL+++EF+ Y G + S+R S F Y+++S +P S+DWR
Sbjct: 71 -VVNYWLGLNEFADLSHEEFKNKYLGLNV-DLSNRRECSEEFTYKDVS--SIPKSVDWRK 126
Query: 145 KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST---NGNNGCL- 200
KGAVT +KNQ CG CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T NG NG L
Sbjct: 127 KGAVTDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNNGCNGGLM 186
Query: 201 -----------GGSREKAFAYIIQ-----------------------------------N 214
G +E+ + YI++ N
Sbjct: 187 DYAFAYIISNGGLHKEEDYPYIMEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALAN 246
Query: 215 Q------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWG 256
Q G+F+G CGT+LDH V VG+G+ + G ++ ++KNSWG+ WG
Sbjct: 247 QPLSVAIDASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGSAK-GLDFIVVKNSWGSKWG 305
Query: 257 DAGYMKIVRD----EGLCGIGTRSSYP 279
+ G++++ R+ GLCGI +SYP
Sbjct: 306 EKGFIRMKRNTGKPAGLCGINKMASYP 332
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 92/162 (56%), Positives = 122/162 (75%), Gaps = 7/162 (4%)
Query: 51 MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG 110
MA++GR YKD EKE R KIFK+N+ IE NK ++TYKL N+F+DLTN+EFR+L
Sbjct: 1 MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNR 60
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
+K +H + ++TFKY+N+ T VP+++DWR KGAVTPIK+Q++CGCCWAF+AVAA E
Sbjct: 61 FK----AHICSEATTFKYENV--TAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATE 114
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
GIT+I +G LI LSEQ+L+DC T G N GC GG + AF +I
Sbjct: 115 GITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFI 156
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 40/78 (51%), Positives = 52/78 (66%), Gaps = 4/78 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + G+F G CGT+LDH V VG+G +DG YWL+KNSWG WG+ GY+++ RD
Sbjct: 220 FEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDV 279
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 280 TAKEGLCGIAMQASYPTA 297
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 96/202 (47%), Positives = 130/202 (64%), Gaps = 6/202 (2%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
PMF+I T + V+SSR E + HEKWM Q G+SYKD EKE R +IFK N+E
Sbjct: 10 PMFLIFTTWM--LPYVMSSR-VLEPYLSNKHEKWMTQFGKSYKDAAEKEKRFQIFKNNVE 66
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTD 135
+IE N GN+ + L N F+DLTN+EF+A G K+ +++F+Y N+ T
Sbjct: 67 FIELFNAVGNKPFNLSINHFADLTNEEFKASLNGNKKLHDKFDILNETTSFRYHNV--TS 124
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
VP S+DWR +GAVTPIKNQ CG CWAF+ VA++EGI +I +G L+ LSEQ+L+DC
Sbjct: 125 VPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIEGIHQITTGELVSLSEQELIDCVRGN 184
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
++GC GG E AF +I + G+
Sbjct: 185 SSGCSGGYLEDAFKFIAKKGGM 206
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/120 (43%), Positives = 67/120 (55%), Gaps = 11/120 (9%)
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQ 225
VA ++G K+ S SE LL N + + F + + GIF G CGT
Sbjct: 230 VAEIKGYEKVPSN-----SENDLLKAVANQPVSVYVDAGDYVFQFY--SGGIFTGKCGTD 282
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
DH VTIVG+G + D YWL+KNSWG WG+ GYMK+ R+ +GLCGI T SYP+A
Sbjct: 283 TDHVVTIVGYGVSLDYTEYWLVKNSWGTGWGEKGYMKLKRNVDSKKGLCGIATNPSYPVA 342
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 88/199 (44%), Positives = 132/199 (66%), Gaps = 2/199 (1%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++I+ L+++ + V SR E E HEKWMAQ+GR YKD EKE R ++FK N+ +I
Sbjct: 9 YLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFI 68
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L NQF+DL ++EF+AL + + ++T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTETSFRYE--SVTKIPA 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
++D R +GAVTPIK+Q CG CWAF+AVAA EGI +I +G L+ LSEQ+L+DC + G
Sbjct: 127 TIDRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEG 186
Query: 199 CLGGSREKAFAYIIQNQGI 217
C+GG + AF +I + GI
Sbjct: 187 CIGGYVDDAFEFIAKKGGI 205
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/121 (42%), Positives = 66/121 (54%), Gaps = 12/121 (9%)
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGV-CGT 224
VA ++G K+ S N E+ LL N + AF Y + GIFN CGT
Sbjct: 229 VAEIKGYEKVPSNN-----EKALLKAVANQPVSVYIDAGTHAFKYY--SSGIFNARNCGT 281
Query: 225 QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
+HAV +VG+G D + YWL+KNSWG WG+ GY++I RD EGLCGI YP+
Sbjct: 282 DPNHAVAVVGYGKALDDSKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPI 341
Query: 281 A 281
A
Sbjct: 342 A 342
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 101/202 (50%), Positives = 134/202 (66%), Gaps = 27/202 (13%)
Query: 19 FIIITLLVSC--ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
I ITLL+ ASQ +S R+ HE S+ E HE WM +GR+YKD EKE R KIFKEN+E
Sbjct: 7 IICITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVE 65
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
YIE NK F+A GY M S RS+ ++F+Y+N++ V
Sbjct: 66 YIESVNK--------------------FKASRNGYNMSS-RPRSSEITSFRYENVAA--V 102
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P+S+DWR KGAVTPIK+Q +CGCCWAF+AVAA+EG+T++++G LI LSEQ+L+DC T+G
Sbjct: 103 PSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGE 162
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
+ GC GG + AF +II N G+
Sbjct: 163 DQGCGGGLMDSAFEFIIGNGGL 184
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 41/72 (56%), Positives = 51/72 (70%), Gaps = 4/72 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
+ G+F G CGT+LDH VT VG+G T+DG YWL+KNSWG WG+ GY+ + R DEGL
Sbjct: 249 SSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWLVKNSWGTGWGEDGYIWMERDIGADEGL 308
Query: 270 CGIGTRSSYPLA 281
CGI +SYP A
Sbjct: 309 CGIAMEASYPTA 320
>gi|310656788|gb|ADP02217.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 294
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 104/291 (35%), Positives = 153/291 (52%), Gaps = 35/291 (12%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ + C+S V+S+R + ++VE HE+WM + R YKD EK ++FK N+ +
Sbjct: 8 LLAILGCICLCSSTVMSARELADAAMVERHEQWMVKFNRVYKDNAEKVRWFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N N + LG NQF+DLTNDEF+A T + S R+ T FKY N+S +P
Sbjct: 68 IESFNAR-NHKFWLGVNQFTDLTNDEFKATKTNKGLKRTSSRAPTR--FKYNNVSTDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
T++DWR KGA+TPIK+Q +C AF + + +T N ++ S NN
Sbjct: 125 TAVDWRTKGAITPIKDQGQCD-GQAFKFIIKIGSLTS--EANYPYTAQDGQCKTSIASNN 181
Query: 198 GC-------LGGSREKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTI 232
+ + E + + NQ G G CGT LDH +
Sbjct: 182 VATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDAIFQHYSGGAMTGSCGTDLDHGIAA 241
Query: 233 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
+G+G T DG YWL+KNSWG TWG++GY+++ +D G+CG+ + SYP
Sbjct: 242 IGYGMTSDGTKYWLLKNSWGTTWGESGYLRMEKDISDKSGMCGLAMQPSYP 292
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 117/339 (34%), Positives = 175/339 (51%), Gaps = 98/339 (28%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMR-LKIFKENLEYIEKANKEGNRTYKLGTNQ 95
S+HE S+ E+ E+W+++H + LE+++R ++FK+NL +I++ N++ + +Y LG N+
Sbjct: 39 SSHE-SLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRKVS-SYWLGLNE 96
Query: 96 FSDLTNDEFRALYTGYKMPSPS------HRSTTSST-------------FKYQNLSMTDV 136
F+DLT+DEF+A Y G H F+Y+ + +
Sbjct: 97 FADLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAARL 156
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAVT +KNQ +CG CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T+GN
Sbjct: 157 PKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTDGN 216
Query: 197 ---NGCL------------GGSREKAFAYIIQ---------------------------- 213
NG L G E+A+ Y+++
Sbjct: 217 NGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCSRGSSAAVVTISGYEDVPRNNEQA 276
Query: 214 ------------------------NQGIFNGVCGTQLDHAVTIVGFGTT--EDG---ANY 244
+ G+F+G CGTQLDH V VG+GT ++G A+Y
Sbjct: 277 LLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAAVGYGTAGKDNGHVVADY 336
Query: 245 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
++KNSWG +WG+ GY+++ R +GLCGI SYP
Sbjct: 337 IIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMPSYP 375
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 95/223 (42%), Positives = 140/223 (62%), Gaps = 15/223 (6%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T+ ++E+ E WM++H ++YK EK R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42 TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT++EF+ Y G P S + S+ F+Y+++ TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T N+GC GG + AF YII G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 218 F----------NGVCGTQLDHA--VTIVGFGTTEDGANYWLIK 248
G+C Q + VTI G+ + + L+K
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVK 261
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 186 bits (473), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 88/190 (46%), Positives = 128/190 (67%), Gaps = 7/190 (3%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+S + ++ V +E W+A+HG++Y EKE R +IF +NL++I++ N GNR+YK+G
Sbjct: 22 VTSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVG 81
Query: 93 TNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA 147
NQF+DLTN+E+R++Y G Y+ + R S + Q M P +DWR++GA
Sbjct: 82 LNQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEM--FPAKVDWRERGA 139
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
V+P+KNQ CG CWAF+ VA+VEGI KI +G+LI LSEQ+L+DC N+GC GGS + A
Sbjct: 140 VSPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYA 199
Query: 208 FAYIIQNQGI 217
F +I+ N GI
Sbjct: 200 FQFIVSNGGI 209
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 52/77 (67%), Gaps = 6/77 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 267
A+ + G+ G CGT LDH V +VG+G+ E+G +YW+++NSWG WG+ GY+++ R+
Sbjct: 269 AFQLYTSGVLTGSCGTNLDHGVVVVGYGS-ENGKDYWIVRNSWGPEWGEDGYIRMERNMV 327
Query: 268 ----GLCGIGTRSSYPL 280
G+CGI +SYP+
Sbjct: 328 DTPVGMCGITLMASYPI 344
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 186 bits (472), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 111/309 (35%), Positives = 166/309 (53%), Gaps = 77/309 (24%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK +R +IFK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 43 LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+ Y G K+ S R + F Y+++ ++P S+DWR KGAV P+KNQ CG CWA
Sbjct: 102 EFKNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCS---TNGNNGCL------------GGSREKA 207
F+ VAAVEGI +I +GNL LSEQ+L+DC +NG NG L G +E+
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 208 FAYIIQ-----------------------------------NQ----------------- 215
+ YI++ NQ
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYS 277
Query: 216 -GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---VRDEGLCG 271
G+F+G CG+ LDH V VG+GT + G +Y ++KNSWG+ WG+ GY+++ + G
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYIIVKNSWGSKWGEKGYIRMRGTLETRGNLR 336
Query: 272 IGTRSSYPL 280
+SYPL
Sbjct: 337 YLQMASYPL 345
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 95/228 (41%), Positives = 137/228 (60%), Gaps = 20/228 (8%)
Query: 9 GSFKINTTP----------MFIIITLLVSCASQVVSSRSTH---------EQSVVEIHEK 49
GS I T+P +F + + + ++S S H E+ ++ ++E+
Sbjct: 2 GSSSITTSPATMTMAAIVLLFTVFAVSSALDMSIISYDSAHADKAATLRTEEELMSMYEQ 61
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W+ +HG+ Y EKE R +IFK+NL +I+ N +RTYKLG N+F+DLTN+E+RA Y
Sbjct: 62 WLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYL 121
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
G K+ P+ R + + +Y +P S+DWR +GAV P+K+Q CG CWAF+A+ AV
Sbjct: 122 GTKI-DPNRRLGKTPSNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAV 180
Query: 170 EGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
EGI KI +G LI LSEQ+L+DC T N GC GG + AF +II N GI
Sbjct: 181 EGINKIVTGELISLSEQELVDCDTGYNQGCNGGLMDYAFEFIINNGGI 228
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 54/85 (63%), Gaps = 10/85 (11%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG RE + + G+F G CGT LDH V VG+GT + G +YW+++NSWG++WG+ GY
Sbjct: 284 GGGRE----FQLYVSGVFTGRCGTALDHGVVAVGYGTAK-GHDYWIVRNSWGSSWGEDGY 338
Query: 261 MKIVRD-----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYPL
Sbjct: 339 IRLERNLANSRSGKCGIAIEPSYPL 363
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 138/204 (67%), Gaps = 6/204 (2%)
Query: 18 MFIIITLLVSCASQVVS---SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+F+ +TL + S SR HE S+ E HE+WMA++ R+YKD+ E+E R +FK+N
Sbjct: 3 LFVCMTLHIYYLEHRASEATSRPLHEASMYERHEQWMARYSRNYKDDAEEERRFXMFKDN 62
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
+++I+ + GN KLG N +D+T++EFRA +K+P + +++F++QN+ T
Sbjct: 63 VDFIQTFDTAGNMPNKLGVNALADMTHEEFRASGNTFKIPPNLGLRSETTSFRHQNV--T 120
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+P+++DWR K VT IKNQ +CG CWAF+AVAA+EGI K+++ I LSEQ+L+DC
Sbjct: 121 RIPSTMDWRKKRTVTHIKNQLQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIF 180
Query: 195 GNN-GCLGGSREKAFAYIIQNQGI 217
G+N GC GG + AF +IIQN+G+
Sbjct: 181 GSNIGCEGGCMDDAFKFIIQNRGL 204
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 54/107 (50%), Gaps = 6/107 (5%)
Query: 179 NLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTT 238
N+ + SE+ LL + + AF + GI G LD+ VT G+G +
Sbjct: 236 NMPEFSEKALLKVVAHQPISVAIDAGGSAFQFY--EIGIITXESGNDLDYGVTTDGYGRS 293
Query: 239 EDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 281
DG +WL+KNSWG WG+ GY ++ R GLCG ++SYP A
Sbjct: 294 ADGKKHWLVKNSWGTDWGENGYTRMERGVKATTGLCGFTMQASYPTA 340
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 90/187 (48%), Positives = 132/187 (70%), Gaps = 3/187 (1%)
Query: 31 QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
++ SSR+ E V+ ++E W+ +HG+SY EKE R +IFK+NL +I++ N E +RTYK
Sbjct: 32 ELSSSRTDDE--VMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAE-SRTYK 88
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
+G N+F+DLTNDE+R++Y G + S ST + +Y ++ +P S+DWR+KGAV
Sbjct: 89 VGLNRFADLTNDEYRSMYLGARTGSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVG 148
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q CG CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF +
Sbjct: 149 VKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEF 208
Query: 211 IIQNQGI 217
II+N GI
Sbjct: 209 IIKNGGI 215
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 52/76 (68%), Gaps = 4/76 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 267
A+ G+F G CGT LDH VT VG+GT E+ +YW++KNSWG++WG++GY+++ R+
Sbjct: 274 MAFQFYESGVFTGNCGTALDHGVTAVGYGT-ENSVDYWIVKNSWGSSWGESGYIRMERNT 332
Query: 268 GL---CGIGTRSSYPL 280
G CGI SYP+
Sbjct: 333 GATGKCGIAVEPSYPI 348
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 86/178 (48%), Positives = 125/178 (70%), Gaps = 1/178 (0%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ +++E++E W+AQH ++Y EK+ + +FK+N YI + N +GN +YKLG NQF+DL
Sbjct: 37 DDAIMELYELWLAQHKKAYNGLDEKQKKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADL 96
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
+++EF+A Y G K+ + R + S + +YQ D+P S+DWR+KGAVT +KNQ CG
Sbjct: 97 SHEEFKAAYLGTKLDA-KKRLSRSPSPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGS 155
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T+ N GC GG + AF +II N G+
Sbjct: 156 CWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIISNGGL 213
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 38/77 (49%), Positives = 54/77 (70%), Gaps = 6/77 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ G+F CGTQLDH VT+VG+G+ E G +YWL+KNSWGN+WG+ G++K+ R+
Sbjct: 273 AFQFYESGVFTSNCGTQLDHGVTLVGYGS-ESGIDYWLVKNSWGNSWGEKGFIKLQRNLE 331
Query: 267 ---EGLCGIGTRSSYPL 280
G+CGI +SYP+
Sbjct: 332 GASTGMCGIAMEASYPV 348
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 96/214 (44%), Positives = 140/214 (65%), Gaps = 15/214 (7%)
Query: 32 VVSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
++SS+ E +++E++E W+A+H R+Y EK+ R +FK+N YI + N +GNR+YK
Sbjct: 26 IISSKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHN-QGNRSYK 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+DL+++EF+A Y G K+ + S S +YQ D+P S+DWR+KGAVT
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKLDTKKRLSRPPSR-RYQYSDGEDLPESIDWREKGAVTS 143
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF +
Sbjct: 144 VKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEF 203
Query: 211 IIQNQGI----------FNGVCGT--QLDHAVTI 232
II N G+ ++G C + + H VTI
Sbjct: 204 IINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTI 237
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 34/72 (47%), Positives = 51/72 (70%), Gaps = 6/72 (8%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EG 268
+ G+F CGTQLDH VT+VG+G+ E G +YW +KNSWG +WG+ G++++ R+ G
Sbjct: 275 DSGVFTSTCGTQLDHGVTLVGYGS-ESGTDYWTVKNSWGKSWGEEGFIRLQRNIEVASTG 333
Query: 269 LCGIGTRSSYPL 280
+CGI +SYP+
Sbjct: 334 MCGIAMEASYPV 345
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 86/178 (48%), Positives = 125/178 (70%), Gaps = 1/178 (0%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ +++E++E W+AQH ++Y EK+ R +FK+N YI + N +GN +YKLG NQF+DL
Sbjct: 37 DDAIMELYELWLAQHKKAYNGLGEKQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADL 96
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
+++EF+A Y G K+ + R + S + +YQ D+P S+DWR+KGAVT +K+Q CG
Sbjct: 97 SHEEFKATYLGAKLDT-KKRLSNSPSPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGS 155
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T+ N GC GG + AF +II N G+
Sbjct: 156 CWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIINNGGL 213
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 53/77 (68%), Gaps = 6/77 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ G+F CGTQLDH VT+VG+G+ E G +YW++KNSWG +WG+ G++++ R+
Sbjct: 273 AFQFYESGVFTSTCGTQLDHGVTLVGYGS-ESGTDYWIVKNSWGKSWGEKGFIRLQRNIE 331
Query: 267 ---EGLCGIGTRSSYPL 280
G+CGI +SYPL
Sbjct: 332 GVSTGMCGIAMEASYPL 348
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 85/180 (47%), Positives = 124/180 (68%), Gaps = 2/180 (1%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E ++E W+ +HG++Y EKE R KIFK+NL +IE+ N G+++YKLG N+F+DL
Sbjct: 41 ESHTRHVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADL 100
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSS--TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
TN+E+RA++ G + P +++ + T +Y + ++P +DWR+KGAVTPIK+Q +C
Sbjct: 101 TNEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQGQC 160
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ V AVEGI +I +GNL LSEQ+L+DC N GC GG + AF +I+QN GI
Sbjct: 161 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYAFEFIVQNGGI 220
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 6/76 (7%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 265
+ + G+F G CGT LDH V VG+GT E+G +YWL++NSWG+ WG+ GY+K+ R
Sbjct: 281 FQLYQSGVFTGRCGTNLDHGVVAVGYGT-ENGTDYWLVRNSWGSAWGENGYIKLERNVQN 339
Query: 266 -DEGLCGIGTRSSYPL 280
+ G CGI +SYP+
Sbjct: 340 TETGKCGIAIEASYPI 355
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 128/350 (36%), Positives = 179/350 (51%), Gaps = 92/350 (26%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRS----THEQ--SVVEIHEK----WMAQHGRSYKDEL 62
I TT +FI++ L +C V++S S TH+Q S VE +K W+ +HGR YK
Sbjct: 5 ILTTTIFILLMLCNTC---VIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHND 61
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
E+E+R I++ N++YI+ N + N +Y L N+F+DLTN+EF++ Y G SH
Sbjct: 62 EREVRFGIYQANVQYIQCKNAQKN-SYNLTDNKFADLTNEEFQSTYMGLSTRLRSH---- 116
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
++ F+Y D+P S DWR +GAVT I +Q +CG CWAFAAVAAVEGI KI+SG LI
Sbjct: 117 NTGFRYDEHG--DLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLIS 174
Query: 183 LSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGI-------FNGVCGT-----QLDHA 229
LSEQ+L+DC +GN GC GG E A+ +II+N G+ + GV GT +A
Sbjct: 175 LSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYA 234
Query: 230 VTIVGF-------------------------------------------------GTTED 240
+I G+ G T
Sbjct: 235 ASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVV 294
Query: 241 GANYWLIKNSW------GNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
G I W G WG++GY+++ RD EG+CGI ++SYPL
Sbjct: 295 GYGKETINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQASYPL 344
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 96/202 (47%), Positives = 132/202 (65%), Gaps = 6/202 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + L SQV+ R H+ ++ E HE WMA++G+ YKD EKE R +IFK+N+E+
Sbjct: 10 MLALFLFLAVGISQVMP-RKLHQTALRERHENWMAEYGKIYKDAAEKEKRFQIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDV 136
IE N GN+ YKLG N +DLT +EF+ G K +T + FKY+N+ TD+
Sbjct: 69 IESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENV--TDI 126
Query: 137 PTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P ++DWR KGAVTPIK+Q +CG CWAF+ VAA EGI +I +G L+ LSEQ+L+DC +
Sbjct: 127 PEAIDWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV- 185
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
++GC GG E F +II+N GI
Sbjct: 186 DHGCDGGLMEDGFEFIIKNGGI 207
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 41/71 (57%), Positives = 53/71 (74%), Gaps = 5/71 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGAN-YWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G+F G CGTQLDH VT+VG+GTT+DG + YW++KNSWG WG+ GY+++ R EGLC
Sbjct: 274 GVFTGQCGTQLDHGVTVVGYGTTDDGTHEYWIVKNSWGTQWGEEGYIRMQRGIDALEGLC 333
Query: 271 GIGTRSSYPLA 281
GI +SYP A
Sbjct: 334 GIAMDASYPTA 344
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 107/314 (34%), Positives = 169/314 (53%), Gaps = 78/314 (24%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFS 97
E ++ W+A++GRSY E+E R ++F +NL++++ N + ++LG N+F+
Sbjct: 42 EAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFA 101
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLTNDEFR+ + G K+ S ++ +Y++ + ++P S+DWR+KGAV P+KNQ +C
Sbjct: 102 DLTNDEFRSTFLGAKVVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQC 157
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQG 216
G CWAF+AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG + AF +II+N G
Sbjct: 158 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 217
Query: 217 I----------FNGVCGTQLDHA--VTIVGF----------------------------- 235
I +G C ++A V+I GF
Sbjct: 218 IDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGR 277
Query: 236 --------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD--- 266
GT+ D G +YW+++NSWG WG++GY+++ R+
Sbjct: 278 EFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINA 337
Query: 267 -EGLCGIGTRSSYP 279
G CGI +SYP
Sbjct: 338 TTGKCGIAMMASYP 351
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 137/216 (63%), Gaps = 13/216 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + L SQV+ R H+ ++ E HE WMA++G+ YKD EKE R +IFK+N+E+
Sbjct: 10 MLALFLFLAVGISQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDV 136
IE N GN+ YKLG N +DLT +EF+ G K +T + FKY+N+ TD+
Sbjct: 69 IESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENV--TDI 126
Query: 137 PTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P ++DWR KGAVTPIK+Q +CG WAF+ +AA EGI +I +GNL+ LSEQ+L+DC +
Sbjct: 127 PEAIDWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSV- 185
Query: 196 NNGCLGGSREKAFAYIIQNQGI-------FNGVCGT 224
++GC GG E F +II+N GI + GV GT
Sbjct: 186 DDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGT 221
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/76 (51%), Positives = 53/76 (69%), Gaps = 5/76 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 265
++ + GI+NG CGT LDH VT VG+GT E+G +YW++KNSWG WG+ GY+++ R
Sbjct: 268 FMFYSSGIYNGECGTDLDHGVTAVGYGT-ENGTDYWIVKNSWGTQWGEKGYIRMHRGIAA 326
Query: 266 DEGLCGIGTRSSYPLA 281
G+CGI SSYP A
Sbjct: 327 KHGICGIALDSSYPTA 342
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 91/208 (43%), Positives = 135/208 (64%), Gaps = 10/208 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F+++ L + ++ TH ++ V+ ++E W+A+HG+SY EKE R +
Sbjct: 14 LFLLLGLASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALGEKERRFQ 73
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL +I++ N E NRTYK+G N+F+DLTN+E+R++Y G + + RS+ + +Y
Sbjct: 74 IFKDNLRFIDEHNAE-NRTYKVGLNRFADLTNEEYRSMYLGTRTAA-KRRSSNKISDRYA 131
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+P S+DWR KGAV +K+Q CG CWAF+ +AAVEGI KI +G LI LSEQ+L+
Sbjct: 132 FRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELV 191
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGI 217
DC T+ N GC GG + AF +II N GI
Sbjct: 192 DCDTSYNEGCNGGLMDYAFEFIINNGGI 219
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 55/84 (65%), Gaps = 10/84 (11%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + GIF G CGT LDH VT VG+GT E+G +YW++KNSWG +WG+ GY+
Sbjct: 276 GGRE----FQLYQSGIFTGRCGTALDHGVTAVGYGT-ENGVDYWIVKNSWGASWGEEGYI 330
Query: 262 KIVRD-----EGLCGIGTRSSYPL 280
++ RD G CGI +SYP+
Sbjct: 331 RMERDLATSATGKCGIAMEASYPI 354
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 110/300 (36%), Positives = 161/300 (53%), Gaps = 75/300 (25%)
Query: 51 MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG 110
M++HG+SY+ EK R ++F++NL++I++ NK+ + +Y LG N+F+DL+++EF+ Y G
Sbjct: 1 MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKRKYLG 59
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
K+ P R + F Y++++ D+P S+DWR KGAV +KNQ CG CWAF+ VAAVE
Sbjct: 60 LKIELPKRRDSPEE-FSYKDVA--DLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVE 116
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFN----------G 220
GI +I +GNL LSEQ+L+DC NNGC GG + AFA+II N G+ G
Sbjct: 117 GINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEG 176
Query: 221 VCGTQLD--HAVTIVGF------------------------------------------- 235
CG + + VTI G+
Sbjct: 177 TCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHC 236
Query: 236 GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
GT D G +Y +KNSWG+ WG+ GY+++ R+ EG+CGI +SYP
Sbjct: 237 GTELDHGVAAVGYGTSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMASYP 296
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 184 bits (467), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 126/188 (67%), Gaps = 3/188 (1%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
+ +SRS E ++ ++E+W+ +HG+ Y EKE R +IFK+NL +I+ N + +RTY
Sbjct: 64 AHAATSRSDEE--LMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQEDRTY 121
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
KLG N+F+DLTN+E+RA Y G K+ P+ R + + +Y +P S+DWR +GAV
Sbjct: 122 KLGLNRFADLTNEEYRAKYLGTKI-DPNRRLGKTPSNRYAPRVGDKLPESVDWRKEGAVP 180
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
P+K+Q CG CWAF+A+ AVEGI KI +G LI LSEQ+L+DC T N GC GG + AF
Sbjct: 181 PVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNEGCNGGLMDYAFE 240
Query: 210 YIIQNQGI 217
+II N GI
Sbjct: 241 FIINNGGI 248
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 53/85 (62%), Gaps = 10/85 (11%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG RE + + G+F G CGT LDH V VG+GT +G +YW+++NSWG +WG+ GY
Sbjct: 304 GGGRE----FQLYVSGVFTGRCGTALDHGVVAVGYGTA-NGHDYWIVRNSWGPSWGEDGY 358
Query: 261 MKIVRD-----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYPL
Sbjct: 359 IRLERNLANSRSGKCGIAIEPSYPL 383
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 102/217 (47%), Positives = 137/217 (63%), Gaps = 11/217 (5%)
Query: 8 SGSFKINTTPMFIII----TLLVSCASQVVSSRSTHEQSVV-EIHEKWMAQHGRSYKDEL 62
S SF + + II+ T LV A + ++ + S + E +EKW A HGR+YKD L
Sbjct: 5 SSSFSLAAILLIIIMYCCPTGLVEAARKGPAAAGGGDDSAMRERYEKWAADHGRTYKDSL 64
Query: 63 EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST 121
EK R ++F+ N +I+ N G ++ +L TN+F+DLTN+EF A Y G +P
Sbjct: 65 EKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEF-AEYYGRPFSTPV---I 120
Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
S F Y N+ +DVP +++WRD+GAVT +KNQK+C CWAF+AVAAVEGI +IRS NL+
Sbjct: 121 GGSGFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEGIHQIRSHNLV 180
Query: 182 QLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI 217
LS QQLLDCST NN GC G ++AF YI N GI
Sbjct: 181 ALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGI 217
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/76 (52%), Positives = 50/76 (65%), Gaps = 8/76 (10%)
Query: 214 NQGIF----NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ G+F N C T L+HA+T VG+GT E G YWL+KNSWG WG+ GYMKI RD
Sbjct: 282 SSGVFGAMQNETCTTDLNHAMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIARDVAS 341
Query: 267 -EGLCGIGTRSSYPLA 281
GLCG+ + SYP+A
Sbjct: 342 NTGLCGLAMQPSYPVA 357
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 91/208 (43%), Positives = 134/208 (64%), Gaps = 10/208 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F + L + ++S + H ++ V ++E+W+ +HG+ Y EK+ R +
Sbjct: 3 LFALFALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKRFQ 62
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL +I++ N E NRTYKLG N+F+DLTN+E+RA Y G K+ P+ R + + +Y
Sbjct: 63 IFKDNLRFIDQQNAE-NRTYKLGLNRFADLTNEEYRARYLGTKI-DPNRRLGRTPSNRYA 120
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+P S+DWR +GAV P+K+Q CG CWAF+A+ AVEGI KI +G+LI LSEQ+L+
Sbjct: 121 PRVGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQELV 180
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGI 217
DC T N GC GG + AF +II+N GI
Sbjct: 181 DCDTGYNMGCNGGLMDYAFEFIIKNGGI 208
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 54/85 (63%), Gaps = 10/85 (11%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG RE + + + G+F G CGT LDH V VG+GT ++G ++W+++NSWG WG+ GY
Sbjct: 264 GGGRE----FQLYSSGVFTGRCGTALDHGVVAVGYGT-DNGHDFWIVRNSWGADWGEEGY 318
Query: 261 MKIVRD-----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYP+
Sbjct: 319 IRLERNLGNSRSGKCGIAIEPSYPI 343
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 94/218 (43%), Positives = 139/218 (63%), Gaps = 16/218 (7%)
Query: 14 NTTPMFIIITLLVSCAS------QVVSSRSTH--------EQSVVEIHEKWMAQHGRSYK 59
+++ M + + LL+ AS ++ TH ++ V+ ++E W+A+HG+SY
Sbjct: 6 SSSSMAVFLFLLLGLASASAXDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYN 65
Query: 60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR 119
EKE R +IFK+NL +I++ N E NRTYK+G N+F+DLTN+E+R++Y G + + R
Sbjct: 66 ALGEKERRFQIFKDNLRFIDEHNAE-NRTYKVGLNRFADLTNEEYRSMYLGTRTAA-KRR 123
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
S+ + +Y +P S+DWR KGAV +K+Q CG CWAF+ +AAVEGI KI +G
Sbjct: 124 SSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGG 183
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
LI LSEQ+L+DC T+ N GC GG + AF +II N GI
Sbjct: 184 LISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGI 221
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 55/84 (65%), Gaps = 10/84 (11%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + GIF G CGT LDH VT VG+GT E+G +YW++KNSWG +WG+ GY+
Sbjct: 278 GGRE----FQLYQSGIFTGRCGTALDHGVTAVGYGT-ENGVDYWIVKNSWGASWGEEGYI 332
Query: 262 KIVRD-----EGLCGIGTRSSYPL 280
++ RD G CGI +SYP+
Sbjct: 333 RMERDLATSATGKCGIAMEASYPI 356
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 86/182 (47%), Positives = 125/182 (68%), Gaps = 1/182 (0%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKD-ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
S ++ V+ ++E W+ +HG+SY EK+ R +IFK+NL YI++ N G+R+YKLG N+
Sbjct: 39 SRSDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNR 98
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
F+DLTN+E+R+ Y G K + + T S +Y + +P S+DWR+KGAV +K+Q
Sbjct: 99 FADLTNEEYRSTYLGAKTDARRRIAKTKSDRRYAPKAGGSLPDSIDWREKGAVAEVKDQG 158
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ +AAVEGI +I +G LI LSEQ+L+DC T+ N GC GG + AF +II+N
Sbjct: 159 SCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNG 218
Query: 216 GI 217
GI
Sbjct: 219 GI 220
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 37/74 (50%), Positives = 51/74 (68%), Gaps = 5/74 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + + GIF G CGT LDH VT VG+GT E+G +YW++KNSW +WG+ GY+++ R+
Sbjct: 281 FQLYSSGIFTGSCGTDLDHGVTAVGYGT-ENGVDYWIVKNSWAASWGEKGYLRMQRNVKD 339
Query: 267 -EGLCGIGTRSSYP 279
GLCGI SYP
Sbjct: 340 KNGLCGIAIEPSYP 353
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 104/295 (35%), Positives = 159/295 (53%), Gaps = 71/295 (24%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
V+ + E + +H + Y+ EK R +IF +NL++I++ NK+ + Y LG N+F+DLT
Sbjct: 43 HKVIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLT 101
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
++EF+ + G+K + + F+Y++ D+P S+DWR KGAV+P+KNQ +CG C
Sbjct: 102 HEEFKNKFLGFKGELAERKDESIEQFRYRDF--VDLPKSVDWRKKGAVSPVKNQGQCGSC 159
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST---NGNNGCL-----------GGSREK 206
WAF+ VAAVEGI +I +GNL LSEQ+L+DC T NG NG L G +E+
Sbjct: 160 WAFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGGLMDYAFAYVTRNGLHKEE 219
Query: 207 AFAYI-----------------------------------IQNQ---------------- 215
+ YI + NQ
Sbjct: 220 EYPYIMSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFY 279
Query: 216 --GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 268
G+F+G CGT+LDH V VG+GT++ G +Y +++NSWG WG+ GY+++ R+ G
Sbjct: 280 SGGVFDGHCGTELDHGVAAVGYGTSK-GLDYVIVRNSWGPKWGEKGYIRMKRNTG 333
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 113/311 (36%), Positives = 165/311 (53%), Gaps = 76/311 (24%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+V + + W +H + Y EK R IFK+NL +I + N++ N +Y LG NQF+D+T++
Sbjct: 41 LVNLFKSWSVKHRKIYVSPKEKLKRYGIFKQNLMHIAETNRK-NGSYWLGLNQFADITHE 99
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
EF+A + G K + ++ T +TF+Y + ++P S+DWR KGAVTP+KNQ +CG C
Sbjct: 100 EFKANHLGLKQGLSRMGAQTRTPTTFRYA--AAANLPWSVDWRYKGAVTPVKNQGKCGSC 157
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF-- 218
WAF++VAAVEGI +I +G L+ LSEQ+L+DC T ++GC GG + AFAYI+ +QGI
Sbjct: 158 WAFSSVAAVEGINQIVTGKLVSLSEQELMDCDTMLDHGCEGGLMDFAFAYIMGSQGIHAE 217
Query: 219 --------NGVCGTQLDHA--VTIVGF--------------------------------- 235
G C + +A VTI G+
Sbjct: 218 DDYPYLMEEGYCKEKQPYANVVTITGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQF 277
Query: 236 --GTTEDGA--------------------NYWLIKNSWGNTWGDAGYMKIV----RDEGL 269
G DG+ NY +KNSWG WG+ GY++I + EG+
Sbjct: 278 YKGGVFDGSCSDELDHALTAVGYGSSYGQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGV 337
Query: 270 CGIGTRSSYPL 280
CGI T +SYP+
Sbjct: 338 CGIYTMASYPV 348
>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 377
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 117/334 (35%), Positives = 154/334 (46%), Gaps = 98/334 (29%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE--GNRTYKLGTNQFSD 98
Q++ ++W A+HGR+Y E+ RL+++ N+ YIE AN + TY+LG ++D
Sbjct: 47 QTMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGETAYTD 106
Query: 99 LTNDEFRALYTGYKMPSP---SHRSTTSSTFK---------------YQNLSMTDVPTSL 140
LT DEF A+YT PSP +H + Y N+S P S+
Sbjct: 107 LTADEFTAMYTS---PSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAGAPASV 163
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL 200
DWR KGAVT +KNQ CG CWAF+ VA VEGI +IR+GNLI LSEQ+L+DC T + GC
Sbjct: 164 DWRAKGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCDTL-DYGCD 222
Query: 201 GGSREKAFAYIIQNQGIF----------NGVC-GTQLD-HAVTIVGFGTT---------- 238
GG A +I N GI +G C +L HA I GF
Sbjct: 223 GGVSYHALEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRSEPSLAN 282
Query: 239 ------------EDGANY-----------------------------------WLIKNSW 251
GAN+ W++KNSW
Sbjct: 283 AVAAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEKYWIVKNSW 342
Query: 252 GNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 280
G WGD GY ++ +D EGLCGI R S+PL
Sbjct: 343 GKKWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 86/181 (47%), Positives = 129/181 (71%), Gaps = 3/181 (1%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
++HE+ ++E+ EK+MA++ ++Y EK R ++FK+NL +I++ NK+ Y LG N+F
Sbjct: 43 ASHER-LMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKK-ITGYWLGLNEF 100
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLT+DEF+A Y G + +P+ R++ F+Y+ + +P +DWR KGAVT +KNQ +
Sbjct: 101 ADLTHDEFKAAYLGLTL-TPARRNSNDQLFRYEEVEAASLPKEVDWRKKGAVTEVKNQGQ 159
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ VAAVEGI I +GNL +LSEQ+L+DC T+GNNGC GG + AF+YI N G
Sbjct: 160 CGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLMDYAFSYIAANGG 219
Query: 217 I 217
+
Sbjct: 220 L 220
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 49/68 (72%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F+G CGT+LDH VT VG+GT G +Y ++KNSWG+ WG+ GY+++ R +GLCG
Sbjct: 294 GVFDGPCGTRLDHGVTAVGYGTASKGHDYIIVKNSWGSHWGEKGYIRMRRGTGKHDGLCG 353
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 354 INKMASYP 361
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 167/313 (53%), Gaps = 77/313 (24%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSD 98
E ++ W+A++GRSY E E R ++F +NL + + N + + ++LG N+F+D
Sbjct: 47 EAEARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFAD 106
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTN+EFRA + G K+ S ++ +Y++ + ++P S+DWR+KGAV P+KNQ +CG
Sbjct: 107 LTNEEFRATFLGAKVVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCG 162
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
CWAF+AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG + AF +II+N GI
Sbjct: 163 SCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGI 222
Query: 218 ----------FNGVCGTQLDHA--VTIVGF------------------------------ 235
+G C ++A V+I GF
Sbjct: 223 DTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGRE 282
Query: 236 -------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD---- 266
GT+ D G +YW+++NSWG WG++GY+++ R+
Sbjct: 283 FQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINVT 342
Query: 267 EGLCGIGTRSSYP 279
G CGI +SYP
Sbjct: 343 TGKCGIAMMASYP 355
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 91/183 (49%), Positives = 121/183 (66%), Gaps = 5/183 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
+ E V +E W+A+HGR+Y EKE R +IFK+NL +IE N GNRTYK+G NQF+
Sbjct: 41 SDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSS---TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DLTN+E+R +Y G K S + R S + +Y + +P S+DWR +GAV PIKNQ
Sbjct: 101 DLTNEEYRTMYLGTK--SDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQ 158
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ VAAVEGI +I +G +I LSEQ+L+DC N+GC GG + AF +II N
Sbjct: 159 GSCGSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN 218
Query: 215 QGI 217
G+
Sbjct: 219 GGM 221
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 53/76 (69%), Gaps = 6/76 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 267
A+ + + G+F G CG ++DH V +VG+G+ EDG +YW+++NSWG WG+ GY+K+ R+
Sbjct: 280 AFQLYSSGVFTGECGEEVDHGVVVVGYGS-EDGVDYWIVRNSWGTKWGENGYVKMERNVK 338
Query: 268 ----GLCGIGTRSSYP 279
G CGI T +SYP
Sbjct: 339 KSHLGKCGIMTEASYP 354
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 113/336 (33%), Positives = 164/336 (48%), Gaps = 76/336 (22%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
F +IT ++ Q+ + RS E V+ ++E+W+ +H + Y EK+ R +IFK+NL +
Sbjct: 12 FFSLITFSLALDIQLPTGRSNDE--VMTMYEEWLVKHQKVYNGLREKDQRFQIFKDNLNF 69
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDV 136
I++ N + N TY +G N+F+D+TN+E+R +Y G + T +Y S +
Sbjct: 70 IDEHNAQ-NYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHRYAYNSGDRL 128
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P +DWR KGA+T IK+Q CG CWAF+ +A VE I KI +G L+ LSEQ+L+DC N
Sbjct: 129 PVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFN 188
Query: 197 NGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHA--VTIVGF--------- 235
GC GG + AF +II N GI F G C A V+I G+
Sbjct: 189 EGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDGYEDVPSNNEN 248
Query: 236 ----------------------------------GTTED------------GANYWLIKN 249
GT+ D G +YWL++N
Sbjct: 249 ALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYGSENGLDYWLVRN 308
Query: 250 SWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 280
SWG WG+ GY K+ R+ G CGI +SYP+
Sbjct: 309 SWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPV 344
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 111/331 (33%), Positives = 167/331 (50%), Gaps = 83/331 (25%)
Query: 25 LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-- 82
+ S + Q+ S E+ ++ +W AQHG +E +E R + F++NL YI++ N
Sbjct: 26 IASSSGQIRS-----EEETRRMYAEWTAQHGSPITNE--EEGRYEAFRDNLRYIDEHNAA 78
Query: 83 -KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
G +++LG N+F+ LTN+E+RA Y G ++ S + + +Y+ +P S+D
Sbjct: 79 ADAGIHSFRLGLNRFAGLTNEEYRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVD 138
Query: 142 WRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST------- 193
WR+KGAV +K+Q + CG WAF+A+AAVE I +I +G LI LSEQ+L+DC T
Sbjct: 139 WREKGAVGKVKDQGRSCGSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNAGCD 198
Query: 194 -------------NG-------------NNGCLGGSR----------------EKAFAYI 211
NG N+ C R EK+
Sbjct: 199 GGLMDDAFEFIISNGGIDTDEDYPYKARNDSCDANKRNRKAVTIDDYEDLRMNEKSLQKA 258
Query: 212 IQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 253
+ NQ GIF G CGT LDHA TIVG+G +E+G +YW++K S+G
Sbjct: 259 VSNQPVSVAIEAGGRDFQLYKSGIFTGTCGTDLDHATTIVGYG-SENGTDYWIVKESYGT 317
Query: 254 TWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
+WG++GY ++ R+ G CGI SYP+
Sbjct: 318 SWGESGYARMERNIKETSGKCGIAMLPSYPV 348
>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
Length = 355
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 162/338 (47%), Gaps = 77/338 (22%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F++ + + ++S + H + V+ + E+W+ +H + Y EKE R +
Sbjct: 8 LFMVFAVSSALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNALGEKEKRFQ 67
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK NL +I++ N NRTYKLG N F+DLTN E+RA+Y P T +Y
Sbjct: 68 IFKNNLRFIDERNSL-NRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTPPRNRYV 126
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+P S+DWR +GAVTP+KNQ C CWAF AV AVE + KI++G+LI LSEQ++
Sbjct: 127 PRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLISLSEQEV 186
Query: 189 LD----------------------------------------CSTNGNN--------GCL 200
+D C +N N G +
Sbjct: 187 VDCTTSSSRGCGGGDIQHGYIYIRKNGISLEKDYPYRGDEGKCDSNKKNAIVTIDGHGWV 246
Query: 201 GGSREKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGA 242
E+A I NQ G+F G CGT+L+HA+ +VG+G +DG
Sbjct: 247 PTQLEEALKQGIANQPVAVPIPADDYEFQYYTSGVFKGKCGTELNHALLLVGYGAEKDG- 305
Query: 243 NYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
+YW+ KNS+ + WG+ GY++I R C G YP+
Sbjct: 306 DYWIAKNSYSDKWGENGYIRIQRKLSTCKFGNGGYYPI 343
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 180 bits (457), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 111/321 (34%), Positives = 165/321 (51%), Gaps = 81/321 (25%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
++ V I+ +W A+HG++ + +++ R IFK+NL +I+ N++ N TYKLG
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLT 101
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
+F+DLTNDE+R LY G + P+ R + KY ++ +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CWAF+ AAVEGI KI +G LI LSEQ+L+DC + N GC GG + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGI----------FNGVCGTQLDHA--VTIVGF------------------------ 235
++N G+ F G C + L ++ V+I G+
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280
Query: 236 -------------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIV 264
GT D G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRME 340
Query: 265 RD-----EGLCGIGTRSSYPL 280
R+ G CGI +SYP+
Sbjct: 341 RNLAASKSGKCGIAVEASYPV 361
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 180 bits (457), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 169/350 (48%), Gaps = 76/350 (21%)
Query: 7 RSGSFKINTTPMFIIITLLVSCASQVVSSRSTH-----EQSVVEIHEKWMAQHGRSYKDE 61
S + I+ M I TL + ++S TH + V ++E W+ +HG+SY
Sbjct: 4 HSSTLTISLLLMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNAL 63
Query: 62 LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST 121
EK+ R +IFK+NL+YI++ N N++YKLG +F+DLTN+E+R++Y G K + +
Sbjct: 64 GEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLS 123
Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
+ + +Y +P S+DWRDKG + +K+Q CG CWAF+AVAA+E I I +GNLI
Sbjct: 124 KNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLI 183
Query: 182 QLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVC--------- 222
LSEQ+L+DC + N GC GG + AF ++I N GI N VC
Sbjct: 184 SLSEQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKV 243
Query: 223 ---------------------------------GTQLDHAVTIVGFGTTEDGANYWLIKN 249
G L H + + G ++ ++
Sbjct: 244 VKIDSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHGVVAA 303
Query: 250 SWG-----------NTW----GDAGYMKIVRD----EGLCGIGTRSSYPL 280
+G N+W G+ GY+++ R+ GLCG+ T SYP+
Sbjct: 304 GYGSENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGLATEPSYPV 353
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 180 bits (457), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 84/178 (47%), Positives = 119/178 (66%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E I+E W+ +HGR+Y EKE R +IFK+NL++I++ N GN +YKLG N+F+DL
Sbjct: 18 EAETRRIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADL 77
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
+NDE+R++Y G +M + +Y D+P ++DWR+KGAV P+K+Q +CG
Sbjct: 78 SNDEYRSVYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGS 137
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ V AVEGI +I +GNL LSEQ+L+DC N GC GG + AF +II+N GI
Sbjct: 138 CWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIENGGI 195
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 49/76 (64%), Gaps = 6/76 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
+ + G+F G CGTQLDH V VG+GT E G +YW+++NSWG WG+ GY+++ RD
Sbjct: 255 GFQLYQSGVFTGSCGTQLDHGVVTVGYGT-EHGVDYWIVRNSWGPAWGENGYIRMERDVA 313
Query: 267 ---EGLCGIGTRSSYP 279
G CGI +SYP
Sbjct: 314 STETGKCGIAMEASYP 329
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 106/301 (35%), Positives = 158/301 (52%), Gaps = 75/301 (24%)
Query: 51 MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG 110
+ +H ++Y KE R +IFK+NL +I++ NK N+++KLG N+F+DL+N+E+++++ G
Sbjct: 11 LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
+M + S FKY ++P S+DWR+KGAV P+K+Q +CG CWAF+ VAAVE
Sbjct: 71 GRMVR-DRKGFESDRFKYG--VGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVE 127
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNG 220
GI +I +G+LI LSEQ+L+DC N GC GG + AF +I++N GI +G
Sbjct: 128 GINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDG 187
Query: 221 VCGTQLDHA--VTIVGF------------------------------------------- 235
C +A VTI GF
Sbjct: 188 QCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGLC 247
Query: 236 GTTED------------GANYWLIKNSWGNTWGDAGYMKIVR-----DEGLCGIGTRSSY 278
GT D G +YW+++NSWG WG+ GY+++ R + G CGI + SY
Sbjct: 248 GTDLDHGVVAVGYGTEDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQPSY 307
Query: 279 P 279
P
Sbjct: 308 P 308
>gi|255646092|gb|ACU23533.1| unknown [Glycine max]
Length = 208
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 87/172 (50%), Positives = 120/172 (69%), Gaps = 8/172 (4%)
Query: 34 SSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGT 93
++R+ + + E HE+WMA HG+ Y EKE + + FKEN++ IE N GN+ YKLG
Sbjct: 27 NARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQKYQTFKENVQRIEAFNHAGNKPYKLGI 86
Query: 94 NQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
N F+DLTN+EF+A+ + G+ + T + TF+Y+N MT VP +LDWR +GAVTPI
Sbjct: 87 NHFADLTNEEFKAINRFKGH----VCSKITRTPTFRYEN--MTAVPATLDWRQEGAVTPI 140
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
K+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L+DC T G + + GS
Sbjct: 141 KDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQVVKGS 192
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 109/304 (35%), Positives = 153/304 (50%), Gaps = 76/304 (25%)
Query: 52 AQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALY 108
A+HG+SY E E+ RLKI+ EN I K N++ G Y + N+F D+ + EF +
Sbjct: 32 AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G+K S+ + +N+ +P ++DWR KGAVTP+KNQ +CG CWAF+A +
Sbjct: 92 NGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATGS 151
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI-------FNG 220
+EG +SG+++ LSEQ L+DCST+ GNNGC GG + AF YI N+GI +NG
Sbjct: 152 LEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPYNG 211
Query: 221 VCG------------------------TQLDHAVTIV----------------------- 233
G TQL AV V
Sbjct: 212 TDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYD 271
Query: 234 ----------------GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRS 276
G+GT +G +YWL+KNSWG TWGD GY+++ R+ + CGI + +
Sbjct: 272 EPECDSESLDHGVLVVGYGTL-NGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQCGIASSA 330
Query: 277 SYPL 280
SYPL
Sbjct: 331 SYPL 334
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 85/178 (47%), Positives = 121/178 (67%), Gaps = 1/178 (0%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E V E+ E W+ +HG+SY EK+ R KIF++NL+YI++ N NR+YKLG N+F+D+
Sbjct: 43 EDEVKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADI 102
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN+E+R Y G K + S S + +Y ++ +P S+DWR+KGAVT +K+Q CG
Sbjct: 103 TNEEYRTGYLGAKRDA-SRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGS 161
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ +AAVEG+ ++ +GNLI LSEQ+L+DC N GC GG AF +II+N GI
Sbjct: 162 CWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFIIKNGGI 219
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 53/77 (68%), Gaps = 5/77 (6%)
Query: 207 AFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
+ + + + GIF G CGT LDH V VG+GT E+G +YW++KNSWG+ WG+ GY+++ R+
Sbjct: 278 GYDFQLYSSGIFTGSCGTDLDHGVAAVGYGT-ENGVDYWIVKNSWGDYWGEKGYVRMQRN 336
Query: 267 ----EGLCGIGTRSSYP 279
GLCGI +SYP
Sbjct: 337 VKAKTGLCGIAMEASYP 353
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 179 bits (455), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 84/179 (46%), Positives = 127/179 (70%), Gaps = 2/179 (1%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+ V+ +++ WMA+HG++Y EKE R +IFK+NL++I++ N + NRTYK+G N+F+DL
Sbjct: 39 EEEVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQ-NRTYKVGLNRFADL 97
Query: 100 TNDEFRALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
TN+E+RA+Y G + P +++ +Y + +P S+DWR+ GAV P+K+Q+ CG
Sbjct: 98 TNEEYRAIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSCG 157
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ VAAVEGI +I +G LI LSEQ+L+DC T + GC GG + AF +II+N G+
Sbjct: 158 SCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNGGL 216
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 50/70 (71%), Gaps = 6/70 (8%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLC 270
GIF G CGT LDH + VG+GT E+G +YW+++NSWG++WG+ GY+++ R+ G C
Sbjct: 283 GIFTGECGTALDHGIVAVGYGT-ENGTDYWIVRNSWGSSWGENGYIRMERNMADAFSGKC 341
Query: 271 GIGTRSSYPL 280
GI +SYP+
Sbjct: 342 GIAMEASYPI 351
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 179 bits (455), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 86/175 (49%), Positives = 117/175 (66%), Gaps = 5/175 (2%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E E+WMA++GR Y D EK R +IFK N+ +IE N +Y LG NQF+D+TN+
Sbjct: 6 MMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMTNN 65
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF A YTG +P R S + ++ ++ VP S+DWRD GAVT +KNQ CG CWA
Sbjct: 66 EFLARYTGASLPLNIERDPVVS---FDDVDISAVPQSIDWRDYGAVTSVKNQGSCGSCWA 122
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
F+A+A VEGI KI++GNLI LSEQ++LDC+ + GC GG KA+ +II N G+
Sbjct: 123 FSAIATVEGIYKIKAGNLISLSEQEVLDCAL--SYGCDGGWVNKAYDFIISNNGV 175
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 33/68 (48%), Positives = 47/68 (69%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F G CGT L+HA+T++G+G T G YW++KNSWG +WG+ GY+++ RD GLCG
Sbjct: 240 GVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARDVSSPYGLCG 299
Query: 272 IGTRSSYP 279
I +P
Sbjct: 300 IAMAPLFP 307
>gi|125571164|gb|EAZ12679.1| hypothetical protein OsJ_02594 [Oryza sativa Japonica Group]
Length = 250
Score = 179 bits (455), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 90/182 (49%), Positives = 116/182 (63%), Gaps = 6/182 (3%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSDL 99
S+ HE+WMA+ GR+Y D EK R+++F N E ++ AN+ G +RTY LG NQFSDL
Sbjct: 37 HSMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDL 96
Query: 100 TNDEFRALYTGYK--MPSPSHRS--TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
T+DEF + GY P PSHR + TDVP S+DWR +GAVT +KNQ+
Sbjct: 97 TDDEFAQTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQR 156
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAFAAVAA EG+ ++ +GNL+ LSEQQ+LDC T G N C GG A YI +
Sbjct: 157 SCGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDC-TGGANTCSGGDVSAALRYIAASG 215
Query: 216 GI 217
G+
Sbjct: 216 GL 217
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 179 bits (455), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 111/321 (34%), Positives = 164/321 (51%), Gaps = 81/321 (25%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
++ V I+ +W A+HG++ + +++ R IFK+NL +I+ N+ N TYKLG
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLT 101
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
+F+DLTNDE+R LY G + P+ R + KY ++ +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CWAF+ AAVEGI KI +G LI LSEQ+L+DC + N GC GG + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGI----------FNGVCGTQLDHA--VTIVGF------------------------ 235
++N G+ F G C + L ++ V+I G+
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280
Query: 236 -------------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIV 264
GT D G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRME 340
Query: 265 RD-----EGLCGIGTRSSYPL 280
R+ G CGI +SYP+
Sbjct: 341 RNLAASKSGKCGIAVEASYPV 361
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 111/321 (34%), Positives = 164/321 (51%), Gaps = 81/321 (25%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
++ V I+ +W A+HG++ + +++ R IFK+NL +I+ N+ N TYKLG
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLT 101
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
+F+DLTNDE+R LY G + P+ R + KY ++ +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CWAF+ AAVEGI KI +G LI LSEQ+L+DC + N GC GG + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGI----------FNGVCGTQLDHA--VTIVGF------------------------ 235
++N G+ F G C + L ++ V+I G+
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVRVAI 280
Query: 236 -------------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIV 264
GT D G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRME 340
Query: 265 RD-----EGLCGIGTRSSYPL 280
R+ G CGI +SYP+
Sbjct: 341 RNLAASKSGKCGIAVEASYPV 361
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 88/181 (48%), Positives = 132/181 (72%), Gaps = 4/181 (2%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
S+H++ +VE+ EKW+A+H ++Y EK R ++FK+NL+ I++ N+E +Y LG N+F
Sbjct: 35 SSHDR-LVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKLIDEINRE-VTSYWLGLNEF 92
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLT+DEF+ Y G + P R ++S +F+Y+N++ D+P ++DWR KGAVT +KNQ +
Sbjct: 93 ADLTHDEFKTTYLG--LSPPPARRSSSRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQ 150
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ VAAVEGI I +GNL LSEQ+L+DCS +GN+GC GG + AF+YI + G
Sbjct: 151 CGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGMMDYAFSYIASSGG 210
Query: 217 I 217
+
Sbjct: 211 L 211
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/69 (49%), Positives = 47/69 (68%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVR----DEGLC 270
G+F+G CG QLDH V VG+G+ + G +Y ++KNSWG WG+ GY+++ R EGLC
Sbjct: 279 GVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVKNSWGGKWGEKGYIRMKRGTGKSEGLC 338
Query: 271 GIGTRSSYP 279
GI +SYP
Sbjct: 339 GINKMASYP 347
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 90/183 (49%), Positives = 121/183 (66%), Gaps = 5/183 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
+ E V +E W+A+HGR+Y EKE R +IFK+NL +IE+ N GNRTYK+G NQF+
Sbjct: 41 SDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSS---TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DLTN+E+R +Y G K S + R S + +Y + +P S+DWR +GAV PIKNQ
Sbjct: 101 DLTNEEYRTMYLGTK--SDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQ 158
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ VAAV GI +I +G +I LSEQ+L+DC N+GC GG + AF +II N
Sbjct: 159 GSCGSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN 218
Query: 215 QGI 217
G+
Sbjct: 219 GGM 221
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 53/76 (69%), Gaps = 6/76 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 267
A+ + + G+F G CG ++DH V +VG+G+ EDG +YW+++NSWG WG+ GY+K+ R+
Sbjct: 280 AFQLYSSGVFTGECGEEVDHGVVVVGYGS-EDGVDYWIVRNSWGTKWGENGYVKMERNVK 338
Query: 268 ----GLCGIGTRSSYP 279
G CGI T +SYP
Sbjct: 339 KSHLGKCGIMTEASYP 354
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 111/321 (34%), Positives = 161/321 (50%), Gaps = 81/321 (25%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTN 94
++ V I+ +W A HG++ + +++ R IFK+NL +I+ N K N TYKLG
Sbjct: 42 DEEVRSIYLQWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEKNKNATYKLGLT 101
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPI 151
+F+DLTN+E+R+LY G + P R + + + D VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNEEYRSLYLGART-EPVRRIAKAKNVNQKYSAAVDGKEVPETVDWRLKGAVNPI 160
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CWAF+ AAVEGI KI +G LI LSEQ+L+DC + N GC GG + AF +I
Sbjct: 161 KDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVDCDNSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGI----------FNGVCGTQLDHA--VTIVGF------------------------ 235
++N G+ F G C + L +A V+I G+
Sbjct: 221 MKNGGLKTEKDYPYRGFGGKCNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVSVAI 280
Query: 236 -------------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIV 264
GT D G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQTGIFTGNCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRME 340
Query: 265 RD-----EGLCGIGTRSSYPL 280
R+ G CGI +SYP+
Sbjct: 341 RNLASSKSGKCGIAVEASYPV 361
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 86/175 (49%), Positives = 123/175 (70%), Gaps = 5/175 (2%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+VE+ E W++ HG++Y EK R ++FKENL++I++ NKE +Y LG N+F+DL+++
Sbjct: 43 LVELFESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKEVT-SYWLGLNEFADLSHE 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF++ + G P R +S F Y+++ D+P S+DWR KGAVTP+KNQ CG CWA
Sbjct: 102 EFKSKFLGLYPEFP--RKKSSEDFSYRDV--VDLPKSIDWRKKGAVTPVKNQGSCGSCWA 157
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
F+ VAAVEGI +I +GNL LSEQQL+DC T+ NNGC GG + AF +I+ N G+
Sbjct: 158 FSTVAAVEGINQIVAGNLTSLSEQQLIDCDTSFNNGCNGGLMDYAFEFIVNNGGL 212
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 48/68 (70%), Gaps = 5/68 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F+G CGT LDH V VG+G++ G +Y ++KNSWG WG+ GY+++ R+ EGLCG
Sbjct: 279 GVFSGPCGTDLDHGVAAVGYGSSS-GIDYIIVKNSWGPKWGERGYLRMKRNTGKPEGLCG 337
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 338 INKMASYP 345
>gi|115438530|ref|NP_001043562.1| Os01g0613500 [Oryza sativa Japonica Group]
gi|11034572|dbj|BAB17096.1| cysteine proteinase-like [Oryza sativa Japonica Group]
gi|113533093|dbj|BAF05476.1| Os01g0613500 [Oryza sativa Japonica Group]
gi|215697766|dbj|BAG91959.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 360
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/182 (49%), Positives = 116/182 (63%), Gaps = 6/182 (3%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSDL 99
S+ HE+WMA+ GR+Y D EK R+++F N E ++ AN+ G +RTY LG NQFSDL
Sbjct: 37 HSMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDL 96
Query: 100 TNDEFRALYTGYKM--PSPSHRS--TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
T+DEF + GY P PSHR + TDVP S+DWR +GAVT +KNQ+
Sbjct: 97 TDDEFAQTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQR 156
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAFAAVAA EG+ ++ +GNL+ LSEQQ+LDC T G N C GG A YI +
Sbjct: 157 SCGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDC-TGGANTCSGGDVSAALRYIAASG 215
Query: 216 GI 217
G+
Sbjct: 216 GL 217
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 45/70 (64%), Gaps = 6/70 (8%)
Query: 216 GIFNG--VCGTQLDHAVTIVGFGTTEDGA-NYWLIKNSWGNTWGDAGYMKIVRD---EGL 269
G++ G CG +L+HAVT+VG+G DG YWL+KN WG WG+ GYM++ R G
Sbjct: 287 GVYAGSAACGRRLNHAVTVVGYGAAADGGGEYWLVKNQWGTWWGEGGYMRVARGGAAGGN 346
Query: 270 CGIGTRSSYP 279
CGI T + YP
Sbjct: 347 CGIATYAFYP 356
>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 122/355 (34%), Positives = 164/355 (46%), Gaps = 100/355 (28%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEK--------WMAQHGRSYKDELEKEMRLKIFKENL 75
+L C+S+ +++ S H ++ H WM RSY EK R K+++ N+
Sbjct: 29 MLAGCSSESLTTSSEHSDIGIDKHHDLMMARFHVWMTVQNRSYPTSSEKAHRFKVYRSNM 88
Query: 76 EYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR----------STT 122
YIE N E TY+LG F+DLT++EF +LYTG K+P HR +T
Sbjct: 89 RYIEALNAEATTSGFTYELGEGPFTDLTDEEFISLYTG-KIPDDDHREDGVHDEQIITTH 147
Query: 123 SSTFK-------YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
+ + Y N S P +DWR +GAVTP+K+Q +CG CWAF VA +EGI KI
Sbjct: 148 AGSVNGAEGVTVYANFS-AGAPIRMDWRKRGAVTPVKDQGKCGSCWAFPTVATIEGIHKI 206
Query: 176 RSGNLIQLSEQQL-----LDCSTNG-----------NNG----------------CLGGS 203
+ G L+ LSEQQL LD NG NG C G
Sbjct: 207 KRGRLVSLSEQQLVDCDFLDGGCNGGWPRNAFQWIIQNGGITTTSSYTYKAAEGQCKGNR 266
Query: 204 R---------------EKAFAYIIQNQ------------------GIFNGVCGT-QLDHA 229
+ E + I+ NQ GI+NG C T +L+H
Sbjct: 267 KPAAKITGYRKVKSNSEVSMVNIVANQPIAASIVVHGGQFQHYKGGIYNGPCATSKLNHV 326
Query: 230 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYPL 280
+TIVG+G GA YW++KNSWG WG+ GYM + R G CGI R +PL
Sbjct: 327 ITIVGYGQQAYGAKYWIVKNSWGAAWGNKGYMLMKRGTKNPLGQCGIAVRPIFPL 381
>gi|125526835|gb|EAY74949.1| hypothetical protein OsI_02845 [Oryza sativa Indica Group]
Length = 360
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/182 (49%), Positives = 116/182 (63%), Gaps = 6/182 (3%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSDL 99
S+ HE+WMA+ GR+Y D EK R+++F N E ++ AN+ G +RTY LG NQFSDL
Sbjct: 37 HSMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDL 96
Query: 100 TNDEFRALYTGYKM--PSPSHRS--TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
T+DEF + GY P PSHR + TDVP S+DWR +GAVT +KNQ+
Sbjct: 97 TDDEFARTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQR 156
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAFAAVAA EG+ ++ +GNL+ LSEQQ+LDC T G N C GG A YI +
Sbjct: 157 SCGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDC-TGGANTCSGGDVSAALRYIAASG 215
Query: 216 GI 217
G+
Sbjct: 216 GL 217
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 45/70 (64%), Gaps = 6/70 (8%)
Query: 216 GIFNG--VCGTQLDHAVTIVGFGTTEDGA-NYWLIKNSWGNTWGDAGYMKIVRD---EGL 269
G++ G CG +L+HAVT+VG+G DG YWL+KN WG WG+ GYM++ R G
Sbjct: 287 GVYAGSAACGRRLNHAVTVVGYGAAADGGGEYWLVKNQWGTWWGEGGYMRVARGGAAGGN 346
Query: 270 CGIGTRSSYP 279
CGI T + YP
Sbjct: 347 CGIATYAFYP 356
>gi|358347416|ref|XP_003637753.1| Cysteine proteinase [Medicago truncatula]
gi|355503688|gb|AES84891.1| Cysteine proteinase [Medicago truncatula]
Length = 323
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 116/302 (38%), Positives = 154/302 (50%), Gaps = 62/302 (20%)
Query: 35 SRSTHEQSVV-EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGT 93
SR+ E S+ + HE+WM GR+Y D++EKE R KIF +NLEYIE N+ GN TY+LG
Sbjct: 21 SRTLLESSIAAKTHEQWMKDFGRTYADDVEKEKRFKIFAKNLEYIENFNRAGNETYELGL 80
Query: 94 NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-----------VPTSLDW 142
NQF DLT EF + YT + S +S N+S +P S+DW
Sbjct: 81 NQFLDLTKKEFTSKYTCANLKGKLESSMVASVAALFNVSKISTNNSLKGKRKPIPESIDW 140
Query: 143 RDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS---------- 192
R+ GAVT +K Q C CWAFA +AAVEGI +I++ L+ LS ++ +
Sbjct: 141 REGGAVTSVKRQGACASCWAFATLAAVEGIVQIKNRELVSLSASGIVKFAYDYIKKNEIA 200
Query: 193 -------TNGNNGCLG-GSREKAFAYIIQNQ-----------------GIF-NGVCGT-- 224
T CL S E+ ++ Q GIF +G CG
Sbjct: 201 SEADYPYTEKEGKCLSIRSGEENLLEVVAQQPVTVLIATNENFVNYKGGIFGSGPCGPIE 260
Query: 225 --QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIG-TRSS 277
QL HAVT++GF T E YWLIKNS+G +WG+ GYMK+ R +CG+ T S
Sbjct: 261 SLQLTHAVTVIGF-TNE----YWLIKNSYGESWGEKGYMKLKRKGDSHHTVCGLSMTASI 315
Query: 278 YP 279
YP
Sbjct: 316 YP 317
>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 350
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 86/197 (43%), Positives = 120/197 (60%), Gaps = 2/197 (1%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L+ + V+++ +V HE+WMA+ GR Y D EK R +F N Y++
Sbjct: 14 LLVLVATAVFHAVAAQGEAGLTVAARHEQWMAKFGRVYTDANEKARRQAVFGANARYVDA 73
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
N+ GNRTY LG N+FSDLT++EF + GY+ P + + L+ ++P S
Sbjct: 74 VNRAGNRTYTLGLNEFSDLTDNEFAKTHLGYREFRPETANISKGVDPGYGLA-GNIPKSF 132
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL 200
DWR KGAVT +K+Q CGCCWAFAAVAA EG+ KI G LI +SEQQ+LDC+T GNN C
Sbjct: 133 DWRTKGAVTEVKSQGGCGCCWAFAAVAATEGLVKIAKGTLISMSEQQVLDCTT-GNNTCK 191
Query: 201 GGSREKAFAYIIQNQGI 217
GG A +Y+ + G+
Sbjct: 192 GGYMNDALSYVFASGGL 208
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 40/66 (60%), Gaps = 3/66 (4%)
Query: 216 GIFNGV--CGTQLDHAVTIVGFGTTEDGAN-YWLIKNSWGNTWGDAGYMKIVRDEGLCGI 272
G+F G CG LDH T+VG+G + G YWL+KN WG +WG++GYM+I R
Sbjct: 276 GVFTGSPSCGQNLDHFFTVVGYGFADGGKQMYWLVKNQWGTSWGESGYMRIARGSSARNC 335
Query: 273 GTRSSY 278
G ++Y
Sbjct: 336 GMTNNY 341
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 84/192 (43%), Positives = 125/192 (65%), Gaps = 5/192 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ V+ ++ W+ +HG+SY EKE R +IFK+NL YI+ N + +R+Y+LG N+F+DL
Sbjct: 42 DDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADL 101
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN+E+RA Y G K + + + +Y + ++P S+DWR+KGAV +K+Q CG
Sbjct: 102 TNEEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEELPDSIDWREKGAVAAVKDQGSCGS 161
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFN 219
CWAF+A+ AVEGI +I +G LI LSEQ+L+DC + N GC GG + AF +II+N GI
Sbjct: 162 CWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGGLMDYAFNFIIKNGGI-- 219
Query: 220 GVCGTQLDHAVT 231
+ LD+ T
Sbjct: 220 ---DSDLDYPYT 228
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 36/69 (52%), Positives = 50/69 (72%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF G CGT +DH V +VG+G+ E+G +YW+++NSWG WG+AGY+K+ R+ GLCG
Sbjct: 286 GIFTGKCGTAVDHGVVVVGYGS-EEGMDYWIVRNSWGAAWGEAGYLKMQRNVGKSSGLCG 344
Query: 272 IGTRSSYPL 280
I SYP+
Sbjct: 345 ITIEPSYPV 353
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 94/203 (46%), Positives = 133/203 (65%), Gaps = 12/203 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQS--VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+ ++ LL C SQV+S R+ HE S + E HE+W ++G+ YKD EK+ RL IFK+N+
Sbjct: 10 ILALVLLLPICISQVMS-RNLHEASXCMSERHEQWTKKYGKVYKDAAEKQKRLLIFKDNV 68
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMT 134
E+IE N GN+ YKL N +D TN+EF A + GYK H+ + S T FKY+N+ T
Sbjct: 69 EFIESFNAAGNKPYKLSINHLTDQTNEEFVASHNGYK-----HKGSHSQTPFKYENI--T 121
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
VP ++DWR+ GAV +K+Q +CG CWAF+ VA EGI +I + L+ LSEQ+L+DC +
Sbjct: 122 GVPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATTEGIYQITTSMLMSLSEQELVDCDSV 181
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
++GC GG E F +I +N GI
Sbjct: 182 -DHGCDGGYMEGGFEFIXKNGGI 203
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 58/86 (67%), Gaps = 9/86 (10%)
Query: 200 LGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 259
+GGS A+ + G+F G CGTQLDH VT VG+G+T+DG YW++KNSWG WG+ G
Sbjct: 259 VGGS-----AFQFNSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEG 313
Query: 260 YMKIVR----DEGLCGIGTRSSYPLA 281
Y+++ R EGLCGI +SYP A
Sbjct: 314 YIRMQRGTDAQEGLCGIAMDASYPTA 339
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 133/201 (66%), Gaps = 8/201 (3%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSV---VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+ IT ++ +V H S+ +E+ E WM++H ++Y+ EK R +IF +NL+
Sbjct: 17 LFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRSIEEKLHRFEIFLDNLK 76
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+I++ NK+ + +Y LG N+F+DL+++EF++ Y G ++ P RS S F Y ++ D+
Sbjct: 77 HIDETNKKVS-SYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRS--SRGFSYGDVE--DL 131
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAVTP+KNQ CG CWAF+ VAAVEGI +I +GNL LSEQ+L+DC + N
Sbjct: 132 PESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRSFN 191
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
NGC GG + AF YI+ N G+
Sbjct: 192 NGCYGGLMDYAFQYIMSNSGL 212
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 86/180 (47%), Positives = 119/180 (66%), Gaps = 10/180 (5%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-------TYKLGTNQFSDL 99
HE WMA+HGR+Y D EK RL+IF+ N E I+ N + + +++L TN+F+DL
Sbjct: 43 HESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATNRFADL 102
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM-TDVPTSLDWRDKGAVTPIKNQKECG 158
T++EFRA TG + P+ F+Y+N S+ D S+DWR GAVT +K+Q CG
Sbjct: 103 TDEEFRAARTGLRRPAAVA-GAVGGGFRYENFSLQADAAGSMDWRAMGAVTGVKDQGSCG 161
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI 217
CCWAF+AVAA+EG+TKIR+G L+ LSEQQL+DC G++ GC GG + AF YI + G+
Sbjct: 162 CCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYISRQGGL 221
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 32/60 (53%), Positives = 45/60 (75%), Gaps = 3/60 (5%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---VRDEGLCGIGTRSSYPL 280
T+LDHA+T VG+G DG YWL+KNSWG+ WG++GY++I R EG+CG+ +SYP+
Sbjct: 301 TELDHAITAVGYGMAGDGTGYWLMKNSWGSGWGESGYVRIRRGSRGEGVCGLAKLASYPV 360
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 91/179 (50%), Positives = 119/179 (66%), Gaps = 5/179 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYK-DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
E+S+ +++ W QH S D E R +IFKEN++YI+ NK+ + YKLG N+F+D
Sbjct: 39 EKSLRSLYDNWALQHRSSRSLDSEEHAERFEIFKENVKYIDSVNKK-DSPYKLGLNKFAD 97
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
L+N+EF+A+Y G KM R S +F YQN +P S+DWR KGAV +KNQ CG
Sbjct: 98 LSNEEFKAIYMGTKMDLRGDREVQSGSFMYQN--SEPLPASIDWRQKGAVAAVKNQGHCG 155
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ VA+VEGI I +GNL+ LSEQQL+DCST N+GC GG + AF YII N GI
Sbjct: 156 SCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTE-NSGCNGGLMDTAFQYIINNGGI 213
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 47/68 (69%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK----IVRDEGLCG 271
G+F G CGT LDH V VG+GT+ +G NYW+++NSWG WG+ GY++ I EG CG
Sbjct: 282 GVFTGKCGTALDHGVVAVGYGTSPEGINYWIVRNSWGPKWGEEGYIRMQQGIEAAEGKCG 341
Query: 272 IGTRSSYP 279
I ++SYP
Sbjct: 342 IAMQASYP 349
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 88/208 (42%), Positives = 134/208 (64%), Gaps = 4/208 (1%)
Query: 11 FKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
F ++ M II L + + S E +++++E W+ +HG++Y EKE R +I
Sbjct: 19 FSLSLASMSIIDYDL---PADPLQSTERTEAHMMKMYEHWLVKHGKNYNAIGEKERRFEI 75
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH-RSTTSSTFKYQ 129
FK+NL ++++ N RTYKLG +F+DLTN+E+RA+Y G KM R+ S + ++
Sbjct: 76 FKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKMEKKEKLRTERSQRYLHK 135
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+ D+P+ +DWR+KGAVT +K+Q +CG CWAF+ V +VEGI +I +G+LI LSEQ+L+
Sbjct: 136 AGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGINQIVTGDLISLSEQELV 195
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGI 217
DC N GC GG + AF +II+N GI
Sbjct: 196 DCDKAYNQGCNGGLMDYAFEFIIKNGGI 223
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 53/84 (63%), Gaps = 10/84 (11%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
G RE + + G+F G CGT LDH V VG+GT E+G +YW+++NSWG WG++GY
Sbjct: 279 AGGRE----FQLYQSGVFTGRCGTNLDHGVVAVGYGT-ENGIDYWIVRNSWGPKWGESGY 333
Query: 261 MKIVR-----DEGLCGIGTRSSYP 279
+++ R D G CGI +SYP
Sbjct: 334 IRMERNVASTDTGKCGIAMEASYP 357
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 95/249 (38%), Positives = 142/249 (57%), Gaps = 16/249 (6%)
Query: 7 RSGSFKINTTPMFIIITLLVSCASQVVSSRSTH-----EQSVVEIHEKWMAQHGRSYKDE 61
S + I+ M I TL + ++S TH + V ++E W+ +HG+SY
Sbjct: 4 HSSTLTISILLMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNAL 63
Query: 62 LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST 121
EK+ R +IFK+NL YI++ N N++YKLG +F+DLTN+E+R++Y G K + +
Sbjct: 64 GEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLS 123
Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
+ + +Y +P S+DWR+KG + +K+Q CG CWAF+AVAA+E I I +GNLI
Sbjct: 124 KNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLI 183
Query: 182 QLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVT 231
LSEQ+L+DC + N GC GG + AF ++I+N GI NGVC Q
Sbjct: 184 SLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVC-DQYRKNAK 242
Query: 232 IVGFGTTED 240
+V + ED
Sbjct: 243 VVKIDSYED 251
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 34/69 (49%), Positives = 48/69 (69%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF G CGT +DH V I G+GT E+G +YW+++NSWG WG+ GY+++ R+ GLCG
Sbjct: 286 GIFTGKCGTAVDHGVVIAGYGT-ENGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCG 344
Query: 272 IGTRSSYPL 280
+ SYP+
Sbjct: 345 LAIEPSYPV 353
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 177 bits (450), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 83/178 (46%), Positives = 122/178 (68%), Gaps = 1/178 (0%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ V+ ++E W+ +HG+SY E+E R +IFK+NL +IE+ N NRTYK+G N+F+DL
Sbjct: 47 DAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVGLNRFADL 105
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN+E+R+ Y G + + + + +Y + D+P S+DWR+KGAV P+K+Q CG
Sbjct: 106 TNEEYRSRYLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGS 165
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC + N GC GG + AF +II N GI
Sbjct: 166 CWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGI 223
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 51/77 (66%), Gaps = 6/77 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
A+ + G+F G CGTQLDH V VG+GT E+ +YW+++NSWG WG++GY+K+ R
Sbjct: 283 AFQLYQSGVFTGQCGTQLDHGVVAVGYGT-ENSVDYWIVRNSWGPNWGESGYIKLERNLA 341
Query: 266 --DEGLCGIGTRSSYPL 280
+ G CGI SYP+
Sbjct: 342 GTETGKCGIAIEPSYPI 358
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 89/208 (42%), Positives = 131/208 (62%), Gaps = 11/208 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F++ L + ++S TH + V+ ++E+W+ +HG++Y EKE R +
Sbjct: 5 LFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFE 64
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL +I++ N E NRTY +G N+F+DLTN+EFR++Y G + TS +Y
Sbjct: 65 IFKDNLMFIDQHNSE-NRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSD--RYA 121
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+P S+DWR +GAV +K+Q CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+
Sbjct: 122 PRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELV 181
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGI 217
DC T+ N GC GG + AF +II N GI
Sbjct: 182 DCDTSYNEGCNGGLMDYAFEFIINNGGI 209
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 54/84 (64%), Gaps = 9/84 (10%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG R + + N G+F G CGT LDH V VG+GT E G +YW+++NSWG +WG++GY
Sbjct: 265 GGGRN----FQLYNSGVFTGECGTSLDHGVAAVGYGT-EKGKDYWIVRNSWGKSWGESGY 319
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYP+
Sbjct: 320 IRMERNIASPTGKCGIAIEPSYPI 343
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 177 bits (448), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 87/198 (43%), Positives = 132/198 (66%), Gaps = 9/198 (4%)
Query: 35 SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTN 94
S S ++ V+ I+ +W+A+HG++Y E+E R +IFK+NL+++++ N E NR+YK+G N
Sbjct: 35 SSSRTDEEVMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSE-NRSYKVGLN 93
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-VPTSLDWRDKGAVTPIKN 153
+F+DLTN+E+R+++ G K S + S + + +D +P S+DWR+ GAV PIK+
Sbjct: 94 RFADLTNEEYRSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKD 153
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q CG CWAF+ VAAVEG+ +I +G +IQLSEQ+L+DC + GC GG + AF +II
Sbjct: 154 QGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAGCNGGLMDYAFEFIIN 213
Query: 214 NQGI-------FNGVCGT 224
N GI + GV GT
Sbjct: 214 NGGIDTEEDYPYRGVDGT 231
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 54/77 (70%), Gaps = 6/77 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+F G CG LDH V +VG+GT ++GA++W+++NSWG +WG+ GY+++ R+
Sbjct: 277 AFQLYLSGVFTGECGRALDHGVVVVGYGT-DNGADHWIVRNSWGTSWGENGYIRMERNVV 335
Query: 267 ---EGLCGIGTRSSYPL 280
G CGI ++SYP+
Sbjct: 336 DNFGGKCGIAMQASYPI 352
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 177 bits (448), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 90/210 (42%), Positives = 133/210 (63%), Gaps = 5/210 (2%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRST--HEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
S K T + I LL+S + V++ T +E ++E+W+ ++ ++Y EKE R
Sbjct: 4 SIKSITLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERR 63
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
+IFK+NL+++E+ + NRTY++G +F+DLTNDEFRA+Y KM K
Sbjct: 64 FEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKM---ERTRVPVKGEK 120
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y +P ++DWR KGAV P+K+Q CG CWAF+A+ AVEGI +I++G LI LSEQ+
Sbjct: 121 YLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L+DC T+ N+GC GG + AF +II+N GI
Sbjct: 181 LVDCDTSYNDGCGGGLMDYAFKFIIENGGI 210
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 50/75 (66%), Gaps = 5/75 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+F G CGT LDH V VG+G+ E G +YW+++NSWG+ WG++GY K+ R+
Sbjct: 271 AFQLYTSGVFTGTCGTSLDHGVVAVGYGS-EGGQDYWIVRNSWGSNWGESGYFKLERNIK 329
Query: 267 --EGLCGIGTRSSYP 279
G CG+ +SYP
Sbjct: 330 ESSGKCGVAMMASYP 344
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 177 bits (448), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 132/201 (65%), Gaps = 8/201 (3%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSV---VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+ IT + +V H S+ +E+ E WM++H ++Y+ EK R +IF +NL+
Sbjct: 17 LFITYATAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKAYRSIEEKLHRFEIFLDNLK 76
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+I++ NK+ + +Y LG N+F+DL+++EF++ Y G ++ P RS S F Y ++ D+
Sbjct: 77 HIDETNKKVS-SYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRS--SRGFSYGDVE--DL 131
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAVTP+KNQ CG CWAF+ VAAVEGI +I +GNL LSEQ+L+DC + N
Sbjct: 132 PESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRSFN 191
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
NGC GG + AF YI+ N G+
Sbjct: 192 NGCYGGLMDYAFQYIMSNSGL 212
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 177 bits (448), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 81/174 (46%), Positives = 117/174 (67%), Gaps = 3/174 (1%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
E HEKWMAQ+G+ YKD EKE R ++FK N+++IE N G++ + L NQF+DL ++EF
Sbjct: 33 ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEF 92
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAF 163
+AL + + + T ++F+Y+N+ T +P+++DWR +GAVTPIK+Q CG CWAF
Sbjct: 93 KALLNNVQKKASRVETATETSFRYENV--TKIPSTMDWRKRGAVTPIKDQGYTCGSCWAF 150
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
A VA VE + +I +G L+ LSEQ+L+DC + GC GG E AF +I GI
Sbjct: 151 ATVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGI 204
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 42/80 (52%), Positives = 53/80 (66%), Gaps = 5/80 (6%)
Query: 207 AFAYIIQNQGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
A A+ + GIF CGT LDHAV +VG+G DG YWL+KNSW WG+ GYM+I R
Sbjct: 262 AIAFKFYSSGIFEARNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGYMRIKR 321
Query: 266 D----EGLCGIGTRSSYPLA 281
D +GLCGI + +SYP+A
Sbjct: 322 DIRAKKGLCGIASNASYPIA 341
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 87/200 (43%), Positives = 128/200 (64%), Gaps = 5/200 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR YKD+ EK R ++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR+ T + R T F+ +N+++ +P
Sbjct: 68 IESFN-AGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTG--FRNENVNIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
++DWR KG VTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI S + L T +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSL--LTVMSM 182
Query: 198 GCLGGSREKAFAYIIQNQGI 217
GC GG + AF +II+N G+
Sbjct: 183 GCEGGLMDDAFKFIIKNGGL 202
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 65/124 (52%), Gaps = 17/124 (13%)
Query: 165 AVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGV 221
+VA+++G + + N L++ Q + + +G + + G+ G
Sbjct: 223 SVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD----------MTFQFYKGGVMTGS 272
Query: 222 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSS 277
CGT LDH + +G+G DG YWL+KNSWG TWG+ G++++ +D G+CG+ S
Sbjct: 273 CGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPS 332
Query: 278 YPLA 281
YP A
Sbjct: 333 YPTA 336
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 176 bits (446), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 80/178 (44%), Positives = 119/178 (66%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E + ++E W+ ++G++Y EKE R +IFK+NL+++++ N GN +YKLG N+F+DL
Sbjct: 42 EAETLRLYEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADL 101
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
+N+E+RA Y G +M + +Y D+P S+DWR+KGAV P+K+Q +CG
Sbjct: 102 SNEEYRAAYLGTRMDGKRRLLGGPKSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGS 161
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ V AVEGI +I +GNL LSEQ+L+DC N GC GG + AF +I++N GI
Sbjct: 162 CWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGI 219
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 6/76 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
A+ + G+F G CGTQLDH V VG+GT E+G +YW+++NSWG WG+ GY+++ R
Sbjct: 279 AFQLYQSGVFTGSCGTQLDHGVVAVGYGT-ENGVDYWVVRNSWGPAWGENGYIRMERNVA 337
Query: 266 --DEGLCGIGTRSSYP 279
+ G CGI +SYP
Sbjct: 338 STETGKCGIAMEASYP 353
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 117/356 (32%), Positives = 174/356 (48%), Gaps = 93/356 (26%)
Query: 16 TPMFIIITLLV--SCAS----QVVSSRSTHE--------QSVVE-----IHEKWMAQHGR 56
+ M +++ +V SCA+ +VSS H Q V + + E WM +HG+
Sbjct: 6 SAMLVLLLAMVISSCATAMDMSIVSSNDNHHVTNGPGRRQGVFDAEATLMFESWMVKHGK 65
Query: 57 SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP 116
Y+ EKE RL IF++NL +I N E N +Y+LG N+F+DL+ E+ + G P
Sbjct: 66 VYESVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYAQICHGADPRPP 124
Query: 117 SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIR 176
+ +S+ +Y+ +P S+DWR++GAVT +K+Q +C CWAF+ V AVEG+ KI
Sbjct: 125 RNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFSTVGAVEGLNKIV 184
Query: 177 SGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQL 226
+G L+ LSEQ L++C+ NNGC GG E A+ +I+ N G+ NGVC +L
Sbjct: 185 TGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCNDRL 243
Query: 227 ---DHAVTIVGF----------------------------------------GTTEDGAN 243
+ V I G+ GT N
Sbjct: 244 KENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASGVFDGTCGTNLN 303
Query: 244 YWLI---------KNSW------GNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
+ ++ ++ W GNTWG+AGYMK+ R+ GLCGI R+SYPL
Sbjct: 304 HGVVVVGYGTENGRDYWIVRNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 90/210 (42%), Positives = 133/210 (63%), Gaps = 6/210 (2%)
Query: 8 SGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
S S K N +F+++T+ S QV+S R + S V+ HEKWMAQ+G+ YKD EKE R
Sbjct: 3 SFSQKKNILVVFLVLTVWTS---QVMSRRLSEAYSSVK-HEKWMAQYGKVYKDAAEKEKR 58
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
+IFK N+ +IE + G++ + L NQF+DL +F+AL + + R+ T++
Sbjct: 59 FQIFKNNVHFIESFHAAGDKPFNLSINQFADL--HKFKALLINGQKKEHNVRTATATEAS 116
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
++ S+T +P+SLDWR +GAVTPIK+Q C CWAF+ VA +EG+ +I G L+ LSEQ+
Sbjct: 117 FKYDSVTRIPSSLDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKGELVSLSEQE 176
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L+DC + GC GG E AF +I + G+
Sbjct: 177 LVDCVKGDSEGCYGGYVEDAFEFIAKKGGV 206
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 40/78 (51%), Positives = 53/78 (67%), Gaps = 4/78 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
+A+ + GIF G CGT +DH+VT+VG+G G YWL+KNSWG WG+ GY+++ RD
Sbjct: 265 YAFQFYSSGIFTGKCGTDIDHSVTVVGYGKARGGNKYWLVKNSWGTEWGEKGYIRMKRDI 324
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI T + YP A
Sbjct: 325 RAKEGLCGIATGALYPTA 342
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 87/204 (42%), Positives = 139/204 (68%), Gaps = 5/204 (2%)
Query: 18 MFIIITLLVSCA---SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+F II ++ S A S + + + + + ++E W+ +HG++Y EK++R IFK+N
Sbjct: 11 LFSIIFIVSSSALDLSIIDRAFNRPDDEIASLYETWLVKHGKNYNGLGEKQLRFNIFKDN 70
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKYQNLSM 133
L ++++ N E N ++KLG N+F+DLTN+E+R++Y G + S + RS S + +Y +
Sbjct: 71 LRFVDERNSE-NLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGRSKSDRYAFRAG 129
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR KGAV IK+Q CG CWAF+A+AAVEG+ +I +G+LI LSEQ+L++C T
Sbjct: 130 DTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVTGDLISLSEQELVECDT 189
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI 217
+ N+GC GG + AF +II+N+GI
Sbjct: 190 SYNDGCDGGLMDYAFEFIIKNEGI 213
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 58/84 (69%), Gaps = 9/84 (10%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG R+ + + + G+F G CGT LDH V +VG+GT EDG +YW+++NSWG+TWG+ GY
Sbjct: 269 GGGRD----FQLYDSGVFTGKCGTALDHGVAVVGYGT-EDGLDYWIVRNSWGDTWGEGGY 323
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G+CGI SYP+
Sbjct: 324 IRMQRNTKLPSGICGIAIEPSYPI 347
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 90/203 (44%), Positives = 129/203 (63%), Gaps = 11/203 (5%)
Query: 21 IITLLVSCASQVVSSR--STHEQSVVEIH---EKWMAQHGRSYKDELEKEMRLKIFKENL 75
+I L+V A+ +R + + +EI E W A+HG+SY +LEK RL IF + L
Sbjct: 10 LILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDLEKARRLMIFSDTL 69
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMT 134
YIEK N + N T+ LG N+FSDLTN EFRA++ G +K P R +++ ++
Sbjct: 70 AYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAED----EDVDVS 125
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+PTSLDWR KGAVTPIK+Q +CG CWAF+A+A++E + + L+ LSEQQL+DC T
Sbjct: 126 SLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCDTV 185
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
+ GC GG E AF ++++N G+
Sbjct: 186 -DAGCDGGLMETAFKFVVKNGGV 207
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 36/66 (54%), Positives = 47/66 (71%), Gaps = 3/66 (4%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--EGLCGIG 273
GI +G CG LDH V ++G+GT E G YW+IKNSWG +WG+ G+MKI R +G+CG+
Sbjct: 276 GILSGQCGDSLDHGVLLIGYGT-EGGMPYWIIKNSWGTSWGEDGFMKIERKDGDGICGMN 334
Query: 274 TRSSYP 279
SSYP
Sbjct: 335 GDSSYP 340
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 86/210 (40%), Positives = 132/210 (62%), Gaps = 14/210 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSD 98
E ++E W+A+HGR+Y E++ R ++F +NL +++ N + ++LG NQF+D
Sbjct: 42 EPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFAD 101
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTNDEFRA Y G ++P+ R T ++P S+DWR+KGAV P+KNQ +CG
Sbjct: 102 LTNDEFRAAYLGARIPAARRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCG 161
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
CWAF+AV++VE + +I +G ++ LSEQ+L++CST+ GN+GC GG + AF +II+N GI
Sbjct: 162 SCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGI 221
Query: 218 ----------FNGVCGTQLDHA--VTIVGF 235
+G C ++A V+I GF
Sbjct: 222 DTEGDYPYKAVDGKCDINRENAKVVSIDGF 251
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 52/82 (63%), Gaps = 9/82 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + G+F+G C T LDH V VG+GT E+G +YW+++NSWG WG+ GY+
Sbjct: 278 GGRE----FQLYKAGVFSGTCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGAKWGEDGYI 332
Query: 262 KIVRD----EGLCGIGTRSSYP 279
++ R+ G CGI +SYP
Sbjct: 333 RMERNVNATTGKCGIAMMASYP 354
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 86/210 (40%), Positives = 132/210 (62%), Gaps = 14/210 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSD 98
E ++E W+A+HGR+Y E++ R ++F +NL +++ N + ++LG NQF+D
Sbjct: 45 EPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFAD 104
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTNDEFRA Y G ++P+ R T ++P S+DWR+KGAV P+KNQ +CG
Sbjct: 105 LTNDEFRAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCG 164
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
CWAF+AV++VE + +I +G ++ LSEQ+L++CST+ GN+GC GG + AF +II+N GI
Sbjct: 165 SCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGI 224
Query: 218 ----------FNGVCGTQLDHA--VTIVGF 235
+G C ++A V+I GF
Sbjct: 225 DTEGDYPYKAVDGKCDINRENAKVVSIDGF 254
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 51/82 (62%), Gaps = 9/82 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + G+F G C T LDH V VG+GT E+G +YW+++NSWG WG+ GY+
Sbjct: 281 GGRE----FQLYKAGVFTGTCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGAKWGEDGYI 335
Query: 262 KIVRD----EGLCGIGTRSSYP 279
++ R+ G CGI +SYP
Sbjct: 336 RMERNVNATTGKCGIAMMASYP 357
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 122/188 (64%), Gaps = 5/188 (2%)
Query: 35 SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE----GNRTYK 90
S + ++++ E +EKWMA+ GR+YKD EK R ++FK N +I+ N G K
Sbjct: 8 STAGDDKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPK 67
Query: 91 LGTNQFSDLTNDEFRALY-TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
L TN+F+DLT DEFR +Y TG+++ T + FK+ +S++DVP S+DWR +GAVT
Sbjct: 68 LTTNKFADLTEDEFRNIYVTGHRVNYRPTSLVTDTVFKFGAVSLSDVPPSIDWRARGAVT 127
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q C CCWAF++ AAVEGI +I +GN + LS QQL+DCS N C G +KA+
Sbjct: 128 SVKDQHLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYE 187
Query: 210 YIIQNQGI 217
YI ++ G+
Sbjct: 188 YIARSGGL 195
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/74 (52%), Positives = 49/74 (66%), Gaps = 8/74 (10%)
Query: 216 GIFNGV---CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----E 267
GIF C T L+HA+TIVG+GT E G YWL+KNSWG+ WGD GY+K RD
Sbjct: 261 GIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGDKGYVKFARDVASEIN 320
Query: 268 GLCGIGTRSSYPLA 281
G+CG+ +SYP+A
Sbjct: 321 GVCGLALEASYPVA 334
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 97/250 (38%), Positives = 145/250 (58%), Gaps = 19/250 (7%)
Query: 1 MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+VL F + +++ IIT + R TH+Q ++ ++E W+ +H ++Y
Sbjct: 16 LVLFFSLASFLMLSSASDMSIITYDETHGLNSPPLR-THDQ-LLSLYESWLVKHHKNYNA 73
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EKE R IFK+N+ ++++ N N++YKLG N+F+DLTNDE+R+LY KM ++
Sbjct: 74 LGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRSLYLSGKMMKRERKN 133
Query: 121 TTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
F+ D +P S+DWRD+GAV P+K+Q +CG CWAF+ V AVEGI KI +
Sbjct: 134 EDG--FRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFSTVGAVEGINKIVT 191
Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLD 227
G LI LSEQ+L+DC N GC GG + AF +I++N GI +G+C
Sbjct: 192 GELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDTEDDYPYKGVDGLCDQNRK 251
Query: 228 HA--VTIVGF 235
+A VTI G+
Sbjct: 252 NAKVVTINGY 261
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 53/76 (69%), Gaps = 6/76 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+F G CGT+LDH V VG+G+ E+G +YW+++NSWG WG++GY+++ R+
Sbjct: 291 AFQLYESGVFTGQCGTELDHGVVAVGYGS-ENGKDYWIVRNSWGPDWGESGYIRLERNVA 349
Query: 267 ---EGLCGIGTRSSYP 279
G CGI ++SYP
Sbjct: 350 STSTGKCGIAMQASYP 365
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 86/210 (40%), Positives = 132/210 (62%), Gaps = 14/210 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSD 98
E ++E W+A+HGR+Y E++ R ++F +NL +++ N + ++LG NQF+D
Sbjct: 102 EPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFAD 161
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTNDEFRA Y G ++P+ R T ++P S+DWR+KGAV P+KNQ +CG
Sbjct: 162 LTNDEFRAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCG 221
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
CWAF+AV++VE + +I +G ++ LSEQ+L++CST+ GN+GC GG + AF +II+N GI
Sbjct: 222 SCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGI 281
Query: 218 ----------FNGVCGTQLDHA--VTIVGF 235
+G C ++A V+I GF
Sbjct: 282 DTEGDYPYKAVDGKCDINRENAKVVSIDGF 311
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 51/82 (62%), Gaps = 9/82 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + G+F G C T LDH V VG+GT E+G +YW+++NSWG WG+ GY+
Sbjct: 338 GGRE----FQLYKAGVFTGTCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGAKWGEDGYI 392
Query: 262 KIVRD----EGLCGIGTRSSYP 279
++ R+ G CGI +SYP
Sbjct: 393 RMERNVNATTGKCGIAMMASYP 414
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 107/333 (32%), Positives = 150/333 (45%), Gaps = 99/333 (29%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ C + + + + + ++V HE+WMAQ+ R YKD EK R K
Sbjct: 8 ILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFK-------- 59
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
F+DLTN EFR++ T S + + T F+Y+N+S +P
Sbjct: 60 ------------------FADLTNHEFRSVKTNKGFKSSNMKILTG--FRYENVSADALP 99
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI---------------- 181
T++DWR KG VTPIK+Q +CGCC AF+AVAA EGI KI +G L+
Sbjct: 100 TTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVSLADQELVDCDVHGED 159
Query: 182 QLSEQQLLDCS------------------TNGNNGCLGGSR---------------EKAF 208
Q E L+D + T + C GS E A
Sbjct: 160 QGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCNSGSNSAATIKGYEDVPANDEAAL 219
Query: 209 AYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 250
+ NQ G+ G CGT LDH + +G+G T DG YWL+KNS
Sbjct: 220 MKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNS 279
Query: 251 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
WG TWG+ GY+++ +D G+CG+ SYP
Sbjct: 280 WGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 312
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 88/217 (40%), Positives = 137/217 (63%), Gaps = 19/217 (8%)
Query: 13 INTTPMFIIITLLVSCAS------------QVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+N+ + + +T++V ++ VSSRS E V ++E+W+ +HG++
Sbjct: 4 LNSATVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAE--VSRLYEEWLVKHGKAQNS 61
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EK+ R +IFK+NL +I++ N + N +Y+LG +F+DLTNDE+R++Y G ++ R
Sbjct: 62 LTEKDRRFEIFKDNLRFIDEHNGK-NLSYRLGLTKFADLTNDEYRSMYLGSRL----KRK 116
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
T S+ +Y+ +P S+DWR +GAV +K+Q CG CWAF+ + AVEGI KI +G+L
Sbjct: 117 ATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDL 176
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
I LSEQ+L+DC T+ N GC GG + AF +II N GI
Sbjct: 177 ITLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGI 213
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/84 (47%), Positives = 58/84 (69%), Gaps = 9/84 (10%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG R A+ + + GIF+G+CGT LDH V VG+GT E+G +YW++KNSWG +WG++GY
Sbjct: 269 GGGR----AFQLYDSGIFDGICGTDLDHGVVAVGYGT-ENGKDYWIVKNSWGTSWGESGY 323
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYP+
Sbjct: 324 IRMERNIASSAGKCGIAVEPSYPI 347
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 109/316 (34%), Positives = 161/316 (50%), Gaps = 80/316 (25%)
Query: 44 VEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSD 98
+ I+ +W +HG+S + +++ R IFK+NL +I+ N+ N TYKLG F++
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKGAVTPIKNQK 155
LTNDE+R+LY G + P R T + KY +++ +VP ++DWR KGAV IK+Q
Sbjct: 61 LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ AAVEGI KI +G L+ LSEQ+L+DC + N GC GG + AF +I++N
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179
Query: 216 GI----------FNGVCGTQLDHA--VTIVGF---------------------------- 235
G+ NG C + L ++ VTI G+
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239
Query: 236 ---------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD-- 266
GT D G +YW+++NSWG WG+ GY+++ R+
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVA 299
Query: 267 --EGLCGIGTRSSYPL 280
G CGI +SYP+
Sbjct: 300 SKSGKCGIAIEASYPV 315
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 81/191 (42%), Positives = 126/191 (65%), Gaps = 6/191 (3%)
Query: 32 VVSSRSTH-----EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN 86
+++ TH + ++ +E W+ +HG+SY EKE R +IFK+N YI++ N +
Sbjct: 24 IITYDQTHAVGSTDDVIMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKD 83
Query: 87 RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKG 146
R++KLG N+F+DLTN+E+R+ YTG + S + + + +Y +L+ +P S+DWR+ G
Sbjct: 84 RSFKLGLNRFADLTNEEYRSKYTGIRTKD-SRKKVSGKSQRYASLAGESLPESVDWREHG 142
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
AV +K+Q +CG CWAF+ ++AVEGI +I +G LI LSEQ+L+DC + N GC GG +
Sbjct: 143 AVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDD 202
Query: 207 AFAYIIQNQGI 217
AF +II N GI
Sbjct: 203 AFQFIINNGGI 213
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 35/71 (49%), Positives = 50/71 (70%), Gaps = 5/71 (7%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
+ GIF G CGT LDH V +VG+GT E+G +YW+++NSWG WG+ GY+++ R G+
Sbjct: 278 DSGIFTGKCGTDLDHGVVVVGYGT-ENGKDYWIVRNSWGADWGEKGYLRMERGISSKAGI 336
Query: 270 CGIGTRSSYPL 280
CGI + SYP+
Sbjct: 337 CGITSEPSYPV 347
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 99/251 (39%), Positives = 146/251 (58%), Gaps = 28/251 (11%)
Query: 18 MFIIITLLVSCASQV----VSSRSTH---------EQSVVEIHEKWMAQHGRSYKDE--L 62
M I+ +V+ AS V +S H + V+ I+E W+ +HG++ +
Sbjct: 1 MVILFLAMVAVASAVDMSIISYDEKHGVSTTGGRSDAEVMSIYEAWLVKHGKAQNQNSLV 60
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK+ R +IFK+NL +I+ NK+ N +Y+LG +F+DLTNDE+R+ Y G KM R T+
Sbjct: 61 EKDRRFEIFKDNLRFIDDHNKK-NLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS 119
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
+Y+ ++P S+DWR KGAV +K+Q CG CWAF+ + AVEGI +I +G+LI
Sbjct: 120 Q---RYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLIT 176
Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI-------FNGVCGT--QLDHAVTIV 233
LSEQ+L+DC T+ N GC GG + AF +II+N GI + GV GT Q+ +V
Sbjct: 177 LSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVV 236
Query: 234 GFGTTEDGANY 244
+ ED Y
Sbjct: 237 TIDSYEDVPTY 247
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 55/76 (72%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF+G CGTQLDH V VG+GT E+G +YW+++NSWG +WG++GY+K+ R+
Sbjct: 271 AFQLYDSGIFDGTCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLKMARNIA 329
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYP+
Sbjct: 330 SSSGKCGIAIEPSYPI 345
>gi|255626679|gb|ACU13684.1| unknown [Glycine max]
Length = 229
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 130/221 (58%), Gaps = 13/221 (5%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
+ +T +F+ TL SCA + + + V+ ++E+W+ +H + Y EK+ R ++FK
Sbjct: 8 VTSTLLFLSFTL--SCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFK 65
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNL 131
+NL +I++ N N TYKLG NQF+D+TN+E+R +Y G K + T ST +Y
Sbjct: 66 DNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYS 125
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ +P +DWR KGAV PIK+Q CG CWAF+ VA VE KI +G + LSEQ+L+DC
Sbjct: 126 AGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEATNKIVTGKFVSLSEQELVDC 185
Query: 192 STNGNNGCLGGSREKAFAYIIQNQGI----------FNGVC 222
N C GG + AF +IIQN GI F+G+C
Sbjct: 186 DRAYNERCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGIC 226
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 174 bits (441), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 88/208 (42%), Positives = 128/208 (61%), Gaps = 9/208 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH---EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+ ++ + L S A+ + E S+ ++E+W + H S +D EK+ R +FKEN
Sbjct: 6 LILVASFLASVAATAIDIADKDLETEDSLWNLYERWRSHHTVS-RDLDEKQKRFNVFKEN 64
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP-----SHRSTTSSTFKYQ 129
YI NK + YKL N+F+DLTN EFR+ Y G ++ S R +++F YQ
Sbjct: 65 PRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRGSRRGGATNSFMYQ 124
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+L +P S+DWR KGAVT +K+Q +CG CWAF+ VAAVEGI +I++ L+ LSEQ+L+
Sbjct: 125 SLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLLSLSEQELI 184
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGI 217
DC T+ NNGC GG + AF +I +N GI
Sbjct: 185 DCDTDENNGCNGGLMDYAFDFIKKNGGI 212
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 50/76 (65%), Gaps = 3/76 (3%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---V 264
+ + ++G+F G GT+LDH V IVG+G T+ G YW+++NSWG WG+ GY++I
Sbjct: 270 YDFQFYSEGVFTGRSGTELDHGVAIVGYGKTQQGTKYWIVRNSWGAEWGEKGYIRISAAS 329
Query: 265 RDEGLCGIGTRSSYPL 280
+ LCG+ +SYP+
Sbjct: 330 DSKRLCGLAMEASYPI 345
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 85/187 (45%), Positives = 125/187 (66%), Gaps = 7/187 (3%)
Query: 31 QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
VSSRS E V ++E+W+ +HG++ EK+ R +IFK+NL +I++ N + N +Y+
Sbjct: 28 HTVSSRSDAE--VSRLYEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYR 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG +F+DLTNDE+R++Y G ++ R T S+ +Y+ +P S+DWR +GAV
Sbjct: 85 LGLTKFADLTNDEYRSMYLGSRL----KRKATKSSLRYEVRVGDAIPESVDWRKEGAVAE 140
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q CG CWAF+ + AVEGI KI +G+LI LSEQ+L+DC T+ N GC GG + AF +
Sbjct: 141 VKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEF 200
Query: 211 IIQNQGI 217
II N GI
Sbjct: 201 IINNGGI 207
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/84 (47%), Positives = 58/84 (69%), Gaps = 9/84 (10%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG R A+ + + GIF+G+CGT LDH V VG+GT E+G +YW++KNSWG +WG++GY
Sbjct: 263 GGGR----AFQLYDSGIFDGICGTDLDHGVVAVGYGT-ENGKDYWIVKNSWGTSWGESGY 317
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYP+
Sbjct: 318 IRMERNIASSAGKCGIAVEPSYPI 341
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 88/201 (43%), Positives = 132/201 (65%), Gaps = 6/201 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD-ELEKEMRLKIFKENLE 76
F+ I L + S ++ R+ E V+ ++++W A+HG+ + + E E R IFK+NL+
Sbjct: 14 FFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPENRFHIFKDNLK 71
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+I++ N + N Y+LG N F+DLTN+E+R+ Y G K S S R+ TS+ +Y D+
Sbjct: 72 FIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSN--RYLPRLGDDL 128
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAV P+K+Q CG CWAF+ VA+VE I +I +G+LI LSEQ+L+DC + N
Sbjct: 129 PDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDCDRSYN 188
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +II+N G+
Sbjct: 189 EGCNGGLMDYAFEFIIENGGL 209
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/83 (46%), Positives = 55/83 (66%), Gaps = 9/83 (10%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG R ++ + GIF G CGT LDH V +VG+G+ E G +YW+++NSWG +WG++GY
Sbjct: 265 GGGR----SFQLYQSGIFTGRCGTDLDHGVNVVGYGS-EGGVDYWIVRNSWGGSWGESGY 319
Query: 261 MKIVRD----EGLCGIGTRSSYP 279
+K+ R+ GLCGI SYP
Sbjct: 320 VKMQRNIASPTGLCGIAMEPSYP 342
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 106/316 (33%), Positives = 167/316 (52%), Gaps = 80/316 (25%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL--EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQ 95
E ++ W+A++G + L E E R +F +NL++++ N + ++LG N+
Sbjct: 45 EAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNR 104
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
F+DLTN+EFRA + G K+ + RS + +Y++ + ++P S+DWR+KGAV P+KNQ
Sbjct: 105 FADLTNEEFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQG 160
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQN 214
+CG CWAF+AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG + AF +II+N
Sbjct: 161 QCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKN 220
Query: 215 QGI----------FNGVCGTQLDHA--VTIVGF--------------------------- 235
GI +G C ++A V+I GF
Sbjct: 221 GGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAG 280
Query: 236 ----------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD- 266
GT+ D G +YW+++NSWG WG++GY+++ R+
Sbjct: 281 GREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNI 340
Query: 267 ---EGLCGIGTRSSYP 279
G CGI +SYP
Sbjct: 341 NVTTGKCGIAMMASYP 356
>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 82/176 (46%), Positives = 110/176 (62%), Gaps = 6/176 (3%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
HE+WMA++GR Y D EK R ++F N +I+ N+ GNRTY LG N FSDLTN+EF
Sbjct: 41 HERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEEFAQ 100
Query: 107 LYTGYK-MPSPS----HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
+ GY+ P P S+ ++ + + P S+DWR +GAVTP+K+Q CG CW
Sbjct: 101 THLGYRHQPGPGGLRPEDSSPAAAVNVTDAQLQSTPDSVDWRARGAVTPVKHQGHCGSCW 160
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
AFAAVAA EG+ +I +GNLI +SEQQ+LDC T G + C G A YI + G+
Sbjct: 161 AFAAVAATEGLVQIATGNLISMSEQQVLDC-TGGTSSCKSGYVNAALTYITASGGL 215
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 73/161 (45%), Gaps = 6/161 (3%)
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
T+S Y N ++T + S + + A Q C A AA G+ + +
Sbjct: 194 TSSCKSGYVNAALTYITASGGLQTEAAYAYSAEQGACRSGGASPNSAAAVGVHR----SA 249
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTED 240
+ ++ L G + E F + + + CG +L HAVT+VG+G D
Sbjct: 250 MLNGDEGALQVLVAGQPVAVAVEAEPDFHHYKSGVYVGSPSCGQKLHHAVTVVGYGADGD 309
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVRDEGL--CGIGTRSSYP 279
G YW++KN WG WG+ GYM++ R G CG+ T + YP
Sbjct: 310 GQGYWVVKNQWGAGWGEVGYMRLTRGNGGNNCGMATHAYYP 350
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 174 bits (440), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 88/201 (43%), Positives = 132/201 (65%), Gaps = 6/201 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD-ELEKEMRLKIFKENLE 76
F+ I L + S ++ R+ E V+ ++++W A+HG+ + + E E R IFK+NL+
Sbjct: 14 FFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPENRFHIFKDNLK 71
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+I++ N + N Y+LG N F+DLTN+E+R+ Y G K S S R+ TS+ +Y D+
Sbjct: 72 FIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSN--RYLPRLGDDL 128
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAV P+K+Q CG CWAF+ VA+VE I +I +G+LI LSEQ+L+DC + N
Sbjct: 129 PDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDCDRSYN 188
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG + AF +II+N G+
Sbjct: 189 EGCNGGLMDYAFEFIIENGGL 209
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 39/83 (46%), Positives = 55/83 (66%), Gaps = 9/83 (10%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG R ++ + GIF G CGT LDH V +VG+G+ E G +YW+++NSWG +WG++GY
Sbjct: 264 GGGR----SFQLYQSGIFTGRCGTDLDHGVNVVGYGS-EGGVDYWIVRNSWGGSWGESGY 318
Query: 261 MKIVRD----EGLCGIGTRSSYP 279
+K+ R+ GLCGI SYP
Sbjct: 319 VKMQRNIASPTGLCGIAMEPSYP 341
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 173 bits (439), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 109/316 (34%), Positives = 160/316 (50%), Gaps = 80/316 (25%)
Query: 44 VEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSD 98
+ I+ +W +HG+S + +++ R IFK+NL +I+ N+ N TYKLG F++
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKGAVTPIKNQK 155
LTNDE+R+LY G + P R T + KY ++ +VP ++DWR KGAV IK+Q
Sbjct: 61 LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQG 119
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ AAVEGI KI +G L+ LSEQ+L+DC + N GC GG + AF +I++N
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179
Query: 216 GI----------FNGVCGTQLDHA--VTIVGF---------------------------- 235
G+ NG C + L ++ VTI G+
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239
Query: 236 ---------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD-- 266
GT D G +YW+++NSWG WG+ GY+++ R+
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVA 299
Query: 267 --EGLCGIGTRSSYPL 280
G CGI +SYP+
Sbjct: 300 SKSGKCGIAIEASYPV 315
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 173 bits (439), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 89/208 (42%), Positives = 130/208 (62%), Gaps = 10/208 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F+ TL + ++S TH + V+ I+E+W+ + G+ Y E+E R +
Sbjct: 15 LFLSFTLSSASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGKVYNALGEREKRFQ 74
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
+FK+NL +I++ N E NRTYKLG N F+DLTN+E+R+ Y G + +R +S +Y
Sbjct: 75 VFKDNLRFIDEHNSE-NRTYKLGLNGFADLTNEEYRSTYLGARGGMKRNRLRKTSD-RYA 132
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+P S+DWR +GAV +K+Q CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+
Sbjct: 133 PRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELV 192
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGI 217
DC T+ N GC GG + AF +II N GI
Sbjct: 193 DCDTSYNEGCNGGLMDYAFEFIINNGGI 220
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 36/69 (52%), Positives = 51/69 (73%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF+G CGTQLDH V VG+GT E+G +YW+++NSWG +WG+ GY+++ R G+CG
Sbjct: 287 GIFSGRCGTQLDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGENGYLRMARSINSPTGICG 345
Query: 272 IGTRSSYPL 280
I +SYP+
Sbjct: 346 IAMEASYPI 354
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 173 bits (439), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 133/206 (64%), Gaps = 9/206 (4%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + ++ T+++ V ++E W+ +HG+SY E+E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLA---LDAKRTNDE-VKAMYESWLIKHGKSYNSLGERERRFEIFK 63
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + +R+YK+G NQF+DLTN+EFR+ Y G+ S + T + +Y+
Sbjct: 64 ETLRFIDEHNADTSRSYKVGLNQFADLTNEEFRSTYLGFTRGS----NKTKVSNRYEPRV 119
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR +GAV IKNQ +CG CWAF+A+AAVEGI KI +GNLI LSEQ+L+DC
Sbjct: 120 GQVLPDYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGINKIVTGNLISLSEQELVDCG 179
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC GG F +II N GI
Sbjct: 180 RTQSTKGCDGGYMTDGFEFIINNGGI 205
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 50/68 (73%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GIF G CGT DHAVTIVG+GT E G +YW++KNSW TWG+ GYM+I+R+ G CGI
Sbjct: 272 GIFTGPCGTATDHAVTIVGYGT-EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGI 330
Query: 273 GTRSSYPL 280
T SYP+
Sbjct: 331 ATMPSYPV 338
>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 334
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 114/335 (34%), Positives = 173/335 (51%), Gaps = 70/335 (20%)
Query: 13 INTTPMFIIITLLVS--CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
++ +F+ +T+L SQ + +EQS+V+ H++WM Q R YKDE EKEMRLK+
Sbjct: 2 VSVRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKV 61
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
FK+NL++IE N GN++Y LG N+F+D +EF A +TG ++ S + T +N
Sbjct: 62 FKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRN 121
Query: 131 LSMTDVPT---SLDWRDKGAVTPI-------------KN--------------QKECGCC 160
+M+D+ S DWRD+GAVTP+ KN +K GC
Sbjct: 122 WNMSDIDMEDESKDWRDEGAVTPVKYQGACRLTKISGKNLLTLSEQQLIDCDIEKNGGCN 181
Query: 161 W-----AFAAVAAVEGI------------------------TKIRSGNLI-QLSEQQLLD 190
AF + G+ T+IR ++ +E+ LL+
Sbjct: 182 GGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERALLE 241
Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKN 249
L +R +F + G++ G+ CGT ++HAVTIVG+GT G NYW++KN
Sbjct: 242 AVRRQPVSVLIDARADSFGHY--KGGVYAGLDCGTDVNHAVTIVGYGTM-SGLNYWVLKN 298
Query: 250 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
SWG +WG+ GYM+I RD +G+CGI ++YP+
Sbjct: 299 SWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 333
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 84/187 (44%), Positives = 126/187 (67%), Gaps = 7/187 (3%)
Query: 31 QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
VSSRS E V ++E+W+ +HG++ EK+ R +IFK+NL +I++ N + N +Y+
Sbjct: 28 HTVSSRSDVE--VSRLYEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYR 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG +F+DLTNDE+R++Y G ++ R T ++ +Y+ +P S+DWR +GAV
Sbjct: 85 LGLTKFADLTNDEYRSMYLGSRL----KRKATKTSLRYEARVGDAIPESVDWRKEGAVAE 140
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q CG CWAF+ + AVEGI KI +G+LI LSEQ+L+DC T+ N GC GG + AF +
Sbjct: 141 VKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEF 200
Query: 211 IIQNQGI 217
II+N GI
Sbjct: 201 IIKNGGI 207
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/84 (47%), Positives = 58/84 (69%), Gaps = 9/84 (10%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG R A+ + + GIF+G+CGT LDH V VG+GT E+G +YW++KNSWG +WG++GY
Sbjct: 263 GGGR----AFQLYDSGIFDGICGTDLDHGVVAVGYGT-ENGKDYWIVKNSWGTSWGESGY 317
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYP+
Sbjct: 318 IRMERNIASSAGKCGIAVEPSYPI 341
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 92/216 (42%), Positives = 134/216 (62%), Gaps = 15/216 (6%)
Query: 40 EQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E V+ I+E W+ +HG+ S +EK+ R +IFK+NL ++++ N E N +Y+LG +F+
Sbjct: 43 EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFA 101
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLTNDE+R+ Y G KM R T+ +Y+ ++P S+DWR KGAV +K+Q C
Sbjct: 102 DLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVAEVKDQGGC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF +II+N GI
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218
Query: 218 -------FNGVCGT--QLDHAVTIVGFGTTEDGANY 244
+ GV GT Q+ +V + ED Y
Sbjct: 219 DTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTY 254
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 55/76 (72%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF+G CGTQLDH V VG+GT E+G +YW+++NSWG +WG++GY+++ R+
Sbjct: 278 AFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMARNIA 336
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYP+
Sbjct: 337 SSSGKCGIAIEPSYPI 352
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 173 bits (438), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 85/185 (45%), Positives = 123/185 (66%), Gaps = 6/185 (3%)
Query: 36 RSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLG 92
RS E + ++E W+A+HGR+Y EKE R +IFK+N+ +I+ N G+R+++LG
Sbjct: 41 RSEEEMRI--LYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLG 98
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+D+TN+E+RA+Y G + P+ R + +Y+ + D+P S+DWR KGAV +K
Sbjct: 99 LNRFADMTNEEYRAVYLGTR-PAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVK 157
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
+Q CG CWAF+ VAAVEGI KI +G+LI LSEQ+L+DC N GC GG + F +II
Sbjct: 158 DQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQGCNGGLMDYGFEFII 217
Query: 213 QNQGI 217
N GI
Sbjct: 218 NNGGI 222
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 53/82 (64%), Gaps = 9/82 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + + GIF G CGT LDH V VG+GT E+G +YW+++NSWG WG++GY+
Sbjct: 279 GGRE----FQLYHSGIFTGRCGTDLDHGVVAVGYGT-ENGKDYWIVRNSWGGDWGESGYI 333
Query: 262 KIVRD----EGLCGIGTRSSYP 279
++ R+ G CGI SYP
Sbjct: 334 RMERNVNTSTGKCGIAIEPSYP 355
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 173 bits (438), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 83/170 (48%), Positives = 120/170 (70%), Gaps = 4/170 (2%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E W+++HG+ YK EK R ++F+ENL +I++ NKE + +Y LG N+F+DL+++EF++
Sbjct: 405 ESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHEEFKSK 463
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
Y G + P R S F+Y++++ D+P S+DWR KGAVT +KNQ CG CWAF+ VA
Sbjct: 464 YLGLRAEFPRSRDY-SGEFRYRDVA--DLPESVDWRKKGAVTHVKNQGACGSCWAFSTVA 520
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
AVEGI +I +GNL LSEQ+L+DC T N+GC GG + AFA+I N G+
Sbjct: 521 AVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGL 570
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 50/68 (73%), Gaps = 5/68 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+FNG CGT+LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R+ EGLCG
Sbjct: 637 GVFNGPCGTELDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCG 695
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 696 INKMASYP 703
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 173 bits (438), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 93/236 (39%), Positives = 138/236 (58%), Gaps = 15/236 (6%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
I+ L + SS + V+ +++ W+ QHG++Y E+E R +IFK+NL +I++
Sbjct: 20 ISTLTLNQNHPSSSSWRSDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEH 79
Query: 82 NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS--STFKYQNLSMTDVPTS 139
N N TYKLG N+F+DLTN E+RA + G + P R S + +Y + + ++P S
Sbjct: 80 NSNNNTTYKLGLNKFADLTNQEYRAKFLGTRT-DPRRRLMKSKIPSSRYAHRAGDNLPDS 138
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+DWRD GAV+P+K+Q CG CWAF+ +A VEGI KI SG L+ LSEQ+L+DC + + GC
Sbjct: 139 VDWRDHGAVSPVKDQGSCGSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAGC 198
Query: 200 LGGSREKAFAYIIQNQGI----------FNGVCGTQLDHA--VTIVGFGTTEDGAN 243
GG + AF +I+ N GI FN C +A V+I G+ + N
Sbjct: 199 NGGLMDYAFQFIMDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPNNEN 254
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 52/76 (68%), Gaps = 4/76 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
A+ + G+FNG CG LDH V VG+GT ++G +YW+++NSWG+ WG+ GY+++ R
Sbjct: 275 AFQLYESGVFNGECGLALDHGVVAVGYGTDDNGQDYWIVRNSWGSNWGENGYIRMERNIN 334
Query: 266 -DEGLCGIGTRSSYPL 280
+ G CGI +SYP+
Sbjct: 335 ANTGKCGIAMEASYPV 350
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 173 bits (438), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 92/216 (42%), Positives = 134/216 (62%), Gaps = 15/216 (6%)
Query: 40 EQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E V+ I+E W+ +HG+ S +EK+ R +IFK+NL ++++ N E N +Y+LG +F+
Sbjct: 43 EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFA 101
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLTNDE+R+ Y G KM R T+ +Y+ ++P S+DWR KGAV +K+Q C
Sbjct: 102 DLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVAEVKDQGGC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF +II+N GI
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218
Query: 218 -------FNGVCGT--QLDHAVTIVGFGTTEDGANY 244
+ GV GT Q+ +V + ED Y
Sbjct: 219 DTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTY 254
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 55/76 (72%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF+G CGTQLDH V VG+GT E+G +YW+++NSWG +WG++GY+++ R+
Sbjct: 278 AFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMARNIA 336
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYP+
Sbjct: 337 SSSGKCGIAIEPSYPI 352
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 92/216 (42%), Positives = 134/216 (62%), Gaps = 15/216 (6%)
Query: 40 EQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E V+ I+E W+ +HG+ S +EK+ R +IFK+NL ++++ N E N +Y+LG +F+
Sbjct: 43 EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFA 101
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLTNDE+R+ Y G KM R T+ +Y+ ++P S+DWR KGAV +K+Q C
Sbjct: 102 DLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVAEVKDQGGC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF +II+N GI
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218
Query: 218 -------FNGVCGT--QLDHAVTIVGFGTTEDGANY 244
+ GV GT Q+ +V + ED Y
Sbjct: 219 DTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTY 254
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 55/76 (72%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF+G CGTQLDH V VG+GT E+G +YW+++NSWG +WG++GY+++ R+
Sbjct: 278 AFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMARNIA 336
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYP+
Sbjct: 337 SSSGKCGIAIEPSYPI 352
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 80/184 (43%), Positives = 127/184 (69%), Gaps = 7/184 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSY----KDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTN 94
E V +++ W+A+HGR+Y + E E++ R +F +NL +++ N + G R ++LG N
Sbjct: 50 EPEVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERAGARGFRLGMN 109
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
QF+DLTNDEFRA Y G +P+ + +++ + ++P S+DWR+KGAV P+KNQ
Sbjct: 110 QFADLTNDEFRAAYLGAMVPAARRGAVVGERYRHDG-AAEELPESVDWREKGAVAPVKNQ 168
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQ 213
+CG CWAF+AV++VE + +I +G ++ LSEQ+L++CST+ GN+GC GG + AF +II+
Sbjct: 169 GQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIK 228
Query: 214 NQGI 217
N GI
Sbjct: 229 NGGI 232
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 52/82 (63%), Gaps = 9/82 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + G+F+G C T LDH V VG+G E+G +YW+++NSWG WG+AGY+
Sbjct: 289 GGRE----FQLYKSGVFSGSCTTNLDHGVVAVGYGA-ENGKDYWIVRNSWGPKWGEAGYI 343
Query: 262 KIVRD----EGLCGIGTRSSYP 279
++ R+ G CGI +SYP
Sbjct: 344 RMERNVNASTGKCGIAMMASYP 365
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 106/316 (33%), Positives = 166/316 (52%), Gaps = 80/316 (25%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL--EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQ 95
E ++ W+A++G + L E E R +F +NL++++ N + ++LG N+
Sbjct: 44 EAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNR 103
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
F+DLTN+EFRA + G K+ + RS + +Y++ + ++P S+DWR+KGAV P+KNQ
Sbjct: 104 FADLTNEEFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQG 159
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQN 214
+CG CWAF+AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG AF +II+N
Sbjct: 160 QCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKN 219
Query: 215 QGI----------FNGVCGTQLDHA--VTIVGF--------------------------- 235
GI +G C ++A V+I GF
Sbjct: 220 GGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAG 279
Query: 236 ----------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD- 266
GT+ D G +YW+++NSWG WG++GY+++ R+
Sbjct: 280 GREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNI 339
Query: 267 ---EGLCGIGTRSSYP 279
G CGI +SYP
Sbjct: 340 NVTTGKCGIAMMASYP 355
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 89/203 (43%), Positives = 128/203 (63%), Gaps = 11/203 (5%)
Query: 21 IITLLVSCASQVVSSR--STHEQSVVEIH---EKWMAQHGRSYKDELEKEMRLKIFKENL 75
+I L+V A+ +R + + +EI E W A+HG+SY + EK RL IF + L
Sbjct: 6 LILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDWEKARRLMIFSDTL 65
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMT 134
YIEK N + N T+ LG N+FSDLTN EFRA++ G +K P R +++ ++
Sbjct: 66 AYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAED----EDVDVS 121
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+PTSLDWR KGAVTPIK+Q +CG CWAF+A+A++E + + L+ LSEQQL+DC T
Sbjct: 122 SLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCDTV 181
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
+ GC GG E AF ++++N G+
Sbjct: 182 -DAGCDGGLMETAFKFVVKNGGV 203
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 35/66 (53%), Positives = 46/66 (69%), Gaps = 3/66 (4%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--EGLCGIG 273
GI +G C LDH V ++G+GT E G YW+IKNSWG +WG+ G+MKI R +G+CG+
Sbjct: 270 GILSGKCDDSLDHGVLLIGYGT-EGGMPYWIIKNSWGTSWGEDGFMKIERKDGDGMCGMN 328
Query: 274 TRSSYP 279
SSYP
Sbjct: 329 GDSSYP 334
>gi|326520659|dbj|BAJ92693.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 289
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 127/199 (63%), Gaps = 6/199 (3%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
++L + +VS E+ V ++ +WMA+HG +Y E+E R + F++NL YI++
Sbjct: 18 VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 82 N---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N G +++LG N+F+DLTN+E+R+ Y G + R ++ +YQ ++P
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPE 134
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
S+DWR KGAV +K+Q CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194
Query: 199 CLGGSREKAFAYIIQNQGI 217
C GG + AF +II N GI
Sbjct: 195 CNGGLMDYAFEFIINNGGI 213
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 85/182 (46%), Positives = 122/182 (67%), Gaps = 7/182 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T +V + E+W+A++ ++Y EK R ++FK+NL +I++AN++ +Y LG N F+
Sbjct: 63 TQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFA 122
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKGAVTPIKNQK 155
DLT+DEF+A Y G +P + T+ F+Y + P S+DWR KGAVT +KNQ
Sbjct: 123 DLTHDEFKATYLGL-LP----KRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQG 177
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
+CG CWAF+ VAAVEGI +I +GNL LSEQQL+DCST+GNNGC GG + AF++I
Sbjct: 178 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGA 237
Query: 216 GI 217
G+
Sbjct: 238 GL 239
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 5/68 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F+G CG++LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R EGLCG
Sbjct: 307 GVFDGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGTHWGEKGYIRMKRGTGKPEGLCG 365
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 366 INKMASYP 373
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 82/181 (45%), Positives = 123/181 (67%), Gaps = 4/181 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQF 96
E+ + ++E W+A+HGR+ EKE R +IFK+N+ +I+ N G+R+++LG N+F
Sbjct: 43 EEEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRF 102
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+D+TN+E+R +Y G + P+ R + +Y+ + ++P S+DWRDKGAVT +K+Q
Sbjct: 103 ADMTNEEYRTVYLGTR-PASHRRRARLGSDRYRYNAGEELPESVDWRDKGAVTTVKDQGS 161
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+DC N GC GG + AF +II N G
Sbjct: 162 CGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQGCNGGLMDYAFEFIINNGG 221
Query: 217 I 217
I
Sbjct: 222 I 222
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 54/82 (65%), Gaps = 9/82 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + + GIF G CGT LDH V VG+GT E+G +YW+++NSWG WG++GY+
Sbjct: 279 GGRE----FQLYHSGIFTGRCGTDLDHGVVAVGYGT-ENGKDYWIVRNSWGGDWGESGYI 333
Query: 262 KIVRD----EGLCGIGTRSSYP 279
++ R+ G CGI SSYP
Sbjct: 334 RMERNVNASTGKCGIAMESSYP 355
>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
Length = 291
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/201 (43%), Positives = 127/201 (63%), Gaps = 6/201 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT-GYKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG N+F+D+TN+EF A YT G P + S + +++++ V
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPVVS---FDDVNISAV 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
S+DWRD GAVT +K+Q CG CWAF+A+A VEGI KI +G L+ LSEQ++LDC+ +
Sbjct: 125 GQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAV--S 182
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
NGC GG + A+ +II N G+
Sbjct: 183 NGCDGGFVDNAYDFIISNNGV 203
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 85/182 (46%), Positives = 122/182 (67%), Gaps = 7/182 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T +V + E+W+A++ ++Y EK R ++FK+NL +I++AN++ +Y LG N F+
Sbjct: 77 TQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFA 136
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKGAVTPIKNQK 155
DLT+DEF+A Y G +P + T+ F+Y + P S+DWR KGAVT +KNQ
Sbjct: 137 DLTHDEFKATYLGL-LP----KRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQG 191
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
+CG CWAF+ VAAVEGI +I +GNL LSEQQL+DCST+GNNGC GG + AF++I
Sbjct: 192 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGA 251
Query: 216 GI 217
G+
Sbjct: 252 GL 253
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 5/68 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F+G CG++LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R EGLCG
Sbjct: 321 GVFDGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGTHWGEKGYIRMKRGTGKPEGLCG 379
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 380 INKMASYP 387
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/194 (44%), Positives = 125/194 (64%), Gaps = 11/194 (5%)
Query: 32 VVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
++S TH + V+ ++E+W+ +HG++Y EKE R +IFK+NL +I++ N
Sbjct: 28 IISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNS 87
Query: 84 EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
E NRTY +G N+F+DLTN+EFR++Y G + TS +Y +P S+DWR
Sbjct: 88 E-NRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSD--RYAPRVGDSLPDSVDWR 144
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
+GAV +K+Q CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+DC T+ N GC GG
Sbjct: 145 KEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGL 204
Query: 204 REKAFAYIIQNQGI 217
+ AF +II N GI
Sbjct: 205 MDYAFEFIINNGGI 218
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 54/84 (64%), Gaps = 9/84 (10%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG R + + N G+F G CGT LDH V VG+GT E G +YW+++NSWG +WG++GY
Sbjct: 274 GGGRN----FQLYNSGVFTGECGTSLDHGVAAVGYGT-EKGKDYWIVRNSWGKSWGESGY 328
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYP+
Sbjct: 329 IRMERNIASPTGKCGIAIEPSYPI 352
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 95/243 (39%), Positives = 140/243 (57%), Gaps = 23/243 (9%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEM 66
+T +F+ TL + ++S H + V+ ++ W+A+H ++Y E+E
Sbjct: 8 STLLFLFFTLSSAWDMSILSHNHGHHHQSSWRSDNEVISMYNWWLAKHSKTYNKLGEREK 67
Query: 67 RLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR--STTSS 124
R +IFK NL +I++ N NRTYK+G +F+DLTN+E+RA + G K P R + +
Sbjct: 68 RFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFLGTK-SDPKRRLMKSKNP 126
Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
+ +Y + +P S+DWR GAV+ IK+Q CG CWAF+ +AAVEG+ KI +G LI LS
Sbjct: 127 SQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLS 186
Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQ--LDHAVTI 232
EQ+L+DC + N GC GG + AF +II N GI +G C T + AVTI
Sbjct: 187 EQELVDCDRSYNAGCNGGLMDNAFQFIINNGGIDTDKDYPYQAVDGKCDTTKVKNKAVTI 246
Query: 233 VGF 235
GF
Sbjct: 247 DGF 249
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/70 (54%), Positives = 49/70 (70%), Gaps = 6/70 (8%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLC 270
G+F G CG+ LDH V IVG+GT EDG +YWL++NSWG WG+ GY+K+ R+ G C
Sbjct: 286 GVFTGECGSALDHGVVIVGYGT-EDGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGKC 344
Query: 271 GIGTRSSYPL 280
GI SSYP+
Sbjct: 345 GIAMESSYPI 354
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 81/175 (46%), Positives = 121/175 (69%), Gaps = 2/175 (1%)
Query: 44 VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
+ ++E+W+ +HG++Y EK+ R IFK+NL +I+ N + NRTYKLG N+F+DLTN+E
Sbjct: 1 MSLYEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNAD-NRTYKLGLNRFADLTNEE 59
Query: 104 FRALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
+RA Y G ++ P+ T + + +Y ++P S+DWR++ AV P+K+Q CG CWA
Sbjct: 60 YRARYLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWA 119
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
F+ + AVEGI KI +G+LI LSEQ+L+DC T+ N GC GG + A+ +II N GI
Sbjct: 120 FSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGI 174
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 53/85 (62%), Gaps = 10/85 (11%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG RE + + G+F G CGT LDH V VG+G+ + G +YW+++NSWG +WG+ GY
Sbjct: 230 GGGRE----FQLYVSGVFTGRCGTALDHGVVAVGYGSVK-GHDYWIVRNSWGASWGEEGY 284
Query: 261 MKIVRD-----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYP+
Sbjct: 285 VRLERNLAKSRSGKCGIAIEPSYPI 309
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 100/260 (38%), Positives = 146/260 (56%), Gaps = 37/260 (14%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEK--------------WMAQHGRSYKDELE 63
M ++ V+C++ + S H+ SVV ++ W +H + Y E
Sbjct: 7 MLFLLLGFVACSA----TASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE 62
Query: 64 KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT- 122
K R +IFK NL +I + N+ N +Y LG N F+D+ ++EF+A Y G K P + R
Sbjct: 63 KVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK-PGLARRDAQP 120
Query: 123 --SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
S+TF+Y N ++P ++DWR KGAVTP+KNQ ECG CWAF+ VAAVEGI +I +G L
Sbjct: 121 HGSTTFRYAN--AVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKL 178
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF----------NGVCGTQLDHA- 229
+ LSEQ+L+DC N+GC GG + AFAYI+ NQGI+ G C + H+
Sbjct: 179 VSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSK 238
Query: 230 -VTIVGFGTTEDGANYWLIK 248
+TI G+ + + L+K
Sbjct: 239 VITITGYEDVPENSETSLLK 258
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/86 (45%), Positives = 51/86 (59%), Gaps = 9/86 (10%)
Query: 198 GCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 257
G GSR+ F GIF+G CG Q DHA+T VG+G+ G +Y ++KNSWG WG+
Sbjct: 268 GIAAGSRDFQF----YKGGIFDGECGIQPDHALTAVGYGSYY-GQDYIIMKNSWGKNWGE 322
Query: 258 AGYMKIVR----DEGLCGIGTRSSYP 279
GY +I R EG+C I +SYP
Sbjct: 323 QGYFRIRRGTGKPEGVCDIYKIASYP 348
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 106/304 (34%), Positives = 150/304 (49%), Gaps = 76/304 (25%)
Query: 52 AQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALY 108
A+HG+SY E E+ RLKI+ EN I K N++ G Y + N+F D+ + EF +
Sbjct: 32 AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G+K S+ + +N+ +P ++DWR KGAVTP+KNQ +CG CWAF+A +
Sbjct: 92 NGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATGS 151
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI-------FNG 220
+EG +SG+++ LSEQ L+ CST+ GNNGC GG + AF YI N+GI +NG
Sbjct: 152 LEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPYNG 211
Query: 221 VCG------------------------TQLDHAVTIV----------------------- 233
G TQL AV V
Sbjct: 212 TDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYD 271
Query: 234 ----------------GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRS 276
G+GT +G +YW +KNSWG TWGD GY+++ R+ + CGI + +
Sbjct: 272 EPECDSESLDHGVLVVGYGTL-NGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQCGIASSA 330
Query: 277 SYPL 280
S PL
Sbjct: 331 SIPL 334
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 127/199 (63%), Gaps = 6/199 (3%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
++L + +VS E+ V ++ +WMA+HG +Y E+E R + F++NL YI++
Sbjct: 18 VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 82 N---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N G +++LG N+F+DLTN+E+R+ Y G + R ++ +YQ ++P
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPE 134
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
S+DWR KGAV +K+Q CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194
Query: 199 CLGGSREKAFAYIIQNQGI 217
C GG + AF +II N GI
Sbjct: 195 CNGGLMDYAFEFIINNGGI 213
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/75 (49%), Positives = 50/75 (66%), Gaps = 5/75 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + GIF G CGT LDH V VG+GT E+G +YWL++NSWG+ WG+ GY+++ R+
Sbjct: 273 AFQLYKSGIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGSVWGEDGYIRMERNIK 331
Query: 267 --EGLCGIGTRSSYP 279
G CGI SYP
Sbjct: 332 ASSGKCGIAVEPSYP 346
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 84/182 (46%), Positives = 120/182 (65%), Gaps = 4/182 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQF 96
+ V +++ W AQH RSY E E RL+IF++NL +I++ N N +++LG +F
Sbjct: 40 DDEVHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRF 99
Query: 97 SDLTNDEFRALYTGYKMP-SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
+DLTN+E+R+ Y G + S R++T + +Y+ S D+P S+DWRDKGAV +K+Q
Sbjct: 100 ADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKDQG 159
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ +AAVEGI I +G+LI LSEQ+L+DC T N GC GG + AF +II N
Sbjct: 160 SCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIISNG 219
Query: 216 GI 217
GI
Sbjct: 220 GI 221
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 54/76 (71%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + GIF G CGT+LDH VT +G+G+ E+G YW++KNSWG+ WG++GY+++ R+
Sbjct: 281 AFQLYESGIFTGYCGTELDHGVTAIGYGS-ENGKYYWIVKNSWGSDWGESGYIRMERNIN 339
Query: 267 --EGLCGIGTRSSYPL 280
G CGI +SYP+
Sbjct: 340 SATGKCGIAMEASYPI 355
>gi|58201350|gb|AAW66796.1| cysteine protease [Pinus taeda]
gi|58201354|gb|AAW66798.1| cysteine protease [Pinus taeda]
gi|58201358|gb|AAW66800.1| cysteine protease [Pinus taeda]
gi|58201378|gb|AAW66810.1| cysteine protease [Pinus taeda]
gi|58201390|gb|AAW66816.1| cysteine protease [Pinus taeda]
gi|58201396|gb|AAW66819.1| cysteine protease [Pinus taeda]
gi|58201404|gb|AAW66823.1| cysteine protease [Pinus taeda]
gi|58201408|gb|AAW66825.1| cysteine protease [Pinus taeda]
Length = 193
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 80/169 (47%), Positives = 120/169 (71%), Gaps = 3/169 (1%)
Query: 32 VVSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
++S++ E +++E++E W+A+H ++Y EK+ R +FK+N YI + N +GNR+YK
Sbjct: 26 IISNKDLREDDAIMELYELWLAEHKKAYNGLDEKQKRFTVFKDNFLYIHEHN-QGNRSYK 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+DL+++EF+A Y G K+ + R S + +YQ D+P S+DWR+KGAV P
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKLDTKK-RLLRSPSPRYQYSDGEDLPKSIDWREKGAVAP 143
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+K+Q CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T+ N GC
Sbjct: 144 VKDQGACGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGC 192
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 122/188 (64%), Gaps = 5/188 (2%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
+ SR E E HE WMAQ+G+ YKD EK+ R +IFK N+ +IE N G++ + L
Sbjct: 24 IMSRRLFEACTSERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLS 83
Query: 93 TNQFSDLTNDEFRALYTG--YKMPSPSHRST-TSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
NQF+DL ++EF+AL T K+ S +T T ++FKY + T + ++DWR +GAVT
Sbjct: 84 INQFADLHDEEFKALLTNGNKKVRSVVGTATETETSFKYNRV--TKLLATMDWRKRGAVT 141
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
PIK+Q+ CG CWAF+AVAA+EGI +I + L+ LSEQ+L+DC + GC GG E AF
Sbjct: 142 PIKDQRRCGSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFE 201
Query: 210 YIIQNQGI 217
++ + GI
Sbjct: 202 FVAKKGGI 209
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 39/77 (50%), Positives = 51/77 (66%), Gaps = 4/77 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + GIF G CGT DHA+T+VG+G + G YWL+KNSWG WG+ GY+++ RD
Sbjct: 269 AFQFYSSGIFTGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWGEKGYIRMKRDIR 328
Query: 267 --EGLCGIGTRSSYPLA 281
EGLCGI + YP A
Sbjct: 329 AKEGLCGIAMNAFYPTA 345
>gi|116788286|gb|ABK24823.1| unknown [Picea sitchensis]
Length = 294
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 84/200 (42%), Positives = 126/200 (63%), Gaps = 2/200 (1%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ +++ + S + + R E ++ + ++W HG++Y + ++ +R ++FKENL Y
Sbjct: 10 LVMLLLVFSSVTAITYNPRDLSENGLLSLFDRWCNHHGKTYTAK-QRPLRFQVFKENLFY 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I + N GN T+ LG N FSDLT+DEFR G + PS +S K L + ++P
Sbjct: 69 ISEHNSRGNHTFWLGLNAFSDLTSDEFRTQQMGLRGHPPSLKSRRREP-KSGLLELYNIP 127
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
+SLDWRDK AVT +K+Q CG CWAF+A A+EGI KI +G+L+ LSEQ+L DC T+ N+
Sbjct: 128 SSLDWRDKDAVTGVKDQGACGDCWAFSATGAIEGINKIVTGSLVSLSEQELCDCDTSYNS 187
Query: 198 GCLGGSREKAFAYIIQNQGI 217
GC GG + AF ++I N GI
Sbjct: 188 GCDGGLMDYAFQWVIVNGGI 207
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/176 (49%), Positives = 119/176 (67%), Gaps = 9/176 (5%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
S+ + + ++KWM ++GR YK E E R I++ N++YI+ N N ++ L N F
Sbjct: 9 SSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNF 67
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLTN+EF+A Y GYK S + F+Y N M ++PT++DWR +GAVTPIKNQ +
Sbjct: 68 ADLTNEEFKATYLGYKTVS-----IPDTCFRYGN--MVNLPTNVDWRQEGAVTPIKNQGQ 120
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYI 211
CG CWAF+AVAAVEGI KI++G LI LSEQ+L+DC T+GN GC GG KAF +I
Sbjct: 121 CGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFI 176
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 83/158 (52%), Gaps = 22/158 (13%)
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
++ + T + T +++ +GA + QKE + F +++ E + +L Q
Sbjct: 173 FEFIKRTGLTTEIEYPYQGAESACNEQKEK---YQFVSISGYEKVPVNDEKSLKAAVANQ 229
Query: 188 LLDCSTN--GNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 245
+ + + GNN + + GIF+G CG QL+H V IVG+G T + A YW
Sbjct: 230 PVSVAIDAEGNN------------FQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQA-YW 276
Query: 246 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
L+KNSWG WG++GY+++ RD +G CGI +SYP
Sbjct: 277 LVKNSWGTDWGESGYIRMKRDSTDKQGTCGIAMMASYP 314
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/176 (49%), Positives = 119/176 (67%), Gaps = 9/176 (5%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
S+ + + ++KWM ++GR YK E E R I++ N++YI+ N N ++ L N F
Sbjct: 9 SSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNF 67
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLTN+EF+A Y GYK S + F+Y N M ++PT++DWR +GAVTPIKNQ +
Sbjct: 68 ADLTNEEFKATYLGYKTVS-----IPDTCFRYGN--MVNLPTNVDWRQEGAVTPIKNQGQ 120
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYI 211
CG CWAF+AVAAVEGI KI++G LI LSEQ+L+DC T+GN GC GG KAF +I
Sbjct: 121 CGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFI 176
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 83/158 (52%), Gaps = 22/158 (13%)
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
++ + T + T +++ +GA + QKE + F +++ E + +L Q
Sbjct: 173 FEFIKRTGLTTEIEYPYQGAESACNEQKEK---YQFVSISGYEKVPVNDEKSLKAAVANQ 229
Query: 188 LLDCSTN--GNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 245
+ + + GNN + + GIF+G CG QL+H V IVG+G T + A YW
Sbjct: 230 PVSVAIDAEGNN------------FQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQA-YW 276
Query: 246 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
L+KNSWG WG++GY+++ RD +G CGI +SYP
Sbjct: 277 LVKNSWGTDWGESGYIRMKRDSTDRQGTCGIAMMASYP 314
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/229 (40%), Positives = 137/229 (59%), Gaps = 27/229 (11%)
Query: 28 CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR 87
C + +V+ ++E E+WM +HGR Y D EK+ RL++++ N+E +E N GN
Sbjct: 18 CGAALVA----RADPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN- 72
Query: 88 TYKLGTNQFSDLTNDEFRALYTGYKMPSP---SHRSTTSSTFKYQNLSM------TDVPT 138
Y+L N+F+DLTN+EFRA G+ P + ST ST + +D+P
Sbjct: 73 GYRLADNKFADLTNEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPK 132
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
S+DWR+KGAV P+K+Q +CG CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC T G
Sbjct: 133 SVDWREKGAVAPVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IG 191
Query: 199 CLGGSREKAFAYIIQNQGI----------FNGVCGTQ--LDHAVTIVGF 235
C GG AF ++++N+G+ NG C T + AV+I G+
Sbjct: 192 CAGGYMSWAFEFVMKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGY 240
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 119/188 (63%), Gaps = 18/188 (9%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E E+WM +HGR Y D EK+ RL++++ N+ +E N N Y+L N+F+DLTN+
Sbjct: 28 MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLTNE 87
Query: 103 EFRALYTGYKMPSPSHRSTTSSTF-------------KYQNLSMTDVPTSLDWRDKGAVT 149
EFRA G+ P P R+T +T +Y + ++P S+DWR+KGAV
Sbjct: 88 EFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSD----ELPKSVDWREKGAVA 143
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
P+KNQ ECG CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC T GC GG AF
Sbjct: 144 PVKNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IGCAGGYMSWAFE 202
Query: 210 YIIQNQGI 217
+++ N G+
Sbjct: 203 FVMNNSGL 210
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 50/88 (56%), Gaps = 14/88 (15%)
Query: 207 AFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTE----------DGANYWLIKNSWGNTWG 256
+F + + G+F G C L+H VT+VG+G T+ G YW++KNSWG WG
Sbjct: 296 SFVWQLYGGGVFTGPCTADLNHGVTVVGYGETQRDTDGDGTGVPGQKYWIVKNSWGPEWG 355
Query: 257 DAGYMKIVRD----EGLCGIGTRSSYPL 280
DAGY+ + R+ GLCGI SYP+
Sbjct: 356 DAGYILMQREASVASGLCGIALLPSYPV 383
>gi|52546920|gb|AAU81593.1| cysteine proteinase [Petunia x hybrida]
Length = 210
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 91/197 (46%), Positives = 128/197 (64%), Gaps = 17/197 (8%)
Query: 53 QHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK 112
QHG+ Y+ EK R +IFKENL++I++ NK + Y LG N+FSDL++DEF+ +Y G K
Sbjct: 3 QHGKIYESIEEKLHRFEIFKENLKHIDERNKIVS-NYWLGLNEFSDLSHDEFKKMYLGLK 61
Query: 113 MPSP--SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
+ +++ + F+Y++ D+P S+DWR KGAVTP+KNQ +CG CWAF+ VAAVE
Sbjct: 62 VDHDLLNNKKQSQQDFEYRDF--VDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVE 119
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF----------NG 220
GI +I++GNL LSEQ+L+DC T NNGC GG + AF +II N G+ G
Sbjct: 120 GINQIKTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFQFIISNGGLHKEDDYPYLMEEG 179
Query: 221 VCGTQLDHA--VTIVGF 235
C + D + VTI G+
Sbjct: 180 TCDEKRDESEVVTIDGY 196
>gi|359484377|ref|XP_003633102.1| PREDICTED: thiol protease aleurain-like isoform 2 [Vitis vinifera]
Length = 318
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 106/258 (41%), Positives = 138/258 (53%), Gaps = 40/258 (15%)
Query: 53 QHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK 112
++G+SYK E ++R +IF ENL+ I N++G Y L NQF+D T +EFR G
Sbjct: 69 RYGKSYKTVDEIKLRFEIFSENLKLIRSTNRKG-LPYTLAVNQFADWTWEEFRRHRLGAA 127
Query: 113 MPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
S+T K N +TDV P + DWR+ G V+PIK+Q CG CW F+ A+E
Sbjct: 128 Q-------NCSATLK-GNHKLTDVILPETKDWREDGIVSPIKDQGHCGSCWTFSTTGALE 179
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI----------FN 219
G I LSEQQL+DC+ NN GC GG +AF YI N G+ +
Sbjct: 180 AAYAQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGLD 239
Query: 220 GVC-------GTQ----------LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 262
G C G Q ++HAV VG+G EDG YWLIKNSWG WGD GY K
Sbjct: 240 GTCKFSSENIGVQVLDSVNITLDVNHAVLAVGYG-VEDGVAYWLIKNSWGENWGDNGYFK 298
Query: 263 IVRDEGLCGIGTRSSYPL 280
+ + +CG+ T SSYP+
Sbjct: 299 MELGKNMCGVATCSSYPV 316
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 98/247 (39%), Positives = 141/247 (57%), Gaps = 37/247 (14%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEK--------------WMAQHGRSYKDELE 63
M ++ V+C++ + S H+ SVV ++ W +H + Y E
Sbjct: 16 MLFLLLGFVACSA----TASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE 71
Query: 64 KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT- 122
K R +IFK NL +I + N+ N +Y LG N F+D+ ++EF+A Y G K P + R
Sbjct: 72 KVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK-PGLARRDAQP 129
Query: 123 --SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
S+TF+Y N ++P ++DWR KGAVTP+KNQ ECG CWAF+ VAAVEGI +I +G L
Sbjct: 130 HGSTTFRYAN--AVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKL 187
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF----------NGVCGTQLDHA- 229
+ LSEQ+L+DC N+GC GG + AFAYI+ NQGI+ G C + H+
Sbjct: 188 VSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSK 247
Query: 230 -VTIVGF 235
+TI G+
Sbjct: 248 VITITGY 254
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/86 (45%), Positives = 51/86 (59%), Gaps = 9/86 (10%)
Query: 198 GCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 257
G GSR+ F GIF+G CG Q DHA+T VG+G+ G +Y ++KNSWG WG+
Sbjct: 277 GIAAGSRDFQF----YKGGIFDGECGIQPDHALTAVGYGSYY-GQDYIIMKNSWGKNWGE 331
Query: 258 AGYMKIVR----DEGLCGIGTRSSYP 279
GY +I R EG+C I +SYP
Sbjct: 332 QGYFRIRRGTGKPEGVCDIYKIASYP 357
>gi|195379512|ref|XP_002048522.1| GJ11311 [Drosophila virilis]
gi|194155680|gb|EDW70864.1| GJ11311 [Drosophila virilis]
Length = 311
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 105/318 (33%), Positives = 162/318 (50%), Gaps = 68/318 (21%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ + L++ + Q ++S + E E HGRSY + E+ +R +IF++N + I+
Sbjct: 4 VCVVLVLIGSVQSIASDAVLEAEWKSFKE----MHGRSYAGDSEELLRRRIFEDNKKLID 59
Query: 80 KAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
N + G TYK+G N+F+DL EF + +M +R+ ++ + Y+ + +
Sbjct: 60 THNARYEAGKETYKMGVNEFTDLLPSEFVS-----RMMGSLNRTAVTADYIYEPSANLQI 114
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P S+DWR KGAV+P+KNQ CG CW FAAV +EG + +R+ +++LSEQ LLDCS++
Sbjct: 115 PESIDWRTKGAVSPVKNQGTCGSCWTFAAVGTLEGQSFLRTKRMVELSEQNLLDCSSHPP 174
Query: 196 --NNGCLGGSREKAFAYIIQNQ-------------------------------------- 215
N+GC G A Y+ NQ
Sbjct: 175 YRNHGCQRGYPYDALRYVKDNQGLDTRSSYPYQGVQGRCRFRREHVGVRIKGVATVRSGD 234
Query: 216 -----------GIFNGVC-GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 263
GI+N C G+ HAV +VG+G + G +YWL+KNSWGN WG+AGY ++
Sbjct: 235 ERALQAAVAEKGIYNQPCSGSTYFHAVVLVGYG-RDRGHDYWLLKNSWGN-WGEAGYFRM 292
Query: 264 VRD-EGLCGIGTRSSYPL 280
R+ LC I + +YPL
Sbjct: 293 ARNSRNLCYIASWPTYPL 310
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 89/190 (46%), Positives = 123/190 (64%), Gaps = 7/190 (3%)
Query: 32 VVSSRSTH--EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
VV++ +H E V I+E+W+ +HG++Y EKE R KIFK+NL++IE+ N + NR+Y
Sbjct: 24 VVTATESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSY 83
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
G NQFSDLT DEF+A Y G K+ +S + +YQ +P +DWR++GAV
Sbjct: 84 DRGLNQFSDLTVDEFQASYLGGKI---EKKSLSDVAERYQYKEGDILPDEVDWRERGAVV 140
Query: 150 P-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKA 207
P +K Q +CG CWAFAA AVEGI +I +G L+ LSEQ+L+DC +N GC GG A
Sbjct: 141 PRVKRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWA 200
Query: 208 FAYIIQNQGI 217
F +I +N GI
Sbjct: 201 FEFIKENGGI 210
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 42/70 (60%), Gaps = 5/70 (7%)
Query: 216 GIFNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G++ G C DH V IVG+GT+ D +YWLI+NSWG WG+ GY+++ R+ G C
Sbjct: 277 GVYKGPCSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPGWGEGGYLRLQRNFNEPTGKC 336
Query: 271 GIGTRSSYPL 280
+ YP+
Sbjct: 337 AVAVAPVYPI 346
>gi|58201366|gb|AAW66804.1| cysteine protease [Pinus taeda]
gi|58201368|gb|AAW66805.1| cysteine protease [Pinus taeda]
gi|58201392|gb|AAW66817.1| cysteine protease [Pinus taeda]
gi|58201394|gb|AAW66818.1| cysteine protease [Pinus taeda]
gi|58201398|gb|AAW66820.1| cysteine protease [Pinus taeda]
Length = 193
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 80/169 (47%), Positives = 120/169 (71%), Gaps = 3/169 (1%)
Query: 32 VVSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
++S++ E +++E++E W+A+H ++Y EK+ R +FK+N YI + N +GNR+YK
Sbjct: 26 IISNKDLREDDAIMELYELWVAEHKKAYNGLDEKQKRFTVFKDNFLYIHEHN-QGNRSYK 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+DL+++EF+A Y G K+ + R S + +YQ D+P S+DWR+KGAV P
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKLDTKK-RLLRSPSPRYQYSDGEDLPKSIDWREKGAVAP 143
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+K+Q CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T+ N GC
Sbjct: 144 VKDQGACGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGC 192
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 114/352 (32%), Positives = 169/352 (48%), Gaps = 91/352 (25%)
Query: 18 MFIIITLLVSCAS----QVVSSRSTHE--------QSVVE-----IHEKWMAQHGRSYKD 60
+F++ ++ SCA+ VVSS H Q + + + E WM +HG+ Y
Sbjct: 10 IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDS 69
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EKE RL IF++NL +I N E N +Y+LG N+F+DL+ E+ + G P +
Sbjct: 70 VAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV 128
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
+S+ +Y+ +P S+DWR++GAVT +K+Q C CWAF+ V AVEG+ KI +G L
Sbjct: 129 FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGEL 188
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQL---D 227
+ LSEQ L++C+ NNGC GG E A+ +I+ N G+ NGVC +L +
Sbjct: 189 VTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDN 247
Query: 228 HAVTIVGF----------------------------------------GTTEDGANYWLI 247
V I G+ GT N+ ++
Sbjct: 248 KNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVV 307
Query: 248 KNSWG---------------NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
+G +TWG+AGYMK+ R+ GLCGI R+SYPL
Sbjct: 308 VVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 86/180 (47%), Positives = 125/180 (69%), Gaps = 5/180 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
++ V EI+E W+A+H + Y +E E R +IFK+NL++I++ N E N TYK+G ++DL
Sbjct: 38 DEEVKEIYELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDEHNSE-NHTYKMGLTPYTDL 96
Query: 100 TNDEFRALYTGYKMPSPSHR--STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
TN+EF+A+Y G + + HR T + + +Y + ++P +DWR KGAVTP+KNQ +C
Sbjct: 97 TNEEFQAIYLGTRSDT-IHRLKRTINISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKC 155
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ V+ VE I +IR+GNLI LSEQQL+DC+ N+GC GG+ A+ YII N GI
Sbjct: 156 GSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNKK-NHGCKGGAFVYAYQYIIDNGGI 214
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY--MKIVRDEGLCGIG 273
GIF+G CGT+L+H V IVG+ +YW+++NSWG WG+ GY MK V GLCGI
Sbjct: 278 GIFSGPCGTKLNHGVVIVGYWK-----DYWIVRNSWGRYWGEQGYIRMKRVGGCGLCGIA 332
Query: 274 TRSSYP 279
YP
Sbjct: 333 RLPYYP 338
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 83/215 (38%), Positives = 133/215 (61%), Gaps = 13/215 (6%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTH------------EQSVVEIHEKWMAQHGRSYKDEL 62
T F +I+++ + +++ +TH + V ++E W+ +HG++Y
Sbjct: 8 TLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKTYNALG 67
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK+ R +IFK+NL +I++ N G+ TYKLG N+F+DLTN+E+R YTG K + +
Sbjct: 68 EKDRRFQIFKDNLRFIDEHNS-GDHTYKLGLNKFADLTNEEYRMTYTGIKTIDDKKKLSK 126
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
+ +Y S +P +DWR++GAVT +K+Q CG CWAF+ +VEG+ KI +G+LI
Sbjct: 127 MKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIVTGDLIS 186
Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+SEQ+L++C T+ N GC GG + AF +II+N GI
Sbjct: 187 VSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGI 221
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 38/69 (55%), Positives = 47/69 (68%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF G CGT LDH V G+GT EDG +YWL+KNSWG WG+ GY+K+ R+ G CG
Sbjct: 288 GIFTGSCGTALDHGVLAAGYGT-EDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCG 346
Query: 272 IGTRSSYPL 280
I +SYP+
Sbjct: 347 IAMEASYPI 355
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/229 (40%), Positives = 137/229 (59%), Gaps = 27/229 (11%)
Query: 28 CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR 87
C + +V+ ++E E+WM +HGR Y D EK+ RL++++ N+E +E N GN
Sbjct: 39 CGAALVA----RADPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN- 93
Query: 88 TYKLGTNQFSDLTNDEFRALYTGYKMPSP---SHRSTTSSTFKYQNLSM------TDVPT 138
Y+L N+F+DLTN+EFRA G+ P + ST ST + +D+P
Sbjct: 94 GYRLADNKFADLTNEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPK 153
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
S+DWR+KGAV P+K+Q +CG CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC T G
Sbjct: 154 SVDWREKGAVAPVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IG 212
Query: 199 CLGGSREKAFAYIIQNQGI----------FNGVCGTQ--LDHAVTIVGF 235
C GG AF ++++N+G+ NG C T + AV+I G+
Sbjct: 213 CAGGYMSWAFEFVMKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGY 261
>gi|58201360|gb|AAW66801.1| cysteine protease [Pinus taeda]
Length = 193
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 80/169 (47%), Positives = 120/169 (71%), Gaps = 3/169 (1%)
Query: 32 VVSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
++S++ E +++E++E W+A+H ++Y EK+ R +FK+N YI + N +GNR+YK
Sbjct: 26 IISNKDLREDDAIMELYELWLAEHKKAYNGLDEKQKRFTVFKDNFLYIHEHN-QGNRSYK 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+DL+++EF+A Y G K+ + R S + +YQ D+P S+DWR+KGAV P
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKLDTKK-RLLRSPSPRYQYSDGEDLPKSIDWREKGAVVP 143
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+K+Q CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T+ N GC
Sbjct: 144 VKDQGACGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGC 192
>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
Length = 299
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 139/220 (63%), Gaps = 10/220 (4%)
Query: 1 MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+VLI SF ++ II+ + + S R+ E V+ ++E+W+ +HG+SY
Sbjct: 14 IVLII---SSFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNG 68
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EK+ R +IFK+NL++I++ N N TY+LG +F+DLTN+E+R+ + G K+ P+ R
Sbjct: 69 LGEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKI-DPNRRM 126
Query: 121 TT---SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
S + +Y +P S+DWR +GAV +K+Q CG CWAF+A+AAVEGI KI +
Sbjct: 127 KKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVT 186
Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G+LI LSEQ+L+DC T+ N GC GG + AF +II N GI
Sbjct: 187 GDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGI 226
>gi|58201346|gb|AAW66794.1| cysteine protease [Pinus taeda]
gi|58201348|gb|AAW66795.1| cysteine protease [Pinus taeda]
gi|58201362|gb|AAW66802.1| cysteine protease [Pinus taeda]
gi|58201364|gb|AAW66803.1| cysteine protease [Pinus taeda]
gi|58201370|gb|AAW66806.1| cysteine protease [Pinus taeda]
gi|58201372|gb|AAW66807.1| cysteine protease [Pinus taeda]
gi|58201374|gb|AAW66808.1| cysteine protease [Pinus taeda]
gi|58201380|gb|AAW66811.1| cysteine protease [Pinus taeda]
gi|58201382|gb|AAW66812.1| cysteine protease [Pinus taeda]
gi|58201384|gb|AAW66813.1| cysteine protease [Pinus taeda]
gi|58201386|gb|AAW66814.1| cysteine protease [Pinus taeda]
gi|58201402|gb|AAW66822.1| cysteine protease [Pinus taeda]
Length = 193
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 80/169 (47%), Positives = 120/169 (71%), Gaps = 3/169 (1%)
Query: 32 VVSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
++S++ E +++E++E W+A+H ++Y EK+ R +FK+N YI + N +GNR+YK
Sbjct: 26 IISNKDLREDDAIMELYELWLAEHKKAYNGLDEKQKRFTVFKDNFLYIHEHN-QGNRSYK 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+DL+++EF+A Y G K+ + R S + +YQ D+P S+DWR+KGAV P
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKLDTKK-RLLRSPSPRYQYSDGEDLPKSIDWREKGAVAP 143
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+K+Q CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T+ N GC
Sbjct: 144 VKDQGACGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGC 192
>gi|58201352|gb|AAW66797.1| cysteine protease [Pinus taeda]
Length = 192
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 80/169 (47%), Positives = 120/169 (71%), Gaps = 3/169 (1%)
Query: 32 VVSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
++S++ E +++E++E W+A+H ++Y EK+ R +FK+N YI + N +GNR+YK
Sbjct: 25 IISNKDLREDDAIMELYELWLAEHKKAYNGLDEKQKRFTVFKDNFLYIHEHN-QGNRSYK 83
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+DL+++EF+A Y G K+ + R S + +YQ D+P S+DWR+KGAV P
Sbjct: 84 LGLNQFADLSHEEFKATYLGAKLDTKK-RLLRSPSPRYQYSDGEDLPKSIDWREKGAVAP 142
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+K+Q CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T+ N GC
Sbjct: 143 VKDQGACGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGC 191
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 87/205 (42%), Positives = 128/205 (62%), Gaps = 5/205 (2%)
Query: 15 TTPMFIIITLLVSCASQVVSSRST--HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
T + I LL+S + V++ T +E ++E+W+ ++ ++Y EKE R +IF
Sbjct: 9 TLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEKETRFEIFT 68
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+NL+YIE+ N N+T+++G +F+DLTNDEFRA+Y KM +Y
Sbjct: 69 DNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKM---ERTRVPVKGERYLYKV 125
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR KGAV P+K+Q CG CWAF+A+ AVEGI +I++G LI LSEQ+L+DC
Sbjct: 126 GDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI 217
T+ N GC GG + AF +II+N GI
Sbjct: 186 TSYNGGCGGGLMDYAFKFIIENGGI 210
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 50/75 (66%), Gaps = 5/75 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+F G CGT LDH V VG+G+ E G +YW+++NSWG+ WG++GY K+ R+
Sbjct: 271 AFQLYKSGVFTGTCGTSLDHGVVAVGYGS-EGGQDYWIVRNSWGSNWGESGYFKLERNIK 329
Query: 267 --EGLCGIGTRSSYP 279
G CG+ +SYP
Sbjct: 330 ESSGKCGVAMMASYP 344
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 91/218 (41%), Positives = 133/218 (61%), Gaps = 15/218 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ V+ +++ W+ QHG++Y E+E R +IFK+NL +I++ N N TYKLG N+F+DL
Sbjct: 39 DDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADL 98
Query: 100 TNDEFRALYTGYKMPSPSHRSTTS--STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
TN E+RA + G + P R S + +Y + + ++P S++WRD GAV+ +K+Q C
Sbjct: 99 TNQEYRAKFLGTRT-DPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSC 157
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+A+AAVEGI KI SG LI LSEQ+L+DC + + GC GG + AF +II N GI
Sbjct: 158 GSCWAFSAIAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDNGGI 217
Query: 218 ----------FNGVCGTQLDHA--VTIVGFGTTEDGAN 243
FN C +A V+I G+ + N
Sbjct: 218 DTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPNNEN 255
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 51/76 (67%), Gaps = 4/76 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
A+ + G+FNG CG LDH V VG+G+ ++G +YW+++NSWG WG+ GY+++ R
Sbjct: 276 AFQLYESGVFNGECGLALDHGVVAVGYGSDDNGQDYWIVRNSWGGNWGENGYIRMERNIN 335
Query: 266 -DEGLCGIGTRSSYPL 280
+ G CGI +SYP+
Sbjct: 336 ANTGKCGIAMEASYPV 351
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 128/208 (61%), Gaps = 10/208 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F TL + ++S +H + V+ I+E W+ +HG++Y EKE R +
Sbjct: 5 LFFASTLSSASDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLVKHGKAYNSLGEKERRFE 64
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
+FK+NL +I++ N E NRTY++G N+F+DLTN+E+R++Y G + + +Y
Sbjct: 65 VFKDNLRFIDEHNSE-NRTYRVGLNRFADLTNEEYRSMYLG-ALSGIRRNKLRKISDRYT 122
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+P S+DWR +GAV +K+Q CG CWAF+AVAAVEGI KI +G+LI LSEQ+L+
Sbjct: 123 PRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSEQELV 182
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGI 217
DC + N GC GG + F +II N GI
Sbjct: 183 DCDNSYNEGCNGGLMDYGFEFIINNGGI 210
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 55/75 (73%), Gaps = 5/75 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + + G+F+G CGT LDH V VG+GT E+G +YW+++NSWG +WG++GY+++ R+
Sbjct: 271 FQLYSSGVFSGRCGTALDHGVVAVGYGT-ENGQDYWIVRNSWGKSWGESGYLRMARNIRK 329
Query: 267 -EGLCGIGTRSSYPL 280
G+CGI +SYP+
Sbjct: 330 PTGICGIAMEASYPI 344
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 82/181 (45%), Positives = 125/181 (69%), Gaps = 5/181 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL-EKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFS 97
E V ++E W+ +HGR + L E + R ++F +NL +++ N + G ++LG NQF+
Sbjct: 49 EAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERAGEHGFRLGMNQFA 108
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLTNDEFRA Y G ++P+ RS + Y++ ++P S+DWR+KGAV P+KNQ +C
Sbjct: 109 DLTNDEFRAAYLGARIPAA--RSGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQGQC 166
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQG 216
G CWAF+AV++VE I +I +G ++ LSEQ+L++CST+ GN+GC GG + AF +II+N G
Sbjct: 167 GSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGG 226
Query: 217 I 217
I
Sbjct: 227 I 227
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 34/74 (45%), Positives = 50/74 (67%), Gaps = 5/74 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + G+F+G C T LDH V VG+GT E+G +YW+++NSWG WG+AGY+++ R+
Sbjct: 288 FQLYKSGVFSGSCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGPKWGEAGYIRMERNINA 346
Query: 267 -EGLCGIGTRSSYP 279
G CGI +SYP
Sbjct: 347 TTGKCGIAMMASYP 360
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 87/215 (40%), Positives = 131/215 (60%), Gaps = 15/215 (6%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVV-----------EIHEKWMAQHGRSYKDELEKE 65
P + ++ A S+ + SVV + W +HG+ Y EK
Sbjct: 3 PKLAVAVFVLFLAFAACSANHHRDPSVVGYSQEDLALPSSLFRSWSVKHGKLYASPTEKL 62
Query: 66 MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP---SHRSTT 122
R +IFK+NL +I + N++ N +Y LG NQF+D+ ++EF+A Y G K P + ++ T
Sbjct: 63 ERYEIFKQNLMHIAETNRK-NGSYWLGLNQFADVAHEEFKASYLGLKRALPRAGAPQTRT 121
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
+ F+Y + +P S+DWR KGAVTP+KNQ +CG CWAF++VAAVEGI +I +G L+
Sbjct: 122 PTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVS 181
Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
LSEQ+L+DC T ++GC GG+ + AFAY++ +QGI
Sbjct: 182 LSEQELVDCDTTLDHGCEGGTMDLAFAYMMGSQGI 216
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 55/87 (63%), Gaps = 9/87 (10%)
Query: 198 GCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 257
G GSR+ F G+F+G C +LDHA+T VG+G++ G NY +KNSWG WG+
Sbjct: 272 GIAAGSRDFQF----YRGGVFDGACSVELDHALTAVGYGSSY-GQNYITMKNSWGKNWGE 326
Query: 258 AGYMKIV----RDEGLCGIGTRSSYPL 280
GY++I + EG+CGI T +SYP+
Sbjct: 327 QGYVRIKMGTGKPEGVCGIYTMASYPV 353
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 93/208 (44%), Positives = 130/208 (62%), Gaps = 30/208 (14%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ E HE+WMAQ+GR YKD+ EKE R IFKEN+ I+ N + ++Y LG NQF+DL+N+
Sbjct: 1 MYERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNE 60
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
EF+A +K M SP + F+Y+N+S VP ++DWR KGAVTP+K+Q +C
Sbjct: 61 EFKASRNRFKGHMCSPQ-----AGPFRYENVSA--VPATMDWRKKGAVTPVKDQGQC--- 110
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI-- 217
VAA+EGI ++ +G LI LSEQ+++DC T G + GC GG + AF +I QN+G+
Sbjct: 111 -----VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTT 165
Query: 218 --------FNGVCGTQ--LDHAVTIVGF 235
+G C TQ + HA I GF
Sbjct: 166 EANYPYTGTDGTCNTQKEVSHAAKITGF 193
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 41/78 (52%), Positives = 54/78 (69%), Gaps = 5/78 (6%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + + GIF G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+++ +D
Sbjct: 222 FEFQFYSSGIFTGSCGTELDHGVTAVGYGGS-DGTKYWLVKNSWGAQWGEEGYIRMQKDI 280
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 281 SAKEGLCGIAMQASYPTA 298
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 81/178 (45%), Positives = 122/178 (68%), Gaps = 2/178 (1%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ V ++E W+ +HG++Y EKE R +IFK+NL +I++ N +R+YK+G N+F+DL
Sbjct: 44 DSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSV-DRSYKVGLNRFADL 102
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN+E++A++ G KM +R + + +Y D+P ++DWR+KGAV P+K+Q +CG
Sbjct: 103 TNEEYKAMFLGTKMER-KNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGS 161
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ V AVEGI +I +G LI LSEQ+L+DC + N GC GG + AF +II N GI
Sbjct: 162 CWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGI 219
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 53/76 (69%), Gaps = 6/76 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+F G CGT+LDH V VG+GT E+G NYW+++NSWG+ WG++GY+++ R+
Sbjct: 279 AFQLYKSGVFTGRCGTELDHGVVAVGYGT-ENGVNYWIVRNSWGSAWGESGYIRMERNVA 337
Query: 267 ---EGLCGIGTRSSYP 279
G CGI + SYP
Sbjct: 338 NTKTGKCGIAIQPSYP 353
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 127/199 (63%), Gaps = 6/199 (3%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
++L + +VS E+ V ++ +WMA+HG +Y E+E R + F++NL YI++
Sbjct: 18 VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 82 N---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N G +++LG N+F+DLTN+E+R+ Y G + R ++ +YQ ++P
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPE 134
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
S+DWR KGAV +K+Q CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194
Query: 199 CLGGSREKAFAYIIQNQGI 217
C GG + AF +II N GI
Sbjct: 195 CNGGLMDYAFEFIINNGGI 213
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 37/75 (49%), Positives = 50/75 (66%), Gaps = 5/75 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + GIF G CGT LDH V VG+GT E+G +YWL++NSWG+ WG+ GY+++ R+
Sbjct: 273 AFQLYKSGIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGSVWGEDGYIRMERNIK 331
Query: 267 --EGLCGIGTRSSYP 279
G CGI SYP
Sbjct: 332 ASSGKCGIAVEPSYP 346
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 112/347 (32%), Positives = 170/347 (48%), Gaps = 111/347 (31%)
Query: 13 INTTPMFIIITLLVSCAS-----QVVSSRSTHEQS---VVEIHEKWMAQHGRSYKDELEK 64
+++ +F I T LV C+ +V H S + E+ E WM++HG++Y+ EK
Sbjct: 5 VSSIFLFTIFTSLVICSVVAHDFSIVGYSPEHLTSMHKLTELFESWMSKHGKTYESIEEK 64
Query: 65 EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS 124
RL++FK+NL +I++ N++ TY L N+F+DL+++EF+ S
Sbjct: 65 LHRLEVFKDNLMHIDRRNRDVT-TYWLALNEFADLSHEEFK-----------------SK 106
Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
+ + L +KGAV P+KNQ CG CWAF+ VAAVEGI +I +GNL LS
Sbjct: 107 LAQIRRL------------EKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 154
Query: 185 EQQLLDCSTN---GNNGCL------------GGSREKAFAYIIQN--------------- 214
EQ+L+DC T+ G NG L G +E+ + Y+++
Sbjct: 155 EQELIDCDTSFNSGCNGGLMDYAFDYIVNNGGLHKEEDYPYLMEEGTCDEKREEMEVVTI 214
Query: 215 --------------------------------------QGIFNGVCGTQLDHAVTIVGFG 236
+G+FNG CGT LDH V VG+G
Sbjct: 215 SGYHDVPENNEESLLKALAHQPLSIAIEASGRDFQFYGRGVFNGPCGTDLDHGVAAVGYG 274
Query: 237 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
+++ G +Y ++KNSWG WG+ GY+++ R+ EGLCGI +SYP
Sbjct: 275 SSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 320
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 104/305 (34%), Positives = 151/305 (49%), Gaps = 78/305 (25%)
Query: 50 WMAQHGRSYKDELEK-EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
W+ ++YKD +E+ E + ++ +NLE++ N E + T+KLG F+DLT+DE+R
Sbjct: 51 WVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHN-EKDSTFKLGLTNFADLTHDEYRQHA 109
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
GY+ P + T T K D P S+DWR KGAVT +KNQ++CG CWAF+
Sbjct: 110 LGYR---PELKGTGLGTGKSTGFQYADYEAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTT 166
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF-------- 218
+VEG I SG L+ LSEQ+L+DC ++GC GG + AF++II+N GI
Sbjct: 167 GSVEGANAIYSGELVSLSEQELVDCDVTQDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYK 226
Query: 219 --NGVC--GTQLDHAVTIVGF--------------------------------------- 235
+GVC + H VTI +
Sbjct: 227 AQDGVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVF 286
Query: 236 ----GTTED------------GANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTR 275
GT D G +YW++KNSWG+ WGD+GY+++ R G CGI +
Sbjct: 287 DAPCGTALDHGVLVVGYGSDNGTDYWIVKNSWGDFWGDSGYIRLARGISNSAGQCGIAMQ 346
Query: 276 SSYPL 280
+SYP+
Sbjct: 347 ASYPI 351
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 83/189 (43%), Positives = 119/189 (62%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS ++ ++ +WMA HGR+Y E+E R ++F++NL YI+ N G +
Sbjct: 31 IVSYGERSDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHS 90
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+RA Y G + R + +Y D+P S+DWR KGAV
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGARTRPQRERKLGA---RYHAADNEDLPESVDWRAKGAV 147
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q CG CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 207
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 208 EFIINNGGI 216
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 39/76 (51%), Positives = 55/76 (72%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF G CGT LDH VT VG+GT E+G +YW++KNSWG++WG++GY+++ R+
Sbjct: 276 AFQLYSSGIFTGSCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIK 334
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 335 ASSGKCGIAVEPSYPL 350
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 82/178 (46%), Positives = 121/178 (67%), Gaps = 2/178 (1%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
++ V ++E W+ HG++Y EKE R +IFK+NL +I++ N+E +RTYK+G +F+DL
Sbjct: 55 DEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRE-SRTYKVGLTRFADL 113
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN+E+RA + G + S R + + + +Y D+P +DWR KGAV +K+Q +CG
Sbjct: 114 TNEEYRARFLGGRF-SRKPRLSAAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQCGS 172
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF++VAAVEGI +I +G LI LSEQ+L+DC + N GC GG + AF +II N GI
Sbjct: 173 CWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGI 230
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 6/76 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+F G CGT LDH V VG+GT ++G +YW+++NSWG WG++GY+++ R+
Sbjct: 290 AFQLYQSGVFTGRCGTDLDHGVVAVGYGT-DNGTDYWIVRNSWGKDWGESGYIRLERNVA 348
Query: 267 ---EGLCGIGTRSSYP 279
G CGI + SYP
Sbjct: 349 NITTGKCGIAVQPSYP 364
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 83/176 (47%), Positives = 116/176 (65%), Gaps = 5/176 (2%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
V+ + E W+ ++G+SY EKE R +IFK+NL ++++ N + NR+YK+G NQFSDLT +
Sbjct: 44 VMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTLE 103
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
E+ ++Y G K T+ + +Y+ +P S+DWR KGAV +KNQ CG CW
Sbjct: 104 EYSSIYLGTKF----DMRMTNVSDRYEPRVGDQLPNSIDWRKKGAVLGVKNQGNCGSCWT 159
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
FA +AAVE I +I +GNLI LSEQQ++DC NNGC GGSR A+ +II N GI
Sbjct: 160 FAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDNGGI 215
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 50/68 (73%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGI 272
GIF G CG ++DHAVTIVG+GT E G +YW+++NSWG+ WG+ GY+++ R + G C I
Sbjct: 281 GIFTGPCGAKIDHAVTIVGYGT-EGGMDYWIVRNSWGSNWGENGYVRMQRNVGNAGTCFI 339
Query: 273 GTRSSYPL 280
T +YP+
Sbjct: 340 ATSPNYPV 347
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 126/206 (61%), Gaps = 6/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I+ + +F L++S A +V+S V +++E W+ + G+SY EKEMR +IFK
Sbjct: 8 ISMSLLFFSTLLILSSALDIVNSAQRTNDQVRDMYESWLVEQGKSYNSLDEKEMRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+NL I+ N + NR++ LG N+F+DLT++E+R+ Y G+K P + + K ++
Sbjct: 68 DNLRIIDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFK-SGPKAKVSNRYVPKVGDV- 125
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR GAV +KNQ C CWAF+AVAAVEGI KI +GNL+ LSEQ+L+DC
Sbjct: 126 ---LPNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGNLLSLSEQELVDCG 182
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC G AF +II N GI
Sbjct: 183 RTQSTRGCNRGYMTDAFQFIINNGGI 208
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/68 (54%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GIF CGT +DH VTIVG+GT E G +YW++KNSWG WG+ GY++I R+ G CGI
Sbjct: 275 GIFTQYCGTAIDHGVTIVGYGT-ERGLDYWIVKNSWGTNWGENGYIRIQRNIGGAGKCGI 333
Query: 273 GTRSSYPL 280
+SYP+
Sbjct: 334 ARMASYPV 341
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 88/216 (40%), Positives = 132/216 (61%), Gaps = 10/216 (4%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
P ++++ S A+ +S + E V++++E+W+ +H + Y EKE R ++FK+NL
Sbjct: 7 PTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLG 65
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTD 135
+I+ N + N TY LG N+F+D+TN+E+RA+Y G + + T +T +Y S
Sbjct: 66 FIQDHNAQ-NNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQ 124
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P +DWR KGAV PIK+Q CG CWAF+ VAAVEGI I +G + LSEQ+L+DC
Sbjct: 125 LPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREY 184
Query: 196 NNGCLGGSREKAFAYIIQNQGI-------FNGVCGT 224
+ GC GG + AF +IIQN GI + G+ GT
Sbjct: 185 DEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGT 220
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 48/70 (68%), Gaps = 6/70 (8%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLC 270
G+F G CGT LDH V +VG+GT E+G +YWL++NSWG WG+ GY K+ R+ EG C
Sbjct: 273 GVFTGKCGTALDHGVVVVGYGT-ENGVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKC 331
Query: 271 GIGTRSSYPL 280
GI SYP+
Sbjct: 332 GIAMDCSYPV 341
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 86/189 (45%), Positives = 118/189 (62%), Gaps = 3/189 (1%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
A+ S + V+ ++E W+ +HG+SY EKE R +IFK+NL +I++ N E N +
Sbjct: 32 ATHASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLS 91
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
YK+G N+F+DLTN+E+R+ Y G K P S +Y +P S+DWR KGAV
Sbjct: 92 YKVGLNRFADLTNEEYRSTYLGAK-SKPKLSKVKSD--RYAPRVGDSLPESVDWRAKGAV 148
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
PIK+Q CG CWAF+ V AVEGI +I +G LI LSEQ+L+DC + N GC GG + F
Sbjct: 149 APIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGGLMDYGF 208
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 209 EFIINNGGI 217
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/88 (47%), Positives = 57/88 (64%), Gaps = 10/88 (11%)
Query: 198 GCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 257
G GG R A+ + GIF G CGT LDH V +VG+GT E G +YW+++NSWG++WG+
Sbjct: 270 GIEGGGR----AFQFYDSGIFTGKCGTALDHGVNVVGYGT-EKGKDYWIVRNSWGSSWGE 324
Query: 258 AGYMKIVRD-----EGLCGIGTRSSYPL 280
AGY+++ R+ G CGI SYPL
Sbjct: 325 AGYIRMERNLAGTSVGKCGIAMEPSYPL 352
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 90/206 (43%), Positives = 123/206 (59%), Gaps = 6/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I+ + +F L++S A + +S V+ ++E W+ +HG+SY EKEMR +IFK
Sbjct: 8 ISKSLLFFSTLLILSSAIDIENSVQRTNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
ENL I+ N + NR+Y LG N+F+DLT++E+R+ Y G K T + +Y
Sbjct: 68 ENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLK-----RGPKTDVSNQYMPKV 122
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR GAV +KNQ C CWAF+AVAAVEGI KI +GNLI LSEQ+L+DC
Sbjct: 123 GDALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCG 182
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC G AF +II N GI
Sbjct: 183 RTQITKGCNRGLMTDAFKFIINNGGI 208
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 53/82 (64%), Gaps = 4/82 (4%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G + + + GIF G CGT +DH VTIVG+GT E G +YW++KNSWG WG++GY+
Sbjct: 261 GVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYGT-ERGMDYWIVKNSWGTNWGESGYI 319
Query: 262 KIVRD---EGLCGIGTRSSYPL 280
+I R+ G CGI SYP+
Sbjct: 320 RIQRNIGGAGKCGIAKMPSYPV 341
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 83/181 (45%), Positives = 122/181 (67%), Gaps = 5/181 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL-EKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFS 97
E V ++E+WMA+HG++ + L E + R + F +NL +++ N + G R Y+LG N+F+
Sbjct: 45 EAQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGYRLGINRFA 104
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLTN EFRA Y + + +T ++ +Y++ + +P +DWR KGAV P+KNQ +C
Sbjct: 105 DLTNAEFRAAY--LSAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVAPVKNQGQC 162
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQG 216
G CWAF+AV AVEGI +I +G L+ LSEQ+L+DCS NG N GC GG + AFA+I+ N G
Sbjct: 163 GSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVGNGG 222
Query: 217 I 217
I
Sbjct: 223 I 223
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 52/84 (61%), Gaps = 9/84 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGA-NYWLIKNSWGNTWGDAGY 260
G RE + + G+F G CGT LDH V VG+GT DG +YWL++NSWG WG+ GY
Sbjct: 280 GGRE----FQLYQSGVFTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGY 335
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G CGI +SYP+
Sbjct: 336 IRMERNVGARAGKCGIAMEASYPV 359
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 170 bits (430), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 86/209 (41%), Positives = 134/209 (64%), Gaps = 12/209 (5%)
Query: 16 TPMFIIITLLVSCASQ--VVSSRSTHEQS--------VVEIHEKWMAQHGRSYKDELEKE 65
T +FI +T +S A ++S TH V+ ++E+W+ +HG++Y EKE
Sbjct: 6 TILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNALGEKE 65
Query: 66 MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM-PSPSHRSTTSS 124
R +IFK+NL +I++ N + N +++LG N+F+DLTN+E+R + G ++ P+ +R S
Sbjct: 66 KRFEIFKDNLGFIDEHNSK-NLSFRLGLNRFADLTNEEYRTRFLGTRINPNRRNRKVNSQ 124
Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
T +Y +P S+DWR +GAV +K+Q CG CWAF+A+AAVEG+ K+ +G+LI LS
Sbjct: 125 TNRYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLATGDLISLS 184
Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
EQ+L+DC T+ N GC GG + AF +II
Sbjct: 185 EQELVDCDTSYNEGCNGGLMDYAFEFIIN 213
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 56/85 (65%), Gaps = 10/85 (11%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG RE + + + G+F G CGT LDH V VG+GT E+G +YW+++NSWG +WG+AGY
Sbjct: 273 GGGRE----FQLYDSGVFTGRCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGGSWGEAGY 327
Query: 261 MKIVRD-----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYP+
Sbjct: 328 IRLERNLATSKSGKCGIAIEPSYPI 352
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 170 bits (430), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 139/220 (63%), Gaps = 10/220 (4%)
Query: 1 MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+VLI SF ++ II+ + + S R+ E V+ ++E+W+ +HG+SY
Sbjct: 14 IVLIIS---SFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNG 68
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EK+ R +IFK+NL++I++ N N TY+LG +F+DLTN+E+R+ + G K+ P+ R
Sbjct: 69 LGEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKI-DPNRRM 126
Query: 121 TT---SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
S + +Y +P S+DWR +GAV +K+Q CG CWAF+A+AAVEGI KI +
Sbjct: 127 KKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVT 186
Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G+LI LSEQ+L+DC T+ N GC GG + AF +II N GI
Sbjct: 187 GDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGI 226
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 54/85 (63%), Gaps = 10/85 (11%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG RE + + G+F G CGT LDH V VG+GT E+G +YW+++NSWG +WG+ GY
Sbjct: 282 GGGRE----FQLYEYGVFTGRCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGGSWGEQGY 336
Query: 261 MKIVRD-----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYP+
Sbjct: 337 IRLERNLASSRAGKCGIAIEPSYPI 361
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 169 bits (429), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 97/222 (43%), Positives = 139/222 (62%), Gaps = 16/222 (7%)
Query: 1 MVLIFERSG-SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYK 59
M+L+ G S+ I+ + II+ + VSSRS E V I+E WM +HG+
Sbjct: 9 MILLLAMIGVSYAIDMS----IISYDENHHISTVSSRSDAE--VERIYEAWMVEHGKKKM 62
Query: 60 DE----LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
++ EK+ R +IFK+NL YI++ N + N +YKLG +F+DLTNDE+R++Y G K
Sbjct: 63 NQNGLGAEKDQRFEIFKDNLRYIDEHNTK-NLSYKLGLTRFADLTNDEYRSMYLGAK--- 118
Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
P R +S +Y+ +P S+DWR +GAV +K+Q CG CWAF+ + AVEGI KI
Sbjct: 119 PVKRVLKTSD-RYEARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKI 177
Query: 176 RSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+G+LI LSEQ+L+DC T+ N GC GG + AF +II+N GI
Sbjct: 178 VTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGI 219
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 57/76 (75%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + G+F+G+CGT+LDH V VG+GT E+G +YW+++NSWGN WG++GY+K+ R+
Sbjct: 279 AFQLYSSGVFDGICGTELDHGVVAVGYGT-ENGKDYWIVRNSWGNRWGESGYIKMARNIA 337
Query: 267 --EGLCGIGTRSSYPL 280
G CGI +SYP+
Sbjct: 338 EPTGKCGIAMEASYPI 353
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 169 bits (429), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 83/175 (47%), Positives = 123/175 (70%), Gaps = 5/175 (2%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK +R +IFK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 4 LIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVS-NYWLGLNEFADLSHH 62
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+ Y G K+ + R + S F Y+++ D+P S+DWR KGAVT IKNQ CG CWA
Sbjct: 63 EFKKQYLGLKVDFSTRRES-SEEFTYRDV---DLPKSVDWRKKGAVTNIKNQGSCGSCWA 118
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
F+ VAAVEGI +I +GNL LSEQ+L+DC N+GC GG + AF++I++N G+
Sbjct: 119 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGGLMDYAFSFIVENGGL 173
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 169 bits (429), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 85/187 (45%), Positives = 123/187 (65%), Gaps = 8/187 (4%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
S+HE S+ E+ E+W+++H R+Y EK R ++FK+NL +I++ N++ + +Y LG N+F
Sbjct: 50 SSHE-SLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRKVS-SYWLGLNEF 107
Query: 97 SDLTNDEFRALYTGYK------MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
+DLT+DEF+A Y G + Y+ + +P S+DWR KGAVT
Sbjct: 108 ADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSVDWRSKGAVTG 167
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+KNQ +CG CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T+GNNGC GG + AF+Y
Sbjct: 168 VKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNNGCNGGLMDYAFSY 227
Query: 211 IIQNQGI 217
I N G+
Sbjct: 228 IAHNGGL 234
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F+G CGTQLDH V VG+GT G +Y ++KNSWG +WG+ GY+++ R +GLCG
Sbjct: 315 GVFDGPCGTQLDHGVAAVGYGTAAKGHDYIIVKNSWGPSWGEKGYIRMRRGTGKRQGLCG 374
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 375 INKMASYP 382
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 169 bits (429), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 81/205 (39%), Positives = 130/205 (63%), Gaps = 7/205 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTH-----EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
+F+ + ++S TH + + I+EKW+ HG++Y EKE R +IFK
Sbjct: 13 LFLCFAFSSALDMSIISYDQTHPPQRTDAEAMAIYEKWLTTHGKAYNAIGEKERRFEIFK 72
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+NL ++++ N +Y++G N+F+DLTN+E+R+++ G M RS ++ + +Y +
Sbjct: 73 DNLRFVDEHNAVAG-SYRVGLNRFADLTNEEYRSMFLGGNMEM-KERSASTKSDRYAFRA 130
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P S+DWR+KGAV+P+K+Q +CG CWAF+ ++AVEGI +I +G LI LSEQ+L+DC
Sbjct: 131 GDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQELVDCD 190
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI 217
+ N GC GG + F +II N GI
Sbjct: 191 KSYNMGCNGGLMDYGFQFIINNGGI 215
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 36/75 (48%), Positives = 50/75 (66%), Gaps = 5/75 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+F G CGT LDH V VG+GT E+G +YW ++NSWG WG+ GY+K+ R+
Sbjct: 275 AFQLYESGVFTGHCGTNLDHGVVAVGYGT-ENGVDYWTVRNSWGPKWGENGYIKLERNIN 333
Query: 267 --EGLCGIGTRSSYP 279
G CGI + +SYP
Sbjct: 334 ATSGKCGIASMASYP 348
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 83/180 (46%), Positives = 122/180 (67%), Gaps = 4/180 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T ++E+ E+W++ HG+ Y+ EK R ++FK+NL++I++ NK+ +Y LG N+F+
Sbjct: 36 TSMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFA 94
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT+ EF+ +Y G K+ S R + F Y+++ D+P S+DWR KGAVT +KNQ C
Sbjct: 95 DLTHQEFKNMYLGLKVESSRTRQSPEE-FTYKDV--VDLPKSVDWRKKGAVTRVKNQGSC 151
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI KI GNL LSEQ+L+DC NNGC GG + AF++I+ + G+
Sbjct: 152 GSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGL 211
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 50/68 (73%), Gaps = 5/68 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F+G CGTQLDH VT VG+G+++ G +Y ++KNSWG WG+ GY+++ R+ GLCG
Sbjct: 278 GVFDGPCGTQLDHGVTAVGYGSSK-GVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAGLCG 336
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 337 INKMASYP 344
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 139/220 (63%), Gaps = 10/220 (4%)
Query: 1 MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+VLI SF ++ II+ + + S R+ E V+ ++E+W+ +HG+SY
Sbjct: 14 IVLIIS---SFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNG 68
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EK+ R +IFK+NL++I++ N N TY+LG +F+DLTN+E+R+ + G K+ P+ R
Sbjct: 69 LGEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKI-DPNRRM 126
Query: 121 TT---SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
S + +Y +P S+DWR +GAV +K+Q CG CWAF+A+AAVEGI KI +
Sbjct: 127 KKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVT 186
Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G+LI LSEQ+L+DC T+ N GC GG + AF +II N GI
Sbjct: 187 GDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGI 226
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 54/85 (63%), Gaps = 10/85 (11%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG RE + + G+F G CGT LDH V VG+GT E+G +YW+++NSWG +WG+ GY
Sbjct: 282 GGGRE----FQLYEYGVFTGRCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGGSWGEQGY 336
Query: 261 MKIVRD-----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYP+
Sbjct: 337 IRLERNLASSRAGKCGIAIEPSYPI 361
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 88/216 (40%), Positives = 131/216 (60%), Gaps = 10/216 (4%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
P ++++ S A+ +S + E V++++E+W+ +H + Y EKE R ++FK+NL
Sbjct: 7 PTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLG 65
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTD 135
+I+ N + N TY LG N+F+D+TN E+RA+Y G + + T +T +Y S
Sbjct: 66 FIQDHNAQ-NNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQ 124
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P +DWR KGAV PIK+Q CG CWAF+ VAAVEGI I +G + LSEQ+L+DC
Sbjct: 125 LPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREY 184
Query: 196 NNGCLGGSREKAFAYIIQNQGI-------FNGVCGT 224
+ GC GG + AF +IIQN GI + G+ GT
Sbjct: 185 DEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGT 220
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 48/70 (68%), Gaps = 6/70 (8%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLC 270
G+F G CGT LDH V +VG+GT E+G +YWL++NSWG WG+ GY K+ R+ EG C
Sbjct: 273 GVFTGKCGTALDHGVVVVGYGT-ENGVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKC 331
Query: 271 GIGTRSSYPL 280
GI SYP+
Sbjct: 332 GIAMDCSYPV 341
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/299 (36%), Positives = 149/299 (49%), Gaps = 78/299 (26%)
Query: 52 AQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALY 108
+ + +SY+ E + RL F+ NLE+I K N E G +Y +G N+F+DLT DEF ALY
Sbjct: 3 SDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALY 62
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
+PS +R+ +T Y + D S+DWR KGAVTPIKNQ +CG CW+F+ +
Sbjct: 63 ----VPSKFNRTMPYNTV-YLPATSED---SVDWRTKGAVTPIKNQGQCGSCWSFSTTGS 114
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIF--------- 218
EG I +GNL+ LSEQQL+DCS + GN GC GG + AF YII N+G+
Sbjct: 115 TEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTA 174
Query: 219 -NGVCGTQLD--HAVTIVGF---------------------------------------- 235
+G C + + HA TI +
Sbjct: 175 QDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVFD 234
Query: 236 ---GTTEDGA--------NYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 280
GT D +YW++KNSWG TWG GY+ + R G+CGI + SYP+
Sbjct: 235 GNCGTNLDHGVLVVGYTDDYWIVKNSWGTTWGVEGYINMKRGVSASGICGIAMQPSYPI 293
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 128/210 (60%), Gaps = 5/210 (2%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
SF+ ++ + +S + +E V+ ++E+W+ ++G++Y EKE R K
Sbjct: 4 SFRTLALLTLSVLLISISLGVVTATESQRNEGGVLTMYEQWLVENGKNYNGLGEKERRFK 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM +S + +YQ
Sbjct: 64 IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120
Query: 130 NLSMTDVPTSLDWRDKGAVTP-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+P +DWR++GAV P +K Q ECG CWAFAA AVEGI +I +G L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180
Query: 189 LDCST-NGNNGCLGGSREKAFAYIIQNQGI 217
+DC N N GC GG AF +I +N GI
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGI 210
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 42/70 (60%), Gaps = 5/70 (7%)
Query: 216 GIFNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G++ G C DH V IVG+GT+ D +YWLI+NSWG WG+ GY+++ R+ G C
Sbjct: 277 GVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKC 336
Query: 271 GIGTRSSYPL 280
+ YP+
Sbjct: 337 AVAVAPVYPI 346
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 83/189 (43%), Positives = 118/189 (62%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS + ++ +WMA HGR+Y E+E R ++F++NL YI+ N G +
Sbjct: 26 IVSYGERSXEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHS 85
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+RA Y G + R + +Y D+P S+DWR KGAV
Sbjct: 86 FRLGLNRFADLTNDEYRATYLGARTRPQRERKLGA---RYHAADNEDLPESVDWRAKGAV 142
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q CG CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 143 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 202
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 203 EFIINNGGI 211
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 39/76 (51%), Positives = 55/76 (72%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF G CGT LDH VT VG+GT E+G +YW++KNSWG++WG++GY+++ R+
Sbjct: 271 AFQLYSSGIFTGSCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIK 329
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 330 ASSGKCGIAVEPSYPL 345
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 83/180 (46%), Positives = 122/180 (67%), Gaps = 4/180 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T ++E+ E+W++ HG+ Y+ EK R ++FK+NL++I++ NK+ +Y LG N+F+
Sbjct: 39 TSMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT+ EF+ +Y G K+ S R + F Y+++ D+P S+DWR KGAVT +KNQ C
Sbjct: 98 DLTHQEFKNMYLGLKVESSRTRQSPEE-FTYKDV--VDLPKSVDWRKKGAVTRVKNQGSC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI KI GNL LSEQ+L+DC NNGC GG + AF++I+ + G+
Sbjct: 155 GSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGL 214
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 50/68 (73%), Gaps = 5/68 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F+G CGTQLDH VT VG+G+++ G +Y ++KNSWG WG+ GY+++ R+ GLCG
Sbjct: 281 GVFDGPCGTQLDHGVTAVGYGSSK-GVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAGLCG 339
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 340 INKMASYP 347
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 128/210 (60%), Gaps = 5/210 (2%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
SF+ ++ + +S + +E V+ ++E+W+ ++G++Y EKE R K
Sbjct: 4 SFRTLALLTLSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFK 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM +S + +YQ
Sbjct: 64 IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120
Query: 130 NLSMTDVPTSLDWRDKGAVTP-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+P +DWR++GAV P +K Q ECG CWAFAA AVEGI +I +G L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180
Query: 189 LDCST-NGNNGCLGGSREKAFAYIIQNQGI 217
+DC N N GC GG AF +I +N GI
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGI 210
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 42/70 (60%), Gaps = 5/70 (7%)
Query: 216 GIFNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G++ G C DH V IVG+GT+ D +YWLI+NSWG WG+ GY+++ R+ G C
Sbjct: 277 GVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKC 336
Query: 271 GIGTRSSYPL 280
+ YP+
Sbjct: 337 AVAVAPVYPI 346
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 90/210 (42%), Positives = 133/210 (63%), Gaps = 11/210 (5%)
Query: 11 FKINTTPMFIIITL--LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
K T +I+ L + S ++ + ST+ + + +E W+ ++GR Y+D E E+R
Sbjct: 1 MKTTITLSIVILNLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRF 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
I++ N++YIE N + N +YKL N+F+D+TN+EF++ Y GY +P R + F+Y
Sbjct: 61 DIYQSNVQYIEFYNSQ-NYSYKLIDNRFADITNEEFKSTYLGY-LP----RFRVQTEFRY 114
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
++P S+DWR KGAVT +K+Q CG CWAF+AVAAVEGI KI++ NL+ LSEQQL
Sbjct: 115 HKHG--ELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQL 172
Query: 189 LDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
+DC +GN GC GG AF YI ++ GI
Sbjct: 173 IDCDIKSGNEGCEGGDMYIAFNYIKKHGGI 202
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 55/78 (70%), Gaps = 5/78 (6%)
Query: 207 AFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
+A+ ++GIF+G CG L+H +TIVG+G E+G YW++KNSW N WG++GY+++ RD
Sbjct: 260 GYAFQFYSKGIFSGSCGKNLNHGMTIVGYGE-ENGDKYWIVKNSWANDWGESGYVRMKRD 318
Query: 267 ----EGLCGIGTRSSYPL 280
+G CGI ++YP+
Sbjct: 319 TKDKDGTCGIAMDATYPV 336
>gi|58201356|gb|AAW66799.1| cysteine protease [Pinus taeda]
gi|58201376|gb|AAW66809.1| cysteine protease [Pinus taeda]
gi|58201388|gb|AAW66815.1| cysteine protease [Pinus taeda]
gi|58201400|gb|AAW66821.1| cysteine protease [Pinus taeda]
gi|58201406|gb|AAW66824.1| cysteine protease [Pinus taeda]
gi|167345244|gb|ABZ69062.1| cysteine protease [Pinus taeda]
Length = 193
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 79/169 (46%), Positives = 120/169 (71%), Gaps = 3/169 (1%)
Query: 32 VVSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
++S++ E +++E++E W+A+H ++Y EK+ R +FK+N YI + N +GN++YK
Sbjct: 26 IISNKDLREDDAIMELYELWLAEHKKAYNGLDEKQKRFTVFKDNFLYIHEHN-QGNQSYK 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+DL+++EF+A Y G K+ + R S + +YQ D+P S+DWR+KGAV P
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKLDTKK-RLLRSPSPRYQYSDGEDLPKSIDWREKGAVAP 143
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+K+Q CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T+ N GC
Sbjct: 144 VKDQGACGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGC 192
>gi|307103885|gb|EFN52142.1| hypothetical protein CHLNCDRAFT_139276 [Chlorella variabilis]
Length = 388
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 103/296 (34%), Positives = 144/296 (48%), Gaps = 67/296 (22%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
+W HGRSYK E R +F EN +++ + N N L NQF+DLT +EF A +
Sbjct: 48 QWQMTHGRSYKSASEARKRQAVFVENAKHVAEQNAR-NSGLVLALNQFADLTLEEFAATH 106
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
GY + T+++F+Y + D+P+++DWR K AVTP+KNQ CG CWAF+A A
Sbjct: 107 LGYNPSLREGKEHTTTSFQYAD--ANDLPSTVDWRKKNAVTPVKNQAMCGSCWAFSATGA 164
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------F 218
VEGI IR+G L+ LSEQQL+DC + + GC GG + AF YI +N GI +
Sbjct: 165 VEGINAIRTGKLVSLSEQQLVDCDSEKDLGCGGGLMDFAFDYITKNGGIDSEDDYSYWGY 224
Query: 219 NGVCGTQLD---HAVTIVGF-------------------------GTTED---------- 240
+C + + H VTI GF G D
Sbjct: 225 GLICQRRKEADRHVVTIDGFEDVPKNDGEALKKAIAHQPVSLYHSGVVGDDACCQDLNHG 284
Query: 241 ------------GANYWLIKNSWGNTWGDAGYMKIV----RDEGLCGIGTRSSYPL 280
G +++IKNSWG WG+ G+ ++ G CG+ +SYPL
Sbjct: 285 VLAVGYDDGSKGGTPHYVIKNSWGEGWGEQGFFRLAAKSSEASGACGVYKAASYPL 340
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 82/176 (46%), Positives = 118/176 (67%), Gaps = 5/176 (2%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
V+ + E W+ ++G+SY EKE R +IFK+NL ++++ N + NR+YK+G NQFSDLT+
Sbjct: 44 VIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTDA 103
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
E+ ++Y G K + T+ + +Y+ +P S+DWR KGAV +KNQ CG CW
Sbjct: 104 EYSSIYLGTKF----NIRMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGNCGSCWT 159
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
FA++AAVEGI KI +GNLI LSEQ+++DC NNGC GG+ A+ +II N GI
Sbjct: 160 FASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGGI 215
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GIFNG CG ++DH VTIVG+GT E G +YW+++NSWG WG++GY+++ R+ G C I
Sbjct: 282 GIFNGPCGPRIDHGVTIVGYGT-EGGKDYWIVRNSWGPNWGESGYVRMQRNVGGSGKCFI 340
Query: 273 GTRSSYPL 280
YP+
Sbjct: 341 ARAPVYPV 348
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 132/206 (64%), Gaps = 15/206 (7%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
+H++ ++E+ E W++ ++Y+ EK +R ++FK+NL++I++ NK+G ++Y LG N+F+
Sbjct: 43 SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
DL+++EF+ +Y G K S + F Y+++ VP S+DWR KGAV +KNQ
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ VAAVEGI KI +GNL LSEQ+L+DC T NNGC GG + AF YI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218
Query: 217 IF----------NGVCGTQLDHAVTI 232
+ G C Q D + T+
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETV 244
>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 356
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 86/179 (48%), Positives = 112/179 (62%), Gaps = 8/179 (4%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
HE+WMA+ GR Y D EK R ++F N Y++ N+ GNRTY LG N+FSDLT+DEF
Sbjct: 39 HEEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFVQ 98
Query: 107 LYTGYK-MPSPSHRSTTSSTFKYQNLS--MTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
+ GY+ R + K L D+P S+DWR +GAVT +KNQ CGCCWAF
Sbjct: 99 THLGYRGHQQGGLRPEEENVSKVAALGYGQADMPESVDWRAQGAVTGVKNQGSCGCCWAF 158
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTN----GN-NGCLGGSREKAFAYIIQNQGI 217
AAVAA EG+ KI +GNLI +SEQQ+LDC+ GN N C GG + A Y+ ++G+
Sbjct: 159 AAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYVAASRGL 217
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 42/60 (70%), Gaps = 2/60 (3%)
Query: 222 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL--CGIGTRSSYP 279
CG +L+HAVT+VG+G+ + G YWL+KN WG +WG+ GYM+I R G CGI + YP
Sbjct: 293 CGQRLNHAVTVVGYGSADGGQEYWLVKNQWGTSWGEGGYMRIARGNGAPNCGISAYAYYP 352
>gi|403342666|gb|EJY70658.1| Cysteine protease [Oxytricha trifallax]
Length = 367
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 101/317 (31%), Positives = 155/317 (48%), Gaps = 78/317 (24%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ S+ ++++H RS+ + E + RL IF++ E ++ N +++YKL N+FSD+
Sbjct: 50 DPSIQTAFNNFVSRHQRSFLTQEEYKARLAIFRDTFEAVQLHNSLESKSYKLAINKFSDM 109
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQ-K 155
+ DEF + ++ ++P+ S+ ++ + + P SLDWRDKGAV P+ Q K
Sbjct: 110 SKDEF-SKFSSLQLPAEDDEEEESNQYQEDDDDDDLLLGAPQSLDWRDKGAVNPVFEQTK 168
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYI--- 211
CG C+A + AVE KI++G L++LS+QQ+LDC+ GN GC GG A+ Y+
Sbjct: 169 LCGACYAMSTTGAVEAAYKIKTGKLVELSKQQILDCAGRYGNAGCSGGYMVNAYKYMVEN 228
Query: 212 ---------------------------------------------IQNQGIFNGV----- 221
IQNQ + GV
Sbjct: 229 KLMLHKDYPYVNKNQKCQVDTTKTVTGIKGYTSLPANDPVALFNAIQNQPVSVGVQSSKV 288
Query: 222 --------------CGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVRD 266
CG +DHA+ ++G+G + G +YWL+KNSWG WGD GY+KI+RD
Sbjct: 289 LFHQYKSGVLDDSRCGQAIDHAMLLIGYGNDKASGKDYWLVKNSWGEDWGDLGYVKILRD 348
Query: 267 E----GLCGIGTRSSYP 279
G+CGI SYP
Sbjct: 349 MNRGGGICGINRLGSYP 365
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 82/189 (43%), Positives = 117/189 (61%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +WMA HGR+Y E+E R ++F++NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+RA Y G + R N D+P S+DWR KGAV
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDN---EDLPESVDWRAKGAV 147
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q CG CWAF+ +AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG + AF
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAF 207
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 208 EFIINNGGI 216
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 83/174 (47%), Positives = 113/174 (64%), Gaps = 3/174 (1%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
+S + + E WM +H + YK EK R + FK+NL YI++ NK+ N +Y LG N+F+DLT
Sbjct: 42 ESSIRLFESWMLKHDKVYKTIDEKIYRFETFKDNLMYIDETNKK-NNSYWLGLNEFADLT 100
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
+DEF+ Y G +P S S ++ N + D P S+DWR KGAVTP+KNQ CG C
Sbjct: 101 HDEFKEKYVG-SIPEDSMIIEQSDDVEFPNKHVVDYPESIDWRQKGAVTPVKNQNPCGSC 159
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
WAF+ VA VEGI KI +GNLI LSEQ+LLDC ++GC GG + + Y++ N
Sbjct: 160 WAFSTVATVEGINKIVTGNLISLSEQELLDCDRR-SHGCKGGYQTTSLKYVVDN 212
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/53 (60%), Positives = 37/53 (69%), Gaps = 5/53 (9%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 268
G+F G CGT+LDHAVT VG+G +Y LIKNSWG WGD GY+KI R G
Sbjct: 281 GVFGGPCGTKLDHAVTAVGYG-----KDYILIKNSWGPKWGDKGYIKIKRASG 328
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 163/353 (46%), Gaps = 85/353 (24%)
Query: 9 GSFKINTTPMFIIITLLVSCASQ------VVSSRSTHEQSVVEIHEKWMAQHGRSYKDEL 62
GS KI + I L +S + E+ V E+ W +H R YK
Sbjct: 2 GSQKIQLALVLFIWASLACLSSSLPTEFYITGEEFASEERVRELFHLWKERHKRVYKHAE 61
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST- 121
E R +IFKENL+Y+ + N +G+R + LG N+F+D++N+EF+ Y + ++
Sbjct: 62 ETAKRFEIFKENLKYVIERNSKGHR-HTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNY 120
Query: 122 -TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
S + + + + P+SLDWR KG VT IK+Q +CG CWAF++ A+EGI I +G+L
Sbjct: 121 LRRSMQQKKGTASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDL 180
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ--------------------------- 213
I LSEQ+L+DC T N GC GG + AF ++I
Sbjct: 181 ISLSEQELVDCDTT-NYGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDGTCNTTKEDTK 239
Query: 214 ----------------------NQGIFNGVCGTQLD---------------------HAV 230
NQ I G+ G+ LD HAV
Sbjct: 240 VVSIDGYKDVDESDSALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHAV 299
Query: 231 TIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL----CGIGTRSSYP 279
IVG+G+ ED +YW+ KNSWG +WG GY I R+ L C I +SYP
Sbjct: 300 LIVGYGS-EDSEDYWICKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYP 351
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 132/206 (64%), Gaps = 15/206 (7%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
+H++ ++E+ E W++ ++Y+ EK +R ++FK+NL++I++ NK+G ++Y LG N+F+
Sbjct: 43 SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
DL+++EF+ +Y G K S + F Y+++ VP S+DWR KGAV +KNQ
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ VAAVEGI KI +GNL LSEQ+L+DC T NNGC GG + AF YI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218
Query: 217 IFN----------GVCGTQLDHAVTI 232
+ G C Q D + T+
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETV 244
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/68 (47%), Positives = 49/68 (72%), Gaps = 5/68 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F+G CG LDH V VG+G+++ G++Y ++KNSWG WG+ GY+++ R+ EGLCG
Sbjct: 286 GVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCG 344
Query: 272 IGTRSSYP 279
I +S+P
Sbjct: 345 INKMASFP 352
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 83/189 (43%), Positives = 118/189 (62%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +WMA HGR+Y E+E R ++F++NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+RA Y G + R +Y D+P S+DWR KGAV
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGD---RYLAGDNEDLPESVDWRAKGAV 147
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+ +AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG + AF
Sbjct: 148 AEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAF 207
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 208 EFIINNGGI 216
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 40/76 (52%), Positives = 55/76 (72%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + N GIF G CGT LDH VT VG+GT E+G +YW++KNSWG++WG++GY+++ R+
Sbjct: 276 AFQLYNSGIFTGTCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIK 334
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 335 ASSGKCGIAVEPSYPL 350
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 106/317 (33%), Positives = 154/317 (48%), Gaps = 83/317 (26%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+DE E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 59 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 118
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 119 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 178
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQ------------------------------ 186
CG CWAF++ A+EG +SG L+ LSEQ
Sbjct: 179 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 238
Query: 187 ----------QLLDCSTNGNNGCLGGSREKAFAYIIQ----------------------- 213
+ +D S + N G +G + ++ F I Q
Sbjct: 239 GIDTEKSYPYEAIDDSCHFNKGTVGAT-DRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 297
Query: 214 -------NQGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 264
++G++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++
Sbjct: 298 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKML 357
Query: 265 RD-EGLCGIGTRSSYPL 280
R+ E CGI + SSYPL
Sbjct: 358 RNKENQCGIASASSYPL 374
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 106/317 (33%), Positives = 154/317 (48%), Gaps = 83/317 (26%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+DE E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQ------------------------------ 186
CG CWAF++ A+EG +SG L+ LSEQ
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 187 ----------QLLDCSTNGNNGCLGGSREKAFAYIIQ----------------------- 213
+ +D S + N G +G + ++ F I Q
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNKGTVGAT-DRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 293
Query: 214 -------NQGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 264
++G++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++
Sbjct: 294 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKML 353
Query: 265 RD-EGLCGIGTRSSYPL 280
R+ E CGI + SSYPL
Sbjct: 354 RNKENQCGIASASSYPL 370
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 90/209 (43%), Positives = 125/209 (59%), Gaps = 18/209 (8%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
HE ++E W +HG++Y D + R ++K+NL YI + E NRTY LG +F+D
Sbjct: 46 HENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHS--ETNRTYSLGLTKFAD 103
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTN+EFR +YTG ++ S R+ + F+Y + ++ P S+DWR GAVT +K+Q CG
Sbjct: 104 LTNEEFRRMYTGTRIDR-SRRAKRRTGFRYAD---SEAPESVDWRKNGAVTSVKDQGSCG 159
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI- 217
CWAF+AV +VEGI IR+G + LSEQ+L+DC N GC GG + AF +IIQN GI
Sbjct: 160 SCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGGID 219
Query: 218 ---------FNGVCGTQLD--HAVTIVGF 235
F+G C H VTI G+
Sbjct: 220 TEKDYPYKGFDGRCDNSKKNAHVVTIDGY 248
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/73 (50%), Positives = 50/73 (68%), Gaps = 8/73 (10%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-------E 267
QG+F+G CGT LDH V VG+GT EDG +YW++KNSWG WG++GY+++ R+
Sbjct: 284 QGVFSGECGTDLDHGVLAVGYGT-EDGVDYWIVKNSWGEYWGESGYLRMKRNMKDSNDGP 342
Query: 268 GLCGIGTRSSYPL 280
GLCGI SY +
Sbjct: 343 GLCGINIEPSYAV 355
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/232 (39%), Positives = 135/232 (58%), Gaps = 23/232 (9%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHG--RSYKDELEKEMRLKIFKENL 75
+F ++ L +C E+ + +++++W + H RS E+E R +F+ N+
Sbjct: 9 LFSLVILETACGFDYEDKEIESEEGLSKLYDRWRSHHSVPRSLH---EREKRFNVFRHNV 65
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQN 130
++ +NK+ NR+YKL N+F+DLT EF+ YTG K+ HR S F Y +
Sbjct: 66 MHVHNSNKK-NRSYKLKLNKFADLTIHEFKNAYTGSKIKH--HRMLQGPKRGSKQFMYDH 122
Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
+++ +P+S+DWR KGAVT IKNQ +CG CWAF+ VAAVEGI KI++ L+ LSEQ+L+D
Sbjct: 123 ENVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182
Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTI 232
C TN N GC GG E AF +I +N GI +G C D+ V +
Sbjct: 183 CDTNQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLV 234
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 48/70 (68%), Gaps = 5/70 (7%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLC 270
+G+F G CGT+L+H V VG+G+ + G YW+++NSWG WG+ GY+KI R EG C
Sbjct: 275 EGVFTGDCGTELNHGVATVGYGS-QGGKKYWIVRNSWGTEWGEGGYIKIERGIDEPEGRC 333
Query: 271 GIGTRSSYPL 280
GI +SYP+
Sbjct: 334 GIAMEASYPI 343
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 92/245 (37%), Positives = 143/245 (58%), Gaps = 28/245 (11%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVV--------------EIHEKWMAQHGRSY 58
+++ + L +S A+ +S ++H+ S+V E+ E W++ ++Y
Sbjct: 3 LSSPSRILCFPLALSAATLSLSVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEKAY 62
Query: 59 KDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH 118
+ EK +R ++FK+NL++I++ NK+ ++Y LG N+F+DL+++EF+ +Y G K
Sbjct: 63 ETVEEKLLRFEVFKDNLKHIDETNKK-VKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRR 121
Query: 119 RSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
S + F Y+++ VP S+DWR KGAV +KNQ CG CWAF+ VAAVEGI KI +
Sbjct: 122 DEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVT 179
Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFN----------GVCGTQLD 227
GNL LSEQ+L+DC T NNGC GG + AF YI++N G+ G C Q D
Sbjct: 180 GNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKD 239
Query: 228 HAVTI 232
+ T+
Sbjct: 240 ESETV 244
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 122/206 (59%), Gaps = 6/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I+ + +F L++S A + +S V+ ++E W+ + G+SY EKEMR +IFK
Sbjct: 8 ISMSLLFFSTLLILSLALDIENSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
ENL I+ N + NR+Y LG N+F+DLT++E+R+ Y G KM + S +Y
Sbjct: 68 ENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPKTDVSN-----EYMPKV 122
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR GAV +KNQ C CWAF+AV AVEGI KI +GNLI LSEQ+L+DC
Sbjct: 123 GEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDCG 182
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC G AF +II N GI
Sbjct: 183 RTQRTKGCNRGLMTDAFQFIINNGGI 208
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 52/82 (63%), Gaps = 4/82 (4%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G + + + GIF G CGT +DH VTIVG+GT E G +YW++KNSWG WG+ GY+
Sbjct: 261 GVESEGGKFKLYTSGIFTGFCGTAVDHGVTIVGYGT-ERGMDYWIVKNSWGTNWGENGYI 319
Query: 262 KIVRD---EGLCGIGTRSSYPL 280
+I R+ G CGI SYP+
Sbjct: 320 RIQRNIGGAGKCGIARMPSYPV 341
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 124/189 (65%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ V ++ +WMA++GR+Y E+E R ++F++NL Y+++ N G +
Sbjct: 27 IVSYGERSEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHS 86
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + +YQ ++P S+DWR+KGAV
Sbjct: 87 FRLGLNRFADLTNEEYRDTYLGVRTKPVRERRLSG---RYQAADNEELPESVDWREKGAV 143
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG + AF
Sbjct: 144 AKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQGCNGGLMDYAF 203
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 204 EFIINNGGI 212
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 51/76 (67%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + GIF G CGT LDH VT VG+G+ E+G +YW++KNSWG WG+ GY+++ R+
Sbjct: 272 AFQLYKSGIFTGRCGTALDHGVTAVGYGS-ENGKDYWIVKNSWGTVWGEDGYVRLERNIK 330
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 331 ATSGKCGIAIEPSYPL 346
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 83/192 (43%), Positives = 120/192 (62%), Gaps = 6/192 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ + E W+ HG+SY E+E R +IFK NL YI++ N +R +KLG N+F+DL
Sbjct: 38 DDEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADL 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN+E+R+ YTG K + ++ + +Y LS +P S+DWR+ GAV +K+Q CG
Sbjct: 98 TNEEYRSKYTGIKSKD-LRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGS 156
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFN 219
CWAF+ ++AVEGI +I +G LI LSEQ+L+DC + N GC GG + AF +II N GI
Sbjct: 157 CWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINNGGI-- 214
Query: 220 GVCGTQLDHAVT 231
T +D+ T
Sbjct: 215 ---DTDVDYPYT 223
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 34/71 (47%), Positives = 48/71 (67%), Gaps = 5/71 (7%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
+ GIF G CG LDH V +VG+GT E+G +YW+++NSWG WG+ GY+++ R G+
Sbjct: 279 DSGIFTGKCGIALDHGVVVVGYGT-ENGKDYWIVRNSWGADWGENGYLRMERGISSKTGI 337
Query: 270 CGIGTRSSYPL 280
CGI SYP+
Sbjct: 338 CGIAIEPSYPV 348
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 82/189 (43%), Positives = 118/189 (62%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +WMA HGR+Y E+E R ++F++NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+RA Y G + R +Y D+P S+DWR KGAV
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGD---RYLAGDNEDLPESVDWRAKGAV 147
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q CG CWAF+ +AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG + AF
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAF 207
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 208 EFIINNGGI 216
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 40/76 (52%), Positives = 55/76 (72%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + N GIF G CGT LDH VT VG+GT E+G +YW++KNSWG++WG++GY+++ R+
Sbjct: 276 AFQLYNSGIFTGTCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIK 334
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 335 ASSGKCGIAVEPSYPL 350
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 88/211 (41%), Positives = 127/211 (60%), Gaps = 17/211 (8%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDEL--------EKEMRLKIFKENLEYIEKANKEGNRTY 89
+ E+ + + + WM QHG+SY D EK R IFK+NL +I N E N+ Y
Sbjct: 48 SSEERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGY 106
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
LG N F+DLTN+EFRA G + R T+ F+Y ++ + D+P S+DWR+KGAV
Sbjct: 107 FLGLNAFADLTNEEFRAQRHGGRFDRSRER-TSHEEFRYGSVQLKDLPDSIDWREKGAVV 165
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q CG CWAF+AVAA+EG+ K+ +G L+ LSEQ+L+DC + GC GG + AF
Sbjct: 166 GVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFG 225
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTED 240
++I+N G+ T+ D+ G+GT D
Sbjct: 226 FVIKNGGL-----DTEADYPYK--GYGTRCD 249
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/68 (58%), Positives = 48/68 (70%), Gaps = 5/68 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF G CGT LDH VT VG+G EDG YW+IKNSWG+ WG+ GY+K+ R+ GLCG
Sbjct: 300 GIFTGRCGTDLDHGVTNVGYGK-EDGKAYWIIKNSWGSNWGEKGYVKMARNTGLAAGLCG 358
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 359 INMEASYP 366
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 105/317 (33%), Positives = 156/317 (49%), Gaps = 83/317 (26%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+D+ E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +T +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL---------------------------- 188
CG CWAF++ A+EG +SG L+ LSEQ L
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 189 ------------LDCSTNGNNGCLGGSREKAFAYIIQ----------------------- 213
+D S + N G +G + ++ F I Q
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGAIGAT-DRGFTDIPQGDEKKMAEAVATVGPVAVAIDAS 263
Query: 214 -------NQGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 264
++G++N C Q LDH V +VG+GT E G +YWL+KNSWG TWGD G++K++
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTWGDKGFIKML 323
Query: 265 RD-EGLCGIGTRSSYPL 280
R+ + CGI + SSYPL
Sbjct: 324 RNKDNQCGIASASSYPL 340
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 104/151 (68%), Gaps = 2/151 (1%)
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
E E R +IFK NLEYIE N GN++YKLG NQ+SDLT+DEF A +TG K+ S
Sbjct: 78 ELEKRKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQLSSSKM 137
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
S NL+ DVPT+ DWR +GAVT +K+Q CGCCWAF+ VAAVEG KI +G LI
Sbjct: 138 RSAAVPFNLN-DDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGELIS 196
Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
LSEQQL+DC N+GC GG+ + AF YIIQ
Sbjct: 197 LSEQQLVDCDER-NSGCHGGNMDSAFKYIIQ 226
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 49/68 (72%), Gaps = 4/68 (5%)
Query: 217 IFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGI 272
+++G CG ++HAVT VG+G +EDG YWLIKNSWG WG+ GYMK++R+ G CGI
Sbjct: 296 VYSGTCGQSMNHAVTAVGYGVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCGI 355
Query: 273 GTRSSYPL 280
+SYP+
Sbjct: 356 AAHASYPI 363
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 107/339 (31%), Positives = 171/339 (50%), Gaps = 82/339 (24%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYI 78
+ L + A+ V+S ++ +V+ E+W + QH ++Y E E+ R+KIF EN +
Sbjct: 1 MKLFLILAAVVISCQAVSFYDLVQ--EQWSSFKMQHSKNYDSETEERFRMKIFMENAHKV 58
Query: 79 EKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS--HRSTTSSTFKYQNLSM 133
K NK +G +KLG N+++D+ + EF + G+ + S + ++ + +
Sbjct: 59 AKHNKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPAN 118
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
+P ++DWRDKGAVT +K+Q CG CW+F+A ++EG ++G L+ LSEQ L+DCS
Sbjct: 119 VKLPDTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSG 178
Query: 193 ---TNGNNGCL------------GGSREKAFAYII---------QNQGIFN--------- 219
NG NG L G EK++ Y+ QN G +
Sbjct: 179 RYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAEDEKCHYKAQNSGATDKGFVDIEEA 238
Query: 220 -------------------------------GV-----CGTQ-LDHAVTIVGFGTTEDGA 242
GV C +Q LDH V +VG+GT++DG
Sbjct: 239 NEDDLKAAVATVGPVSIAIDASHETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDGQ 298
Query: 243 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+YWL+KNSWG +WG GY+K+ R+ + +CG+ +++SYPL
Sbjct: 299 DYWLVKNSWGPSWGLNGYIKMARNQDNMCGVASQASYPL 337
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 124/189 (65%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ V ++ +WM++H R+Y E+E R ++F++NL YI++ N G +
Sbjct: 26 IVSYGERSEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHS 85
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R+ Y G + R ++ +YQ ++P ++DWR KGAV
Sbjct: 86 FRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQADDNEELPETVDWRKKGAV 142
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG + AF
Sbjct: 143 AAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMDYAF 202
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 203 EFIINNGGI 211
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 37/75 (49%), Positives = 49/75 (65%), Gaps = 5/75 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + GIF G CGT LDH V VG+GT E+G +YWL++NSWG WG+ GY+++ R+
Sbjct: 271 AFQLYKSGIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGTVWGEDGYIRMERNIK 329
Query: 267 --EGLCGIGTRSSYP 279
G CGI SYP
Sbjct: 330 ASSGKCGIAVEPSYP 344
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 88/206 (42%), Positives = 123/206 (59%), Gaps = 6/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I+ + +F L++S A + +S V+ ++E W+ + G+SY EKEMR +IFK
Sbjct: 10 ISMSLLFFSTLLILSSALDIKNSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFK 69
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
ENL I+ N + NR+Y LG N+F+DLT++E+R+ Y G+K + S +Y
Sbjct: 70 ENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSGPKAKVSN-----RYVPKV 124
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR GAV +K+Q C CWAF+AVAAVEGI KI +GNLI LSEQ+L+DC
Sbjct: 125 GVVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCG 184
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC G AF +II N GI
Sbjct: 185 RTQRTRGCNRGYMNDAFQFIIDNGGI 210
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/68 (54%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GI+ G CGT +DH VTIVG+GT E G +YW++KNSWG WG+ GY++I R+ G CGI
Sbjct: 277 GIYTGYCGTAIDHGVTIVGYGT-ERGLDYWIVKNSWGTNWGENGYIRIQRNIGGAGKCGI 335
Query: 273 GTRSSYPL 280
SYP+
Sbjct: 336 AMVPSYPV 343
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 105/317 (33%), Positives = 155/317 (48%), Gaps = 83/317 (26%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+D+ E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ T +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGH 144
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQ------------------------------ 186
CG CWAF++ A+EG +SG L+ LSEQ
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 187 ----------QLLDCSTNGNNGCLGGSREKAFAYIIQ----------------------- 213
+ +D S + N G +G + ++ F I Q
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGAT-DRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 263
Query: 214 -------NQGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 264
++G++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFIKML 323
Query: 265 RD-EGLCGIGTRSSYPL 280
R+ + CGI + SSYPL
Sbjct: 324 RNKDNQCGIASASSYPL 340
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 85/197 (43%), Positives = 130/197 (65%), Gaps = 13/197 (6%)
Query: 32 VVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDEL---EKEMRLKIFKENLEYIEK 80
+VS TH + V+ I+E+W+ ++G+++ + EKE R ++FK+NL +I++
Sbjct: 28 IVSYDQTHLTKSSWRTDDEVMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDE 87
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
N E NR+YK+G N+F+DLTN+E+R++Y G + + +R + SS +Y +P S+
Sbjct: 88 HNSE-NRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRSSN-RYLPRVGDSLPDSV 145
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL 200
DWR +GAV +K+Q CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+DC + N GC
Sbjct: 146 DWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSYNEGCN 205
Query: 201 GGSREKAFAYIIQNQGI 217
GG + AF +II N GI
Sbjct: 206 GGLMDYAFQFIINNGGI 222
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/83 (46%), Positives = 53/83 (63%), Gaps = 9/83 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE F GIF G CGT LDH V VG+GT E+G +YW+++NSWG +WG++GY+
Sbjct: 279 GGREFQF----YQSGIFTGRCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYI 333
Query: 262 KIVRD----EGLCGIGTRSSYPL 280
++ R+ G CGI SYP+
Sbjct: 334 RMERNIATATGKCGIAIEPSYPI 356
>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
Length = 336
Score = 167 bits (423), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 90/233 (38%), Positives = 131/233 (56%), Gaps = 24/233 (10%)
Query: 18 MFIIITLLV--SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+ ++ TL+ + A+ + + + +++ E+WMA+ G++YK EKE R IF++N+
Sbjct: 6 LLVVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNV 65
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+I + +G NQF+DLTNDEF A YTG K P P + +
Sbjct: 66 HFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIW 117
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P +DWR +GAVT +K+Q CG CWAFAAVAA+EG+TKIR+G L LSEQ+L+DC TN
Sbjct: 118 TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN- 176
Query: 196 NNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQ---LDHAVTIVGF 235
+NGC GG ++AF + GI F G C +HA +I G+
Sbjct: 177 SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGY 229
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 167 bits (423), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 147/307 (47%), Gaps = 80/307 (26%)
Query: 53 QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYT 109
+H ++Y DE E+ RLKIF EN I K N+ G +YKL N+++D+ + EFR L
Sbjct: 111 EHRKNYLDETEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMN 170
Query: 110 GYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
G+ +FK + + +P S+DWRDKGAVT +K+Q CG CWAF++
Sbjct: 171 GFNYTLHKELRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSST 230
Query: 167 AAVEGITKIRSGNLIQLSEQQL-------------------------------------- 188
A+EG +SG L+ LSEQ L
Sbjct: 231 GALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 290
Query: 189 --LDCSTNGNNGCLGGSREKAFAYIIQ------------------------------NQG 216
LD S + N G +G + ++ F I Q ++G
Sbjct: 291 EALDDSCHFNKGTIGAT-DRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSEG 349
Query: 217 IF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIG 273
++ C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R+ + CGI
Sbjct: 350 VYVEPACDAQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRNKDNQCGIA 409
Query: 274 TRSSYPL 280
+ SSYPL
Sbjct: 410 SASSYPL 416
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 136/221 (61%), Gaps = 21/221 (9%)
Query: 13 INTTPMFIIITLL-VSCA-----------SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+ +PM +++ ++ VS A + + S + V I+E WM +HG+ +
Sbjct: 4 LKLSPMILLLAMIGVSYAMDMSIISYDENHHITTETSRSDSEVERIYEAWMVEHGKKKMN 63
Query: 61 E----LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP 116
+ EK+ R +IFK+NL +I++ N + N +YKLG +F+DLTN+E+R++Y G K P
Sbjct: 64 QNGLGAEKDQRFEIFKDNLRFIDEHNTK-NLSYKLGLTRFADLTNEEYRSMYLGAK---P 119
Query: 117 SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIR 176
+ R +S +YQ +P S+DWR +GAV +K+Q CG CWAF+ + AVEGI KI
Sbjct: 120 TKRVLKTSD-RYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIV 178
Query: 177 SGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+G+LI LSEQ+L+DC T+ N GC GG + AF +II+N GI
Sbjct: 179 TGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGI 219
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 57/76 (75%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + G+F+G+CGT+LDH V VG+GT E+G +YW+++NSWGN WG++GY+K+ R+
Sbjct: 279 AFQLYSSGVFDGLCGTELDHGVVAVGYGT-ENGKDYWIVRNSWGNRWGESGYIKMARNIE 337
Query: 267 --EGLCGIGTRSSYPL 280
G CGI +SYP+
Sbjct: 338 APTGKCGIAMEASYPI 353
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 88/184 (47%), Positives = 122/184 (66%), Gaps = 11/184 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYK--DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
++S+ +++KW QH RS + D E R +IFKEN+++I+ NK+ + YKLG N+F+
Sbjct: 38 DESLRGLYDKWALQH-RSTRSLDSDEHARRFEIFKENVKHIDSVNKK-DGPYKLGLNKFA 95
Query: 98 DLTNDEFRALYTGYKMPSPS----HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
DL+N+EF+A++ KM R S +F YQN +P S+DWR KGAVTP+KN
Sbjct: 96 DLSNEEFKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKR--LPASIDWRKKGAVTPVKN 153
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q +CG CWAF+ +A+VEGI I++G L+ LSEQQL+DCS N GC GG + AF YII
Sbjct: 154 QGQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSKE-NAGCNGGLMDNAFQYIID 212
Query: 214 NQGI 217
N GI
Sbjct: 213 NGGI 216
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 49/68 (72%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F G CGT+LDH V +VG+G + +G NYW+++NSWG WG+ GY+++ R EG CG
Sbjct: 285 GVFTGKCGTELDHGVVVVGYGKSPEGINYWIVRNSWGPEWGEQGYIRMQRGIEATEGKCG 344
Query: 272 IGTRSSYP 279
I ++SYP
Sbjct: 345 ISMQASYP 352
>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 335
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/236 (38%), Positives = 133/236 (56%), Gaps = 24/236 (10%)
Query: 15 TTPMFIIITLLV--SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
T+ + ++ TL+ + A+ + + + +++ E+WMA+ G++YK EKE R IF+
Sbjct: 2 TSIVLLVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFR 61
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+N+ +I + +G NQF+DLTNDEF A YTG K P P + +
Sbjct: 62 DNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVD 113
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
P +DWR +GAVT +K+Q CG CWAFAAVAA+EG+TKIR+G L LSEQ+L+DC
Sbjct: 114 PIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCD 173
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQ---LDHAVTIVGF 235
TN +NGC GG ++AF + GI F G C +HA +I G+
Sbjct: 174 TN-SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGY 228
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 167 bits (422), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 87/189 (46%), Positives = 123/189 (65%), Gaps = 5/189 (2%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK-EGNRTYK 90
+V+ R+ E+ V ++E W+ +G++Y EKE R +IF +NL YI+ N+ E N +Y
Sbjct: 25 IVAERT--EEEVRLLYEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYT 82
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT--DVPTSLDWRDKGAV 148
LG +F+DLTN+E+R+ Y G K R + + ++LS D+P +DWR+KGAV
Sbjct: 83 LGLTRFADLTNEEYRSTYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAV 142
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
PIK+Q CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T N GC GG + AF
Sbjct: 143 APIKDQGGCGSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYAF 202
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 203 QFIISNGGI 211
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 54/85 (63%), Gaps = 10/85 (11%)
Query: 201 GGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 260
GG R ++ + GIF+G CG LDH V VG+GT E G +YW+++NSWG +WG+AGY
Sbjct: 267 GGGR----SFQLYKSGIFDGRCGIDLDHGVVAVGYGT-ESGKDYWIVRNSWGKSWGEAGY 321
Query: 261 MKIVRD-----EGLCGIGTRSSYPL 280
+++ R+ G CGI SYP+
Sbjct: 322 IRMERNLPSSSSGKCGIAIEPSYPI 346
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 167 bits (422), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 86/209 (41%), Positives = 127/209 (60%), Gaps = 12/209 (5%)
Query: 17 PMFIIITLLV-SCASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
P+ +++ L S S + + E S+ ++E+W + H S +D +K+ R +FKEN
Sbjct: 6 PVLLVLALAFGSTLSIPIKEKDLESEDSLWSLYERWRSHHAVS-RDLDQKQKRFNVFKEN 64
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK------MPSPSHRSTTSSTFKY 128
+++I + NK + T+KL N+F D+TN EFRA Y G K M H S + + F Y
Sbjct: 65 VKFIHEFNKNKDVTFKLALNKFGDMTNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAKFMY 124
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+N P S+DWR++GAV +KNQ +CG CWAF+A+AAVEGI +I + L+ LSEQ+L
Sbjct: 125 ENAV---APPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVEGINQIVTKELVPLSEQEL 181
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+DC T+ N GC GG + AF +I N GI
Sbjct: 182 IDCDTDQNQGCSGGLMDYAFEFIKNNGGI 210
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 58/81 (71%), Gaps = 6/81 (7%)
Query: 206 KAFAYIIQ--NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 263
+A Y+ Q ++G+F G CGT+LDH V +VG+GTT+DG YW ++NSWG WG++GY+++
Sbjct: 262 EASGYVFQFYSEGVFTGRCGTELDHGVAVVGYGTTQDGTKYWTVRNSWGADWGESGYVRM 321
Query: 264 VR----DEGLCGIGTRSSYPL 280
R GLCGI ++SYP+
Sbjct: 322 QRGIKATHGLCGIAMQASYPI 342
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 124/206 (60%), Gaps = 29/206 (14%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+V HE+WM Q+ R YKD EK R ++FK N+++IE N GNR + LG NQF+DLTND
Sbjct: 1 MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTND 60
Query: 103 EFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
EFRA T G+K PSP T F+Y+N+S+ +P ++DWR KGAVTPIK+Q +C
Sbjct: 61 EFRATKTNKGFK-PSPVKVPT---GFRYENISVDALPATIDWRTKGAVTPIKDQGQC--- 113
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI-- 217
EGI KI +G LI LSEQ+L+DC +G + GC GG + AF +II+ G+
Sbjct: 114 ---------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTT 164
Query: 218 --------FNGVCGTQLDHAVTIVGF 235
+G C + + T+ GF
Sbjct: 165 ESSYPYTAADGKCKSGSNSVATVKGF 190
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 62/122 (50%), Gaps = 14/122 (11%)
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGV 221
+ A V E + +L++ Q + + +G + + + G+ G
Sbjct: 183 SVATVKGFEDVPANDEASLMKAVANQPVSVAVDGGD----------MTFQFYSGGVMTGS 232
Query: 222 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSS 277
CGT LDH + +G+G T DG YWL+KNSWG TWG+ GY+++ +D G+CG+ S
Sbjct: 233 CGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPS 292
Query: 278 YP 279
YP
Sbjct: 293 YP 294
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 89/205 (43%), Positives = 124/205 (60%), Gaps = 18/205 (8%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
+E+W+ QHGR YK+ E + I++ N+ +I N + N ++ L NQF+D+TN+E++A
Sbjct: 45 YERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKA 103
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
LY G S ++ S+FK + + +P S+DWR GAVTP++NQ ECG CWAF+ V
Sbjct: 104 LYMGLGTSETSRKN--QSSFKRERSKV--LPISVDWRKMGAVTPVRNQGECGSCWAFSTV 159
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIF------- 218
AAVEGI KIR+G L+ LSEQ+LLDC + GN GC GG AF +I QN GI
Sbjct: 160 AAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPY 219
Query: 219 ---NGVCGTQ--LDHAVTIVGFGTT 238
G+C +H V I G+ T
Sbjct: 220 IGEQGICNKDKAANHVVKISGYETV 244
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 39/78 (50%), Positives = 58/78 (74%), Gaps = 5/78 (6%)
Query: 207 AFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 265
+ + + ++GIFNG CG QL+HAVT++G+G ++G YWL+KNSWG WG+AGY +++R
Sbjct: 269 GYEFQLYSKGIFNGFCGKQLNHAVTVIGYGE-DNGKKYWLVKNSWGTGWGEAGYARMIRD 327
Query: 266 ---DEGLCGIGTRSSYPL 280
DEG+CGI +SYP+
Sbjct: 328 SRDDEGICGIAMEASYPI 345
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 93/189 (49%), Positives = 124/189 (65%), Gaps = 21/189 (11%)
Query: 63 EKEMRLKIFKENLEYIEKANKE-GNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHR 119
E+E RL+IF +N+ YIE +N N+ YKL N+F+DLTN+EF A +K M S R
Sbjct: 3 EREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIASRNKFKGHMCSSIIR 62
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+TT FKY+N S +P+++DWR KGAVTP+KNQ +CG CWAF+AVAA EGI ++ +G
Sbjct: 63 TTT---FKYENASA--IPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGK 117
Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI-------FNGVCGT-----QL 226
L+ LSEQ+L+DC T G + GC GG + AF +IIQN G+ + GV GT
Sbjct: 118 LVSLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKAS 177
Query: 227 DHAVTIVGF 235
HAVTI G+
Sbjct: 178 IHAVTITGY 186
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 40/72 (55%), Positives = 50/72 (69%), Gaps = 4/72 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
N G+F G CGT+LDH VT VG+G DG YWL+KNSWG WG+ GY+++ R EGL
Sbjct: 221 NSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGL 280
Query: 270 CGIGTRSSYPLA 281
CGI ++SYP A
Sbjct: 281 CGIAMQASYPTA 292
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 84/191 (43%), Positives = 127/191 (66%), Gaps = 5/191 (2%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
++ S S + V+ I+E W+ QH ++Y EKE R IFK+NLE+I++ N + ++T+K+
Sbjct: 38 LLPSSSRSDDEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFKV 97
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-----YQNLSMTDVPTSLDWRDKG 146
G N+F+DLTN+EFR++Y G K S S +S+ K Y ++P ++DWR G
Sbjct: 98 GLNKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKNG 157
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
AV +K+Q +CG CWAF+ +AAVEGI +I +G L+ LSEQ+L+DC T+ N+GC GG +
Sbjct: 158 AVAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNSGCDGGLMDY 217
Query: 207 AFAYIIQNQGI 217
A+ +II N GI
Sbjct: 218 AYEFIINNGGI 228
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 32/70 (45%), Positives = 47/70 (67%), Gaps = 6/70 (8%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLC 270
G+F G CG LDH V VG+G+ +DG +YW+++NSWG WG++GY+++ R+ G C
Sbjct: 295 GVFTGKCGADLDHGVVAVGYGS-DDGKDYWIVRNSWGADWGESGYIRMERNLETVKTGKC 353
Query: 271 GIGTRSSYPL 280
GI SYP+
Sbjct: 354 GIAIEPSYPI 363
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 105/317 (33%), Positives = 154/317 (48%), Gaps = 83/317 (26%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+D+ E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQ------------------------------ 186
CG CWAF++ A+EG +SG L+ LSEQ
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 187 ----------QLLDCSTNGNNGCLGGSREKAFAYIIQ----------------------- 213
+ +D S + N G +G + ++ F I Q
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGAT-DRGFTDIPQGDEKKMAEAVATVGPVAVAIDAS 263
Query: 214 -------NQGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 264
++G++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKML 323
Query: 265 RD-EGLCGIGTRSSYPL 280
R+ E CGI + SSYPL
Sbjct: 324 RNKENQCGIASASSYPL 340
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 87/222 (39%), Positives = 133/222 (59%), Gaps = 18/222 (8%)
Query: 9 GSFKINTTPMFIIITLLVSCASQVVSSRSTH---------EQSVVEIHEKWMAQHGRSYK 59
GS K+ + ++I + + ++S H + V I+E WM +HG+ +
Sbjct: 2 GSVKVTILLLAMMIGVSYAADMSIISYDEKHHITAENERSDAEVARIYEAWMEKHGKKAQ 61
Query: 60 DE----LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
EK+ R +IFK+NL +I++ N + N +YKLG +F+DLTN+E+R++Y G K
Sbjct: 62 SNGLVGEEKDQRFEIFKDNLRFIDEHNNK-NLSYKLGLTRFADLTNEEYRSIYLGAK--- 117
Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
S + ++ +YQ +P S+DWR +GAV +K+Q CG CWAF+ + AVEGI KI
Sbjct: 118 -SKKRVLKTSDRYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKI 176
Query: 176 RSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+G+LI LSEQ+L+DC T+ N GC GG + AF +II+N GI
Sbjct: 177 VTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGI 218
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 57/76 (75%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + G+F+G+CGT+LDH V VG+GT E+G +YW+++NSWG +WG++GY+K+ R+
Sbjct: 278 AFQLYSSGVFDGICGTELDHGVVAVGYGT-ENGKDYWIVRNSWGGSWGESGYIKMARNIA 336
Query: 267 --EGLCGIGTRSSYPL 280
G CGI +SYP+
Sbjct: 337 EPTGKCGIAMEASYPI 352
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 166 bits (421), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 106/308 (34%), Positives = 151/308 (49%), Gaps = 80/308 (25%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDLTND 102
+ + + + + Y+ E+ R +F +N+++I + N E R T+ + NQF+DLTN+
Sbjct: 29 LFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFADLTNE 88
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT--SLDWRDKGAVTPIKNQKECGCC 160
E+R LY P P T + + D P S+DWR KGAVTPIKNQ +CG C
Sbjct: 89 EYRQLYL---RPYP-----TELLGRERQEVWLDGPNAGSVDWRQKGAVTPIKNQGQCGSC 140
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI-- 217
W+F+ +VEG I +GNL+ LSEQQL+DCS + GN GC GG + AF YII N G+
Sbjct: 141 WSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDT 200
Query: 218 --------FNGVC--GTQLDHAVTIVGF-------------------------------- 235
+GVC + HAV+I G+
Sbjct: 201 EQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQ 260
Query: 236 -----------GTTEDG--------ANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIG 273
GT D ++YW++KNSWG +WGD GY+ + R G+CGI
Sbjct: 261 MYSSGVFSGPCGTNLDHGVLVVGYTSDYWIVKNSWGASWGDQGYIMMKRGVSSAGICGIA 320
Query: 274 TRSSYPLA 281
+ SYP+A
Sbjct: 321 MQPSYPIA 328
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 166 bits (421), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 105/317 (33%), Positives = 154/317 (48%), Gaps = 83/317 (26%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+D+ E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQ------------------------------ 186
CG CWAF++ A+EG +SG L+ LSEQ
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 187 ----------QLLDCSTNGNNGCLGGSREKAFAYIIQ----------------------- 213
+ +D S + N G +G + ++ F I Q
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGAT-DRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 263
Query: 214 -------NQGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 264
++G++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFIKML 323
Query: 265 RD-EGLCGIGTRSSYPL 280
R+ E CGI + SSYPL
Sbjct: 324 RNKENQCGIASASSYPL 340
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 89/205 (43%), Positives = 124/205 (60%), Gaps = 18/205 (8%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
+E+W+ QHGR YK+ E + I++ N+ +I N + N ++ L NQF+D+TN+E++A
Sbjct: 41 YERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKA 99
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
LY G S ++ S+FK + + +P S+DWR GAVTP++NQ ECG CWAF+ V
Sbjct: 100 LYMGLGTSETSRKN--QSSFKRERSKV--LPISVDWRKMGAVTPVRNQGECGSCWAFSTV 155
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIF------- 218
AAVEGI KIR+G L+ LSEQ+LLDC + GN GC GG AF +I QN GI
Sbjct: 156 AAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPY 215
Query: 219 ---NGVCGTQ--LDHAVTIVGFGTT 238
G+C +H V I G+ T
Sbjct: 216 IGEQGICNKDKAANHVVKISGYETV 240
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/78 (50%), Positives = 58/78 (74%), Gaps = 5/78 (6%)
Query: 207 AFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 265
+ + + ++GIFNG CG QL+HAVT++G+G ++G YWL+KNSWG WG+AGY +++R
Sbjct: 265 GYEFQLYSKGIFNGFCGKQLNHAVTVIGYGE-DNGKKYWLVKNSWGTGWGEAGYARMIRD 323
Query: 266 ---DEGLCGIGTRSSYPL 280
DEG+CGI +SYP+
Sbjct: 324 SRDDEGICGIAMEASYPI 341
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 80/180 (44%), Positives = 119/180 (66%), Gaps = 8/180 (4%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSD 98
S+ ++ +W +HG++Y E EKE+RLKIF +N E+++K N E G T+ +G N +D
Sbjct: 63 SLSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLAD 122
Query: 99 LTNDEFRALYTGYKMPSPSHRSTT-SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
LT DEF+ + GY + R+ +ST++Y +++ P +DW GAVTP+KNQK+C
Sbjct: 123 LTKDEFKKML-GYNAALRASRAPVDASTWEYADVT---PPEEIDWVASGAVTPVKNQKQC 178
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ AVEG+ I++G LI LSE++L+ CSTNGN GC GG + F +I+ N+GI
Sbjct: 179 GSCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEWIVNNRGI 238
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 69/140 (49%), Gaps = 21/140 (15%)
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKA 207
+ +++CG A++G + S + L++ QQ + + ++
Sbjct: 247 VAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQ--------- 297
Query: 208 FAYIIQNQGIFNGV-CGTQLDHAVTIVGFGTTEDGA---NYWLIKNSWGNTWGDAGYMKI 263
++ + G+++ CGT+LDH V +VG+G ++W IKNSWG WG+ GY++I
Sbjct: 298 -SFQLYAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRI 356
Query: 264 VRD----EGLCGIGTRSSYP 279
+ EG CG+ + SYP
Sbjct: 357 AKGGSGVEGQCGVAMQPSYP 376
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 131/212 (61%), Gaps = 14/212 (6%)
Query: 18 MFIIITLLVSCASQ--VVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMR 67
+F+++ S A +VS H + V+ ++E W+ +HG++Y EKE R
Sbjct: 10 LFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKEKR 69
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH--RSTTSST 125
IFK+NL +I++ N + N TY+LG N+F+DLTN+E+R++Y G K P + R + +
Sbjct: 70 FGIFKDNLRFIDEHNSQ-NLTYRLGLNRFADLTNEEYRSMYLGVK-PGATRVTRKVSRKS 127
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
++ +P +DWR +GAV +K+Q CG CWAF+ +AAVEGI +I +G+LI LSE
Sbjct: 128 DRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSE 187
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+L+DC T+ N GC GG + AF +II N GI
Sbjct: 188 QELVDCDTSYNEGCNGGLMDYAFEFIINNGGI 219
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 49/77 (63%), Gaps = 6/77 (7%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+F G CGT LDH V VG+GT E+G +YW++ NSWG WG+ GY+++ R+
Sbjct: 279 AFQLYQSGVFTGKCGTSLDHGVAAVGYGT-ENGQDYWIVGNSWGKNWGEDGYIRMERNLA 337
Query: 267 ---EGLCGIGTRSSYPL 280
G CGI SYP+
Sbjct: 338 GSSSGKCGIAIGPSYPI 354
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 80/186 (43%), Positives = 124/186 (66%), Gaps = 4/186 (2%)
Query: 33 VSSRSTHEQSV-VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
V++++ H V++ E+W+ ++ ++Y EK+ R +IF +NL+++++ N N++Y+L
Sbjct: 22 VTAKADHRNPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYEL 81
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
G +F+DLTN+EFRA+Y KM R + S N+ +P +DWR KGAV P+
Sbjct: 82 GLTRFADLTNEEFRAIYLRSKMERT--RDSVKSERYLHNVG-DKLPDEVDWRAKGAVVPV 138
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CWAF+A+ AVEGI +I++G L+ LSEQ+L+DC T+ NNGC GG + AF +I
Sbjct: 139 KDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFI 198
Query: 212 IQNQGI 217
I N GI
Sbjct: 199 ISNGGI 204
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 35/74 (47%), Positives = 51/74 (68%), Gaps = 5/74 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + G+F G CGT LDH V VG+GT+E G +YW+I+NSWG+ WG++GY+K+ R+
Sbjct: 265 FQLYKSGVFTGTCGTALDHGVVAVGYGTSE-GQDYWIIRNSWGSNWGESGYIKLQRNIKD 323
Query: 267 -EGLCGIGTRSSYP 279
G CG+ +SYP
Sbjct: 324 SSGKCGVAMMASYP 337
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 82/189 (43%), Positives = 118/189 (62%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS ++ ++ +WMA HGR+Y E+E R ++F++NL YI+ N G +
Sbjct: 29 IVSYGERSDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHS 88
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+RA Y G + R + +Y D+P S+DWR KGAV
Sbjct: 89 FRLGLNRFADLTNDEYRATYLGARTRPQRERKLGA---RYHAADNEDLPESVDWRAKGAV 145
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q G CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 146 AEVKDQGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 205
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 206 EFIINNGGI 214
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/75 (50%), Positives = 54/75 (72%), Gaps = 5/75 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + + GIF G CGT LDH VT VG+GT E+G +YW++KNSWG++WG++GY+++ R+
Sbjct: 275 FQLYSSGIFTGSCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKA 333
Query: 267 -EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 334 SSGKCGIAVEPSYPL 348
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 87/211 (41%), Positives = 127/211 (60%), Gaps = 17/211 (8%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDEL--------EKEMRLKIFKENLEYIEKANKEGNRTY 89
+ E+ + + + WM QHG+SY + EK R IFK+NL +I N E N+ Y
Sbjct: 48 SSEERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGY 106
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
LG N F+DLTN+EFRA G + R T+ F+Y ++ + D+P S+DWR+KGAV
Sbjct: 107 FLGLNAFADLTNEEFRAQRHGGRFDRSRER-TSYEEFRYGSVQLKDLPDSIDWREKGAVV 165
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q CG CWAF+AVAA+EG+ K+ +G L+ LSEQ+L+DC + GC GG + AF
Sbjct: 166 GVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFG 225
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTED 240
++I+N G+ T+ D+ G+GT D
Sbjct: 226 FVIKNGGL-----DTEADYPYK--GYGTRCD 249
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/68 (58%), Positives = 48/68 (70%), Gaps = 5/68 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF G CGT LDH VT VG+G EDG YW+IKNSWG+ WG+ GY+K+ R+ GLCG
Sbjct: 300 GIFTGRCGTDLDHGVTNVGYGK-EDGKAYWIIKNSWGSNWGEKGYIKMARNTGLAAGLCG 358
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 359 INMEASYP 366
>gi|348500228|ref|XP_003437675.1| PREDICTED: pro-cathepsin H-like [Oreochromis niloticus]
Length = 276
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 98/257 (38%), Positives = 138/257 (53%), Gaps = 36/257 (14%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
++W+++H + Y E E RL +FK+N +E+ N GN ++ +G NQFSD+T +EF+
Sbjct: 31 KQWISEHNKVYGTE-EYHHRLHVFKQNKRTVEQHNA-GNHSFTMGLNQFSDMTFEEFKKF 88
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMT-DVPTSLDWRDKGA-VTPIKNQKECGCCWAFAA 165
Y + PS T S K ++ T P +DWR KG VTP+K+Q CG CW F+
Sbjct: 89 YL-FTQPS------TCSVIKGSHVKRTGPYPEFVDWRMKGDFVTPVKDQGFCGSCWTFST 141
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ------------ 213
+E + I +G LIQLSEQQLLDCS N N + EKA +
Sbjct: 142 TGCLESVNAIATGKLIQLSEQQLLDCSRNFN---VVKYDEKAMVDAVARLNPITSCFDVT 198
Query: 214 ------NQGIFNGV-CGTQLD---HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 263
+G+++ C D HAV VG+GT ++G YW++KNSWG WG GY I
Sbjct: 199 AEFKHYKEGVYSSTQCKNTTDKVNHAVLAVGYGTEKNGVPYWIVKNSWGTAWGKDGYFLI 258
Query: 264 VRDEGLCGIGTRSSYPL 280
R + +CG+ SSYPL
Sbjct: 259 ERGKNMCGLAACSSYPL 275
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 129/206 (62%), Gaps = 6/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I+ + +F L+ S A S V+ ++E W+ ++G+SY E+EMR++IFK
Sbjct: 8 ISMSLLFFSTFLIFSFAIDAKISPLRTNDEVMALYESWLVKYGKSYNSLGEREMRIEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
ENL +I++ N + NR+Y +G NQF+DLT++E+R+ Y G+K S +S S+ + Q
Sbjct: 68 ENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKS---SLKSKVSNRYMPQVGE 124
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ +P +DWR GAV +KNQ C CWAFA +A VE I +I +G+LI LSEQ+L+DC+
Sbjct: 125 V--LPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDCN 182
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T N GC GG + A+ +II N GI
Sbjct: 183 RTPINEGCKGGFMDDAYEFIINNGGI 208
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 37/69 (53%), Positives = 51/69 (73%), Gaps = 5/69 (7%)
Query: 216 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCG 271
GIF G CGT L+HAVTI+G+GT E+G +YW++KNS+G WG++GY K+ R+ EG CG
Sbjct: 275 GIFTGGSCGTTLNHAVTIIGYGT-ENGIDYWIVKNSYGTQWGESGYGKVQRNVGGEGRCG 333
Query: 272 IGTRSSYPL 280
I + YP+
Sbjct: 334 IASYPFYPV 342
>gi|389986405|gb|AFL46179.1| INT9 protein, partial [Solanum lycopersicum]
Length = 189
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/187 (45%), Positives = 117/187 (62%), Gaps = 13/187 (6%)
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
E+EKE R K FKEN+E+IE NK G + YKL N+++DLT +EF + G S +
Sbjct: 2 EIEKEHRFKTFKENVEFIESFNKNGTQRYKLAINKYADLTTEEFTTSFMGLDTSLLSQQE 61
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
+T++T ++ S+T+VP S+DWR +G+VT +K+Q CGCCWAF+A AA+EG +I + L
Sbjct: 62 STATTTSFKYDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNEL 121
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG------------IFNGVCGTQLDH 228
I LSEQQLLDCST N GC GG A+ +++QN G VC T+
Sbjct: 122 ISLSEQQLLDCSTQ-NKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVCKTEQPA 180
Query: 229 AVTIVGF 235
AVTI G+
Sbjct: 181 AVTINGY 187
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 92/223 (41%), Positives = 134/223 (60%), Gaps = 21/223 (9%)
Query: 14 NTTPMFIII------TLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGR--S 57
N +PM +I+ T + ++S TH ++ V I+E+W +HG+ +
Sbjct: 6 NRSPMLVILIVFTLFTATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNN 65
Query: 58 YKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS 117
D EK+ R +IFK+NL++I++ N E NRTYK+G N+F+DL+N+E+R+ Y G K+
Sbjct: 66 NIDGSEKDKRFEIFKDNLKFIDEHNAE-NRTYKVGLNRFADLSNEEYRSRYLGTKIDPIG 124
Query: 118 H---RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITK 174
R+ T S +Y +P S+DWR +GAV +K+Q CG CWAF+ +AAVEGI K
Sbjct: 125 MMMARTKTRSN-RYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINK 183
Query: 175 IRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
I +G L+ LSEQ+L+DC N GC GG E AF +II N GI
Sbjct: 184 IVTGELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGI 226
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 52/70 (74%), Gaps = 6/70 (8%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLC 270
GIF G CGT LDH VT VG+GT E+G +YW+++NSWG +WG++GY+++ R+ G C
Sbjct: 293 GIFTGKCGTALDHGVTAVGYGT-ENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKC 351
Query: 271 GIGTRSSYPL 280
GI +SSYP+
Sbjct: 352 GIVMQSSYPI 361
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 83/177 (46%), Positives = 118/177 (66%), Gaps = 8/177 (4%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+V++ W +H + Y EK R ++FK+NL++I + N+ N +Y LG NQF+D+ ++
Sbjct: 44 LVDLFSSWSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRR-NGSYWLGLNQFADVAHE 102
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
EF++ Y G K M P+ T F+Y+N ++P S+DWR KGAVTP+KNQ ECG C
Sbjct: 103 EFKSTYLGLKTGMDGPARAPTA---FRYEN--SVNLPWSVDWRKKGAVTPVKNQGECGSC 157
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
WAF+ VAAVEGI +I +G L LSEQ+L+DC T ++GC GG + AFAYI+ N GI
Sbjct: 158 WAFSTVAAVEGINQIATGKLESLSEQELMDCDTTFDHGCGGGFMDFAFAYIMGNLGI 214
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 39/86 (45%), Positives = 57/86 (66%), Gaps = 9/86 (10%)
Query: 198 GCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 257
G GS++ F +G+F G CGT+LDHA+T VG+G++ DG +Y ++KNSWG +WG+
Sbjct: 267 GIAAGSKDFQF----YKRGVFEGSCGTELDHALTAVGYGSS-DGQDYIIMKNSWGKSWGE 321
Query: 258 AGYMKIVR----DEGLCGIGTRSSYP 279
GY +I R EG+C I + +SYP
Sbjct: 322 QGYFRIKRGTGKPEGVCSIYSMASYP 347
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 89/180 (49%), Positives = 119/180 (66%), Gaps = 22/180 (12%)
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKY 128
KEN+ YIE N N+ YKLG NQF+DLT++EF R + G+ S +T ++TFKY
Sbjct: 5 KENVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNGHMRFS----NTRTTTFKY 60
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+N+++ +P S+DWR KGAVTPIKNQ CGCCWAF+A+AA EGI KI +G L+ LSEQ++
Sbjct: 61 ENVTV--LPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEV 118
Query: 189 LDCSTNG-NNGCLGGSREKAFAYIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
+DC T G ++GC GG + AF +IIQN GI + GV G + HA TI G+
Sbjct: 119 VDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGY 178
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/70 (54%), Positives = 46/70 (65%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF G CGT+LDH VT VG+G +G YWL+KNSWG WG+ GY + R EG+CG
Sbjct: 215 GIFTGSCGTELDHGVTAVGYGENNEGTKYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICG 274
Query: 272 IGTRSSYPLA 281
I +SYP A
Sbjct: 275 IAMLASYPTA 284
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 117/189 (61%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS ++ ++ +WMA HGR+Y +E R ++F++NL YI+ N G +
Sbjct: 29 IVSYGERTDEEARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHS 88
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+ A Y G + R + +Y D+P S+DWR KGAV
Sbjct: 89 FRLGLNRFADLTNDEYPATYLGARTRPQRDRKLGA---RYHAADNEDLPESVDWRAKGAV 145
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q CG CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 146 AEVKDQGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 205
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 206 EFIINNGGI 214
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 39/76 (51%), Positives = 56/76 (73%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF G CGT+LDH VT VG+GT E+G +YW++KNSWG++WG++GY+++ R+
Sbjct: 274 AFQLYSSGIFTGSCGTRLDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIK 332
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 333 ASSGKCGIAVEPSYPL 348
>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
Length = 342
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 89/233 (38%), Positives = 129/233 (55%), Gaps = 24/233 (10%)
Query: 18 MFIIITLLV--SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+ ++ TL+ + + + + + +++ E+WMA+ G++YK EKE R IF++N+
Sbjct: 12 LLVVCTLMALQAMGADAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNV 71
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+I + +G NQF+DLTNDEF A YTG K P P + +
Sbjct: 72 HFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIW 123
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P +DWR +GAVT +K+Q CG CWAFAAVAA+EG+TKIR+G L LSEQ+L+DC TN
Sbjct: 124 TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN- 182
Query: 196 NNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQ---LDHAVTIVGF 235
+NGC GG ++AF + GI F G C +HA I G+
Sbjct: 183 SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAARIGGY 235
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 80/194 (41%), Positives = 120/194 (61%), Gaps = 16/194 (8%)
Query: 38 THEQSVVEIHEKWMAQH--------GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
+ E+S+ ++E+W +++ G D+ E R +F EN YI +AN+ G R +
Sbjct: 33 SSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPF 92
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRS------TTSSTFKYQNLSMTDVPTSLDWR 143
+L N+F+D+T DEFR Y G + + HRS +F+Y ++P ++DWR
Sbjct: 93 RLALNKFADMTTDEFRRTYAGSR--ARHHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWR 150
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
++GAVT IK+Q +CG CWAF+AVAAVEG+ KI++G L+ LSEQ+L+DC T N GC GG
Sbjct: 151 ERGAVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGL 210
Query: 204 REKAFAYIIQNQGI 217
+ AF +I +N GI
Sbjct: 211 MDYAFQFIKRNGGI 224
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 36/72 (50%), Positives = 48/72 (66%), Gaps = 5/72 (6%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEG 268
++G+F G CGT LDH V VG+G T DG YW++KNSWG WG+ GY+++ R G
Sbjct: 289 SEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERGYIRMQRGVSSDSNG 348
Query: 269 LCGIGTRSSYPL 280
LCGI +SYP+
Sbjct: 349 LCGIAMEASYPV 360
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 78/183 (42%), Positives = 116/183 (63%), Gaps = 7/183 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ +++E+W H R + EK R FK N+ +I NK G+R Y+L N+F D+
Sbjct: 39 EEALWDLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSST-----FKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
+ EFRA + G ++ S R ++ F Y ++++D+P S+DWR KGAVT +KNQ
Sbjct: 98 SQAEFRATFAGSRV-SDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVKNQ 156
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ V +VEGI IR+G L+ LSEQ+L+DC T N+GC GG + AF YI +N
Sbjct: 157 GKCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYIKKN 216
Query: 215 QGI 217
G+
Sbjct: 217 GGL 219
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 53/76 (69%), Gaps = 4/76 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 267
A++ ++G+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 282 AFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEKGYIRVEKDSG 341
Query: 268 ---GLCGIGTRSSYPL 280
GLCGI +SY +
Sbjct: 342 AEGGLCGIAMEASYAV 357
>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
Length = 319
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 88/220 (40%), Positives = 124/220 (56%), Gaps = 22/220 (10%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
A+ + + + +++ E+WMA+ G++YK EKE R IF++N+ +I +
Sbjct: 2 AASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYD 61
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
+G NQF+DLTNDEF A YTG K P P + + P +DWR +GAV
Sbjct: 62 SAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIWTPCCIDWRFRGAV 113
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q CG CWAFAAVAA+EG+TKIR+G L LSEQ+L+DC TN +NGC GG ++AF
Sbjct: 114 TGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGGGHTDRAF 172
Query: 209 AYIIQNQGI----------FNGVCGTQ---LDHAVTIVGF 235
+ GI F G C +HA +I G+
Sbjct: 173 ELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGY 212
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 133/232 (57%), Gaps = 23/232 (9%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHG--RSYKDELEKEMRLKIFKENL 75
+F ++ L +C E+ + ++++W + H RS E+E R +F+ N+
Sbjct: 9 LFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPRSLN---EREKRFNVFRHNV 65
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQN 130
++ NK+ NR+YKL N+F+DLT +EF+ YTG ++M R + + ++N
Sbjct: 66 MHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHEN 124
Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
LS +P+S+DWR KGAVT IKNQ +CG CWAF+ VAAVEGI KI++ L+ LSEQ+L+D
Sbjct: 125 LSK--LPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182
Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTI 232
C T N GC GG E AF +I +N GI +G C D+ V +
Sbjct: 183 CDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLV 234
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 35/70 (50%), Positives = 49/70 (70%), Gaps = 5/70 (7%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
+G+F G CGT+L+H V VG+G+ E G YW+++NSWG WG+ GY+KI R+ EG C
Sbjct: 275 EGVFTGSCGTELNHGVAAVGYGS-ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRC 333
Query: 271 GIGTRSSYPL 280
GI +SYP+
Sbjct: 334 GIAMEASYPI 343
>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
Length = 319
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 88/220 (40%), Positives = 124/220 (56%), Gaps = 22/220 (10%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
A+ + + + +++ E+WMA+ G++YK EKE R IF++N+ +I +
Sbjct: 2 AASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYD 61
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
+G NQF+DLTNDEF A YTG K P P + + P +DWR +GAV
Sbjct: 62 SAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIWTPCCIDWRFRGAV 113
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q CG CWAFAAVAA+EG+TKIR+G L LSEQ+L+DC TN +NGC GG ++AF
Sbjct: 114 TGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGGGHTDRAF 172
Query: 209 AYIIQNQGI----------FNGVCGTQ---LDHAVTIVGF 235
+ GI F G C +HA +I G+
Sbjct: 173 ELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGY 212
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 114/180 (63%), Gaps = 3/180 (1%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ +++E+W + H R + EK R FK N +I NK G+ Y+L N+F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 100 TNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
EFRA + G + +PS + + F Y L+++D+P S+DWR KGAVT +K+Q +CG
Sbjct: 98 DQAEFRATFVGDLRRDTPS-KPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCG 156
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF 218
CWAF+ V +VEGI IR+G+L+ LSEQ+L+DC T N+GC GG + AF YI N G+
Sbjct: 157 SCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLI 216
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 54/76 (71%), Gaps = 4/76 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 267
A++ ++G+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 278 AFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337
Query: 268 ---GLCGIGTRSSYPL 280
GLCGI +SYP+
Sbjct: 338 ASGGLCGIAMEASYPV 353
>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
Length = 333
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 112/342 (32%), Positives = 165/342 (48%), Gaps = 88/342 (25%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP F++ L + +VS+ +Q++ ++W A HGR Y E+ R ++++NL
Sbjct: 2 TPSFVLAALCLG----IVSALPKLDQTLDAQWDQWKAAHGRLYGLN-EEGWRRAVWEKNL 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
IE N E G ++ LG N F D+TN+EFR + G++ H+ + YQ
Sbjct: 57 RMIELHNGEYSQGRHSFTLGMNHFGDMTNEEFRQVMNGFQ-----HQKHKTGKM-YQEPL 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ +P S+DWR+KG VT +KNQ +CG CWAF+A ++EG ++GNL+ LSEQ L+DCS
Sbjct: 111 LLQLPKSVDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDCS 170
Query: 193 ----TNGNNGCL------------GGSREKAFAYIIQN---------------------- 214
G NG L G EK++ Y+ ++
Sbjct: 171 RPQGNQGCNGGLMDFAFQYVKDNKGLEAEKSYPYVGKDGECKYKPELSAANDTGFVDVPQ 230
Query: 215 ------------------------------QGIF--NGVCGTQLDHAVTIVGFGT--TED 240
+GI+ G L+H V +VG+GT +E
Sbjct: 231 REKVVQKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGCSSRDLNHGVLLVGYGTDASET 290
Query: 241 G-ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 280
G +YWLIKNSWG TWG GY+KI R+ CG+ T +SYPL
Sbjct: 291 GKGDYWLIKNSWGTTWGADGYVKIARNRNNHCGVATAASYPL 332
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 78/178 (43%), Positives = 115/178 (64%), Gaps = 4/178 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ ++++ +W+ +H R Y EK+ R +IFK+NL YI NK+ ++Y LG N+FSDL
Sbjct: 45 DDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQ-EKSYWLGLNKFSDL 103
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T+DEFRALY G + +H F Y+++ ++ +DWR KGAV+ +K+Q CG
Sbjct: 104 THDEFRALYLGIRPAGRAHGLRNGDRFIYEDVVAEEM---VDWRKKGAVSDVKDQGSCGS 160
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+A+ +VEG+ I +G LI LSEQ+L+DC N GC GG + AF +II+N GI
Sbjct: 161 CWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMDYAFDFIIKNGGI 218
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/70 (50%), Positives = 47/70 (67%), Gaps = 5/70 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLC 270
G+F G CGT LDH V VG+GT +DG NYW++KNSWG +WG+ GY+++ R G C
Sbjct: 286 GVFTGPCGTDLDHGVLAVGYGTDDDGVNYWIVKNSWGPSWGEKGYIRMERMGSNSTSGKC 345
Query: 271 GIGTRSSYPL 280
GI S+P+
Sbjct: 346 GINIEPSFPI 355
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 75/179 (41%), Positives = 110/179 (61%), Gaps = 1/179 (0%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ +++E+W + H R + EK R FK N +I NK G+ Y+L N+F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
EFRA + G + + F Y L+++D+P S+DWR KGAVT +K+Q +CG
Sbjct: 98 DQAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGS 157
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF 218
CWAF+ V +VEGI IR+G+L+ LSEQ+L+DC T N+GC GG + AF YI N G+
Sbjct: 158 CWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLI 216
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 54/76 (71%), Gaps = 4/76 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 267
A++ ++G+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 278 AFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337
Query: 268 ---GLCGIGTRSSYPL 280
GLCGI +SYP+
Sbjct: 338 ASGGLCGIAMEASYPV 353
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 74/175 (42%), Positives = 112/175 (64%), Gaps = 1/175 (0%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ + E W QHG++Y + EK RLK+F++N +++ + N +GN +Y L N F+DLT+
Sbjct: 26 IAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHH 85
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+A G + + + S + + + DVP S+DWR GAVT +K+Q CG CW+
Sbjct: 86 EFKASRLGLSSAASASLNVDRSNRQIPDF-VADVPASVDWRKNGAVTQVKDQGNCGACWS 144
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
F+A A+EGI KI +G+L+ LSEQ+L+DC + NNGC GG + AF ++I N GI
Sbjct: 145 FSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGI 199
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 69/120 (57%), Gaps = 16/120 (13%)
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNG--NNGCLGGSREKAFAYIIQNQGIFNGVCG 223
V ++G + N E++LL N + G G R A+ + ++GIF G C
Sbjct: 223 VVTIDGYVDVPQNN-----EKELLKAVANQPVSVGICGSER----AFQLYSKGIFTGPCS 273
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
T LDHAV IVG+G+ E+G +YW++KNSWG+ WG GYM + R+ GLCGI +SYP
Sbjct: 274 TSLDHAVLIVGYGS-ENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGINMLASYP 332
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 127/206 (61%), Gaps = 5/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNTKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+ S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC GG F +II N GI
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGI 209
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 51/68 (75%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GIF G CGT +DHAVTIVG+GT E G +YW++KNSW TWG+ GYM+I+R+ G CGI
Sbjct: 276 GIFTGPCGTAIDHAVTIVGYGT-EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGI 334
Query: 273 GTRSSYPL 280
T SYP+
Sbjct: 335 ATMPSYPV 342
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 127/206 (61%), Gaps = 5/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+ S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC GG F +II N GI
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGI 209
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 51/68 (75%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GIF G CGT +DHAVTIVG+GT E G +YW++KNSW TWG+ GYM+I+R+ G CGI
Sbjct: 276 GIFTGPCGTAIDHAVTIVGYGT-EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGI 334
Query: 273 GTRSSYPL 280
T SYP+
Sbjct: 335 ATMPSYPV 342
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 127/206 (61%), Gaps = 5/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+ S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC GG F +II N GI
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGI 209
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 51/68 (75%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GIF G CGT +DHAVTIVG+GT E G +YW++KNSW TWG+ GYM+I+R+ G CGI
Sbjct: 276 GIFTGPCGTAIDHAVTIVGYGT-EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGI 334
Query: 273 GTRSSYPL 280
T SYP+
Sbjct: 335 ATMPSYPV 342
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 127/206 (61%), Gaps = 5/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+ S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRF 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC GG F +II N GI
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGI 209
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 51/68 (75%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GIF G CGT +DHAVTIVG+GT E G +YW++KNSW TWG+ GYM+I+R+ G CGI
Sbjct: 276 GIFTGPCGTAIDHAVTIVGYGT-EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGI 334
Query: 273 GTRSSYPL 280
T SYP+
Sbjct: 335 ATMPSYPV 342
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 163 bits (413), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 79/181 (43%), Positives = 118/181 (65%), Gaps = 6/181 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQF 96
E+ V ++ +WMA+H +Y E+E R + F+ NL YI++ N G +++LG N+F
Sbjct: 35 EEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRF 94
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLTN+E+R+ Y G + R ++ +YQ ++P S+DWR KGAV +K+Q
Sbjct: 95 ADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPESVDWRKKGAVGAVKDQGG 151
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG + AF +II N G
Sbjct: 152 CGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGG 211
Query: 217 I 217
I
Sbjct: 212 I 212
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 37/75 (49%), Positives = 50/75 (66%), Gaps = 5/75 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + GIF G CGT LDH V VG+GT E+G +YWL++NSWG+ WG+ GY+++ R+
Sbjct: 272 AFQLYKSGIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGSVWGENGYIRMERNIK 330
Query: 267 --EGLCGIGTRSSYP 279
G CGI SYP
Sbjct: 331 ASSGKCGIAVEPSYP 345
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 86/200 (43%), Positives = 126/200 (63%), Gaps = 7/200 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F +ITL +S + S RS E V+ ++EKW+ +H + Y EK R +IFK+NL +
Sbjct: 10 LFGLITLSLSL--DMSSGRSNKE--VMTMYEKWLVKHQKVYYGLGEKNQRFQIFKDNLIF 65
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N N +Y++G N+FSD+TN E+R Y + TS + Y+ +P
Sbjct: 66 IDEHNAP-NHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIKNKITSVRYAYKAGHNNKLP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
S+DWR GA+TPIKNQ CG CWAF+AVAAVE I KI +G+L+ LSEQ+L+DC N
Sbjct: 125 VSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVSLSEQELVDCDRTKNK 182
Query: 198 GCLGGSREKAFAYIIQNQGI 217
GC GG++ A+ +I++N G+
Sbjct: 183 GCNGGNQVNAYRFIVENGGL 202
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/69 (53%), Positives = 49/69 (71%), Gaps = 6/69 (8%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLC 270
G+F G CGT LDHAV +VG+G+ E+G +YWL+KNSWG WG+ GY+KI R + G C
Sbjct: 269 GVFTGSCGTSLDHAVVVVGYGS-ENGKDYWLVKNSWGTNWGERGYLKIERNLKNTNTGKC 327
Query: 271 GIGTRSSYP 279
GI ++YP
Sbjct: 328 GIAMDATYP 336
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 79/194 (40%), Positives = 120/194 (61%), Gaps = 16/194 (8%)
Query: 38 THEQSVVEIHEKWMAQH--------GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
+ E+S+ ++E+W +++ G D+ E R +F EN YI +AN+ G R +
Sbjct: 33 SSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPF 92
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS------TFKYQNLSMTDVPTSLDWR 143
+L N+F+D+T DEFR Y G + + HRS + +F+Y ++P ++DWR
Sbjct: 93 RLALNKFADMTTDEFRRTYAGSR--ARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWR 150
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
++GAVT IK+Q +CG CWAF+ VAAVEG+ KI++G L+ LSEQ+L+DC T N GC GG
Sbjct: 151 ERGAVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGL 210
Query: 204 REKAFAYIIQNQGI 217
+ AF +I +N GI
Sbjct: 211 MDYAFQFIKRNGGI 224
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 36/72 (50%), Positives = 48/72 (66%), Gaps = 5/72 (6%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEG 268
++G+F G CGT LDH V VG+G T DG YW++KNSWG WG+ GY+++ R G
Sbjct: 289 SEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERGYIRMQRGVSSDSNG 348
Query: 269 LCGIGTRSSYPL 280
LCGI +SYP+
Sbjct: 349 LCGIAMEASYPV 360
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 105/313 (33%), Positives = 164/313 (52%), Gaps = 77/313 (24%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSD 98
E ++ W+A++GRSY E E R ++F +NL + + N + + ++LG N+F+D
Sbjct: 46 EAEARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFAD 105
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTN+EFRA + G K+ S ++ +Y++ + ++P S+DWR+KGAV P+KNQ +CG
Sbjct: 106 LTNEEFRATFLGAKVVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCG 161
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS-REKAFAYIIQNQGI 217
CWAF+AV+ VE I ++ +G +I LSEQ+L++CSTNG NG G + AF +II+N GI
Sbjct: 162 SCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGI 221
Query: 218 ----------FNGVCGTQLDHA--VTIVGF------------------------------ 235
+G C ++A V+I GF
Sbjct: 222 DTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGRE 281
Query: 236 -------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD---- 266
GT+ D G +YW+++NSWG WG++GY+++ R+
Sbjct: 282 FQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINVT 341
Query: 267 EGLCGIGTRSSYP 279
G CGI +SYP
Sbjct: 342 TGKCGIAMMASYP 354
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 163 bits (412), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 92/209 (44%), Positives = 129/209 (61%), Gaps = 10/209 (4%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ P I+TL A + RS E + I+++W +H + D+ + RL++FK
Sbjct: 21 VSVVPPLDILTLSKQ-AWAAPAGRSDEEVRI--IYQEWRVKHRPAENDQYVGDYRLEVFK 77
Query: 73 ENLEYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
ENL ++++ N +R Y+LG N+F+DLTN+E+RA + + S RST+
Sbjct: 78 ENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFL--RDLSRLGRSTSGEISNQY 135
Query: 130 NLSMTDV-PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
L DV P S+DWR+KGAV +KNQ CG CWAFAA+AAVEGI +I +G+LI LSEQQL
Sbjct: 136 RLREGDVLPDSIDWREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQL 195
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+DCST N GC GG +AF YII N G+
Sbjct: 196 VDCSTR-NYGCEGGWPYRAFQYIINNGGV 223
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 51/75 (68%), Gaps = 5/75 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + + GIF G C T L+H VT+VG+GT E+G +YW++KNSWG WG++GY+ + R+
Sbjct: 284 FQLYHSGIFTGSCNTSLNHGVTVVGYGT-ENGNDYWIVKNSWGENWGNSGYILMERNIAE 342
Query: 267 -EGLCGIGTRSSYPL 280
G CGI SYP+
Sbjct: 343 SSGKCGIAISPSYPI 357
>gi|167345238|gb|ABZ69059.1| cysteine protease [Pinus radiata]
gi|167345240|gb|ABZ69060.1| cysteine protease [Pinus radiata]
Length = 185
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 76/162 (46%), Positives = 115/162 (70%), Gaps = 3/162 (1%)
Query: 32 VVSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
++S++ E +++E++E W+A+H ++Y EK+ R +FK+N YI + N +GNR+YK
Sbjct: 26 IISNKDLREDDAIMELYELWLAEHKKAYNGLDEKQKRFTVFKDNFLYIHEHN-QGNRSYK 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+DL+++EF+A Y G K+ + R S + +YQ D+P S+DWR+KGAV P
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKLDTKK-RLLRSPSPRYQYSDGEDLPKSIDWREKGAVAP 143
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+K+Q CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC
Sbjct: 144 VKDQGACGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCD 185
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 82/189 (43%), Positives = 117/189 (61%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R F++NL YI++ N G +
Sbjct: 26 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 85
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 86 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 142
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 143 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 202
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 203 DFIINNGGI 211
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF G CGT LDH V VG+GT E+G +YW+++NSWG +WG++GY+++ R+
Sbjct: 271 AFQLYSSGIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIK 329
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 330 ASSGKCGIAVEPSYPL 345
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 82/189 (43%), Positives = 117/189 (61%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 202 DFIINNGGI 210
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF G CGT LDH V VG+GT E+G +YW+++NSWG +WG++GY+++ R+
Sbjct: 270 AFQLYSSGIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIK 328
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 329 ASSGKCGIAVEPSYPL 344
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 82/181 (45%), Positives = 120/181 (66%), Gaps = 7/181 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++E+W + H S +D EK R +FKEN ++I + NK+ + YKLG N+F+D+
Sbjct: 33 EESLWGLYERWRSHHTVS-RDLSEKNKRFNVFKENAKFIHEFNKK-DAPYKLGLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSP-SHRSTTSST--FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
TN EFR+ Y G K+ + R T +T F Y+N+ +P S+DWR +GAV P+K+Q +
Sbjct: 91 TNQEFRSTYAGSKIHHHRTQRGTPRATGSFMYENVH--SIPASVDWRTQGAVAPVKDQGQ 148
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ +A+VEGI KI++ L+ LS QQL+DC T+ N GC GG + AF +I N G
Sbjct: 149 CGSCWAFSTIASVEGINKIKTNQLVPLSGQQLVDCDTDQNEGCNGGLMDYAFEFIKSNGG 208
Query: 217 I 217
I
Sbjct: 209 I 209
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 50/77 (64%), Gaps = 4/77 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
A+ ++G+F G CG +LDH V +VG+G T DG YW+++NSWG WG+ GY+++ R
Sbjct: 267 MAFQFYSEGVFTGSCGNELDHGVAVVGYGATRDGTKYWIVRNSWGAEWGEKGYIRMQRGI 326
Query: 267 ---EGLCGIGTRSSYPL 280
GLCGI SYPL
Sbjct: 327 RARHGLCGIAMEPSYPL 343
>gi|9502426|gb|AAF88125.1|AC021043_18 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 365
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/231 (40%), Positives = 130/231 (56%), Gaps = 29/231 (12%)
Query: 13 INTTPMFIIITLLVS--CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
++ +F+ +T+L SQ + +EQS+V+ H++WM Q R YKDE EKEMRLK+
Sbjct: 2 VSVRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKV 61
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
FK+NL++IE N GN++Y LG N+F+D +EF A +TG ++ S + T +N
Sbjct: 62 FKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRN 121
Query: 131 LSMTDVPT---SLDWRDKGAVTPIKNQKECGCCWAFAAVA-------------------- 167
+M+D+ S DWRD+GAVTP+K Q G C F
Sbjct: 122 WNMSDIDMEDESKDWRDEGAVTPVKYQ---GACPEFPTKQIRRNSLVGKQYTKLLGVLSD 178
Query: 168 -AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
EG+TKI NL+ LSEQQL+DC N GC GG E+AF YII+N G+
Sbjct: 179 WGDEGLTKISGKNLLTLSEQQLIDCDIEKNGGCNGGEFEEAFKYIIKNGGV 229
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/114 (40%), Positives = 71/114 (62%), Gaps = 9/114 (7%)
Query: 173 TKIRSGNLI-QLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGV-CGTQLDHAV 230
T+IR ++ +E+ LL+ L +R +F + G++ G+ CGT ++HAV
Sbjct: 254 TQIRGFQMVPSHNERALLEAVRRQPVSVLIDARADSFGHY--KGGVYAGLDCGTDVNHAV 311
Query: 231 TIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
TIVG+GT G NYW++KNSWG +WG+ GYM+I RD +G+CGI ++YP+
Sbjct: 312 TIVGYGTM-SGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 364
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 82/205 (40%), Positives = 125/205 (60%), Gaps = 5/205 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
+N +ITLL + S + + ++ E W +HG++Y + +K R KIF+
Sbjct: 1 MNFLSALFLITLLFF---NLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFE 57
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
EN E+++K N +GN +Y L N F+DLT+ EF+A G S S + + F +
Sbjct: 58 ENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGK-LSRRNFPLHDF- 115
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DWR KGAV+ +K+Q CG CW+F+A A+EGI KI +G+L+ LSEQ+L+DC
Sbjct: 116 VGDVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCD 175
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI 217
+ NNGC GG + A+ ++I+N GI
Sbjct: 176 RSYNNGCEGGLMDYAYQFVIENNGI 200
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 55/76 (72%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + ++GIF G C T LDHAV IVG+G+ E+G +YW++KNSWG WG GYM ++R+
Sbjct: 260 AFQLYSKGIFTGPCSTSLDHAVLIVGYGS-ENGVDYWIVKNSWGTHWGINGYMYMLRNSG 318
Query: 267 --EGLCGIGTRSSYPL 280
+GLCGI +S+P+
Sbjct: 319 NSQGLCGINMLASFPV 334
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/212 (41%), Positives = 131/212 (61%), Gaps = 23/212 (10%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++EKW A H S +D + + R +FKEN+++I + N++ + TYKL N+F D+
Sbjct: 34 EESLWSLYEKWRAHHAVS-RDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDM 92
Query: 100 TNDEFRALYTGYKMPSPSHRST-----TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR+ Y G K+ H T + F Y+ D+PTS+DWR+KGAVT +K+Q
Sbjct: 93 TNQEFRSTYAGSKI---DHHMTLRGVKDAGEFSYEKFH--DLPTSVDWREKGAVTGVKDQ 147
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ V AVEGI +I++ L+ LSEQQL+DC T N+GC GG + AF +I N
Sbjct: 148 GQCGSCWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDTK-NSGCNGGLMDYAFDFIKNN 206
Query: 215 QGIFN----------GVCGTQLDHA-VTIVGF 235
G+ + CG++ + A VTI G+
Sbjct: 207 GGLSSEDSYPYLAEQKSCGSEANSAVVTIDGY 238
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 74/137 (54%), Gaps = 18/137 (13%)
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKA 207
+ QK CG A +AV ++G + N L++ Q + + +
Sbjct: 218 LAEQKSCGS-EANSAVVTIDGYQDVPRNNEAALMKAVANQPVSVAIEAS----------G 266
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 265
+A+ +QG+F+G CGT+LDH V VG+G +DG YW++KNSWG WG++GY+++ R
Sbjct: 267 YAFQFYSQGVFSGHCGTELDHGVAAVGYGVDDDGKKYWIVKNSWGEGWGESGYIRMERGI 326
Query: 266 --DEGLCGIGTRSSYPL 280
G CGI +SYP+
Sbjct: 327 KDKRGKCGIAMEASYPI 343
>gi|356545079|ref|XP_003540973.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 330
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 101/295 (34%), Positives = 138/295 (46%), Gaps = 77/295 (26%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
ASQV + R+ + S+ E HE+WM+++G+ YKD E+E R +IFKEN+ YIE +N +
Sbjct: 5 ASQV-TCRTLQDASMYERHEEWMSRYGKVYKDPREREKRFRIFKENMNYIETSNNVAIKP 63
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
KL NQF+DL N+EF A +K + TF + P KGAV
Sbjct: 64 XKLVINQFADLNNEEFIAPRNIFKGMILCRFLSRKHTFPF--------PYVFLGHKKGAV 115
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGG----- 202
TP+K+Q CG CWAF VA+ EGI + +G LI LSEQ+L+DC T G + GC G
Sbjct: 116 TPVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCECGLMDDA 175
Query: 203 --------------------------------------------SREKAFAYIIQNQ--- 215
+ EKA ++ NQ
Sbjct: 176 FKFIIQNHGVXDANYPYKGVDGKCNANEEANPAATITGXEDVPANNEKALQKVVANQPVF 235
Query: 216 ---------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 255
G+F G C T+L+H VT +G+G + DG YWL+KNS W
Sbjct: 236 VAIDACDSDFQFYKSGVFTGSCETELNHGVTTMGYGVSHDGTQYWLVKNSXETEW 290
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 124/206 (60%), Gaps = 9/206 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHE------QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
F+ + L ++ + S HE +S+ +++E+W + H S + EK R +FK
Sbjct: 6 FLFVALSLALVLGITESLDFHEKDLESEESLWDLYERWRSHHTVSTSLD-EKHKRFNVFK 64
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKYQNL 131
EN+ ++ K NK G + YKL N+F+D+TN EFR++Y G K+ R TT +
Sbjct: 65 ENVMHVHKTNKMG-KPYKLKLNKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGNGSFMYG 123
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ VPTS+DWR KGAVT +K+Q +CG CWAF+ + AVEGI I++ L+ LSEQ+L+DC
Sbjct: 124 KVEKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINYIKTNELVSLSEQELVDC 183
Query: 192 STNGNNGCLGGSREKAFAYIIQNQGI 217
T N GC GG E AF +I + +GI
Sbjct: 184 DTTENQGCNGGLMEYAFEFIKKKRGI 209
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 37/71 (52%), Positives = 52/71 (73%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
++G+F G CGT+LDH V +VG+GTT DG YW+++NSWG WG+ GY+++ R EGL
Sbjct: 274 SEGVFIGECGTELDHGVAVVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKEGL 333
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 334 CGIAMEASYPI 344
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 126/206 (61%), Gaps = 5/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+ S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC G F +II N GI
Sbjct: 184 RTQNTRGCNGSYITDGFPFIINNGGI 209
Score = 90.1 bits (222), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 51/68 (75%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GIF G CGT +DHAVTIVG+GT E G +YW++KNSW TWG+ GYM+I+R+ G CGI
Sbjct: 276 GIFTGPCGTAIDHAVTIVGYGT-EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGI 334
Query: 273 GTRSSYPL 280
T SYP+
Sbjct: 335 ATMPSYPV 342
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 124/208 (59%), Gaps = 18/208 (8%)
Query: 21 IITLLVSCASQVVSSRST-----------HEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
++ L+++ Q + R+ + +++++ +W+ H R Y+ EK R +
Sbjct: 12 LVLLVIAIGQQADAGRANAIVDYEGNQLHSDDAILDVFHQWLETHSRVYRSLSEKHHRFQ 71
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFKEN YI NK+ ++Y LG N+FSDLT+ EFRA Y G K P +R + F Y+
Sbjct: 72 IFKENFLYIHAHNKQ-QKSYWLGLNKFSDLTHQEFRAQYLGTK---PVNRQRKEANFMYE 127
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
++ + +DWR KGAVT +K+Q CG CWAF+AV +VEG+ I++G L+ LSEQ+L+
Sbjct: 128 DV---EAEPKVDWRLKGAVTDVKDQGACGSCWAFSAVGSVEGVNAIKTGELVSLSEQELV 184
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGI 217
DC N GC GG + AF +II+N GI
Sbjct: 185 DCDRKQNQGCNGGLMDYAFEFIIKNGGI 212
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 49/70 (70%), Gaps = 5/70 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLC 270
G+F G CG++LDH V VG+GT +DG NYW++KNSWG WG+ GY+++ R +G C
Sbjct: 279 GVFTGPCGSELDHGVLAVGYGTDDDGVNYWIVKNSWGPGWGEKGYIRMERFGSDSTDGKC 338
Query: 271 GIGTRSSYPL 280
GI +S+P+
Sbjct: 339 GINIEASFPI 348
>gi|154550449|gb|ABS83496.1| cysteine protease [Pinus pinaster]
Length = 187
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 76/164 (46%), Positives = 116/164 (70%), Gaps = 3/164 (1%)
Query: 32 VVSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
++S++ E +++E++E W+A+H ++Y EK+ R +FK+N YI + N +GNR+YK
Sbjct: 26 IISNKDLREDDAIMELYELWLAEHKKAYNGLDEKQKRFTVFKDNFLYIHEHN-QGNRSYK 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG N+F+DL+++EF+A Y G K+ + R S + +YQ D+P S+DWR KGAV P
Sbjct: 85 LGLNKFADLSHEEFKATYLGAKLDTKK-RLLRSPSPRYQYSDGEDLPKSIDWRVKGAVAP 143
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+K+Q CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T+
Sbjct: 144 VKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTS 187
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 126/206 (61%), Gaps = 5/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y + S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYL--RFTSGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC GG F +II N GI
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGI 209
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 51/68 (75%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GIF G CGT +DHAVTIVG+GT E G +YW++KNSW TWG+ GYM+I+R+ G CGI
Sbjct: 276 GIFTGPCGTAVDHAVTIVGYGT-EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGI 334
Query: 273 GTRSSYPL 280
T SYP+
Sbjct: 335 ATMPSYPV 342
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 119/183 (65%), Gaps = 11/183 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E++V +++E+W H S + E R +F+ N+ ++ + NK+ N+ YKL N+F+D+
Sbjct: 31 EENVWKLYERWRGHHSVS-RASHEAIKRFNVFRHNVLHVHRTNKK-NKPYKLKINRFADI 88
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
T+ EFR+ Y G ++M R S F Y+N+ T VP+S+DWR+KGAVT +KNQ
Sbjct: 89 THHEFRSSYAGSNVKHHRMLRGPKRG--SGGFMYENV--TRVPSSVDWREKGAVTEVKNQ 144
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
++CG CWAF+ VAAVEGI KIR+ L+ LSEQ+L+DC T N GC GG E AF +I N
Sbjct: 145 QDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNN 204
Query: 215 QGI 217
GI
Sbjct: 205 GGI 207
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 36/74 (48%), Positives = 53/74 (71%), Gaps = 4/74 (5%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 265
+ + ++G+F G CGTQL+H V IVG+G T++G YW+++NSWG WG+ GY++I R
Sbjct: 269 FQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISE 328
Query: 266 DEGLCGIGTRSSYP 279
+EG CGI +SYP
Sbjct: 329 NEGRCGIAMEASYP 342
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 117/189 (61%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG++Y E+E R F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 202 DFIINNGGI 210
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF G CGT LDH V VG+GT E+G +YW+++NSWG +WG++GY+++ R+
Sbjct: 282 AFQLYSSGIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIK 340
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 341 ASSGKCGIAVEPSYPL 356
>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
Length = 322
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 83/191 (43%), Positives = 119/191 (62%), Gaps = 16/191 (8%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
SQ + +EQS+V+ H++WM Q R Y+DE EKEMRL++FK+NL++IE N GN++Y
Sbjct: 21 SQARPHVTLNEQSIVDYHQQWMTQFSRVYQDESEKEMRLQVFKKNLKFIENFNNMGNQSY 80
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT---SLDWRDKG 146
+G N+F+D T +EF A +TG ++ + + T +N +++D+ S DWRD+G
Sbjct: 81 TVGVNEFTDWTIEEFLATHTGLRVNVTTLSELFNETMPSRNWNISDIDIDDESKDWRDEG 140
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
AV P+K Q C G+TKI NL+ LSEQQL+DC T N GC GG E+
Sbjct: 141 AVIPVKVQGAC-------------GLTKISGKNLLTLSEQQLIDCDTEKNTGCDGGGIEE 187
Query: 207 AFAYIIQNQGI 217
AF YII+N G+
Sbjct: 188 AFKYIIKNGGV 198
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 68/133 (51%), Gaps = 24/133 (18%)
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLI-QLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
Q + G C A A A T+IR ++ +E+ LL+ L +R +F
Sbjct: 207 QVKKGSCRANARSATQ---TQIRGFEMVPSHNERALLEAVRRQPVSVLIDARADSFK--T 261
Query: 213 QNQGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 267
G++ G+ CGT ++HAVT VG+GT +I+ +WG+ GYM+I RD +
Sbjct: 262 YKGGVYAGLDCGTDVNHAVTFVGYGT--------MIQ-----SWGENGYMRIRRDVEWPQ 308
Query: 268 GLCGIGTRSSYPL 280
G+CGI ++YP+
Sbjct: 309 GMCGIAQVAAYPI 321
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 117/189 (61%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG++Y E+E R F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 202 DFIINNGGI 210
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF G CGT LDH V VG+GT E+G +YW+++NSWG +WG++GY+++ R+
Sbjct: 270 AFQLYSSGIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIK 328
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 329 ASSGKCGIAVEPSYPL 344
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 78/175 (44%), Positives = 116/175 (66%), Gaps = 2/175 (1%)
Query: 44 VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
+ +++ W+A+HG++Y E+ R +IFK NL +I++ N + N TYK+G +F+DLTN+E
Sbjct: 1 MSMYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQ-NHTYKVGLTKFADLTNEE 59
Query: 104 FRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
+RA++ G + + + S + +Y + +P S+DWR KGAV PIK+Q CG CWA
Sbjct: 60 YRAMFLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWA 119
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
F+ VAAVEGI +I +G LI LSEQ+L+DC N GC GG + AF +II N G+
Sbjct: 120 FSTVAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGL 174
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 35/70 (50%), Positives = 48/70 (68%), Gaps = 6/70 (8%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLC 270
G+F G CGT LDH V +VG+ +E+G +YWL++NSWG WG+ GY+K+ R+ G C
Sbjct: 241 GVFTGECGTALDHGVVVVGY-ASENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGRC 299
Query: 271 GIGTRSSYPL 280
GI SSYP+
Sbjct: 300 GIAMESSYPV 309
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 104/317 (32%), Positives = 153/317 (48%), Gaps = 83/317 (26%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+DE E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 157 CGCCWAFAAVAAVEGITKIRSG--------NLIQLSEQ---------------------- 186
CG CWAF++ A+EG +SG NL+ S +
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 187 ----------QLLDCSTNGNNGCLGGSREKAFAYIIQ----------------------- 213
+ +D S + N G +G + ++ F I Q
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTVGAT-DRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 263
Query: 214 -------NQGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 264
++G++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKML 323
Query: 265 RD-EGLCGIGTRSSYPL 280
R+ E CGI + SSYPL
Sbjct: 324 RNKENQCGIASASSYPL 340
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 88/212 (41%), Positives = 126/212 (59%), Gaps = 21/212 (9%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQH--GRSYKDELEKEMRLKI 70
FI++ L + + S HE+ V E++E+W + H RS + EK R +
Sbjct: 4 FIVLALCMLMVLETTKSLDFHEKDVESEDSLWELYERWKSHHTIARSLE---EKAKRFNV 60
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTSST 125
FK N+++I + NK+ N +YKL N+F D+T++EFR Y G + HR T+ +
Sbjct: 61 FKHNVKHIHETNKKEN-SYKLKLNKFGDMTSEEFRRTYAGSNIKH--HRMFQGERQTTKS 117
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F Y N+ +PTS+DWR GAVTP+KNQ +CG CWAF+ V AVEGI +IR+ L LSE
Sbjct: 118 FMYANVDT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSE 175
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+L+DC TN N GC GG + AF +I + G+
Sbjct: 176 QELVDCDTNKNQGCNGGLMDLAFEFIKEKGGL 207
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 82/151 (54%), Gaps = 23/151 (15%)
Query: 143 RDKGAVT-----PIKNQKE-CGCCWAFAAVAAVEG---ITKIRSGNLIQLSEQQLLDCST 193
++KG +T P K E C A V +++G + K +L++ Q + +
Sbjct: 202 KEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEVDLMKAVAHQPVSVAI 261
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 253
+ GGS + ++G+F G CGT+L+H V +VG+GTT DG YW++KNSWG
Sbjct: 262 DA-----GGS-----DFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGE 311
Query: 254 TWGDAGYMKIVR----DEGLCGIGTRSSYPL 280
WG+ GY+++ R EGLCGI +SYPL
Sbjct: 312 EWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 87/213 (40%), Positives = 127/213 (59%), Gaps = 17/213 (7%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T +++ E+WM +HGR+Y D EK+ R ++++ N+E +E N N YKL N+F+
Sbjct: 23 TRADLMLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFA 81
Query: 98 DLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQ 154
DLTN+EFRA G++ + P +T S+ S D+ P S+DWR KGAV +KNQ
Sbjct: 82 DLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQ 141
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC GC GG AF +++ N
Sbjct: 142 GDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGN 200
Query: 215 QGI----------FNGVC-GTQLDH-AVTIVGF 235
G+ NG C +L+ AV I G+
Sbjct: 201 HGLTTEASYPYHAANGACQAAKLNQSAVAIAGY 233
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 64/131 (48%), Gaps = 25/131 (19%)
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGT 224
A+A +T +L + + Q + + +G +F + + G++ G C
Sbjct: 229 AIAGYRNVTPSSEPDLARAAAAQPVSVAVDGG----------SFMFQLYGSGVYTGPCTA 278
Query: 225 QLDHAVTIVGFGTTEDGAN----------YWLIKNSWGNTWGDAGYMKIVRD-----EGL 269
++H VT+VG+G +E + YW++KNSWG WGDAGY+ + RD GL
Sbjct: 279 DVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYILMQRDVAGLASGL 338
Query: 270 CGIGTRSSYPL 280
CGI SYP+
Sbjct: 339 CGIALLPSYPV 349
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 87/209 (41%), Positives = 125/209 (59%), Gaps = 17/209 (8%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
+E+ + E W +HG+ Y E R ++K+NLEYI++ + E NR+Y LG +F+D
Sbjct: 38 NERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQR-HSEKNRSYWLGLTKFAD 96
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
+TNDEFR YTG ++ S RS + F+Y + ++ P S+DWR KGAVT +K+Q CG
Sbjct: 97 ITNDEFRRQYTGTRIDR-SKRSKRKTGFRYAD---SEAPESVDWRKKGAVTTVKDQGSCG 152
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI- 217
CWAF+A+ +VEGI IR+G + LSEQ+L+DC N GC GG + AF +I++N GI
Sbjct: 153 SCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILENGGID 212
Query: 218 ---------FNGVCGTQLD--HAVTIVGF 235
+G C H VTI G+
Sbjct: 213 TENDYPYKGLDGRCDNNKKNAHVVTIDGY 241
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 49/78 (62%), Gaps = 8/78 (10%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + + G+F G CGT LDH V VG+G+ E +YW++KNSWG WG++GY+++ R+
Sbjct: 272 FQLYSGGVFTGECGTDLDHGVLAVGYGS-EGSLDYWIVKNSWGEYWGESGYLRMQRNIKD 330
Query: 267 ----EGLCGIGTRSSYPL 280
GLCGI SY +
Sbjct: 331 SNHQFGLCGINIEPSYAV 348
>gi|357167707|ref|XP_003581294.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 358
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 84/179 (46%), Positives = 108/179 (60%), Gaps = 13/179 (7%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
HE+WMA+ GRSY D EK R ++F N +++ N+ GNRTY LG NQFSDLT+ EF
Sbjct: 42 HERWMARFGRSYTDAGEKARRQEVFGANARHVDAVNRAGNRTYTLGLNQFSDLTDHEFLQ 101
Query: 107 LYTGYK-------MPSPSHRSTTSST-FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
+ GY + P +T Y D+P S+DWR KGAVT IKNQ+ CG
Sbjct: 102 QHLGYGRHHGQRGLLLPEEEVMPKATALGYGQ----DMPYSVDWRAKGAVTEIKNQRSCG 157
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAFAAVAA EG+ KI +GNLI +SEQQ+LDC T + C G A Y++ + G+
Sbjct: 158 SCWAFAAVAATEGLVKIATGNLISMSEQQVLDC-TGDRSSCDSGYISDALRYVVTSGGL 215
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 15/101 (14%)
Query: 194 NGNNGCLGG-SREKAFAYIIQ---------NQGIFNGV--CGTQLDHAVTIVGFGTTEDG 241
NG+ G L G + + A I++ + G++ G CG +L+HA+T+VG+GT
Sbjct: 253 NGDEGALQGLAARQPVAVIVEASEPDFRHYSSGVYAGSASCGRELNHALTVVGYGTENGA 312
Query: 242 ANYWLIKNSWGNTWGDAGYMKIVRDEGL---CGIGTRSSYP 279
YWL+KN WG WG+ GYM++ R G CGI + + YP
Sbjct: 313 GEYWLVKNQWGTWWGENGYMRVARRNGAGANCGIASVAFYP 353
>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 400
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 82/195 (42%), Positives = 125/195 (64%), Gaps = 3/195 (1%)
Query: 26 VSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG 85
V+C Q S+S E E HEKWMAQ+G+ Y+D E E R +IFK N+++IE N G
Sbjct: 95 VTCGRQC-RSKSRLEACTSERHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFNVAG 153
Query: 86 NRTYKLGTNQFSDLTNDEFRALY-TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRD 144
++ + + NQF DL ++EF+AL G + S +T ++F+Y ++ +T++P ++D R
Sbjct: 154 DKPFNIRINQFPDLHDEEFKALLINGQRKVSGVETATEETSFRYGSV-VTNIPATMDGRK 212
Query: 145 KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSR 204
KG VTPIK+Q G CWA +AVAA+EGI +I + L+ LS+Q+L+D + GC+GG
Sbjct: 213 KGVVTPIKDQGIIGSCWALSAVAAIEGIHQITTSKLMFLSKQKLVDSVKGESEGCIGGYV 272
Query: 205 EKAFAYIIQNQGIFN 219
E AF +I++ GI +
Sbjct: 273 EDAFEFIVKKGGILS 287
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 51/113 (45%), Gaps = 8/113 (7%)
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
P K C +VA ++G K+ S N ++ LL N AF
Sbjct: 292 PYKGVNXCKVEKETHSVAHIKGYEKVPSNN-----KKALLKVVANQPVSVYIDVGAHAFK 346
Query: 210 YIIQNQGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
Y + IFN CG+ +H V +VG+G DGA YW +KNSWG WG YM
Sbjct: 347 YY--SSEIFNARNCGSDPNHVVAVVGYGKALDGAKYWPVKNSWGTEWGGKWYM 397
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 85/210 (40%), Positives = 129/210 (61%), Gaps = 8/210 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
S I + F +ITL ++ + S RS E V+ ++E+W+ +H + Y EK+ R +
Sbjct: 3 SITITSLLFFSLITLSLAMDT---SMRSNEE--VMTMYEEWLVKHHKVYNGLGEKDQRFE 57
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH--RSTTSSTFK 127
IFK+NL +I++ N + N TYK+G N+F+D TN+E+R +Y G K + + + ++ +
Sbjct: 58 IFKDNLGFIDEHNAQ-NYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIKITTGHR 116
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y S +P +DWR KGAV IK+Q CG CWAF+ +A VE I KI +G L+ LSEQ+
Sbjct: 117 YAFNSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQE 176
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L+DC N GC GG + AF +I++N GI
Sbjct: 177 LVDCDRAFNEGCNGGLMDYAFEFIVENGGI 206
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 35/70 (50%), Positives = 48/70 (68%), Gaps = 6/70 (8%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLC 270
G+F G CGT LDH V +VG+G E+G +YWL++NSWG WG+ GY K+ R + G C
Sbjct: 273 GVFTGRCGTNLDHGVVVVGYGF-ENGVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKC 331
Query: 271 GIGTRSSYPL 280
GI ++SYP+
Sbjct: 332 GIAMQASYPV 341
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 82/178 (46%), Positives = 115/178 (64%), Gaps = 11/178 (6%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
E++E+W + H S + EK+ R +FK N+ Y+ NK+ ++ YKL N+F+D+TN EF
Sbjct: 36 ELYERWRSHHTVSRSLD-EKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEF 93
Query: 105 RALYTGYKMPSPSHRS-----TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
R Y G K+ HRS + TF Y N+ DVP S+DWR KGAVTP+K+Q +CG
Sbjct: 94 RHHYAGSKIKH--HRSFLGASRANGTFMYANVE--DVPPSVDWRKKGAVTPVKDQGKCGS 149
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ V AVEGI +I++ L+ LSEQ+L+DC T+ N GC GG + AF +I + GI
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGI 207
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 53/71 (74%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
++G+F G CGT+LDH V IVG+GTT DG YW+++NSWG WG+ GY+++ R +EGL
Sbjct: 272 SEGVFTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQREIDAEEGL 331
Query: 270 CGIGTRSSYPL 280
CGI + SYP+
Sbjct: 332 CGIAMQPSYPI 342
>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
Length = 318
Score = 160 bits (405), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 85/184 (46%), Positives = 117/184 (63%), Gaps = 12/184 (6%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++ + + WM ++ + YKD EK R +IFK+NL+YI++ NK+ N TY LG F+
Sbjct: 39 TSTEKLINLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKK-NNTYWLGLTSFT 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSST----FKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
DLTNDEF+ Y G P + STT + F Y ++ ++P S+DWR KGAVTP++N
Sbjct: 98 DLTNDEFKEKYVG---SIPENWSTTEESNDKEFIYDDV--VNIPASIDWRQKGAVTPVRN 152
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q CG CW F++VAAVEGI KI +G L+ LSEQ+LLDC + GC GG A Y +
Sbjct: 153 QGSCGSCWTFSSVAAVEGINKIVTGQLVSLSEQELLDCERR-SYGCRGGFPPYALQY-VA 210
Query: 214 NQGI 217
N GI
Sbjct: 211 NSGI 214
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 48/101 (47%), Gaps = 16/101 (15%)
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q++C A +G+ +++ N EQ L+ + ++ +AF Q
Sbjct: 226 QRQCRAAQAKGPKVKTDGVGRVQRNN-----EQALIQRIAIQPVSIVVEAKGRAF----Q 276
Query: 214 NQ--GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 252
N GIF G CGT +DHAV VG+G Y LIKNSWG
Sbjct: 277 NYRGGIFAGPCGTSIDHAVAAVGYGN-----GYILIKNSWG 312
>gi|387015020|gb|AFJ49629.1| Cathepsin H [Crotalus adamanteus]
Length = 337
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/309 (32%), Positives = 142/309 (45%), Gaps = 79/309 (25%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
++ + W +QH R+Y+ E E RL+IF +N + I+K N GN ++++G NQFSD+T EF
Sbjct: 34 QLFKAWASQHRRAYRSEEEFRHRLQIFLDNKQKIDKHNA-GNSSFRMGLNQFSDMTFTEF 92
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGCCWAF 163
R Y P + S T F S P ++DWR KG V+P+KNQ CG CW F
Sbjct: 93 RKKYL---WQEPQNCSATMGNFP---RSAGPCPKAIDWRKKGKFVSPVKNQGSCGSCWTF 146
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF---- 218
+ +E I++G L+ L+EQQL+DC+ N NN GC GG +AF YI+ N+G+
Sbjct: 147 STTGCLESAIAIKTGKLLNLAEQQLIDCAQNFNNFGCSGGLPSQAFEYILYNKGLMDEEA 206
Query: 219 ------NGVCGTQLDHAVTI--------------------------VGFGTTEDGANY-- 244
NG C Q AV + F ED +Y
Sbjct: 207 YPYRAQNGTCKFQPQKAVAFIKDVVNISLYDEQGLVQAVGTYNPVSIAFEVREDFVHYQE 266
Query: 245 --------------------------------WLIKNSWGNTWGDAGYMKIVRDEGLCGI 272
W++KNSWG +WG GY I R + +CG+
Sbjct: 267 GVYTSTDCDKTPDKVNHAVLAVGYGEEGGVPFWIVKNSWGTSWGLDGYFNIERGKNMCGL 326
Query: 273 GTRSSYPLA 281
+S+P+
Sbjct: 327 ADCASFPVP 335
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 108/347 (31%), Positives = 161/347 (46%), Gaps = 91/347 (26%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFK 72
T + + + LV+ A Q VS I E+W +H ++Y+DE E+ RLKIF
Sbjct: 3 TALILPLLALVAVA-QAVSYAEV-------IQEEWHTFKLEHRKNYQDETEERFRLKIFN 54
Query: 73 ENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-- 127
EN I K N+ G ++K+ N+++D+ + EF + G+ +FK
Sbjct: 55 ENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGV 114
Query: 128 -YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
+ + +P +DWR KGAVT +K+Q CG CWAF++ A+EG +SG L+ LSEQ
Sbjct: 115 TFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQ 174
Query: 187 ----------------------------------------QLLDCSTNGNNGCLGGSREK 206
+ +D S + N G +G + ++
Sbjct: 175 NLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGSIGAT-DR 233
Query: 207 AFAYIIQ------------------------------NQGIFN-GVCGTQ-LDHAVTIVG 234
F I Q ++G++N C Q LDH V +VG
Sbjct: 234 GFVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVG 293
Query: 235 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
FGT E G +YWL+KNSWG TWGD G++K++R+ E CGI + SSYPL
Sbjct: 294 FGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 108/347 (31%), Positives = 161/347 (46%), Gaps = 91/347 (26%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFK 72
T + + + LV+ A Q VS I E+W +H ++Y+DE E+ RLKIF
Sbjct: 3 TALILPLLALVAVA-QAVSYAEV-------IQEEWHTFKLEHRKNYQDETEERFRLKIFN 54
Query: 73 ENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-- 127
EN I K N+ G ++K+ N+++D+ + EF + G+ +FK
Sbjct: 55 ENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGV 114
Query: 128 -YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
+ + +P +DWR KGAVT +K+Q CG CWAF++ A+EG +SG L+ LSEQ
Sbjct: 115 TFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQ 174
Query: 187 ----------------------------------------QLLDCSTNGNNGCLGGSREK 206
+ +D S + N G +G + ++
Sbjct: 175 NLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGAT-DR 233
Query: 207 AFAYIIQ------------------------------NQGIFN-GVCGTQ-LDHAVTIVG 234
F I Q ++G++N C Q LDH V +VG
Sbjct: 234 GFVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVG 293
Query: 235 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
FGT E G +YWL+KNSWG TWGD G++K++R+ E CGI + SSYPL
Sbjct: 294 FGTDESGQDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 80/174 (45%), Positives = 117/174 (67%), Gaps = 7/174 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK----ANKEGNRTYKLGTNQFSDLTNDE 103
+ W+ +H ++Y EKE R IF++NLE+I++ N G ++LG N+F+DLTNDE
Sbjct: 6 QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65
Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
FR +Y G K P ++ + + +Y ++P S+DWR KGAV+ +K+Q +CG CWAF
Sbjct: 66 FRRIYFGVKRP---EKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAF 122
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+A+ AVEGI KI +G+LI LSEQ+L+DC T+ N+GC GG + AF +II N GI
Sbjct: 123 SAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGI 176
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 51/75 (68%), Gaps = 4/75 (5%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + G+F G CGT LDH V VG+GTT+DG +YW+++NSWG+ WG+ GY+++ R+
Sbjct: 237 FQLYKSGVFTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTES 296
Query: 267 -EGLCGIGTRSSYPL 280
G CGI SYP+
Sbjct: 297 KSGKCGIAIEPSYPV 311
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 126/208 (60%), Gaps = 17/208 (8%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+++ E+WM +HGR+Y D EK+ R ++++ N+E +E N N YKL N+F+DLTN+
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKECGC 159
EFRA G++ + P +T S+ S D+ P S+DWR KGAV +KNQ +CG
Sbjct: 86 EFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGS 145
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI-- 217
CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC GC GG AF +++ N G+
Sbjct: 146 CWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGNHGLTT 204
Query: 218 --------FNGVC-GTQLDH-AVTIVGF 235
NG C +L+ AV I G+
Sbjct: 205 EASYPYHAANGACQAAKLNQSAVAIAGY 232
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 64/131 (48%), Gaps = 25/131 (19%)
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGT 224
A+A +T +L + + Q + + +G + F + + G++ G C
Sbjct: 228 AIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGS----------FMFQLYGSGVYTGPCTA 277
Query: 225 QLDHAVTIVGFGTTEDGAN----------YWLIKNSWGNTWGDAGYMKIVRD-----EGL 269
++H VT+VG+G +E + YW++KNSWG WGDAGY+ + RD GL
Sbjct: 278 DVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYILMQRDVAGLASGL 337
Query: 270 CGIGTRSSYPL 280
CGI SYP+
Sbjct: 338 CGIALLPSYPV 348
>gi|444514070|gb|ELV10520.1| Cathepsin L1 [Tupaia chinensis]
Length = 450
Score = 160 bits (404), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 106/337 (31%), Positives = 154/337 (45%), Gaps = 81/337 (24%)
Query: 14 NTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
N P + L + AS +S +Q++ W + H R Y E+ R ++++
Sbjct: 124 NMNPSLFLAALCLGIASATPNS----DQNLDTSWHHWKSTHRRLYGKN-EEGWRRAVWEK 178
Query: 74 NLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
N++ IE N E G + +G N F D+TN+EFR + G++ + + + F
Sbjct: 179 NMKMIEMHNHEYSNGKHGFTMGMNAFGDMTNEEFRQVMNGFR----NQKQKSGKVFHAPL 234
Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
L P S+DWR+KG VTP+KNQ +CG CWAF+A A+EG ++G LI LSEQ L+D
Sbjct: 235 L--LQAPKSVDWREKGFVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVD 292
Query: 191 CS-TNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVT-------- 231
CS GN GC GG + AF YI N G+ +G C + + AV
Sbjct: 293 CSRRQGNLGCQGGLMDNAFQYIKDNGGLDSEESYPYKGMDGTCQYKAEWAVANDTGFEKA 352
Query: 232 ---------------------------------------------IVGFGTTEDGAN--Y 244
+VG+G + +N Y
Sbjct: 353 LMKAVASVGPISVAIDAGHASFQFYKDGIYYEPDCSSENLDHGVLVVGYGVEKRNSNDKY 412
Query: 245 WLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 280
WLIKNSWG WG GY+KI +D CG+ + +SYP+
Sbjct: 413 WLIKNSWGEQWGANGYVKIAKDRNNHCGVASAASYPV 449
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 160 bits (404), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 91/200 (45%), Positives = 120/200 (60%), Gaps = 21/200 (10%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E+W+ Q+ R YKD+ E E+R I++ NLEYIE N + +Y L N+F+DLTN+EF +
Sbjct: 6 ERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQ-EXSYNLTDNKFADLTNEEFVSP 64
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
Y G+ R + F Y D+P S DWR +GAV+ IK+Q CG CWAF+AVA
Sbjct: 65 YLGFGT-----RFLPHTGFMYH--EHEDLPESKDWRKEGAVSDIKDQGNCGSCWAFSAVA 117
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGI-------FN 219
AVEGI KI+SG L+ LSEQ+ DC +GN GC GG + AFA+I +N G+ +
Sbjct: 118 AVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDYPYE 177
Query: 220 GVCGT-----QLDHAVTIVG 234
GV GT L HA I G
Sbjct: 178 GVDGTCNKEKALHHAANISG 197
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/77 (49%), Positives = 55/77 (71%), Gaps = 7/77 (9%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
A+ + +G+F+G+CG QL+H VTIVG+G T D YW++KNSWG WG++GY+++ RD
Sbjct: 230 AFQLYLKGVFSGICGKQLNHGVTIVGYGKGTSD--KYWIVKNSWGADWGESGYIRMKRDA 287
Query: 267 ---EGLCGIGTRSSYPL 280
G CGI ++SYPL
Sbjct: 288 FDKAGTCGIAMQASYPL 304
>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 337
Score = 160 bits (404), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 79/172 (45%), Positives = 113/172 (65%), Gaps = 3/172 (1%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
HEKWMAQHG+ YKD EKE L+IF+ N+E+IE + G++++ L TNQF+DL ++EF+A
Sbjct: 32 HEKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFDVCGDKSFNLSTNQFADLHDEEFKA 91
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA-A 165
L T S +TT + F+Y N+ T +P S+DWR +G VTPIK+Q +C CWAF+
Sbjct: 92 LLTNGHKKEHSLWTTTETLFRYDNV--TKIPASMDWRKRGVVTPIKDQGKCLSCWAFSLC 149
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
VA +EG+ +I + L+ LSEQ+L+D + GC G E AF +I + I
Sbjct: 150 VATIEGLHQIITSELVPLSEQELVDFVKGESEGCYGDYVEDAFKFITKKGRI 201
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 63/120 (52%), Gaps = 11/120 (9%)
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQ 225
VA ++G K+ S SE LL N +R+ AF + + GIF G CGT
Sbjct: 225 VAQIKGYKKVPSK-----SENALLKAVANQLVSVSVEARDSAFQFY--SSGIFTGKCGTD 277
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 281
DH V + +G + DG YWL KNSWG WG+ GY++I D EGLCGI YP+A
Sbjct: 278 TDHRVALASYGESGDGTKYWLAKNSWGTEWGEKGYIRIKXDIPAKEGLCGIAKYPYYPIA 337
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 160 bits (404), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 82/183 (44%), Positives = 119/183 (65%), Gaps = 11/183 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E++V +++E+W H + + E R +F+ N+ ++ + NK+ N+ YKL N+F+D+
Sbjct: 30 EENVWKLYERWRDHHSVT-RASHEALKRFNVFRHNVLHVHRTNKK-NKPYKLKVNRFADI 87
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
T+ EFR+ Y G ++M R S F Y+N+ T VP+S+DWR+KGAVT +KNQ
Sbjct: 88 THHEFRSSYAGSNVKHHRMLRGPKRG--SGGFMYENV--TRVPSSVDWREKGAVTEVKNQ 143
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
++CG CWAF+ VAAVEGI KIR+ L+ LSEQ+L+DC T N GC GG E AF +I N
Sbjct: 144 QDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNN 203
Query: 215 QGI 217
GI
Sbjct: 204 GGI 206
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 36/74 (48%), Positives = 53/74 (71%), Gaps = 4/74 (5%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 265
+ + ++G+F G CGTQL+H V IVG+G T++G YW+++NSWG WG+ GY++I R
Sbjct: 268 FQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISE 327
Query: 266 DEGLCGIGTRSSYP 279
+EG CGI +SYP
Sbjct: 328 NEGRCGIAMEASYP 341
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 160 bits (404), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 84/182 (46%), Positives = 120/182 (65%), Gaps = 7/182 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQF 96
++ V I+++W A+H + D+ + RL++FKENL ++++ N +R Y+LG N+F
Sbjct: 36 DEEVRIIYQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRF 95
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQK 155
+DLTN+E+RA + + S RST+ L DV P S+DWR+KGAV +K+Q
Sbjct: 96 ADLTNEEYRARFL--RDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKSQG 153
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAFAA+A VEGI +I +G+LI LSEQQL+DCST N+GC GG +AF YII N
Sbjct: 154 RCGSCWAFAAIATVEGINQIVTGDLISLSEQQLVDCSTR-NHGCEGGWPYRAFQYIINNG 212
Query: 216 GI 217
G+
Sbjct: 213 GV 214
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 49/75 (65%), Gaps = 5/75 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM----KIVR 265
+ + + GIF G C T L+H VT+VG+GT +G +YW++KNSWG +WGD+GY+ I
Sbjct: 275 FQLYHSGIFTGSCNTSLNHGVTVVGYGTV-NGNDYWIVKNSWGESWGDSGYILMERNIAE 333
Query: 266 DEGLCGIGTRSSYPL 280
G CGI SYP+
Sbjct: 334 SSGKCGIAISPSYPI 348
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 116/189 (61%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVE I +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 202 DFIINNGGI 210
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF G CGT LDH V VG+GT E+G +YW+++NSWG +WG++GY+++ R+
Sbjct: 270 AFQLYSSGIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIK 328
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 329 ASSGKCGIAVEPSYPL 344
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 117/189 (61%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q+ G CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 142 AEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 202 DFIINNGGI 210
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF G CGT LDH V VG+GT E+G +YW+++NSWG +WG++GY+++ R+
Sbjct: 270 AFQLYSSGIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIK 328
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 329 ASSGKCGIAVEPSYPL 344
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 134/210 (63%), Gaps = 13/210 (6%)
Query: 16 TPMFIIITLLVSCASQVVSSRST---HEQSVVE---IHEKWMAQHGRSYKDEL-EKEMRL 68
T +F++I ++S S + +T H +S E I + WM++HG++Y + L EKE R
Sbjct: 10 TILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRF 69
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
+ FK+NL +I++ N + N +Y+LG +F+DLT E+R L+ G P P R+ +S +Y
Sbjct: 70 QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLKTSR-RY 125
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
L+ +P S+DWR +GAV+ IK+Q C CWAF+ VAAVEG+ KI +G LI LSEQ+L
Sbjct: 126 VPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQEL 185
Query: 189 LDCSTNGNNGCLG-GSREKAFAYIIQNQGI 217
+DC+ NNGC G G + AF ++I N G+
Sbjct: 186 VDCNL-VNNGCYGSGLMDTAFQFLINNNGL 214
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/83 (50%), Positives = 61/83 (73%), Gaps = 5/83 (6%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G +K+ +++ I+NG CGT LDHA+ IVG+G+ E+G +YW+++NSWG TWGDAGY+
Sbjct: 267 GVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGS-ENGQDYWIVRNSWGTTWGDAGYI 325
Query: 262 KIVRD----EGLCGIGTRSSYPL 280
KI R+ +GLCGI +SYP+
Sbjct: 326 KIARNFEDPKGLCGIAMLASYPI 348
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 159 bits (403), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 134/210 (63%), Gaps = 13/210 (6%)
Query: 16 TPMFIIITLLVSCASQVVSSRST---HEQSVVE---IHEKWMAQHGRSYKDEL-EKEMRL 68
T +F++I ++S S + +T H +S E I + WM++HG++Y + L EKE R
Sbjct: 10 TILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRF 69
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
+ FK+NL +I++ N + N +Y+LG +F+DLT E+R L+ G P P R+ +S +Y
Sbjct: 70 QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLKTSR-RY 125
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
L+ +P S+DWR +GAV+ IK+Q C CWAF+ VAAVEG+ KI +G LI LSEQ+L
Sbjct: 126 VPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQEL 185
Query: 189 LDCSTNGNNGCLG-GSREKAFAYIIQNQGI 217
+DC+ NNGC G G + AF ++I N G+
Sbjct: 186 VDCNL-VNNGCYGSGLMDTAFQFLINNNGL 214
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/83 (50%), Positives = 61/83 (73%), Gaps = 5/83 (6%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G +K+ +++ I+NG CGT LDHA+ IVG+G+ E+G +YW+++NSWG TWGDAGY+
Sbjct: 268 GVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGS-ENGQDYWIVRNSWGTTWGDAGYI 326
Query: 262 KIVRD----EGLCGIGTRSSYPL 280
KI R+ +GLCGI +SYP+
Sbjct: 327 KIARNFEDPKGLCGIAMLASYPI 349
>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
Length = 318
Score = 159 bits (403), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 85/184 (46%), Positives = 116/184 (63%), Gaps = 12/184 (6%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++ + + WM ++ + YKD EK R +IFK+NL+YI++ NK+ N TY LG F+
Sbjct: 39 TSTEKLINLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKK-NNTYWLGLTSFT 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSST----FKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
DLTNDEF+ Y G P + STT F Y ++ ++P S+DWR KGAVTP++N
Sbjct: 98 DLTNDEFKEKYVG---SIPENWSTTEEPNDKEFIYDDV--VNIPASIDWRQKGAVTPVRN 152
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q CG CW F++VAAVEGI KI +G L+ LSEQ+LLDC + GC GG A Y +
Sbjct: 153 QGSCGSCWTFSSVAAVEGINKIVTGQLVSLSEQELLDCERR-SYGCRGGFPPYALQY-VA 210
Query: 214 NQGI 217
N GI
Sbjct: 211 NSGI 214
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 48/101 (47%), Gaps = 16/101 (15%)
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q++C A +G+ +++ N EQ L+ + ++ +AF Q
Sbjct: 226 QRQCRAAQAKGPKVKTDGVGRVQRNN-----EQALIQRIAIQPVSIVVEAKGRAF----Q 276
Query: 214 NQ--GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 252
N GIF G CGT +DHAV VG+G Y LIKNSWG
Sbjct: 277 NYRGGIFAGPCGTSIDHAVAAVGYGN-----GYILIKNSWG 312
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 159 bits (403), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 76/177 (42%), Positives = 117/177 (66%), Gaps = 5/177 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ WM H + Y++ EK R +IFK+NL YI++ NK+ N +Y+LG N+F+
Sbjct: 39 TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYRLGLNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+NDEF Y G + + +S ++ N + ++P ++DWR KGAVTP+++Q C
Sbjct: 98 DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDIVNLPENVDWRKKGAVTPVRHQGSC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC ++GC GG A Y+ +N
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN 210
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 206 KAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
K + + GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 269 KGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKR 327
Query: 266 ----DEGLCGIGTRSSYPL 280
G+CG+ S YP+
Sbjct: 328 APGNSPGVCGLYKSSYYPI 346
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 159 bits (403), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 83/201 (41%), Positives = 123/201 (61%), Gaps = 5/201 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F L++S A + + ++E W+ ++G+SY E E R +IFKE L +
Sbjct: 13 LFFSTLLVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + NR+Y++G NQF+D TN+EF++ Y G+ S S++ S+ +Y+ +P
Sbjct: 73 IDEHNADTNRSYRVGLNQFADQTNEEFQSTYLGFT--SGSNKMKVSN--RYEPRVGQVLP 128
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGN 196
+DWR GAV IK+Q +CG CWAF+A+A VEGI KI +G+LI LSEQ+L+DC T
Sbjct: 129 DYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRTQNT 188
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GGS F +II N GI
Sbjct: 189 RGCDGGSITDGFQFIINNGGI 209
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/68 (58%), Positives = 52/68 (76%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GIF G CGT +DHAVTIVG+GT E G +YW++KNSW TWG+ GY++I+R+ G CGI
Sbjct: 276 GIFTGPCGTAVDHAVTIVGYGT-EGGIDYWIVKNSWDTTWGEEGYIRILRNVGGAGTCGI 334
Query: 273 GTRSSYPL 280
T+ SYP+
Sbjct: 335 ATKPSYPV 342
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 83/187 (44%), Positives = 118/187 (63%), Gaps = 15/187 (8%)
Query: 40 EQSVVEIHEKWMAQHG---RSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
E+S+ ++E W + H R E E R +FKEN+ YI +ANK+ +R ++L N+F
Sbjct: 33 EESLRGLYETWRSHHTVSRRGLGAEAEAR-RFNVFKENVRYIHEANKK-DRPFRLALNKF 90
Query: 97 SDLTNDEFRALYTGYKMPSPSHRS------TTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
+D+T DEFR Y G ++ HRS +F Y + ++P ++DWR KGAVTP
Sbjct: 91 ADMTTDEFRRTYAGSRVRH--HRSLSGGRRQGGGSFMYADAE--NLPAAVDWRQKGAVTP 146
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
IK+Q +CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC+ N+GC GG + AF +
Sbjct: 147 IKDQGQCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQF 206
Query: 211 IIQNQGI 217
I QN GI
Sbjct: 207 IQQNGGI 213
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 36/70 (51%), Positives = 47/70 (67%), Gaps = 4/70 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F GT LDH V VG+GTT DG YW++KNSWG WG+ GY+++ R EGL
Sbjct: 278 SEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGVKQAEGL 337
Query: 270 CGIGTRSSYP 279
CGI +SYP
Sbjct: 338 CGIAMEASYP 347
>gi|113120267|gb|ABI30273.1| VXH-B, partial [Vasconcellea x heilbornii]
Length = 266
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 80/178 (44%), Positives = 114/178 (64%), Gaps = 5/178 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++ + + WM ++G+ YKD EK + +IFK+NL+YI++ NK+ N TY LG F+
Sbjct: 39 TSTEKLINLFDSWMVEYGKVYKDIDEKIYKFEIFKDNLKYIDETNKKNN-TYWLGLTSFT 97
Query: 98 DLTNDEFRALYTGYKMPS-PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
DLTNDEF+ Y G S + + F Y ++ ++P S+DWR KGAVTP+++Q
Sbjct: 98 DLTNDEFKEKYVGSISESWSTTEESNDEGFIYDDV--VNIPASIDWRQKGAVTPVRHQGS 155
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CW F++VAAVEGI KI +G L+ LSEQ+LLDC + GC GG A Y+ QN
Sbjct: 156 CGSCWTFSSVAAVEGINKIVTGRLVSLSEQELLDCERR-SYGCRGGFPPYALQYVAQN 212
>gi|113120273|gb|ABI30276.1| VXH-C [Vasconcellea x heilbornii]
Length = 282
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 79/171 (46%), Positives = 109/171 (63%), Gaps = 3/171 (1%)
Query: 44 VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
+ + E WM +H + YK EK R +IFK+NL YI++ NK+ N +Y LG N+F+DLT+DE
Sbjct: 45 IRLFESWMLKHDKVYKSMEEKINRFEIFKDNLMYIDETNKKNN-SYWLGLNEFADLTHDE 103
Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
F+ Y G +P S ++ + D P S+DWR KGAVTP+K+Q CG CWAF
Sbjct: 104 FKKKYVG-SIPEDYTIIEQSDDGEFPYKHVVDYPESVDWRQKGAVTPVKDQNPCGSCWAF 162
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+ VA VEGI KI +G LI LSEQ+LLDC ++GC GG + + Y++ N
Sbjct: 163 STVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCDGGYQRTSLQYVVDN 212
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 78/183 (42%), Positives = 115/183 (62%), Gaps = 5/183 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL-EKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQ 95
E I+ W A+HG + L E+E R + F +NL +++ N G ++LG N+
Sbjct: 45 EAEARAIYGLWRAEHGSGNSNSLGEEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNR 104
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
F+DLTNDEFRA Y G K + +Y++ + ++P ++DWR+KGAV P+KNQ
Sbjct: 105 FADLTNDEFRAAYLGVKGAGQRRSARAGVGERYRHDGVEELPEAVDWREKGAVAPVKNQG 164
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQN 214
+CG CWAF+AV+AVE I ++ +G L+ LSEQ+L++C NG +NGC GG + AF +II N
Sbjct: 165 QCGSCWAFSAVSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINN 224
Query: 215 QGI 217
GI
Sbjct: 225 GGI 227
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 55/82 (67%), Gaps = 9/82 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + + G+F G CGT+LDH V VG+GT E+G +YW+++NSWG WG+AGY+
Sbjct: 284 GGRE----FQLYHSGVFTGRCGTELDHGVVAVGYGT-ENGKDYWIVRNSWGPKWGEAGYL 338
Query: 262 KIVRD----EGLCGIGTRSSYP 279
++ R+ G CGI SSYP
Sbjct: 339 RMERNINATTGKCGIAMMSSYP 360
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 73/177 (41%), Positives = 112/177 (63%), Gaps = 12/177 (6%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
++ E W +HG+SY + E+ RLK+F++N +++ K N +GN +Y L N F+DLT+ EF
Sbjct: 27 QLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEF 86
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMT----DVPTSLDWRDKGAVTPIKNQKECGCC 160
+ G S ++NL +T D+P S+DWR+KG VT +K+Q CG C
Sbjct: 87 KTSRLGL--------SAAPLNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGAC 138
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
W+F+A A+EGI KI +G+L+ LSEQ+L++C + N+GC GG + AF ++I N GI
Sbjct: 139 WSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGI 195
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 54/76 (71%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + ++GIF G C T LDHAV IVG+G+ E+G +YW++KNSWG WG GYM + R+
Sbjct: 255 AFQMYSKGIFTGPCSTSLDHAVLIVGYGS-ENGVDYWIVKNSWGTGWGMRGYMHMQRNSG 313
Query: 267 --EGLCGIGTRSSYPL 280
+G+CGI +SYP+
Sbjct: 314 NSQGVCGINMLASYPV 329
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 124/214 (57%), Gaps = 17/214 (7%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQHGRSYKDELEKEMRL 68
TT ++I L ++ V S H++ V +++E+W + H S ++ EK+ R
Sbjct: 2 TTKKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRF 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTS 123
+FK N+ ++ NK ++ YKL N+F+D+TN EF+ Y G K+ HR S
Sbjct: 61 NVFKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNH--HRMFRGTPRVS 117
Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
TF Y+N T P S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++ L+ L
Sbjct: 118 GTFMYENF--TKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPL 175
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
SEQ+L+DC N GC GG E AF YI Q GI
Sbjct: 176 SEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGI 209
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 94/242 (38%), Positives = 137/242 (56%), Gaps = 30/242 (12%)
Query: 15 TTPMFIIITLLVSCASQVVSS--------RSTHEQSVVEIHEKWMAQHGRSYKDELEKEM 66
T + II LLV C + +S S+ + + +E W+ ++G+ Y+++ E E
Sbjct: 4 TITLVAIINLLVLCNLWITASACPAKHNDNSSDSEVMRMRYESWLKKYGQKYRNKDEWEF 63
Query: 67 RLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF 126
R +I++ N+++IE N + N +YKL N+F DLTN+EFR +Y Y+ RS + F
Sbjct: 64 RFEIYRANVQFIEVYNSQ-NYSYKLMDNKFVDLTNEEFRRMYLVYQP-----RSHLQTRF 117
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
YQ D+P +DWR +GAVT IK+Q CG CW+F+AVA VE I KI++G L+ LSEQ
Sbjct: 118 MYQ--KHGDLPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQ 175
Query: 187 QLLDCST-NGNNGCLGGSREKAFAYIIQNQGI-------FNGVCGTQ-----LDHAVTIV 233
QL+DC NGN GC GG E F +I + G+ + G G +HAV I
Sbjct: 176 QLIDCDNRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQGSDGDXNKAKVRNHAVAIC 234
Query: 234 GF 235
G+
Sbjct: 235 GY 236
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 51/77 (66%), Gaps = 5/77 (6%)
Query: 207 AFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
+A+ + ++G F+G CG L+H +TIVG+G E+G YWL+KNSW N G +GY+++ RD
Sbjct: 264 GYAFQLYSKGTFSGSCGKDLNHRMTIVGYGE-ENGEKYWLVKNSWANDXGVSGYIRMKRD 322
Query: 267 ----EGLCGIGTRSSYP 279
+G CG +SYP
Sbjct: 323 PKDKDGTCGTAMEASYP 339
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 91/224 (40%), Positives = 135/224 (60%), Gaps = 19/224 (8%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
L +S V RS E V ++ +W A++ + K E RL++FKENL++++K N
Sbjct: 30 LTLSKQGGAVPVRSDEE--VRMLYLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHNA 87
Query: 84 EGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNLSMTDVPTS 139
+R T++LG N+F+DLTN+E+R + + S RS + + +Y+ D+P S
Sbjct: 88 AADRGEHTFRLGMNRFADLTNEEYRTRF--LRDFSRLRRSASGKISSRYRLREGDDLPDS 145
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+DWR+KGAV P+KNQ CG CWAF+ VAAVEGI +I +G+LI LSEQQL+DC+T N+GC
Sbjct: 146 IDWREKGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT-ANHGC 204
Query: 200 LGGSREKAFAYIIQNQGIF----------NGVCGTQLDHAVTIV 233
GG AF +I+ N GI NG+C + ++ V +
Sbjct: 205 RGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNSTVNAPVVSI 248
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 45/69 (65%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF G C +HA+T+VG+GT D +Y +KNSWG WG++GY+++ R+ G CG
Sbjct: 288 GIFTGSCNISANHALTVVGYGTEND-KDYRTVKNSWGKNWGESGYIRVERNIGNPNGKCG 346
Query: 272 IGTRSSYPL 280
I +SYP+
Sbjct: 347 ITRFASYPV 355
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/210 (40%), Positives = 127/210 (60%), Gaps = 17/210 (8%)
Query: 19 FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
FI++ L + + H E S+ E++E+W + H + E EK R +FK
Sbjct: 4 FIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFNVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFK 127
N+++I + NK+ +++YKL N+F D+T++EFR Y G ++M ++T S F
Sbjct: 63 HNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKS--FM 119
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y N++ +PTS+DWR GAVTP+KNQ +CG CWAF+ V AVEGI +IR+ L LSEQ+
Sbjct: 120 YANVNT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L+DC TN N GC GG + AF +I + G+
Sbjct: 178 LVDCDTNQNQGCNGGLMDLAFEFIKEKGGL 207
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 82/151 (54%), Gaps = 23/151 (15%)
Query: 143 RDKGAVT-----PIKNQKE-CGCCWAFAAVAAVEG---ITKIRSGNLIQLSEQQLLDCST 193
++KG +T P K E C A V +++G + K +L++ Q + +
Sbjct: 202 KEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAI 261
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 253
+ GGS + ++G+F G CGT+L+H V +VG+GTT DG YW++KNSWG
Sbjct: 262 DA-----GGS-----DFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGE 311
Query: 254 TWGDAGYMKIVR----DEGLCGIGTRSSYPL 280
WG+ GY+++ R EGLCGI +SYPL
Sbjct: 312 EWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 124/214 (57%), Gaps = 17/214 (7%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQHGRSYKDELEKEMRL 68
TT ++I L ++ V S H++ V +++E+W + H S ++ EK+ R
Sbjct: 2 TTKKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRF 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTS 123
+FK N+ ++ NK ++ YKL N+F+D+TN EF+ Y G K+ HR S
Sbjct: 61 NVFKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNH--HRMFRGTPRVS 117
Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
TF Y+N T P S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++ L+ L
Sbjct: 118 GTFMYENF--TKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPL 175
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
SEQ+L+DC N GC GG E AF YI Q G+
Sbjct: 176 SEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGV 209
>gi|414887428|tpg|DAA63442.1| TPA: hypothetical protein ZEAMMB73_713985 [Zea mays]
Length = 313
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/260 (35%), Positives = 134/260 (51%), Gaps = 52/260 (20%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W A + RSY E+ R +++++N+E IE N+ +Y+L F+DLT++EF A +T
Sbjct: 43 WQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRAELSYQLSETPFTDLTSEEFLATHT 102
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMT---------------------DVPTSLDWRDKGAV 148
M + H S + +++ L T DVP S+DWR KGAV
Sbjct: 103 ---MSTRLHASEAAR--RHRELITTHAGPVSDGGRQWNRRNYTTDLDVPESVDWRTKGAV 157
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ---LLDCSTNGNNGCLG---- 201
T +K+Q CG CW+FA VAA+EG+ KIR+G L+ LSEQ+ LD + N G
Sbjct: 158 TTVKDQGACGGCWSFATVAAIEGLHKIRTGQLVSLSEQEGKCKLDKARNHVAKIRGRKLV 217
Query: 202 -GSREKAFAYIIQNQ-----------------GIFNGVCGTQ-LDHAVTIVGFGTTEDGA 242
+ E A + Q G+F+G C + L+HAVT+VG+G G
Sbjct: 218 DQNNEAALEVAVAQQPVAVGMNVHPIQQHYKSGVFHGPCDPEDLNHAVTMVGYGAESGGR 277
Query: 243 NYWLIKNSWGNTWGDAGYMK 262
YW++KNSWG WG+ GY +
Sbjct: 278 KYWIVKNSWGEKWGEKGYFR 297
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 124/214 (57%), Gaps = 17/214 (7%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQHGRSYKDELEKEMRL 68
TT ++I L ++ V S H++ V +++E+W + H S ++ EK+ R
Sbjct: 2 TTKKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRF 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTS 123
+FK N+ ++ NK ++ YKL N+F+D+TN EF+ Y G K+ HR S
Sbjct: 61 NVFKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGTKVNH--HRMFRGTPRVS 117
Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
TF Y+N T P S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++ L+ L
Sbjct: 118 GTFMYENF--TKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPL 175
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
SEQ+L+DC N GC GG E AF YI Q G+
Sbjct: 176 SEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGV 209
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 114/182 (62%), Gaps = 8/182 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ ++E+W +H + +D +K R +FKEN+ I N+ + YKL N+F D+
Sbjct: 40 EEALWALYERWRGRHAVA-RDLGDKARRFNVFKENVRLIHDFNQR-DEPYKLRLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSP----SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
T DEFR Y G ++ R ++S+F Y D+PTS+DWR KGAVT +K+Q
Sbjct: 98 TADEFRRHYAGSRVAHHRMFRGDRQGSASSFMY--AGARDLPTSVDWRQKGAVTDVKDQG 155
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
+CG CWAF+ +AAVEGI I++ NL LSEQQL+DC T GN GC GG + AF YI ++
Sbjct: 156 QCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAKHG 215
Query: 216 GI 217
G+
Sbjct: 216 GV 217
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 37/71 (52%), Positives = 50/71 (70%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F G CGT+LDH VT VG+G DG YW++KNSWG WG+ GY+++ RD EG
Sbjct: 280 SEGVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEKGYIRMARDVAAKEGH 339
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 340 CGIAMEASYPV 350
>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 157 bits (397), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 89/200 (44%), Positives = 125/200 (62%), Gaps = 19/200 (9%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F ++ + A QV + R+ + S+ E HE+ M ++G+ YKD ++ FKEN+ YI
Sbjct: 12 FAMLLCMAFLAFQV-TCRTLQDASMXERHEQRMTRYGKVYKDPPKRX-----FKENVNYI 65
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N N+ YK G NQF+ R + G+ M S R TT FK++N++ T P+
Sbjct: 66 EACNNAANKPYKRGINQFAP------RNRFKGH-MCSSIIRITT---FKFENVTAT--PS 113
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
++D R KGAVTPIK+Q +CGCCWAF+AVAA EGI + +G LI LSEQ+L+DC T G +
Sbjct: 114 TVDCRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDX 173
Query: 198 GCLGGSREKAFAYIIQNQGI 217
GC GG + AF +IIQN G+
Sbjct: 174 GCEGGLMDDAFKFIIQNHGL 193
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 38/70 (54%), Positives = 51/70 (72%), Gaps = 4/70 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
G+F G CGT+LDH VT VG+G ++DG YWL+KNSWG WG+ GY+++ R +E LCG
Sbjct: 263 GVFTGSCGTELDHGVTAVGYGVSDDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEALCG 322
Query: 272 IGTRSSYPLA 281
I ++SYP A
Sbjct: 323 IAVQASYPSA 332
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 157 bits (397), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 82/183 (44%), Positives = 114/183 (62%), Gaps = 11/183 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+ E++E+W + H S + EK R +FK N+ Y+ NK+ ++ YKL N+F+D+
Sbjct: 31 EEKFWELYERWRSHHTVSRSLD-EKHKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR Y G K+ HR S + TF Y N +VP S+DWR KGAVTP+K+Q
Sbjct: 89 TNHEFRQHYAGSKIKH--HRTLLGASRANGTFMYANED--NVPPSIDWRKKGAVTPVKDQ 144
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ V AVEGI +I++ L+ LSEQ+L+DC T N GC GG + AF +I +
Sbjct: 145 GQCGSCWAFSTVVAVEGINQIKTKKLVSLSEQELVDCDTTENQGCNGGLMDPAFDFIKKR 204
Query: 215 QGI 217
GI
Sbjct: 205 GGI 207
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 93/263 (35%), Positives = 141/263 (53%), Gaps = 32/263 (12%)
Query: 16 TPMFIIITLLVSCAS----QVVSSRSTHEQSVVE-------------IHEKWMAQHGRSY 58
T + ++ ++ SCA+ VVSS + H + I + WM +HG+ Y
Sbjct: 8 TLILLVAMVITSCATAMDMSVVSSNNNHHLTTSPGRLHSGFDAEASLIFDSWMVKHGKVY 67
Query: 59 KDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH 118
EKE RL IF++NL +I N E N +Y+LG QF+DL+ E+ + G P +
Sbjct: 68 GSVAEKERRLTIFEDNLRFISNRNAE-NLSYRLGLTQFADLSLHEYGEVCHGADPRPPRN 126
Query: 119 RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG 178
+S+ +Y+ + +P S+DWR++GAVT +K+Q C CWAF+ V AVEG+ KI +G
Sbjct: 127 HVFMTSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTG 186
Query: 179 NLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQL-- 226
L+ LSEQ L++C+ NNGC GG E A+ +I++N G+ NGVC +L
Sbjct: 187 ELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKE 245
Query: 227 -DHAVTIVGFGTTEDGANYWLIK 248
+ V I GF + L+K
Sbjct: 246 NNKNVMIDGFENLPANDEFALMK 268
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 42/75 (56%), Positives = 56/75 (74%), Gaps = 5/75 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + G+F+G CGT L+H V +VG+GT E+G +YWL+KNS GNTWG+AGYMK+ R+
Sbjct: 286 FQLYESGVFDGSCGTNLNHGVVVVGYGT-ENGRDYWLVKNSRGNTWGEAGYMKMARNIAN 344
Query: 267 -EGLCGIGTRSSYPL 280
GLCGI R+SYPL
Sbjct: 345 PRGLCGIAMRASYPL 359
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 86/210 (40%), Positives = 131/210 (62%), Gaps = 15/210 (7%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI-EKAN 82
++V+ S++VS E+S++EI ++W +H + Y+ E E R + FK NL+YI EKA
Sbjct: 32 IVVNDFSELVS-----EESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAG 86
Query: 83 KE-GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
K+ + +G N+F+DL+N+EF+ LY + + +T+ ++ +NL D P+SLD
Sbjct: 87 KKTAALGHSVGLNKFADLSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDAPSSLD 146
Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLG 201
WR KG VT +K+Q +CG CW+F+ A+EGI I +G+LI LSEQ+L+DC T N GC G
Sbjct: 147 WRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT-NYGCEG 205
Query: 202 GSREKAFAYIIQNQGI-------FNGVCGT 224
G + AF ++I N GI + GV GT
Sbjct: 206 GYMDYAFEWVINNGGIDTEANYPYTGVDGT 235
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 65/123 (52%), Gaps = 20/123 (16%)
Query: 166 VAAVEGITKIRSGN--LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCG 223
V +++G T + + L+ + QQ + +G+ A + + GI++G C
Sbjct: 245 VVSIDGYTDVDETDSALLCATVQQPISVGMDGS----------ALDFQLYTGGIYDGDCS 294
Query: 224 ---TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRS 276
+DHAV IVG+G+ E+G +YW++KNSWG WG GY I R+ G+C I +
Sbjct: 295 DDPNDIDHAVLIVGYGS-ENGEDYWIVKNSWGTEWGMEGYFYIKRNTDLPYGVCAINAEA 353
Query: 277 SYP 279
SYP
Sbjct: 354 SYP 356
>gi|356552228|ref|XP_003544471.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 351
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 81/206 (39%), Positives = 120/206 (58%), Gaps = 10/206 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F++ + + ++S + H + V+ + E+W+ +H + Y EKE R +
Sbjct: 8 LFMVFAVSSALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNALGEKEKRFQ 67
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK NL +I++ N NRTYKLG N F+DLTN E+RA+Y P T Y
Sbjct: 68 IFKNNLRFIDERNSL-NRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTPPRNHYV 126
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+P S+DWR +GAVTP+KNQ C CWAF AV AVE + KI++G+LI LSEQ++
Sbjct: 127 PRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLISLSEQEV 186
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQN 214
+DC+T+ + GC GG + + YI +N
Sbjct: 187 VDCTTSSSRGCGGGDIQHGYIYIRKN 212
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 274
QG+F G CGT+L+HA+ +VG+GT +DG +YW+ KNS+ + WG+ GY++I R C G
Sbjct: 275 QGVFKGKCGTELNHALLLVGYGTEKDG-DYWIAKNSYSDKWGENGYIRIQRKLSTCKFGN 333
Query: 275 RSSYPL 280
YP+
Sbjct: 334 GGYYPI 339
>gi|209732052|gb|ACI66895.1| Cathepsin H precursor [Salmo salar]
Length = 275
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 105/281 (37%), Positives = 145/281 (51%), Gaps = 32/281 (11%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
I+ LL+ V S E+ E H K WM+Q+ + Y E E RL+IF EN I+
Sbjct: 3 ILVLLIFAVYSVSSLYPFSEEG--EYHFKLWMSQYNKVYDME-EYYHRLQIFIENKRRID 59
Query: 80 KANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
N EGN + +G NQFSDLT EFR + + P + S T + N P S
Sbjct: 60 YHN-EGNHKFTMGLNQFSDLTFAEFRK---SFLLTEPQNCSATKGSHVSSN---GPYPES 112
Query: 140 LDWRDKG-AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN---- 194
+DWR KG VT +KNQ CG CW F+ +E +T I +G L+QLSEQQL+DC+
Sbjct: 113 VDWRKKGNYVTAVKNQGSCGSCWTFSTTGCLESVTAIATGKLLQLSEQQLVDCAQAFNIT 172
Query: 195 -----GNNGCLGGSREKAFAYIIQNQGI-FNGVCGTQ---------LDHAVTIVGFGTTE 239
G + + AY + + + ++G T ++HAV VG+G E
Sbjct: 173 KYDEMGMVDAVARFNPVSLAYEVTSDFMHYDGGVYTSKECHNTTDTVNHAVLAVGYG-EE 231
Query: 240 DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
G YW++KNSWG++WG GY I R + +CG+ SSYPL
Sbjct: 232 KGTPYWIVKNSWGSSWGMKGYFFIERGKNMCGLAACSSYPL 272
>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
Length = 356
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 113/351 (32%), Positives = 162/351 (46%), Gaps = 80/351 (22%)
Query: 1 MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+VL +GS ++ P+ ++ L +VV R + ++ ++G+ Y+
Sbjct: 13 LVLSCAVAGSVFDDSNPIRMVSDRLRELELEVV--RVLGQVPHALRFARFAHRYGKKYET 70
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
E ++R IF E+LE I+ NK+G +YKLG NQF+D T +EFR G + ++
Sbjct: 71 AEEMKLRFGIFLESLELIKSTNKQG-LSYKLGVNQFADWTWEEFRKHRLG------AAQN 123
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
+++T L+ T +P S DWR G V+P+K+Q CG CW F+ A+E G
Sbjct: 124 CSATTKGSHKLTDTALPESKDWRKDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKG 183
Query: 181 IQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI-------FNGVCGT-------- 224
I LSEQQL+DC NN GC GG +AF YI N G+ + GV G+
Sbjct: 184 ISLSEQQLVDCGRGFNNFGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGVDGSCKFVPENV 243
Query: 225 ----------------QLDHAVTIV-----------GF-------------GTT------ 238
+L HAV V GF G+T
Sbjct: 244 GVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNH 303
Query: 239 ---------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
EDG YWLIKNSWG WGD GY K+ + +CG+ T +SYP+
Sbjct: 304 AVLAVGYGVEDGIPYWLIKNSWGGNWGDNGYFKMEMGKNMCGVATCASYPI 354
>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
Length = 333
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 102/312 (32%), Positives = 149/312 (47%), Gaps = 80/312 (25%)
Query: 44 VEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDL 99
+++H + W QHG++YK E+E+ R ++++ NL+ I N E G TY LG N D+
Sbjct: 26 LDLHWQMWKKQHGKNYKTEVEELGRREVWERNLQLISLHNLEASMGMHTYDLGMNHMGDM 85
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T +E + K+P+ R ++ + S T VP ++DWR KG VT +KNQ CG
Sbjct: 86 TEEEILQSFASLKVPADLKREPSA----FVASSGTPVPDTVDWRQKGYVTQVKNQGSCGS 141
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI- 217
CWAF++V A+EG +G L+ LS Q L+DCS+ GN GC GG +AF Y+I N+GI
Sbjct: 142 CWAFSSVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKGCNGGFMSEAFQYVIDNKGID 201
Query: 218 ------FNGVCG------------------------TQLDHAVTIVG------------- 234
+ GV G T L AV ++G
Sbjct: 202 SDTSYPYQGVQGTCHYNPSYRSANCTRYSFLPEGDETTLKQAVAMIGPISVAIDATRPSF 261
Query: 235 -------------------------FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-G 268
+GT DG +YWL+KNSWG +G+ GY+++ R+
Sbjct: 262 ILWRSGVYNDLTCTQKINHAVLVVGYGTL-DGQDYWLVKNSWGTRFGENGYIRMSRNRNN 320
Query: 269 LCGIGTRSSYPL 280
CGI YP+
Sbjct: 321 QCGIALYGCYPI 332
>gi|2098464|pdb|1PCI|A Chain A, Procaricain
gi|2098465|pdb|1PCI|B Chain B, Procaricain
gi|2098466|pdb|1PCI|C Chain C, Procaricain
Length = 322
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 76/177 (42%), Positives = 116/177 (65%), Gaps = 5/177 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ WM H + Y++ EK R +IFK+NL YI++ NK+ N +Y LG N+F+
Sbjct: 13 TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 71
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+NDEF Y G + + +S ++ N + ++P ++DWR KGAVTP+++Q C
Sbjct: 72 DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDIVNLPENVDWRKKGAVTPVRHQGSC 128
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC ++GC GG A Y+ +N
Sbjct: 129 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN 184
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 45/78 (57%), Gaps = 5/78 (6%)
Query: 206 KAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
K + + GIF G CGT++D AVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 243 KGRPFQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKR 301
Query: 266 ----DEGLCGIGTRSSYP 279
G+CG+ S YP
Sbjct: 302 APGNSPGVCGLYKSSYYP 319
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 127/209 (60%), Gaps = 15/209 (7%)
Query: 17 PMFIIITLL----VSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
P FI + L+ +S A + + E S+ ++EKW H + +D EK R +
Sbjct: 4 PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVA-RDLDEKNRRFNV 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS-----TTSST 125
FKEN+++I + N++ + YKL N+F D+TN EFR+ Y G K+ HRS + +
Sbjct: 63 FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQH--HRSQRGIQKNTGS 120
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F Y+N+ S+DWR KGAVT +K+Q +CG CWAF+ +A+VEGI +I++G L+ LSE
Sbjct: 121 FMYENVGSLPA-ASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
Q+L+DC T+ N GC GG + AF +I +N
Sbjct: 180 QELVDCDTSYNEGCNGGLMDYAFEFIQKN 208
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 52/77 (67%), Gaps = 4/77 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 265
+ + ++G+F G CGT+LDH V IVG+G T DG YW++KNSWG WG++GY+++ R
Sbjct: 269 YGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGI 328
Query: 266 --DEGLCGIGTRSSYPL 280
G CGI +SYP+
Sbjct: 329 SDKRGKCGIAMEASYPI 345
>gi|6978721|ref|NP_037071.1| pro-cathepsin H precursor [Rattus norvegicus]
gi|115729|sp|P00786.1|CATH_RAT RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|55886|emb|CAA68699.1| cathepsin H pre-pro-peptide [Rattus norvegicus]
gi|55391460|gb|AAH85352.1| Cathepsin H [Rattus norvegicus]
gi|149018921|gb|EDL77562.1| cathepsin H, isoform CRA_a [Rattus norvegicus]
gi|226475|prf||1514114A cathepsin H
Length = 333
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 108/339 (31%), Positives = 154/339 (45%), Gaps = 85/339 (25%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHE----KWMAQHGRSYKDELEKEMRLKIFKE 73
M+ + LL + A ++S+ +T E +V I + WM QH ++Y E RL++F
Sbjct: 1 MWTALPLLCAGA-WLLSAGATAELTVNAIEKFHFTSWMKQHQKTYSSR-EYSHRLQVFAN 58
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N I+ N+ N T+K+G NQFSD++ F + Y P + S T S +
Sbjct: 59 NWRKIQAHNQR-NHTFKMGLNQFSDMS---FAEIKHKYLWSEPQNCSATKSNYL---RGT 111
Query: 134 TDVPTSLDWRDKG-AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
P+S+DWR KG V+P+KNQ CG CW F+ A+E I SG ++ L+EQQL+DC+
Sbjct: 112 GPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCA 171
Query: 193 TNGNN-GCLGGSREKAFAYIIQNQGIF----------NGVCGTQLDHAVTIV-------- 233
N NN GC GG +AF YI+ N+GI NG C + AV V
Sbjct: 172 QNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQCKFNPEKAVAFVKNVVNITL 231
Query: 234 ------------------GFGTTED----------------------------------G 241
F TED G
Sbjct: 232 NDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNG 291
Query: 242 ANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
YW++KNSWG+ WG+ GY I R + +CG+ +SYP+
Sbjct: 292 LLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAACASYPI 330
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 156 bits (395), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 100/304 (32%), Positives = 146/304 (48%), Gaps = 78/304 (25%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W + HG+ Y ++ E+ MR I++ NL+ I N EG ++KL N D+T+ E
Sbjct: 32 WKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHN-EGKHSFKLAMNHLGDMTSLEISQTLL 90
Query: 110 GYKMPSPSHRSTTSSTF-KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G K+ + +TF N+ + D S+DWR KG VTP+KNQ +CG CWAF+ A
Sbjct: 91 GLKLKKHAESQPKGATFLPPANVKVVD---SIDWRSKGYVTPVKNQGQCGSCWAFSTTGA 147
Query: 169 VEGITKIRSGNLIQLSEQQLLDCS----TNGNNGCL------------GGSREKAFAYII 212
+EG ++G L+ LSEQ L+DCS NG G L G EK++ Y+
Sbjct: 148 LEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPYLA 207
Query: 213 QN-----------------------------------------------------QGIFN 219
++ QG+++
Sbjct: 208 KDGVCHYNKSAIGAKDTGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQGVYD 267
Query: 220 G--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIGTRS 276
T+LDH V VG+GT +DG +YWL+KNSWG +WG+ GY+KI R D CG+ +++
Sbjct: 268 DPDCSSTRLDHGVLAVGYGT-DDGKDYWLVKNSWGPSWGEEGYIKIARNDHDKCGVASKA 326
Query: 277 SYPL 280
SYPL
Sbjct: 327 SYPL 330
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 80/172 (46%), Positives = 112/172 (65%), Gaps = 9/172 (5%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W +HG+ Y E+ R ++K+NLEYI++ + E N +Y LG +F+DLTN+EFR YT
Sbjct: 48 WAHKHGKVYSAAEERAHRFLVWKDNLEYIQR-HSEKNLSYWLGLTKFADLTNEEFRRQYT 106
Query: 110 GYKMPSPSH----RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
G ++ R+ T S F+Y N ++ P S+DWR+KGAVT +K+Q CG CWAF+A
Sbjct: 107 GTRIDRSRRLKKGRNATGS-FRYAN---SEAPKSIDWREKGAVTSVKDQGSCGSCWAFSA 162
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
V +VEGI IR+G+ I LS Q+L+DC N GC GG + AF ++IQN GI
Sbjct: 163 VGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQNGGI 214
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/71 (47%), Positives = 47/71 (66%), Gaps = 7/71 (9%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD------EGL 269
G+F G CGT LDH V VG+G+ E G +YW++KNSWG WG++GY+++ R+ GL
Sbjct: 281 GVFTGRCGTDLDHGVLAVGYGS-EKGLDYWIVKNSWGEYWGESGYLRMQRNLKDDNGYGL 339
Query: 270 CGIGTRSSYPL 280
CGI SY +
Sbjct: 340 CGINIEPSYAV 350
>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
endopeptidase; AltName: Full=Papaya peptidase B;
AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
Precursor
gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
Length = 348
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 78/177 (44%), Positives = 114/177 (64%), Gaps = 5/177 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ WM +H ++YK+ EK R +IFK+NL+YI++ NK N Y LG N+FS
Sbjct: 39 TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMIN-GYWLGLNEFS 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+NDEF+ Y G P + ++ N + D+P S+DWR KGAVTP+K+Q C
Sbjct: 98 DLSNDEFKEKYVG---SLPEDYTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CWAF+ VA VEGI KI++GNL++LSEQ+L+DC + GC G + + Y+ QN
Sbjct: 155 ESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQN 210
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/69 (49%), Positives = 44/69 (63%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R G+CG
Sbjct: 279 GIFEGSCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGNSPGVCG 337
Query: 272 IGTRSSYPL 280
+ S YP+
Sbjct: 338 VYRSSYYPI 346
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 76/177 (42%), Positives = 115/177 (64%), Gaps = 5/177 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ WM H + Y++ EK R +IFK+NL YI++ NK+ N +Y LG N+F+
Sbjct: 39 TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+NDEF Y G + + +S ++ N ++P ++DWR KGAVTP+++Q C
Sbjct: 98 DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDTVNLPENVDWRKKGAVTPVRHQGSC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC ++GC GG A Y+ +N
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN 210
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 46/78 (58%), Gaps = 5/78 (6%)
Query: 206 KAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
K + + GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 269 KGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKR 327
Query: 266 ----DEGLCGIGTRSSYP 279
G+CG+ S YP
Sbjct: 328 APGNSPGVCGLYKSSYYP 345
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 82/180 (45%), Positives = 124/180 (68%), Gaps = 7/180 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++E+W + H S + EK R +FKENL++I K N++ +R YKL N+F+D+
Sbjct: 33 EESLWNLYERWRSHHTVS-RSLTEKNQRFNVFKENLKHIHKVNQK-DRPYKLRLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSPS--HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
TN EF Y G K+ H S + F ++N S ++P+S+DWR +GAVT +K+Q +C
Sbjct: 91 TNHEFLQHYGGSKVSHYRMFHGSRRQTGFAHENTS--NLPSSIDWRKQGAVTGVKDQGKC 148
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF++VAAVEGI KI++G LI LSEQ+L+DC++ N+GC GG E+AF++I + G+
Sbjct: 149 GSCWAFSSVAAVEGINKIKTGELISLSEQELVDCNSV-NHGCDGGLMEQAFSFIEKTGGL 207
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 53/70 (75%), Gaps = 4/70 (5%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLC 270
+G++ G CGT+L+H V +VG+G T+DG YW++KNSWG+ WG+ G++++ R +EGLC
Sbjct: 273 EGVYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWGSEWGENGFIRMQRENDVEEGLC 332
Query: 271 GIGTRSSYPL 280
GI +SYP+
Sbjct: 333 GITLEASYPI 342
>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 343
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 108/349 (30%), Positives = 165/349 (47%), Gaps = 97/349 (27%)
Query: 9 GSFKINTTPMFIIITLLVSCAS----QVVSSRSTH--------EQSVVEIHEKWMAQHGR 56
G+ + + +FI+ +++ +S ++S +H ++ V+ I+E+ +A+HG+
Sbjct: 2 GTNRSSKATIFILFFTVLAVSSALDLSIISYDRSHADKSGWRSDEEVMSIYEEXLAKHGK 61
Query: 57 SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP 116
Y E E R +I KENL+++E+ N GNRTYK+G N+F+D + M P
Sbjct: 62 VYNAIDEMEERFQISKENLKFVEQHNA-GNRTYKVGLNRFADRSR----------MMTRP 110
Query: 117 SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIR 176
S R Y ++ S+DWR +GAV +K Q EC C F +AAVEGI KI
Sbjct: 111 SSR--------YAPRVSDNLSESVDWRKEGAVVRVKTQSECESCRTFTVIAAVEGINKIV 162
Query: 177 SGNLIQLSE-QQLLDCSTNG-----------NNGCLGGSREKAF---------------- 208
+GNL LS+ + ++ +G NNG + + F
Sbjct: 163 TGNLTALSDCDRTVNAGCSGGLADYALEFIINNGGIDTEEDYPFQGAVGICDQYKINAVD 222
Query: 209 ------------------------AYI--------IQNQGIFNGVCGTQLDHAVTIVGFG 236
AYI + GIF G CGT +DH VT VG+G
Sbjct: 223 GYERVPAYDELALKKAVANQPVSVAYIEAYGKEFQLYESGIFTGKCGTSIDHGVTAVGYG 282
Query: 237 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 280
T E+G +YW++KNSWG WG+AGY+++ R+ G CGI + YP+
Sbjct: 283 T-ENGIDYWIVKNSWGENWGEAGYVRMERNTAEDTAGKCGIAILTLYPI 330
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 115/180 (63%), Gaps = 9/180 (5%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ + WM +H + Y+ EK R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DL+NDEF+ Y G H T+K+ +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98 DLSNDEFKKKYVGSVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ +A VEG+ KI +GNL++LSEQ+L+DC N ++GC GG + + Y+ N
Sbjct: 154 GSCGSCWAFSTIATVEGVNKIVTGNLLELSEQELVDCDKN-SHGCKGGYQTTSLQYVADN 212
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/73 (52%), Positives = 50/73 (68%), Gaps = 5/73 (6%)
Query: 212 IQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DE 267
+ G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R +
Sbjct: 277 LYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ 335
Query: 268 GLCGIGTRSSYPL 280
G CG+ S YP
Sbjct: 336 GTCGVYKSSYYPF 348
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 82/194 (42%), Positives = 122/194 (62%), Gaps = 11/194 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI-EKANKEGNRTYKLGTNQFSD 98
++S++EI ++W +H ++YK E E R FK NL+YI EK KE +++G N+F+D
Sbjct: 36 DESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHRVGLNKFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFK-YQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
L+N+EF+ LY K+ P +++ + + +NL D P+SLDWR KG VT +K+Q +C
Sbjct: 96 LSNEEFKQLYLS-KVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CW+F+ A+EGI I + +LI LSEQ+L+DC T N GC GG + AF ++I N GI
Sbjct: 155 GSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTT-NYGCEGGYMDYAFEWVINNGGI 213
Query: 218 -------FNGVCGT 224
+ GV GT
Sbjct: 214 DTEANYPYTGVDGT 227
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 38/56 (67%), Gaps = 5/56 (8%)
Query: 228 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 279
HAV IVG+G+ E+G +YW++KNSWG +WG GY I R+ G+C I +SYP
Sbjct: 294 HAVLIVGYGS-ENGEDYWIVKNSWGTSWGIEGYFYIKRNTDLPYGVCAINAMASYP 348
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 76/176 (43%), Positives = 114/176 (64%), Gaps = 5/176 (2%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ E+ + W +HG++Y E E++ R++IFK+N +++ + N N TY L N F+DLT+
Sbjct: 26 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 85
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLS-MTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+A G + +PS + K Q+L VP S+DWR KGAVT +K+Q CG CW
Sbjct: 86 EFKASRLGLSVSAPSVIMAS----KGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 141
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+F+A A+EGI +I +G+LI LSEQ+L+DC + N GC GG + AF ++I+N GI
Sbjct: 142 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGI 197
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 58/85 (68%), Gaps = 10/85 (11%)
Query: 205 EKAF-----AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 259
E+AF + + QGIF+G C T LDHAV IVG+G+ ++G +YW++KNSWG +WG G
Sbjct: 255 ERAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGS-QNGVDYWIVKNSWGKSWGMDG 313
Query: 260 YMKIVRD----EGLCGIGTRSSYPL 280
+M + R+ +G+CGI +SYP+
Sbjct: 314 FMHMQRNTENSDGVCGINMLASYPI 338
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/180 (47%), Positives = 117/180 (65%), Gaps = 12/180 (6%)
Query: 40 EQSVVEIHEKWMAQH--GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
++++ +++E+W + + RS+ EK+ R +FKEN++YI + NK ++ YKL NQF
Sbjct: 37 DETLWDLYERWRSVYTSARSFG---EKQNRFHVFKENVKYINEVNKM-DKPYKLRLNQFG 92
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT EF Y K+ + S F Y+N+ +VP S+DWR KGAVTP+KNQ C
Sbjct: 93 DLTPSEFARTYANSKIIEGTRNE--SGGFMYENV---EVPRSIDWRVKGAVTPVKNQGRC 147
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+A AAVEGI +I +G LI LSEQQL+DC T N+GC GG+ +AF YI Q GI
Sbjct: 148 GGCWAFSAAAAVEGINQITTGQLISLSEQQLIDCDTQ-NSGCRGGTMGRAFEYIKQRGGI 206
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 40/69 (57%), Positives = 54/69 (78%), Gaps = 3/69 (4%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCG 271
QG+F G CGT+L+H VT VG+GTT DG +YW+IKNSWG TWG+ GYM+++R GLCG
Sbjct: 274 QGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMRMLRGVSPYGLCG 333
Query: 272 IGTRSSYPL 280
I ++S+P+
Sbjct: 334 IAMQASFPI 342
>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 379
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 81/218 (37%), Positives = 123/218 (56%), Gaps = 16/218 (7%)
Query: 12 KINTTPMFIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQHGRSYKDELEKE 65
+++ T + + + + S A ++ + E+ + +++E+W H R ++ EK
Sbjct: 3 QVSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKG 61
Query: 66 MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM------PSPSHR 119
R FKEN+ +I NK G+R Y+L N+F D+ +EFR+ + ++ SP+ R
Sbjct: 62 RRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAAR 121
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ F Y S D P S+DWR +GAVT +K+Q CG CWAF+ V AVEGI IR+G+
Sbjct: 122 AGAVPGFMYD--SAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGS 179
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L LSEQ+L+DC T+ NGC GG E AF +I GI
Sbjct: 180 LASLSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGI 216
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 52/75 (69%), Gaps = 3/75 (4%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
A+ ++G+F G CGT LDH V VG+G +DG YW++KNSWG +WG+ GY+++ R
Sbjct: 279 AFQFYSEGVFTGDCGTDLDHGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAG 338
Query: 266 DEGLCGIGTRSSYPL 280
+ GLCGI +S+P+
Sbjct: 339 NGGLCGIAMEASFPI 353
>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
Length = 336
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 124/202 (61%), Gaps = 9/202 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ + V + S+ + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLSISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
N GC GG ++AF Y+ +N+G+
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGL 197
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 42/61 (68%), Gaps = 4/61 (6%)
Query: 224 TQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
++LDHAV +VG+G G YW++KNSW + WGD GY+ + +D+ CG+ T++SYP
Sbjct: 275 SRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYP 334
Query: 280 L 280
L
Sbjct: 335 L 335
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 115/180 (63%), Gaps = 9/180 (5%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ + WM +H + Y+ EK R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DL+NDEF+ Y G+ H T+K+ +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98 DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC + + GC GG + + Y+ N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQYVANN 212
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/73 (52%), Positives = 50/73 (68%), Gaps = 5/73 (6%)
Query: 212 IQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 267
+ G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R +
Sbjct: 277 LYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ 335
Query: 268 GLCGIGTRSSYPL 280
G CG+ S YP
Sbjct: 336 GTCGVYKSSYYPF 348
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 115/180 (63%), Gaps = 9/180 (5%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ + WM +H + Y+ EK R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DL+NDEF+ Y G+ H T+K+ +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98 DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC + + GC GG + + Y+ N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQYVANN 212
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/73 (52%), Positives = 50/73 (68%), Gaps = 5/73 (6%)
Query: 212 IQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 267
+ G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R +
Sbjct: 277 LYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ 335
Query: 268 GLCGIGTRSSYPL 280
G CG+ S YP
Sbjct: 336 GTCGVYKSSYYPF 348
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 76/176 (43%), Positives = 114/176 (64%), Gaps = 5/176 (2%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ E+ + W +HG++Y E E++ R++IFK+N +++ + N N TY L N F+DLT+
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLS-MTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+A G + +PS + K Q+L VP S+DWR KGAVT +K+Q CG CW
Sbjct: 88 EFKASRLGLSVSAPSVIMAS----KGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+F+A A+EGI +I +G+LI LSEQ+L+DC + N GC GG + AF ++I+N GI
Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGI 199
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 55/76 (72%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF+G C T LDHAV IVG+G+ ++G +YW++KNSWG +WG G+M + R+
Sbjct: 259 AFQLYSSGIFSGPCSTSLDHAVLIVGYGS-QNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317
Query: 267 --EGLCGIGTRSSYPL 280
+G+CGI +SYP+
Sbjct: 318 NSDGVCGINMLASYPI 333
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 84/210 (40%), Positives = 119/210 (56%), Gaps = 9/210 (4%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
T + + L S V +S H E+S+ +++E+W + H S + EK R
Sbjct: 2 ATKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRF 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFK 127
+FK NL ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ P R T
Sbjct: 61 NVFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGA 119
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+ + VP S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++ L+ LSEQ+
Sbjct: 120 FMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQE 179
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L+DC N GC GG E AF +I Q GI
Sbjct: 180 LVDCDKEENQGCNGGLMESAFEFIKQKGGI 209
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
+G+F G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+ EGLC
Sbjct: 275 EGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLC 334
Query: 271 GIGTRSSYPL 280
GI SYP+
Sbjct: 335 GIAMLPSYPI 344
>gi|194352764|emb|CAQ00110.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 406
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 117/229 (51%), Gaps = 34/229 (14%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG---NRTYKLGTNQ 95
H+ +++ WM H RSY EK R ++++ N+ +IE N E TY+LG
Sbjct: 55 HQDLMMDRFHVWMTVHNRSYSTAGEKARRFEVYRSNMRFIEAVNAEAATSGLTYELGEGP 114
Query: 96 FSDLTNDEFRALYTGYKMP-------------------SPSHRSTTSSTFKYQNLSMTDV 136
F+DLTN+EF LYTG + S T Y N S +
Sbjct: 115 FTDLTNEEFMELYTGQILEDDQSEDGDDDEQIITTHAGSIDGLGTHKGATVYANFSAS-A 173
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
PTS+DWR +G VTP+KNQK+CG CWAF VA +EGI KI+ G L+ LSEQQL+DC +
Sbjct: 174 PTSIDWRKRGVVTPVKNQKQCGSCWAFPTVATIEGIHKIKRGTLVSLSEQQLIDCDYL-D 232
Query: 197 NGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTIVGF 235
NGC GG +AF +I +N GI G C A IVGF
Sbjct: 233 NGCKGGLVTRAFQWIKKNGGITSTSSYKYKAVRGRCLRNRKPAAKIVGF 281
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/81 (46%), Positives = 49/81 (60%), Gaps = 16/81 (19%)
Query: 216 GIFNGVCGT-QLDHAVTIVGFG-----------TTEDGANYWLIKNSWGNTWGDAGYMKI 263
GI+NG C T +L+HAVT+VG+G + GA YW++KNSWG TWGD GY+ +
Sbjct: 318 GIYNGPCSTTKLNHAVTVVGYGQQQQNGADSVHASAPGAKYWIVKNSWGTTWGDKGYILM 377
Query: 264 VR----DEGLCGIGTRSSYPL 280
R G CGI TR +PL
Sbjct: 378 KRGTKHSSGQCGIATRPVFPL 398
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 115/180 (63%), Gaps = 9/180 (5%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ + WM +H + Y+ EK R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DL+NDEF+ Y G+ H T+K+ +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98 DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC + + GC GG + + Y+ N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQYVANN 212
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/73 (52%), Positives = 50/73 (68%), Gaps = 5/73 (6%)
Query: 212 IQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 267
+ G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R +
Sbjct: 277 LYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ 335
Query: 268 GLCGIGTRSSYPL 280
G CG+ S YP
Sbjct: 336 GTCGVYKSSYYPF 348
>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
Length = 330
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 82/209 (39%), Positives = 124/209 (59%), Gaps = 15/209 (7%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + ++S+ TH+ S + E+W +HG++Y E + R +++ N+
Sbjct: 2 TPIFLLATLCLG----MISAAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNM 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I N++ G + L N F DLTN EFR L TG++ P + F
Sbjct: 57 KMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQSMGPKETTIFREPF------ 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ D+P SLDWR+ G VTP+KNQ +CG CWAF+AV ++EG ++G L+ LSEQ L+DCS
Sbjct: 111 LGDIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCS 170
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIFNG 220
+ GN GC GG E AF Y+ +N+G+ G
Sbjct: 171 WSYGNLGCNGGLMEFAFQYVKENRGLDTG 199
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 76/176 (43%), Positives = 114/176 (64%), Gaps = 5/176 (2%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ E+ + W +HG++Y E E++ R++IFK+N +++ + N N TY L N F+DLT+
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLS-MTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+A G + +PS + K Q+L VP S+DWR KGAVT +K+Q CG CW
Sbjct: 88 EFKASRLGLSVSAPSVIMAS----KGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+F+A A+EGI +I +G+LI LSEQ+L+DC + N GC GG + AF ++I+N GI
Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGI 199
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 56/76 (73%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + ++GIF+G C T LDHAV IVG+G+ ++G +YW++KNSWG +WG G+M + R+
Sbjct: 259 AFQLYSRGIFSGPCSTSLDHAVLIVGYGS-QNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317
Query: 267 --EGLCGIGTRSSYPL 280
+G+CGI +SYP+
Sbjct: 318 NSDGVCGINMLASYPI 333
>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
Length = 335
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 86/204 (42%), Positives = 126/204 (61%), Gaps = 13/204 (6%)
Query: 20 IIITLLVS-CASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
++ LLV+ C S V ++ S Q ++ H W +QHG+SY ++LE R+ I++ENL
Sbjct: 1 MMFALLVTLCISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDLEVGRRM-IWEENLRK 57
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFF 113
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF Y+ +N+G+
Sbjct: 174 QGNQGCNGGIMDQAFQYVKENKGL 197
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 30/67 (44%), Positives = 43/67 (64%), Gaps = 4/67 (5%)
Query: 218 FNGVCGTQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIG 273
+ C ++LDHAV +VG+G G YW++KNSW + WGD GY+ + +D+ CGI
Sbjct: 268 YERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIA 327
Query: 274 TRSSYPL 280
T +SYPL
Sbjct: 328 TMASYPL 334
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 90/211 (42%), Positives = 128/211 (60%), Gaps = 23/211 (10%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++E+W + H + ++ EK R +FK N+ ++ NK ++ YKL N+F D+
Sbjct: 33 EKSLWNLYERWRSHHTVT-RNLDEKHNRFNVFKANVMHVHNTNKL-DKPYKLKLNKFGDM 90
Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR +Y K+ HR S + TF Y+N DVP+S+DWR+KGAVT +K+Q
Sbjct: 91 TNYEFRRIYADSKISH--HRMFRGMSHENGTFMYEN--AVDVPSSIDWRNKGAVTGVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ +AAVEGI +I++ L+ LSEQQL+DC T N GC GG E AF +I QN
Sbjct: 147 GQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCDTEENEGCNGGLMEYAFEFIKQN 206
Query: 215 QGIF----------NGVCGTQL-DHAVTIVG 234
GI +G C + D AV+I G
Sbjct: 207 -GITTESNYPYAAKDGTCDVEKEDKAVSIDG 236
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 52/78 (66%), Gaps = 4/78 (5%)
Query: 207 AFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 265
+ + ++G+F G C T L+H V IVG+G T+D YW++KNSWG+ WG+ GY+++ R
Sbjct: 265 GYNFQFYSEGVFTGHCDTDLNHGVAIVGYGVTQDRTKYWIMKNSWGSEWGEQGYIRMQRG 324
Query: 266 ---DEGLCGIGTRSSYPL 280
EGLCGI +SYP+
Sbjct: 325 ISSREGLCGIAMEASYPI 342
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 84/181 (46%), Positives = 116/181 (64%), Gaps = 17/181 (9%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL-- 99
S+ E E W ++G YKD E++ +IFK N+ YI+ N GN+ YKL N+F D
Sbjct: 37 SLSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPI 96
Query: 100 --TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
++D F + +T ++TFKY+N+ TD+P ++DWR +GAVTPIKNQ +C
Sbjct: 97 EDSDDGFER----------TTTTTPTTTFKYENV--TDIPATVDWRKRGAVTPIKNQGKC 144
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN-NGCLGGSREKAFAYIIQNQG 216
G CWAF+AVAA+EGI KI SGNL+ LSEQQL+DC +G GC G+ AF +I++N G
Sbjct: 145 GSCWAFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGG 204
Query: 217 I 217
I
Sbjct: 205 I 205
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 40/69 (57%), Positives = 52/69 (75%), Gaps = 4/69 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF G CGT+ +HA+TIVG+GT++DG YWL+KNSW WG+ GY++I RD EGLCG
Sbjct: 269 GIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRDIDAKEGLCG 328
Query: 272 IGTRSSYPL 280
I + SYP+
Sbjct: 329 IAMKPSYPI 337
>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
Length = 336
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 124/202 (61%), Gaps = 9/202 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + S+ + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS +G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
N GC GG ++AF Y+ +N+G+
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGL 197
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 41/61 (67%), Gaps = 4/61 (6%)
Query: 224 TQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
++LDHAV +VG+G G YW++KNSW + WGD GY+ + +D+ CGI T +SYP
Sbjct: 275 SRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYP 334
Query: 280 L 280
L
Sbjct: 335 L 335
>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 359
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 155/332 (46%), Gaps = 90/332 (27%)
Query: 31 QVVSSRSTH-EQSVVEI---------HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++VS R E+SVV+I ++ ++G+ Y++ E ++R IFKENL+ I
Sbjct: 34 RMVSDRLREVEESVVQILGQSRHVISFARFAHRYGKRYENAEEMKLRFSIFKENLDLIRS 93
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
NK+G +YKLG NQF+D+T EF+ G + + +T T K L+ +P +
Sbjct: 94 TNKKG-LSYKLGVNQFADMTWQEFQRTKLG---AAQNCSATLKGTHK---LTGEALPETK 146
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GC 199
DWR+ G V+P+K+Q CG CW F+ A+E G I LSEQQL+DC+ NN GC
Sbjct: 147 DWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGC 206
Query: 200 LGGSREKAFAYIIQNQGI-------FNGVCGT------------------------QLDH 228
GG +AF YI N G+ + G GT +L H
Sbjct: 207 NGGLPSQAFEYIKSNGGLDTEEAYPYTGEDGTCKYSAENVGVEVLDSVNITLGAEDELKH 266
Query: 229 AVTIV----------------------------------------GFGTTEDGANYWLIK 248
AV +V G+G EDG YWLIK
Sbjct: 267 AVGLVRPVSIAFEVIHSFRLYKSGVYSDSHCGQTPMDVNHAVLAVGYG-IEDGVPYWLIK 325
Query: 249 NSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
NSWG WGD GY K+ + +CGI T +SYP+
Sbjct: 326 NSWGADWGDKGYFKMEMGKNMCGIATCASYPV 357
>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 423
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 81/218 (37%), Positives = 123/218 (56%), Gaps = 16/218 (7%)
Query: 12 KINTTPMFIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQHGRSYKDELEKE 65
+++ T + + + + S A ++ + E+ + +++E+W H R ++ EK
Sbjct: 47 QVSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKG 105
Query: 66 MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM------PSPSHR 119
R FKEN+ +I NK G+R Y+L N+F D+ +EFR+ + ++ SP+ R
Sbjct: 106 RRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAAR 165
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ F Y S D P S+DWR +GAVT +K+Q CG CWAF+ V AVEGI IR+G+
Sbjct: 166 AGAVPGFMYD--SAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGS 223
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L LSEQ+L+DC T+ NGC GG E AF +I GI
Sbjct: 224 LASLSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGI 260
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 52/75 (69%), Gaps = 3/75 (4%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
A+ ++G+F G CGT LDH V VG+G +DG YW++KNSWG +WG+ GY+++ R
Sbjct: 323 AFQFYSEGVFTGDCGTDLDHGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAG 382
Query: 266 DEGLCGIGTRSSYPL 280
+ GLCGI +S+P+
Sbjct: 383 NGGLCGIAMEASFPI 397
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 87/233 (37%), Positives = 134/233 (57%), Gaps = 24/233 (10%)
Query: 16 TPMFIIITLLV--SCASQVVSSRSTHE-----QSVVE-----IHEKWMAQHGRSYKDELE 63
+ M I++ +V SCA+ + S +++ SV + I E WM +HG+ Y E
Sbjct: 6 SAMLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAE 65
Query: 64 KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS 123
KE RL IF++NL +I N E N +Y+LG F+DL+ E++ + G P + +
Sbjct: 66 KERRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMT 124
Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
S+ +Y+ + +P S+DWR++GAVT +K+Q C CWAF+ V AVEG+ KI +G L+ L
Sbjct: 125 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 184
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQL 226
SEQ L++C+ NNGC GG E A+ +I++N G+ NGVC +L
Sbjct: 185 SEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRL 236
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/75 (54%), Positives = 55/75 (73%), Gaps = 5/75 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + G+F+G CGT L+H V +VG+GT E+G +YWL+KNS G TWG+AGYMK+ R+
Sbjct: 279 FQLYESGVFDGSCGTNLNHGVVVVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIAN 337
Query: 267 -EGLCGIGTRSSYPL 280
GLCGI R+SYPL
Sbjct: 338 PRGLCGIAMRASYPL 352
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 81/175 (46%), Positives = 112/175 (64%), Gaps = 5/175 (2%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
+ +V + E W ++ + YK+ EK R +IFK+NL YI++ NK+ N +Y LG N+F+DLT
Sbjct: 16 ERLVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKK-NSSYWLGLNEFADLT 74
Query: 101 NDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
+DEF+A Y G S + F Y+++ D P S+DWR KGAVTP+KNQ CG
Sbjct: 75 HDEFKAKYVGSLGEDSTIIEQSDDEEFPYKHV--VDYPESIDWRQKGAVTPVKNQNPCGS 132
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CWAF+ VA VEGI KI +G LI LSEQ+LLDC ++GC GG + + Y+ N
Sbjct: 133 CWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCKGGYQTTSLQYVADN 186
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 67/132 (50%), Gaps = 16/132 (12%)
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K Q +C + + G ++ + N E L+ N + S+ +AF +
Sbjct: 198 KKQGKCRAKDKKGSKVKITGYKRVPANN-----EVSLIQAIANQPVSVVVESKGRAFQFY 252
Query: 212 IQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DE 267
GIF G CGT++DHAVT VG+G NY LIKNSWG WG+ GY++I R +
Sbjct: 253 --KGGIFEGPCGTKVDHAVTAVGYG-----KNYILIKNSWGPKWGEKGYIRIKRASGKSK 305
Query: 268 GLCGIGTRSSYP 279
G CG+ + S +P
Sbjct: 306 GTCGVYSSSYFP 317
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 83/203 (40%), Positives = 126/203 (62%), Gaps = 13/203 (6%)
Query: 20 IIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ +LV+ S ++ R E+S+ +++E+W + H S +D EK R +FK N+ +I
Sbjct: 12 VLAVILVAAMSMEITERDLASEESLWDLYERWRSHHTVS-RDLSEKRKRFNVFKANVHHI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTG----YKMPSPSHRSTTSSTFKYQNLSMT 134
K N++ ++ YKL N F+D+TN EFR Y+ Y+M S +T K ++L
Sbjct: 71 HKVNQK-DKPYKLKLNSFADMTNHEFREFYSSKVKHYRMLHGSRANTGFMHGKTESL--- 126
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
P S+DWR +GAVT +KNQ +CG CWAF+ V VEGI KI++G L+ LSEQ+L+DC T+
Sbjct: 127 --PASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETD 184
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
N GC GG E A+ +I ++ GI
Sbjct: 185 -NEGCNGGLMENAYEFIKKSGGI 206
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 34/73 (46%), Positives = 50/73 (68%), Gaps = 6/73 (8%)
Query: 214 NQGIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DE 267
++G++ G CG +LDH V +VG+GT DG YW++KNSWG WG+ GY+++ R +
Sbjct: 271 SEGVYAGDSCGNELDHGVAVVGYGTALDGTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEG 330
Query: 268 GLCGIGTRSSYPL 280
G+CGI +SYPL
Sbjct: 331 GVCGIAMEASYPL 343
>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
Length = 336
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 123/202 (60%), Gaps = 9/202 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + S+ + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
N GC GG ++AF Y+ +N+G+
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGL 197
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 41/61 (67%), Gaps = 4/61 (6%)
Query: 224 TQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
++LDHAV +VG+G G YW++KNSW + WGD GY+ + +D+ CGI T +SYP
Sbjct: 275 SRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYP 334
Query: 280 L 280
L
Sbjct: 335 L 335
>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 75/173 (43%), Positives = 105/173 (60%), Gaps = 9/173 (5%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
++ E+WMA+ G+ Y EKE R +F++N+ +I L NQF+DLTNDEF
Sbjct: 39 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 98
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
+ +TG K P P + + +P +DWR KGAVT +K+Q CG CWAFA
Sbjct: 99 VSTHTGAKPPCPKDAP--------RGVDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFA 150
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
AVAA+EG+T+IR+G L LSEQ+L+DC T G++GC GG ++AF + GI
Sbjct: 151 AVAAIEGLTQIRTGKLTPLSEQELVDCDT-GSSGCAGGHTDRAFELVAAKGGI 202
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 132/229 (57%), Gaps = 22/229 (9%)
Query: 18 MFIIITLLVSCASQVVSSRSTHE-----QSVVE-----IHEKWMAQHGRSYKDELEKEMR 67
+ ++ ++ SCA+ + S +++ SV + I E WM +HG+ Y EKE R
Sbjct: 3 ILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAEKERR 62
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
L IF++NL +I N E N +Y+LG F+DL+ E++ + G P + +S+ +
Sbjct: 63 LTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMTSSDR 121
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y+ + +P S+DWR++GAVT +K+Q C CWAF+ V AVEG+ KI +G L+ LSEQ
Sbjct: 122 YKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQD 181
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQL 226
L++C+ NNGC GG E A+ +I++N G+ NGVC +L
Sbjct: 182 LINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRL 229
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/75 (54%), Positives = 55/75 (73%), Gaps = 5/75 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + G+F+G CGT L+H V +VG+GT E+G +YWL+KNS G TWG+AGYMK+ R+
Sbjct: 272 FQLYESGVFDGSCGTNLNHGVVVVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIAN 330
Query: 267 -EGLCGIGTRSSYPL 280
GLCGI R+SYPL
Sbjct: 331 PRGLCGIAMRASYPL 345
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 112/192 (58%), Gaps = 49/192 (25%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ +L + ASQ +SRS HE S+ E HE WMA++GR YKD EKE R KIFK+N+
Sbjct: 14 LLFILAAWASQA-TSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV----- 67
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
++TFKY+N+ T VP+++
Sbjct: 68 ----------------------------------------AQATTFKYENV--TAVPSTI 85
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGC 199
DWR KGAVTPIK+Q++CG CWAF+AVAA EGIT+I +G LI LSEQ+L+DC T G N GC
Sbjct: 86 DWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGC 145
Query: 200 LGGSREKAFAYI 211
GG + AF +I
Sbjct: 146 SGGLXDDAFRFI 157
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 40/78 (51%), Positives = 52/78 (66%), Gaps = 4/78 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 266
F + G+F G CGT+LDH V VG+G +DG YWL+KNSWG WG+ GY+++ RD
Sbjct: 221 FEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMXYWLVKNSWGTGWGEEGYIRMQRDV 280
Query: 267 ---EGLCGIGTRSSYPLA 281
EGLCGI ++SYP A
Sbjct: 281 TAKEGLCGIAMQASYPTA 298
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 79/191 (41%), Positives = 114/191 (59%), Gaps = 12/191 (6%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
I E W+ +HG+ Y EKE RL IFK+NL +I N E N Y+LG N+F+DL+ E++
Sbjct: 63 IFESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSE-NLGYRLGLNRFADLSLHEYK 121
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
+ G P + SS+ +Y+ + +P S+DWR++GAVT +K+Q C CWAF+
Sbjct: 122 EICHGADPKPPRNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFST 181
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI-------- 217
V AVEG+ KI +G L+ LSEQ L++C+ NNGC GG E A+ +I+ N G+
Sbjct: 182 VGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIVSNGGLGTDNDYPY 240
Query: 218 --FNGVCGTQL 226
NG C +L
Sbjct: 241 KAVNGACDGRL 251
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 42/75 (56%), Positives = 56/75 (74%), Gaps = 5/75 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + G+F+G CGT L+H V +VG+GT E+G NYW+++NSWGNTWG+AGYMK+ R+
Sbjct: 294 FQLYESGVFDGRCGTNLNHGVVVVGYGT-ENGRNYWIVRNSWGNTWGEAGYMKMARNIAN 352
Query: 267 -EGLCGIGTRSSYPL 280
GLCGI R SYPL
Sbjct: 353 PRGLCGIAMRVSYPL 367
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 79/186 (42%), Positives = 115/186 (61%), Gaps = 14/186 (7%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ ++E+W +H + +D +K R +FK N+ I + N+ + YKL N+F D+
Sbjct: 42 EEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDM 99
Query: 100 TNDEFRALYTGYKMPSPSHR--------STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
T DEFR Y G ++ HR S+ S++F Y + DVP S+DWR KGAVT +
Sbjct: 100 TADEFRRHYAGSRVAH--HRMFRGDRQGSSASASFMYAD--ARDVPASVDWRQKGAVTDV 155
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q +CG CWAF+ +AAVEGI I++ NL LSEQQL+DC T N GC GG + AF YI
Sbjct: 156 KDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYI 215
Query: 212 IQNQGI 217
++ G+
Sbjct: 216 AKHGGV 221
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 39/71 (54%), Positives = 52/71 (73%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F+G CGT+LDH VT VG+G T DG YWL+KNSWG WG+ GY+++ RD EG
Sbjct: 284 SEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGH 343
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 344 CGIAMEASYPV 354
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 82/198 (41%), Positives = 122/198 (61%), Gaps = 9/198 (4%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN- 82
LLV+ A +S E + A HG++YK++ E+ R+KIF +N + IE N
Sbjct: 4 LLVAVAIIALSYAHPSFDIYPEEWHVFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNA 63
Query: 83 --KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
++G +YK+ N F DL EF+AL G+KM SP + F S +++P ++
Sbjct: 64 KYEQGEVSYKMMMNHFGDLMVHEFKALMNGFKM-SPDTKRNGELYFP----SNSNLPKTV 118
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGC 199
DWR KGAVTP+K+Q +CG CW+F+A ++EG +++G L+ LSEQ L+DCST+ GNNGC
Sbjct: 119 DWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGC 178
Query: 200 LGGSREKAFAYIIQNQGI 217
GG ++AF Y+ N+GI
Sbjct: 179 EGGLMDQAFQYVSDNKGI 196
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 50/75 (66%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ ++G++N LDH V VG+GT E+G +YWL+KNSWG +WG+ GY+KI R+
Sbjct: 256 SFQFYSKGVYNEPNCSSYDLDHGVLAVGYGT-ENGQDYWLVKNSWGPSWGENGYIKIARN 314
Query: 267 EG-LCGIGTRSSYPL 280
CGI + +SYPL
Sbjct: 315 HSNHCGIASMASYPL 329
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 154/319 (48%), Gaps = 87/319 (27%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDL 99
I E+W +H ++Y+DE E+ RLKIF EN I K N+ G T+K+ N+++D+
Sbjct: 23 IKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADM 82
Query: 100 TNDEFRAL-----YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
+ EFR YT +K S S T TF + + +P S+DWR+KGAVT +K+Q
Sbjct: 83 LHHEFRETMNGFNYTLHKELRASDPSFTGITF--ISPAHVKLPKSVDWREKGAVTAVKDQ 140
Query: 155 KECGCCWAFAAVAAVEG--------ITKIRSGNLIQLSEQ-------------------- 186
CG CWAF++ A+EG + + NL+ S +
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKD 200
Query: 187 ------------QLLDCSTNGNNGCLGGSREKAFAYIIQ--------------------- 213
+ +D S + N +G + ++ FA I Q
Sbjct: 201 NGGIDTEKSYPYEGIDDSCHFNKDSVGAT-DRGFADIPQGNEKKMAEAVATIGPVSVAID 259
Query: 214 ---------NQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 262
++GI+N C +Q LDH V +VG+GT E G +YWL+KNSWG TWGD G++K
Sbjct: 260 ASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESGKDYWLVKNSWGTTWGDKGFIK 319
Query: 263 IVRDE-GLCGIGTRSSYPL 280
+ R+E CGI + SSYPL
Sbjct: 320 MARNEDNQCGIASASSYPL 338
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 83/200 (41%), Positives = 124/200 (62%), Gaps = 8/200 (4%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
I+LL CA VV++ ++ + + E + A H +SY+ +E+ +R KIF EN + +
Sbjct: 4 ISLL--CAFVVVTTAASSHEILRTQWEAFKATHKKSYQSNMEELLRFKIFSENSLLVARH 61
Query: 82 NKEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N++ R +YKLG NQF DL EF ++ GY+ + R +T N++ + +P
Sbjct: 62 NEKYARGLVSYKLGMNQFGDLLPHEFARMFNGYRGARTAGRGST--FLPPANVNYSSLPQ 119
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNN 197
S+DWR+KGAVTP+KNQ +CG CWAF+ ++EG +++G L+ LSEQ L+DCS T GN+
Sbjct: 120 SMDWREKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSETFGNH 179
Query: 198 GCLGGSREKAFAYIIQNQGI 217
GC GG + AF YI N GI
Sbjct: 180 GCEGGLMDNAFQYIKANGGI 199
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 53/75 (70%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G+++ C + QLDH V +VG+G EDG YWL+KNSW +WGD GY+K+ RD
Sbjct: 259 SFQLYSEGVYDETECSSEQLDHGVLVVGYGV-EDGKKYWLVKNSWAESWGDNGYIKMSRD 317
Query: 267 -EGLCGIGTRSSYPL 280
+ CGI + +SYPL
Sbjct: 318 KDNQCGIASAASYPL 332
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 115/187 (61%), Gaps = 15/187 (8%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ ++E+W +H + +D +K R +FK N+ I + N+ + YKL N+F D+
Sbjct: 149 EEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDM 206
Query: 100 TNDEFRALYTGYKMPSPSHR---------STTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
T DEFR Y G ++ HR S ++S+F Y + DVP S+DWR KGAVT
Sbjct: 207 TADEFRRHYAGSRVAH--HRMFRGDRQGSSASASSFMYAD--ARDVPASVDWRQKGAVTD 262
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q +CG CWAF+ +AAVEGI I++ NL LSEQQL+DC T N GC GG + AF Y
Sbjct: 263 VKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQY 322
Query: 211 IIQNQGI 217
I ++ G+
Sbjct: 323 IAKHGGV 329
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 51/71 (71%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F+G CGT+LDH V VG+G T DG YWL+KNSWG WG+ GY+++ RD EG
Sbjct: 392 SEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGH 451
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 452 CGIAMEASYPV 462
>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
Length = 336
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 123/202 (60%), Gaps = 9/202 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + S+ + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS G
Sbjct: 116 PQQVDWRQRGFVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
N GC GG ++AF Y+ +N+G+
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGL 197
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 41/61 (67%), Gaps = 4/61 (6%)
Query: 224 TQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
++LDHAV +VG+G G YW++KNSW + WGD GY+ + +D+ CG+ T +SYP
Sbjct: 275 SRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATSASYP 334
Query: 280 L 280
L
Sbjct: 335 L 335
>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
Length = 335
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 126/204 (61%), Gaps = 13/204 (6%)
Query: 20 IIITLLVS-CASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
++ LLV+ C S V ++ S Q ++ H W +QHG+SY +++E R+ I++ENL
Sbjct: 1 MMFALLVTLCISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKQDP----NRTSKGALFMEPSFF 113
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF Y+ +N+G+
Sbjct: 174 QGNQGCNGGIMDQAFQYVKENKGL 197
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 30/67 (44%), Positives = 43/67 (64%), Gaps = 4/67 (5%)
Query: 218 FNGVCGTQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIG 273
+ C ++LDHAV +VG+G G YW++KNSW + WGD GY+ + +D+ CGI
Sbjct: 268 YERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIA 327
Query: 274 TRSSYPL 280
T +SYPL
Sbjct: 328 TMASYPL 334
>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
tropicalis]
gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 123/202 (60%), Gaps = 9/202 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + S+ + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
N GC GG ++AF Y+ +N+G+
Sbjct: 176 NQGCNGGIMDQAFQYVKENKGL 197
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 30/67 (44%), Positives = 43/67 (64%), Gaps = 4/67 (5%)
Query: 218 FNGVCGTQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIG 273
+ C ++LDHAV +VG+G G YW++KNSW + WGD GY+ + +D+ CGI
Sbjct: 268 YERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIA 327
Query: 274 TRSSYPL 280
T +SYPL
Sbjct: 328 TMASYPL 334
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 125/201 (62%), Gaps = 6/201 (2%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE+++R+KI+ EN + K
Sbjct: 6 LIFLLAAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKLRMKIYLENKHKVAK 64
Query: 81 AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y++ N+F DL + EFR++ GY+ + S STF + + +VP
Sbjct: 65 HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 123
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
S+DWR+KGA+TP+K+Q +CG CWAF++ A+EG T ++G L+ LSEQ L+DCS GN
Sbjct: 124 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGN 183
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG ++AF YI N+GI
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGI 204
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 30/56 (53%), Positives = 42/56 (75%), Gaps = 2/56 (3%)
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
LDH V +VG+G+ ++G +YWL+KNSW WGD GY+KI R+ + CG+ T +SYPL
Sbjct: 283 LDHGVLVVGYGS-DNGEDYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 87/212 (41%), Positives = 123/212 (58%), Gaps = 21/212 (9%)
Query: 19 FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQH--GRSYKDELEKEMRLKI 70
F+ + L +S V +S H E+S+ +++E+W + H RS D K R +
Sbjct: 6 FLWVVLSLSLVLGVANSFDFHDKDLESEESLWDLYERWRSHHTVSRSLGD---KHKRFNV 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTSST 125
FK N+ ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ HR + T
Sbjct: 63 FKANMMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNH--HRMFRDMPRGNGT 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F Y+ + VP S+DWR KGAVT +K+Q CG CWAF+ V AVEGI +I++ L+ LSE
Sbjct: 120 FMYEKVG--SVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSE 177
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+L+DC T N GC GG E AF +I Q GI
Sbjct: 178 QELVDCDTEENAGCNGGLMESAFQFIKQKGGI 209
>gi|414591546|tpg|DAA42117.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
gi|414591547|tpg|DAA42118.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 268
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 116/189 (61%), Gaps = 13/189 (6%)
Query: 40 EQSVVEIHEKWMAQH----GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
E+S+ ++E+W + + R D+ ++ R +FKEN Y+ +AN++ R ++L N+
Sbjct: 34 EESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKDGRPFRLALNK 93
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS-------MTDVPTSLDWRDKGAV 148
F+D+T DEFR Y G + + HR+ + + T++P ++DWR +GAV
Sbjct: 94 FADMTTDEFRRTYAGSR--TRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAVDWRLRGAV 151
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q +CG CWAF+A+AAVEG+ KI +G L+ LSEQ+L+DC N GC GG + AF
Sbjct: 152 TGVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAF 211
Query: 209 AYIIQNQGI 217
YI +N G+
Sbjct: 212 QYIQRNGGV 220
>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
Length = 327
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 75/173 (43%), Positives = 105/173 (60%), Gaps = 9/173 (5%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
++ E+WMA+ G+ Y EKE R +F++N+ +I L NQF+DLTNDEF
Sbjct: 17 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 76
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
+ +TG K P P + + +P +DWR KGAVT +K+Q CG CWAFA
Sbjct: 77 VSTHTGAKPPCPKDAP--------RGVDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFA 128
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
AVAA+EG+T+IR+G L LSEQ+L+DC T G++GC GG ++AF + GI
Sbjct: 129 AVAAIEGLTQIRTGKLTPLSEQELVDCDT-GSSGCAGGHTDRAFELVAAKGGI 180
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 80/181 (44%), Positives = 110/181 (60%), Gaps = 5/181 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E S+ ++E+W QH + +D EK R +F+EN+ I + N+ G+ YKL N+F D+
Sbjct: 40 EDSLWALYERWREQHTVA-RDLGEKARRFNVFRENVRLIHEFNR-GDAPYKLRLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQN---LSMTDVPTSLDWRDKGAVTPIKNQKE 156
T DEFR Y ++ S + + S+ DVP S+DWR KGAVT +K+Q +
Sbjct: 98 TADEFRRAYASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQGQ 157
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ +AAVEGI IRS NL LSEQQL+DC T N GC GG + AF YI ++ G
Sbjct: 158 CGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCNGGLMDYAFQYIAKHGG 217
Query: 217 I 217
+
Sbjct: 218 V 218
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/71 (54%), Positives = 52/71 (73%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F G CGT+LDH V VG+GTT DG YW++KNSWG WG+ GY+++ RD EGL
Sbjct: 282 SEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWGPEWGEKGYIRMKRDVKDKEGL 341
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 342 CGIAMEASYPV 352
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 116/189 (61%), Gaps = 13/189 (6%)
Query: 40 EQSVVEIHEKWMAQH----GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
E+S+ ++E+W + + R D+ ++ R +FKEN Y+ +AN++ R ++L N+
Sbjct: 34 EESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKDGRPFRLALNK 93
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS-------MTDVPTSLDWRDKGAV 148
F+D+T DEFR Y G + + HR+ + + T++P ++DWR +GAV
Sbjct: 94 FADMTTDEFRRTYAGSR--TRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAVDWRLRGAV 151
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q +CG CWAF+A+AAVEG+ KI +G L+ LSEQ+L+DC N GC GG + AF
Sbjct: 152 TGVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAF 211
Query: 209 AYIIQNQGI 217
YI +N G+
Sbjct: 212 QYIQRNGGV 220
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 46/70 (65%), Gaps = 4/70 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
++G+F G CGT LDH V VG+GTT DG YW +KNSWG WG+ GY+++ R GL
Sbjct: 285 SEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGYIRMQRGVPDSRGL 344
Query: 270 CGIGTRSSYP 279
CGI SYP
Sbjct: 345 CGIAMEPSYP 354
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 82/186 (44%), Positives = 122/186 (65%), Gaps = 12/186 (6%)
Query: 40 EQSVVEIHEKWMAQH---GRSYKDEL-EKEMRLKIFKENLEYIEKANKEGNRT--YKLGT 93
E +++ W+A+H G S+ + E E R ++F +NL++++ N + ++LG
Sbjct: 58 EAEARAVYDLWVARHRHGGGSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGM 117
Query: 94 NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV-TPIK 152
N+F+DLTNDEFRA Y G P+ R + Y++ + +P S+DWRDKGAV P+K
Sbjct: 118 NRFADLTNDEFRAAYLG-TTPAGRGRHVGEA---YRHDGVEALPDSVDWRDKGAVVAPVK 173
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+AVAAVEGI KI +G L+ LSEQ+L++C+ NG N+GC GG + AFA+I
Sbjct: 174 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFI 233
Query: 212 IQNQGI 217
+N G+
Sbjct: 234 ARNGGL 239
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 51/84 (60%), Gaps = 9/84 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGY 260
G RE + + + G+F G CGT LDH V VG+GT G +YW ++NSWG WG+ GY
Sbjct: 296 GGRE----FQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGY 351
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G CGI +SYP+
Sbjct: 352 IRMERNVTARTGKCGIAMMASYPI 375
>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
Length = 340
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 80/207 (38%), Positives = 122/207 (58%), Gaps = 14/207 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ LL A +S+ + V+ ++E+W+ +H + Y EK R +IFK+NL Y
Sbjct: 5 VLILSFLLFVSAITCISTNWRSDDEVIALYEEWLVKHQKLYSSLGEKIKRFEIFKDNLRY 64
Query: 78 IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKM-------PSPSHRSTTSSTFK 127
I++ N K + + LG NQF+DLT DEF ++Y G + +P+H K
Sbjct: 65 IDQQNHYNKVNHMNFTLGLNQFADLTLDEFSSIYLGTSVDYEQIISSNPNHDDVEEDILK 124
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+ ++P S+DWR+KG V PI+NQ +CG CW F+AVA++E + I+ G++I LSEQ+
Sbjct: 125 ---EDVVELPDSVDWREKGVVFPIRNQGKCGSCWTFSAVASIETLNGIKKGHMIALSEQE 181
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQN 214
LLDC T + GC GG AFAY+ +N
Sbjct: 182 LLDCETI-SQGCKGGHYNNAFAYVAKN 207
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 53/71 (74%), Gaps = 5/71 (7%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++GIF+G CG LDHAV IVG+G+ + GANYW+++NSWG WG+ GYM+I ++ EG
Sbjct: 271 DRGIFSGACGPILDHAVNIVGYGS-KGGANYWIMRNSWGTNWGENGYMRIQKNSKHYEGH 329
Query: 270 CGIGTRSSYPL 280
CGI + SYP+
Sbjct: 330 CGIAMQPSYPV 340
>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
Length = 360
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 79/180 (43%), Positives = 117/180 (65%), Gaps = 11/180 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H + + EK R +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EKSLWDLYERWRSHHTVT-RSLDEKHNRFNVFKANVMHVHNTNKL-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR +Y K+ HR S + TF Y+N+ +VP+S+DWR KGAVT +K+Q
Sbjct: 91 TNYEFRRIYADSKVSH--HRMFRGMSNENGTFMYENVK--NVPSSIDWRKKGAVTDVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI +I++ L+ LSEQ+L+DC T GN GC GG E AF +I QN
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGGNEGCNGGLMEYAFEFIKQN 206
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 79/173 (45%), Positives = 108/173 (62%), Gaps = 7/173 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E + A+H + Y+ E+ MR IF+EN ++IE N + + LG N F DLTN E+R
Sbjct: 82 ENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFDFYLGMNHFGDLTNKEYRER 141
Query: 108 YTGYKMP--SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
Y GY+ P +PS S S + + DVP +DWRD+G VTP+KNQ +CG CWAF+A
Sbjct: 142 YLGYRRPENTPSKASYIFSRAE----KIEDVPDQIDWRDQGFVTPVKNQGQCGSCWAFSA 197
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGI 217
V ++EG +G L+ LSEQ L+DCST GN+GC GG ++AF Y+ N GI
Sbjct: 198 VGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMDQAFEYVKDNHGI 250
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 46/67 (68%), Gaps = 3/67 (4%)
Query: 216 GIFN-GVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGI 272
G++N C T +LDH V +VG+G G ++W++KNSWG WG GY+++ R++G CGI
Sbjct: 317 GVYNVPWCSTSELDHGVLVVGYGKQFQGKDFWMVKNSWGVGWGIYGYIEMSRNKGNQCGI 376
Query: 273 GTRSSYP 279
+++S P
Sbjct: 377 ASKASIP 383
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 82/186 (44%), Positives = 123/186 (66%), Gaps = 12/186 (6%)
Query: 40 EQSVVEIHEKWMAQH---GRSYKDEL-EKEMRLKIFKENLEYIEKANKEGNRT--YKLGT 93
E +++ W+A+H G S+ + E E R ++F +NL++++ N + ++LG
Sbjct: 59 EAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGM 118
Query: 94 NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV-TPIK 152
N+F+DLTNDEFRA Y G +P+ R Y++ + +P S+DWRDKGAV +P+K
Sbjct: 119 NRFADLTNDEFRAAYLG---TTPAGRGRHVGEM-YRHDGVEALPDSVDWRDKGAVVSPVK 174
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYI 211
NQ +CG CWAF+AVAAVEGI KI +G L+ LSEQ+L++C+ N GN+GC GG + AFA+I
Sbjct: 175 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNRGNSGCNGGIMDDAFAFI 234
Query: 212 IQNQGI 217
+N G+
Sbjct: 235 TRNGGL 240
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 51/84 (60%), Gaps = 9/84 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGY 260
G RE + + + G+F G CGT LDH V VG+GT G +YW ++NSWG WG+ GY
Sbjct: 297 GGRE----FQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGY 352
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G CGI +SYP+
Sbjct: 353 IRMERNVTARTGKCGIAMMASYPI 376
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 82/186 (44%), Positives = 122/186 (65%), Gaps = 12/186 (6%)
Query: 40 EQSVVEIHEKWMAQH---GRSYKDEL-EKEMRLKIFKENLEYIEKANKEGNRT--YKLGT 93
E +++ W+A+H G S+ + E E R ++F +NL++++ N + ++LG
Sbjct: 58 EAEARAVYDLWVARHRHGGDSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGM 117
Query: 94 NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV-TPIK 152
N+F+DLTNDEFRA Y G P+ R + Y++ + +P S+DWRDKGAV P+K
Sbjct: 118 NRFADLTNDEFRAAYLG-TTPAGRGRHVGEA---YRHDGVEVLPDSVDWRDKGAVVAPVK 173
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+AVAAVEGI KI +G L+ LSEQ+L++C+ NG N+GC GG + AFA+I
Sbjct: 174 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFI 233
Query: 212 IQNQGI 217
+N G+
Sbjct: 234 ARNGGL 239
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 51/84 (60%), Gaps = 9/84 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGY 260
G RE + + + G+F G CGT LDH V VG+GT G +YW ++NSWG WG+ GY
Sbjct: 296 GGRE----FQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGY 351
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G CGI +SYP+
Sbjct: 352 IRMERNVTARTGKCGIAMMASYPI 375
>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
Length = 360
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 143/303 (47%), Gaps = 76/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+SY+ E R +IF E+L+ + N++G +Y+LG N+F+D++ +EFRA
Sbjct: 61 RFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKG-LSYRLGINRFADMSWEEFRAT- 118
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
++ + + S T + + +P + DWR+ G V+P+KNQ CG CW F+ A
Sbjct: 119 ---RLGAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCWTFSTTGA 175
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E +G I LSEQQL+DC NN GC GG +AF YI N G+
Sbjct: 176 LEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQG 235
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L AV +V GF
Sbjct: 236 VNGICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGVYTS 295
Query: 236 ---GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
GTT EDG YWLIKNSWG WGD GY K+ + +CG+ T +S
Sbjct: 296 DHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCAS 355
Query: 278 YPL 280
YP+
Sbjct: 356 YPI 358
>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
Length = 360
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 143/303 (47%), Gaps = 76/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+SY+ E R +IF E+L+ + N++G +Y+LG N+F+D++ +EFRA
Sbjct: 61 RFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKG-LSYRLGINRFADMSWEEFRAT- 118
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
++ + + S T + + +P + DWR+ G V+P+KNQ CG CW F+ A
Sbjct: 119 ---RLGAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCWTFSTTGA 175
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E +G I LSEQQL+DC NN GC GG +AF YI N G+
Sbjct: 176 LEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQG 235
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L AV +V GF
Sbjct: 236 VNGICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGVYTS 295
Query: 236 ---GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
GTT EDG YWLIKNSWG WGD GY K+ + +CG+ T +S
Sbjct: 296 DHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCAS 355
Query: 278 YPL 280
YP+
Sbjct: 356 YPI 358
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 80/183 (43%), Positives = 115/183 (62%), Gaps = 11/183 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++E+W + H S + EK R +FKEN+ ++ + NK+ + YKL N+F+D+
Sbjct: 31 EESLWNLYERWRSHHTVS-RSLDEKHKRFNVFKENVNFVHEFNKK-DEPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR+ Y G K+ HR + +F Y+ + VP S+DWR KGAVTPIK+Q
Sbjct: 89 TNHEFRSTYAGSKVNH--HRMFRGSQHAAGSFMYEKVK--SVPPSVDWRKKGAVTPIKDQ 144
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ V AVEGI I++ L+ LSEQ+L+DC T+ N GC GG AF +I +
Sbjct: 145 GQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEK 204
Query: 215 QGI 217
GI
Sbjct: 205 GGI 207
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/131 (37%), Positives = 73/131 (55%), Gaps = 17/131 (12%)
Query: 157 CGCCWAFAAVAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
C + V +++G + N L++ + Q + + + GGS A+
Sbjct: 222 CDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDA-----GGS-----AFQFY 271
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
++G+F G CGT LDH V IVG+GTT DG YW++KNSWG WG+ GY+++ R EGL
Sbjct: 272 SEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRGISAKEGL 331
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 332 CGIAVEASYPI 342
>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
Length = 335
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 125/205 (60%), Gaps = 14/205 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
MF ++ L C S V ++ S Q ++ H W +QHG+SY +++E R+ I++ENL
Sbjct: 2 MFALLITL--CISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLR 56
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 57 KIEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKQDP----NRTSKGALFMEPSF 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS
Sbjct: 113 FAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSR 172
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF Y+ +N+G+
Sbjct: 173 PQGNQGCNGGIMDQAFQYVKENKGL 197
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 30/67 (44%), Positives = 43/67 (64%), Gaps = 4/67 (5%)
Query: 218 FNGVCGTQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIG 273
+ C ++LDHAV +VG+G G YW++KNSW + WGD GY+ + +D+ CGI
Sbjct: 268 YERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIA 327
Query: 274 TRSSYPL 280
T +SYPL
Sbjct: 328 TMASYPL 334
>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
gi|194701540|gb|ACF84854.1| unknown [Zea mays]
Length = 379
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 81/218 (37%), Positives = 122/218 (55%), Gaps = 16/218 (7%)
Query: 12 KINTTPMFIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQHGRSYKDELEKE 65
+++ T + + + + S A ++ + E+ + +++E+W H R ++ EK
Sbjct: 3 QVSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKG 61
Query: 66 MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM------PSPSHR 119
R FKEN+ +I NK G+R Y+L N+F D+ +EFR+ + ++ SP+ R
Sbjct: 62 RRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAAR 121
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ F Y S D P S+DWR +GAVT +K Q CG CWAF+ V AVEGI IR+G+
Sbjct: 122 AGAVPGFMYD--SAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGS 179
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L LSEQ+L+DC T+ NGC GG E AF +I GI
Sbjct: 180 LASLSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGI 216
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 52/75 (69%), Gaps = 3/75 (4%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
A+ ++G+F G CGT LDH V VG+G +DG YW++KNSWG +WG+ GY+++ R
Sbjct: 279 AFQFYSEGVFTGDCGTDLDHGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAG 338
Query: 266 DEGLCGIGTRSSYPL 280
+ GLCGI +S+P+
Sbjct: 339 NGGLCGIAMEASFPI 353
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 117/180 (65%), Gaps = 4/180 (2%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
+E V ++E+W+ ++ ++Y EKE R KIFK+NL+++++ N +RT+++G +F+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTN+EFRA+Y KM + S + + Y+ + +P +DWR GAV +K+Q CG
Sbjct: 96 LTNEEFRAIYLRKKMER-TKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
CWAF+AV AVEGI +I +G LI LSEQ+L+DC N GC GG AF +I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 48/75 (64%), Gaps = 5/75 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+ G CG LDH V +VG+G+T G +YW+I+NSWG WGD+GY+K+ R+
Sbjct: 274 AFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNID 332
Query: 267 --EGLCGIGTRSSYP 279
G CGI SYP
Sbjct: 333 DPFGKCGIAMMPSYP 347
>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
Length = 363
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 102/306 (33%), Positives = 147/306 (48%), Gaps = 83/306 (27%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+SY+ E + R +IF E+LE + N++G +Y+LG N++SD++ +EF+A
Sbjct: 64 RFAVRYGKSYESAAEVQRRFRIFSESLEEVRSTNQKG-LSYRLGINRYSDMSWEEFQASR 122
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
G T S+T + N M D +P + DWR+ G V+P+K+Q CG CW F+
Sbjct: 123 LGAAQ-------TCSATLR-GNHRMQDANALPETKDWREDGIVSPVKDQSHCGSCWTFST 174
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI------- 217
A+E +G I LSEQQL+DC+ NN GC GG +AF YI N G+
Sbjct: 175 TGALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGLDTEESYP 234
Query: 218 ---FNGVCG---------------------TQLDHAVTIV-----------GF------- 235
NGVC +L +AV +V GF
Sbjct: 235 YKGVNGVCHYKPENAAVQVLDSVNITLNAEDELQNAVGLVRPVSVAFEVINGFRQYKSGV 294
Query: 236 ------GTTEDGAN---------------YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 274
GTT D N YWLIKNSWG +WGD GY K+ R + +C + T
Sbjct: 295 YTSDHCGTTPDDVNHAVLAVGYGVENGTPYWLIKNSWGESWGDKGYFKMERGKNMCAVAT 354
Query: 275 RSSYPL 280
+SYP+
Sbjct: 355 CASYPI 360
>gi|195624522|gb|ACG34091.1| thiol protease aleurain precursor [Zea mays]
Length = 360
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 143/303 (47%), Gaps = 76/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+SY+ E R +IF E+L+ + N++G +Y+LG N+F+D++ +EFRA
Sbjct: 61 RFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKG-LSYRLGINRFADMSWEEFRAT- 118
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
++ + + S T + + +P + DWR+ G V+P+KNQ CG CW F+ A
Sbjct: 119 ---RLGAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCWTFSTTGA 175
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E +G I LSEQQL+DC NN GC GG +AF YI N G+
Sbjct: 176 LEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQG 235
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L AV +V GF
Sbjct: 236 VNGICKFKNENVGFKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGVYTS 295
Query: 236 ---GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
GTT EDG YWLIKNSWG WGD GY K+ + +CG+ T +S
Sbjct: 296 DHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCAS 355
Query: 278 YPL 280
YP+
Sbjct: 356 YPI 358
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 80/194 (41%), Positives = 118/194 (60%), Gaps = 13/194 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN--KEGNRTYKLGTNQFS 97
E+ + E+ + W +H + YK E E R+ FK NL+YI + N ++ +K+G N+F+
Sbjct: 43 EEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKFA 102
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+N+EFR +Y K+ P T K+++L D P+SLDWR+KG VT +K+Q +C
Sbjct: 103 DLSNEEFREMYLS-KVKKPI---TIEEKRKHRHLQTCDAPSSLDWRNKGVVTAVKDQGDC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CW+F+ A+E I I +G+LI LSEQ+L+DC T N GC GG + AF ++I N GI
Sbjct: 159 GSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGI 218
Query: 218 -------FNGVCGT 224
+ GV GT
Sbjct: 219 DTEADYPYTGVDGT 232
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 63/123 (51%), Gaps = 20/123 (16%)
Query: 166 VAAVEGITKIRSGN--LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCG 223
V ++EG + + L+ + QQ + +G+ A + + GI++G C
Sbjct: 242 VVSIEGYVDVDPSDSALLCATVQQPISVGMDGS----------ALDFQLYTGGIYDGDCS 291
Query: 224 ---TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRS 276
+DHA+ IVG+G+ E+ +YW++KNSWG WG GY I R+ G+C I +
Sbjct: 292 GDPNDIDHAILIVGYGS-ENDEDYWIVKNSWGTEWGMEGYFYIRRNTSKPYGVCAINADA 350
Query: 277 SYP 279
SYP
Sbjct: 351 SYP 353
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 80/186 (43%), Positives = 115/186 (61%), Gaps = 10/186 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++E+W A+H S +D EK R +F+EN + + N + YKL N+F+DL
Sbjct: 42 EESLWALYERWRARHTVS-RDLAEKSRRFNVFRENARLVHEFNLRRDAPYKLRLNRFADL 100
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPI 151
T+DEFR Y ++M P + + + S T +PTS+DWR+KGAVT +
Sbjct: 101 TSDEFRRSYASSRVSHHRMFKP-RAANNNDDDDDKGSSFTHGGALPTSVDWREKGAVTGV 159
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q +CG CWAF+ +AAVEGI IR+ NL LSEQQL+DC T N GC GG + AF+YI
Sbjct: 160 KDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCDGGLMDDAFSYI 219
Query: 212 IQNQGI 217
++ G+
Sbjct: 220 AKHGGV 225
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 51/71 (71%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F G CGT+LDH V VG+G T DG YW++KNSWG WG+ GY+++ RD EGL
Sbjct: 291 SEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWIVKNSWGEEWGEKGYIRMKRDVADKEGL 350
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 351 CGIAMEASYPV 361
>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
Length = 358
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 75/170 (44%), Positives = 104/170 (61%), Gaps = 4/170 (2%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
+W A + RSY E++ R ++++ N+E+IE N+ GN TY LG NQF+DLT +EF LY
Sbjct: 59 RWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADLTEEEFLDLY 118
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVA 167
T MP + + S+ D PTS+DWR +GAVTPIKNQ C CWAF A
Sbjct: 119 TMKGMPPVRRDAGKKQQANFS--SVVDAPTSVDWRSRGAVTPIKNQGPSCSSCWAFVTAA 176
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+E IT+IR+G L+ LSEQ+L+DC + GC G + ++IQN G+
Sbjct: 177 TIESITQIRTGKLVSLSEQELIDCDPY-DGGCNLGYFVNGYKWVIQNGGL 225
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 50/68 (73%), Gaps = 3/68 (4%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---VRDEGLCGI 272
G+++G CGT+++HA+T+VG+G G YWL+KNSWG TWG+ GY+++ VR GLCGI
Sbjct: 290 GVWSGQCGTRMNHAITVVGYGADSSGVKYWLVKNSWGQTWGERGYLRMRKDVRQGGLCGI 349
Query: 273 GTRSSYPL 280
+YP+
Sbjct: 350 ALDLAYPI 357
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 117/204 (57%), Gaps = 18/204 (8%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E+WM +HGR+Y + EK+ R +++KENL IE+ N G Y L N+F+DLTN+EFRA
Sbjct: 120 EQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNS-GGHGYTLTDNKFADLTNEEFRAK 178
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNL----SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
G P R L + TD+P +DWR KGAV +KNQ CG CWAF
Sbjct: 179 MLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGSCWAF 238
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI------ 217
+AVAA+EG+ +I++G L+ LSEQ+L+DC GC GG AF +++ N G+
Sbjct: 239 SAVAAMEGLNQIKNGKLVSLSEQELVDCDAEA-VGCAGGFMSWAFEFVMANHGLTTEASY 297
Query: 218 ----FNGVCGTQL--DHAVTIVGF 235
NG C T + +V+I G+
Sbjct: 298 PYKGINGACQTAKLNESSVSITGY 321
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 117/180 (65%), Gaps = 4/180 (2%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
+E V ++E+W+ ++ ++Y EKE R KIFK+NL+++++ N +RT+++G +F+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTN+EFRA+Y KM + S + + Y+ + +P +DWR GAV +K+Q CG
Sbjct: 96 LTNEEFRAIYLRKKMER-NKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
CWAF+AV AVEGI +I +G LI LSEQ+L+DC N GC GG AF +I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 48/75 (64%), Gaps = 5/75 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+ G CG LDH V +VG+G+T G +YW+I+NSWG WGD+GY+K+ R+
Sbjct: 274 AFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNID 332
Query: 267 --EGLCGIGTRSSYP 279
G CGI SYP
Sbjct: 333 DPFGKCGIAMMPSYP 347
>gi|94480716|emb|CAI91577.1| cathepsin L [Aphrocallistes vastus]
Length = 329
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 81/194 (41%), Positives = 120/194 (61%), Gaps = 5/194 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M ++ LL A V + E+ E W ++ RSY L++E+R KI+ N+ Y
Sbjct: 1 MKLVFLLLGLFAGACVCLQCETEEVQDFAWEGWKLKYNRSYG--LDEELRKKIWANNMLY 58
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
+++ N EG+ +YKL NQF+DLTN E+R +Y GY + R F+ + + D+P
Sbjct: 59 VKEFNAEGH-SYKLAANQFADLTNLEYRQIYLGYDNEARLSRKREGKVFQ-RKMKDEDLP 116
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
T++DWR KG VTP+KNQ +CG CW+F+A ++EG I+SG L+ SEQ+L+DCST+ GN
Sbjct: 117 TTVDWRSKGVVTPVKNQGQCGSCWSFSATGSLEGQYAIKSGKLVSFSEQELVDCSTSLGN 176
Query: 197 NGCLGGSREKAFAY 210
+GC GG + AF Y
Sbjct: 177 HGCQGGLMDYAFKY 190
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 32/56 (57%), Positives = 40/56 (71%), Gaps = 1/56 (1%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYP 279
T+LDH V +VG+GT ++G +YWLIKNSWG WG GY KI CGI T++SYP
Sbjct: 273 TKLDHGVLVVGYGT-DNGVDYWLIKNSWGMAWGMDGYFKIEMKSDKCGICTQASYP 327
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 78/187 (41%), Positives = 118/187 (63%), Gaps = 11/187 (5%)
Query: 40 EQSVVEIHEKW----MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
E+S+ ++E+W M +++ +K +FKEN+ YI +ANK+G R+++L N+
Sbjct: 35 EESLRALYEQWRSHYMVSRPAGLQEQDDKARWFNVFKENVRYIHEANKKG-RSFRLALNK 93
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT-----DVPTSLDWRDKGAVTP 150
F+D+T DEFR Y + HR+ +S ++ + S ++P ++DWR +GAVT
Sbjct: 94 FADMTTDEFRRAYAAGSR-TRHHRALSSGIRRHGDGSFMYAQAGNLPLAVDWRQRGAVTG 152
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
IK+Q +CG CWAF+ +AAVEGI KIR+G L+ LSEQ+L+DC N GC GG + AF Y
Sbjct: 153 IKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDVDNQGCNGGLMDYAFQY 212
Query: 211 IIQNQGI 217
I +N GI
Sbjct: 213 IKRNGGI 219
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 36/70 (51%), Positives = 48/70 (68%), Gaps = 4/70 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
++G+F G CGT+LDH V VG+G T DG YW++KNSWG WG+ GY+++ R +GL
Sbjct: 284 SEGVFTGSCGTELDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERGYIRMQRGISDSQGL 343
Query: 270 CGIGTRSSYP 279
CGI SYP
Sbjct: 344 CGIAMEPSYP 353
>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 85/198 (42%), Positives = 127/198 (64%), Gaps = 10/198 (5%)
Query: 25 LVSCASQVVSSRSTHEQSVVE---IHEKWMAQHGRSYKDEL-EKEMRLKIFKENLEYIEK 80
+ S A + ++ H +S E I + WM++HG++Y + L EKE R + FK+NL +I++
Sbjct: 23 VPSSAIDLPATSGGHNRSNEEVGFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQ 82
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
N + N +Y+LG +F+DLT E+R L+ G P P R+ S +Y L +P S+
Sbjct: 83 HNAK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLRISR-RYVPLDGDQLPESV 138
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL 200
DWR++GAV+ IK+Q C CWAF+ VAAVEGI KI +G L+ LSEQ+L+DC+ NNGC
Sbjct: 139 DWRNEGAVSAIKDQGTCNSCWAFSTVAAVEGINKIVTGELVSLSEQELVDCNLV-NNGCY 197
Query: 201 G-GSREKAFAYIIQNQGI 217
G G+ + AF ++I N G+
Sbjct: 198 GSGTMDAAFQFLINNGGL 215
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 41/83 (49%), Positives = 60/83 (72%), Gaps = 5/83 (6%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G +K+ +++ GI+NG CGT LDHA+ IVG+G+ E+G +YW+++NSWG TWGDAGY
Sbjct: 269 GVDKKSQEFMLYRSGIYNGPCGTDLDHALVIVGYGS-ENGQDYWIVRNSWGTTWGDAGYA 327
Query: 262 KIVRD----EGLCGIGTRSSYPL 280
K+ R+ G+CGI +SYP+
Sbjct: 328 KMARNFEYPSGVCGIAMLASYPV 350
>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332642714|gb|AEE76235.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 290
Score = 153 bits (387), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 117/180 (65%), Gaps = 4/180 (2%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
+E V ++E+W+ ++ ++Y EKE R KIFK+NL+++++ N +RT+++G +F+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTN+EFRA+Y KM + S + + Y+ + +P +DWR GAV +K+Q CG
Sbjct: 96 LTNEEFRAIYLRKKMER-TKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
CWAF+AV AVEGI +I +G LI LSEQ+L+DC N GC GG AF +I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
>gi|162138968|ref|NP_001104662.1| uncharacterized protein LOC567623 precursor [Danio rerio]
gi|158254065|gb|AAI54241.1| Zgc:174153 protein [Danio rerio]
Length = 336
Score = 153 bits (387), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 86/205 (41%), Positives = 124/205 (60%), Gaps = 14/205 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
MF +I L C S V ++ S Q ++ H W +QHG+SY +++E R+ I++ENL
Sbjct: 2 MFALIITL--CISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLR 56
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 57 KIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSF 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS
Sbjct: 113 FAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSR 172
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG + AF Y+ +N+G+
Sbjct: 173 PQGNQGCNGGLMDLAFQYVKENKGL 197
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 42/61 (68%), Gaps = 4/61 (6%)
Query: 224 TQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
++LDHAV +VG+G G YW++KNSW + WGD GY+ + +D+ CG+ T++SYP
Sbjct: 275 SRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYP 334
Query: 280 L 280
L
Sbjct: 335 L 335
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 153 bits (387), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 78/187 (41%), Positives = 114/187 (60%), Gaps = 7/187 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
V S + S ++ E W Q+G++Y E EK RLK+F+EN ++ + N N +Y L
Sbjct: 14 AVHSSVSEASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTL 73
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
N F+DLT+ EF+A G+ SP +S S Q L VP ++DWR GAVT
Sbjct: 74 ALNAFADLTHHEFKASRLGF---SPGRAQSIRSVGTPVQEL---HVPPAVDWRKSGAVTG 127
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q CG CW+F+ A+EGI KI +G+L+ LSEQ+L+DC + N+GC GG + A+ +
Sbjct: 128 VKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQF 187
Query: 211 IIQNQGI 217
+I+NQGI
Sbjct: 188 VIKNQGI 194
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 71/121 (58%), Gaps = 18/121 (14%)
Query: 166 VAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVC 222
+ ++G T I + L+Q+ +Q + G+ EK F + ++G++ G C
Sbjct: 218 IVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGS--------EKTFQ--LYSKGVYTGPC 267
Query: 223 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSY 278
+ LDHAV IVG+GT EDG ++W++KNSWG WG GY+ ++R+ EG+CGI +SY
Sbjct: 268 SSTLDHAVLIVGYGT-EDGVDFWIVKNSWGEHWGMRGYIHMLRNNGTAEGICGINMLASY 326
Query: 279 P 279
P
Sbjct: 327 P 327
>gi|255636047|gb|ACU18368.1| unknown [Glycine max]
Length = 227
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 87/212 (41%), Positives = 123/212 (58%), Gaps = 21/212 (9%)
Query: 19 FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQH--GRSYKDELEKEMRLKI 70
F+ + L +S V +S H E+S+ +++E+W + H RS D K R +
Sbjct: 6 FLWVVLSLSLVLGVANSFDFHDKDLESEESLWDLYERWRSHHTVSRSLGD---KHKRFNV 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTSST 125
FK N+ ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ HR + T
Sbjct: 63 FKANVMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNH--HRMFRDMPRGNGT 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F Y+ + VP S+DWR KGAVT +K+Q CG CWAF+ V AVEGI +I++ L+ LSE
Sbjct: 120 FMYEKVG--SVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSE 177
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
Q+L+DC T N GC GG E AF +I Q GI
Sbjct: 178 QELVDCDTEENAGCNGGLMESAFQFIKQKGGI 209
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 81/185 (43%), Positives = 113/185 (61%), Gaps = 14/185 (7%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ E++E+W QH R +D EK R +FK+N+ I + N+ + YKL N+F D+
Sbjct: 41 EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98
Query: 100 TNDEFRALYTGYKMPSPSH------RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
T DEFR Y ++ SH R S F Y D+P ++DWR+KGAV +K+
Sbjct: 99 TADEFRRAYASSRV---SHHRMFRGRGERRSGFMY--AGARDLPAAVDWREKGAVGAVKD 153
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYII 212
Q +CG CWAF+ +AAVEGI IR+ NL LSEQQL+DC T GN GC GG + AF YI
Sbjct: 154 QGQCGSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIA 213
Query: 213 QNQGI 217
++ G+
Sbjct: 214 KHGGV 218
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 52/71 (73%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F G CGT+LDH V VG+GTT DG YW+++NSWG WG+ GY+++ RD EGL
Sbjct: 283 SEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRDVSAKEGL 342
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 343 CGIAMEASYPI 353
>gi|47224192|emb|CAG13112.1| unnamed protein product [Tetraodon nigroviridis]
Length = 327
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 106/314 (33%), Positives = 143/314 (45%), Gaps = 82/314 (26%)
Query: 40 EQSVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
E+ E H K WMA H ++Y + E RL+IF EN IEK N GN ++ +G NQFSD
Sbjct: 21 ERDADEQHFKSWMALHNKAYSVQ-EFHQRLQIFTENKRRIEKHNG-GNHSFTMGLNQFSD 78
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKG-AVTPIKNQKEC 157
+T EFR + P + S T ++ N + P S+DWR KG VTP+KNQ C
Sbjct: 79 MTFAEFRKRFL---WSEPQNCSATKGSYMKTN---SPQPESIDWRTKGNYVTPVKNQGAC 132
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQG 216
G CW F+ +E +T I +G L+ LSEQQL+DC+ + NN GC GG +AF YI N+G
Sbjct: 133 GSCWTFSTTGCLESVTAINTGKLVPLSEQQLVDCAWDFNNHGCNGGLPSQAFEYIKYNKG 192
Query: 217 I----------FNGVCGTQLDHAVTIV--------------------------GFGTTED 240
+ F G C + + A V F T+D
Sbjct: 193 LMTESGYPYTAFEGKCKYKPELAAAFVKNVVNITAYDEKGMEDAVATHNPVSFAFEVTDD 252
Query: 241 GANY-----------------------------------WLIKNSWGNTWGDAGYMKIVR 265
+Y W++KNSWG WG+ GY I R
Sbjct: 253 FMHYKGGVYSSSRCHKTTDKVNHAVLAVGYGNNNSSVPYWIVKNSWGPYWGENGYFLIER 312
Query: 266 DEGLCGIGTRSSYP 279
+ +CG+ SSYP
Sbjct: 313 GKNMCGLAACSSYP 326
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/181 (43%), Positives = 113/181 (62%), Gaps = 7/181 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H S + EK R +FKEN+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHHTVS-RSLTEKHKRFNVFKENVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSPSHRSTT---SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
TN EFR+ Y G K+ T + TF Y+ + VP S+DWR KGAVT +K+Q +
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVG--SVPASVDWRKKGAVTDVKDQGQ 148
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ V AVEGI +I++ L+ LSEQ+L+DC N GC GG E AF +I Q G
Sbjct: 149 CGSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGG 208
Query: 217 I 217
I
Sbjct: 209 I 209
>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
Length = 333
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 97/309 (31%), Positives = 154/309 (49%), Gaps = 84/309 (27%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFR 105
+W A +G+ Y + E+ R ++++N++ I++ N+E G ++ L N F DLTN+EF+
Sbjct: 31 RWKAANGKLYNKD-EEVWRRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGDLTNEEFK 89
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
+ G K+ +P + +Q L + P+S+DWR+KG VTP+K+Q +CG CWAF+A
Sbjct: 90 QVMNGLKIQNPREGNM------FQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWAFSA 143
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCS----TNGNNGCL------------GGSREKAFA 209
A+EG ++G L+ LSEQ L+DCS G NG L G E+++
Sbjct: 144 TGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNAGCNGGLMDNAFRYVKDNGGLDSEESYP 203
Query: 210 YIIQN----------------------------------------------------QGI 217
Y+ Q+ +GI
Sbjct: 204 YLAQDGRCKYKPEQSAANDTGFADIHQDEESLMLSVATVGPISVAIDASLDTFRFYYKGI 263
Query: 218 F-NGVCGTQ-LDHAVTIVGFGTTEDGA---NYWLIKNSWGNTWGDAGYMKIVRDEG-LCG 271
+ + C ++ LDH V +VG+G+ E A NYW++KNSWG WG GY+ + +D G CG
Sbjct: 264 YYDPNCSSEDLDHGVLVVGYGSDEREAENKNYWIVKNSWGTQWGMQGYILMAKDRGNHCG 323
Query: 272 IGTRSSYPL 280
I T +S+P+
Sbjct: 324 IATSASFPI 332
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/178 (44%), Positives = 113/178 (63%), Gaps = 11/178 (6%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
E++E+W + H S + EK+ R +FK N+ Y+ NK+ ++ YKL N+F+D+TN EF
Sbjct: 36 ELYERWRSHHTVS-RSLDEKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEF 93
Query: 105 RALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
R Y G K+ HR S + TF Y + VP ++DWR KGAVTP+K+Q +CG
Sbjct: 94 RHHYAGSKIKH--HRTFLGASRANGTFMYAHED--SVPPTVDWRKKGAVTPVKDQGKCGS 149
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ V AVEGI +I++ L+ LSEQ+L+DC T+ N GC GG + AF +I + GI
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGI 207
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 52/71 (73%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
++G+F G CGT+LDH V IVG+GTT D YW++KNSWG WG+ GY+++ R +EGL
Sbjct: 272 SEGVFTGDCGTELDHGVAIVGYGTTLDRTKYWIVKNSWGPEWGEKGYIRMQREIDAEEGL 331
Query: 270 CGIGTRSSYPL 280
CGI + SYP+
Sbjct: 332 CGIAMQPSYPI 342
>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
Length = 358
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 108/323 (33%), Positives = 147/323 (45%), Gaps = 91/323 (28%)
Query: 40 EQSVVEI---------HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
E+SVV+I ++ ++G+ Y++ E ++R IFKENL+ I NK+ +YK
Sbjct: 43 EESVVQILGQSRHVLSFARFTHRYGKKYQNAEEIKLRFSIFKENLDLIRSTNKK-RLSYK 101
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-YQNLSMTDVPTSLDWRDKGAVT 149
LG NQF+DLT EF+ G S+T K L+ +P + DWR+ G V+
Sbjct: 102 LGVNQFADLTWQEFQRNKLGAAQ-------NCSATLKGSHKLTEAALPETKDWREDGIVS 154
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAF 208
P+K+Q CG CW F+ A+E G I LSEQQL+DC+ NN GC GG +AF
Sbjct: 155 PVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAF 214
Query: 209 AYIIQNQGI-------FNGVCGT------------------------QLDHAVTIV---- 233
YI N G+ + G GT +L HAV +V
Sbjct: 215 EYIKSNGGLDTEEAYPYTGKDGTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLVRPVS 274
Query: 234 ------------------------------------GFGTTEDGANYWLIKNSWGNTWGD 257
G+G EDG YWLIKNSWG WGD
Sbjct: 275 IAFEVVKSFRLYKSGVYTDSHCGNTPMDVNHAVLAVGYG-IEDGVPYWLIKNSWGADWGD 333
Query: 258 AGYMKIVRDEGLCGIGTRSSYPL 280
GY K+ + +CGI T +SYP+
Sbjct: 334 KGYFKMEMGKNMCGIATCASYPV 356
>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
Length = 503
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 97/309 (31%), Positives = 154/309 (49%), Gaps = 84/309 (27%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFR 105
+W A +G+ Y + E+ R ++++N++ I++ N+E G ++ L N F DLTN+EF+
Sbjct: 31 RWKAANGKLYNKD-EEVWRRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGDLTNEEFK 89
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
+ G K+ +P + +Q L + P+S+DWR+KG VTP+K+Q +CG CWAF+A
Sbjct: 90 QVMNGLKIQNPREGNM------FQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWAFSA 143
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCS----TNGNNGCL------------GGSREKAFA 209
A+EG ++G L+ LSEQ L+DCS G NG L G E+++
Sbjct: 144 TGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNAGCNGGLMDNAFRYVKDNGGLDSEESYP 203
Query: 210 YIIQN----------------------------------------------------QGI 217
Y+ Q+ +GI
Sbjct: 204 YLAQDGRCKYKPEQSAANDTGFADIHQDEESLMLSVATVGPISVAIDASLDTFRFYYKGI 263
Query: 218 F-NGVCGTQ-LDHAVTIVGFGTTEDGA---NYWLIKNSWGNTWGDAGYMKIVRDEG-LCG 271
+ + C ++ LDH V +VG+G+ E A NYW++KNSWG WG GY+ + +D G CG
Sbjct: 264 YYDPNCSSEDLDHGVLVVGYGSDEREAENKNYWIVKNSWGTQWGMQGYILMAKDRGNHCG 323
Query: 272 IGTRSSYPL 280
I T +S+P+
Sbjct: 324 IATSASFPI 332
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/59 (50%), Positives = 39/59 (66%), Gaps = 6/59 (10%)
Query: 226 LDHAVTIVGFGTTEDGA---NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
LDH V +VG+G+ E A NYW++KNSWG WG GYM +VRD + C I T S+P+
Sbjct: 446 LDHGVLVVGYGSDEREAENKNYWIVKNSWGTDWGLQGYMLLVRDWDNHCEITT--SFPV 502
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 153 bits (386), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 76/176 (43%), Positives = 110/176 (62%), Gaps = 4/176 (2%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
+V E+ E W +HG+SY EK RL +F +N E++ N N +Y L N ++DLT+
Sbjct: 24 NVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTH 83
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+ G+ SP+ R+ + +L DVP SLDWR KGAVT +K+Q CG CW
Sbjct: 84 HEFKVSRLGF---SPALRNFRPVLPQEPSLPR-DVPDSLDWRKKGAVTAVKDQGSCGACW 139
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+F+A A+EGI +I +G+LI LSEQ+L+DC + N+GC GG + A+ ++I N GI
Sbjct: 140 SFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGI 195
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 70/120 (58%), Gaps = 18/120 (15%)
Query: 166 VAAVEGITKIRS---GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVC 222
V ++G I S G L+Q Q + G+ E+AF + ++GIF+G C
Sbjct: 219 VVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGS--------ERAFQ--LYSKGIFSGPC 268
Query: 223 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSY 278
T LDHAV IVG+G+ E+G +YW++KNSWG +WG GYM + R+ EG+CGI +SY
Sbjct: 269 STSLDHAVLIVGYGS-ENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVCGINKLASY 327
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 81/201 (40%), Positives = 124/201 (61%), Gaps = 6/201 (2%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE++ R+KI+ EN + K
Sbjct: 6 LIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 64
Query: 81 AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y++ N+F DL + EFR++ GY+ + S STF + + +VP
Sbjct: 65 HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 123
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
S+DWR+KGA+TP+K+Q +CG CWAF++ A+EG T ++G LI LSEQ L+DCS GN
Sbjct: 124 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG ++AF YI N+GI
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGI 204
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 30/56 (53%), Positives = 42/56 (75%), Gaps = 2/56 (3%)
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
LDH V +VG+G+ ++G +YWL+KNSW WGD GY+KI R+ + CG+ T +SYPL
Sbjct: 283 LDHGVLVVGYGS-DNGKDYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337
>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
Length = 363
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 97/303 (32%), Positives = 143/303 (47%), Gaps = 76/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+SY+ E + R +IF E+L+ + N++G +Y+LG N+FSD++ +EFRA
Sbjct: 64 RFAVRYGKSYESAAEVQKRFRIFSESLQLVRSTNRKG-LSYRLGINRFSDMSWEEFRAT- 121
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
++ + + S T + + +P + DWR+ G V+P+KNQ CG CW F+ A
Sbjct: 122 ---RLGAAQNCSATLAGNHRMRAAAVALPKTKDWREDGIVSPVKNQGHCGSCWTFSTTGA 178
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E +G I LSEQQL+DC NN GC GG +AF YI N G+
Sbjct: 179 LEAAYTQATGKPISLSEQQLVDCGKPFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYKG 238
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L AV +V GF
Sbjct: 239 VNGICDFKAENVGVKVLDSVNITLGAEDELKDAVALVRPVSVAFQVVNGFRQYKSGVYTS 298
Query: 236 ---GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
G T E+G YWLIKNSWG WGD GY K+ + +CG+ T +S
Sbjct: 299 DSCGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDKGYFKMEMGKNMCGVATCAS 358
Query: 278 YPL 280
YP+
Sbjct: 359 YPI 361
>gi|111073719|dbj|BAF02548.1| triticain gamma [Triticum aestivum]
Length = 365
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 102/306 (33%), Positives = 143/306 (46%), Gaps = 83/306 (27%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+SY+ E R +IF E+LE + N++G +Y+LG N+FSD++ +EF+A
Sbjct: 66 RFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNRKG-LSYRLGINRFSDMSWEEFQATR 124
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
G T S+T +L M D +P + DWR+ G V+P+K+Q CG CW F+
Sbjct: 125 LGAAQ-------TCSATLAGNHL-MRDAAALPETKDWREDGIVSPVKDQSHCGSCWTFST 176
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI------- 217
A+E +G I LSEQQL+DC+ NN GC GG +AF YI N GI
Sbjct: 177 TGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCSGGLPSQAFEYIKYNGGIDTEESYP 236
Query: 218 ---FNGVCGTQLDHAVTIV--------------------------------GF------- 235
NGVC + ++AV V GF
Sbjct: 237 YKGVNGVCHYKAENAVVQVLDSVNITLNAEDELKNAVGLVRPVSVAFEVINGFRQYKSGV 296
Query: 236 ------GTTEDGAN---------------YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 274
GTT D N YWLIKNSWG WGD GY K+ + +C + T
Sbjct: 297 YSSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAVAT 356
Query: 275 RSSYPL 280
+SYP+
Sbjct: 357 CASYPI 362
>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 338
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 124/210 (59%), Gaps = 11/210 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLE 76
M +++ L+ C VS+ S ++ H K W H +SY E E+ R +++ENL+
Sbjct: 1 MNLLVCLVSLCWGLAVSAPLG--DSELDRHWKLWKNWHQKSYH-EAEEGWRRTVWEENLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
I+ N E G TY+LG NQF DLTN+EF+ + TG + S +R S+ + +
Sbjct: 58 AIQLHNLEQSLGLHTYRLGMNQFGDLTNEEFQEILTGERHFSKGNRINGSA---FLEANF 114
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
VPTS+DWRD G VTP+KNQ CG CWAF+ A+EG +SG LI LSEQ L+DCS
Sbjct: 115 VQVPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSW 174
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIFNGVC 222
GN GC GG + AF YI+QNQGI + C
Sbjct: 175 QQGNQGCHGGIVDLAFQYILQNQGIDSEDC 204
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/71 (50%), Positives = 47/71 (66%), Gaps = 6/71 (8%)
Query: 216 GIF-NGVCGTQ-LDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-L 269
GIF + C ++ LDHAV +VG+G E G YW++KNSWG WGD GY+ + +D G
Sbjct: 267 GIFYDPKCSSESLDHAVLVVGYGYEREDEAGKKYWIVKNSWGKHWGDRGYVYMSKDRGNH 326
Query: 270 CGIGTRSSYPL 280
CGI T +SYPL
Sbjct: 327 CGIATVASYPL 337
>gi|348551380|ref|XP_003461508.1| PREDICTED: pro-cathepsin H-like [Cavia porcellus]
Length = 335
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 106/330 (32%), Positives = 146/330 (44%), Gaps = 84/330 (25%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN 82
LL+S + S+ + S+ E H K WM QH + Y + E R + F N + I N
Sbjct: 14 LLLSLGHTISVSKLS---SLEEFHFKSWMMQHQKQYSAK-EHHHRQQTFARNWKKINAHN 69
Query: 83 KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDW 142
K GN T+K+ NQFSD++ DE + Y P + S T S + PTS+DW
Sbjct: 70 K-GNHTFKMALNQFSDMSFDEIKRKYL---WSEPQNCSATKSNY---FRGTGPYPTSVDW 122
Query: 143 RDKGA-VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCL 200
R KG V+P+KNQ CG CW F+ A+E I SG ++ L+EQQL+DC+ + NN GC
Sbjct: 123 RKKGNFVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQDFNNHGCE 182
Query: 201 GGSREKAFAYIIQNQGIF----------NGVCGTQLDHAVTIV----------------- 233
GG +AF YI+ N+GI +G C Q A+ V
Sbjct: 183 GGLPSQAFEYILYNKGIMGEDTYPYQGKDGHCRFQPQKAIAFVKDVVNITLNDEEAMVEA 242
Query: 234 ---------GFGTTED----------------------------------GANYWLIKNS 250
F TED G YW++KNS
Sbjct: 243 VALYNPVSFAFEVTEDFISYQSGIYSSTSCHKTPDKVNHAVLAVGYGVQNGVPYWIVKNS 302
Query: 251 WGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
WG WG GY I R + +CG+ +S+P+
Sbjct: 303 WGTAWGQDGYFLIERGKNMCGLAACASFPI 332
>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 145/326 (44%), Gaps = 96/326 (29%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR--TYKLGTNQFSDLTNDEFRA 106
+W A+H R+Y E+ RL+++ N+ YIE N + TY+LG ++DLT+DEF A
Sbjct: 44 RWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLTYELGETAYTDLTSDEFTA 103
Query: 107 LYTGYKMPS-------PSHRSTTSSTFK-----------YQNLSMTDVPTSLDWRDKGAV 148
+YT P P TT + Y N S P S+DWR++GAV
Sbjct: 104 MYTSRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVNES-AGAPASVDWRERGAV 162
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +KNQ +CG CWAF+ VA +EGI +I++G L LSEQ+L+DC ++GC GG +A
Sbjct: 163 TAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVDCD-KLDHGCNGGVSYRAL 221
Query: 209 AYIIQNQGIF----------NGVCGTQL--DHAVTIVGF--------------------- 235
+I N GI + C T+ HA +I GF
Sbjct: 222 QWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGFQRVATRSELSLTNAVAMQPVA 281
Query: 236 -GTTEDGANYWLIKN-----------------------------------SWGNTWGDAG 259
GAN+ +N SWG WGD G
Sbjct: 282 VSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYGEDEVTGESYWIVKNSWGEKWGDNG 341
Query: 260 YMK-----IVRDEGLCGIGTRSSYPL 280
Y++ I + EG+CGI R S+PL
Sbjct: 342 YLRMKKGIIDKPEGICGIAIRPSFPL 367
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 78/179 (43%), Positives = 115/179 (64%), Gaps = 11/179 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+ + +++E+W + H S + EK R +FK N+ ++ +NK ++ YKL N+F+D+
Sbjct: 33 EEGLWDLYERWRSHHTVS-RSLDEKHNRFNVFKGNVMHVHSSNKM-DKPYKLKLNRFADM 90
Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR++Y G K+ HR + TF YQN+ VP+S+DWR KGAVT +K+Q
Sbjct: 91 TNHEFRSIYAGSKVNH--HRMFRGTPRGNGTFMYQNVDR--VPSSVDWRKKGAVTDVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
+CG CWAF+ + AVEGI +I++ L+ LSEQ+L+DC T N GC GG E AF +I Q
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTHKLVPLSEQELVDCDTTQNQGCNGGLMESAFEFIKQ 205
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 52/71 (73%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F G CGT LDH V IVG+GTT+DG YW +KNSWG+ WG+ GY+++ R +GL
Sbjct: 273 SEGVFTGNCGTALDHGVAIVGYGTTQDGTKYWTVKNSWGSEWGEKGYIRMKRSISVKKGL 332
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 333 CGIAMEASYPI 343
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 77/194 (39%), Positives = 112/194 (57%), Gaps = 7/194 (3%)
Query: 28 CASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN 86
CA+ R ++++ +++E+W H + EK R FK+N+ YI + NK G
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRGG 84
Query: 87 RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWR 143
R Y+L N+F D+ +EFRA + G + F Y+ + D+P ++DWR
Sbjct: 85 RGYRLRLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVR--DLPRAVDWR 142
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
KGAVT +K+Q +CG CWAF+ V +VEGI IR+G L+ LSEQ+L+DC T N+GC GG
Sbjct: 143 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGL 202
Query: 204 REKAFAYIIQNQGI 217
E AF YI + GI
Sbjct: 203 MENAFEYIKHSGGI 216
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 51/76 (67%), Gaps = 4/76 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 267
++ + G+F G CGT LDH V +VG+G T DG YW++KNSWG WG+ GY+++ RD
Sbjct: 277 SFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDSG 336
Query: 268 ---GLCGIGTRSSYPL 280
GLCGI +SYP+
Sbjct: 337 YDGGLCGIAMEASYPV 352
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 74/172 (43%), Positives = 107/172 (62%), Gaps = 1/172 (0%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
++EKW+ +H + Y EK+ R +IFK+NL +I++ N + N +YK+G N+F+D+ N+E+R
Sbjct: 3 MYEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQ-NYSYKVGLNKFADINNEEYR 61
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
+Y G K + T T + V +DWR KGAVT IK+Q CG CWAF+
Sbjct: 62 DMYLGTKSDAKRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFST 121
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+A VE I KI +G + LSEQ+L+DC N GC GG + AF +II+N GI
Sbjct: 122 IATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGI 173
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 45/70 (64%), Gaps = 6/70 (8%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-----GLC 270
G+F G CGT LDH V +VG+G+ E+G +YWL++NSWG WG+ GY KI C
Sbjct: 239 GVFTGKCGTDLDHGVVVVGYGS-ENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKC 297
Query: 271 GIGTRSSYPL 280
GI +SYP+
Sbjct: 298 GIAMEASYPV 307
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 105/173 (60%), Gaps = 11/173 (6%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W A + RSY E++ R ++++ N+E+IE N+ GN TY LG NQF+DLT +EF LYT
Sbjct: 52 WQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEEEFLDLYT 111
Query: 110 GYKMP----SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFA 164
MP + R+ SS+ + D PTS+DWR KGAVTPIKNQ C CWAF
Sbjct: 112 MKGMPVRRDAGKKRANVSSS-----AAAVDAPTSVDWRSKGAVTPIKNQGPSCSSCWAFV 166
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
A +E ITKI +G L+ LSEQ+L+DC + GC G + ++IQN G+
Sbjct: 167 TAATIESITKITTGKLVSLSEQELIDCDPY-DGGCNLGYFVNGYRWVIQNGGL 218
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 34/69 (49%), Positives = 51/69 (73%), Gaps = 4/69 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCG 271
G+F+G CGT+++HA+T+VG+G + G YWL+KNSWG +WG+ GY+++ RD GLCG
Sbjct: 283 GVFSGQCGTRMNHAITVVGYGADSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGRGGLCG 342
Query: 272 IGTRSSYPL 280
I +YP+
Sbjct: 343 IALDLAYPV 351
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/186 (39%), Positives = 115/186 (61%), Gaps = 8/186 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A+HG ++E R F +NL +++ N G ++L
Sbjct: 45 EAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLA 104
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLTNDEFRA Y G K + +R+ +Y++ ++P ++DWR+KGAV P+K
Sbjct: 105 MNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAPVK 164
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+AV+ VE I +I +G ++ LSEQ+L++C NG ++GC GG + AF +I
Sbjct: 165 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFI 224
Query: 212 IQNQGI 217
I+N GI
Sbjct: 225 IKNGGI 230
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/82 (48%), Positives = 56/82 (68%), Gaps = 9/82 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + + G+F+G CGTQLDH V VG+GT E+G +YW+++NSWG WG+AGY+
Sbjct: 287 GGRE----FQLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYL 341
Query: 262 KIVRD----EGLCGIGTRSSYP 279
++ R+ G CGI SSYP
Sbjct: 342 RMERNINVTSGKCGIAMMSSYP 363
>gi|113603|sp|P05167.1|ALEU_HORVU RecName: Full=Thiol protease aleurain; Flags: Precursor
gi|19021|emb|CAA28804.1| aleurain [Hordeum vulgare]
Length = 362
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/306 (34%), Positives = 143/306 (46%), Gaps = 83/306 (27%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+SY+ E R +IF E+LE + N++G Y+LG N+FSD++ +EF+A
Sbjct: 63 RFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNRKG-LPYRLGINRFSDMSWEEFQATR 121
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
G T S+T +L M D +P + DWR+ G V+P+KNQ CG CW F+
Sbjct: 122 LGAAQ-------TCSATLAGNHL-MRDAAALPETKDWREDGIVSPVKNQAHCGSCWTFST 173
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI------- 217
A+E +G I LSEQQL+DC+ NN GC GG +AF YI N GI
Sbjct: 174 TGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIKYNGGIDTEESYP 233
Query: 218 ---FNGVCG---------------------TQLDHAVTIV-----------GF------- 235
NGVC +L +AV +V GF
Sbjct: 234 YKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVIDGFRQYKSGV 293
Query: 236 ------GTTEDGAN---------------YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 274
GTT D N YWLIKNSWG WGD GY K+ + +C I T
Sbjct: 294 YTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAIAT 353
Query: 275 RSSYPL 280
+SYP+
Sbjct: 354 CASYPV 359
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 80/187 (42%), Positives = 113/187 (60%), Gaps = 15/187 (8%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSD 98
++++ +++E+W H R ++ EK R FKEN+ +I NK G+R +Y+L N+F D
Sbjct: 39 DEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFGD 97
Query: 99 LTNDEFRALYTG--------YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
+ +EFR+ + Y+ SP+ +T F Y + TDVP S+DWR GAVT
Sbjct: 98 MGPEEFRSTFADSRINDLRRYRESSPA--ATAVPGFMYDD--ATDVPRSVDWRQHGAVTA 153
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+KNQ CG CWAF+ V AVEGI IR+G+L+ LSEQ+L+DC T NGC GG E AF +
Sbjct: 154 VKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDT-AENGCQGGLMENAFDF 212
Query: 211 IIQNQGI 217
I GI
Sbjct: 213 IKSYGGI 219
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 54/76 (71%), Gaps = 4/76 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVR-- 265
A+ ++G+F G CGT LDH V +VG+G ++ DG YW++KNSWG +WG+ GY+++ R
Sbjct: 281 AFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSWGPSWGEGGYIRMQRGA 340
Query: 266 -DEGLCGIGTRSSYPL 280
+ GLCGI +S+P+
Sbjct: 341 GNGGLCGIAMEASFPI 356
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/201 (41%), Positives = 119/201 (59%), Gaps = 10/201 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+++ +LV+ +S+ S R + V W + HG+SY D E+ R+ I+++NLE
Sbjct: 3 VFLVLCVLVA-SSRGWSVRFGQDSEWV----AWKSYHGKSYSDVHEERTRMAIWQQNLEK 57
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N E + +YK+ N DLT DEFR Y G + H ST Y S +P
Sbjct: 58 IKRHNAE-DHSYKMAMNHLGDLTEDEFRYFYLGVR---AHHNSTKRGWATYMPPSNVKIP 113
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGN 196
+S+DW KG VT +KNQ +CG CWAF+ +VEG ++G+L+ LSEQ L+DCS + GN
Sbjct: 114 SSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLIDCSGSYGN 173
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
NGC GG + AF YI N GI
Sbjct: 174 NGCQGGLMDNAFRYIESNGGI 194
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 42/58 (72%), Gaps = 2/58 (3%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 280
TQLDH V ++G+G +G +YWL+KNSWG +WG GY+ + R++ CGI + +SYPL
Sbjct: 269 TQLDHGVLVIGYGNY-NGQDYWLVKNSWGYSWGVEGYIMMSRNKNNQCGIASSASYPL 325
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 82/202 (40%), Positives = 121/202 (59%), Gaps = 9/202 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L + CA V+ + + + E + H +SY+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSLLCAIVAVTVAANSHEILRTQWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG NQF DL EF ++ GY+ R++ STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGYR----GQRTSRGSTFMPPANVNDSSL 116
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P+++DWR KGAVTP+K+Q +CG CWAF+A ++EG ++ G L+ LSEQ L+DCS + G
Sbjct: 117 PSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
NNGC GG + AF YI N GI
Sbjct: 177 NNGCEGGLMDNAFKYIKANDGI 198
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/57 (54%), Positives = 41/57 (71%), Gaps = 2/57 (3%)
Query: 225 QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 280
+LDH V VG+G +DG YWL+KNSWG +WGD GY+ + RD+ CGI + +SYPL
Sbjct: 276 ELDHGVLAVGYGV-KDGKKYWLVKNSWGGSWGDNGYILMSRDKNNQCGIASAASYPL 331
>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 89/238 (37%), Positives = 128/238 (53%), Gaps = 25/238 (10%)
Query: 14 NTTPMFIIITLLVSCASQVVSSRS-----THEQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
T P+ I+ LL + S + +++ +W A H RSY E+ R
Sbjct: 7 GTRPVIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRF 66
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
++++ N+EYI+ N+ G TY+LG NQF+DLT +EF A Y G H + +T
Sbjct: 67 EVYRTNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAG------GHTGSAITTAAE 120
Query: 129 QNLSM-TDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
+ S+ D P S+DWR KGAVTP+KNQ +C CWAF+AVA +E + I++G L+ LSEQ
Sbjct: 121 ADGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQ 180
Query: 187 QLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTIVG 234
QL+DC + GC G +AF +I++N GI G C + AVTI G
Sbjct: 181 QLVDCDKY-DGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGAC-SAAKPAVTITG 236
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 35/67 (52%), Positives = 46/67 (68%), Gaps = 3/67 (4%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
G+F+ CG Q+ HAV VG+G G YWL+KNSWG TWG+AGY+++ RD GLCGI
Sbjct: 272 GVFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGI 331
Query: 273 GTRSSYP 279
++YP
Sbjct: 332 ALDTAYP 338
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 78/179 (43%), Positives = 115/179 (64%), Gaps = 3/179 (1%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H S +D EK R +FKEN +++ K N+ ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSYHTVS-RDLEEKNKRFNVFKENTKHVHKVNQM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
TN EFR+ Y G K+ R T + + T +P S+DWR KGAVT IK+Q +CG
Sbjct: 91 TNHEFRSSYGGSKVKHYRMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCG 150
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ V VEGI +I++ L+ LSEQQL+DC + ++GC GG E AF +I +N GI
Sbjct: 151 SCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCDRSDDHGCNGGLMESAFEFIKKNGGI 209
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 52/71 (73%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F+G CGT+LDH V IVG+GTT DG YW++KNSWG WG+ GY+++ R EG
Sbjct: 274 SEGVFDGECGTELDHGVAIVGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQ 333
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 334 CGIAMEASYPV 344
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 82/165 (49%), Positives = 109/165 (66%), Gaps = 20/165 (12%)
Query: 86 NRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
N+ YKLG N+F+DLTN+EF+A +K M S R+TT FKY+N S +P+++DWR
Sbjct: 7 NKLYKLGINKFADLTNEEFKASRNKFKGHMCSSIIRTTT---FKYENASA--IPSTVDWR 61
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGG 202
KGAVTP+KNQ +CG CWAF+AVAA EGI ++ +G L+ LSEQ+L+DC T G + GC GG
Sbjct: 62 KKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGG 121
Query: 203 SREKAFAYIIQNQGI----------FNGVCGTQLD--HAVTIVGF 235
+ AF +IIQN G+ +G C T HAVTI G+
Sbjct: 122 LMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGY 166
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 40/72 (55%), Positives = 50/72 (69%), Gaps = 4/72 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
N G+F G CGT+LDH VT VG+G DG YWL+KNSWG WG+ GY+++ R EGL
Sbjct: 201 NSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIDAAEGL 260
Query: 270 CGIGTRSSYPLA 281
CGI ++SYP A
Sbjct: 261 CGIAMQASYPTA 272
>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
Length = 258
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 72/127 (56%), Positives = 96/127 (75%), Gaps = 5/127 (3%)
Query: 94 NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT---SLDWRDKGAVTP 150
N+F+D+TNDEF A+YTG + P P+ + FKY N++++D ++DWR KGAVT
Sbjct: 4 NEFADMTNDEFMAMYTGLR-PVPAGAKKMAG-FKYGNVTLSDADDDQQTVDWRQKGAVTG 61
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
IK+Q++CGCCWAFAAVAAVEGI +I +GNL+ LSEQQ+LDC T+GNNGC GG + AF Y
Sbjct: 62 IKDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQY 121
Query: 211 IIQNQGI 217
I+ N G+
Sbjct: 122 IVGNGGL 128
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 30/56 (53%), Positives = 41/56 (73%)
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 281
L+HAVT VG+GT EDG YWL+KN WG WG+ GY+++ R CG+ ++SYP+A
Sbjct: 203 LNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 258
>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
Length = 335
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/202 (39%), Positives = 122/202 (60%), Gaps = 9/202 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + + + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GYK + TS +
Sbjct: 60 QHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPKFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS +G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
N GC GG ++AF Y+ +N+G+
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGL 197
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/67 (46%), Positives = 43/67 (64%), Gaps = 4/67 (5%)
Query: 218 FNGVCGTQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIG 273
+ C +QLDHAV +VG+G G YW++KNSW + WGD GY+ + +D+ CGI
Sbjct: 268 YERACTSQLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIA 327
Query: 274 TRSSYPL 280
T +SYPL
Sbjct: 328 TMASYPL 334
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 78/179 (43%), Positives = 115/179 (64%), Gaps = 3/179 (1%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H S +D EK R +FKEN +++ K N+ ++ YKL N+F+D+
Sbjct: 31 EESLWDLYERWRSYHTVS-RDLEEKNKRFNVFKENTKHVHKVNQM-DKPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
TN EFR+ Y G K+ R T + + T +P S+DWR KGAVT IK+Q +CG
Sbjct: 89 TNHEFRSSYGGSKVKHYRMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCG 148
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ V VEGI +I++ L+ LSEQQL+DC + ++GC GG E AF +I +N GI
Sbjct: 149 SCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCDRSDDHGCNGGLMESAFEFIKKNGGI 207
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 52/71 (73%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F+G CGT+LDH V IVG+GTT DG YW++KNSWG WG+ GY+++ R EG
Sbjct: 272 SEGVFDGECGTELDHGVAIVGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQ 331
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 332 CGIAMEASYPV 342
>gi|269954686|ref|NP_954599.2| uncharacterized protein LOC218275 precursor [Mus musculus]
Length = 330
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 90/235 (38%), Positives = 134/235 (57%), Gaps = 26/235 (11%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + VVS+ H+ S+ + E+W +H ++Y E + R +++ N+
Sbjct: 2 TPVFLLATLCLG----VVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWENNM 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I N++ G + L N F DLTN EFR L TG++ S H+ T +Q
Sbjct: 57 KMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPL 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DWRD G VTP+K+Q CG CWAF+AV ++EG ++G L+ LSEQ L+DCS
Sbjct: 111 LGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCS 170
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDH-AVTIVGF 235
+ GN GC GG E AF Y+ +N+G+ ++G C + AV I GF
Sbjct: 171 WSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGF 225
>gi|30388235|gb|AAH51665.1| CDNA sequence BC051665 [Mus musculus]
Length = 330
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 90/235 (38%), Positives = 134/235 (57%), Gaps = 26/235 (11%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + VVS+ H+ S+ + E+W +H ++Y E + R +++ N+
Sbjct: 2 TPVFLLATLCLG----VVSAAPAHDPSLDAVWEEWKTKHRKTYSMNEEAQKR-AVWENNM 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I N++ G + L N F DLTN EFR L TG++ S H+ T +Q
Sbjct: 57 KMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPL 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DWRD G VTP+K+Q CG CWAF+AV ++EG ++G L+ LSEQ L+DCS
Sbjct: 111 LGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCS 170
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDH-AVTIVGF 235
+ GN GC GG E AF Y+ +N+G+ ++G C + AV I GF
Sbjct: 171 WSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGF 225
>gi|255586666|ref|XP_002533962.1| cysteine protease, putative [Ricinus communis]
gi|223526059|gb|EEF28418.1| cysteine protease, putative [Ricinus communis]
Length = 417
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 93/241 (38%), Positives = 135/241 (56%), Gaps = 26/241 (10%)
Query: 9 GSFKINTTPMFIIITLLV---SCASQVVSSRST----------HEQSVVEIHEKWMAQHG 55
GS KI F+II LLV +C S + + E+ V E+ ++W +H
Sbjct: 2 GSQKIQ----FLIIFLLVGPLTCLSFTLPDEYSIVGNDLHELLSEERVKELFQQWKEKHR 57
Query: 56 RSYKDELEKEMRLKIFKENLEYI-EKANKEGN--RTYKLGTNQFSDLTNDEFRALYTGYK 112
+ YK E E RL+ F+ NL+Y+ EK K+ N + +G N+F+D++N EFR Y
Sbjct: 58 KVYKHVEEAEKRLENFRRNLKYVVEKNQKKKNLGSAHTVGLNKFADMSNVEFRQKYLSKV 117
Query: 113 MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGI 172
R+ T + +NL P+SLDWR KG VTP+K+Q +CG CWAF++ A+EGI
Sbjct: 118 KKPIKKRNNNLMTSRQRNLQSCVAPSSLDWRKKGVVTPVKDQGDCGSCWAFSSTGAIEGI 177
Query: 173 TKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTI 232
I +G+L+ LSEQ+L+DC T N GC GG + AF ++I N GI T++D+ T
Sbjct: 178 NAIVTGDLVSLSEQELMDCDTT-NYGCDGGYMDYAFEWVINNGGI-----DTEIDYPYTG 231
Query: 233 V 233
V
Sbjct: 232 V 232
>gi|148709355|gb|EDL41301.1| cDNA sequence BC051665 [Mus musculus]
Length = 349
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 90/235 (38%), Positives = 134/235 (57%), Gaps = 26/235 (11%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + VVS+ H+ S+ + E+W +H ++Y E + R +++ N+
Sbjct: 21 TPVFLLATLCLG----VVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWENNM 75
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I N++ G + L N F DLTN EFR L TG++ S H+ T +Q
Sbjct: 76 KMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPL 129
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DWRD G VTP+K+Q CG CWAF+AV ++EG ++G L+ LSEQ L+DCS
Sbjct: 130 LGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCS 189
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDH-AVTIVGF 235
+ GN GC GG E AF Y+ +N+G+ ++G C + AV I GF
Sbjct: 190 WSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGF 244
>gi|74211558|dbj|BAE26509.1| unnamed protein product [Mus musculus]
Length = 338
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 90/235 (38%), Positives = 134/235 (57%), Gaps = 26/235 (11%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + VVS+ H+ S+ + E+W +H ++Y E + R +++ N+
Sbjct: 10 TPVFLLATLCLG----VVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWENNM 64
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I N++ G + L N F DLTN EFR L TG++ S H+ T +Q
Sbjct: 65 KMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPL 118
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DWRD G VTP+K+Q CG CWAF+AV ++EG ++G L+ LSEQ L+DCS
Sbjct: 119 LGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCS 178
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDH-AVTIVGF 235
+ GN GC GG E AF Y+ +N+G+ ++G C + AV I GF
Sbjct: 179 WSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGF 233
>gi|294938848|ref|XP_002782226.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239893730|gb|EER14021.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 334
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/328 (32%), Positives = 155/328 (47%), Gaps = 82/328 (25%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M III+ L+ +V + E +V + + G++Y+ + E+ R IF+ +L Y
Sbjct: 1 MMIIISFLLLSILPLV--KCLEEGTVDLAFMGFQHKFGKNYESKEEEIKRNAIFRAHLHY 58
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE+ N + N +YKLG N+ +DLT++EF AL K+ + S S T +
Sbjct: 59 IEQVNAK-NLSYKLGVNEHADLTHEEFAAL----KLGTSSKMSMKRDDKLVVKADTTQLL 113
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
TS+DWR KG +TPIK+Q CG CWAF+A A+E I +G L+ LSEQQL+DCS++ GN
Sbjct: 114 TSVDWRSKGVLTPIKDQGPCGSCWAFSATGALEAQYAIATGKLLSLSEQQLIDCSSSYGN 173
Query: 197 NGCLGG--------------SREKAFAYIIQN---------------------------- 214
GC GG +E + YI +N
Sbjct: 174 EGCSGGLMENAYTYIKSAGLDQESTYPYIAKNNACQVSLEKRSDGIPAGEVTGFHMLDQT 233
Query: 215 -QGIFNGV----------------------------CGTQLDHAVTIVGFGTTEDGANYW 245
QG+ + C +DH V VG+GT E+G +Y+
Sbjct: 234 EQGLMKALADAPVSIAMYASDPDFRFYQSGVYSSKTCHGTIDHGVVAVGYGT-ENGEDYF 292
Query: 246 LIKNSWGNTWGDAGYMKIVRDEGLCGIG 273
+I+NSWG++WG GY + R G+ G G
Sbjct: 293 VIRNSWGSSWGQDGYFYLKR--GVSGYG 318
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/202 (41%), Positives = 122/202 (60%), Gaps = 9/202 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG NQF DL EF ++ G++ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHR----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P ++DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
NNGC GG E AF YI N GI
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGI 198
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 51/75 (68%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G+++ C ++ LDH V +VG+G + G YWL+KNSW +WGD GY+ + RD
Sbjct: 258 SFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRD 316
Query: 267 -EGLCGIGTRSSYPL 280
CGI +++SYPL
Sbjct: 317 NNNQCGIASQASYPL 331
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 118/210 (56%), Gaps = 9/210 (4%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
T + + L S V +S H E+S+ +++E+W + H S + EK R
Sbjct: 1 ATKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRF 59
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFK 127
+FK NL ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ R T
Sbjct: 60 NVFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRGTPHENGA 118
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+ + VP S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++ L+ LSEQ+
Sbjct: 119 FMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQE 178
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L+DC N GC GG E AF +I Q GI
Sbjct: 179 LVDCDKEENQGCNGGLMESAFEFIKQKGGI 208
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
+G+F G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+ EGLC
Sbjct: 274 EGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLC 333
Query: 271 GIGTRSSYPL 280
GI SYP+
Sbjct: 334 GIAMLPSYPI 343
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 111/195 (56%), Gaps = 12/195 (6%)
Query: 52 AQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY 111
A +G+SY E E + R IFK NL YI N++G +Y L N F DL+ +EFR Y GY
Sbjct: 124 ATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQG-YSYSLKMNHFGDLSREEFRRKYLGY 182
Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEG 171
+ + +S +DVP+++DWR+KG VTP+K+Q++CG CWAF+A A+EG
Sbjct: 183 NKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSATGALEG 242
Query: 172 ITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIF----------NG 220
++G L+ LSEQ+L+DCS GN GC GG AF Y++ + G+ +G
Sbjct: 243 AHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPYLARDG 302
Query: 221 VCGTQLDHAVTIVGF 235
C VTI GF
Sbjct: 303 ECKRACKKVVTISGF 317
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 40/62 (64%), Gaps = 1/62 (1%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
+ ++G+F+ CGT LDH V +VG+GT E ++W++KNSWG+ WG GYM +
Sbjct: 346 LPFQFYHEGVFDASCGTDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYMAMH 405
Query: 267 EG 268
+G
Sbjct: 406 KG 407
>gi|6851030|emb|CAB71032.1| cysteine protease [Lolium multiflorum]
Length = 359
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/306 (33%), Positives = 145/306 (47%), Gaps = 83/306 (27%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ +HG+SY E + R +IF E+L+ + N++G +YKLG N+FSD+T +EF+A
Sbjct: 60 RFAVRHGKSYGSAAEVQRRFRIFSESLDEVRSTNRKG-LSYKLGINRFSDMTWEEFQATK 118
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
G T S+T +L M D +P + DWR+ G V+P+K+Q CG CW F+
Sbjct: 119 LGAAQ-------TCSATLAGNHL-MRDANALPETKDWRETGIVSPVKDQASCGSCWTFST 170
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI------- 217
A+E +G I LSEQQL+DC+ NN GC GG +AF YI N GI
Sbjct: 171 TGALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGIDTEESYP 230
Query: 218 ---FNGVCG---------------------TQLDHAVTIV-----------GF------- 235
NGVC +L +AV +V GF
Sbjct: 231 YKGVNGVCKYRPENAAVQVADSVNITLNAEDELKNAVGLVRPVSVAFEVIDGFKQYKSGV 290
Query: 236 ------GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 274
GTT E+G YWLIKNSWG WG+ GY K+ + +C + T
Sbjct: 291 YTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGEDGYFKMEMGKNMCAVAT 350
Query: 275 RSSYPL 280
+SYP+
Sbjct: 351 CASYPI 356
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 80/178 (44%), Positives = 111/178 (62%), Gaps = 6/178 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++ + E WM +H R Y + EK R +IFK+NL YI++ NK+ N +Y LG N+F
Sbjct: 39 TSTERLIRLFESWMLKHDRVYNNIEEKIHRFEIFKDNLMYIDETNKK-NNSYWLGLNEFV 97
Query: 98 DLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
DLT+DEF+ Y G + + F Y+++ D P S+DWRDKGAVTP+K
Sbjct: 98 DLTHDEFKEKYVGSIGEDFVTIEQSNDEEFPYKHV--VDYPESIDWRDKGAVTPVK-PNP 154
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ VA VEGI KI +G LI LSEQ+LLDC ++GC GG + + Y++ N
Sbjct: 155 CGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCKGGYQTTSLQYVVDN 211
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/78 (51%), Positives = 49/78 (62%), Gaps = 9/78 (11%)
Query: 206 KAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
K A+ + GIFNG CGT+LDHAVT +G+G T Y LIKNSWG WG+ GY+KI R
Sbjct: 270 KGRAFQLYKGGIFNGPCGTKLDHAVTAIGYGKT-----YILIKNSWGPNWGEKGYLKIKR 324
Query: 266 ----DEGLCGIGTRSSYP 279
EG CG+ S +P
Sbjct: 325 ASGKSEGTCGVYKSSYFP 342
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/186 (39%), Positives = 115/186 (61%), Gaps = 8/186 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A+HG ++E R F +NL +++ N G ++L
Sbjct: 45 EAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLA 104
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLTNDEFRA Y G K + +R+ +Y++ ++P ++DWR+KGAV P+K
Sbjct: 105 MNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVK 164
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+AV+ VE I +I +G ++ LSEQ+L++C NG ++GC GG + AF +I
Sbjct: 165 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFI 224
Query: 212 IQNQGI 217
I+N GI
Sbjct: 225 IKNGGI 230
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/82 (48%), Positives = 56/82 (68%), Gaps = 9/82 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + + G+F+G CGTQLDH V VG+GT E+G +YW+++NSWG WG+AGY+
Sbjct: 287 GGRE----FQLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYL 341
Query: 262 KIVRD----EGLCGIGTRSSYP 279
++ R+ G CGI SSYP
Sbjct: 342 RMERNINVTSGKCGIAMMSSYP 363
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/186 (39%), Positives = 115/186 (61%), Gaps = 8/186 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A+HG ++E R F +NL +++ N G ++L
Sbjct: 45 EAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLA 104
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLTNDEFRA Y G K + +R+ +Y++ ++P ++DWR+KGAV P+K
Sbjct: 105 MNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVK 164
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+AV+ VE I +I +G ++ LSEQ+L++C NG ++GC GG + AF +I
Sbjct: 165 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFI 224
Query: 212 IQNQGI 217
I+N GI
Sbjct: 225 IKNGGI 230
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/82 (48%), Positives = 56/82 (68%), Gaps = 9/82 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + + G+F+G CGTQLDH V VG+GT E+G +YW+++NSWG WG+AGY+
Sbjct: 287 GGRE----FQLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYL 341
Query: 262 KIVRD----EGLCGIGTRSSYP 279
++ R+ G CGI SSYP
Sbjct: 342 RMERNINVTSGKCGIAMMSSYP 363
>gi|58332124|ref|NP_001011214.1| cathepsin S, gene 2 precursor [Xenopus (Silurana) tropicalis]
gi|56556518|gb|AAH87770.1| cathepsin S [Xenopus (Silurana) tropicalis]
Length = 332
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/327 (31%), Positives = 153/327 (46%), Gaps = 76/327 (23%)
Query: 25 LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE 84
LV+ S ++ + S + ++ + W+ H +SYKD E+ R I++E L++I N E
Sbjct: 8 LVALLSLLIPAHSAPDPTLDTHWQLWVKTHQKSYKDTEEERTRRTIWEETLKFITVHNLE 67
Query: 85 ---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
G TY++G N D+T +E A TGY + S S + T + + P S+D
Sbjct: 68 YSLGLHTYEVGMNHLGDMTGEEVAATMTGY---TGSGGSLANMTEVPKEIQEAQPPASID 124
Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCL 200
WR + VTP+++Q CGCC+AF+AV A+E K + G L LS Q+L+DCS T GNNGC
Sbjct: 125 WRTQACVTPVRDQGPCGCCYAFSAVGAMECQAKRKRGMLFTLSPQELVDCSDTEGNNGCK 184
Query: 201 GGSREKAFAYIIQN---------------------------QGIFNGVCGTQ--LDHAVT 231
GG AF Y++++ I N G + L HAV
Sbjct: 185 GGRLMSAFTYMMKHGVMEENAYRYTGQEADCRRQVHTGLKVTAIHNVAAGDENVLMHAVG 244
Query: 232 IVG--------------------------------------FGTTEDGANYWLIKNSWGN 253
VG +GT ++G +YWL+KNSWG
Sbjct: 245 TVGPVSVNIDSNRKGFRLYKSGVYYDPYCTTNLDHSVLVVGYGT-DNGNDYWLVKNSWGA 303
Query: 254 TWGDAGYMKIVRDE-GLCGIGTRSSYP 279
+GD GY+K+ R+ CGI + +P
Sbjct: 304 GYGDKGYIKMARNRNNHCGIAQEAVFP 330
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 88/225 (39%), Positives = 132/225 (58%), Gaps = 19/225 (8%)
Query: 23 TLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN 82
L +S V RS E V ++ +W ++ + K E RL++FKENL+++++ N
Sbjct: 31 VLTLSKQGGAVPVRSDEE--VRMLYLEWRVKNHPAEKYLDLNEYRLEVFKENLQFVDEHN 88
Query: 83 KEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNLSMTDVPT 138
+R T+ LG N+F+DLTN+E+R + + S RS + + +Y+ D+P
Sbjct: 89 AAADRGEHTFLLGMNRFADLTNEEYRTRF--LRDFSRLRRSASGKISSRYRLREGDDLPD 146
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
S+DWR+ GAV P+KNQ CG CWAF+ VAAVEGI +I +G+LI LSEQQL+DC+T N+G
Sbjct: 147 SIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT-ANHG 205
Query: 199 CLGGSREKAFAYIIQNQGIF----------NGVCGTQLDHAVTIV 233
C GG AF +I+ N GI NG+C + ++ V +
Sbjct: 206 CRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNSTVNAPVVSI 250
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 47/69 (68%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIF G C +HA+T+VG+GT E+ ++W++KNSWG WG++GY++ R+ G CG
Sbjct: 290 GIFTGSCNISANHALTVVGYGT-ENDKDFWIVKNSWGKNWGESGYIRAERNIENPNGKCG 348
Query: 272 IGTRSSYPL 280
I +SYP+
Sbjct: 349 ITRFASYPV 357
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 75/174 (43%), Positives = 112/174 (64%), Gaps = 5/174 (2%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
E+ + W +HG++Y E E++ R++IFK+N +++ + N N TY L N F+DLT+ EF
Sbjct: 30 ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLS-MTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
+A G + + S + K Q+L VP S+DWR KGAVT +K+Q CG CW+F
Sbjct: 90 KASRLGLSVSASSLIMAS----KGQSLGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSF 145
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
+A A+EGI +I +G+LI LSEQ+L+DC + N GC GG + AF ++I+N GI
Sbjct: 146 SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGI 199
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/80 (48%), Positives = 56/80 (70%), Gaps = 5/80 (6%)
Query: 205 EKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 264
E+AF + GIF+G C T LDHAV IVG+G+ ++G +YW++KNSWG +WG G+M +
Sbjct: 257 ERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGS-QNGVDYWIVKNSWGKSWGMDGFMHMQ 315
Query: 265 RD----EGLCGIGTRSSYPL 280
R+ EG+CGI +SYP+
Sbjct: 316 RNTGNSEGICGINMLASYPI 335
>gi|115479391|ref|NP_001063289.1| Os09g0442300 [Oryza sativa Japonica Group]
gi|115510968|sp|P25778.2|ORYC_ORYSJ RecName: Full=Oryzain gamma chain; Flags: Precursor
gi|51535997|dbj|BAD38077.1| putative oryzain gamma chain precursor [Oryza sativa Japonica
Group]
gi|113631522|dbj|BAF25203.1| Os09g0442300 [Oryza sativa Japonica Group]
gi|215694919|dbj|BAG90110.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 362
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 142/303 (46%), Gaps = 77/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ +HG+ Y D E + R +IF E+LE + N+ G Y+LG N+F+D++ +EF+A
Sbjct: 64 RFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNRRG-LPYRLGINRFADMSWEEFQASR 122
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G + + + ++ +P + DWR+ G V+P+K+Q CG CW F+ +
Sbjct: 123 LG-----AAQNCSATLAGNHRMRDAAALPETKDWREDGIVSPVKDQGHCGSCWTFSTTGS 177
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E +G + LSEQQL+DC+T NN GC GG +AF YI N G+
Sbjct: 178 LEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTG 237
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L +AV +V GF
Sbjct: 238 VNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVAFQVINGFRMYKSGVYTS 297
Query: 236 ---GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
GT+ E+G YWLIKNSWG WGD GY K+ + +CGI T +S
Sbjct: 298 DHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCAS 357
Query: 278 YPL 280
YP+
Sbjct: 358 YPI 360
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 83/202 (41%), Positives = 121/202 (59%), Gaps = 9/202 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG NQF DL EF ++ G+ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P ++DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
NNGC GG E AF YI N GI
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGI 198
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 51/75 (68%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G+++ C ++ LDH V +VG+G + G YWL+KNSW +WGD GY+ + RD
Sbjct: 258 SFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRD 316
Query: 267 -EGLCGIGTRSSYPL 280
CGI +++SYPL
Sbjct: 317 NNNQCGIASQASYPL 331
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/186 (40%), Positives = 116/186 (62%), Gaps = 10/186 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A++G E+E R + F +NL +++ N G Y+LG
Sbjct: 46 EAEARAVYDLWLAENGGGSSPNANSIPERERRFRAFWDNLNFVDAHNARAAAGEEGYRLG 105
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLTNDEFRA Y G K + R +Y++ ++P ++DWR+KGAV P+K
Sbjct: 106 MNRFADLTNDEFRAAYLGVK--AQRARPGRMVGERYRHDGAEELPEAVDWREKGAVAPVK 163
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+AV+ VE I +I +G ++ LSEQ+L++C TNG ++GC GG + AF +I
Sbjct: 164 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFI 223
Query: 212 IQNQGI 217
I+N GI
Sbjct: 224 IKNGGI 229
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 56/82 (68%), Gaps = 9/82 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + + G+F+G CGTQLDH V VG+GT E+G +YW+++NSWG WG++GY+
Sbjct: 286 GGRE----FQLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGESGYL 340
Query: 262 KIVRD----EGLCGIGTRSSYP 279
++ R+ G CGI SSYP
Sbjct: 341 RMERNINVTSGKCGIAMMSSYP 362
>gi|218202220|gb|EEC84647.1| hypothetical protein OsI_31538 [Oryza sativa Indica Group]
Length = 363
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 142/303 (46%), Gaps = 77/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ +HG+ Y D E + R +IF E+LE + N+ G Y+LG N+F+D++ +EF+A
Sbjct: 65 RFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNRRG-LPYRLGINRFADMSWEEFQASR 123
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G + + + ++ +P + DWR+ G V+P+K+Q CG CW F+ +
Sbjct: 124 LG-----AAQNCSATLAGNHRMRDAAALPETKDWREDGIVSPVKDQGHCGSCWTFSTTGS 178
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E +G + LSEQQL+DC+T NN GC GG +AF YI N G+
Sbjct: 179 LEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTG 238
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L +AV +V GF
Sbjct: 239 VNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVAFQVINGFRMYKSGVYTS 298
Query: 236 ---GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
GT+ E+G YWLIKNSWG WGD GY K+ + +CGI T +S
Sbjct: 299 DHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCAS 358
Query: 278 YPL 280
YP+
Sbjct: 359 YPI 361
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 81/201 (40%), Positives = 123/201 (61%), Gaps = 6/201 (2%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE++ R+KI+ EN + K
Sbjct: 6 LIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 64
Query: 81 AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y++ N+F DL + EFR++ GY+ + S STF + + +VP
Sbjct: 65 HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 123
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
S+DWR KGA+TP+K+Q +CG CWAF++ A+EG T ++G LI LSEQ L+DCS GN
Sbjct: 124 ESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG ++AF YI N+GI
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGI 204
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 31/56 (55%), Positives = 42/56 (75%), Gaps = 2/56 (3%)
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
LDH V +VG+G+ ++G +YWL+KNSW WGD GY+KI R+ + CGI T +SYPL
Sbjct: 283 LDHGVLVVGYGS-DNGKDYWLVKNSWSEHWGDEGYIKIARNRKNHCGIATAASYPL 337
>gi|149392541|gb|ABR26073.1| oryzain gamma chain precursor [Oryza sativa Indica Group]
Length = 367
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 142/303 (46%), Gaps = 77/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ +HG+ Y D E + R +IF E+LE + N+ G Y+LG N+F+D++ +EF+A
Sbjct: 69 RFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNRRG-LPYRLGINRFADMSWEEFQASR 127
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G + + + ++ +P + DWR+ G V+P+K+Q CG CW F+ +
Sbjct: 128 LG-----AAQNCSATLAGNHRMRDAAALPETKDWREDGIVSPVKDQGHCGSCWTFSTTGS 182
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E +G + LSEQQL+DC+T NN GC GG +AF YI N G+
Sbjct: 183 LEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTG 242
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L +AV +V GF
Sbjct: 243 VNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVAFQVINGFRMYKSGVYTS 302
Query: 236 ---GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
GT+ E+G YWLIKNSWG WGD GY K+ + +CGI T +S
Sbjct: 303 DHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCAS 362
Query: 278 YPL 280
YP+
Sbjct: 363 YPI 365
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 84/200 (42%), Positives = 116/200 (58%), Gaps = 17/200 (8%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ L+ C S++ R H W HG++Y E E+++R I+ +NLE +
Sbjct: 8 LLVAVLIAQCFSELSQDRQWH---------AWKDFHGKTYTGE-EEDLRRAIWNDNLEIV 57
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
+K N E N +YKL N F+DLT EF+ + GY+ S ST STF LS +P
Sbjct: 58 KKHNAE-NHSYKLDMNHFADLTVTEFKQRFMGYRAAS---NSTGGSTF--LPLSNVQLPA 111
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNN 197
+DWRDKG VT +KNQ +CG CWAF++ ++EG ++G L+ LSEQ L+DCS GNN
Sbjct: 112 EVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDCSKKYGNN 171
Query: 198 GCLGGSREKAFAYIIQNQGI 217
GC GG + AF YI N GI
Sbjct: 172 GCEGGLMDYAFKYIKNNDGI 191
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 34/58 (58%), Positives = 43/58 (74%), Gaps = 2/58 (3%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
TQLDH V VG+G EDG +YWL+KNSWG WG GY+K+ R+ + CGI T++SYPL
Sbjct: 268 TQLDHGVLAVGYGA-EDGKDYWLVKNSWGEGWGMNGYIKMSRNKDNQCGIATQASYPL 324
>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
Length = 336
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 79/202 (39%), Positives = 121/202 (59%), Gaps = 9/202 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + + + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GY + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
N GC GG ++AF Y+ +N+G+
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGL 197
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 42/61 (68%), Gaps = 4/61 (6%)
Query: 224 TQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
++LDHAV +VG+G G YW++KNSW + WGD GY+ + +D+ CG+ T++SYP
Sbjct: 275 SRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYP 334
Query: 280 L 280
L
Sbjct: 335 L 335
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 93/258 (36%), Positives = 125/258 (48%), Gaps = 73/258 (28%)
Query: 95 QFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
QF+++TNDEFR++YTGYK S S T S++F+YQN+S +P ++DWR KGAVTPIK
Sbjct: 1 QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60
Query: 153 NQKECGCCWAF---AAVAAVEGITKIR-------------------SGNLIQLSEQQLL- 189
NQ CGCCWAF AA+ I K + SG LI + + ++
Sbjct: 61 NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCSGGLIDTAFEHIMA 120
Query: 190 -----------------------------------DCSTNGNNGCLG---------GSRE 205
D N N + G
Sbjct: 121 TGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGIEG 180
Query: 206 KAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
F + + G+F G C T LDHAVT VG+ + G+ YW+IKNSWG WG+ GYM+I +
Sbjct: 181 GGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIKK 240
Query: 266 D----EGLCGIGTRSSYP 279
D EGLCG+ ++SYP
Sbjct: 241 DIKDKEGLCGLAMKASYP 258
>gi|326516056|dbj|BAJ88051.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 362
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 105/306 (34%), Positives = 142/306 (46%), Gaps = 83/306 (27%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ +G+SY+ E R +IF E+LE + N++G Y+LG N+FSD++ +EF+A
Sbjct: 63 RFAVGYGKSYESAAEVRRRFRIFSESLEEVRSTNRKG-LPYRLGINRFSDMSWEEFQATR 121
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
G T S+T +L M D +P + DWR+ G V+P+KNQ CG CW F+
Sbjct: 122 LGAAQ-------TCSATLAGNHL-MRDAAALPETKDWREDGIVSPVKNQAHCGSCWTFST 173
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI------- 217
A+E +G I LSEQQL+DC+ NN GC GG +AF YI N GI
Sbjct: 174 TGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIKYNGGIDTEESYP 233
Query: 218 ---FNGVCG---------------------TQLDHAVTIV-----------GF------- 235
NGVC +L +AV +V GF
Sbjct: 234 YKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVIDGFRQYKSGV 293
Query: 236 ------GTTEDGAN---------------YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 274
GTT D N YWLIKNSWG WGD GY K+ + +C I T
Sbjct: 294 YTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAIAT 353
Query: 275 RSSYPL 280
+SYP+
Sbjct: 354 CASYPV 359
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 122/201 (60%), Gaps = 6/201 (2%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE++ R+KI+ EN + K
Sbjct: 2 LIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 60
Query: 81 AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y + N+F DL + EFR++ GY+ + S STF + + VP
Sbjct: 61 HNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVTVP 119
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
S+DWR+KGA+TP+K+Q +CG CWAF++ A+EG T ++G L+ LSEQ L+DCS GN
Sbjct: 120 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGN 179
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG ++AF YI N+GI
Sbjct: 180 EGCNGGLMDQAFQYIKDNKGI 200
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 42/56 (75%), Gaps = 2/56 (3%)
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
LDH V +VG+G+ ++G +YWL+KNSW WGD GY+K+ R+ + CG+ + +SYPL
Sbjct: 279 LDHGVLVVGYGS-DNGKDYWLVKNSWSEHWGDEGYIKMARNRKNHCGVASAASYPL 333
>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
Length = 340
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 100/325 (30%), Positives = 161/325 (49%), Gaps = 85/325 (26%)
Query: 30 SQVVSSRS--THEQSVVE-IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN 86
S+ SS+S T +Q ++ +H +M++ ++YK + E EMRL+ +K N+ +I N + +
Sbjct: 24 SETSSSQSLYTADQDHIDFVH--FMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQND 81
Query: 87 RT-YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDK 145
T + LG N +D T+DE++ + GYK + + + Y ++ D+P S+DWR+K
Sbjct: 82 GTSFTLGPNHLADYTHDEYKKM-LGYKPRNKTGKEV------YSTPNLKDIPESIDWREK 134
Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG----NNGCLG 201
GAV +K+Q +CG CWAF+ +A++E I +G L LSEQQL+DCS NG N G +G
Sbjct: 135 GAVNAVKDQGQCGSCWAFSTIASLESRYFIETGKLQSLSEQQLVDCSKNGNEGCNGGDMG 194
Query: 202 GSR-----------EKAFAYIIQNQ----------------------------------- 215
+ EK + Y+ ++Q
Sbjct: 195 LAMDYIASAGGVETEKDYPYVGKDQTCAFEASKEVATDKGHINIVPGKFATLQAAIAEGP 254
Query: 216 -----------------GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 257
GIF+ CGT LDH V VG+G ++G Y++++NSW ++WG
Sbjct: 255 VSVAIEADSLFFQFYRSGIFDSSWCGTNLDHGVAAVGYGV-DNGKQYYIVRNSWSDSWGL 313
Query: 258 AGYMKIVRD---EGLCGIGTRSSYP 279
GY+ I+ + G+CGI P
Sbjct: 314 KGYINIIANGDGNGMCGIQMEPVVP 338
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 72/181 (39%), Positives = 112/181 (61%), Gaps = 6/181 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQF 96
+ V ++E W ++HG + + +RL++F++NL YI+ N E G T++LG F
Sbjct: 45 DDEVRRMYEAWKSEHGHGHGSD--DRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPF 102
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLT +E+R G++ S + D+P ++DWR+ GAVT +KNQ++
Sbjct: 103 ADLTLEEYRGRALGFRARRGGASRVGSGSSYRPRPRGGDLPDAIDWRELGAVTGVKNQEQ 162
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+AVAA+EGI +I +GNL+ LSEQ+++DC T + GC GG + AF ++I N G
Sbjct: 163 CGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCDTQ-DGGCNGGEMQNAFQFVINNGG 221
Query: 217 I 217
I
Sbjct: 222 I 222
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 39/69 (56%), Positives = 53/69 (76%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
GIFNG CGTQLDH VT VG+G+ E+G +YW++KNSW ++WG+AGY++I R+ G CG
Sbjct: 289 GIFNGPCGTQLDHGVTAVGYGS-ENGKDYWIVKNSWSSSWGEAGYIRIRRNVAAATGKCG 347
Query: 272 IGTRSSYPL 280
I +SYP+
Sbjct: 348 IAMDASYPV 356
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 76/186 (40%), Positives = 117/186 (62%), Gaps = 10/186 (5%)
Query: 40 EQSVVEIHEKWMAQHGR-SYKDE---LEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A+HG SY + E+E R + F +NL +++ N G ++L
Sbjct: 43 EAEARAVYDLWLAEHGGGSYPNANSIPERERRFRAFWDNLRFVDAHNARAAAGEEGFRLA 102
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLTNDEFRA Y G K R +Y++ ++P ++DWR+KGAV P+K
Sbjct: 103 MNRFADLTNDEFRAAYLGVK--GQRARPGRVVGERYRHDGAEELPEAVDWREKGAVAPVK 160
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+A++ VE I +I +G ++ LSEQ+L++C TNG ++GC GG + AF +I
Sbjct: 161 NQGQCGSCWAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFI 220
Query: 212 IQNQGI 217
I+N GI
Sbjct: 221 IKNGGI 226
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/82 (48%), Positives = 56/82 (68%), Gaps = 9/82 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 261
G RE + + + G+F+G CGTQLDH V VG+GT E+G +YW+++NSWG WG+AGY+
Sbjct: 283 GGRE----FQLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYL 337
Query: 262 KIVRD----EGLCGIGTRSSYP 279
++ R+ G CGI SSYP
Sbjct: 338 RMERNINVTSGKCGIAMMSSYP 359
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 78/172 (45%), Positives = 109/172 (63%), Gaps = 6/172 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
+ W A HG SY E+ R I++ NL++IEK N EG+ +YKL N+F+DLT EF A
Sbjct: 23 DSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGH-SYKLAVNKFADLTYPEFAAK 81
Query: 108 YTGYKMPSP-SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
Y G + + + +S +ST+ + M +P S+DWR G VTPIK+Q +CG CW+F+
Sbjct: 82 YLGLRFDATNATKSFAASTYLPR---MVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTT 138
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGI 217
+VEG ++G L+ LSEQ L+DCS+ GN GC GG ++AF YII N GI
Sbjct: 139 GSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGI 190
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 32/68 (47%), Positives = 46/68 (67%), Gaps = 4/68 (5%)
Query: 216 GIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGI 272
G++N +QLDH V VG+GT+ ++YWL+KNSWG +WG +GY+ + R+ CGI
Sbjct: 257 GVYNEPACSSSQLDHGVLAVGYGTS-GSSDYWLVKNSWGTSWGQSGYIWMTRNSNNQCGI 315
Query: 273 GTRSSYPL 280
T +SYPL
Sbjct: 316 ATAASYPL 323
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 84/202 (41%), Positives = 120/202 (59%), Gaps = 9/202 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H +SY+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKSYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG NQF DL EF ++ G+ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P +DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
NNGC GG E AF YI N GI
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGI 198
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 51/75 (68%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G+++ C ++ LDH V +VG+G + G YWL+KNSW +WGD GY+ + RD
Sbjct: 258 SFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRD 316
Query: 267 -EGLCGIGTRSSYPL 280
CGI +++SYPL
Sbjct: 317 NNNQCGIASQASYPL 331
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 81/201 (40%), Positives = 120/201 (59%), Gaps = 7/201 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N + G +YKLG NQF DL EF ++ GY S +S S+ N++ + +P
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYH---GSRKSGGSTFLPPANVNDSSLP 117
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
++DWR KGAVTP+K+Q +CG CWAF+ ++EG +++G L+ LSEQ L+DCS + GN
Sbjct: 118 KAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
NGC GG E AF YI N GI
Sbjct: 178 NGCEGGLMEDAFKYIKANDGI 198
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 51/75 (68%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G+++ C ++ LDH V +VG+G + G YWL+KNSW +WGD GY+ + RD
Sbjct: 258 SFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRD 316
Query: 267 -EGLCGIGTRSSYPL 280
CGI +++SYPL
Sbjct: 317 NNNQCGIASQASYPL 331
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 83/200 (41%), Positives = 117/200 (58%), Gaps = 37/200 (18%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E W+++HG+ YK EK R ++F+ENL +I++ NKE + +Y LG N+F+DL+++EF++
Sbjct: 50 ESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHEEFKSK 108
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
+ D+P S+DWR KGAVT +KNQ CG CWAF+ VA
Sbjct: 109 ------------------------DVADLPESVDWRKKGAVTHVKNQGACGSCWAFSTVA 144
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFN-------- 219
AVEGI +I +GNL LSEQ+L+DC T N+GC GG + AFA+I N G+
Sbjct: 145 AVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDDYPYLM 204
Query: 220 --GVCGTQLDHA--VTIVGF 235
G C Q + VTI G+
Sbjct: 205 EEGTCEEQKEDVDIVTISGY 224
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 50/68 (73%), Gaps = 5/68 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+FNG CGT+LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R+ EGLCG
Sbjct: 261 GVFNGPCGTELDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCG 319
Query: 272 IGTRSSYP 279
I +SYP
Sbjct: 320 INKMASYP 327
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 150 bits (379), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 79/183 (43%), Positives = 113/183 (61%), Gaps = 11/183 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H S + EK R +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR+ Y G +KM S S TF Y+ + VP S+DWR KGAVT +K+Q
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI +I++ L+ LSEQ+L+DC N GC GG E AF +I Q
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206
Query: 215 QGI 217
GI
Sbjct: 207 GGI 209
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 51/70 (72%), Gaps = 4/70 (5%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
+G+F G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+ EGLC
Sbjct: 275 EGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGLC 334
Query: 271 GIGTRSSYPL 280
GI +SYP+
Sbjct: 335 GIAMMASYPI 344
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 150 bits (379), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 79/183 (43%), Positives = 113/183 (61%), Gaps = 11/183 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H S + EK R +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR+ Y G +KM S S TF Y+ + VP S+DWR KGAVT +K+Q
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI +I++ L+ LSEQ+L+DC N GC GG E AF +I Q
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206
Query: 215 QGI 217
GI
Sbjct: 207 GGI 209
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 80/202 (39%), Positives = 119/202 (58%), Gaps = 21/202 (10%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
++E+W + H S + EK+ R +FK N ++ ANK ++ YKL N+F+D+TN EFR
Sbjct: 37 LYERWRSHHTVS-RSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFR 94
Query: 106 ALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
Y+G K+ HR + TF Y+ + VP S+DWR KGAVT +K+Q +CG C
Sbjct: 95 NTYSGSKVKH--HRMFRGGPRGNGTFMYEKVDT--VPASVDWRKKGAVTSVKDQGQCGSC 150
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI--- 217
WAF+ + AVEGI +I++ L+ LSEQ+L+DC T+ N GC GG + AF +I Q GI
Sbjct: 151 WAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTE 210
Query: 218 -------FNGVCGTQLDHAVTI 232
++G C ++A +
Sbjct: 211 ANYPYEAYDGTCDVSKENAPAV 232
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 83/202 (41%), Positives = 121/202 (59%), Gaps = 9/202 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG NQF DL EF ++ G+ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P +DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
NNGC GG E AF YI +N GI
Sbjct: 177 NNGCEGGLMEDAFKYIKENDGI 198
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 51/75 (68%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G+++ C ++ LDH V +VG+G + G YWL+KNSW +WGD GY+ + RD
Sbjct: 258 SFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRD 316
Query: 267 -EGLCGIGTRSSYPL 280
CGI +++SYPL
Sbjct: 317 NNNQCGIASQASYPL 331
>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
Length = 347
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 84/221 (38%), Positives = 129/221 (58%), Gaps = 13/221 (5%)
Query: 3 LIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDEL 62
+IF+ S S N M ++I + ++CAS R H+ + E W +G+ Y+++
Sbjct: 1 MIFQDSKSSPANLLRMKVVIWMFLACASTTAYLR--HDPMLDNHWELWKKTYGKQYEEQN 58
Query: 63 EKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR 119
++ R I+++NL+++ N E G +Y L N SD+T++E +L + ++P+ R
Sbjct: 59 QEVTRRLIWEKNLKFVTLHNLEHSMGLHSYDLSMNHLSDMTSEEVASLMSSLRIPNQWSR 118
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+TT Y+ S +P S+DWRDKG VT +K Q CG CWAF+AV A+E K+++G
Sbjct: 119 NTT-----YRLNSNQKLPDSVDWRDKGCVTEVKYQGTCGSCWAFSAVGALEAQLKLKTGK 173
Query: 180 LIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGI 217
L+ LS Q L+DCSTN N+GC GG +AF YII N GI
Sbjct: 174 LVSLSAQNLVDCSTNEKYENHGCNGGCMTEAFQYIIDNNGI 214
>gi|317135059|gb|ADV03094.1| cathepsin L [Hyriopsis cumingii]
gi|372126672|gb|AEX88474.1| cathepsin L [Hyriopsis schlegelii]
Length = 333
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 111/342 (32%), Positives = 156/342 (45%), Gaps = 89/342 (26%)
Query: 16 TPMFIIITLLVSCASQV-VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
T + + L + A + VS+ + Q V I+ K H E+ +R ++K+N
Sbjct: 5 TVHIVTVFLCLKVADGLSVSALNIGWQEFVRIYNKTYRAH--------EEPVRYSVWKDN 56
Query: 75 LEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
I + N +G TY L N++ DLTN+E+ L TG K+ + R FKY NL
Sbjct: 57 FLAINRHNSKADQGFHTYWLAMNEYGDLTNEEYFRLRTGLKINANIERRGL--VFKYTNL 114
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
S + P+ +DWR KG VTP+KNQ CG C+AF+A AVEG ++G L+ LSEQ ++DC
Sbjct: 115 S--EYPSEVDWRSKGYVTPVKNQGGCGSCYAFSATGAVEGQHFRKTGKLVSLSEQNIVDC 172
Query: 192 S-TNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQ--------------- 225
S GN GC GG +K+F YI N GI +G C +
Sbjct: 173 SFKEGNKGCRGGLMDKSFTYIKDNNGIDTEEAYPYEARDGPCRFRRSEVGATVRGYVDLP 232
Query: 226 ------LDHAVTIVG---------------------------------------FGTTED 240
L HAVT +G +GT D
Sbjct: 233 ENDEIALQHAVTTIGPISVAIDGHHFNFRFYHHGVFDNPNCSKTKINHGVLVVGYGT-RD 291
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIGTRSSYPLA 281
G +YWL+KNSWG WG GY+ + R ++ C I +SYP+
Sbjct: 292 GLDYWLVKNSWGERWGAEGYILMSRNNDNQCCITCAASYPIV 333
>gi|16444924|dbj|BAB70669.1| cysteine proteinase [Daucus carota]
Length = 208
Score = 150 bits (378), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 78/180 (43%), Positives = 118/180 (65%), Gaps = 7/180 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
++S+ +++E+W + H S +D EK++R +FK N+++I K N + N+ YKL N+F+D+
Sbjct: 33 DESLWDLYERWRSHHTVS-RDLTEKQIRFNVFKTNVKHIHKVN-QMNKPYKLEVNKFADM 90
Query: 100 TNDEFRALYTGYKMPS-PSHRSTTSST-FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
T EFR Y G K+ S R + T F ++N +P+S+DWR GAVTPIKNQ C
Sbjct: 91 TYHEFRNSYGGSKVKHFRSLRGDRARTGFMHENTK--HLPSSVDWRKHGAVTPIKNQGRC 148
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+A+ VEGI KI++ L+ LSEQ+L+DC ++ N GC GG E A +I ++ G+
Sbjct: 149 GSCWAFSAIVGVEGINKIKTNQLVSLSEQELVDCESD-NQGCNGGLMENALEFIKRSGGV 207
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 150 bits (378), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 79/201 (39%), Positives = 123/201 (61%), Gaps = 6/201 (2%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE++ R+KI+ EN + K
Sbjct: 2 LIFLLGAVFVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 60
Query: 81 AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y++ N+F DL + EFR++ GY+ + S STF + + +VP
Sbjct: 61 HNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 119
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
S+DWR+KGA+TP+K+Q +CG CWAF++ A+EG T ++G L+ L EQ L+DCS GN
Sbjct: 120 ESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGN 179
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
GC GG ++AF YI N+GI
Sbjct: 180 EGCNGGLMDQAFQYIKDNKGI 200
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 30/56 (53%), Positives = 42/56 (75%), Gaps = 2/56 (3%)
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
LDH V +VG+G+ ++G +YWL+KNSW WGD GY+KI R+ + CG+ T +SYPL
Sbjct: 279 LDHGVLVVGYGS-DNGKDYWLVKNSWSEHWGDQGYIKIARNRKNHCGVATAASYPL 333
>gi|339765072|gb|AEK01110.1| cathepsin L [Cristaria plicata]
gi|397880684|gb|AFO67888.1| cathepsin L [Cristaria plicata]
Length = 333
Score = 150 bits (378), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 110/342 (32%), Positives = 154/342 (45%), Gaps = 88/342 (25%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+ P+ I+ L S VS+ + Q V H K + H E+ R ++KEN
Sbjct: 5 SIPIVIVFLHLKSADGLSVSALNIGWQEFVRTHNKTYSAH--------EELFRYAVWKEN 56
Query: 75 LEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
+ I + N +G TY L N++ DLTN+E+ L TG+ M RS S FKY NL
Sbjct: 57 VLAINRHNSKADQGVHTYWLSMNEYGDLTNEEYFRLRTGFIMNGNIERS--GSIFKYTNL 114
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
S + P +DWR KG VT +K+Q CG C+AF+A A+EG ++G L+ LSEQ ++DC
Sbjct: 115 S--EYPRQVDWRRKGYVTRVKDQGGCGSCYAFSATGALEGQHFRKTGKLVSLSEQNIVDC 172
Query: 192 S-TNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCG----------------- 223
S GN GC GG +K+F YI N GI +G C
Sbjct: 173 SFKEGNKGCKGGLMDKSFTYIKNNNGIDKEEAYPYEARDGPCRFRRSEVGATDRGYVDLP 232
Query: 224 ----TQLDHAVTIVG---------------------------------------FGTTED 240
T L HAV +G +GT +
Sbjct: 233 ENDETALRHAVATIGPISVAIDGHHFNFRFYDHGVFDNPNCSKTKINHGVLVVGYGT-RN 291
Query: 241 GANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIGTRSSYPLA 281
G +YW++KNSWG WG GY+ + R ++ C I +SYP+
Sbjct: 292 GLDYWMVKNSWGRGWGAKGYILMSRNNDNQCCIACAASYPIV 333
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 150 bits (378), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 76/181 (41%), Positives = 114/181 (62%), Gaps = 7/181 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++EKW + H S + EK R +F+ N+ ++ NK ++ YKL N+F+D+
Sbjct: 31 EESLWDLYEKWRSHHTVSTSLD-EKRKRFNVFRANVLHVHNTNKM-DKPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMPSPSH---RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
TN EFR Y K+ + + +F Y N+ VP S+DWR KGAVTP+K+Q +
Sbjct: 89 TNHEFRTAYASSKVKHHTMFRGAPLGNGSFMYGNID--KVPASIDWRKKGAVTPVKDQGK 146
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ + AVEGI I++ LI LSEQ+L+DC+T N+GC GG + AF +I + +G
Sbjct: 147 CGSCWAFSTIVAVEGINFIKTNKLISLSEQELVDCNTGENHGCNGGLMDYAFEFITKQKG 206
Query: 217 I 217
I
Sbjct: 207 I 207
>gi|121531600|gb|ABM55485.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 326
Score = 150 bits (378), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 102/311 (32%), Positives = 144/311 (46%), Gaps = 82/311 (26%)
Query: 47 HEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLT 100
++W+A HG++YK+ LE++ R IF+ NL I++ N +G TY LG +F+DLT
Sbjct: 20 EDQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNARYDKGEETYLLGVTRFADLT 79
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
++EF+ + G P R + T ++L +VP S+DW +KGAV +K+Q CG C
Sbjct: 80 HEEFKDILKGQIKNKP--RLNATPTVFPEDL---EVPDSIDWTEKGAVLEVKDQNPCGSC 134
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGC---------------LGGSR 204
WAF+A A+EG I + I LSEQQLLDCS GN C G
Sbjct: 135 WAFSATGALEGQNAILNNVKISLSEQQLLDCSAAYGNGNCKEGGDMSAAFEYVRDYGIQS 194
Query: 205 EKAFAYI--------------------------------------------------IQN 214
EK++ YI +
Sbjct: 195 EKSYPYIRKQTECQYDASKTILKIKGYKNVTTSEEGLRKAVGAIGPISIAMNSDPLQLYY 254
Query: 215 QGIFNGV-CGTQLDHAVTIVGFGTTEDGA---NYWLIKNSWGNTWGDAGYMKIVRD-EGL 269
GI +G C LDH V +VG+G + +W +KNSWG WG+ GY +I RD L
Sbjct: 255 SGIISGKGCSHDLDHGVLVVGYGKASQWSGETKFWRVKNSWGKIWGENGYFRIKRDANNL 314
Query: 270 CGIGTRSSYPL 280
CGI +YP+
Sbjct: 315 CGIADDPTYPV 325
>gi|169659203|dbj|BAG12786.1| putative cysteine protease [Sorogena stoianovitchae]
Length = 293
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 97/283 (34%), Positives = 136/283 (48%), Gaps = 77/283 (27%)
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
E + RL +F E++ +E N +G+ +Y LG NQF+DLT +EF +LY G + + S +
Sbjct: 21 EDKHRLALFAESVRIVETENAKGH-SYTLGLNQFADLTTEEFSSLYLGLVLENKVQASES 79
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
L D ++DWR KGAVTP+K+QK CG CWAF+A A+EG +G LI
Sbjct: 80 VV------LQDGDSEENVDWRQKGAVTPVKDQKSCGSCWAFSATGAMEGALVKSTGKLIN 133
Query: 183 LSEQQLLDCST--NGNNGCL-----------GGSREKAFAY------------------- 210
LSEQQL+DC T NG NG L G + EK + Y
Sbjct: 134 LSEQQLVDCVTKCNGCNGGLMTAAFDYVLGRGRATEKDYPYKGVDGRCKQTATDNKIKGY 193
Query: 211 --IIQNQ---------------------------GIFNGVCGTQLDHAVTIVGFGTTEDG 241
+ QN G+ + CGT+LDH V VG+ G
Sbjct: 194 NNVPQNNYKALKAAVASPLSVAVNAAGTIQRYKSGVIDANCGTRLDHGVLAVGY----QG 249
Query: 242 ANYWLIKNSWGNTWGDAGYMKIVR-----DEGLCGIGTRSSYP 279
+YW++KNSWGN +G+ GY ++ G+CGI ++ P
Sbjct: 250 EDYWIVKNSWGNGYGENGYFRVKMGTQNGGAGVCGINMMAAQP 292
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 79/198 (39%), Positives = 117/198 (59%), Gaps = 21/198 (10%)
Query: 40 EQSVVEIHEKWMAQHGRSYKD----ELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
++ V ++E W ++HGR + E +RL++F++NL YI+ N E G T++LG
Sbjct: 47 DEEVRRMYEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLG 106
Query: 93 TNQFSDLTNDEFRALYTGYKMP---SPSHRSTTSSTFKYQNLSMT----------DVPTS 139
F+DLT +E+R G++ PS R+ S S D+P +
Sbjct: 107 LTPFADLTLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCGDLPDA 166
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+DWR GAVT +KNQ++CG CWAF+AVAA+EGI I +GNL+ LSEQ+++DC T ++GC
Sbjct: 167 IDWRQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDTQ-DSGC 225
Query: 200 LGGSREKAFAYIIQNQGI 217
GG E AF ++I N GI
Sbjct: 226 NGGQMENAFQFVIDNGGI 243
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 74/119 (62%), Gaps = 12/119 (10%)
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQ 225
VAA++G ++ S N L E + + + GG +AF + + GIFNG CGT
Sbjct: 268 VAAIDGFVEVASNNETALQEAVAIQPVSVAIDA--GG---RAFQHY--SSGIFNGPCGTN 320
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
LDH VT+VG+G+ E+G YW++KNSW ++WG+AGY++I R+ G CGI +SYP+
Sbjct: 321 LDHGVTVVGYGS-ENGKAYWIVKNSWSDSWGEAGYIRIRRNVFLPVGKCGIAMDASYPV 378
>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
Length = 336
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 79/202 (39%), Positives = 120/202 (59%), Gaps = 9/202 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + + + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GY + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
N GC GG + AF Y+ +N+G+
Sbjct: 176 NQGCNGGLMDLAFQYVKENKGL 197
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 42/61 (68%), Gaps = 4/61 (6%)
Query: 224 TQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
++LDHAV +VG+G G YW++KNSW + WGD GY+ + +D+ CG+ T++SYP
Sbjct: 275 SRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYP 334
Query: 280 L 280
L
Sbjct: 335 L 335
>gi|227018328|gb|ACP18830.1| cysteine proteinase 1 [Chrysomela tremula]
Length = 323
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/197 (42%), Positives = 118/197 (59%), Gaps = 12/197 (6%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
+ L+++ A+ VV+ + +Q E+ + HG++YK E+++R IF++ L I
Sbjct: 1 MKLIIAFAAFVVAINAASDQ---ELWADFKKAHGKTYKSLREEKLRFNIFQDTLREIAAH 57
Query: 82 N---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N + G TY L NQFSD+T++EFRA+ PS + NL++ P
Sbjct: 58 NAKYESGESTYYLAINQFSDITDEEFRAMLMKNVESRPSLED-----MEIANLTVGAAPE 112
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNN 197
S+DWR +GAV PI+NQ++CG CWAF+AVAAVEG I+SG+ LS QQL+DCST GN+
Sbjct: 113 SIDWRTEGAVLPIRNQEDCGSCWAFSAVAAVEGQAAIKSGSKTPLSVQQLVDCSTEGGNS 172
Query: 198 GCLGGSREKAFAYIIQN 214
GC GG AF YI N
Sbjct: 173 GCNGGLMNGAFDYIKAN 189
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 47/68 (69%), Gaps = 4/68 (5%)
Query: 216 GIFNGVC--GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGI 272
GIFN + G LDH VT VG+GT ++G YW +KNSWG +WG+ GY+++ RD CGI
Sbjct: 255 GIFNNILCLGFGLDHGVTAVGYGT-DNGKKYWPVKNSWGESWGEEGYIRMARDTLHNCGI 313
Query: 273 GTRSSYPL 280
++SYP+
Sbjct: 314 NQQASYPI 321
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 120/201 (59%), Gaps = 7/201 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N + G +YKLG NQF DL EF ++ G+ + ++ SS N++ + +P
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLP 117
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
+DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + GN
Sbjct: 118 KVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
NGC GG E AF YI N GI
Sbjct: 178 NGCEGGLMEDAFKYIKANDGI 198
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 51/75 (68%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G+++ C ++ LDH V +VG+G + G YWL+KNSW +WGD GY+ + RD
Sbjct: 258 SFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRD 316
Query: 267 -EGLCGIGTRSSYPL 280
CGI +++SYPL
Sbjct: 317 NNNQCGIASQASYPL 331
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 120/201 (59%), Gaps = 7/201 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N + G +YKLG NQF DL EF ++ G+ + ++ SS N++ + +P
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLP 117
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
+DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + GN
Sbjct: 118 KVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
NGC GG E AF YI N GI
Sbjct: 178 NGCEGGLMEDAFKYIKANDGI 198
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 51/75 (68%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G+++ C ++ LDH V +VG+G + G YWL+KNSW +WGD GY+ + RD
Sbjct: 258 SFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRD 316
Query: 267 -EGLCGIGTRSSYPL 280
CGI +++SYPL
Sbjct: 317 NNNQCGIASQASYPL 331
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 117/199 (58%), Gaps = 9/199 (4%)
Query: 23 TLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN 82
LLV+ A VS + E E + HG++YK++ E+ R KIF N + IE N
Sbjct: 3 VLLVAVAVIAVSCANRFYNINPEEWETFKVVHGKNYKNQFEEMFRRKIFMNNKKRIEAHN 62
Query: 83 ---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
++G +YK+ N F DL + E +AL G+KM +P+ + F S +P S
Sbjct: 63 AKYEQGEVSYKMKMNHFGDLMSHEIKALMNGFKM-TPNTKREGKIYFP----SNDKLPKS 117
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNG 198
+DWR KGAVTP+K+Q +CG CW+F+A ++EG ++ G L+ LSEQ L+DCS GNNG
Sbjct: 118 VDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMDCSKEYGNNG 177
Query: 199 CLGGSREKAFAYIIQNQGI 217
C GG +KAF Y+ N+GI
Sbjct: 178 CEGGLMDKAFQYVSDNKGI 196
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 53/75 (70%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFN-GVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ ++G++N C + LDH V VG+GT E+G +YWL+KNSWG +WG++GY+KI R+
Sbjct: 256 SFHFYSEGVYNEPYCSSYDLDHGVLAVGYGT-ENGQDYWLVKNSWGPSWGESGYIKIARN 314
Query: 267 EG-LCGIGTRSSYPL 280
CGI + +SYP+
Sbjct: 315 HSNHCGIASMASYPI 329
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 80/185 (43%), Positives = 112/185 (60%), Gaps = 15/185 (8%)
Query: 40 EQSVVEIHEKWMAQH--GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E+S +++E+W + H RS D K R +FK N+ ++ NK ++ YKL N+F+
Sbjct: 33 EESFWDLYERWRSHHTVSRSLGD---KHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFA 88
Query: 98 DLTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
D+TN EFR+ Y G K+ HR + TF Y+ + VP S+DWR GAVT +K
Sbjct: 89 DMTNHEFRSTYAGSKVNH--HRMFQGTPRGNGTFMYEKVG--SVPPSVDWRKNGAVTGVK 144
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
+Q +CG CWAF+ V AVEGI +I++ L+ LSEQ+L+DC T N GC GG E AF +I
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIK 204
Query: 213 QNQGI 217
Q GI
Sbjct: 205 QKGGI 209
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
+G+F G C T+L+H V IVG+GTT DG NYW ++NSWG WG+ GY+++ R EGLC
Sbjct: 275 EGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQRSISKKEGLC 334
Query: 271 GIGTRSSYPL 280
GI +SYP+
Sbjct: 335 GIAMMASYPI 344
>gi|330376140|gb|AEC13302.1| cathepsin H [Gallus gallus]
Length = 329
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 100/308 (32%), Positives = 136/308 (44%), Gaps = 80/308 (25%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
++ + WM QHGR Y E E RL++F N +IE N GN ++++ NQFSD+T EF
Sbjct: 27 QLFKAWMLQHGRRYGAG-EYERRLRVFVGNKRHIEGHNA-GNSSFQMALNQFSDMTFAEF 84
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGCCWAF 163
+ LY P + S T F S P ++DWR KG VTP+KNQ CG CW F
Sbjct: 85 KKLYL---WSEPQNCSATRGNFL---RSDGPCPEAVDWRKKGNFVTPVKNQGPCGSCWTF 138
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF---- 218
+ +E I +G L+ L+EQ L+DC+ NN GC GG +AF YI+ N+G+
Sbjct: 139 STTGCLESAIAIATGKLLSLAEQLLVDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDA 198
Query: 219 ------NGVCGTQLDHAVTIV--------------------------GFGTTEDGANY-- 244
NG C Q D A+ V F T D +Y
Sbjct: 199 YPYRAQNGTCKFQPDKAIAFVKDVINITQYDEAGMVEAVGKHNPVSFAFEVTSDFMHYRK 258
Query: 245 --------------------------------WLIKNSWGNTWGDAGYMKIVRDEGLCGI 272
W++KNSWG WG GY I R + +CG+
Sbjct: 259 GVYSNPRCEHTPDKVNHAVLAVGYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGL 318
Query: 273 GTRSSYPL 280
+SYP+
Sbjct: 319 AACASYPV 326
>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 334
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 125/203 (61%), Gaps = 10/203 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ + L+ +C VVSS S E +W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFIDFDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
K N + G+ TY LG NQF+DL N+EF +L G++ S ++T STF + ++ D
Sbjct: 60 IKHNLKYDLGHFTYDLGMNQFADLKNEEFVSLMNGFR--GNSSKATRGSTFLPPS-NVFD 116
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
+PT +DWR KG VTP+KNQ +CG CWAF+A ++EG ++G L+ LSEQ L+DCS
Sbjct: 117 MPTMVDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGKE 176
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF YI+ GI
Sbjct: 177 GNMGCEGGLMDQAFQYILDVGGI 199
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 46/75 (61%), Gaps = 3/75 (4%)
Query: 209 AYIIQNQGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + G++N T LDH V VG+GT+ DG +Y+ +SWG WG GY+ + R+
Sbjct: 259 SFQLYKSGVYNEPACSSTLLDHGVLAVGYGTSSDGTDYFFFFHSWGAAWGMNGYLWMSRN 318
Query: 267 -EGLCGIGTRSSYPL 280
+ CGI T++SYPL
Sbjct: 319 KDNQCGIATKASYPL 333
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 75/191 (39%), Positives = 120/191 (62%), Gaps = 16/191 (8%)
Query: 40 EQSVVEIHEKWMAQH-------GRSYKDEL---EKEMRLKIFKENLEYIEKANKEGNRTY 89
E+S+ ++E+W +++ G + +L + R +FKEN++YI +ANK+ +R +
Sbjct: 31 EESLRGLYERWRSRYTVSPSTPGSGLRGKLADHDPARRFNVFKENVKYIHEANKK-DRPF 89
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKG 146
+L N+F+D+T DE R Y G ++ HR+ + N + +D +P ++DWR+KG
Sbjct: 90 RLALNKFADMTTDELRHSYAGSRVRH--HRALSGGRRAQGNFTYSDAENLPPAVDWREKG 147
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
AVT IK+Q +CG CWAF+ +AAVE I KIR+G L+ LSEQ+L+DC + GC GG +
Sbjct: 148 AVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLSEQELMDCDNVNDQGCDGGLMDY 207
Query: 207 AFAYIIQNQGI 217
AF +I +N G+
Sbjct: 208 AFQFIQKNGGV 218
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 36/71 (50%), Positives = 49/71 (69%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F G C T LDH V VG+GT DG YW++KNSWG WG+ GY+++ R EGL
Sbjct: 283 SEGVFTGQCTTDLDHGVAAVGYGTARDGTKYWIVKNSWGLDWGEKGYIRMQRGVSQAEGL 342
Query: 270 CGIGTRSSYPL 280
CGI ++SYP+
Sbjct: 343 CGIAMQASYPI 353
>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
Length = 330
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/235 (36%), Positives = 127/235 (54%), Gaps = 26/235 (11%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + ++S+ TH+ S + E+W +HG++Y E + R +++ N+
Sbjct: 2 TPIFLLATLCLG----MISAAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNM 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I N++ G + L N F DLTN EFR L TG++ F
Sbjct: 57 KMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQGQKTKMMKVFPEPF------ 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP ++DWR G VTP+KNQ CG CWAF+AV ++EG ++G L+ LSEQ L+DCS
Sbjct: 111 LGDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCS 170
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDH-AVTIVGF 235
++GN GC GG + AF Y+ N G+ NG C + A +VGF
Sbjct: 171 WSHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKYSAAKVVGF 225
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/213 (39%), Positives = 125/213 (58%), Gaps = 16/213 (7%)
Query: 19 FIIITLLVSCASQVVSS-----RSTHEQSVVEIH-------EKWMAQHGRSYKDEL-EKE 65
F+I LLV+ + V ++ R HE+ +++ ++WM Q+ ++Y +++ E E
Sbjct: 5 FLIAALLVAASGGVGAAPELQLREQHEKLLLDAKANPMAAFQQWMMQYTKAYANDIKELE 64
Query: 66 MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR-ALYTGYKMPSPSHRSTTSS 124
R ++ ENL YI N ++ L N F+DLT DEFR L +K S+R SS
Sbjct: 65 TRFSVWLENLNYILAYNAR-TTSHWLHLNAFADLTTDEFRNRLGYDFKARQASNR-LQSS 122
Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
F Y N+ +PT +DWR KGAVT +KNQ +CG CWAFA +VEGI I +G L LS
Sbjct: 123 PFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFATTGSVEGINAIVTGELASLS 182
Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
EQ+L+DC T+ + GC GG + A+ +II+N G+
Sbjct: 183 EQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGL 215
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 5/78 (6%)
Query: 207 AFAYIIQNQGIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI-- 263
A ++ + G+++ CGT L+H V +VG+G NYW++KNSWG WGD GY+++
Sbjct: 273 AKSFQLYGGGVYDDPTCGTSLNHGVLVVGYGKDPHFGNYWIVKNSWGPEWGDNGYIRLRM 332
Query: 264 -VRD-EGLCGIGTRSSYP 279
D +G+CGI S+P
Sbjct: 333 GAEDVQGMCGIAMAPSFP 350
>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
Length = 358
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 102/306 (33%), Positives = 140/306 (45%), Gaps = 84/306 (27%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+ Y+ E ++R IF ENLE I N+ G YKLG N+++D++ +EFRA
Sbjct: 60 RFAHRYGKRYQSVEEMKLRFAIFMENLELIRSTNRRG-LPYKLGINRYADMSWEEFRASR 118
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
G S+T K N MTD +P + DWR+ G V+P+K+Q CG CW F+
Sbjct: 119 LGAA-------QNCSATLK-GNHKMTDELLPKTKDWREDGIVSPVKDQGSCGSCWTFSTT 170
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI-------- 217
A+E +G I LSEQQL+DC+ NN GC GG +AF YI N G+
Sbjct: 171 GALEAAYTQATGKGISLSEQQLVDCAYAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPY 230
Query: 218 --FNGVCG---------------------TQLDHAVTIV--------------------- 233
NG C +L HAV +V
Sbjct: 231 AGVNGFCHFKPENVGVKVVESVNITLGAEDELLHAVGLVRPVSIAFEVVSGFRFYKGGVY 290
Query: 234 -------------------GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 274
G+G E+G YWLIKNSWG WG GY K+ + +CGI T
Sbjct: 291 TSDTCGRTQMDVNHAVLAVGYG-VENGVPYWLIKNSWGEEWGVDGYFKMELGKNMCGIAT 349
Query: 275 RSSYPL 280
+SYP+
Sbjct: 350 CASYPI 355
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 120/201 (59%), Gaps = 7/201 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N + G +YKLG NQF DL EF ++ G+ + ++ SS N++ + +P
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLP 117
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
+DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + GN
Sbjct: 118 KVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
NGC GG E AF YI N GI
Sbjct: 178 NGCEGGLMEDAFKYIKANDGI 198
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 51/75 (68%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G+++ C ++ LDH V +VG+G + G YWL+KNSW +WGD GY+ + RD
Sbjct: 258 SFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRD 316
Query: 267 -EGLCGIGTRSSYPL 280
CGI +++SYPL
Sbjct: 317 NNNQCGIASQASYPL 331
>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
Length = 333
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/190 (41%), Positives = 116/190 (61%), Gaps = 11/190 (5%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
+ S+ TH+QS+ E +W A+HG+ Y E+ +R ++++NL+ IE+ N E G T
Sbjct: 14 IASAAPTHDQSLDEQWNQWTAEHGKVYSTG-EESLRRAVWEKNLKMIEQHNLEYSQGKHT 72
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
+ +G N F D+TN++FR + TG++ + + F Q +VP S+DWR+KG V
Sbjct: 73 FTMGMNAFGDMTNEDFRQMMTGFQ----NQKYNKGEVF--QPPQPLEVPESVDWREKGYV 126
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKA 207
TP+KNQ CG CWAF+A A+EG ++G L+ LSEQ L+DCS N+GC GG KA
Sbjct: 127 TPVKNQHRCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQPQHNSGCKGGLVIKA 186
Query: 208 FAYIIQNQGI 217
F Y+ N G+
Sbjct: 187 FQYVKDNGGL 196
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 27/59 (45%), Positives = 41/59 (69%), Gaps = 4/59 (6%)
Query: 226 LDHAVTIVGFGTTEDGAN---YWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 280
L+HAV +VG+G ++G+N YWL+KNSWG WG GY+ + +D+ CGI + + YP+
Sbjct: 274 LNHAVLVVGYGVMQEGSNNNTYWLVKNSWGERWGVGGYIMMAKDKNNHCGIASDALYPI 332
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 81/182 (44%), Positives = 115/182 (63%), Gaps = 11/182 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+ + +++E+W + H S + EK+ R +FKENL++I K N + +R YKL N F+D+
Sbjct: 33 EERLRDLYERWRSHHTVS-RSLAEKQERFNVFKENLKHIHKVNHK-DRPYKLKLNSFADM 90
Query: 100 TNDEFRALYTGYKMPS----PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
TN EF Y G K+ R T S + + +P+S+DWR GAVT IK+Q
Sbjct: 91 TNHEFLQHYGGSKVSHYRVLRGQRQGTGSMHE----DTSKLPSSVDWRKNGAVTGIKDQG 146
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
+CG CWAF+ VAAVEGI KI++G LI LSEQ+L+DC ++ N+GC GG E AF +I Q
Sbjct: 147 KCGSCWAFSTVAAVEGINKIKTGELISLSEQELVDCDSD-NHGCNGGLMEDAFNFIKQIG 205
Query: 216 GI 217
G+
Sbjct: 206 GL 207
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 37/71 (52%), Positives = 53/71 (74%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
++ IF G CGT+L+H V +VG+GTT+DG YW++KNSWG WG+ GY+++ R +EGL
Sbjct: 272 SEAIFTGDCGTELNHGVALVGYGTTQDGTKYWIVKNSWGTDWGEKGYIRMQRGIDAEEGL 331
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 332 CGITMEASYPV 342
>gi|301609086|ref|XP_002934107.1| PREDICTED: cathepsin S-like [Xenopus (Silurana) tropicalis]
Length = 332
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/326 (30%), Positives = 155/326 (47%), Gaps = 74/326 (22%)
Query: 25 LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE 84
LV+ S ++ + S + ++ + W+ H ++YKD E+ R I++E L++I N E
Sbjct: 8 LVALLSLLIPAHSAPDPTLDTHWQLWVKTHQKTYKDTEEERTRRTIWEETLKFITVHNLE 67
Query: 85 ---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
G TY++G N+ D+T +E A TGY + S S + + + + P S+D
Sbjct: 68 YSLGLHTYEVGMNRLGDMTGEEVAATMTGY---TGSDHSLANMSHVPKEILEALPPASID 124
Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCL 200
WR KG VTP++ Q +CG C+AF+ V A+E K ++G L+ LS Q+L+DCS T GNNGC+
Sbjct: 125 WRTKGCVTPVRKQGDCGSCYAFSTVGAMECQVKKKTGRLVILSPQELVDCSYTEGNNGCI 184
Query: 201 GGSREKAFAYI---------------------------IQNQGIFNGVCGTQ--LDHAVT 231
G+ AF Y+ ++ I+N G + L +AV
Sbjct: 185 NGNLLSAFRYMEKYGIMEESSYRYTGQEADCRRRVHTGLKVNAIYNVTDGDENVLKNAVG 244
Query: 232 IV------------GFGTTEDGA-------------------------NYWLIKNSWGNT 254
V GF T + G +YWL+KNSWG
Sbjct: 245 TVGPVSVAVDARQNGFRTYKSGVYFDPYCTLHVTHAVLVVGYGREYGNDYWLVKNSWGVD 304
Query: 255 WGDAGYMKIVRDE-GLCGIGTRSSYP 279
WGD GY+K+ R+ CGI ++ YP
Sbjct: 305 WGDQGYVKMARNRNSHCGIANQAFYP 330
>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 88/241 (36%), Positives = 127/241 (52%), Gaps = 22/241 (9%)
Query: 14 NTTPMFIIITLLVSCASQVVSSRS-----THEQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
T P+ I+ LL + S + +++ +W A H RSY E+ R
Sbjct: 7 GTRPVIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRF 66
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY----TGYKMPSPSHRSTTSS 124
++++ N+EYI+ N+ G TY+LG NQF+DLT +EF A Y TG + + + S
Sbjct: 67 EVYRTNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAGGHTGSAITTAAEADGLWS 126
Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQL 183
+ D P S+DWR KGAVTP+KNQ +C CWAF+AVA +E + I++G L+ L
Sbjct: 127 SGGSDGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVAL 186
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTIV 233
SEQQL+DC + GC G +AF +I++N GI G C + AVTI
Sbjct: 187 SEQQLVDCDKY-DGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGAC-SAAKPAVTIT 244
Query: 234 G 234
G
Sbjct: 245 G 245
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 35/67 (52%), Positives = 46/67 (68%), Gaps = 3/67 (4%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
G+F+ CG Q+ HAV VG+G G YWL+KNSWG TWG+AGY+++ RD GLCGI
Sbjct: 281 GVFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGI 340
Query: 273 GTRSSYP 279
++YP
Sbjct: 341 ALDTAYP 347
>gi|330793420|ref|XP_003284782.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
gi|325085276|gb|EGC38686.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
Length = 347
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 83/189 (43%), Positives = 110/189 (58%), Gaps = 14/189 (7%)
Query: 35 SRSTHEQSVVEIHEK-----WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
S +T +Q E+ + WM Q+ R Y E E R IFK N++Y+++ N +G+ T
Sbjct: 13 SFATAKQQFSELQYRNAFTNWMIQNQRHYASE-EFAARYNIFKANMDYVQEWNSKGSETV 71
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
LG N F+D+TN EFR++Y G S +T + S+DWR KGAVT
Sbjct: 72 -LGLNTFADITNQEFRSIYLGTPFDGSSIINTETEKI------FAAPAASIDWRTKGAVT 124
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAF 208
PIKNQ++CG CW+F+ + EG T I GNL LSEQ L+DCS + GNNGC GG AF
Sbjct: 125 PIKNQQQCGGCWSFSTTGSTEGATAIAKGNLPSLSEQNLIDCSGSYGNNGCNGGLMTLAF 184
Query: 209 AYIIQNQGI 217
YII N+GI
Sbjct: 185 EYIINNKGI 193
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 40/76 (52%), Gaps = 20/76 (26%)
Query: 224 TQLDHAVTIVGFGTTE----------------DGA---NYWLIKNSWGNTWGDAGYMKIV 264
T LDH V +VG+ + +GA NYW++KNSWG +WG GY+ +
Sbjct: 270 TSLDHGVLVVGYASGSGSGSGSGSGSGSGLAVEGASSGNYWIVKNSWGTSWGIEGYILMS 329
Query: 265 RDE-GLCGIGTRSSYP 279
+D CGI T +S+P
Sbjct: 330 KDRNNNCGIATMASFP 345
>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
Length = 367
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 107/357 (29%), Positives = 155/357 (43%), Gaps = 106/357 (29%)
Query: 18 MFIIITLLVSCASQVVSSRSTH----EQSVVEIHEKWMAQH---------GRSYKDELEK 64
+F+I TLL S A +SS + Q V E + A+H G++Y E
Sbjct: 10 LFLIPTLLFSAAVSDISSDESDDLLIRQVVPEGDDLLSAEHQFGLFKAKFGKTYSTVEEH 69
Query: 65 EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK---MPSPSHRST 121
+ R +F+ NL + ++ + + G +FSDLT DEFR Y G K +P+ + ++
Sbjct: 70 DYRFSVFEANLRRARR-HQLLDPSAVHGVTRFSDLTPDEFRRDYLGLKPLRLPADAQKAP 128
Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
L D+PT DWRD GAVTP+K+Q CG CW+F+A+ A+EG + +GNLI
Sbjct: 129 I--------LPTNDLPTDFDWRDHGAVTPVKDQGSCGSCWSFSAIGALEGAHFLTTGNLI 180
Query: 182 QLSEQQLLDCSTNGN--------NGCLGG---------------SREKAFAYIIQNQG-- 216
+SEQQL+DC + GC GG RE+ + YI ++G
Sbjct: 181 SMSEQQLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILKAGGVEREETYPYIGSDRGSC 240
Query: 217 ----------------------------IFNG----------------------VCGTQL 226
+ NG +C L
Sbjct: 241 KFNKSQIVASVSNFSVVSLDEDQIAANMVKNGPLAVGINAVFMQTYMKGVSCPYICSRNL 300
Query: 227 DHAVTIVGFGTTEDGA------NYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
DH V +VG+G+ YW+IKNSWG +WG+ GY KI R CG+ + S
Sbjct: 301 DHGVVLVGYGSAGYAPIRFKEKPYWIIKNSWGESWGEDGYYKICRGHNACGVDSMVS 357
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 121/202 (59%), Gaps = 9/202 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF E+ I +
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIAR 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG NQF DL EF ++ G+ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P ++DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
NNGC GG E AF YI N GI
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGI 198
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 51/75 (68%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G+++ C ++ LDH V +VG+G + G YWL+KNSW +WGD GY+ + RD
Sbjct: 258 SFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRD 316
Query: 267 -EGLCGIGTRSSYPL 280
CGI +++SYPL
Sbjct: 317 NNNQCGIASQASYPL 331
>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
gi|255639509|gb|ACU20049.1| unknown [Glycine max]
Length = 366
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 101/332 (30%), Positives = 147/332 (44%), Gaps = 94/332 (28%)
Query: 31 QVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
QVV H E H + + G++Y + E + R +IFK NL K++++ + +
Sbjct: 34 QVVPDAEDHHLLNAEHHFSAFKTKFGKTYATQEEHDHRFRIFKNNL-LRAKSHQKLDPSA 92
Query: 90 KLGTNQFSDLTNDEFRALYTGYK---MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKG 146
G +FSDLT EFR + G K +PS + ++ L D+PT DWR+ G
Sbjct: 93 VHGVTRFSDLTPAEFRRQFLGLKPLRLPSDAQKAPI--------LPTNDLPTDFDWREHG 144
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN--------NG 198
AVT +KNQ CG CW+F+AV A+EG + +G L+ LSEQQL+DC + +G
Sbjct: 145 AVTGVKNQGSCGSCWSFSAVGALEGAHFLSTGELVSLSEQQLVDCDHECDPEERGACDSG 204
Query: 199 CLGG---------------SREKAFAYIIQNQG--------------------------- 216
C GG REK + Y +++G
Sbjct: 205 CNGGLMTTAFEYTLQAGGLMREKDYPYTGRDRGPCKFDKSKVAASVANFSVVSLDEEQIA 264
Query: 217 ---IFNG----------------------VCGTQLDHAVTIVGFGTTEDGA------NYW 245
+ NG +CG LDH V +VG+G+ YW
Sbjct: 265 ANLVQNGPLAVGINAVFMQTYIGGVSCPYICGKHLDHGVLLVGYGSGAYAPIRFKEKPYW 324
Query: 246 LIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
+IKNSWG +WG+ GY KI R +CG+ + S
Sbjct: 325 IIKNSWGESWGEEGYYKICRGRNVCGVDSMVS 356
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 103/171 (60%), Gaps = 5/171 (2%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
+W A H R Y E+ +R +I+ NLE I + N G +Y LG N+F DL + EF A Y
Sbjct: 23 EWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAAKY 82
Query: 109 TGYKMPSP-SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
G + + +S SST+ + M +P S+DWR G VTP+KNQ +CG CW+F+
Sbjct: 83 LGVRFNGVNATKSFASSTYLPR---MVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTG 139
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
+VEG ++G L+ LSEQ L+DCS+ GN GC GG + AF YII+N GI
Sbjct: 140 SVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGI 190
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 34/58 (58%), Positives = 44/58 (75%), Gaps = 1/58 (1%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
TQLDH V VG+GT+ +G +YWL+KNSWG TWG AGY+ + R+ + CGI T +SYPL
Sbjct: 267 TQLDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKAGYIWMSRNADNQCGIATSASYPL 324
>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
Length = 362
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 87/258 (33%), Positives = 130/258 (50%), Gaps = 37/258 (14%)
Query: 11 FKINTTPMFIIITLLVSC----ASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYK 59
F +P + + LL SC A+ ++ +R+T + +++ W H RSY
Sbjct: 4 FMACASPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYP 63
Query: 60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM---PSP 116
E R +++ N E+I+ N G+ TY+L N+F+DLT +EF A YTGY P
Sbjct: 64 SAEEALQRFDVYRRNAEFIDAVNLRGDLTYRLAENEFADLTEEEFLATYTGYYAGDGPVD 123
Query: 117 SHRSTT-----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-CGCCWAFAAVAAVE 170
TT ++F Y+ DVP S+DWR +GAV P K+Q C CWAF A +E
Sbjct: 124 DSVITTGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIE 179
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFN----------G 220
+ I++G L+ LSEQQL+DC + + GC GS +A+ ++++N G+ G
Sbjct: 180 SLNMIKTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRG 238
Query: 221 VC--GTQLDHAVTIVGFG 236
C HA I GFG
Sbjct: 239 PCNRAKSAHHAAKITGFG 256
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 37/68 (54%), Positives = 49/68 (72%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCG 271
G++ G CGT+L HAVT+VG+GT GA YW IKNSWG +WG+ GY++I+RD GLCG
Sbjct: 291 GVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCG 350
Query: 272 IGTRSSYP 279
+ +YP
Sbjct: 351 VTLDIAYP 358
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 117/210 (55%), Gaps = 15/210 (7%)
Query: 22 ITLLVSCASQVVSSRSTHE----QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
I LVS A V S + +V + ++W+ +HG+ Y EK RL+IF+ NL+Y
Sbjct: 14 IICLVSAAKAVQHSYEVGDINSGNGLVRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQY 73
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY--KMPSPSHRSTTSSTFKYQNLSMT- 134
I NK N +++LG N+F+DLTN+EF+ Y G K R+ L T
Sbjct: 74 IHAHNKNSNSSFRLGLNKFADLTNEEFKTRYFGKNSKQWRDRRRTELEGAELRPVLKQTV 133
Query: 135 -------DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+ +SLDWR KGAVT +K+Q +CG CWAF+ A+EG+ I +G L+ LSEQ+
Sbjct: 134 GSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEGVNFISTGKLVSLSEQE 193
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L+ C N GC GG + AF ++IQN GI
Sbjct: 194 LVACDAT-NYGCEGGDMDYAFTWVIQNGGI 222
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 48/80 (60%), Gaps = 8/80 (10%)
Query: 207 AFAYIIQNQGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 263
A + + GI++G C +DHAV +VG+ + ++G +YW++KNSWG WG GY I
Sbjct: 279 AIDFQLYTGGIYDGDCSGNPDDIDHAVLVVGY-SAKNGKDYWIVKNSWGTDWGLEGYFYI 337
Query: 264 VRDE----GLCGIGTRSSYP 279
+R+ G+C I +SYP
Sbjct: 338 LRNTELPYGVCAINAMASYP 357
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 82/197 (41%), Positives = 114/197 (57%), Gaps = 20/197 (10%)
Query: 54 HGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM 113
H + Y E E+ R IFK NL YI N +G +Y L N+F DLT +EFR Y GYK
Sbjct: 96 HNKFYATEEERLKRYAIFKNNLTYIHNHNMQG-YSYVLKMNKFGDLTLEEFRQRYLGYKK 154
Query: 114 P---SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
P +P T+ +++ D+PT +DWR +G VT +K+Q +CG CWAF+A A+E
Sbjct: 155 PDLRTPPREVDTT----LESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATGAME 210
Query: 171 GITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIF----------N 219
G+ ++G L+ LS+QQL+DCS GN GC GG E+AF Y+++N GI +
Sbjct: 211 GVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPYMRKD 270
Query: 220 GVC-GTQLDHAVTIVGF 235
GVC +Q TI G+
Sbjct: 271 GVCKSSQCTSVATITGY 287
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 44/77 (57%), Gaps = 4/77 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDG-ANYWLIKNSWGNTWGDAGYMKIVRDE 267
A+ GIF+ CGT LDH V +VG+ G +YW++KNSWG WG GYM + +
Sbjct: 318 AFQFYYDGIFDAPCGTNLDHGVLLVGYSAETAGQGDYWIMKNSWGAAWGKGGYMLMAMHK 377
Query: 268 ---GLCGIGTRSSYPLA 281
G CG+ S+P+A
Sbjct: 378 GPAGQCGVLLDGSFPVA 394
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 118/204 (57%), Gaps = 10/204 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M+ + L C S V ++ S +Q + + E+W HG++Y E E+ R I+++NL
Sbjct: 1 MWTYLALFTLCLSGVFAAPSLDKQ-LDDHWEQWKTWHGKNYH-EKEEGWRRMIWEKNLRK 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
I+ N E G TY+LG N F D+ ++EFR + GYK + R S F N
Sbjct: 59 IQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGYK--HKTERKFKGSLFMEPNF--L 114
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
+VP+ LDWR+KG VTP+K+Q ECG CWAF+ A+EG + G L+ LSEQ L+DCS
Sbjct: 115 EVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRP 174
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF YI N G+
Sbjct: 175 EGNEGCNGGLMDQAFQYIKDNNGL 198
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 40/61 (65%), Gaps = 4/61 (6%)
Query: 225 QLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+LDH V +VG+G DG YW++KNSW +WGD GY+ + +D + CGI T +SYPL
Sbjct: 277 ELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAASYPL 336
Query: 281 A 281
Sbjct: 337 V 337
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 82/203 (40%), Positives = 125/203 (61%), Gaps = 9/203 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ + L+ +C VVSS S E +W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
K N + G+ TY LG NQF+DL N+EF A+ TG+++ S ++ STF N ++ +
Sbjct: 60 IKHNLKYDLGHFTYALGMNQFADLKNEEFVAMMTGFRVNGTS-KAAKGSTFLPSN-NIGE 117
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
+P ++DWR KG VTP+K+Q +CG CWAF+ ++EG +G L+ LSEQ L+DCS
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKE 177
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF YII+ GI
Sbjct: 178 GNEGCDGGLMDQAFQYIIKAGGI 200
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 49/76 (64%), Gaps = 3/76 (3%)
Query: 208 FAYIIQNQGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
++ + G++N T LDH V VG+GTT DG +YW++KNSW TWG GY+ + R
Sbjct: 259 MSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTTSDGTDYWIVKNSWAETWGMNGYLWMSR 318
Query: 266 D-EGLCGIGTRSSYPL 280
+ + CGI T++SYPL
Sbjct: 319 NKDNQCGIATQASYPL 334
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 89/238 (37%), Positives = 131/238 (55%), Gaps = 26/238 (10%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
T + I L+ S V SS +++ + EKW+ H + Y E +R I++ N
Sbjct: 11 TLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSN 70
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
++ I+ N + +KL N+F+D+TN EF+A + G +T+S + +
Sbjct: 71 VQLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGL--------NTSSLRLHKKQRPVC 121
Query: 135 D----VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
D VP ++DWR +GAVTPI+NQ +CG CWAF+AVAA+EGI KI++GNL+ LSEQQL+D
Sbjct: 122 DPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLID 181
Query: 191 CSTNG-NNGCLGGSREKAFAYIIQNQGI-------FNGVCGT-----QLDHAVTIVGF 235
C N GC GG E AF +I N G+ + G+ GT + VTI G+
Sbjct: 182 CDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGY 239
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 49/78 (62%), Gaps = 5/78 (6%)
Query: 207 AFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 265
F + + + G+F CGT L+H VT+VG+G D YW++KNSWG WG+ GY+++ R
Sbjct: 266 GFIFQLYSSGVFTNYCGTNLNHGVTVVGYGVEGD-QKYWIVKNSWGTGWGEEGYIRMERG 324
Query: 266 ---DEGLCGIGTRSSYPL 280
D G CGI +SYPL
Sbjct: 325 VSEDTGKCGIAMMASYPL 342
>gi|351700981|gb|EHB03900.1| Cathepsin H [Heterocephalus glaber]
Length = 334
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 134/305 (43%), Gaps = 80/305 (26%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
+ WM QH + Y + E R +IF N I NK GN T+K+ NQFSD++ DE +
Sbjct: 35 KSWMMQHQKEYSTK-EYHHRQQIFASNWRKINAHNK-GNHTFKMALNQFSDMSFDEIKRK 92
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGCCWAFAAV 166
Y P + S T S + PTS+DWR KG V+ +KNQ CG CW F+
Sbjct: 93 YL---WSEPQNCSATKSNYF---RGTGPYPTSVDWRKKGNFVSAVKNQGACGSCWTFSTT 146
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF------- 218
A+E I SG ++ L+EQQL+DC+ + NN GC GG +AF YI+ N+GI
Sbjct: 147 GALESAVAIASGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPY 206
Query: 219 ---NGVCGTQLDHAVTIV--------------------------GFGTTED--------- 240
+G C Q A+ V + TED
Sbjct: 207 EGKDGHCRFQPQKAIAFVKDIVNITLNDEEAMVEAVALYNPVSFAYEVTEDFMSYKRGIY 266
Query: 241 -------------------------GANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTR 275
G YW++KNSWG WG+ GY I R + +CG+
Sbjct: 267 SSTSCHKTPDKVNHAVLAVGYGVDHGVPYWIVKNSWGTQWGNNGYFLIERGKNMCGLAAC 326
Query: 276 SSYPL 280
+SYP+
Sbjct: 327 ASYPI 331
>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
Length = 362
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 87/258 (33%), Positives = 130/258 (50%), Gaps = 37/258 (14%)
Query: 11 FKINTTPMFIIITLLVSC----ASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYK 59
F +P + + LL SC A+ ++ +R+T + +++ W H RSY
Sbjct: 4 FMACASPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYP 63
Query: 60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM---PSP 116
E R +++ N E+I+ N G+ TY+L N+F+DLT +EF A YTGY P
Sbjct: 64 SAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVD 123
Query: 117 SHRSTT-----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-CGCCWAFAAVAAVE 170
TT ++F Y+ DVP S+DWR +GAV P K+Q C CWAF A +E
Sbjct: 124 DSVITTGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIE 179
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFN----------G 220
+ I++G L+ LSEQQL+DC + + GC GS +A+ ++++N G+ G
Sbjct: 180 SLNMIKTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRG 238
Query: 221 VC--GTQLDHAVTIVGFG 236
C HA I GFG
Sbjct: 239 PCNRAKSAHHAAKITGFG 256
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 37/68 (54%), Positives = 49/68 (72%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCG 271
G++ G CGT+L HAVT+VG+GT GA YW IKNSWG +WG+ GY++I+RD GLCG
Sbjct: 291 GVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCG 350
Query: 272 IGTRSSYP 279
+ +YP
Sbjct: 351 VTLDIAYP 358
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 108/159 (67%), Gaps = 8/159 (5%)
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
E E R ++F +NL++++ N + ++LG N+F+DLTN EFRA Y G P+ R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAV-TPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ Y++ + +P S+DWRDKGAV P+KNQ +CG CWAF+AVAAVEGI KI +G
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
L+ LSEQ+L++C+ NG N+GC GG + AFA+I +N G+
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGL 238
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 51/84 (60%), Gaps = 9/84 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGY 260
G RE + + + G+F G CGT LDH V VG+GT GA YW ++NSWG WG+ GY
Sbjct: 295 GGRE----FQLYDSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGY 350
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G CGI +SYP+
Sbjct: 351 IRMERNVTARTGKCGIAMMASYPI 374
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 108/159 (67%), Gaps = 8/159 (5%)
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
E E R ++F +NL++++ N + ++LG N+F+DLTN EFRA Y G P+ R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAV-TPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ Y++ + +P S+DWRDKGAV P+KNQ +CG CWAF+AVAAVEGI KI +G
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
L+ LSEQ+L++C+ NG N+GC GG + AFA+I +N G+
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGL 238
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 51/84 (60%), Gaps = 9/84 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGY 260
G RE + + + G+F G CGT LDH V VG+GT GA YW ++NSWG WG+ GY
Sbjct: 295 GGRE----FQLYDSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGY 350
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G CGI +SYP+
Sbjct: 351 IRMERNVTARTGKCGIAMMASYPI 374
>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 371
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 84/224 (37%), Positives = 120/224 (53%), Gaps = 37/224 (16%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHE--------KWMAQHGRSYKDELEKEMRLK 69
+F+ +T L A +++ + H VVE+ + +W A H R+Y D E+ R +
Sbjct: 27 LFVFLTALPPAA--IMTPAAGH---VVELDDMLMLDRFVRWQAAHNRTYGDAEERLRRFQ 81
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
+++ N+EYIE N+ G TY+LG NQF+DLT++EF ++Y S
Sbjct: 82 VYRANIEYIEATNRRGGLTYELGENQFADLTSEEFLSMYA-------SSYDAGDRADDEA 134
Query: 130 NLSMTDV---------------PTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGIT 173
L TDV P S DWR KGAVTP KNQ C CWAF VA +EG+T
Sbjct: 135 ALITTDVAGDGAWSDGDLEALPPPSWDWRAKGAVTPPKNQGPTCSSCWAFVTVATIEGLT 194
Query: 174 KIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
I++G LI LSEQQL+DC + GC GS + F ++++N G+
Sbjct: 195 FIKTGKLISLSEQQLVDCDMY-DGGCNTGSYSRGFRWVLENGGL 237
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 47/68 (69%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCG 271
G+++G CGT L HAVT+VG+G GA YW++KNSWG WG+ G++++ RD GLCG
Sbjct: 303 GVYSGPCGTNLAHAVTVVGYGVDPASGAKYWIVKNSWGQAWGERGFIRMRRDVGGPGLCG 362
Query: 272 IGTRSSYP 279
I +YP
Sbjct: 363 IALDVAYP 370
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 84/205 (40%), Positives = 119/205 (58%), Gaps = 12/205 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M + + C S V ++ + +Q ++ H E+W HG+ Y E E+ R ++++NL+
Sbjct: 1 MRVFLAAFALCLSAVFAAPTLDKQ--LDNHWEQWKNWHGKKYH-EKEEGWRRMVWEKNLQ 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE N E G TY+LG N+F D+T++EFR + GYK R S F N
Sbjct: 58 KIELHNLEHSMGTHTYRLGMNRFGDMTHEEFRQVMNGYK--HKKERRFRGSLFMEPNF-- 113
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
+VP SLDWR+KG VTP+K+Q ECG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 114 LEVPNSLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSR 173
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF YI G+
Sbjct: 174 PEGNEGCNGGLMDQAFQYIKDQNGL 198
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/60 (48%), Positives = 37/60 (61%), Gaps = 4/60 (6%)
Query: 225 QLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+LDH V VG+G DG YW++KNSW WGD GY+ + +D CGI T +SYPL
Sbjct: 277 ELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWGDKGYVYMAKDRHNHCGIATAASYPL 336
>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
Length = 360
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 87/258 (33%), Positives = 130/258 (50%), Gaps = 37/258 (14%)
Query: 11 FKINTTPMFIIITLLVSC----ASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYK 59
F +P + + LL SC A+ ++ +R+T + +++ W H RSY
Sbjct: 4 FMACASPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYP 63
Query: 60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM---PSP 116
E R +++ N E+I+ N G+ TY+L N+F+DLT +EF A YTGY P
Sbjct: 64 SAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVD 123
Query: 117 SHRSTT-----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-CGCCWAFAAVAAVE 170
TT ++F Y+ DVP S+DWR +GAV P K+Q C CWAF A +E
Sbjct: 124 DSVITTGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIE 179
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFN----------G 220
+ I++G L+ LSEQQL+DC + + GC GS +A+ ++++N G+ G
Sbjct: 180 SLNMIKTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRG 238
Query: 221 VC--GTQLDHAVTIVGFG 236
C HA I GFG
Sbjct: 239 PCNRAKSAHHAAKITGFG 256
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 32/54 (59%), Positives = 42/54 (77%), Gaps = 1/54 (1%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 268
G++ G CGT+L HAVT+VG+GT GA YW IKNSWG +WG+ GY++I+RD G
Sbjct: 291 GVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVG 344
>gi|291410711|ref|XP_002721635.1| PREDICTED: cathepsin H [Oryctolagus cuniculus]
Length = 333
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 134/305 (43%), Gaps = 80/305 (26%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
+ WM+QH + Y E E RL+ F N I A+ GN T+++G NQFSD++ E +
Sbjct: 34 KSWMSQHHKKYSAE-EYPRRLQTFVRNWRKI-NAHNNGNHTFQMGLNQFSDMSFAEIKHK 91
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGCCWAFAAV 166
Y P + S T S + P+S+DWR KG V+P+KNQ CG CW F+
Sbjct: 92 YL---WTEPQNCSATKSNYL---RGTGPYPSSVDWRKKGNFVSPVKNQGACGSCWTFSTT 145
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI-------- 217
A+E I G ++ L+EQQL+DC+ N NN GC GG +AF YI+ N+GI
Sbjct: 146 GALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGEDSYPY 205
Query: 218 --FNGVCGTQLDHAVTIV--------------------------GFGTTED--------- 240
G C Q A+ V F TED
Sbjct: 206 RAMEGRCKFQPQKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTEDFMQYRKGIY 265
Query: 241 -------------------------GANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTR 275
G YW++KNSWG+ WG GY I R + +CG+
Sbjct: 266 SSTSCHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWGMNGYFYIERGKNMCGLAAC 325
Query: 276 SSYPL 280
+SYP+
Sbjct: 326 ASYPI 330
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 120/208 (57%), Gaps = 14/208 (6%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
T + I L+ S V SS +++ + EKW+ H + Y E +R I++ N
Sbjct: 11 TLVVLICFVLIASKLCSVNSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSN 70
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
++ I+ N + +KL N+F+D+TN EF+A + G +T+S + +
Sbjct: 71 VQLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGL--------NTSSLRLHKKQRPVC 121
Query: 135 D----VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
D VP ++DWR +GAVTPI+NQ +CG CWAF+AVAA+EGI KI++GNL+ LSEQQL+D
Sbjct: 122 DPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLID 181
Query: 191 CSTNG-NNGCLGGSREKAFAYIIQNQGI 217
C N GC GG E AF +I N G+
Sbjct: 182 CDVGTYNKGCSGGLMETAFEFIKSNGGL 209
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 49/78 (62%), Gaps = 5/78 (6%)
Query: 207 AFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 265
F + + + G+F CGT L+H VT+VG+G D YW++KNSWG WG+ GY+++ R
Sbjct: 266 GFIFQLYSSGVFTSYCGTNLNHGVTVVGYGVEGD-QKYWIVKNSWGTGWGEEGYIRMERG 324
Query: 266 ---DEGLCGIGTRSSYPL 280
D G CGI +SYPL
Sbjct: 325 ISEDTGKCGIAMLASYPL 342
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 81/203 (39%), Positives = 123/203 (60%), Gaps = 10/203 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ + +L +C S V+S+ Q + E + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPLLVLTACLSSVLSAPVLDAQ-LNEHWDLWKSWHSKKYH-EKEEGWRRMVWEKNLQKI 58
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G +++LG N F D+T++EFR + GYK+ + R T S F N MT
Sbjct: 59 ELHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLKT--QRKFTGSLFMEPNF-MT- 114
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
P+++DWR+KG VTP+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 APSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPE 174
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF Y+ NQG+
Sbjct: 175 GNEGCGGGLMDQAFQYVTDNQGL 197
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 4/60 (6%)
Query: 225 QLDHAVTIVGFG-TTED--GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+LDH V VG+G ED G +W++KNSWG WGD GY+ + +D + CGI T +SYPL
Sbjct: 276 ELDHGVLAVGYGFEGEDKMGKKFWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|214015327|gb|ACJ62280.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015329|gb|ACJ62281.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015345|gb|ACJ62289.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 114/188 (60%), Gaps = 18/188 (9%)
Query: 46 IHEKWMAQHGRSYKDE-------------LEKEMRLKIFKENLEYIEKANKE---GNRTY 89
++E W ++HGR ++ ++ +RL++F++NL YI+K N E G T+
Sbjct: 1 MYEAWKSKHGRGGSNDDCDMAPGDEEQEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTF 60
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+LG F+DLT DE+R G++ + S Y+ D+P ++DWR GAVT
Sbjct: 61 RLGLTPFADLTLDEYRGRVLGFRARGRRSGARYGSGSGYRARG-GDLPDAIDWRQLGAVT 119
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF
Sbjct: 120 EVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFR 178
Query: 210 YIIQNQGI 217
++I N GI
Sbjct: 179 FVIGNGGI 186
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 106/172 (61%), Gaps = 6/172 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLTNDEFRA 106
E W +HG+ Y + E+ R I++ N +Y+++ N + + +G NQF+DL + EF
Sbjct: 23 ESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSEFGR 82
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
LY GY PS + S F + + D+PTS+DWR KG VT IKNQ +CG CWAF+AV
Sbjct: 83 LYNGYN-NKPSMKKAQSKVF---STKVGDLPTSVDWRTKGFVTAIKNQGQCGSCWAFSAV 138
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGI 217
A +EG +G L+ LSEQ L+DCST GN GC GG + AF Y+I+N GI
Sbjct: 139 AGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGI 190
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/58 (51%), Positives = 41/58 (70%), Gaps = 2/58 (3%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 280
T LDH VT VG+ ++ G YW++KNSWG TWG AGY+ + R++ CGI T +SYP+
Sbjct: 268 TSLDHGVTAVGYDSSS-GVAYWIVKNSWGTTWGQAGYIWMSRNKNNQCGIATAASYPI 324
>gi|214015331|gb|ACJ62282.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 114/188 (60%), Gaps = 18/188 (9%)
Query: 46 IHEKWMAQHGRSYKDE-------------LEKEMRLKIFKENLEYIEKANKE---GNRTY 89
++E W ++HGR ++ ++ +RL++F++NL YI+K N E G T+
Sbjct: 1 MYEAWKSKHGRGGSNDDCDMAPSDEEQEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTF 60
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+LG F+DLT DE+R G++ + S Y+ D+P ++DWR GAVT
Sbjct: 61 RLGLTPFADLTLDEYRGRVLGFRARGRRSGARYGSGSGYRARG-GDLPDAIDWRQLGAVT 119
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF
Sbjct: 120 EVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFR 178
Query: 210 YIIQNQGI 217
++I N GI
Sbjct: 179 FVIGNGGI 186
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 78/195 (40%), Positives = 120/195 (61%), Gaps = 12/195 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI--EKANKEGNR-TYKLGTNQF 96
E+ V+EI ++W +H + Y+ E E R + FK NL+YI A ++ N+ + +G N+F
Sbjct: 42 EERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNKF 101
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+D++N+EFR Y K+ P ++ T S + + D P+SLDWR+ G VT +K+Q
Sbjct: 102 ADMSNEEFRKAYLS-KVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKDQGS 160
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF++ A+EGI + +G+LI LSEQ+L++C T+ N GC GG + AF ++I N G
Sbjct: 161 CGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS-NYGCEGGYMDYAFEWVINNGG 219
Query: 217 I-------FNGVCGT 224
I + GV GT
Sbjct: 220 IDSESDYPYTGVDGT 234
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 47/80 (58%), Gaps = 8/80 (10%)
Query: 207 AFAYIIQNQGIFNGVCGTQ---LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 263
A + + GI++G C +DHAV IVG+G+ ED YW++KNSWG +WG GY +
Sbjct: 277 AIDFQLYTGGIYDGSCSDDPDDIDHAVLIVGYGS-EDSEEYWIVKNSWGTSWGIDGYFYL 335
Query: 264 VRDE----GLCGIGTRSSYP 279
RD G+C + +SYP
Sbjct: 336 KRDTDLPYGVCAVNAMASYP 355
>gi|297818850|ref|XP_002877308.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323146|gb|EFH53567.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 306
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 84/232 (36%), Positives = 127/232 (54%), Gaps = 37/232 (15%)
Query: 84 EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH----RSTTSSTFKYQNLSMTDVPTS 139
+ + YK+ + DLT+DE++++Y G K+ S TS+ ++Y+ ++P
Sbjct: 47 DNEKNYKVLERRRDDLTDDEYQSIYLGGKVERTSLSLEWEWDTSNRYRYK--EGDNLPDE 104
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+DWR++GAV + K CWAF+A AVEGI +I +G LI LSEQQL+DC+T N+ C
Sbjct: 105 VDWRERGAVVEQEISKNLRGCWAFSAAGAVEGINQITTGELISLSEQQLIDCATETNDRC 164
Query: 200 LGGSREKAFAYI------------------------IQNQGIFNGVC-GTQLDHAVTIVG 234
GG AF +I I+ G+F G C T ++H V +VG
Sbjct: 165 DGGDAAYAFMFIKENGGIVTNKDYPFTGDKNATCKAIEKDGVFTGPCDSTLINHNVLVVG 224
Query: 235 FGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDE-----GLCGIGTRSSYPL 280
+GT + G +YWLI+NS+G+TWG+ GY ++ R G+CG+ YPL
Sbjct: 225 YGTNSTTGQDYWLIRNSFGSTWGENGYFRLQRSNIQNSTGICGVTLTPVYPL 276
>gi|198432221|ref|XP_002130541.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
Length = 330
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 106/336 (31%), Positives = 155/336 (46%), Gaps = 86/336 (25%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
+ +L C + V S +T + +W HG+SY E + +L I+++NL + +
Sbjct: 3 VLILALCVTYVASRDTTSLK-----WNEWKNTHGKSYASHEEMKRQL-IWEKNLRVVTQH 56
Query: 82 N---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N EG TY + +F+DL NDEF +Y P+ R K PT
Sbjct: 57 NYEYDEGLHTYTMAMTKFADLENDEFNTMYLA---SMPADRKNELVCKKQTIDKFAQNPT 113
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNN 197
++DWR +G VTP+KNQ +CG CWAF+A ++EG ++ L+ LSEQQL+DCST G+
Sbjct: 114 TVDWRTQGYVTPVKNQLQCGSCWAFSATGSLEGQHFAKTKKLVSLSEQQLIDCSTKQGDL 173
Query: 198 GCLGGSREKAFAYIIQNQGI----------FNGVC---------------------GTQL 226
GC GG + AFAYI Q GI N VC TQL
Sbjct: 174 GCGGGYPDWAFAYINQVGGIESETNYPYEAKNDVCRFNVSEVAATLTGCVDITPDSETQL 233
Query: 227 DHAVTIVG---------------FGT-------------------------TEDGANYWL 246
+ AV +G +G+ ++G YW+
Sbjct: 234 EKAVGSIGPVSVLIDASHISFQLYGSGIYYEQQCSSSPASLDHGVLAVGYGADNGQEYWM 293
Query: 247 IKNSWGNTWGD-AGYMKIVRDE-GLCGIGTRSSYPL 280
+KNSWG WG GY+K+ +++ CGI T++SYP+
Sbjct: 294 VKNSWGEGWGKLGGYIKMAKNKNNNCGIATQASYPI 329
>gi|345321446|ref|XP_001521682.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 221
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 83/207 (40%), Positives = 119/207 (57%), Gaps = 9/207 (4%)
Query: 16 TPMFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
T + ++TL LV A+ V + ++ E+W A H R Y+ E E R ++++N
Sbjct: 2 TSLLRLVTLSLVVAAAPTVLGLPRLDSALDRHWERWKAWHRRDYR-ENEDGWRRAVWEKN 60
Query: 75 LEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
L+ IE N+E G R++K+ N F D+TN+EFR + G++ S R S F N
Sbjct: 61 LKMIEMHNREYSVGKRSFKMEMNHFGDMTNEEFRQVMNGFRYDR-SAREVQGSVFLEPNF 119
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ P S+DWR KG VTP+KNQ +CG CWAF+ ++EG ++G L+ LSEQ L+DC
Sbjct: 120 --VEAPKSVDWRQKGYVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDC 177
Query: 192 S-TNGNNGCLGGSREKAFAYIIQNQGI 217
S GN+GC GG + AF YI N GI
Sbjct: 178 SRPQGNHGCNGGLMDYAFQYIKDNGGI 204
>gi|229596051|ref|XP_001013456.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225565626|gb|EAR93211.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 315
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 98/333 (29%), Positives = 152/333 (45%), Gaps = 82/333 (24%)
Query: 13 INTTPMFIIITL-LVSCASQVVSSRSTH----EQSVVEIHEKWMAQHGRSYKDELEKEMR 67
+N +F I L L++ A+ ++ +++ H +Q++ + + ++ + Y D + R
Sbjct: 1 MNKNILFAIAGLALLATATTILLTKTHHNTQEDQNIQALWSAFKTKYNKKYADPDFERYR 60
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
++IF ENL+ +E + T G QF D+T +EF+ Y KM +S F
Sbjct: 61 IEIFTENLKVVE------SNTKNYGITQFMDITREEFKQTYLTLKM----KNGLKASPFA 110
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
N D +DW KGAVTP+K+Q +CG CW+F+ AVEG + + L LSEQ
Sbjct: 111 KFN----DAGVEIDWTTKGAVTPVKDQGQCGSCWSFSTTGAVEGALFLSTKKLTSLSEQY 166
Query: 188 LLDCSTNGNNGCLGGSREKAFAYI------------------------------------ 211
L+DCS +GN GC GG + AF +I
Sbjct: 167 LVDCSKDGNEGCNGGLMDTAFDFISQHGIPTEAAYPYKAVDGTCKMTSGPYKISSHTDIQ 226
Query: 212 --------IQNQGIFNGV---------------CGTQLDHAVTIVGFGTTEDGANYWLIK 248
IQ Q I V CGT+LDH V +VG+ + YW +K
Sbjct: 227 DCNDLLNKIQKQPIAIAVDANNFQYYQKDIFSDCGTELDHGVLLVGYSAS---GKYWKVK 283
Query: 249 NSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 281
NSWG WG++G++++ CG+ +S+P A
Sbjct: 284 NSWGPNWGESGFIRLAAGN-TCGLCNMASFPNA 315
>gi|214015291|gb|ACJ62262.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 75/189 (39%), Positives = 116/189 (61%), Gaps = 20/189 (10%)
Query: 46 IHEKWMAQHGR----------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
++E W ++HGR + DE + +RL++F++NL YI+K N E G T++LG
Sbjct: 1 MYEAWKSKHGRGGSNDDDCDMAPGDEEGRRLRLEVFRDNLRYIDKHNAEADAGLHTFRLG 60
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTT----SSTFKYQNLSMTDVPTSLDWRDKGAV 148
F+DLT DE+R G++ + RS T ++ + +P ++DWR GAV
Sbjct: 61 LTPFADLTLDEYRGRVLGFR--ARGRRSGTRYGHGHGYRARPRGGDLLPDAIDWRQLGAV 118
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF
Sbjct: 119 TEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAF 177
Query: 209 AYIIQNQGI 217
++I N GI
Sbjct: 178 RFVIGNGGI 186
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 71/158 (44%), Positives = 101/158 (63%), Gaps = 8/158 (5%)
Query: 65 EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK---MPSPSHRST 121
+ R + FKEN YIE+ N+ G +Y+LG NQFSDLT++EFR + G + + SP +
Sbjct: 32 DHRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRFLGLRPDLIDSPVLKMP 91
Query: 122 TSSTFK--YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
S + +QN+ D+P S+DWR GAVT K+Q CG CWAFA A+EGI +I +G
Sbjct: 92 RDSDIEEGFQNV---DLPASVDWRQHGAVTAPKDQGSCGGCWAFATTGAIEGINQIVTGQ 148
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L+ LSEQ+L+DC + GC GG E A+ +I++N G+
Sbjct: 149 LVSLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGL 186
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 36/69 (52%), Positives = 50/69 (72%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCG 271
G+F G CG +++H V IVG+GT EDG +YW++KNSW TWGD G++K+ R+ GLC
Sbjct: 253 GVFTGHCGEEINHGVLIVGYGT-EDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCS 311
Query: 272 IGTRSSYPL 280
I T +SYP+
Sbjct: 312 INTLASYPV 320
>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
Length = 358
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 140/303 (46%), Gaps = 78/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+ Y E + R IF +NLE I N +G +YKLG N+FSDLT DEFR
Sbjct: 61 RFARRYGKRYDSVEEIKQRFDIFLDNLEMINSHNDKG-LSYKLGVNEFSDLTWDEFRRDR 119
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G + + +TT K ++ + P + DWR+ G V+P+KNQ +CG CW F+ A
Sbjct: 120 LG---AAQNCSATTKGNLKLRDAVL---PETKDWREAGIVSPVKNQGKCGSCWTFSTTGA 173
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E + G I LSEQQL+DC+ NN GC GG +AF YI N G+
Sbjct: 174 LEAAYTQKFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLETEEAYPYTG 233
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L +AV +V GF
Sbjct: 234 KNGLCKFSSQNVGVKVTDSVNITLGAEDELKYAVALVRPVSVAFEVVKGFKQYKSGVYTS 293
Query: 236 ---GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
GTT E G +WLIKNSWG WGD Y K+ +CGI T +S
Sbjct: 294 TECGTTPMDVNHAVLAVGYGVEYGVPFWLIKNSWGADWGDNAYFKMEMGNDMCGIATCAS 353
Query: 278 YPL 280
YP+
Sbjct: 354 YPV 356
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 82/202 (40%), Positives = 123/202 (60%), Gaps = 9/202 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ + L+ C VVSS S E +W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAVC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
K N + G+ TY LG NQF+DL N+EF A+ TG+++ S ++ STF N ++
Sbjct: 60 IKHNLKYDLGHFTYALGMNQFADLQNEEFVAMMTGFRVNGTS-KAAKGSTFLPSN-NVDK 117
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P ++DWR KG VTP+K+Q +CG CWAF+A ++EG ++G L+ LSEQ L+DCS
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDCSYR- 176
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
N GC GG ++AF YII GI
Sbjct: 177 NYGCHGGFMDRAFQYIIDAGGI 198
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/68 (48%), Positives = 48/68 (70%), Gaps = 3/68 (4%)
Query: 216 GIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 272
G++N G T+L HAV +VG+GTT DG +YW++KNSW TWG GY+ + R+ + CGI
Sbjct: 265 GVYNEPGCSTTRLGHAVLVVGYGTTSDGTDYWIVKNSWAKTWGMNGYLWMSRNKDNQCGI 324
Query: 273 GTRSSYPL 280
+ +SYP+
Sbjct: 325 ASEASYPM 332
>gi|242074968|ref|XP_002447420.1| hypothetical protein SORBIDRAFT_06g000780 [Sorghum bicolor]
gi|241938603|gb|EES11748.1| hypothetical protein SORBIDRAFT_06g000780 [Sorghum bicolor]
Length = 381
Score = 146 bits (369), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 76/192 (39%), Positives = 119/192 (61%), Gaps = 16/192 (8%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
HE ++E WMA HGRSY EK R +I+++N+++IE N++ +T+ G NQF+D
Sbjct: 51 HELLMMERFHAWMAAHGRSYPTAEEKLRRFQIYRDNVKFIEAINRDTTKTFTCGENQFTD 110
Query: 99 LTNDEFRALYTGYKMPS-PSHRSTTSSTFKYQNLSM-----------TDVPTSLDWRDKG 146
LT+ EF A YT S P S++ T + +++ TD+P +DWR++
Sbjct: 111 LTHQEFLARYTMASHDSVPLDLSSSVITTRAGDITESDSGTTMQVEDTDLPEHVDWREQD 170
Query: 147 AVTPIKNQKE-CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
AVTP++NQ + C CW FA+VA +E KI++G+L++LSEQQ++DC+ C GG+ +
Sbjct: 171 AVTPVQNQLQGCHACWVFASVATIESANKIKNGDLLKLSEQQIVDCTA---EKCGGGTLQ 227
Query: 206 KAFAYIIQNQGI 217
+AF Y+ +N GI
Sbjct: 228 EAFKYVQKNGGI 239
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 50/93 (53%), Gaps = 12/93 (12%)
Query: 198 GCLGGSREKAFAYIIQNQGIFNG---VCGTQLDHAVTIVGFGTTED-GANYWLIKNSWGN 253
L + + AFAY GI++G L+HA+ IVG+G E G YW+ KNSWG
Sbjct: 290 AVLFDAHDPAFAYY--KGGIYSGGQPRTRYVLNHAMAIVGYGKNESTGQKYWIAKNSWGT 347
Query: 254 TWGDAGYMKIVRD-----EGLCGIGTRSS-YPL 280
WGD GY+ I +D +G+ G+ + YP+
Sbjct: 348 GWGDGGYVYIAKDMPDKPQGVQGLVAKYPWYPI 380
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 146 bits (369), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 83/203 (40%), Positives = 117/203 (57%), Gaps = 10/203 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ + +L C S +S+ S Q + E + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPVAVLAVCLSAALSAPSLDPQ-LDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKKI 58
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G TY+LG N F D+T++EFR + GYK S R S F N +
Sbjct: 59 ELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYK--RKSERKFKGSLFMEPNF--LE 114
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
P S+DWRD G VTP+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 APRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPE 174
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF YI NQG+
Sbjct: 175 GNEGCNGGLMDQAFQYIKDNQGL 197
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/60 (48%), Positives = 39/60 (65%), Gaps = 4/60 (6%)
Query: 225 QLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+LDH V +VG+G DG YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 276 ELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 358
Score = 146 bits (369), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 86/253 (33%), Positives = 129/253 (50%), Gaps = 37/253 (14%)
Query: 16 TPMFIIITLLVSC----ASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYKDELEK 64
+P + + LL SC A+ ++ +R+T + +++ W H RSY E
Sbjct: 5 SPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEA 64
Query: 65 EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM---PSPSHRST 121
R +++ N E+I+ N G+ TY+L N+F+DLT +EF A YTGY P T
Sbjct: 65 LQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVIT 124
Query: 122 T-----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-CGCCWAFAAVAAVEGITKI 175
T ++F Y+ DVP S+DWR +GAV P K+Q C CWAF A +E + I
Sbjct: 125 TGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMI 180
Query: 176 RSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFN----------GVC--G 223
++G L+ LSEQQL+DC + + GC GS +A+ ++++N G+ G C
Sbjct: 181 KTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRA 239
Query: 224 TQLDHAVTIVGFG 236
HA I GFG
Sbjct: 240 KSAHHAAKITGFG 252
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 37/68 (54%), Positives = 49/68 (72%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCG 271
G++ G CGT+L HAVT+VG+GT GA YW IKNSWG +WG+ GY++I+RD GLCG
Sbjct: 287 GVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCG 346
Query: 272 IGTRSSYP 279
+ +YP
Sbjct: 347 VTLDIAYP 354
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 146 bits (369), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 73/205 (35%), Positives = 121/205 (59%), Gaps = 21/205 (10%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGR--------------SYKDELEKEMRLKIFKENL 75
++V + ++ V ++E W ++HGR ++E ++ +RL++F++NL
Sbjct: 37 TRVPAPAERADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNL 96
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
YI+ N E G T++LG F+DLT +E+R G++ + S + +
Sbjct: 97 RYIDAHNAEADAGLHTFRLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVRG-- 154
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
D+P ++DWR GAVT +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC
Sbjct: 155 -GDLPDAIDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCD 213
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI 217
++GC GG E AF ++I N GI
Sbjct: 214 AQ-DSGCDGGQMENAFRFVIGNGGI 237
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 71/122 (58%), Gaps = 18/122 (14%)
Query: 166 VAAVEGITKIRSGNLIQLSEQ---QLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVC 222
VA ++G+ ++ S N L E Q + + + + +AF + + GIFNG C
Sbjct: 262 VATIDGLVEVASNNETALQEAVAIQPVSVAIDASG--------RAFQHY--SSGIFNGPC 311
Query: 223 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK----IVRDEGLCGIGTRSSY 278
GT LDH VT VG+G+ E G +YW++KNSW +WG+AGY++ + R G CGI +SY
Sbjct: 312 GTSLDHGVTAVGYGS-ESGKDYWIVKNSWSASWGEAGYIRMRRNVPRPTGKCGIAMDASY 370
Query: 279 PL 280
P+
Sbjct: 371 PV 372
>gi|75994630|gb|ABA33836.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 146 bits (369), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 73/187 (39%), Positives = 117/187 (62%), Gaps = 20/187 (10%)
Query: 46 IHEKWMAQHGRSYK-----------DELEKEMRLKIFKENLEYIEKANKE---GNRTYKL 91
++E W ++HGR + +E ++ +RL++F++NL YI+K N E G T++L
Sbjct: 1 MYEAWKSKHGRGGRSNDDCDMAPGDEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFRL 60
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTP 150
G F+DLT DE+R G++ + + ++ ++ D+ P ++DWR GAVT
Sbjct: 61 GLTPFADLTLDEYRGRVLGFR----ARGRRSGHGYRSRHPRGGDLLPDAIDWRQLGAVTE 116
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q++CG CWAF+AVAA+EGI I +GNL+ LSEQ+++DC ++GC GG E AF +
Sbjct: 117 VKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFLF 175
Query: 211 IIQNQGI 217
+I N GI
Sbjct: 176 VIGNGGI 182
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 146 bits (369), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 125/229 (54%), Gaps = 26/229 (11%)
Query: 19 FIIITLLV-SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
FI+ +LLV + ++ ++ H QS + +HG++YK++ E+ R IF+ENL
Sbjct: 4 FILASLLVVAVSATLLKEDGAHFQS-------FKLKHGKTYKNQAEETKRFAIFRENLRK 56
Query: 78 IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N K+G +Y G N+F+D+T EF+A+ PS +T + +Q
Sbjct: 57 IEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATKT----FQLADGV 112
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
VP S+DWR + VTPIK+Q +CG CWAFA V + EG + +G L + SEQQL+DC+T+
Sbjct: 113 SVPESIDWRSRNVVTPIKDQAQCGSCWAFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTD 172
Query: 195 GNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTIV 233
N GC GG + F Y IQ G+ ++G C + VT V
Sbjct: 173 LNYGCDGGYLDDTFPY-IQTNGLELESDYPYTGYDGYCSYESSKVVTKV 220
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 63/116 (54%), Gaps = 16/116 (13%)
Query: 172 ITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN-QGIFNGVCGTQ----- 225
+TK+ S + +EQ LL+ +G + A A + Q F+G+ +
Sbjct: 217 VTKVSSYVSVPANEQALLEA--------VGTAGPVAIAINADDLQFYFSGIIDDKYCDPE 268
Query: 226 -LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
LDH V VG+ +E+G +YWLIKNSWG WG++GY + +R + +CG+ + YPL
Sbjct: 269 YLDHGVLAVGY-DSENGRDYWLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPL 323
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 146 bits (369), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 134/245 (54%), Gaps = 18/245 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F+++ L CA+ + ++ TH++ V + A HG+ Y+ E E+ RLKI+ EN I
Sbjct: 4 FVVLCFL--CAA-MTAAAITHQELVGAEWSAFKALHGKEYQSETEEYYRLKIYMENRMMI 60
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+ N++ +YKL N++ D+ + EF + G++ S S + + +
Sbjct: 61 ARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNGFRRDYRSKPRQGSFYIEPEGIEDKH 120
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN- 194
+P ++DWR KGAVTP+KNQ +CG CWAF+ ++EG +SG+++ LSEQ L+DCST
Sbjct: 121 LPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDCSTAF 180
Query: 195 GNNGCLGGSREKAFAYIIQNQGI-------FNGVCGT----QLDHAVTIVGFGTTEDGAN 243
GNNGC GG + AF YI N GI +NG GT + D T GF +G
Sbjct: 181 GNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTDGTCHFKKSDVGATDTGFVDIPEGNE 240
Query: 244 YWLIK 248
+ L K
Sbjct: 241 HLLKK 245
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 53/75 (70%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ +QG+++ C ++ LDH V +VG+GT +D +YWL+KNSWG TWGD GY+ + R+
Sbjct: 263 SFQFYSQGVYDEPECSSENLDHGVLVVGYGTKDD-QDYWLVKNSWGTTWGDGGYIYMTRN 321
Query: 267 -EGLCGIGTRSSYPL 280
+ CGI + +SYPL
Sbjct: 322 KDNQCGIASSASYPL 336
>gi|225579644|gb|ACN93991.1| cathepsin L [Dicentrarchus labrax]
Length = 316
Score = 146 bits (369), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 83/200 (41%), Positives = 118/200 (59%), Gaps = 12/200 (6%)
Query: 23 TLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
+L C S +S+ S Q ++ H + W + H + Y E E+ R ++++NL+ IE
Sbjct: 1 AVLAVCLSAALSAPSLDPQ--LDDHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKKIELH 57
Query: 82 NKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N E G TY+LG N F D+T++EFR L GYK+ + R + S F N + P
Sbjct: 58 NLEHSMGKHTYRLGMNHFGDMTHEEFRQLMNGYKLKAA--RKFSGSLFMEPNF--LEAPR 113
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNN 197
S+DWRD G VTP+K+Q +CG CWAF+ A+EG +SG L+ LSEQ L+DCS GN
Sbjct: 114 SVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKSGKLVSLSEQNLVDCSRPEGNE 173
Query: 198 GCLGGSREKAFAYIIQNQGI 217
GC GG ++AF Y+ NQG+
Sbjct: 174 GCNGGLMDQAFQYVKDNQGL 193
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 3/45 (6%)
Query: 225 QLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
+LDH V +VG+G DG YW++ NSW WGD GY+ + +D
Sbjct: 272 ELDHGVLVVGYGFEGQDVDGKKYWIVNNSWSEKWGDEGYIYLAKD 316
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 146 bits (369), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 83/203 (40%), Positives = 117/203 (57%), Gaps = 10/203 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ + +L C S +S+ S Q + E + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPVAVLAVCLSAALSAPSLDPQ-LDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKKI 58
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G TY+LG N F D+T++EFR + GYK S R S F N +
Sbjct: 59 ELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYK--RKSERKFKGSLFMEPNF--LE 114
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
P S+DWRD G VTP+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 APRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPE 174
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF YI NQG+
Sbjct: 175 GNEGCNGGLMDQAFQYIKDNQGL 197
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/60 (48%), Positives = 39/60 (65%), Gaps = 4/60 (6%)
Query: 225 QLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+LDH V +VG+G DG YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 276 ELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/179 (41%), Positives = 105/179 (58%), Gaps = 7/179 (3%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK----LGTNQFSDLT 100
E+ E+WM +H + Y EK R F NL ++ K N EG R +G N F+DL+
Sbjct: 49 ELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADLS 108
Query: 101 NDEFRALYTG--YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
N+EFR +Y+ + + R + + ++ D P SLDWR +GAVT +KNQ +CG
Sbjct: 109 NEEFREVYSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASLDWRKRGAVTAVKNQGDCG 168
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF++ A+EGI I +G LI LSEQ+L+DC T N GC GG + AF ++I N GI
Sbjct: 169 SCWAFSSTGAMEGINAITTGELISLSEQELVDCDTT-NEGCDGGYMDYAFEWVINNGGI 226
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 44/71 (61%), Gaps = 8/71 (11%)
Query: 216 GIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL--- 269
GI++G C +DHAV +VG+G + G +YW++KNSWG WG GY+ I R+ GL
Sbjct: 293 GIYDGDCSGNPDDIDHAVLVVGYGQ-QGGTDYWIVKNSWGTDWGMQGYIYIRRNTGLPYG 351
Query: 270 -CGIGTRSSYP 279
C I +SYP
Sbjct: 352 VCAIDAMASYP 362
>gi|363737841|ref|XP_001232765.2| PREDICTED: pro-cathepsin H [Gallus gallus]
Length = 327
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 99/300 (33%), Positives = 133/300 (44%), Gaps = 80/300 (26%)
Query: 53 QHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK 112
QHGR Y + E E RL++F N +IE N GN ++++ NQFSD+T EF+ LY
Sbjct: 33 QHGRRY-EAGEYERRLRVFVGNKRHIEGHNA-GNSSFQMALNQFSDMTFAEFKKLYL--- 87
Query: 113 MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGCCWAFAAVAAVEG 171
P + S T F S P ++DWR KG VTP+KNQ CG CW F+ +E
Sbjct: 88 WSEPQNCSATRGNFL---RSDGPCPEAVDWRKKGNFVTPVKNQGPCGSCWTFSTTGCLES 144
Query: 172 ITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF----------NG 220
I +G L+ L+EQQL+DC+ NN GC GG +AF YI+ N+G+ NG
Sbjct: 145 AIAIATGKLLSLAEQQLVDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAYPYRAQNG 204
Query: 221 VCGTQLDHAVTIV--------------------------GFGTTEDGANY---------- 244
C Q D A+ V F T D +Y
Sbjct: 205 TCKFQPDKAIAFVKDVINITQYDEAGMVEAVGKHNPVSFAFEVTSDFMHYRKGVYSNPRC 264
Query: 245 ------------------------WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
W++KNSWG WG GY I R + +CG+ +SYP+
Sbjct: 265 EHTPDKVNHAVLAVGYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAACASYPV 324
>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 388
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 90/222 (40%), Positives = 128/222 (57%), Gaps = 15/222 (6%)
Query: 10 SFKINTTP----MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEK 64
S +++ P M +++ LL C VS+ S ++ H E W H +SY + E+
Sbjct: 39 SLQVSPGPWGQAMKLLVCLLSLCWGLAVSAPLG--DSELDKHWELWKNWHQKSYH-KAEE 95
Query: 65 EMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST 121
R +++ENL+ IE N E G TY+LG NQF DLTN+EF+ + + S +R
Sbjct: 96 GWRRMVWEENLKVIELHNLEQSLGLHTYQLGMNQFGDLTNEEFQQMLISERHFSEGNRIN 155
Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
S+ + ++ VPTS+DWRD G VTP+KNQ CG CWAF+ A+EG +SG L+
Sbjct: 156 GSA---FLEVNYVQVPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLV 212
Query: 182 QLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIFNGVC 222
LSEQ L+DCS GN GC GG + AF YI++N+GI + C
Sbjct: 213 SLSEQNLVDCSWQQGNQGCNGGIVDFAFQYILENRGIDSEDC 254
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 40/60 (66%), Gaps = 4/60 (6%)
Query: 225 QLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYPL 280
+L+HAV +VG+G E G YW++KNSWG WGD GY + +D G CGI T +SYPL
Sbjct: 328 RLNHAVLVVGYGYEGEDEAGKKYWIVKNSWGKQWGDHGYFYLSKDRGNHCGIATTASYPL 387
>gi|328872971|gb|EGG21338.1| cysteine proteinase 5 precursor [Dictyostelium fasciculatum]
Length = 358
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/201 (38%), Positives = 121/201 (60%), Gaps = 9/201 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+I+ +L S+ S EQ + WM +H RSY E R ++K+N++Y
Sbjct: 6 IFLIVLMLA-----FASASSYSEQQYRDSFTNWMQKHSRSYASH-EFNTRYSVYKKNMDY 59
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
+ + N +G+ T LG N +D+TN E++A+Y G K + + + S++ + + +P
Sbjct: 60 VNEWNSKGSETV-LGLNSLADMTNQEYQAIYLGTKTDATARLAAASASASFGKVQGA-LP 117
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
S+DW +GAVT +KNQ +CG CW+F+A + EG +I + NL+ LSEQ L+DCS++ GN
Sbjct: 118 ASIDWVAQGAVTQVKNQGQCGSCWSFSATGSTEGAHQISTSNLVALSEQNLIDCSSSYGN 177
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
+GC GG + AF YII N GI
Sbjct: 178 DGCNGGLMDNAFKYIIANGGI 198
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 40/84 (47%), Gaps = 27/84 (32%)
Query: 224 TQLDHAVTIVGFGTTE--------------------------DGANYWLIKNSWGNTWGD 257
T LDH V +VG+GT GA +W +KNSWG WG
Sbjct: 274 TNLDHGVLVVGYGTASANGSSDSDSSAASQSSSSESSDDQATQGAQFWKVKNSWGPEWGL 333
Query: 258 AGYMKIVRD-EGLCGIGTRSSYPL 280
+GY+++ R+ + CGI T +S P+
Sbjct: 334 SGYIQMARNRDNNCGIATTASQPI 357
>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 361
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 73/210 (34%), Positives = 121/210 (57%), Gaps = 12/210 (5%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEK---WMAQHGRSYKDELEKEMRLKIFKENLE 76
+ + +L +C+ + + + + + + E+ W A++ R+Y E + R ++ ENL
Sbjct: 10 LALVMLFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMVYSENLR 69
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYT---GYKMPSPSHRSTTSSTFKYQNLS 132
+I+ N+ +Y+LG NQF+DLT +EF+ Y + P+ T +S
Sbjct: 70 FIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMPPIVGTMSTAGMS 129
Query: 133 MTD----VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
D P S+DWR KGAVTP+KNQ++CG CWAFA VA++EG+ +I++G L+ LSEQ++
Sbjct: 130 NGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEI 189
Query: 189 LDCSTNGNN-GCLGGSREKAFAYIIQNQGI 217
+DC GN+ GC GG A ++ +N G+
Sbjct: 190 VDCDRGGNDHGCRGGYPRSAMEWVTRNGGL 219
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 58/115 (50%), Gaps = 11/115 (9%)
Query: 174 KIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQ-LDHAVTI 232
+IR +Q + L+ + G + +AF + +G+F+G C T ++HAVT+
Sbjct: 245 RIRGYQAVQRKNEAELERAVAGRPVAVVIDASRAFQFY--KRGVFSGPCNTTTVNHAVTV 302
Query: 233 VGFGTTEDGA----NYWLIKNSWGNTWGDAGY----MKIVRDEGLCGIGTRSSYP 279
VG+G+ + YW++KNSWG WG+ GY ++ EG+C I P
Sbjct: 303 VGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRAREGMCAIAIEPLLP 357
>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 359
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 73/210 (34%), Positives = 121/210 (57%), Gaps = 12/210 (5%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEK---WMAQHGRSYKDELEKEMRLKIFKENLE 76
+ + +L +C+ + + + + + + E+ W A++ R+Y E + R ++ ENL
Sbjct: 10 LALVMLFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMVYSENLR 69
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYT---GYKMPSPSHRSTTSSTFKYQNLS 132
+I+ N+ +Y+LG NQF+DLT +EF+ Y + P+ T +S
Sbjct: 70 FIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMPPIVGTMSTAGMS 129
Query: 133 MTD----VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
D P S+DWR KGAVTP+KNQ++CG CWAFA VA++EG+ +I++G L+ LSEQ++
Sbjct: 130 NGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEI 189
Query: 189 LDCSTNGNN-GCLGGSREKAFAYIIQNQGI 217
+DC GN+ GC GG A ++ +N G+
Sbjct: 190 VDCDRGGNDHGCRGGYPRSAMEWVTRNGGL 219
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 60/116 (51%), Gaps = 11/116 (9%)
Query: 174 KIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQ-LDHAVTI 232
+IR +Q + L+ + G + +AF + +G+F+G C T ++HAVT+
Sbjct: 245 RIRGYQAVQRKNEAELERAVAGRPVAVVIDASRAFQFY--KRGVFSGPCNTTTVNHAVTV 302
Query: 233 VGFGTTEDGA----NYWLIKNSWGNTWGDAGY----MKIVRDEGLCGIGTRSSYPL 280
VG+G+ + YW++KNSWG WG+ GY ++ EG+C I YP+
Sbjct: 303 VGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRAREGMCAIAIEPYYPV 358
>gi|330842502|ref|XP_003293216.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
gi|325076482|gb|EGC30264.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
Length = 376
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 82/198 (41%), Positives = 118/198 (59%), Gaps = 19/198 (9%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
A ++S+ T +Q E W +HG+ Y+++ E R IFK+N++Y+ N +G+ T
Sbjct: 17 AGYKINSKFTEQQYKTAFTE-WTIKHGKQYENQ-EFGRRYGIFKDNMDYVHDWNSKGSET 74
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--------PTSL 140
LG N F+DLTN E++ Y G + S HR Y ++ ++ PTS+
Sbjct: 75 V-LGLNIFADLTNLEYQKYYLGTHVNSLLHRG-------YDGRALEEIFGSDDGRNPTSV 126
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGC 199
DW KGAVTPIK+Q +CG CW+F+ +VEG +I++G L+ LSEQ L+DCS GN GC
Sbjct: 127 DWNKKGAVTPIKDQGQCGSCWSFSTTGSVEGAHQIKTGKLVSLSEQNLVDCSGAEGNLGC 186
Query: 200 LGGSREKAFAYIIQNQGI 217
GG + AF YIIQN+GI
Sbjct: 187 DGGLMDNAFIYIIQNKGI 204
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 37/95 (38%)
Query: 224 TQLDHAVTIVGFG---------------------------------TTEDG---ANYWLI 247
T+LDH V +VG+G +++DG + YWL+
Sbjct: 282 TELDHGVLVVGYGVAKKDENNASPNKHQIRIRHNDDFGIDEIVTDSSSDDGRKTSQYWLV 341
Query: 248 KNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 281
KNSWG +WG G++++ ++ + CGI + +SYP A
Sbjct: 342 KNSWGVSWGMQGFIQMSKNRKNNCGIASCASYPTA 376
>gi|76574392|gb|ABA46966.1| cysteine protease Mir1 [Zea diploperennis]
gi|76574396|gb|ABA46968.1| cysteine protease Mir1 [Zea diploperennis]
gi|76574398|gb|ABA46969.1| cysteine protease Mir1 [Zea diploperennis]
gi|76574406|gb|ABA46973.1| cysteine protease Mir1 [Zea diploperennis]
Length = 250
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 72/186 (38%), Positives = 113/186 (60%), Gaps = 16/186 (8%)
Query: 46 IHEKWMAQHGRSYK-----------DELEKEMRLKIFKENLEYIEKANKE---GNRTYKL 91
++E W ++HGR + +E ++ +RL++F++NL YI+K N E G T++L
Sbjct: 1 MYEAWKSRHGRGGRNDCDMAPGDEEEEDDRRLRLEVFRDNLRYIDKHNAEADAGLHTFRL 60
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
G F+DLT +E+R G++ Y+ D+P ++DWR GAVT +
Sbjct: 61 GLTPFADLTLEEYRGRVLGFRARGRRSGVRYGHGHGYRARG-GDLPDAIDWRQLGAVTEV 119
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF ++
Sbjct: 120 KDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFRFV 178
Query: 212 IQNQGI 217
I N GI
Sbjct: 179 IGNGGI 184
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/177 (41%), Positives = 107/177 (60%), Gaps = 9/177 (5%)
Query: 47 HEKWMAQHGRSYKDELE-KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
+ W A+ G+ + R + FKEN YIE+ N+ G +Y+LG NQFSDLT++EFR
Sbjct: 13 YASWCAKFGKECASSNSLGDRRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFR 72
Query: 106 ALYTGYK---MPSPSHRSTTSSTFK--YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
+ G + + SP + S + +QN+ D+P S+DWR GAVT K+Q CG C
Sbjct: 73 QRFLGLRPDLIDSPVLKMPRDSDIEEGFQNV---DLPASVDWRKHGAVTAPKDQGSCGGC 129
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
WAFA A+EGI +I +G L+ LSEQ+L+DC + GC GG E A+ +I++N G+
Sbjct: 130 WAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGL 186
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 70/122 (57%), Gaps = 18/122 (14%)
Query: 166 VAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVC 222
V A++G I G+ L++ +Q + + G + K F + G+F G C
Sbjct: 210 VVAIDGYEAIPDGDEQALLRAVAKQPVSVAIEGAS--------KDFQHYAS--GVFTGHC 259
Query: 223 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSY 278
G +++H V IVG+GT EDG +YW++KNSW TWGD G++K+ R+ GLC I T +SY
Sbjct: 260 GEEINHGVLIVGYGT-EDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCSINTLASY 318
Query: 279 PL 280
P+
Sbjct: 319 PV 320
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 85/210 (40%), Positives = 115/210 (54%), Gaps = 26/210 (12%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR- 87
A + S ST + S++E ++W A + +SY E+ R ++ N+ YIE N E
Sbjct: 32 AGDMERSMSTDDSSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAA 91
Query: 88 --TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK------------------ 127
TY+LG ++DLTN EF A+YT P+P+ S
Sbjct: 92 GLTYELGETAYTDLTNQEFMAMYTA---PAPAQLPADESVITTRAGPVDAVGGAPGQLPV 148
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y NLS T P S+DWR GAVTP+KNQ CG CWAF+ VA VEGI +IR+G L+ LSEQ+
Sbjct: 149 YVNLS-TSAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQE 207
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L+DC T ++GC GG +A +I N GI
Sbjct: 208 LVDCDTL-DDGCDGGISYRALRWIASNGGI 236
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 38/72 (52%), Positives = 51/72 (70%), Gaps = 6/72 (8%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGAN-YWLIKNSWGNTWGDAGYMKIVRD-----EG 268
+G++NG CGT L+H VT+VG+G G + YW++KNSWG WGD GY+++ +D EG
Sbjct: 302 KGVYNGPCGTNLNHGVTVVGYGQEAAGGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEG 361
Query: 269 LCGIGTRSSYPL 280
LCGI R SYPL
Sbjct: 362 LCGIAIRPSYPL 373
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 81/225 (36%), Positives = 109/225 (48%), Gaps = 71/225 (31%)
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS- 184
F+Y+N+S +PT++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +G L+ L+
Sbjct: 7 FRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAE 66
Query: 185 ---------------EQQLLDCS------------------TNGNNGCLGGSR------- 204
E L+D + T + C GS
Sbjct: 67 QELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNSAATIKG 126
Query: 205 --------EKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTT 238
E A + NQ G+ G CGT LDH + +G+G T
Sbjct: 127 YEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKT 186
Query: 239 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
DG YWL+KNSWG TWG+ GY+++ +D G+CG+ SYP
Sbjct: 187 SDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 231
>gi|214015269|gb|ACJ62251.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 72/187 (38%), Positives = 117/187 (62%), Gaps = 20/187 (10%)
Query: 46 IHEKWMAQHGRSYK-----------DELEKEMRLKIFKENLEYIEKANKE---GNRTYKL 91
++E W ++HGR + +E ++ +RL++F++NL YI+K N E G T++L
Sbjct: 1 MYEAWKSKHGRGGRSNDDCDMAPGDEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFRL 60
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTP 150
G F+DLT DE+R G++ + + ++ ++ D+ P ++DWR GAVT
Sbjct: 61 GLTPFADLTLDEYRGRVLGFR----ARGRRSGHGYRSRHPRGGDLLPDAIDWRQLGAVTE 116
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF +
Sbjct: 117 VKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFRF 175
Query: 211 IIQNQGI 217
+I N GI
Sbjct: 176 VIGNGGI 182
>gi|37788267|gb|AAO64473.1| cathepsin H precursor [Fundulus heteroclitus]
Length = 345
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 145/326 (44%), Gaps = 95/326 (29%)
Query: 45 EIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
E H K WMAQ+ ++Y D E RL+IF EN I+K N EGN ++ +G NQFS +T +E
Sbjct: 24 EFHFKSWMAQYNKAY-DFNEYYRRLQIFTENKRRIDKHN-EGNHSFTMGLNQFSGMTFNE 81
Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLS----MT-------------DVPTSLDWRDKG 146
FR + M P + S T + NL+ MT P S+DWR KG
Sbjct: 82 FRK---AFLMSEPQNCSATKGNYLSSNLNQFSGMTFNEFRKAFLMSEGPQPDSIDWRKKG 138
Query: 147 -AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSR 204
+TP+K Q CG CW F+ +E +T I + L+ LSEQQL+DC+ + NN GC GG
Sbjct: 139 NYITPVKTQGSCGSCWTFSTTGCLESVTAIATVKLVPLSEQQLVDCAQDFNNHGCNGGLP 198
Query: 205 EKAFAYIIQNQGIFN----------GVCGTQLDHAVTIV--------------------- 233
+AF YI+ N+G+ G+C + A V
Sbjct: 199 SQAFEYIMYNKGLMTEQDYPYKFVEGICSYKPSLAAAFVKEVRNITAYDEMGMVDAVGTL 258
Query: 234 -----GFGTTEDGANY----------------------------------WLIKNSWGNT 254
F T+D +Y W++KNSWG++
Sbjct: 259 NPVSFAFEVTDDFMHYREGVYTSTTCHNTTDKVNHAVLAVGYGQEKGTPYWIVKNSWGSS 318
Query: 255 WGDAGYMKIVRDEGLCGIGTRSSYPL 280
WG GY I R + +CG+ SS P+
Sbjct: 319 WGIDGYFLIERGKNMCGLAACSSSPV 344
>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
Length = 335
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/201 (38%), Positives = 116/201 (57%), Gaps = 7/201 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + +LV S + + ++W +HG+ Y E E + RLK+F +N+ Y
Sbjct: 6 LFLGLCVLVHVCSAFIPLVLPIPGLYEDYFKEWQEKHGKVYSTEEESQSRLKVFMKNVIY 65
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I+ NK+G+ +Y+L N+++D+T DEF+ Y + P H S T S K D P
Sbjct: 66 IDNHNKQGH-SYELEVNEYADMTLDEFKDQY----LMEPQHCSATHS-LKSDPPKYRDPP 119
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
++DWR KGAVTP+KNQ +CG CW F+ +E +++G L+ LSEQQL+DC+ N
Sbjct: 120 KAIDWRSKGAVTPVKNQGQCGSCWTFSTTGCLESHHFLKTGQLVSLSEQQLVDCAQAFNN 179
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
NGC GG +AF YI N G+
Sbjct: 180 NGCNGGLPSQAFEYIHYNGGL 200
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 38/57 (66%)
Query: 225 QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 281
++HAV VG+ TTE G +YW++KNSWG +G GY I R E +CG+ +SYP+
Sbjct: 279 HVNHAVLAVGYNTTESGEDYWIVKNSWGTKFGINGYFWIARGENMCGLADCASYPIV 335
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 111/228 (48%), Gaps = 71/228 (31%)
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
S+ F+Y+N+S+ +P ++DWR GAVTPIK+Q +CGCCWAF+AVAA EGI KI +G LI
Sbjct: 3 STGFRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLIS 62
Query: 183 LS----------------EQQLLDCS------------------TNGNNGCLGGSR---- 204
LS E L+D + T + C GS
Sbjct: 63 LSEQELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSNSAAN 122
Query: 205 -----------EKAFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGF 235
E A + NQ G+ G CGT LDH + +G+
Sbjct: 123 IKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGY 182
Query: 236 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 279
G T DG YWL+KNSWG TWG+ GY+++ +D +G+CG+ SYP
Sbjct: 183 GKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYP 230
>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
Length = 338
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 106/172 (61%), Gaps = 9/172 (5%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W + H + Y E E+ R ++++NL+ IE N + G TY+LG N F D+TN+EFR
Sbjct: 33 WKSWHTKKYH-EKEEGWRRMVWEKNLKKIELHNLDHSMGKHTYRLGMNHFGDMTNEEFRQ 91
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
L GYK + R S F N + P SLDWRDKG VTP+K+Q +CG CWAF+A
Sbjct: 92 LMNGYK--HKAERKVKGSLFLEPNF--LEAPRSLDWRDKGYVTPVKDQGQCGSCWAFSAT 147
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
A+EG ++G ++QLSEQ L++CS GN GC GG ++AF Y+ NQG+
Sbjct: 148 GALEGQQFRKTGKMVQLSEQNLVECSRPEGNEGCNGGLMDQAFQYVKDNQGL 199
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/60 (48%), Positives = 39/60 (65%), Gaps = 4/60 (6%)
Query: 225 QLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+LDH V +VG+G DG YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 278 ELDHGVLLVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYVYMAKDRQNHCGIATAASYPL 337
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 119/206 (57%), Gaps = 9/206 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F +ITLL++ + ++ ++ + V E + +H ++Y D E+ R+KIF EN +I
Sbjct: 3 FALITLLIALVA--MTQAVSYSELVREEWNTFKLEHRKNYADSTEETFRMKIFNENKHHI 60
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLS 132
K N+ G +YKL N+++D+ + EFR G+ +T +F + +
Sbjct: 61 AKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTFISPE 120
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+PT++DWR KGAVT +K+Q CG CWAF++ A+EG +SG L+ LSEQ L+DCS
Sbjct: 121 HVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDCS 180
Query: 193 TN-GNNGCLGGSREKAFAYIIQNQGI 217
T GNNGC GG + AF Y+ N GI
Sbjct: 181 TKYGNNGCNGGLMDNAFRYVKDNGGI 206
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 69/117 (58%), Gaps = 9/117 (7%)
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFN--GVCGT 224
A G I GN +L++ +T G + +++F + ++G+++
Sbjct: 230 ATDRGFADIPQGNEKKLAQA----VATIGPVSVAIDASQQSFQFY--SEGVYDEPNCSAE 283
Query: 225 QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
LDH V +VG+GT +DG++YWL+KNSWG TWGD G++K+ R+ E CGI + SSYPL
Sbjct: 284 NLDHGVLVVGYGTEKDGSDYWLVKNSWGTTWGDKGFIKMSRNKENQCGIASASSYPL 340
>gi|52076122|dbj|BAD46635.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 416
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 82/186 (44%), Positives = 116/186 (62%), Gaps = 7/186 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSD 98
E+S+ ++E+W A + S +D + E R ++FK N YI + N K +Y LG N+FSD
Sbjct: 19 EESMWSLYERWRAVYAPS-RDLSDMESRFEVFKANARYIHEFNQKSKGMSYVLGLNKFSD 77
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LT +EF A YTG K+ + + + T+S+ + L + P + DWR GAVT +K+Q +CG
Sbjct: 78 LTYEEFAAKYTGVKVDASAFATATTSSPD-EELPVGVPPATWDWRLNGAVTDVKDQGQCG 136
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL-GGSREKAFAYIIQNQGI 217
CW F+AV AVEGI I +GNL+ LSEQQ+LDCS G+ CL GG A YI++N G+
Sbjct: 137 SCWVFSAVGAVEGINAIMTGNLLTLSEQQVLDCSNTGD--CLKGGDPRAALQYIVKN-GV 193
Query: 218 FNGVCG 223
CG
Sbjct: 194 TLDQCG 199
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 37/74 (50%), Positives = 51/74 (68%), Gaps = 4/74 (5%)
Query: 211 IIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 266
I G++NG CGT ++HAVT VG+G T+D NYW+ +NSWG WG++GY+++ RD
Sbjct: 327 ITTPYGVYNGPCGTSVNHAVTTVGYGVTQDNINYWIARNSWGPRWGESGYIRMKRDIAAK 386
Query: 267 EGLCGIGTRSSYPL 280
EGLCGI YP+
Sbjct: 387 EGLCGISMYGVYPI 400
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/67 (47%), Positives = 43/67 (64%), Gaps = 7/67 (10%)
Query: 215 QGIFNGVCGT-QLDHAVTIVGF--GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 267
+G+F G C T L+H V +VG+ TT D YW++KNSWG WG+ GY+++ RD
Sbjct: 263 KGVFTGRCKTAPLNHGVVVVGYGVNTTPDKTKYWIVKNSWGKGWGEGGYIRMKRDVGTPG 322
Query: 268 GLCGIGT 274
GLCGI T
Sbjct: 323 GLCGITT 329
>gi|214015396|gb|ACJ62314.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 246
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 73/186 (39%), Positives = 115/186 (61%), Gaps = 19/186 (10%)
Query: 46 IHEKWMAQHGR----------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
++E W ++HGR + DE + +RL++F++NL YI+K N E G T++LG
Sbjct: 1 MYEAWKSKHGRGGSNDDDCDMAPGDEEGRRLRLEVFRDNLRYIDKHNAEADAGLHTFRLG 60
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPI 151
F+DLT DE+R G++ + + ++ + D+ P ++DWR GAVT +
Sbjct: 61 LTPFADLTLDEYRGRVLGFR----ARGRRSGHGYRARRPRGGDLLPDAIDWRQLGAVTEV 116
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF ++
Sbjct: 117 KDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFRFV 175
Query: 212 IQNQGI 217
I N GI
Sbjct: 176 IGNGGI 181
>gi|214015361|gb|ACJ62297.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 71/186 (38%), Positives = 114/186 (61%), Gaps = 18/186 (9%)
Query: 46 IHEKWMAQHGRSYK-----------DELEKEMRLKIFKENLEYIEKANKE---GNRTYKL 91
++E W ++HGR + +E ++ +RL++F++NL YI+ N E G T++L
Sbjct: 1 MYEAWKSKHGRGGRSNDDCDMAPGDEEEDRRLRLEVFRDNLRYIDAHNAEADAGLHTFRL 60
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
G F+DLT +E+R G++ + S + + D+P ++DWR GAVT +
Sbjct: 61 GLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVRG---GDLPDAIDWRQLGAVTEV 117
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF ++
Sbjct: 118 KDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEDAFRFV 176
Query: 212 IQNQGI 217
I N GI
Sbjct: 177 IGNGGI 182
>gi|214015297|gb|ACJ62265.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 114/188 (60%), Gaps = 18/188 (9%)
Query: 46 IHEKWMAQHGR-------------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTY 89
++E W ++HGR + E ++ +RL++F++NL YI+K N E G T+
Sbjct: 1 MYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTF 60
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+LG F+DLT +E+R G++ + S Y+ D+P ++DWR GAVT
Sbjct: 61 RLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGSGYRARG-GDLPDAIDWRQLGAVT 119
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF
Sbjct: 120 EVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFR 178
Query: 210 YIIQNQGI 217
++I N GI
Sbjct: 179 FVIGNGGI 186
>gi|214015295|gb|ACJ62264.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 71/186 (38%), Positives = 114/186 (61%), Gaps = 18/186 (9%)
Query: 46 IHEKWMAQHGRSYK-----------DELEKEMRLKIFKENLEYIEKANKE---GNRTYKL 91
++E W ++HGR + +E ++ +RL++F++NL YI+ N E G T++L
Sbjct: 1 MYEAWKSKHGRGGRSNDDCDMAPGDEEEDRRLRLEVFRDNLRYIDAHNAEADAGLHTFRL 60
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
G F+DLT +E+R G++ + S + + D+P ++DWR GAVT +
Sbjct: 61 GLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVRG---GDLPDAIDWRQLGAVTEV 117
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF ++
Sbjct: 118 KDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEDAFRFV 176
Query: 212 IQNQGI 217
I N GI
Sbjct: 177 IGNGGI 182
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 68/181 (37%), Positives = 110/181 (60%), Gaps = 9/181 (4%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR--------TYKLGTNQFS 97
+ + W A+HG++Y E+ RL +F +N ++ N N +Y L N F+
Sbjct: 40 LFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFA 99
Query: 98 DLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
DLT++EFRA G + + RS + ++ + + VP +LDWR+ GAVT +K+Q
Sbjct: 100 DLTHEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGS 159
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CW+F+A A+EGI KI++G+L+ LSEQ+L+DC + N+GC GG + A+ ++++N G
Sbjct: 160 CGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGG 219
Query: 217 I 217
I
Sbjct: 220 I 220
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 68/120 (56%), Gaps = 17/120 (14%)
Query: 166 VAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVC 222
+ ++G + + S L+Q QQ + G+ +AF Q QGIF+G C
Sbjct: 244 IVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGS--------ARAFQLYSQ-QGIFDGPC 294
Query: 223 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSY 278
T LDHAV IVG+G+ E G +YW++KNSWG +WG GYM + R+ +G+CGI +S+
Sbjct: 295 PTSLDHAVLIVGYGS-EGGKDYWIVKNSWGESWGMKGYMHMHRNTGDSKGVCGINMMASF 353
>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 106/319 (33%), Positives = 147/319 (46%), Gaps = 82/319 (25%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V +S H S V ++ ++G+ Y+ E + R ++F +NL+ I NK+G +YKLG
Sbjct: 51 VVGQSRHALSFV----RFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKG-LSYKLG 105
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLT DEFR G + + +TT K N + P + DWR+ G V+P+K
Sbjct: 106 VNEFTDLTWDEFRRDRLG---AAQNCSATTKGNVKLTNAVL---PETKDWREDGIVSPVK 159
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYI 211
NQ +CG CW F+ A+E G I LSEQQL+DC+ NN GC GG +AF YI
Sbjct: 160 NQGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 219
Query: 212 IQNQGI----------FNGVCG---------------------TQLDHAVTIV------- 233
N G+ NG+C +L +AV +V
Sbjct: 220 KSNGGLDTEEAYPYTGKNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAF 279
Query: 234 ----GF-------------GTTEDGANY---------------WLIKNSWGNTWGDAGYM 261
GF G T N+ WLIKNSWG WGD GY
Sbjct: 280 EVIKGFKQYKSGVYSSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDDGYF 339
Query: 262 KIVRDEGLCGIGTRSSYPL 280
K+ + +CGI T +SYP+
Sbjct: 340 KMEMGKNMCGIATCASYPV 358
>gi|75994616|gb|ABA33829.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/185 (39%), Positives = 113/185 (61%), Gaps = 16/185 (8%)
Query: 46 IHEKWMAQHGR----------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
++E W ++HGR + DE + +RL++F++NL YI+K N E G T++LG
Sbjct: 1 MYEAWKSKHGRGGSNDDDCDMAPGDEEGRRLRLEVFRDNLRYIDKHNAEADAGLHTFRLG 60
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
F+DLT DE+R G++ + + + + +P ++DWR GAVT +K
Sbjct: 61 LTPFADLTLDEYRGRVLGFRARGRRSGARYGHRARPRGGDL--LPDAIDWRQLGAVTEVK 118
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
+Q++CG CWAF+AVAA+EGI I +GNL+ LSEQ+++DC ++GC GG E AF ++I
Sbjct: 119 DQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFRFVI 177
Query: 213 QNQGI 217
N GI
Sbjct: 178 GNGGI 182
>gi|75994628|gb|ABA33835.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/187 (38%), Positives = 116/187 (62%), Gaps = 20/187 (10%)
Query: 46 IHEKWMAQHGRSYK-----------DELEKEMRLKIFKENLEYIEKANKE---GNRTYKL 91
++E W ++HGR + +E ++ +RL++F++NL YI+K N E G T++L
Sbjct: 1 MYEAWKSKHGRGGRSNDDCDMAPGDEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFRL 60
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTP 150
G F+DLT DE+R G++ + + ++ + D+ P ++DWR GAVT
Sbjct: 61 GLTPFADLTLDEYRGRVLGFR----ARGRRSGHGYRARRPRGGDLLPDAIDWRQLGAVTE 116
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF +
Sbjct: 117 VKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFRF 175
Query: 211 IIQNQGI 217
+I N GI
Sbjct: 176 VIGNGGI 182
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 117/204 (57%), Gaps = 11/204 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
MF ++ L + C + +S+ S Q + E W H + Y E E+ R ++++NL+
Sbjct: 1 MFPVVVLAL-CVTAALSAPSLDPQ-LDEHWNLWKDWHSKKYH-EKEEGWRRMVWEKNLKK 57
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G TY LG N F D+T++EFR + GYK+ S R S F N
Sbjct: 58 IELHNLEHSMGKHTYSLGMNHFGDMTHEEFRQIMNGYKLKS--QRKLRGSLFMEPNF--L 113
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
+ P S+DWRDKG VTP+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 114 EAPRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRP 173
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF YI N G+
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGL 197
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/60 (48%), Positives = 39/60 (65%), Gaps = 4/60 (6%)
Query: 225 QLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+LDH V +VG+G DG YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 276 ELDHGVLVVGYGFEGKDVDGKKYWIVKNSWSENWGDKGYIYMAKDKKNHCGIATAASYPL 335
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/178 (40%), Positives = 107/178 (60%), Gaps = 4/178 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKD--ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E+S+ ++E+W + + S + +E R +FKEN Y+ + NK +R ++L N+F+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYVHEGNKR-DRPFRLALNKFA 92
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-VPTSLDWRDKGAVTPIKNQKE 156
D+T DEFR Y G ++ S + D +P ++DWR KGAVT IK+Q +
Sbjct: 93 DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKDQGQ 152
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC N GC GG + AF +I +N
Sbjct: 153 CGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCEGGLMDYAFQFIQKN 210
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 36/70 (51%), Positives = 48/70 (68%), Gaps = 4/70 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F G C T LDH V VG+G T DG YW++KNSWG WG+ GY+++ R EGL
Sbjct: 277 SEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQTEGL 336
Query: 270 CGIGTRSSYP 279
CGI ++SYP
Sbjct: 337 CGIAMQASYP 346
>gi|147836416|emb|CAN75313.1| hypothetical protein VITISV_033592 [Vitis vinifera]
Length = 201
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 71/134 (52%), Positives = 102/134 (76%), Gaps = 8/134 (5%)
Query: 86 NRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRD 144
+++YKL N+F+DLTN+EFRA +K +H ST +++FKY+++ T VP+++DWR
Sbjct: 2 DKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYEHV--TAVPSTVDWRK 55
Query: 145 KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGS 203
KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + GC GG
Sbjct: 56 KGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGL 115
Query: 204 REKAFAYIIQNQGI 217
+ AF +I QN G+
Sbjct: 116 MDDAFKFIEQNHGL 129
>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
Length = 337
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 118/204 (57%), Gaps = 11/204 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ + +L C S +S+ S Q ++ H + W + H + Y E E+ R ++++NL+
Sbjct: 1 MLPLAVLAVCLSAALSAPSLDPQ--LDDHWDLWKSWHSKKYH-EKEEGWRRMVWEKNLKK 57
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G Y+LG N F D+T++EFR + GYK + R S F N
Sbjct: 58 IELHNLEHSMGKHPYRLGMNHFGDMTHEEFRQIMNGYKQ-RKTERKFKGSLFMEPNF--L 114
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
+ P +LDWRDKG VTP+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 EAPRALDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRP 174
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF Y+ NQG+
Sbjct: 175 EGNEGCNGGLMDQAFQYVKDNQGL 198
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/60 (48%), Positives = 39/60 (65%), Gaps = 4/60 (6%)
Query: 225 QLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+LDH V +VG+G DG YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 277 ELDHGVLVVGYGYEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 336
>gi|214015263|gb|ACJ62248.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 252
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 113/188 (60%), Gaps = 17/188 (9%)
Query: 46 IHEKWMAQHGRSYKDE-------------LEKEMRLKIFKENLEYIEKANKE---GNRTY 89
++E W ++HGR ++ ++ +RL++F++NL YI+K N E G T+
Sbjct: 1 MYEAWKSKHGRGGSNDDCDMAPGDEEQEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTF 60
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+LG F+DLT DE+R G++ + ++ + +P ++DWR GAVT
Sbjct: 61 RLGLTPFADLTLDEYRGRVLGFRARRSRSGARYGHGYRARPRGGDLLPDAIDWRQLGAVT 120
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF
Sbjct: 121 EVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFR 179
Query: 210 YIIQNQGI 217
++I N GI
Sbjct: 180 FVIGNGGI 187
>gi|75994626|gb|ABA33834.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/187 (38%), Positives = 116/187 (62%), Gaps = 20/187 (10%)
Query: 46 IHEKWMAQHGRSY-----------KDELEKEMRLKIFKENLEYIEKANKE---GNRTYKL 91
++E W ++HGR ++E ++ +RL++F++NL YI+K N E G T++L
Sbjct: 1 MYEAWKSKHGRGGSSNDDCDIAPGEEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFRL 60
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTP 150
G F+DLT DE+R G++ + + ++ + D+ P ++DWR GAVT
Sbjct: 61 GLTPFADLTLDEYRGRVLGFR----ARGRRSGHGYRARRPRGGDLLPDAIDWRQLGAVTE 116
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF +
Sbjct: 117 VKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEDAFRF 175
Query: 211 IIQNQGI 217
+I N GI
Sbjct: 176 VIGNGGI 182
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 124/202 (61%), Gaps = 9/202 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ + L+ +C VVSS S E +W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFTDFDEDWNEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
K N + G+ TY LG NQF+DL N+EF A+ TG+++ S + ++ STF N ++ +
Sbjct: 60 IKHNLKYDLGHFTYDLGINQFTDLQNEEFVAMMTGFRV-SGTSKAAKGSTFLPPN-NVGE 117
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P ++DWR KG VTP+K+Q +CG CWAF+ +VEG +G L+ LSEQ L+DCS
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDCSGR- 176
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
+ GC GG ++AF YII GI
Sbjct: 177 DAGCDGGFMDRAFQYIIDAGGI 198
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 46/68 (67%), Gaps = 3/68 (4%)
Query: 216 GIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 272
G++N G T LDH V VG+GT+ DG +YW++KNSW TWG GY+ + R+ + CGI
Sbjct: 265 GVYNEPGCDSTVLDHGVLAVGYGTSSDGTDYWIVKNSWAETWGMNGYVWMSRNKDNQCGI 324
Query: 273 GTRSSYPL 280
T +SYPL
Sbjct: 325 ATNASYPL 332
>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 142/303 (46%), Gaps = 78/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+ Y+ E + R ++F +NL+ I NK+G +YKLG N+F+DLT DEFR
Sbjct: 63 RFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKG-LSYKLGVNEFTDLTWDEFRRDR 121
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G + + +TT K N+ + P + DWR+ G V+P+KNQ +CG CW F+ A
Sbjct: 122 LG---AAQNCSATTKGNLKVTNVVL---PETKDWREAGIVSPVKNQGKCGSCWTFSTTGA 175
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E G I LSEQQL+DC+ NN GC GG +AF YI N G+
Sbjct: 176 LEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 235
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L +AV +V GF
Sbjct: 236 KNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGVYTS 295
Query: 236 ---GTTEDGANY---------------WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
G T N+ WLIKNSWG WGD GY K+ + +CGI T +S
Sbjct: 296 TECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCAS 355
Query: 278 YPL 280
YP+
Sbjct: 356 YPV 358
>gi|16506815|gb|AAL23962.1|AF426248_1 truncated cathepsin H [Homo sapiens]
Length = 323
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 147/331 (44%), Gaps = 87/331 (26%)
Query: 23 TLLVSCASQVVSSRSTHEQSVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
TL + CA+++ + S+ + H K WM++H ++Y E E RL+ F N I A
Sbjct: 4 TLPLLCAAELCVN------SLEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN-A 55
Query: 82 NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
+ GN T+K+ NQFSD++ E + Y P + S T S + P S+D
Sbjct: 56 HNNGNHTFKMALNQFSDMSFAEIKHKYL---WSEPQNCSATKSNYL---RGTGPYPPSVD 109
Query: 142 WRDKGA-VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GC 199
WR KG V+P+KNQ CG CW F+ A+E I +G ++ L+EQQL+DC+ + NN GC
Sbjct: 110 WRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGC 169
Query: 200 LGGSREKAFAYIIQNQGIF----------NGVCGTQLDHAVTIV---------------- 233
GG +AF YI+ N+GI +G C Q A+ V
Sbjct: 170 QGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVE 229
Query: 234 ----------GFGTTED----------------------------------GANYWLIKN 249
F T+D G YW++KN
Sbjct: 230 AVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKN 289
Query: 250 SWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
SWG WG GY I R + +CG+ +SYP+
Sbjct: 290 SWGPQWGMNGYFLIERGKNMCGLAACASYPI 320
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/205 (36%), Positives = 119/205 (58%), Gaps = 19/205 (9%)
Query: 31 QVVSSRSTHEQSVVEIHEKWMAQHGR-------------SYKDELEKEMRLKIFKENLEY 77
+V + ++ V ++E W ++HGR + E ++ +RL++F++NL Y
Sbjct: 68 RVPAPAERADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRY 127
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
I+K N E G T++LG F+DLT DE+R G++ + + Y+
Sbjct: 128 IDKHNAEADAGLHTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRG 187
Query: 135 D--VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P ++DWR GAVT +K+Q++CG CWAF+AVAA+EGI I +GNL+ LSEQ+++DC
Sbjct: 188 GDLLPDAIDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCD 247
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI 217
++GC GG E AF ++I N GI
Sbjct: 248 AQ-DSGCDGGQMENAFRFVIGNGGI 271
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 71/122 (58%), Gaps = 18/122 (14%)
Query: 166 VAAVEGITKIRSGNLIQLSEQ---QLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVC 222
VA ++G+ ++ S N L E Q + + + + +AF + + GIFNG C
Sbjct: 296 VATIDGLVEVASNNETALQEAVAIQPVSVAIDASG--------RAFQHY--SSGIFNGPC 345
Query: 223 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK----IVRDEGLCGIGTRSSY 278
GT LDH VT VG+G+ E G +YW++KNSW +WG+AGY++ + R G CGI +SY
Sbjct: 346 GTSLDHGVTAVGYGS-ESGKDYWIVKNSWSASWGEAGYIRMRRNVPRPTGKCGIAMDASY 404
Query: 279 PL 280
P+
Sbjct: 405 PV 406
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 124/202 (61%), Gaps = 9/202 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ + L+ C VVSS S E ++W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAVC---VVSSLSMSFTDFDEDWKEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+ N + G+ TY LG NQF+DL N EF A+ TG+++ S ++ STF N ++
Sbjct: 60 IRHNLKYDLGHFTYDLGMNQFADLQNKEFVAMMTGFRVNGTS-KAAKGSTFLPPN-NVGK 117
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P ++DWR KG VTP+K+Q +CG CWAF+A ++EG ++G L+ LSEQ L+DCS +
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCS-DK 176
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
N GC GG ++AF YII GI
Sbjct: 177 NYGCNGGLMDRAFQYIIDAGGI 198
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 3/76 (3%)
Query: 208 FAYIIQNQGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
F++ + G++N G T LDH V VG+GTT DG +YW++KNSW TWG GY+ + R
Sbjct: 257 FSFQLYQSGVYNEPGCSSTLLDHGVLAVGYGTTIDGTDYWIVKNSWAETWGMNGYIWMSR 316
Query: 266 D-EGLCGIGTRSSYPL 280
+ + CGI T++SYPL
Sbjct: 317 NKDNQCGIATQASYPL 332
>gi|222642109|gb|EEE70241.1| hypothetical protein OsJ_30359 [Oryza sativa Japonica Group]
Length = 351
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 82/186 (44%), Positives = 116/186 (62%), Gaps = 7/186 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSD 98
E+S+ ++E+W A + S +D + E R ++FK N YI + N K +Y LG N+FSD
Sbjct: 19 EESMWSLYERWRAVYAPS-RDLSDMESRFEVFKANARYIHEFNQKSKGMSYVLGLNKFSD 77
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LT +EF A YTG K+ + + + T+S+ + L + P + DWR GAVT +K+Q +CG
Sbjct: 78 LTYEEFAAKYTGVKVDASAFATATTSSPD-EELPVGVPPATWDWRLNGAVTDVKDQGQCG 136
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL-GGSREKAFAYIIQNQGI 217
CW F+AV AVEGI I +GNL+ LSEQQ+LDCS G+ CL GG A YI++N G+
Sbjct: 137 SCWVFSAVGAVEGINAIMTGNLLTLSEQQVLDCSNTGD--CLKGGDPRAALQYIVKN-GV 193
Query: 218 FNGVCG 223
CG
Sbjct: 194 TLDQCG 199
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 46/71 (64%), Gaps = 7/71 (9%)
Query: 215 QGIFNGVCGT-QLDHAVTIVGF--GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 267
+G+F G C T L+H V +VG+ TT D YW++KNSWG WG+ GY+++ RD
Sbjct: 263 KGVFTGRCKTAPLNHGVVVVGYGVNTTPDKTKYWIVKNSWGKGWGEGGYIRMKRDVGTPG 322
Query: 268 GLCGIGTRSSY 278
GLCGI T ++Y
Sbjct: 323 GLCGITTYATY 333
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 74/171 (43%), Positives = 106/171 (61%), Gaps = 8/171 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE-GNRTYKLGTNQFSDLTNDEFRALY 108
W A+HG+SY++ E+ +R ++ N +YI++ N+ G Y L NQF DL N EF++LY
Sbjct: 25 WKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFKSLY 84
Query: 109 TGYKMP-SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
GY+M +P + Q D+P S+DW KG VTP+KNQ +CG CW+F+A
Sbjct: 85 NGYRMSNAPRKGKPFVPAARVQ-----DLPASVDWSKKGWVTPVKNQGQCGSCWSFSATG 139
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
++EG +G L+ LSEQ L+DCS GN+GC GG + AF Y+I+N GI
Sbjct: 140 SMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGI 190
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 32/59 (54%), Positives = 44/59 (74%), Gaps = 4/59 (6%)
Query: 224 TQLDHAVTIVGFGTTEDGA-NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 280
T LDH V VG+GT DG+ +YWL+KNSWG +WG +GY+++VR+ CGI T +SYP+
Sbjct: 267 TNLDHGVLAVGYGT--DGSKDYWLVKNSWGASWGMSGYIEMVRNHNNKCGIATSASYPV 323
>gi|125526836|gb|EAY74950.1| hypothetical protein OsI_02846 [Oryza sativa Indica Group]
Length = 359
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 79/178 (44%), Positives = 104/178 (58%), Gaps = 17/178 (9%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
H WMA+ GR+Y D EK R ++F+ N E I+ AN+ G+ TY LG F+DLT DEFRA
Sbjct: 38 HRCWMARVGRTYADAAEKARRFEVFRANAERIDAANRAGDLTYTLGLTPFADLTADEFRA 97
Query: 107 LYTGYKMPSPSHRSTTSSTFKY--------QNLSMTDVPT---SLDWRDKGAVTPIKNQK 155
+ MP ++ + Q+L + P S DWRD GAVTP+++Q
Sbjct: 98 RHL---MPDADVDEPATARVLFEQEEKAAKQHLPPSRPPAVWGSKDWRDLGAVTPVQDQD 154
Query: 156 E--CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
+ C CWAFAAVAA EG+ KI +GN+ LS QQ+LDC T G+N C GG +A YI
Sbjct: 155 KNNCNSCWAFAAVAATEGLIKIETGNVTPLSAQQVLDC-TGGDNTCKGGHIHEALRYI 211
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 31/59 (52%), Positives = 41/59 (69%), Gaps = 1/59 (1%)
Query: 222 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
CG + +HAV +VG+GT DG YWL+KNSWG WG+ GYM+I D CG+ +R +YP
Sbjct: 301 CGKKRNHAVAVVGYGTASDGTPYWLLKNSWGTDWGENGYMRIAVDAD-CGVSSRPAYPF 358
>gi|75994610|gb|ABA33826.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/185 (38%), Positives = 114/185 (61%), Gaps = 16/185 (8%)
Query: 46 IHEKWMAQHGR----------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
++E W ++HGR + DE + +RL++F++NL YI+K N E G T++LG
Sbjct: 1 MYEAWKSKHGRGGSNDDDCDMAPGDEEGRRLRLEVFRDNLRYIDKHNAEADAGLHTFRLG 60
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
F+DLT DE+R G++ + + + + + +P ++DWR GAVT +K
Sbjct: 61 LTPFADLTLDEYRGRVLGFRARARRSGARYGHRARPRGGDL--LPDAIDWRQLGAVTEVK 118
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF ++I
Sbjct: 119 DQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEDAFRFVI 177
Query: 213 QNQGI 217
N GI
Sbjct: 178 GNGGI 182
>gi|214015359|gb|ACJ62296.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/185 (38%), Positives = 114/185 (61%), Gaps = 16/185 (8%)
Query: 46 IHEKWMAQHGR----------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
++E W ++HGR + DE + +RL++F++NL YI+K N E G T++LG
Sbjct: 1 MYEAWKSKHGRGGSNDDDCDMAPGDEEGRRLRLEVFRDNLRYIDKHNAEADAGLHTFRLG 60
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
F+DLT DE+R G++ + + + + + +P ++DWR GAVT +K
Sbjct: 61 LTPFADLTLDEYRGRVLGFRARARRSGARYGHRARPRGGDL--LPDAIDWRQLGAVTEVK 118
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF ++I
Sbjct: 119 DQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEDAFRFVI 177
Query: 213 QNQGI 217
N GI
Sbjct: 178 GNGGI 182
>gi|214015311|gb|ACJ62272.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015376|gb|ACJ62304.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/185 (38%), Positives = 113/185 (61%), Gaps = 16/185 (8%)
Query: 46 IHEKWMAQHGR----------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
++E W ++HGR + DE + +RL++F++NL YI+K N E G T++LG
Sbjct: 1 MYEAWKSKHGRGGSNDDDCDMAPGDEEGRRLRLEVFRDNLRYIDKHNAEADAGLHTFRLG 60
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
F+DLT DE+R G++ + + + + +P ++DWR GAVT +K
Sbjct: 61 LTPFADLTLDEYRGRVLGFRARGRRSGARYGHRARPRGGDL--LPDAIDWRQLGAVTEVK 118
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF ++I
Sbjct: 119 DQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEDAFRFVI 177
Query: 213 QNQGI 217
N GI
Sbjct: 178 GNGGI 182
>gi|75994632|gb|ABA33837.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/185 (38%), Positives = 113/185 (61%), Gaps = 16/185 (8%)
Query: 46 IHEKWMAQHGR----------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
++E W ++HGR + DE + +RL++F++NL YI+K N E G T++LG
Sbjct: 1 MYEAWKSKHGRGGSNDDDCDMAPGDEEGRRLRLEVFRDNLRYIDKHNAEADAGLHTFRLG 60
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
F+DLT DE+R G++ + + + + +P ++DWR GAVT +K
Sbjct: 61 LTPFADLTLDEYRGRVLGFRARGRRSGARYGHRARPRGGDL--LPDAIDWRQLGAVTEVK 118
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF ++I
Sbjct: 119 DQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFRFVI 177
Query: 213 QNQGI 217
N GI
Sbjct: 178 GNGGI 182
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 71/178 (39%), Positives = 110/178 (61%), Gaps = 9/178 (5%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-----RTYKLGTNQFSDL 99
E+ EKW +H ++Y E EK RLK+F++N ++ + N+ N +Y L N F+DL
Sbjct: 31 ELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADL 90
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T+ EF+ G + + + Q+ + +P+ +DWR GAVTP+K+Q CG
Sbjct: 91 THHEFKTTRLGLPLTLLRFKRPQNQ----QSRDLLHIPSQIDWRQSGAVTPVKDQASCGA 146
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+A A+EGI KI +G+L+ LSEQ+L+DC T+ N+GC GG + A+ ++I N+GI
Sbjct: 147 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGI 204
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 59/87 (67%), Gaps = 9/87 (10%)
Query: 198 GCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 257
G G RE + + ++GIF G C T LDHAV IVG+G+ E+G +YW++KNSWG WG
Sbjct: 256 GICGSERE----FQLYSKGIFTGPCSTFLDHAVLIVGYGS-ENGVDYWIVKNSWGKYWGM 310
Query: 258 AGYMKIVRD----EGLCGIGTRSSYPL 280
GY+ ++R+ +G+CGI T +SYP+
Sbjct: 311 NGYIHMIRNSGNSKGICGINTLASYPV 337
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/269 (35%), Positives = 129/269 (47%), Gaps = 77/269 (28%)
Query: 56 RSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYTGYK 112
+ Y+ E+ R IF +NL +I + N E R T+ +G NQF+DLTN+E+R LY
Sbjct: 29 KQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEEYRQLYL--- 85
Query: 113 MPSPSHRSTTSSTFKYQNLSMTDVPT--SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
P P T + + D P S+DWR KGAVTPIKNQ +CG CW+F+ +VE
Sbjct: 86 RPYP-----TELLGRERQEVWLDGPNAGSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVE 140
Query: 171 GITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI----------FN 219
G I +GNL+ LSEQQL+DCS + GN GC GG + AF YII N G+ +
Sbjct: 141 GAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARD 200
Query: 220 GVC--GTQLDHAVTIVGF------------------------------------------ 235
GVC + HAV+I G+
Sbjct: 201 GVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGP 260
Query: 236 -GTTEDG--------ANYWLIKNSWGNTW 255
GT D ++YW++KNSWG +W
Sbjct: 261 CGTNLDHGVLVVGYTSDYWIVKNSWGASW 289
>gi|214015370|gb|ACJ62301.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015384|gb|ACJ62308.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015392|gb|ACJ62312.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/185 (38%), Positives = 113/185 (61%), Gaps = 16/185 (8%)
Query: 46 IHEKWMAQHGR----------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
++E W ++HGR + DE + +RL++F++NL YI+K N E G T++LG
Sbjct: 1 MYEAWKSKHGRGGSNDDDCDMAPGDEEGRRLRLEVFRDNLRYIDKHNAEADAGLHTFRLG 60
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
F+DLT DE+R G++ + + + + +P ++DWR GAVT +K
Sbjct: 61 LTPFADLTLDEYRGRVLGFRARGRRSGARYGHRARPRGGDL--LPDAIDWRQLGAVTEVK 118
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF ++I
Sbjct: 119 DQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEDAFRFVI 177
Query: 213 QNQGI 217
N GI
Sbjct: 178 GNGGI 182
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 104/319 (32%), Positives = 149/319 (46%), Gaps = 81/319 (25%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEK-EMRLKIFKENLEYIEKANKEGNRT 88
SQ + R+ H V++ + HG Y +L E + NL IE A+ GN +
Sbjct: 11 SQFLPRRNLH--LVLKGPTAFRRIHGVFYSSQLGLCEPAFRCHLANLRVIE-AHNAGNSS 67
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP-TSLDWRDKGA 147
+ +G QF+DLT EF A + M + T + +T+ P +DWR K A
Sbjct: 68 FTMGITQFADLTAAEFSAYVKRFPM---------NVTRPRNEVWITEAPLQEVDWRQKNA 118
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREK 206
VT IKNQ +CG CW+F+ +VEG I +G L+ LSEQQL+DCST GN+GC GG +
Sbjct: 119 VTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCSTRYGNHGCNGGLMDY 178
Query: 207 AFAYIIQNQGI----------FNGVCGTQLD--HAVTIVGF------------------- 235
AF Y+I N G+ +G C T+ + HA I GF
Sbjct: 179 AFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEIHGFRNVPKEHEDQLAAAVSIGP 238
Query: 236 ------------------------GTTEDGA--------NYWLIKNSWGNTWGDAGYMKI 263
GT+ D +YW++KNSWG +WG+ GY+++
Sbjct: 239 VSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGYSDDYWIVKNSWGKSWGEEGYIRL 298
Query: 264 VR---DEGLCGIGTRSSYP 279
R +G+CGI ++SYP
Sbjct: 299 KRGVDKKGMCGITMQASYP 317
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 85/211 (40%), Positives = 124/211 (58%), Gaps = 22/211 (10%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+++ E+WM +HGR+Y D EK+ R ++++ N+E +E N N YKL N+F+DLTN+
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKEC-- 157
EFRA G++ + P +T S+ S D+ P S+DWR+KGAV I K C
Sbjct: 86 EFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAV--INRWKICVD 143
Query: 158 -GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
G CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC GC GG AF +++ N G
Sbjct: 144 AGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGNHG 202
Query: 217 I----------FNGVC-GTQLDH-AVTIVGF 235
+ NG C +L+ AV I G+
Sbjct: 203 LTTEASYPYHAANGACQAAKLNQSAVAIAGY 233
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 64/131 (48%), Gaps = 25/131 (19%)
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGT 224
A+A +T +L + + Q + + +G + F + + G++ G C
Sbjct: 229 AIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGS----------FMFQLYGSGVYTGPCTA 278
Query: 225 QLDHAVTIVGFGTTEDGAN----------YWLIKNSWGNTWGDAGYMKIVRD-----EGL 269
++H VT+VG+G +E + YW++KNSWG WGDAGY+ + RD GL
Sbjct: 279 DVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYILMQRDVAGLASGL 338
Query: 270 CGIGTRSSYPL 280
CGI SYP+
Sbjct: 339 CGIALLPSYPV 349
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/173 (42%), Positives = 108/173 (62%), Gaps = 16/173 (9%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
++E+W+ ++ ++Y EKE R KIFKENL++I++ N N+T+++G +F+DLTNDE +
Sbjct: 1 MYERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDEPK 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
+ + Y+ + +P +DWR KGAV P+K+Q CG CWAF+A
Sbjct: 61 DF-------------MKADRYLYKEGDI--LPDEIDWRAKGAVVPVKDQGNCGSCWAFSA 105
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
V AVEGI +I++G LI LS+Q+L+DC N GC GG AF +II N GI
Sbjct: 106 VGAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGI 158
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 48/75 (64%), Gaps = 5/75 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+F G CG LDH V +VG+GT+ G +YW+I+NSWG WG+ GY+K+ R+
Sbjct: 220 AFKLYKSGVFTGTCGIYLDHGVVVVGYGTSS-GEDYWIIRNSWGLNWGENGYVKLQRNID 278
Query: 267 --EGLCGIGTRSSYP 279
G CG+ SYP
Sbjct: 279 DSFGKCGVAMMPSYP 293
>gi|214015366|gb|ACJ62299.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 71/186 (38%), Positives = 113/186 (60%), Gaps = 18/186 (9%)
Query: 46 IHEKWMAQHGRSY-----------KDELEKEMRLKIFKENLEYIEKANKE---GNRTYKL 91
++E W ++HGR +E ++ +RL++F++NL YI+ N E G T++L
Sbjct: 1 MYEAWKSKHGRGGSNDDDCDMAPGDEEEDRRLRLEVFRDNLRYIDAHNAEADAGLHTFRL 60
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
G F+DLT +E+R G++ + S + + D+P ++DWR GAVT +
Sbjct: 61 GLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVRG---GDLPDAIDWRQLGAVTEV 117
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF ++
Sbjct: 118 KDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEDAFRFV 176
Query: 212 IQNQGI 217
I N GI
Sbjct: 177 IGNGGI 182
>gi|110625773|ref|NP_081620.2| cathepsin L-like 3 precursor [Mus musculus]
gi|74208432|dbj|BAE26401.1| unnamed protein product [Mus musculus]
gi|187955662|gb|AAI47425.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
gi|187957686|gb|AAI47424.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
Length = 331
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 86/235 (36%), Positives = 127/235 (54%), Gaps = 25/235 (10%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + VVS+ H S+ + E+W +H ++Y E + R +++ N
Sbjct: 2 TPVFLLATLCLG----VVSAAPAHNPSLDAVWEEWKTKHKKTYNMNDEGQKR-AVWENNK 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I+ N++ G + L N F DLTN EFR L TG++ + T +Q
Sbjct: 57 KMIDLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQ-----GQKTKMMMKVFQEPL 111
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DWRD G VTP+K+Q CG CWAF+AV ++EG ++G L+ LS Q L+DCS
Sbjct: 112 LGDVPKSVDWRDHGYVTPVKDQGSCGSCWAFSAVGSLEGQMFRKTGKLVPLSVQNLVDCS 171
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCG-TQLDHAVTIVGF 235
+ GN GC GG + AF Y+ N G+ NG C + A T+ GF
Sbjct: 172 WSQGNQGCDGGLPDLAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKNSAATVTGF 226
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 78/175 (44%), Positives = 107/175 (61%), Gaps = 9/175 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E + H +SY+ ++E+ +R KIF EN I K N + G +YKLG NQF DL EF
Sbjct: 8 EAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPHEF 67
Query: 105 RALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
++ GY R STF N++ + +P ++DWR KGAVTP+K+Q +CG CWAF
Sbjct: 68 AKMFNGYH----GERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAF 123
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
+A ++EG ++SG L+ LSEQ L+DCS + GN GC GG + AF YI N GI
Sbjct: 124 SATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGI 178
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 49/75 (65%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G+++ +LDH V VG+G ++G YWL+KNSW TWGD GY+ + RD
Sbjct: 238 SFQLYSEGVYDEPNCSSEELDHGVLAVGYGV-KNGKKYWLVKNSWAETWGDNGYILMSRD 296
Query: 267 -EGLCGIGTRSSYPL 280
+ CGI + +SYPL
Sbjct: 297 KDNQCGIASSASYPL 311
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 110/180 (61%), Gaps = 12/180 (6%)
Query: 45 EIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSD 98
E+ +W + HG+ Y E E R+ I++ NL+YIEK N G+ ++ LG N++ D
Sbjct: 22 ELDSEWQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYGD 80
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
+TN+EFR+ GYKM T+ + ++ D+P ++DWR KG VTPIKNQ +CG
Sbjct: 81 MTNEEFRSTMNGYKM----RNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCG 136
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
CW+F+A ++EG T ++G L LSEQ L+DCS GN+GC GG + AF YI N GI
Sbjct: 137 SCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNGI 196
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 31/57 (54%), Positives = 41/57 (71%), Gaps = 2/57 (3%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
T+LDH V VG+GT E G +YWL+KNSWG +WG GY+ + R++ CGI T +SYP
Sbjct: 273 TRLDHGVLAVGYGT-ESGKDYWLVKNSWGESWGQKGYIMMSRNKRNNCGIATSASYP 328
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 110/180 (61%), Gaps = 12/180 (6%)
Query: 45 EIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSD 98
E+ +W + HG+ Y E E R+ I++ NL+YIEK N G+ ++ LG N++ D
Sbjct: 22 ELDSEWQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYGD 80
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
+TN+EFR+ GYKM T+ + ++ D+P ++DWR KG VTPIKNQ +CG
Sbjct: 81 MTNEEFRSTMNGYKM----RNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCG 136
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
CW+F+A ++EG T ++G L LSEQ L+DCS GN+GC GG + AF YI N GI
Sbjct: 137 SCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSGI 196
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 31/57 (54%), Positives = 41/57 (71%), Gaps = 2/57 (3%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
T+LDH V VG+GT E G +YWL+KNSWG +WG GY+ + R++ CGI T +SYP
Sbjct: 273 TRLDHGVLAVGYGT-ESGKDYWLVKNSWGESWGQKGYIMMSRNKRNNCGIATSASYP 328
>gi|54020908|ref|NP_001005695.1| cathepsin S precursor [Xenopus (Silurana) tropicalis]
gi|49522293|gb|AAH75261.1| cathepsin S [Xenopus (Silurana) tropicalis]
Length = 333
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 74/172 (43%), Positives = 104/172 (60%), Gaps = 5/172 (2%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W H + Y+DE+E R +++NL + N E G TY+LG N +D+T++E ++
Sbjct: 30 WKNTHNKDYEDEIEDLQRRITWEKNLNLVNMHNLEYSMGMHTYELGMNHLADMTSEEIKS 89
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
TG +P S R T S+ K VP S+DWRDKG V+ +KNQ CG CWAF+AV
Sbjct: 90 KLTGLILPPQSERQATFSSQKNSTFG-GKVPDSIDWRDKGCVSDVKNQGGCGSCWAFSAV 148
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
A+EG +++G L+ LS Q L+DCS+ GN GC GG +AF Y+I N+GI
Sbjct: 149 GALEGQLMLKTGKLVSLSPQNLVDCSSKYGNKGCGGGFMTQAFQYVIDNKGI 200
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 52/82 (63%), Gaps = 6/82 (7%)
Query: 200 LGGSREKAFAYIIQNQGIF-NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 258
+ G+R F Y G++ + C +++H V VG+G +G ++WL+KNSWG +GD
Sbjct: 254 IDGTRPSFFLY---RSGVYSDPTCSHEVNHGVLAVGYGNL-NGQDFWLLKNSWGTKYGDQ 309
Query: 259 GYMKIVRDEG-LCGIGTRSSYP 279
GY++I R++G LCG+ + + YP
Sbjct: 310 GYVRIARNKGNLCGVASYTCYP 331
>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 517
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 72/180 (40%), Positives = 113/180 (62%), Gaps = 7/180 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY--KLGTNQFS 97
E+ V+E+ ++W ++ + Y+ ++++R + FK NL+YI + N + Y LG N+F+
Sbjct: 43 EEGVIELFQRWKEENKKIYRSPDQEKLRFENFKRNLKYIAEKNSKRISPYGQSLGLNRFA 102
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
D++N+EF++ +T S R+ S ++ S D P SLDWR KG VT +K+Q C
Sbjct: 103 DMSNEEFKSKFTSKVKKPFSKRNGLSG----KDHSCEDAPYSLDWRKKGVVTAVKDQGYC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
GCCWAF++ A+EGI I SG+LI LSE +L+DC N+GC GG + AF +++ N GI
Sbjct: 159 GCCWAFSSTGAIEGINAIVSGDLISLSEPELVDCDRT-NDGCDGGHMDYAFEWVMHNGGI 217
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 46/71 (64%), Gaps = 8/71 (11%)
Query: 216 GIFNGVCGTQ---LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----G 268
GI++G C + +DHA+ +VG+G+ D +YW++KNSWG +WG GY+ I R+ G
Sbjct: 283 GIYDGDCSSDPDDIDHAILVVGYGSEGD-EDYWIVKNSWGTSWGMEGYIYIRRNTNLKYG 341
Query: 269 LCGIGTRSSYP 279
+C I +SYP
Sbjct: 342 VCAINYMASYP 352
>gi|162460343|ref|NP_001105479.1| cysteine protease2 precursor [Zea mays]
gi|1491774|emb|CAA68192.1| cysteine protease [Zea mays]
Length = 360
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 141/303 (46%), Gaps = 76/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+SY+ E R +IF E+L+ + N++G +Y+LG N+F+D++ +EFRA
Sbjct: 61 RFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKG-LSYRLGINRFADMSWEEFRAT- 118
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
++ + + S T + + +P + DWR+ G V+P+KNQ CG CW F+ A
Sbjct: 119 ---RLGAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCWTFSTTGA 175
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI-------FNG 220
+E +G I LSEQQL+DC NN GC GG +AF YI N G+ + G
Sbjct: 176 LEAAYTQATGKPISLSEQQLVDCGLAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQG 235
Query: 221 VCGT------------------------QLDHAVTIV-----------GF---------- 235
V G +L AV +V GF
Sbjct: 236 VNGISKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGVYTS 295
Query: 236 ---GTTEDGANY---------------WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
GTT N+ WLIKNSWG WGD GY K+ + +CG+ T +S
Sbjct: 296 DHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCAS 355
Query: 278 YPL 280
YP+
Sbjct: 356 YPI 358
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 108/206 (52%), Gaps = 18/206 (8%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK------EGNRTYKLGTNQFSDLTN 101
E W A+HG++Y E+ RL F EN ++ N G +Y L N F+DLT+
Sbjct: 40 EAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLTH 99
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
DEFRA G P S + + VP +LDWR GAVT +K+Q CG CW
Sbjct: 100 DEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCGACW 159
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI---- 217
+F+A A+EGI KI +G+L+ LSEQ+L+DC + N GC GG A+ ++I+N GI
Sbjct: 160 SFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDTED 219
Query: 218 ------FNGVCGTQ--LDHAVTIVGF 235
+G C H VTI G+
Sbjct: 220 DYPFREADGTCNKNKLKKHVVTIDGY 245
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 68/121 (56%), Gaps = 18/121 (14%)
Query: 166 VAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVC 222
V ++G ++ S L+Q QQ + G+ A A+ + +QGIF+G C
Sbjct: 239 VVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGS----------ARAFQLYSQGIFDGPC 288
Query: 223 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSY 278
T LDHAV IVG+G+ E G +YW++KNSWG WG GYM + R+ G+CGI +S+
Sbjct: 289 PTSLDHAVLIVGYGS-EGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSSGICGINMMASF 347
Query: 279 P 279
P
Sbjct: 348 P 348
>gi|330794859|ref|XP_003285494.1| hypothetical protein DICPUDRAFT_149375 [Dictyostelium purpureum]
gi|325084585|gb|EGC38010.1| hypothetical protein DICPUDRAFT_149375 [Dictyostelium purpureum]
Length = 421
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 80/189 (42%), Positives = 111/189 (58%), Gaps = 14/189 (7%)
Query: 35 SRSTHEQSVVEIHEK-----WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
S +T +Q E+ + WM Q+ R Y E E R IFK N++Y+++ N +G+ T
Sbjct: 13 SFATAKQQFSELQYRNAFTNWMIQNQRHYASE-EFATRYNIFKANMDYVQEWNSKGSETI 71
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
LG N F+D+TN E+RA Y G + S T + ++DWR KGAVT
Sbjct: 72 -LGLNAFADITNQEYRANYLGTPFDASSIVGTETEKI------FAAPAATVDWRTKGAVT 124
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAF 208
PIKNQ++CG CW+F+ + EG +I +GNL+ LSEQ L+DCS + GN+GC GG AF
Sbjct: 125 PIKNQQQCGGCWSFSTTGSTEGAHQISTGNLVSLSEQNLIDCSGSYGNDGCNGGLMTLAF 184
Query: 209 AYIIQNQGI 217
YII N+GI
Sbjct: 185 EYIINNKGI 193
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 26/38 (68%), Gaps = 1/38 (2%)
Query: 243 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
NYW++KNSWG WG GY+ + +D CGI + +SYP
Sbjct: 382 NYWIVKNSWGTGWGMEGYILMSKDRNNNCGIASMASYP 419
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 115/200 (57%), Gaps = 15/200 (7%)
Query: 19 FIIITLLV-SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
FI+ +LLV + ++ ++ H QS + +HG++YK++ E+ R IF+ENL
Sbjct: 4 FILASLLVVAVSATLLKEDGVHFQS-------FKLKHGKTYKNQAEETKRFAIFRENLRK 56
Query: 78 IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N K+G +Y G N+F+D+T EF+A+ PS +T + +Q
Sbjct: 57 IEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATKT----FQLADGV 112
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
VP S+DWR + VTPIK+Q +CG CW+FA V + EG + +G L + SEQQL+DC+T+
Sbjct: 113 SVPESIDWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTD 172
Query: 195 GNNGCLGGSREKAFAYIIQN 214
N GC GG + F YI N
Sbjct: 173 LNYGCDGGYLDDTFPYIQTN 192
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 63/116 (54%), Gaps = 16/116 (13%)
Query: 172 ITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN-QGIFNGVCGTQ----- 225
+TK+ S + +EQ LL+ +G + A A + Q F+G+ +
Sbjct: 217 VTKVSSYVSVPANEQALLEA--------VGTAGPVAIAINADDLQFYFSGIIDDKYCDPE 268
Query: 226 -LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
LDH V VG+ +E+G +YWLIKNSWG WG++GY + +R + +CG+ + YPL
Sbjct: 269 WLDHGVLAVGY-NSENGLDYWLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPL 323
>gi|214015303|gb|ACJ62268.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 71/188 (37%), Positives = 114/188 (60%), Gaps = 18/188 (9%)
Query: 46 IHEKWMAQHGRSYKDE-------------LEKEMRLKIFKENLEYIEKANKE---GNRTY 89
++E W ++HGR +++ ++ +RL++F++NL YI+ N E G T+
Sbjct: 1 MYEAWKSKHGRGGRNDDCDMAPGDEEQEEEDRRLRLEVFRDNLRYIDAHNAEADAGLHTF 60
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+LG F+DLT +E+R G++ + S Y+ D+P ++DWR GAVT
Sbjct: 61 RLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGSGYRARG-GDLPDAIDWRQLGAVT 119
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF
Sbjct: 120 EVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFR 178
Query: 210 YIIQNQGI 217
++I N GI
Sbjct: 179 FVIGNGGI 186
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 76/190 (40%), Positives = 111/190 (58%), Gaps = 20/190 (10%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT--------YKL 91
E+++ E++ +W + H + EK R FK N+ +I N N T Y+L
Sbjct: 35 EEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYRL 94
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST----FKYQNLSMTDVPTSLDWRDKGA 147
N+F D+ EFR+ + G P HR T + F Y ++ D+P ++DWR KGA
Sbjct: 95 RLNRFGDMDQAEFRSTFAG-----PLHRHTRPAQSIPGFIYD--TVKDIPQAVDWRQKGA 147
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREK 206
VT +K+Q +CG CWAF+AVA+VEG+ IR+G+L+ LSEQ+L+DC T G +NGC GG E
Sbjct: 148 VTGVKDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMES 207
Query: 207 AFAYIIQNQG 216
AF +I + G
Sbjct: 208 AFEFIAHSAG 217
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 53/77 (68%), Gaps = 5/77 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVRDE 267
A+ ++G+F G CG++LDH V +VG+G E DG YW++KNSWG WG+ GY+++ RD
Sbjct: 279 AFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKEYWIVKNSWGPGWGEHGYVRMQRDS 338
Query: 268 ----GLCGIGTRSSYPL 280
GLCGI +SYP+
Sbjct: 339 GVDGGLCGIAMEASYPV 355
>gi|217072214|gb|ACJ84467.1| unknown [Medicago truncatula]
gi|388506066|gb|AFK41099.1| unknown [Medicago truncatula]
Length = 249
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 76/180 (42%), Positives = 114/180 (63%), Gaps = 11/180 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H + + EK R +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EKSLWDLYERWRSHHTVTRSLD-EKNNRFNVFKANVMHVHNTNKL-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR++Y K+ HR S + F Y+N+ VP+S+DWR GAVT +K+Q
Sbjct: 91 TNYEFRSIYADSKVNH--HRMFRGMSHDNGPFMYENVE--GVPSSIDWRKIGAVTGVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI +I++ L+ LSEQ+L+DC T N GC GG E AF +I QN
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTEVNQGCNGGLMECAFEFIKQN 206
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 108/206 (52%), Gaps = 18/206 (8%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK------EGNRTYKLGTNQFSDLTN 101
E W A+HG++Y E+ RL F EN ++ N G +Y L N F+DLT+
Sbjct: 40 EAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLTH 99
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
DEFRA G P S + + VP +LDWR GAVT +K+Q CG CW
Sbjct: 100 DEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCGACW 159
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI---- 217
+F+A A+EGI KI +G+L+ LSEQ+L+DC + N GC GG A+ ++I+N GI
Sbjct: 160 SFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDTED 219
Query: 218 ------FNGVCGTQ--LDHAVTIVGF 235
+G C H VTI G+
Sbjct: 220 DYPFREADGTCNKNKLKKHVVTIDGY 245
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 67/120 (55%), Gaps = 18/120 (15%)
Query: 166 VAAVEGITKIRSGN---LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVC 222
V ++G ++ S L+Q QQ + G+ A A+ + +QGIF+G C
Sbjct: 239 VVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGS----------ARAFQLYSQGIFDGPC 288
Query: 223 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSY 278
T LDHAV IVG+G+ E G +YW++KNSWG WG GYM + R+ G+CGI +S+
Sbjct: 289 PTSLDHAVLIVGYGS-EGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSSGICGINMMASF 347
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 77/183 (42%), Positives = 113/183 (61%), Gaps = 9/183 (4%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
V+E E + +H + Y E+E+ R+KIF EN I NK +G+ TYKL N++ D+
Sbjct: 25 VLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDM 84
Query: 100 TNDEFRALYTGYKMPS----PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
+ EF + G++ ++R+ T +TF + + +P ++DWR KGAVTPIK+Q
Sbjct: 85 LHHEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQ-LPKNVDWRTKGAVTPIKDQG 143
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQN 214
+CG CWAF+A A+EG T ++G L+ LSEQ L+DCS GNNGC GG + AF Y+ +N
Sbjct: 144 QCGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKEN 203
Query: 215 QGI 217
GI
Sbjct: 204 GGI 206
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 31/56 (55%), Positives = 43/56 (76%), Gaps = 1/56 (1%)
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
LDH V +VG+G +DG +YWL+KNSWG TWGD GY+K+ R+ + CGI + +S+PL
Sbjct: 285 LDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGDQGYVKMARNRDNQCGIASSASFPL 340
>gi|297740488|emb|CBI30670.3| unnamed protein product [Vitis vinifera]
Length = 206
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 79/186 (42%), Positives = 114/186 (61%), Gaps = 25/186 (13%)
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
N E +++ + P+ RS T+S F++QN+ T +P+++DWR K VT IKNQ +CG C
Sbjct: 41 NSEPLVIHSKFP-PTLGLRSETTS-FRHQNV--TRIPSTMDWRKKRTVTHIKNQLQCGGC 96
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIFN 219
WAF+AVAA+EGI K+++ I LSEQ+L+DC G+N GC GG + AF +IIQN+G+ +
Sbjct: 97 WAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIFGSNIGCEGGCMDDAFKFIIQNRGLNS 156
Query: 220 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTR 275
+ DG +WL+KNSWG WG+ GY ++ R GLCG +
Sbjct: 157 EA----------------SADGKKHWLVKNSWGTDWGENGYTRMERGVKATTGLCGFTMQ 200
Query: 276 SSYPLA 281
+SYP A
Sbjct: 201 ASYPTA 206
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 81/220 (36%), Positives = 127/220 (57%), Gaps = 22/220 (10%)
Query: 36 RSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT--YKLGT 93
S E+ VVE+ +KW +HG+ YK E E + + F++NL Y+ + N E + + +G
Sbjct: 40 ESIAEERVVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGL 99
Query: 94 NQFSDLTNDEFRALYTGYKMPSPS------HRSTTSSTFKYQNLSMTDVPTSLDWRDKGA 147
N+F+D++N+EFR +Y K+ P+ R + ++ D PTSLDWR G
Sbjct: 100 NKFADMSNEEFREVYVS-KVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGI 158
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
VT +K+Q +CG CWAF++ A+EGI + +G+LI LSEQ+L+DC + N+GC GG + A
Sbjct: 159 VTGVKDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCDST-NDGCEGGYMDYA 217
Query: 208 FAYIIQNQGIF----------NGVCGTQLDH--AVTIVGF 235
F +++ N GI +G C T + AV+I G+
Sbjct: 218 FEWVMSNGGIDTETDYPYTGEDGTCNTTKEETKAVSIDGY 257
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/56 (51%), Positives = 34/56 (60%), Gaps = 5/56 (8%)
Query: 228 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 279
HAV +VG+G E G YW+IKNSWG WG GY I R D G+C I +SYP
Sbjct: 308 HAVLVVGYGA-ESGEEYWIIKNSWGTDWGMKGYAYIKRNTSKDYGVCAINAMASYP 362
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 125/229 (54%), Gaps = 29/229 (12%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR- 87
A + S S + S++E ++W A + +SY E+ R +++ N+ YIE N E
Sbjct: 32 AGDTMGSMSNDDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAA 91
Query: 88 --TYKLGTNQFSDLTNDEFRALYTGYKMPS-PSHRSTTSSTFK--------------YQN 130
TY+LG ++DLTN EF A+YT + P+ S ++ Y N
Sbjct: 92 GLTYELGETAYTDLTNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVN 151
Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
LS + P S+DWR GAVTP+KNQ CG CWAF+ VA VEGI +IR+G L+ LSEQ+L+D
Sbjct: 152 LSAS-APASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVD 210
Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTE 239
C T ++GC GG +A +I N GI T+ D+ T GTT+
Sbjct: 211 CDTL-DDGCDGGISYRALRWIASNGGIT-----TEADYPYT----GTTD 249
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 38/72 (52%), Positives = 50/72 (69%), Gaps = 6/72 (8%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EG 268
+G++NG CGT L+H VT+VG+G G YW++KNSWG WGD GY+++ +D EG
Sbjct: 302 KGVYNGPCGTNLNHGVTVVGYGQEAAAGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEG 361
Query: 269 LCGIGTRSSYPL 280
LCGI R SYPL
Sbjct: 362 LCGIAIRPSYPL 373
>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 80/206 (38%), Positives = 119/206 (57%), Gaps = 18/206 (8%)
Query: 19 FIIITLLVS---CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
F++++L ++ A + TH E+W + HG+SY ++ E+ R +++E+L
Sbjct: 5 FVVLSLCLAGGLAAPSLDPGLDTH-------WEQWKSWHGKSY-EQKEETWRRMVWEEHL 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
IE N E G +++LG N F D+ N+EFR L GYK +H+ S F N
Sbjct: 57 RVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQ-THKKLQGSHFLEPNF- 114
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+VP +DWRD+G VTP+K+Q +CG CWAF+ A+EG R+G L+ LSEQ L++CS
Sbjct: 115 -LEVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECS 173
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF Y+ N GI
Sbjct: 174 KPEGNEGCNGGLMDQAFQYVKDNGGI 199
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 41/68 (60%), Gaps = 5/68 (7%)
Query: 218 FNGVCG-TQLDHAVTIVGFGTTE---DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 272
F C T LDH V +VG+G + DG YW++KNSW WG GY+ + +D + CGI
Sbjct: 270 FEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGI 329
Query: 273 GTRSSYPL 280
T +SYPL
Sbjct: 330 ATAASYPL 337
>gi|348545637|ref|XP_003460286.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 152/315 (48%), Gaps = 83/315 (26%)
Query: 44 VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLT 100
+E H W + +SY E+ R +I+ N + + N +G ++Y+LG F+++
Sbjct: 24 LEFH-AWKLKFEKSYDSPSEEAHRKQIWLSNRKLVLMHNILTDQGLKSYRLGMTYFANME 82
Query: 101 NDEFRALYT----GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
N+E++ L + G S S R +T + Q T +P ++DWRDKG VT +K+QK+
Sbjct: 83 NEEYKQLVSQGCLGSFNGSLSRRGSTFA----QLPEGTALPNTVDWRDKGYVTEVKDQKQ 138
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+A A+EG ++G L+ LSEQQL+DCS+N GN+GC+GG + AF YI N+
Sbjct: 139 CGSCWAFSATGALEGQHFRKTGTLVSLSEQQLVDCSSNFGNSGCMGGWMDFAFKYIKYNR 198
Query: 216 GI----------FNGVCGTQLD---------------------HAVTIVG---------- 234
GI NG+C + D AV VG
Sbjct: 199 GIDTEEFYPYEAKNGLCRYKRDSIGATCSGYIIVKRFEEQALKEAVATVGPISVTIDASR 258
Query: 235 --FGTTEDGA--------------------------NYWLIKNSWGNTWGDAGYMKIVRD 266
F E G +YWL+KNSWG WG+ GY+++ R+
Sbjct: 259 PSFQLYESGVYYDDGCGSIFLNHAVLAVGYGTENGHDYWLVKNSWGLGWGEKGYIRMSRN 318
Query: 267 -EGLCGIGTRSSYPL 280
+ CGI + + YPL
Sbjct: 319 KKNQCGIASVARYPL 333
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 76/174 (43%), Positives = 107/174 (61%), Gaps = 9/174 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E W HG+SY+ +E+++RLKI EN I + N E G +Y + N + DL + EF
Sbjct: 28 ESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYGDLLHHEF 87
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A+ GY+ + + S S +N+ + PT +DWR+ GAVTP+KNQ +CG CWAF+
Sbjct: 88 VAMVNGYEYVNKT--SLGGSFIPSKNVKL---PTHVDWREDGAVTPVKNQGQCGSCWAFS 142
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
+ ++EG T ++G LI LSEQ L+DCS GNNGC GG + AF YI N+GI
Sbjct: 143 STGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGI 196
>gi|4733887|gb|AAD02173.3| cysteine proteinase [Acanthamoeba culbertsoni]
Length = 482
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 78/204 (38%), Positives = 113/204 (55%), Gaps = 8/204 (3%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F + LLV +VS S E+ + WM +H RSY ++ E R ++EN+++I
Sbjct: 36 FTELALLVLACLTLVSCVSLRERELQGQFNSWMRRHARSYSND-EFLERYNTWRENMDFI 94
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSM 133
E+ N+ GN T+ + N+ DLT +EF LY G P+ + S+ + +
Sbjct: 95 EEFNR-GNHTFTVAMNEHGDLTPEEFARLYMGQVSPASEQELQERIAAESAMEDEHHHTR 153
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P + DWR KGAVTP+KNQ C CWAF A AVEG+ KI G+L+ LS+Q LLDC+
Sbjct: 154 ASIPANWDWRTKGAVTPVKNQGSCASCWAFVATGAVEGVRKIAGGSLVSLSDQMLLDCAV 213
Query: 194 -NGNNGCLGGSREKAFAYIIQNQG 216
GN GC GG+ E + ++I N
Sbjct: 214 GTGNQGCSGGNVEITYRWMISNNA 237
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 57/117 (48%), Gaps = 10/117 (8%)
Query: 166 VAAVEGITKIRSGNLIQLSEQQLL-DCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCG- 223
V + I ++R+G SE LL + + GS+ Y ++ C
Sbjct: 262 VQGIRNIMRVRAG-----SESDLLAKAAIAPVTVAIDGSKRSFMFY--SGGYYYDPTCSS 314
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
T L+HAV +VG+GT +YW+ KN WG WGD GY+ + R++ CGI + + P
Sbjct: 315 TNLNHAVLVVGWGTDPQRGDYWIAKNEWGTAWGDDGYVYMARNKNNNCGIASLAVLP 371
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 73/171 (42%), Positives = 103/171 (60%), Gaps = 2/171 (1%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E W G+SY D +E+ R +++ N ++ N G +Y LG N F+DLT++EF+
Sbjct: 31 EAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFKRF 90
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
Y G K+ RS SSTF ++ +P S+DWR G VTP+K+Q +CG CW+F+
Sbjct: 91 YLGTKVDLNRPRSNFSSTF-IPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSFSTTG 149
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
+VEG ++G L+ LSEQ L+DCS GN GC GG + AF YII N+GI
Sbjct: 150 SVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGI 200
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 32/75 (42%), Positives = 48/75 (64%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + G++N T LDH V G+GT+ +G YWL+KNSWG++WG AGY+ + R+
Sbjct: 260 SFQLYTSGVYNEKKCSSTSLDHGVLAAGYGTS-NGTPYWLVKNSWGSSWGQAGYIWMSRN 318
Query: 267 -EGLCGIGTRSSYPL 280
CGI T +SYP+
Sbjct: 319 ANNQCGIATSASYPI 333
>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
Length = 356
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 73/208 (35%), Positives = 121/208 (58%), Gaps = 12/208 (5%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEK---WMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ L+ +C+ + + + + + + E+ W A++ R+Y E + R I+ EN+ +I
Sbjct: 10 LALMFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFI 69
Query: 79 EKANK-EGNRTYKLGTNQFSDLTNDEFRALYT---GYKMPSPSHRSTTSSTFKYQNLS-- 132
+ N+ +Y+LG NQF+DLT +EF+ Y + P+ T T +S
Sbjct: 70 KTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMGPTVGTMSTAGMSNG 129
Query: 133 --MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
+ P S+DWR KGAVT +K+Q++CG CWAFA VA++EG+ +I++G L+ LSEQ+++D
Sbjct: 130 NNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVD 189
Query: 191 CSTNGN-NGCLGGSREKAFAYIIQNQGI 217
C GN NGC GGS A ++ +N G+
Sbjct: 190 CDRGGNDNGCRGGSPRSAMEWVTRNGGL 217
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 8/80 (10%)
Query: 209 AYIIQNQGIFNGVC-GTQLDHAVTIVGFGTT---EDGANYWLIKNSWGNTWGDAGY---- 260
A+ G+F+G C T ++H VT+VG+G+T G YW++KNSWG WG+ GY
Sbjct: 276 AFQFYKSGVFSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWGENGYVRMA 335
Query: 261 MKIVRDEGLCGIGTRSSYPL 280
++ EG+C I YP+
Sbjct: 336 RRVRAREGMCAIAIEPYYPV 355
>gi|281207557|gb|EFA81740.1| hypothetical protein PPL_05734 [Polysphondylium pallidum PN500]
Length = 387
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 78/205 (38%), Positives = 122/205 (59%), Gaps = 7/205 (3%)
Query: 18 MFIIITLLVSCAS---QVVSSRST-HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
M+ + L++C V+S+ +T E+ + WM H Y + E R +FK+
Sbjct: 1 MYRLSVYLLACTVFMLAVLSANATLTERQYQDSFVSWMQTHNVKYTTQ-EFNHRYGVFKK 59
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
NL ++ + N +G+ T LG N F+DLTN E++ +Y G K+ + S + ++ + ++
Sbjct: 60 NLNFVNQWNAKGSSTV-LGMNVFADLTNAEYQRIYLGSKIDTSSMMNANAARLFDRTYNV 118
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+ ++DWR KGAVT IKNQ++CG CW+F+ ++EG +I +GNL+ LSEQ L+DCST
Sbjct: 119 KALSPTVDWRQKGAVTHIKNQQQCGSCWSFSTTGSIEGAHEIATGNLVSLSEQNLIDCST 178
Query: 194 -NGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF Y+I+N GI
Sbjct: 179 AEGNQGCNGGLMTNAFEYVIKNGGI 203
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 27/39 (69%), Gaps = 1/39 (2%)
Query: 243 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 280
NYW++KNSWG WG GY+ + +D CGI + +SYP+
Sbjct: 348 NYWIVKNSWGPEWGLTGYILMSKDRNNNCGIASSASYPV 386
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 111/201 (55%), Gaps = 8/201 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ + LL + AS V EQ + W H + Y E+ R I+++NL+
Sbjct: 3 LLVAACLLFAVASGFVVKFDEDEQQW----QAWKLFHTKKYTTVTEEGARKAIWRDNLKK 58
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I+K N EG+ ++ L N DLT DEFR YTG + ++ S F S VP
Sbjct: 59 IQKHNAEGH-SFTLAMNHLGDLTQDEFRYFYTGMRSHYSNYTKKQGSAFLAP--SHVQVP 115
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
++DWR +G VTP+KNQ +CG CWAF+ ++EG ++G L+ LSEQ L+DCST GN
Sbjct: 116 DTVDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGN 175
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
NGC GG + AF YI +N GI
Sbjct: 176 NGCQGGLMDYAFKYIKENGGI 196
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 51/75 (68%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + G++N G T LDH V +VG+GT + G++YWL+KNSWG WG GY+ + R+
Sbjct: 256 SFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQ-GSDYWLVKNSWGERWGMEGYIMMSRN 314
Query: 267 E-GLCGIGTRSSYPL 280
+ CG+ T++SYPL
Sbjct: 315 KNNQCGVATQASYPL 329
>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
Length = 382
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 81/223 (36%), Positives = 115/223 (51%), Gaps = 30/223 (13%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
+++E+ ++W A++ RSY E+ RL+++ N+ YIE N Y+LG ++DLTN
Sbjct: 47 TMMEMFQRWKAEYNRSYATPEEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTN 106
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV----------------PTSLDWRDK 145
DEF A+YT + S + ++T V P S+DWR
Sbjct: 107 DEFMAMYTAPPLRSAADDDDDAATTTIITTRAGPVDEHQQPEVYFNESAGAPASVDWRAS 166
Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
GAVT +K+Q CG CWAF+ VA VEGI KI+ G L+ LSEQ+L+DC T ++GC GG
Sbjct: 167 GAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCDTL-DSGCDGGVSY 225
Query: 206 KAFAYIIQNQGI-------FNGVCGTQLD------HAVTIVGF 235
+A +I N GI + G D HA TI G
Sbjct: 226 RALEWITANGGITTRDDYPYTGAAAAACDRAKLGHHAATIAGL 268
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 39/78 (50%), Positives = 52/78 (66%), Gaps = 12/78 (15%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTED-------GANYWLIKNSWGNTWGDAGYMKIVRD- 266
+G+++G CGT+L+H VT+VG+G E G YW+IKNSWG WGD GY+K+ +D
Sbjct: 304 KGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWIIKNSWGKNWGDQGYIKMKKDV 363
Query: 267 ----EGLCGIGTRSSYPL 280
EGLCGI R S+PL
Sbjct: 364 AGKPEGLCGIAIRPSFPL 381
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 111/187 (59%), Gaps = 19/187 (10%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
E+S +++E+W RSY+ +K R +FK N+ ++ NK ++ YKL N+
Sbjct: 33 EESFWDLYERW-----RSYRTVSRSLGDKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNK 86
Query: 96 FSDLTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
F+D+TN EFR+ Y G K+ HR + TF Y+ + VP S DWR GAVT
Sbjct: 87 FADMTNHEFRSTYAGSKVNH--HRMFQGTPRGNGTFMYEKVG--SVPPSADWRKNGAVTG 142
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q +CG CWAF+ V AVEGI +I++ L+ LSEQ+L+DC T N GC GG E AF +
Sbjct: 143 VKDQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEF 202
Query: 211 IIQNQGI 217
I Q GI
Sbjct: 203 IKQKGGI 209
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 38/78 (48%), Positives = 52/78 (66%), Gaps = 4/78 (5%)
Query: 207 AFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK---- 262
F + +G+F G C T+L+H V IVG+GTT DG NYW ++NSWG WG+ GY++
Sbjct: 267 GFDFQFYFEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQRS 326
Query: 263 IVRDEGLCGIGTRSSYPL 280
I + EGLCGI +SYP+
Sbjct: 327 IFKKEGLCGIAMMASYPI 344
>gi|214015388|gb|ACJ62310.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 249
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 71/187 (37%), Positives = 113/187 (60%), Gaps = 18/187 (9%)
Query: 46 IHEKWMAQHGR------------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYK 90
++E W ++HGR + E ++ +RL++F++NL YI+K N E G T++
Sbjct: 1 MYEAWKSKHGRGGSNDDCDMALGDEEQEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFR 60
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG F+DLT DE+R G++ + + + + +P ++DWR GAVT
Sbjct: 61 LGLTPFADLTLDEYRGRVLGFRARGRRSGARYGHRARPRGGDL--LPDAIDWRQLGAVTE 118
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF +
Sbjct: 119 VKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFRF 177
Query: 211 IIQNQGI 217
+I N GI
Sbjct: 178 VIGNGGI 184
>gi|76574404|gb|ABA46972.1| cysteine protease Mir1 [Zea diploperennis]
Length = 250
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 71/186 (38%), Positives = 112/186 (60%), Gaps = 16/186 (8%)
Query: 46 IHEKWMAQHGRS-----------YKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKL 91
++E W ++HGR ++E ++ +RL++F++NL YI+ N E G T++L
Sbjct: 1 MYEAWKSKHGRGGSNDDCDMAPGDEEEDDRRLRLEVFRDNLRYIDAHNAEADAGLHTFRL 60
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
G F+DLT +E+R G++ Y+ D+P ++DWR GAVT +
Sbjct: 61 GLTPFADLTLEEYRGRVLGFRARGRRSGVRYGHGHGYRARG-GDLPDAIDWRQLGAVTEV 119
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF ++
Sbjct: 120 KDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFRFV 178
Query: 212 IQNQGI 217
I N GI
Sbjct: 179 IGNGGI 184
>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
Length = 360
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 100/306 (32%), Positives = 143/306 (46%), Gaps = 78/306 (25%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
+ ++ ++G+ Y+ E + R ++F +NL+ I NK+G +YKLG N+F+D+T DEFR
Sbjct: 60 LFARFAHRYGKRYETVEEIKQRFEVFLDNLKMIRSHNKKG-LSYKLGVNEFTDITWDEFR 118
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
G + + +TT K N+ + P + DWR+ G V+P+KNQ +CG CW F+
Sbjct: 119 RDRLG---AAQNCSATTKGNLKLTNVVL---PETKDWREAGIVSPVKNQGKCGSCWTFST 172
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI------- 217
A+E G I LSEQQL+DC+ NN GC GG +AF YI N G+
Sbjct: 173 TGALEAAYGQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYP 232
Query: 218 ---FNGVCG---------------------TQLDHAVTIV-----------GF------- 235
NG+C +L +AV +V GF
Sbjct: 233 YTGKNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGV 292
Query: 236 ------GTTEDGANY---------------WLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 274
G T N+ WLIKNSWG WGD GY K+ + +CGI T
Sbjct: 293 YTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIAT 352
Query: 275 RSSYPL 280
+SYP+
Sbjct: 353 CASYPV 358
>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
Length = 382
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 73/208 (35%), Positives = 121/208 (58%), Gaps = 12/208 (5%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEK---WMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ L+ +C+ + + + + + + E+ W A++ R+Y E + R I+ EN+ +I
Sbjct: 36 LALMFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFI 95
Query: 79 EKANK-EGNRTYKLGTNQFSDLTNDEFRALYT---GYKMPSPSHRSTTSSTFKYQNLS-- 132
+ N+ +Y+LG NQF+DLT +EF+ Y + P+ T T +S
Sbjct: 96 KTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMPPTVGTMSTAGMSNG 155
Query: 133 --MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
+ P S+DWR KGAVT +K+Q++CG CWAFA VA++EG+ +I++G L+ LSEQ+++D
Sbjct: 156 NNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVD 215
Query: 191 CSTNGN-NGCLGGSREKAFAYIIQNQGI 217
C GN NGC GGS A ++ +N G+
Sbjct: 216 CDRGGNDNGCRGGSPRSAMEWVTRNGGL 243
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 56/115 (48%), Gaps = 10/115 (8%)
Query: 174 KIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIV 233
+IR +Q + + L+ + G + +AF + G+F+G C T + V V
Sbjct: 269 RIRGYQAVQRNNEAELERAVAGQPVAVFVDASRAFQFY--KSGVFSGPCDTTTVNHVVTV 326
Query: 234 -GFGTT---EDGANYWLIKNSWGNTWGDAGY----MKIVRDEGLCGIGTRSSYPL 280
G+G+T G YW++KNSWG WG+ GY ++ EG+C I YP+
Sbjct: 327 VGYGSTGSDSGGRKYWIVKNSWGQGWGENGYVRMARRVRAREGMCAIAIEPYYPV 381
>gi|75994608|gb|ABA33825.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 250
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 71/187 (37%), Positives = 114/187 (60%), Gaps = 18/187 (9%)
Query: 46 IHEKWMAQHGR------------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYK 90
++E W ++HGR ++E ++ +RL++F++NL YI+K N E G T++
Sbjct: 1 MYEAWKSKHGRRGGSNDDDCDMAPGEEEEDRRLRLEVFRDNLRYIDKHNAEVDAGLHTFR 60
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG F+DLT DE+R G++ + + + + +P ++DWR GAVT
Sbjct: 61 LGLTPFADLTLDEYRGRVLGFRARGRRSGARYGHRARPRGGDL--LPDAIDWRKLGAVTE 118
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF +
Sbjct: 119 VKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEDAFRF 177
Query: 211 IIQNQGI 217
+I N GI
Sbjct: 178 VIGNGGI 184
>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
Length = 348
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 86/218 (39%), Positives = 117/218 (53%), Gaps = 22/218 (10%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVV-EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ +L+V A+ V S + Q +V E E++ +HG+ Y+ E E E R +F ENL I
Sbjct: 1 MYSLVVLLATLVAYSHAISYQVLVQEQWEQFKLEHGKVYESESENEYRQSVFMENLFQIN 60
Query: 80 KANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY-------- 128
+ NK G +Y++ N DLT DEF +YT MP S + +
Sbjct: 61 EHNKLYEMGLSSYQMAMNHLGDLTKDEFMRIYT-VNMPQLPQSENLSDSEPWLDLPQDLQ 119
Query: 129 --------QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
NL D+PT +DWR KGAVTP+KNQ+ CG CW+F+A A+E ++ L
Sbjct: 120 GFVTYALPTNLDEVDLPTDIDWRQKGAVTPVKNQRNCGSCWSFSATGALEAQWFKKTNKL 179
Query: 181 IQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
I LSEQQL+DCS GN+GC GG AF YI +N GI
Sbjct: 180 ISLSEQQLVDCSGRYGNHGCHGGWMHWAFGYIKENGGI 217
>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
gi|255645733|gb|ACU23360.1| unknown [Glycine max]
Length = 362
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 125/234 (53%), Gaps = 20/234 (8%)
Query: 17 PMFIIITLLVSCASQVVSSRS----THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
P FI++ S +SS E+ V ++ + W +H R Y ++ EK R +IF+
Sbjct: 11 PFFIVLVSFTCSLSLAMSSNQLEQFASEEEVFQLFQAWQKEHKREYGNQEEKAKRFQIFQ 70
Query: 73 ENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKY 128
NL YI + N K ++LG N+F+D++ +EF Y +MP S+ + K
Sbjct: 71 SNLRYINEMNAKRKSPTTQHRLGLNKFADMSPEEFMKTYLKEIEMPY-SNLESRKKLQKG 129
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+ ++P S+DWRDKGAVT +++Q +C WAF+ A+EGI KI +GNL+ LS QQ+
Sbjct: 130 DDADCDNLPHSVDWRDKGAVTEVRDQGKCQSHWAFSVTGAIEGINKIVTGNLVSLSVQQV 189
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTI 232
+DC ++GC GG AF Y+I+N GI NG C + V+I
Sbjct: 190 VDCDP-ASHGCAGGFYFNAFGYVIENGGIDTEAHYPYTAQNGTCKANANKVVSI 242
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 34/55 (61%), Gaps = 7/55 (12%)
Query: 232 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DE---GLCGIGTRSSYPL 280
IVG+G+ G +YW++KNSWG WG+ GY+ I R DE G+C I +P+
Sbjct: 299 IVGYGSV-GGEDYWIVKNSWGKDWGEEGYLLIKRNVSDEWPYGVCAINAAPGFPI 352
>gi|387915678|gb|AFK11448.1| cathepsin L1 [Callorhinchus milii]
Length = 336
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 78/204 (38%), Positives = 119/204 (58%), Gaps = 15/204 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+++++ L +Q+ + + E W + HGR Y + E R ++++NL+ I
Sbjct: 9 YVLLSFLALAFAQMF------DPLLSEPWLNWKSYHGREYMQK-EDSYRRAVWEKNLKTI 61
Query: 79 EKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNLSMT 134
+ N G ++ L NQF D+T +EF L G++ P H ++T + + ++L +
Sbjct: 62 QLHNLGYSMGKHSFDLAMNQFGDMTTEEFHQLMNGFQQPDTEHLTSTKDVSTRPKSLKL- 120
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
P S+DWRDKG VTP+KNQ CG CWAF++ A+EG T ++G LI LSEQ L+DCS
Sbjct: 121 --PGSVDWRDKGYVTPVKNQGACGSCWAFSSTGALEGQTFKKTGKLIPLSEQNLVDCSQK 178
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI 217
GN+GC GG ++AF YI QN GI
Sbjct: 179 QGNHGCNGGMMDRAFTYIQQNNGI 202
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 41/56 (73%), Gaps = 2/56 (3%)
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 280
++HAV +VG+ ++ G NYW++KNS+G+ WG+ GY+ + +D+ CGI + +YPL
Sbjct: 281 INHAVLVVGY-NSQGGNNYWIVKNSFGSRWGNKGYIWMPKDKNNHCGIASYPTYPL 335
>gi|66815417|ref|XP_641725.1| cysteine proteinase [Dictyostelium discoideum AX4]
gi|74844418|sp|Q94503.1|CYSP6_DICDI RecName: Full=Cysteine proteinase 6; Flags: Precursor
gi|1644500|gb|AAC47481.1| cysteine proteinase [Dictyostelium discoideum]
gi|60469754|gb|EAL67741.1| cysteine proteinase [Dictyostelium discoideum AX4]
Length = 434
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 111/203 (54%), Gaps = 10/203 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M ++ L V S + + E WM H R Y E E R IFK N++Y
Sbjct: 1 MKVLSALCVLLVSVATAKQQLSELQYRNAFTNWMIAHQRHYSSE-EFNGRFNIFKANMDY 59
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I + N +G+ T LG N F+D+TN+E+RA Y G + S T S + +
Sbjct: 60 INEWNTKGSETV-LGLNVFADITNEEYRATYLGTPFDASSLEMTPS-----EKVFGGVQA 113
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG--NLIQLSEQQLLDCS-TN 194
S+DWR KGAVTPIKNQ ECG CW+F+A A EG I +G +L +SEQQL+DCS +
Sbjct: 114 NSVDWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGDSDLTSVSEQQLIDCSGSY 173
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
GNNGC GG AF YII N GI
Sbjct: 174 GNNGCEGGLMTLAFEYIINNGGI 196
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 243 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 281
NYW++KNSWG WG GY+ + +D + CGI T +S P A
Sbjct: 388 NYWIVKNSWGLDWGINGYILMSKDKDNQCGIATMASIPQA 427
>gi|114658412|ref|XP_001153217.1| PREDICTED: pro-cathepsin H isoform 6 [Pan troglodytes]
gi|397478882|ref|XP_003810764.1| PREDICTED: pro-cathepsin H [Pan paniscus]
gi|12803323|gb|AAH02479.1| Cathepsin H [Homo sapiens]
gi|60655259|gb|AAX32193.1| cathepsin H [synthetic construct]
gi|123979560|gb|ABM81609.1| cathepsin H [synthetic construct]
gi|123994193|gb|ABM84698.1| cathepsin H [synthetic construct]
gi|189054474|dbj|BAG37247.1| unnamed protein product [Homo sapiens]
gi|410254318|gb|JAA15126.1| cathepsin H [Pan troglodytes]
gi|410294916|gb|JAA26058.1| cathepsin H [Pan troglodytes]
gi|410331109|gb|JAA34501.1| cathepsin H [Pan troglodytes]
Length = 335
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 138/312 (44%), Gaps = 81/312 (25%)
Query: 42 SVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
S+ + H K WM++H ++Y E E RL+ F N I A+ GN T+K+ NQFSD++
Sbjct: 29 SLEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMS 86
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGC 159
E + Y P + S T S + P S+DWR KG V+P+KNQ CG
Sbjct: 87 FAEIKHKYL---WSEPQNCSATKSNYL---RGTGPYPPSVDWRKKGNFVSPVKNQGACGS 140
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF 218
CW F+ A+E I +G ++ L+EQQL+DC+ + NN GC GG +AF YI+ N+GI
Sbjct: 141 CWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIM 200
Query: 219 ----------NGVCGTQLDHAVTIV--------------------------GFGTTED-- 240
+G C Q A+ V F T+D
Sbjct: 201 GEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFM 260
Query: 241 --------------------------------GANYWLIKNSWGNTWGDAGYMKIVRDEG 268
G YW++KNSWG WG GY I R +
Sbjct: 261 MYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKN 320
Query: 269 LCGIGTRSSYPL 280
+CG+ +SYP+
Sbjct: 321 MCGLAACASYPI 332
>gi|355763133|gb|EHH62119.1| hypothetical protein EGM_20318 [Macaca fascicularis]
Length = 331
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 122/206 (59%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M +I +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKQLICVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++ +
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNAN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG +AF YII N GI
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGI 198
>gi|214015307|gb|ACJ62270.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015309|gb|ACJ62271.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015313|gb|ACJ62273.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015315|gb|ACJ62274.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015317|gb|ACJ62275.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015319|gb|ACJ62276.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015321|gb|ACJ62277.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015323|gb|ACJ62278.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 249
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 71/187 (37%), Positives = 113/187 (60%), Gaps = 18/187 (9%)
Query: 46 IHEKWMAQHGR------------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYK 90
++E W ++HGR + E ++ +RL++F++NL YI+K N E G T++
Sbjct: 1 MYEAWKSKHGRGGSNDDCDMAPGDEEQEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFR 60
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG F+DLT DE+R G++ + + + + +P ++DWR GAVT
Sbjct: 61 LGLTPFADLTLDEYRGRVLGFRARGRRSGARYGHRARPRGGDL--LPDAIDWRQLGAVTE 118
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF +
Sbjct: 119 VKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFRF 177
Query: 211 IIQNQGI 217
+I N GI
Sbjct: 178 VIGNGGI 184
>gi|29710|emb|CAA34734.1| unnamed protein product [Homo sapiens]
Length = 335
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 138/312 (44%), Gaps = 81/312 (25%)
Query: 42 SVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
S+ + H K WM++H ++Y E E RL+ F N I A+ GN T+K+ NQFSD++
Sbjct: 29 SLEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMS 86
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGC 159
E + Y P + S T S + P S+DWR KG V+P+KNQ CG
Sbjct: 87 FAEIKHKYL---WSEPQNCSATKSNYL---RGTGPYPPSVDWRKKGNFVSPVKNQGACGS 140
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF 218
CW F+ A+E I +G ++ L+EQQL+DC+ + NN GC GG +AF YI+ N+GI
Sbjct: 141 CWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYILYNKGIM 200
Query: 219 ----------NGVCGTQLDHAVTIV--------------------------GFGTTED-- 240
+G C Q A+ V F T+D
Sbjct: 201 GEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFM 260
Query: 241 --------------------------------GANYWLIKNSWGNTWGDAGYMKIVRDEG 268
G YW++KNSWG WG GY I R +
Sbjct: 261 MYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKN 320
Query: 269 LCGIGTRSSYPL 280
+CG+ +SYP+
Sbjct: 321 MCGLAACASYPI 332
>gi|214015289|gb|ACJ62261.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 254
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 114/190 (60%), Gaps = 19/190 (10%)
Query: 46 IHEKWMAQHGRSYKDE-------------LEKEMRLKIFKENLEYIEKANKE---GNRTY 89
++E W ++HGR +++ ++ +RL++F++NL YI+K N E G T+
Sbjct: 1 MYEAWKSKHGRGGRNDDCDMAPGDEEQEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTF 60
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM--TDVPTSLDWRDKGA 147
+LG F+DLT DE+R G++ + + Y+ +P ++DWR GA
Sbjct: 61 RLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDHLPDAIDWRQLGA 120
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
VT +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E A
Sbjct: 121 VTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENA 179
Query: 208 FAYIIQNQGI 217
F ++I N GI
Sbjct: 180 FRFVIGNGGI 189
>gi|214015235|gb|ACJ62234.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015271|gb|ACJ62252.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015275|gb|ACJ62254.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 254
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 113/190 (59%), Gaps = 19/190 (10%)
Query: 46 IHEKWMAQHGRSYKDE-------------LEKEMRLKIFKENLEYIEKANKE---GNRTY 89
++E W ++HGR +++ ++ +RL++F++NL YI+K N E G T+
Sbjct: 1 MYEAWKSKHGRGGRNDDCDMAPGDEEQEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTF 60
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM--TDVPTSLDWRDKGA 147
+LG F+DLT DE+R G++ + Y+ +P ++DWR GA
Sbjct: 61 RLGLTPFADLTLDEYRGRVLGFRARGRRSGARYGHGHGYRARPRGGDHLPDAIDWRQLGA 120
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
VT +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E A
Sbjct: 121 VTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENA 179
Query: 208 FAYIIQNQGI 217
F ++I N GI
Sbjct: 180 FRFVIGNGGI 189
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 88/223 (39%), Positives = 120/223 (53%), Gaps = 18/223 (8%)
Query: 2 VLIFERSGSFKINTTPMFIIITL---LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSY 58
VL R S +N I++L L A +V +H Q W + H + Y
Sbjct: 3 VLFLARRLSRFVNMNVCLTILSLCLGLAFAAPRVDPDLDSHWQL-------WKSWHSKDY 55
Query: 59 KDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
E E+ R ++++NL+ IE N + G +YKLG NQF D+T +EFR L GYK
Sbjct: 56 H-EREESWRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKHKK 114
Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
S R S F S + P S+DWR+KG VTP+K+Q +CG CWAF+ A+EG
Sbjct: 115 -SERKYRGSQF--LEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFR 171
Query: 176 RSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
++G L+ LSEQ L+DCS GN GC GG ++AF Y+ N GI
Sbjct: 172 KTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGI 214
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 30/59 (50%), Positives = 39/59 (66%), Gaps = 4/59 (6%)
Query: 226 LDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
LDH V +VG+G DG YW++KNSWG WGD GY+ + +D + CGI T +SYPL
Sbjct: 294 LDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 352
>gi|214015355|gb|ACJ62294.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 252
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 72/188 (38%), Positives = 113/188 (60%), Gaps = 17/188 (9%)
Query: 46 IHEKWMAQHGRSY-----------KDELEKEMRLKIFKENLEYIEKANKE---GNRTYKL 91
++E W ++HGR +E ++ +RL++F++NL YI+K N E G T++L
Sbjct: 1 MYEAWKSKHGRGGSSNDDCDMAPGDEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFRL 60
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKGAVT 149
G F+DLT DE+R G++ + + Y+ +P ++DWR GAVT
Sbjct: 61 GLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDAIDWRQLGAVT 120
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF
Sbjct: 121 EVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFR 179
Query: 210 YIIQNQGI 217
++I N GI
Sbjct: 180 FVIGNGGI 187
>gi|75994620|gb|ABA33831.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 255
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 113/190 (59%), Gaps = 19/190 (10%)
Query: 46 IHEKWMAQHGRSYKDE-------------LEKEMRLKIFKENLEYIEKANKE---GNRTY 89
++E W ++HGR +++ ++ +RL++F++NL YI+K N E G T+
Sbjct: 1 MYEAWKSKHGRGGRNDDCDMAPGDEEQEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTF 60
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM--TDVPTSLDWRDKGA 147
+LG F+DLT DE+R G++ + Y+ +P ++DWR GA
Sbjct: 61 RLGLTPFADLTLDEYRGRVLGFRARGRRSGARYGHGHGYRARPRGGDHLPDAIDWRQLGA 120
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
VT +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E A
Sbjct: 121 VTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENA 179
Query: 208 FAYIIQNQGI 217
F ++I N GI
Sbjct: 180 FRFVIGNGGI 189
>gi|61372279|gb|AAX43816.1| cathepsin H [synthetic construct]
Length = 336
Score = 143 bits (360), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 138/312 (44%), Gaps = 81/312 (25%)
Query: 42 SVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
S+ + H K WM++H ++Y E E RL+ F N I A+ GN T+K+ NQFSD++
Sbjct: 29 SLEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMS 86
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGC 159
E + Y P + S T S + P S+DWR KG V+P+KNQ CG
Sbjct: 87 FAEIKHKYL---WSEPQNCSATKSNYL---RGTGPYPPSVDWRKKGNFVSPVKNQGACGS 140
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF 218
CW F+ A+E I +G ++ L+EQQL+DC+ + NN GC GG +AF YI+ N+GI
Sbjct: 141 CWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIM 200
Query: 219 ----------NGVCGTQLDHAVTIV--------------------------GFGTTED-- 240
+G C Q A+ V F T+D
Sbjct: 201 GEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFM 260
Query: 241 --------------------------------GANYWLIKNSWGNTWGDAGYMKIVRDEG 268
G YW++KNSWG WG GY I R +
Sbjct: 261 MYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKN 320
Query: 269 LCGIGTRSSYPL 280
+CG+ +SYP+
Sbjct: 321 MCGLAACASYPI 332
>gi|2499879|sp|Q40143.1|CYSP3_SOLLC RecName: Full=Cysteine proteinase 3; Flags: Precursor
gi|1235545|emb|CAA88629.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 356
Score = 143 bits (360), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 101/305 (33%), Positives = 144/305 (47%), Gaps = 82/305 (26%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ +H + Y E + R +IF +NL+ I N++G +YKLG N+F+DLT DEFR
Sbjct: 59 RFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKG-LSYKLGINEFTDLTWDEFRK-- 115
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
+K+ + + S T+ NL +T+V P + DWR G V+P+K Q +CG CW F+
Sbjct: 116 --HKLGASQNCSATTKG----NLKLTNVVLPETKDWRKDGIVSPVKAQGKCGSCWTFSTT 169
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI-------- 217
A+E G I LSEQQL+DC+ NN GC GG +AF YI N G+
Sbjct: 170 GALEAAYAQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPY 229
Query: 218 --FNGVCG---------------------TQLDHAVTIV-----------GF-------- 235
NG+C +L +AV +V GF
Sbjct: 230 TGKNGICKFSQANIGVKVISSVNITLGAEYELKYAVALVRPVSVAFEVVKGFKQYKSGVY 289
Query: 236 GTTE--------------------DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTR 275
+TE +G YWLIKNSWG WG+ GY K+ + +CG+ T
Sbjct: 290 ASTECGDTPMDVNHAVLAVGYGVENGTPYWLIKNSWGADWGEDGYFKMEMGKNMCGVATC 349
Query: 276 SSYPL 280
+SYP+
Sbjct: 350 ASYPI 354
>gi|60827884|gb|AAX36817.1| cathepsin H [synthetic construct]
Length = 336
Score = 143 bits (360), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 138/312 (44%), Gaps = 81/312 (25%)
Query: 42 SVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
S+ + H K WM++H ++Y E E RL+ F N I A+ GN T+K+ NQFSD++
Sbjct: 29 SLEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMS 86
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGC 159
E + Y P + S T S + P S+DWR KG V+P+KNQ CG
Sbjct: 87 FAEIKHKYL---WSEPQNCSATKSNYL---RGTGPYPPSVDWRKKGNFVSPVKNQGACGS 140
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF 218
CW F+ A+E I +G ++ L+EQQL+DC+ + NN GC GG +AF YI+ N+GI
Sbjct: 141 CWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIM 200
Query: 219 ----------NGVCGTQLDHAVTIV--------------------------GFGTTED-- 240
+G C Q A+ V F T+D
Sbjct: 201 GEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFM 260
Query: 241 --------------------------------GANYWLIKNSWGNTWGDAGYMKIVRDEG 268
G YW++KNSWG WG GY I R +
Sbjct: 261 MYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKN 320
Query: 269 LCGIGTRSSYPL 280
+CG+ +SYP+
Sbjct: 321 MCGLAACASYPI 332
>gi|23110955|ref|NP_004381.2| pro-cathepsin H preproprotein [Homo sapiens]
gi|288558851|sp|P09668.4|CATH_HUMAN RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|119619549|gb|EAW99143.1| cathepsin H [Homo sapiens]
Length = 335
Score = 143 bits (360), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 138/312 (44%), Gaps = 81/312 (25%)
Query: 42 SVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
S+ + H K WM++H ++Y E E RL+ F N I A+ GN T+K+ NQFSD++
Sbjct: 29 SLEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMS 86
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGC 159
E + Y P + S T S + P S+DWR KG V+P+KNQ CG
Sbjct: 87 FAEIKHKYL---WSEPQNCSATKSNYL---RGTGPYPPSVDWRKKGNFVSPVKNQGACGS 140
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF 218
CW F+ A+E I +G ++ L+EQQL+DC+ + NN GC GG +AF YI+ N+GI
Sbjct: 141 CWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIM 200
Query: 219 ----------NGVCGTQLDHAVTIV--------------------------GFGTTED-- 240
+G C Q A+ V F T+D
Sbjct: 201 GEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFM 260
Query: 241 --------------------------------GANYWLIKNSWGNTWGDAGYMKIVRDEG 268
G YW++KNSWG WG GY I R +
Sbjct: 261 MYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKN 320
Query: 269 LCGIGTRSSYPL 280
+CG+ +SYP+
Sbjct: 321 MCGLAACASYPI 332
>gi|214015251|gb|ACJ62242.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015281|gb|ACJ62257.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015285|gb|ACJ62259.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 143 bits (360), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 72/191 (37%), Positives = 113/191 (59%), Gaps = 20/191 (10%)
Query: 46 IHEKWMAQHGR--------------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
++E W ++HGR ++E ++ +RL++F++NL YI+ N E G T
Sbjct: 1 MYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNLRYIDAHNAEADAGLHT 60
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKG 146
++LG F+DLT DE+R G++ + Y+ S +P ++DWR G
Sbjct: 61 FRLGLTPFADLTLDEYRGRVLGFRARGRRSGARYGHGHGYRARSRGGDLLPDAIDWRQLG 120
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
AVT +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E
Sbjct: 121 AVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEN 179
Query: 207 AFAYIIQNQGI 217
AF ++I N GI
Sbjct: 180 AFRFVIGNGGI 190
>gi|16506813|gb|AAL23961.1|AF426247_1 cathepsin H [Homo sapiens]
Length = 335
Score = 143 bits (360), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 138/312 (44%), Gaps = 81/312 (25%)
Query: 42 SVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
S+ + H K WM++H ++Y E E RL+ F N I A+ GN T+K+ NQFSD++
Sbjct: 29 SLEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMS 86
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGC 159
E + Y P + S T S + P S+DWR KG V+P+KNQ CG
Sbjct: 87 FAEIKHKYL---WSEPQNCSATKSNYL---RGTGPYPPSVDWRKKGNFVSPVKNQGACGS 140
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF 218
CW F+ A+E I +G ++ L+EQQL+DC+ + NN GC GG +AF YI+ N+GI
Sbjct: 141 CWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYILYNKGIM 200
Query: 219 ----------NGVCGTQLDHAVTIV--------------------------GFGTTED-- 240
+G C Q A+ V F T+D
Sbjct: 201 GEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFM 260
Query: 241 --------------------------------GANYWLIKNSWGNTWGDAGYMKIVRDEG 268
G YW++KNSWG WG GY I R +
Sbjct: 261 MYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKN 320
Query: 269 LCGIGTRSSYPL 280
+CG+ +SYP+
Sbjct: 321 MCGLAACASYPI 332
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 74/174 (42%), Positives = 106/174 (60%), Gaps = 8/174 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E W HG++Y +E+++RLKI+ EN I + N E G Y + N + DL + EF
Sbjct: 31 ESWKLMHGKTYSSSIEEKLRLKIYMENSLKISRHNSEALNGIHPYYMKMNHYGDLLHHEF 90
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A+ GY+ + + S + +N+ + PT +DWR++GAVTP+KNQ +CG CW+F+
Sbjct: 91 VAMVNGYQYANKT-ASLGGTYIPNKNIQL---PTHVDWREEGAVTPVKNQGQCGSCWSFS 146
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
A A+EG ++G LI LSEQ L+DCS GNNGC GG + AF YI N+GI
Sbjct: 147 ATGALEGQDFRKTGKLISLSEQNLVDCSRKFGNNGCEGGLMDFAFTYIRDNKGI 200
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/59 (52%), Positives = 43/59 (72%), Gaps = 2/59 (3%)
Query: 225 QLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 281
+LDH V +VGFGT + G +YWL+KNSW WGD GY+K+ R+ E +CGI + +SYP+
Sbjct: 278 ELDHGVLVVGFGTDSVSGEDYWLVKNSWSEKWGDQGYIKMARNKENMCGIASSASYPVV 336
>gi|115438534|ref|NP_001043563.1| Os01g0613800 [Oryza sativa Japonica Group]
gi|11034574|dbj|BAB17098.1| cysteine proteinase-like [Oryza sativa Japonica Group]
gi|113533094|dbj|BAF05477.1| Os01g0613800 [Oryza sativa Japonica Group]
gi|125571165|gb|EAZ12680.1| hypothetical protein OsJ_02595 [Oryza sativa Japonica Group]
gi|215766821|dbj|BAG99049.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 359
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 78/178 (43%), Positives = 103/178 (57%), Gaps = 17/178 (9%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
H WMA+ GR+Y D EK R ++F+ N E I+ AN+ G+ TY LG F+DLT DEFRA
Sbjct: 38 HRCWMARVGRTYADAAEKARRFEVFRANAERIDAANRAGDLTYTLGLTPFADLTADEFRA 97
Query: 107 LYTGYKMPSPSHRSTTSSTFKY--------QNLSMTDVPT---SLDWRDKGAVTPIKNQK 155
+ MP ++ + Q+L + P S DWRD GAVTP+++Q
Sbjct: 98 RHL---MPDADVDEPATARVLFEQEEKAAKQHLPPSRPPAVWGSKDWRDLGAVTPVQDQG 154
Query: 156 E--CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
+ C CWAFA VAA EG+ KI +GN+ LS QQ+LDC T G+N C GG +A YI
Sbjct: 155 KNNCNSCWAFAVVAATEGLIKIETGNVTPLSAQQVLDC-TGGDNTCKGGHIHEALRYI 211
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/59 (50%), Positives = 40/59 (67%), Gaps = 1/59 (1%)
Query: 222 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
CG + +HAV +VG+GT DG YWL+KNSW WG+ GYM+I D CG+ +R +YP
Sbjct: 301 CGKKRNHAVAVVGYGTASDGTPYWLLKNSWATDWGENGYMRIAVDAD-CGVSSRPAYPF 358
>gi|414887429|tpg|DAA63443.1| TPA: hypothetical protein ZEAMMB73_816727 [Zea mays]
Length = 334
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/183 (39%), Positives = 106/183 (57%), Gaps = 15/183 (8%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W A + RSY E+ R +++++N+E IE N+ +Y+LG F+DLT++EF A +T
Sbjct: 64 WQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRAGLSYQLGETPFTDLTSEEFLATHT 123
Query: 110 ------------GYKMPSPSHRSTTSSTFKYQNLSMT---DVPTSLDWRDKGAVTPIKNQ 154
++ +H S + N + T DVP S+DWR KGAVTP+K+Q
Sbjct: 124 MSTRLHASEAARRHRELITTHAGPVSDGGRQWNRNYTTDLDVPESVDWRTKGAVTPVKDQ 183
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CW+F VAA+EG+ KIR+G L+ LSEQ +LDCS+ N+GC G A ++ N
Sbjct: 184 GACGSCWSFVTVAAIEGLHKIRTGQLVSLSEQAVLDCSSPPNHGCNRGDPAAAIDWVSAN 243
Query: 215 QGI 217
G+
Sbjct: 244 GGL 246
>gi|214015353|gb|ACJ62293.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 254
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/190 (38%), Positives = 113/190 (59%), Gaps = 19/190 (10%)
Query: 46 IHEKWMAQHGR-------------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTY 89
++E W ++HGR + E ++ +RL++F++NL YI+K N E G T+
Sbjct: 1 MYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTF 60
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKGA 147
+LG F+DLT DE+R G++ + + Y+ +P ++DWR GA
Sbjct: 61 RLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDAIDWRQLGA 120
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
VT +K+Q++CG CWAF+AVAA+EGI I +GNL+ LSEQ+++DC ++GC GG E A
Sbjct: 121 VTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENA 179
Query: 208 FAYIIQNQGI 217
F ++I N GI
Sbjct: 180 FRFVIGNGGI 189
>gi|75060921|sp|Q5E998.1|CATL2_BOVIN RecName: Full=Cathepsin L2; Flags: Precursor
gi|59858409|gb|AAX09039.1| cathepsin L2 preproprotein [Bos taurus]
Length = 334
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 102/342 (29%), Positives = 155/342 (45%), Gaps = 91/342 (26%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENL 75
P F + L + V+S + ++ H +W A H R Y E+E R ++++N
Sbjct: 3 PSFFLTVLCLG-----VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNK 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I+ N+E G +++ N F D+TN+EFR + G++ + H+ +
Sbjct: 57 KIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQ--NQKHKKGKL----FHEPL 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DW KG VTP+KNQ +CG CWAF+A A+EG ++G L+ LSEQ L+DCS
Sbjct: 111 LVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170
Query: 193 -------TNG-----------NNGCLGGS------------------------------- 203
NG +NGCL
Sbjct: 171 RAQGNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIP 230
Query: 204 -REKAFAYIIQNQG--------------------IFNGVCGTQ-LDHAVTIVGFG---TT 238
REKA + G ++ C ++ LDH V +VG+G T
Sbjct: 231 QREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTD 290
Query: 239 EDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
+ +W++KNSWG WG GY+K+ +D+ CGI T +SYP
Sbjct: 291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYP 332
>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 351
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 80/214 (37%), Positives = 125/214 (58%), Gaps = 13/214 (6%)
Query: 12 KINTTPMFIIITLLVSCASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
K P+ +I L C S + + E+S+++++++W + H R ++ E R K+
Sbjct: 5 KFLIVPLVLIAFLCNICESFELERKDFESEKSLMQLYKRWSSHH-RISRNANEMHNRFKV 63
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY----TGYKMPSPSHRSTTSST- 125
FK N +++ K N G ++ KL NQF+D+++DEFR +Y T YK T
Sbjct: 64 FKNNAKHVFKVNLMG-KSLKLKLNQFADMSDDEFRNMYSSNITYYKDLHAKKIEATGGRI 122
Query: 126 --FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
F Y++ + ++P+S+DWR KGAV IKNQ CG CWAFAAVAAVE I +I++ L+ L
Sbjct: 123 GGFMYEHAN--NIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIHQIKTNELVSL 180
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
SE+++LDC + GC GG AF +++ N G+
Sbjct: 181 SEEEVLDCDYR-DGGCRGGFYNSAFEFMMDNDGV 213
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 48/71 (67%), Gaps = 7/71 (9%)
Query: 216 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGL 269
G+F N CG +DH V +VG+GT EDG +YW+I+N +G+ WG GYMK+ R +G+
Sbjct: 280 GMFTENDFCGFNIDHTVVVVGYGTDEDG-DYWIIRNQYGHRWGMNGYMKMQRGAHSPQGV 338
Query: 270 CGIGTRSSYPL 280
CG+ + +YP+
Sbjct: 339 CGMAMQPAYPV 349
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/174 (43%), Positives = 107/174 (61%), Gaps = 10/174 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E + Q+GR Y D E+ R ++F++N + +E NK+ G T+K+ NQF D+TN+EF
Sbjct: 13 EHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFGDMTNEEF 72
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A+ GYK S R ++ F + M +DWR KGAVTP+K+Q +CG CWAF+
Sbjct: 73 NAVMKGYKKGS---RGEPTTVFTAEGRPMA---ADVDWRTKGAVTPVKDQGQCGSCWAFS 126
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
A ++EG +++ L+ LSEQ+L+DCST GN+GC GG AF YI N GI
Sbjct: 127 ATGSLEGQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYIKDNGGI 180
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/57 (56%), Positives = 41/57 (71%), Gaps = 2/57 (3%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 279
T LDH V VG+GT E +YWL+KNSWG+ WGDAGY+K+ R+ + CGI + SYP
Sbjct: 256 TNLDHGVLAVGYGT-ESTEDYWLVKNSWGSGWGDAGYIKMSRNRDNNCGIASEPSYP 311
>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 119/206 (57%), Gaps = 18/206 (8%)
Query: 19 FIIITLLVS---CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
F++++L ++ A + TH E+W + HG+SY ++ E+ R +++++L
Sbjct: 5 FVVLSLCLAGGLAAPSLDPGLDTH-------WEQWKSWHGKSY-EQKEETWRRMVWEKHL 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
IE N E G +++LG N F D+ N+EFR L GYK +H+ S F N
Sbjct: 57 RVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQ-THKKLQGSHFLEPNFQ 115
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+VP +DWRD+G VTP+K+Q +CG CWAF+ A+EG R+G L+ LSEQ L++CS
Sbjct: 116 --EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECS 173
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF Y+ N GI
Sbjct: 174 KPEGNEGCNGGLMDQAFQYVKDNGGI 199
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 41/68 (60%), Gaps = 5/68 (7%)
Query: 218 FNGVCG-TQLDHAVTIVGFGTTE---DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 272
F C T LDH V +VG+G + DG YW++KNSW WG GY+ + +D + CGI
Sbjct: 270 FEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGI 329
Query: 273 GTRSSYPL 280
T +SYPL
Sbjct: 330 ATAASYPL 337
>gi|395502422|ref|XP_003755580.1| PREDICTED: pro-cathepsin H [Sarcophilus harrisii]
Length = 334
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/308 (30%), Positives = 138/308 (44%), Gaps = 80/308 (25%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
+ + WM Q+ + Y E RL F EN I+K N GN ++ + NQFSD++ DEF+
Sbjct: 33 LFKSWMKQNNKKYHLS-EYHHRLHTFLENKRRIDKHN-AGNHSFTMRLNQFSDMSFDEFK 90
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGCCWAFA 164
Y M P + S T + + + P S+DWR KG V+P+KNQ CG CW F+
Sbjct: 91 KTYL---MRLPQNCSATKGSHVRR---LGPYPESVDWRKKGNFVSPVKNQGGCGSCWTFS 144
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF----- 218
+E I +G L+ L+EQQL+DC+ + NN GC GG +AF YI+ N+GI
Sbjct: 145 TTGGLESAVAIATGKLLSLAEQQLVDCAQDFNNHGCNGGLPSQAFEYIMYNKGIMGEDTY 204
Query: 219 -----NGVCGTQLDHAVTIV--------------------------GFGTTEDGANY--- 244
+G C Q + A+ V F T+D +Y
Sbjct: 205 PYEGKDGTCKFQPNKAIAFVKDVANITAYDEEAMTEAVAHHNPVSFAFEVTDDFLSYHKG 264
Query: 245 -------------------------------WLIKNSWGNTWGDAGYMKIVRDEGLCGIG 273
W++KNSWG +WG+ GY I R + +CG+
Sbjct: 265 IYSNPKCSKSPDKVNHAVLAVGYGKENGIPYWIVKNSWGTSWGNNGYFLIERGKNMCGLA 324
Query: 274 TRSSYPLA 281
+SYP+
Sbjct: 325 DCASYPIP 332
>gi|338717354|ref|XP_001492337.3| PREDICTED: pro-cathepsin H-like [Equus caballus]
Length = 323
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 99/305 (32%), Positives = 130/305 (42%), Gaps = 80/305 (26%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
+ WM QH + Y E E RL+ F N I A+ GN T+++G NQFS + F L
Sbjct: 24 KSWMVQHQKKYSSE-EYHHRLQTFVSNWRKIN-AHNTGNHTFRMGLNQFSAMN---FAEL 78
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGCCWAFAAV 166
Y P + S T + P S+DWR KG V+P+KNQ CG CW F+
Sbjct: 79 KHKYLWSEPQNCSATKGNYL---RGAGPYPPSVDWRKKGNFVSPVKNQGGCGSCWTFSTT 135
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF------- 218
A+E I SG L+ L+EQQL+DC+ N NN GC GG +AF YI N+GI
Sbjct: 136 GALESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPY 195
Query: 219 ---NGVCGTQLDHAVTIV--------------------------GFGTTED--------- 240
+G C Q + A+ V F TED
Sbjct: 196 KGQDGDCKFQPNKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFEVTEDFMMYRKGIY 255
Query: 241 -------------------------GANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTR 275
G YW++KNSWG WG GY I R + +CG+
Sbjct: 256 SSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPHWGMNGYFLIERGKNMCGLAAC 315
Query: 276 SSYPL 280
+SYP+
Sbjct: 316 ASYPI 320
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 74/174 (42%), Positives = 109/174 (62%), Gaps = 6/174 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
E + + H ++YK +E+ +R KIF EN +I K N +G +YKLG NQF+DL EF
Sbjct: 28 EAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADLLPHEF 87
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
+ GY+ + R +T NL+ + +P ++DWR KGAVTP+K+Q +CG CWAF+
Sbjct: 88 VKMMNGYQGKRLAGRGST--YLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFS 145
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
+ ++EG +++G L+ LSEQ L+DCS+ GN GC GG + +F YI N GI
Sbjct: 146 STGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKANGGI 199
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 50/75 (66%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G+++ C ++ LDH V VG+G ++G YWL+KNSW TWG GY+ + RD
Sbjct: 259 SFQLYSEGVYDEPNCSSESLDHGVLAVGYGV-KNGKKYWLVKNSWAETWGQDGYILMSRD 317
Query: 267 E-GLCGIGTRSSYPL 280
+ CGI + +SYPL
Sbjct: 318 KNNQCGIASSASYPL 332
>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
gi|228243|prf||1801240A Cys protease 1
Length = 322
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/176 (42%), Positives = 106/176 (60%), Gaps = 14/176 (7%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E++ + GR Y D E+ RL +F +NL+YIE+ NK+ G TY L NQFSD+TN++F
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80
Query: 105 RALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
A+ GYK P P+ T++ T +DWR KGAVTP+K+Q +CG CWAF
Sbjct: 81 NAVMKGYKKGPRPAAVFTSTDA--------APESTEVDWRTKGAVTPVKDQGQCGSCWAF 132
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG--NNGCLGGSREKAFAYIIQNQGI 217
+ +EG +++G L+ LSEQQL+DC+ N GC GG E+A Y+ N G+
Sbjct: 133 STTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGV 188
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/57 (50%), Positives = 42/57 (73%), Gaps = 2/57 (3%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
+QLDHAV VG+G+ E G ++WL+KNSW +WG++GY+K+ R+ CGI T + YP
Sbjct: 265 SQLDHAVLAVGYGS-EGGQDFWLVKNSWATSWGESGYIKMARNRNNNCGIATDACYP 320
>gi|355558399|gb|EHH15179.1| hypothetical protein EGK_01236 [Macaca mulatta]
gi|380809986|gb|AFE76868.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416071|gb|AFH31249.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416073|gb|AFH31250.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416075|gb|AFH31251.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416077|gb|AFH31252.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416079|gb|AFH31253.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
Length = 331
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 122/206 (59%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKQLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++ +
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNAN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG +AF YII N GI
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGI 198
>gi|402856105|ref|XP_003892640.1| PREDICTED: cathepsin S isoform 1 [Papio anubis]
Length = 331
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 120/206 (58%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T + Q L
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNPNQML-- 115
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 116 ---PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG +AF YII N GI
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGI 198
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 98/318 (30%), Positives = 145/318 (45%), Gaps = 74/318 (23%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
S + S E + ++ +M Q+ ++Y E R FK N+E I N N +Y
Sbjct: 25 SALFSEEVPSEVMLQDMFTAFMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASY 83
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+G N+F+DL+ +EF+ Y GYK R S +Q + PTS+DWR AVT
Sbjct: 84 TMGLNEFADLSFEEFKGKYFGYKHV---EREFARSNNLHQEVEA--APTSIDWRTSNAVT 138
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGN-LIQLSEQQLLDCSTN-GNNGCLGGSREKA 207
PIK+Q +CG CWAF+A ++EG ++ + L LSEQQL+DCST+ GN GC GG + A
Sbjct: 139 PIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYA 198
Query: 208 FAYIIQNQGI----------FNGVCGTQLDHAVTIVGF---------------------- 235
F YII N+GI G+C VTI G+
Sbjct: 199 FEYIIANKGICAESAYPYKGVGGLCQKSCTKVVTISGYKDVASGDEASLLNAVGTVGPVS 258
Query: 236 -------------------GTTEDGANYWLIKNSWGNT---------------WGDAGYM 261
GT ++ ++ +G T WG++GY+
Sbjct: 259 VAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYI 318
Query: 262 KIVRDEGLCGIGTRSSYP 279
+++R++ CGI + SYP
Sbjct: 319 RMIRNKNQCGIAIQPSYP 336
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 78/186 (41%), Positives = 112/186 (60%), Gaps = 10/186 (5%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V S T++ S + WM +H R+Y E E R + FKEN+++I K N + + T LG
Sbjct: 23 VFSSQTYQTSFI----GWMRKHDRAYSHE-EFTDRYQAFKENMDFIHKWNSQESDTV-LG 76
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
+F+DLTN+E++ Y G K+ + + K+ + P S+DWR+KGAV+ +K
Sbjct: 77 LTKFADLTNEEYKKHYLGIKVNVKKNLNAAQKGLKFFKFTG---PDSIDWREKGAVSQVK 133
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYI 211
+Q +CG CW+F+ AVEG +I+SGN++ LSEQ L+DCS GN GC GG AF YI
Sbjct: 134 DQGQCGSCWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYI 193
Query: 212 IQNQGI 217
I N GI
Sbjct: 194 IDNGGI 199
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/57 (50%), Positives = 41/57 (71%), Gaps = 2/57 (3%)
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 281
LDH V VG+GT E G +Y++IKNSWG TWG GY+ + R+ + CG+ T +SYP++
Sbjct: 277 LDHGVLAVGYGTLE-GKDYYIIKNSWGPTWGQDGYIFMSRNAQNQCGVATMASYPIS 332
>gi|214015231|gb|ACJ62232.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/191 (37%), Positives = 113/191 (59%), Gaps = 20/191 (10%)
Query: 46 IHEKWMAQHGR--------------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
++E W ++HGR ++E ++ +RL++F++NL YI+ N E G T
Sbjct: 1 MYEAWKSKHGRGGSNNDDCDMAPGDDEQEEEDRRLRLEVFRDNLRYIDAHNAEADAGLHT 60
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKG 146
++LG F+DLT DE+R G++ + Y+ S +P ++DWR G
Sbjct: 61 FRLGLTPFADLTLDEYRGHVLGFRARGRRSGARYGHGHGYRARSRGGDLLPDAIDWRQLG 120
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
AVT +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E
Sbjct: 121 AVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEN 179
Query: 207 AFAYIIQNQGI 217
AF ++I N GI
Sbjct: 180 AFRFVIGNGGI 190
>gi|75994624|gb|ABA33833.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 255
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/190 (37%), Positives = 113/190 (59%), Gaps = 19/190 (10%)
Query: 46 IHEKWMAQHGR-------------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTY 89
++E W ++HGR + E ++ +RL++F++NL YI+K N E G T+
Sbjct: 1 MYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTF 60
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKGA 147
+LG F+DLT DE+R G++ + + Y+ +P ++DWR GA
Sbjct: 61 RLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDAIDWRQLGA 120
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
VT +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E A
Sbjct: 121 VTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENA 179
Query: 208 FAYIIQNQGI 217
F ++I N GI
Sbjct: 180 FLFVIGNGGI 189
>gi|426379977|ref|XP_004056662.1| PREDICTED: pro-cathepsin H [Gorilla gorilla gorilla]
Length = 335
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 133/303 (43%), Gaps = 80/303 (26%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
WM++H ++Y E E RL+ F N I A+ GN T+K+ NQFSD++ E + Y
Sbjct: 38 WMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMSFAEIKHKYL 95
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGCCWAFAAVAA 168
P + S T S + P S+DWR KG V+P+KNQ CG CW F+ A
Sbjct: 96 ---WSEPQNCSATKSNYL---RGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGA 149
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF--------- 218
+E I +G ++ L+EQQL+DC+ + NN GC GG +AF YI+ N+GI
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQG 209
Query: 219 -NGVCGTQLDHAVTIV--------------------------GFGTTED----------- 240
+G C Q A+ V F T+D
Sbjct: 210 KDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSS 269
Query: 241 -----------------------GANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
G YW++KNSWG WG GY I R + +CG+ +S
Sbjct: 270 TSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPKWGMNGYFLIERGKNMCGLAACAS 329
Query: 278 YPL 280
YP+
Sbjct: 330 YPI 332
>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
Length = 360
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 100/303 (33%), Positives = 141/303 (46%), Gaps = 78/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+ Y+ E + R ++F +NL+ I NK+G +YKLG N+F+DLT DEFR
Sbjct: 63 RFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKG-LSYKLGVNEFTDLTWDEFRRDR 121
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G + + +TT K N+ + P + WR+ G V+P+KNQ +CG CW F+ A
Sbjct: 122 LG---AAQNCSATTKGNLKVTNVVL---PETKGWREAGIVSPVKNQGKCGSCWTFSTTGA 175
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E G I LSEQQL+DC+ NN GC GG +AF YI N G+
Sbjct: 176 LEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 235
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L +AV +V GF
Sbjct: 236 KNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGVYTS 295
Query: 236 ---GTTEDGANY---------------WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
G T N+ WLIKNSWG WGD GY K+ + +CGI T +S
Sbjct: 296 TECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCAS 355
Query: 278 YPL 280
YP+
Sbjct: 356 YPV 358
>gi|414887427|tpg|DAA63441.1| TPA: hypothetical protein ZEAMMB73_713985 [Zea mays]
Length = 355
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 74/189 (39%), Positives = 109/189 (57%), Gaps = 26/189 (13%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W A + RSY E+ R +++++N+E IE N+ +Y+L F+DLT++EF A +T
Sbjct: 43 WQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRAELSYQLSETPFTDLTSEEFLATHT 102
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMT---------------------DVPTSLDWRDKGAV 148
M + H S + +++ L T DVP S+DWR KGAV
Sbjct: 103 ---MSTRLHASEAAR--RHRELITTHAGPVSDGGRQWNRRNYTTDLDVPESVDWRTKGAV 157
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q CG CW+FA VAA+EG+ KIR+G L+ LSEQ++LDCS+ NNGC GG+ A
Sbjct: 158 TTVKDQGACGGCWSFATVAAIEGLHKIRTGQLVSLSEQEVLDCSSPPNNGCHGGNPAAAI 217
Query: 209 AYIIQNQGI 217
++ N G+
Sbjct: 218 DWVSANGGL 226
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 51/95 (53%), Gaps = 9/95 (9%)
Query: 172 ITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ---GIFNGVCGTQ-LD 227
+ KIR L+ + + L+ + +G + + IQ G+F+G C + L+
Sbjct: 250 VAKIRGRKLVDQNNEAALEVAVAQQPVAVGMN-----VHPIQQHYKSGVFHGPCDPEDLN 304
Query: 228 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 262
HAVT+VG+G G YW++KNSWG WG+ GY +
Sbjct: 305 HAVTMVGYGAESGGRKYWIVKNSWGEKWGEKGYFR 339
>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 119/206 (57%), Gaps = 18/206 (8%)
Query: 19 FIIITLLVS---CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
F++++L ++ A + TH E+W + HG+SY ++ E+ R +++++L
Sbjct: 5 FVVLSLCLAGGLAAPSLDPGLDTH-------WEQWKSWHGKSY-EQKEETWRRMVWEKHL 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
IE N E G +++LG N F D+ N+EFR L GYK +H+ S F N
Sbjct: 57 RVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQ-THKKLQGSHFLEPNF- 114
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+VP +DWRD+G VTP+K+Q +CG CWAF+ A+EG R+G L+ LSEQ L++CS
Sbjct: 115 -LEVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECS 173
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF Y+ N GI
Sbjct: 174 KPEGNEGCNGGLMDQAFQYVKDNGGI 199
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 40/68 (58%), Gaps = 5/68 (7%)
Query: 218 FNGVCG-TQLDHAVTIVGFGTTE---DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 272
F C T LDH V +VG+G + DG YW++KNSW G GY+ + +D + CGI
Sbjct: 270 FEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKLGQNGYILMAKDKDNHCGI 329
Query: 273 GTRSSYPL 280
T +SYPL
Sbjct: 330 ATAASYPL 337
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/197 (39%), Positives = 115/197 (58%), Gaps = 16/197 (8%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+FI LLV+ ++ V+ Q+ + +HG++YK+++E+ R IFK+NL
Sbjct: 3 VFIAACLLVAVSATVLEETGVKFQA-------FKLKHGKTYKNQVEETARFNIFKDNLRA 55
Query: 78 IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N ++G +YK G N+F+D+T +EFRA T P H +TT L+
Sbjct: 56 IEQHNVLYEQGLVSYKKGINRFTDMTQEEFRAFLTLSSSKKP-HFNTTEHV-----LTGL 109
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
VP S+DWR KG VT +K+Q CG CWAF+ + E ++G L+ LSEQQL+DCST+
Sbjct: 110 AVPDSIDWRTKGQVTGVKDQGNCGSCWAFSVTGSTEAAYYRKAGKLVSLSEQQLVDCSTD 169
Query: 195 GNNGCLGGSREKAFAYI 211
N GC GG ++ F Y+
Sbjct: 170 INAGCNGGYLDETFTYV 186
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 43/57 (75%), Gaps = 1/57 (1%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
++L+H V +VG+GT+ +G YW++KNSWG ++G++GY +++R + CG+ + YP+
Sbjct: 266 SELNHGVLVVGYGTS-NGKKYWIVKNSWGGSFGESGYFRLLRGKNECGVAEDTVYPI 321
>gi|148224682|ref|NP_001086670.1| cathepsin S [Xenopus laevis]
gi|50418223|gb|AAH77285.1| Ctss-prov protein [Xenopus laevis]
Length = 320
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/172 (42%), Positives = 105/172 (61%), Gaps = 5/172 (2%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W +H + Y+DE E +R +++NL + N E G TY+LG N +D+T++E ++
Sbjct: 17 WKNKHTKEYEDESEDLLRRITWEKNLNTVNMHNLEYSMGMHTYELGMNHLADMTSEEIKS 76
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
TG +P S R T S+ K L VP S+DWR+KG V+ +KNQ CG CWAF+AV
Sbjct: 77 KMTGLILPPHSERKATFSSQKNSTLG-GKVPDSIDWREKGCVSEVKNQGGCGSCWAFSAV 135
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
A+EG +++G ++ LS Q L+DCS+ GN GC GG +AF Y+I N GI
Sbjct: 136 GALEGQLMLKTGKIVSLSPQNLVDCSSKYGNKGCSGGFMTRAFQYVIDNNGI 187
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 53/82 (64%), Gaps = 6/82 (7%)
Query: 200 LGGSREKAFAYIIQNQGIF-NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 258
+ G+R F Y G++ + C +++H V VG+GT +G ++WL+KNSWG +GD
Sbjct: 241 IDGTRPTFFLY---KSGVYSDPSCSQEVNHGVLAVGYGTL-NGQDFWLLKNSWGTKYGDQ 296
Query: 259 GYMKIVRD-EGLCGIGTRSSYP 279
GY++I R+ E LCG+ + +SYP
Sbjct: 297 GYVRIARNKENLCGVASYTSYP 318
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/197 (38%), Positives = 111/197 (56%), Gaps = 10/197 (5%)
Query: 23 TLLVSCASQVVSSR-STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
TLL C + V+S + + + WM +H +SY +E E R +++EN YIE
Sbjct: 5 TLLALCVALFVASTFAVSHDPLTGVFADWMQEHQKSYANE-EFVYRWNVWRENYLYIEAH 63
Query: 82 NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
N + N+++ L N+F DLTN EF L+ G + + + + +P D
Sbjct: 64 NHQ-NKSFHLAMNKFGDLTNAEFNKLFKGLSITADQAKQESDIA------PAPGLPADFD 116
Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCL 200
WR KGAVT +KNQ +CG CW+F+ + EG ++ G L LSEQ L+DCST+ GN+GC
Sbjct: 117 WRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVDCSTSYGNHGCN 176
Query: 201 GGSREKAFAYIIQNQGI 217
GG + AF YII+N+GI
Sbjct: 177 GGLMDYAFEYIIRNKGI 193
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 42/59 (71%), Gaps = 2/59 (3%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 281
++LDH V VG+G DG +YWL+KNSWG WG +GY+++ R++ CGI T +S+P A
Sbjct: 269 SRLDHGVLAVGWGV-RDGKDYWLVKNSWGADWGLSGYIEMSRNKHNQCGIATAASHPHA 326
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/182 (39%), Positives = 109/182 (59%), Gaps = 8/182 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
++++ +++E+W H EK R FKEN+ +I NK G+R Y+L N+F D+
Sbjct: 35 DEALWDLYERWQTHHHVHRHHG-EKGRRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDM 93
Query: 100 TNDEFRALYTGYKM----PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
+EFR+ + ++ + S + F Y + TD+P S+DWR +GAVT +K+Q
Sbjct: 94 GREEFRSTFADSRINDLRRAESPAAPAVPGFMYDGV--TDLPPSVDWRKEGAVTAVKDQG 151
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ V +VEGI IR+G+L+ LSEQ+L+DC T+ NGC GG E AF +I
Sbjct: 152 HCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTD-ENGCQGGLMENAFEFIKSYG 210
Query: 216 GI 217
G+
Sbjct: 211 GV 212
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 53/75 (70%), Gaps = 3/75 (4%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
A+ ++G+F G CGT LDH V VG+G ++DG YW++KNSWG +WG+ GY+++ R
Sbjct: 273 AFQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTAYWIVKNSWGPSWGEGGYIRMQRGAG 332
Query: 266 DEGLCGIGTRSSYPL 280
+ GLCGI +S+P+
Sbjct: 333 NGGLCGIAMEASFPI 347
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 80/207 (38%), Positives = 115/207 (55%), Gaps = 15/207 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
F+I + SQ VS + E+W A H + Y+ E E+ R+KIF EN
Sbjct: 3 FLIFLAICVAGSQAVSFFDL-------VQEQWGAFKMTHNKQYQSETEERFRMKIFMENS 55
Query: 76 EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNL 131
+ K NK +G ++KLG N+++D+ + EF + G+ RS S + +
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ +P +DWRDKGAVTP+K+Q +CG CW+F+A ++EG +SG L+ LSEQ L+DC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDC 175
Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGI 217
S GNNGC GG + AF YI N GI
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGI 202
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 34/58 (58%), Positives = 45/58 (77%), Gaps = 1/58 (1%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 280
+QLDH V +VG+GT +DG +YWL+KNSWG +WGD GY+K+ R+ CGI T +SYPL
Sbjct: 279 SQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARNRNNNCGIATEASYPL 336
>gi|111226635|ref|XP_641720.2| cysteine proteinase [Dictyostelium discoideum AX4]
gi|38372247|sp|Q94504.1|CYSP7_DICDI RecName: Full=Cysteine proteinase 7; AltName: Full=Proteinase 1;
Flags: Precursor
gi|1644502|gb|AAC47482.1| cysteine proteinase [Dictyostelium discoideum]
gi|90970688|gb|EAL67742.2| cysteine proteinase [Dictyostelium discoideum AX4]
Length = 460
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 82/203 (40%), Positives = 110/203 (54%), Gaps = 12/203 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M ++ L V S + + E WM H R Y E E R IFK N++Y
Sbjct: 1 MKVLSALCVLLVSVATAKQQLSEVEYRNAFTNWMIAHQRHYSSE-EFNGRYNIFKANMDY 59
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
+ + N +G+ T LG N F+D++N+E+RA Y G + S T S + D
Sbjct: 60 VNEWNTKGSETV-LGLNVFADISNEEYRATYLGTPFDASSLEMTESD-------KIFDAS 111
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG--NLIQLSEQQLLDCS-TN 194
+DWR +GAVTPIKNQ +CG CW+F+ A EG + +G NL+ LSEQ L+DCS +
Sbjct: 112 AQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSY 171
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
GNNGC GG AF YII N+GI
Sbjct: 172 GNNGCEGGLMTLAFEYIINNKGI 194
Score = 44.7 bits (104), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 243 NYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIGTRSSYPLA 281
+YW++KNSWG +WG GY+ + + + CGI T +S P A
Sbjct: 417 DYWIVKNSWGTSWGMDGYILMTKGNNNQCGIATMASRPTA 456
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 119/206 (57%), Gaps = 18/206 (8%)
Query: 19 FIIITLLVS---CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
F++++L ++ A + TH E+W + HG+SY ++ E+ R +++++L
Sbjct: 5 FVVLSLCLAGGLAAPSLDPGLDTHW-------EQWKSWHGKSY-EQKEETWRRMVWEKHL 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
IE N E G +++LG N F D+ N+EFR L GYK +H+ S F N
Sbjct: 57 RVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQ-THKKLQGSHFLEPNF- 114
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+VP +DWRD+G VTP+K+Q +CG CWAF+ A+EG R+G L+ LSEQ L++CS
Sbjct: 115 -LEVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECS 173
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF Y+ N GI
Sbjct: 174 KPEGNEGCNGGLMDQAFQYVKDNGGI 199
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 41/68 (60%), Gaps = 5/68 (7%)
Query: 218 FNGVCG-TQLDHAVTIVGFGTTE---DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 272
F C T LDH V +VG+G + DG YW++KNSW WG GY+ + +D + CGI
Sbjct: 270 FEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGI 329
Query: 273 GTRSSYPL 280
T +SYPL
Sbjct: 330 ATAASYPL 337
>gi|241602000|ref|XP_002405373.1| cathepsin-like protease, putative [Ixodes scapularis]
gi|215502535|gb|EEC12029.1| cathepsin-like protease, putative [Ixodes scapularis]
Length = 273
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/208 (37%), Positives = 122/208 (58%), Gaps = 13/208 (6%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKE 73
T F+++ L+ A+ V E +++ E + A +G+SY + E++ R+ ++
Sbjct: 1 TMRRFLLVCFLIVAATAV------GENVDLDVEWETFKANYGKSYSSQAEEQFRMTVYMN 54
Query: 74 NLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
N + K N+ EG +Y+L N+FSDL ++EF G++ P +++T + N
Sbjct: 55 NKLKVAKHNEQYAEGKVSYQLAMNKFSDLLHEEFVRSRNGFRRIRPVKQAST--YMEPAN 112
Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
+ P ++DWR KGAVTP+KNQ++CG CWAF+A ++EG +R+G L+ LSEQ L+D
Sbjct: 113 IEDVCFPQTVDWRKKGAVTPVKNQEQCGSCWAFSATGSLEGQHFLRTGKLVSLSEQNLVD 172
Query: 191 CSTN-GNNGCLGGSREKAFAYIIQNQGI 217
CS + GN GC GG + AF YI N GI
Sbjct: 173 CSDDFGNLGCSGGVMDDAFRYIKANGGI 200
>gi|354502595|ref|XP_003513369.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
Length = 330
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 120/205 (58%), Gaps = 15/205 (7%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
P+F + TL + VV + TH+ S+ + ++W +HG++Y + E + R +++ N +
Sbjct: 3 PIFFLATLCLG----VVPAAPTHDPSLDDEWQEWKTRHGKTYSMDEEGQKR-AVWENNRK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE N++ G + L N F DLTN EFR L TG++ T +Q +
Sbjct: 58 MIELHNEDYTKGKHGFHLEMNAFGDLTNIEFRQLMTGFQ------SMGTKEMNVFQEPLL 111
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
DVP S+DWR+ VTP+K+Q +C CWAF+AV ++EG ++G LI LSEQ L+DCS
Sbjct: 112 GDVPKSVDWRNLSYVTPVKDQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLVDCSW 171
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI 217
+ GN GC GG E AF Y+ +N+G+
Sbjct: 172 SYGNIGCFGGLMEYAFRYVKENRGL 196
>gi|214015372|gb|ACJ62302.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 250
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 113/188 (60%), Gaps = 19/188 (10%)
Query: 46 IHEKWMAQHGRSYKDE-------------LEKEMRLKIFKENLEYIEKANKE---GNRTY 89
++E W ++HGR ++ ++ +RL++F++NL YI+K N E G T+
Sbjct: 1 MYEAWKSKHGRGGSNDDCDMAPGDEEQEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTF 60
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+LG F+DLT DE+R G++ + + + + +P ++DWR GAVT
Sbjct: 61 RLGLTPFADLTLDEYRGRVLGFRARGRRSGARYGHRARPRGGDL--LPDAIDWRQLGAVT 118
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF
Sbjct: 119 EVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFR 177
Query: 210 YIIQNQGI 217
++I N GI
Sbjct: 178 FVIGNGGI 185
>gi|195379496|ref|XP_002048514.1| GJ14012 [Drosophila virilis]
gi|194155672|gb|EDW70856.1| GJ14012 [Drosophila virilis]
Length = 327
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/203 (37%), Positives = 124/203 (61%), Gaps = 10/203 (4%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
+ LL+ A + ++E + E + ++ +SY+D+ E+++R++IFK+N + I++
Sbjct: 4 VALLLIVAGVGCNRALSYEDVLASEFESFKVEYEKSYEDDGEEQLRMQIFKDNKQLIDRH 63
Query: 82 NKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY-QNLSMTDVP 137
N+ G TY++G NQF+D+ EFR + M + S +S+ +Y + + ++P
Sbjct: 64 NERYAAGEETYEMGVNQFTDMLATEFRKI-----MLVNLNISDFTSSIEYIYSPANAEIP 118
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
+ +DWR+KGAVTP+KNQ CG CWAF+A A+EG I++ LI LSEQ LLDCS+ NN
Sbjct: 119 SQVDWREKGAVTPVKNQGRCGSCWAFSAAGALEGQHFIQTKQLIPLSEQNLLDCSSRYNN 178
Query: 198 -GCLGGSREKAFAYIIQNQGIFN 219
GC GG A Y+ N+G+ N
Sbjct: 179 HGCGGGWPAAALMYVRDNRGMDN 201
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 48/66 (72%), Gaps = 2/66 (3%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGT 274
G+++ C Q +HA+ +VG+G+ + G ++WLIKNSWG WG+ GYM++ R++G LC + +
Sbjct: 263 GVYSHRCRQQANHAMLVVGYGSDQRGGDFWLIKNSWGG-WGEQGYMRLARNQGNLCHVAS 321
Query: 275 RSSYPL 280
+ +P+
Sbjct: 322 YAVFPI 327
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 119/206 (57%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S + H+ ++ H + W +G+ YK++ E+ +R I+++NL+
Sbjct: 12 MKWLVLVLLGCSSAMAQ---LHKDPTLDRHWDLWKKTYGKQYKEKNEEGVRRLIWEKNLK 68
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E AL + ++PS R+ T + Q L
Sbjct: 69 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTALMSSLRVPSQWQRNVTYKSNPNQKL-- 126
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
P S+DWRDKG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCS
Sbjct: 127 ---PDSVDWRDKGCVTDVKYQGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSV 183
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
N GC GG +AF YII N GI
Sbjct: 184 GKYSNRGCNGGFMTEAFQYIIDNNGI 209
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/177 (40%), Positives = 107/177 (60%), Gaps = 7/177 (3%)
Query: 42 SVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
+V +I +W A+ G SY E E+ R +F +N++ I + N +G+ TY LG NQF+D
Sbjct: 11 AVADIDAQWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGH-TYTLGVNQFAD 69
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LT +EF Y G+K P+ + ++ + +PTS+DW +GAVTP+KNQ +CG
Sbjct: 70 LTVEEFSKTYMGFK--KPAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCG 127
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQN 214
CW+F+ ++EG +I +G L+ LSEQQ +DC+ T GN GC GG + AF Y N
Sbjct: 128 SCWSFSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEAN 184
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 88/232 (37%), Positives = 121/232 (52%), Gaps = 25/232 (10%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ + +LV C S V ++ Q H W H +SY E E+ R ++++NL+ IE
Sbjct: 4 LYLAVLVLCVSAVCAAPRFDSQLEDHWH-LWKNWHSKSYH-ESEEGWRRMVWEKNLKKIE 61
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSM 133
N E G +Y+LG N F D+TN+EFR GYK TT FK + +
Sbjct: 62 MHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK-------QTTERKFKGSLFMEPNY 114
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
P ++DWR+KG VTP+K+Q CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR 174
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANY 244
GN GC GG ++AF YI N G LD + GT ED +Y
Sbjct: 175 PEGNEGCNGGLMDQAFQYIQDNAG---------LDTEESYPYVGTDEDPCHY 217
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/60 (50%), Positives = 39/60 (65%), Gaps = 4/60 (6%)
Query: 225 QLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+LDH V +VG+G DG YW++KNSW WGD GY+ + +D + CGI T SSYPL
Sbjct: 278 ELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|403376023|gb|EJY87990.1| Cathepsin L [Oxytricha trifallax]
Length = 343
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 69/166 (41%), Positives = 107/166 (64%), Gaps = 7/166 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
++A++G+SY + E + R + +++N+ + + N + T++LG N+F+D T +E++ L
Sbjct: 46 YLAKYGKSYGTKEEFQFRYEQYQKNMAKVAQYNGQNGNTFRLGINKFTDYTPEEYKVL-L 104
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
GYK S T + LS + P S+DWR+KGAVTP+K+Q +CG CWAF+A A+
Sbjct: 105 GYKPQS------KPMTLEASYLSEENTPASIDWREKGAVTPVKDQGQCGSCWAFSATGAL 158
Query: 170 EGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
EG +I + LI +SEQQL+DCS +GNNGC GG AF Y +N+
Sbjct: 159 EGHYQISNNKLISISEQQLVDCSHDGNNGCNGGEMYLAFDYASKNK 204
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/69 (50%), Positives = 44/69 (63%), Gaps = 4/69 (5%)
Query: 216 GIFNGV-CGTQLDHAVTIVGFGTTEDGA-NYWLIKNSWGNTWGDAGYMKI--VRDEGLCG 271
GI N CGT LDH V VGFG E +Y+++KNSWG WGD G++KI V EG+CG
Sbjct: 271 GILNSKECGTNLDHGVLAVGFGHDEASKQDYFIVKNSWGQYWGDHGFIKIAAVDGEGICG 330
Query: 272 IGTRSSYPL 280
I + YP+
Sbjct: 331 IQMDAVYPI 339
>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 88/232 (37%), Positives = 121/232 (52%), Gaps = 25/232 (10%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ + +LV C S V ++ Q H W H +SY E E+ R ++++NL+ IE
Sbjct: 4 LYLAVLVLCVSAVCAAPRFDSQLEDHWH-LWKNWHSKSYH-ESEEGWRRMVWEKNLKKIE 61
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSM 133
N E G +Y+LG N F D+TN+EFR GYK TT FK + +
Sbjct: 62 MHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK-------QTTERKFKGSLFMEPNY 114
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
P ++DWR+KG VTP+K+Q CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR 174
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANY 244
GN GC GG ++AF YI N G LD + GT ED +Y
Sbjct: 175 PEGNEGCNGGLMDQAFQYIQDNAG---------LDTEESYPYVGTDEDPCHY 217
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/60 (50%), Positives = 39/60 (65%), Gaps = 4/60 (6%)
Query: 225 QLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+LDH V +VG+G DG YW++KNSW WGD GY+ + +D + CGI T SSYPL
Sbjct: 278 ELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/158 (45%), Positives = 98/158 (62%), Gaps = 13/158 (8%)
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-------- 119
+FK N+ I + N+ + YKL N+F D+T DEFR Y G ++ HR
Sbjct: 70 FNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFRRHYAGSRVAH--HRMFRGDRQG 126
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
S+ S++F Y + DVP S+DWR KGAVT +K+Q +CG CWAF+ +AAVEGI I++ N
Sbjct: 127 SSASASFMYAD--ARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKN 184
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L LSEQQL+DC T N GC GG + AF YI ++ G+
Sbjct: 185 LTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGV 222
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 51/71 (71%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F+G CGT+LDH V VG+G T DG YWL+KNSWG WG+ GY+++ RD EG
Sbjct: 285 SEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGH 344
Query: 270 CGIGTRSSYPL 280
CGI +SYP+
Sbjct: 345 CGIAMEASYPV 355
>gi|293334761|ref|NP_001168296.1| uncharacterized protein LOC100382061 [Zea mays]
gi|223947281|gb|ACN27724.1| unknown [Zea mays]
Length = 322
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/189 (39%), Positives = 109/189 (57%), Gaps = 26/189 (13%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W A + RSY E+ R +++++N+E IE N+ +Y+L F+DLT++EF A +T
Sbjct: 10 WQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRAELSYQLSETPFTDLTSEEFLATHT 69
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMT---------------------DVPTSLDWRDKGAV 148
M + H S + +++ L T DVP S+DWR KGAV
Sbjct: 70 ---MSTRLHASEAAR--RHRELITTHAGPVSDGGRQWNRRNYTTDLDVPESVDWRTKGAV 124
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q CG CW+FA VAA+EG+ KIR+G L+ LSEQ++LDCS+ NNGC GG+ A
Sbjct: 125 TTVKDQGACGGCWSFATVAAIEGLHKIRTGQLVSLSEQEVLDCSSPPNNGCHGGNPAAAI 184
Query: 209 AYIIQNQGI 217
++ N G+
Sbjct: 185 DWVSANGGL 193
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 51/95 (53%), Gaps = 9/95 (9%)
Query: 172 ITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ---GIFNGVCGTQ-LD 227
+ KIR L+ + + L+ + +G + + IQ G+F+G C + L+
Sbjct: 217 VAKIRGRKLVDQNNEAALEVAVAQQPVAVGMN-----VHPIQQHYKSGVFHGPCDPEDLN 271
Query: 228 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 262
HAVT+VG+G G YW++KNSWG WG+ GY +
Sbjct: 272 HAVTMVGYGAESGGRKYWIVKNSWGEKWGEKGYFR 306
>gi|410968392|ref|XP_003990691.1| PREDICTED: cathepsin S, partial [Felis catus]
Length = 310
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 121/206 (58%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ CA + + H+ ++ H W +G+ YK++ E+ R I+++NL+
Sbjct: 13 MKCLVWVLLVCAYALAQA---HKDPTLDHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLK 69
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L ++PS R+ T Y++ S
Sbjct: 70 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMGCLRVPSQWQRNVT-----YKSNSN 124
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++GNL+ LS Q L+DCST
Sbjct: 125 QKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCST 184
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG +AF YII N GI
Sbjct: 185 EKYGNKGCNGGFMTEAFQYIIDNNGI 210
>gi|214015364|gb|ACJ62298.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 252
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/188 (37%), Positives = 112/188 (59%), Gaps = 17/188 (9%)
Query: 46 IHEKWMAQHGRSY-----------KDELEKEMRLKIFKENLEYIEKANKE---GNRTYKL 91
++E W ++HGR +E ++ +RL++F++NL YI+K N E G T++L
Sbjct: 1 MYEAWKSKHGRGGSSNDDCDMAPGDEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFRL 60
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKGAVT 149
G F+DLT DE+R G++ + + Y+ +P ++DWR GAVT
Sbjct: 61 GLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDAIDWRQLGAVT 120
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q++CG CWAF+AVA +EG+ I +GNL+ LSEQ+++DC ++GC GG E AF
Sbjct: 121 EVKDQQQCGGCWAFSAVAGIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAFR 179
Query: 210 YIIQNQGI 217
++I N GI
Sbjct: 180 FVIGNGGI 187
>gi|214015325|gb|ACJ62279.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/189 (37%), Positives = 113/189 (59%), Gaps = 20/189 (10%)
Query: 46 IHEKWMAQHGRSYKDE--------------LEKEMRLKIFKENLEYIEKANKE---GNRT 88
++E W ++HGR ++ ++ +RL++F++NL YI+K N E G T
Sbjct: 1 MYEAWKSKHGRGGSNDDDCDMAPGDEEQEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHT 60
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG F+DLT DE+R G++ + + + + +P ++DWR GAV
Sbjct: 61 FRLGLTPFADLTLDEYRGRVLGFRARGRRSGARYGHRARPRGGDL--LPDAIDWRQLGAV 118
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF
Sbjct: 119 TEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAF 177
Query: 209 AYIIQNQGI 217
++I N GI
Sbjct: 178 RFVIGNGGI 186
>gi|119573902|gb|EAW53517.1| cathepsin S, isoform CRA_a [Homo sapiens]
Length = 220
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/203 (37%), Positives = 120/203 (59%), Gaps = 14/203 (6%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+++
Sbjct: 4 LVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM 60
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
N E G +Y LG N D+T++E +L + ++PS R+ T Y++ +
Sbjct: 61 LHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPNRIL 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-- 194
P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 116 PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKY 175
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF YII N+GI
Sbjct: 176 GNKGCNGGFMTTAFQYIIDNKGI 198
>gi|328870624|gb|EGG18997.1| cysteine proteinase [Dictyostelium fasciculatum]
Length = 521
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/201 (37%), Positives = 119/201 (59%), Gaps = 5/201 (2%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F+ + L V A Q ++ T +Q + WM ++ RSY+ E R +FK+N++Y+
Sbjct: 4 FLFVCLAVFMALQAANAAFTEKQ-YRDAFTNWMIKNDRSYQ-SAEFGNRFNVFKKNMDYV 61
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV-P 137
+ N +G+ T L F+D++N+E++ +Y G K+ + + N + V
Sbjct: 62 NEWNSKGSETV-LDLTIFADISNEEYQRIYLGTKIDATQKLIDAARITMNNNFAAAPVFN 120
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGN 196
++DWR KGAVTPIKNQ +CG CW+F+ + EG + +GNL+ LSEQ L+DCS GN
Sbjct: 121 ATVDWRQKGAVTPIKNQGQCGSCWSFSTTGSTEGAHFLSTGNLVSLSEQNLVDCSGPEGN 180
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
+GC GG ++AF YII+N+GI
Sbjct: 181 DGCNGGLMDQAFTYIIKNKGI 201
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 28/41 (68%), Gaps = 2/41 (4%)
Query: 243 NYWLIKNSWGNTWGDA-GYMKIVRDE-GLCGIGTRSSYPLA 281
NYW++KNSWG TWG+ GY+ + ++ CGI T +S P A
Sbjct: 481 NYWIVKNSWGTTWGNLNGYIYMSKNRNNNCGIATSASIPQA 521
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 80/204 (39%), Positives = 120/204 (58%), Gaps = 16/204 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+++++++S AS V + V+ E W H + Y +E+++RLKIF EN I
Sbjct: 6 ILLLSVIISTASAV-----SFFDVVLSDWESWKLTHQKGYDSSVEEKLRLKIFMENSLRI 60
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMT 134
+ N E G TY + N + DL + EF A+ GY ++++T TF +N+++
Sbjct: 61 SRHNAEAIQGRHTYFMKMNHYGDLLHHEFVAMVNGYIY---NNKTTLGGTFIPSKNINL- 116
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
P +DWR++GAVTP+KNQ +CG CW+F+A ++EG ++G LI LSEQ L+DCS
Sbjct: 117 --PEHVDWREEGAVTPVKNQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRK 174
Query: 195 -GNNGCLGGSREKAFAYIIQNQGI 217
GNNGC GG + AF YI N GI
Sbjct: 175 YGNNGCEGGLMDYAFKYIQDNNGI 198
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 41/58 (70%), Gaps = 2/58 (3%)
Query: 226 LDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 281
LDH V VG+GT E G +YWL+KNSW WG+ GY+K+ R+ + +CGI + +SYP+
Sbjct: 277 LDHGVLAVGYGTDEVTGEDYWLVKNSWSEKWGEDGYIKMARNKDNMCGIASSASYPVV 334
>gi|214015333|gb|ACJ62283.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015335|gb|ACJ62284.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015337|gb|ACJ62285.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015341|gb|ACJ62287.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/189 (37%), Positives = 113/189 (59%), Gaps = 20/189 (10%)
Query: 46 IHEKWMAQHGRSYKDE--------------LEKEMRLKIFKENLEYIEKANKE---GNRT 88
++E W ++HGR ++ ++ +RL++F++NL YI+K N E G T
Sbjct: 1 MYEAWKSKHGRGGSNDDDCDMAPGDEEQEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHT 60
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG F+DLT DE+R G++ + + + + +P ++DWR GAV
Sbjct: 61 FRLGLTPFADLTLDEYRGRVLGFRARGRRSGARYGHRARPRGGDL--LPDAIDWRQLGAV 118
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF
Sbjct: 119 TEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAF 177
Query: 209 AYIIQNQGI 217
++I N GI
Sbjct: 178 RFVIGNGGI 186
>gi|218185|dbj|BAA14404.1| oryzain gamma precursor [Oryza sativa Japonica Group]
Length = 362
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 92/303 (30%), Positives = 138/303 (45%), Gaps = 77/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ +HG+ Y D E + R +IF E+LE + N+ G Y+LG N+F+D++ +EF+A
Sbjct: 64 RFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNRRG-LPYRLGINRFADMSWEEFQASR 122
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G + + + ++ +P + DWR+ G V+P+K+Q CG CW F+ +
Sbjct: 123 LG-----AAQNCSATLAGNHRMRDAPALPETKDWREDGIVSPVKDQGHCGSCWPFSTTGS 177
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E +G + LSEQQL DC+T NN GC GG +AF YI N G+
Sbjct: 178 LEARYTQATGPPVSLSEQQLADCATRYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTG 237
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L +AV +V GF
Sbjct: 238 VNGICHYKPENAGVKVLDSVNITLVAEDELKNAVGLVRPVSVAFQVINGFRMYKSGVYTS 297
Query: 236 ---GTTEDGANY---------------WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
GT+ N+ WLIKNSWG WGD GY + + +CGI T +S
Sbjct: 298 DHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFTMEMGKNMCGIATCAS 357
Query: 278 YPL 280
YP+
Sbjct: 358 YPI 360
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/174 (40%), Positives = 111/174 (63%), Gaps = 8/174 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E++ + GR Y + R IF+ NL++I + N + G+ T+ + N F+DL+N+EF
Sbjct: 34 EQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLSNEEF 93
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
RA + GY+ + + + + + + + +P ++DW KG VTPIKNQ++CG CWAF+
Sbjct: 94 RATFNGYRRLA----AVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCWAFS 149
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
AVA++EG +++G L+ LSEQ L+DCS G+ GC GG + AF Y+IQN+GI
Sbjct: 150 AVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRGI 203
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 33/68 (48%), Positives = 47/68 (69%), Gaps = 4/68 (5%)
Query: 216 GIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 272
G++N C T+ LDH VT VG+GT +G YW +KNSWG +WG GY+ + R+ + CGI
Sbjct: 270 GVYNEPDCSTEILDHGVTAVGYGTL-NGVPYWKVKNSWGTSWGQKGYIFMSRNKQNQCGI 328
Query: 273 GTRSSYPL 280
T++SYP+
Sbjct: 329 ATKASYPV 336
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 80/204 (39%), Positives = 118/204 (57%), Gaps = 12/204 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ + ++ C S +S+ S Q ++ H E W + H + Y E E+ R ++++NL+
Sbjct: 1 MLPLAVVALCLSAALSAPSLDPQ--LDDHWELWKSWHSKKYH-EKEEGWRRMVWEKNLKK 57
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G +Y+LG N F D+T++EFR L GYK + + S F N
Sbjct: 58 IELHNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMNGYKRKAET--KARGSLFLEPNF--L 113
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
+ P S+DWRD G VTP+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 114 EAPKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRP 173
Query: 194 NGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG ++AF Y+ NQG+
Sbjct: 174 EGNEGCNGGLMDQAFQYVKDNQGL 197
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/60 (48%), Positives = 39/60 (65%), Gaps = 4/60 (6%)
Query: 225 QLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+LDH V +VG+G DG YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 276 ELDHGVLVVGYGFQGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|328868405|gb|EGG16783.1| cysteine protease 4 [Dictyostelium fasciculatum]
Length = 454
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 119/200 (59%), Gaps = 9/200 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
I++ L+++ + S+ S EQ + WM + GR Y E R +FK+N++Y+
Sbjct: 6 IILVVLMLA----IASASSFSEQQYRDSFTSWMQKQGRVYSSH-EFGARYNVFKKNMDYV 60
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
++ N +G+ T LG N F+D++N+E++ +Y G K+ + + +ST + +
Sbjct: 61 QEWNSKGSETV-LGLNVFADISNEEYQRIYLGTKVDGTARLAAAASTTMDRIYEVQ--AA 117
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNN 197
++DWR +GAVT IKNQ +CG CW+F+ + EG + + NL+ LSEQ L+DCST GN
Sbjct: 118 TVDWRQQGAVTAIKNQGQCGSCWSFSTTGSTEGAHFLSTKNLVSLSEQNLIDCSTAEGNQ 177
Query: 198 GCLGGSREKAFAYIIQNQGI 217
GC GG +AF YII+N GI
Sbjct: 178 GCNGGLMTQAFTYIIKNGGI 197
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 28/38 (73%), Gaps = 1/38 (2%)
Query: 243 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
+YW++KNSWG TWG +GY+ + R+ CGI T +S+P
Sbjct: 415 DYWIVKNSWGTTWGYSGYIFMSRNRNNNCGIATMASFP 452
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/174 (40%), Positives = 111/174 (63%), Gaps = 8/174 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E++ + GR Y + R IF+ NL++I + N + G+ T+ + N F+DL+N+EF
Sbjct: 34 EQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLSNEEF 93
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
RA + GY+ + + + + + + + +P ++DW KG VTPIKNQ++CG CWAF+
Sbjct: 94 RATFNGYRRLA----AVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCWAFS 149
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
AVA++EG +++G L+ LSEQ L+DCS G+ GC GG + AF Y+IQN+GI
Sbjct: 150 AVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRGI 203
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 48/68 (70%), Gaps = 4/68 (5%)
Query: 216 GIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 272
G++N C T+ LDH VT VG+GT +GA YW +KNSWG +WG GY+ + R+ + CGI
Sbjct: 270 GVYNEPDCSTEILDHGVTAVGYGTL-NGAPYWKVKNSWGTSWGRKGYIFMSRNKQNQCGI 328
Query: 273 GTRSSYPL 280
T++SYP+
Sbjct: 329 ATKASYPV 336
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 69/169 (40%), Positives = 99/169 (58%), Gaps = 2/169 (1%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
+ A + +SY E EK+ R IFK NL YI N++G +Y L N F DL+ DEFR Y
Sbjct: 120 FQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRRKYL 178
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
G+K + N+ +++P +DWR +G VTP+K+Q++CG CWAF+ A+
Sbjct: 179 GFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGAL 238
Query: 170 EGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
EG ++G L+ LSEQ+L+DCS GN C GG AF Y++ + GI
Sbjct: 239 EGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGI 287
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 47/71 (66%), Gaps = 4/71 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGL 269
++G+F+ CGT LDH V +VG+GT E ++W++KNSWG WG GYM + +EG
Sbjct: 351 HEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQ 410
Query: 270 CGIGTRSSYPL 280
CG+ +S+P+
Sbjct: 411 CGLLLDASFPV 421
>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
Length = 331
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 121/206 (58%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKWLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF YII N+GI
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGI 198
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 79/207 (38%), Positives = 115/207 (55%), Gaps = 15/207 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
F+I + SQ VS + E+W A H + Y+ + E+ R+KIF EN
Sbjct: 3 FLIFLAICVAGSQAVSFFDL-------VQEQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55
Query: 76 EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNL 131
+ K NK +G ++KLG N+++D+ + EF + G+ RS S + +
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ +P +DWRDKGAVTP+K+Q +CG CW+F+A ++EG +SG L+ LSEQ L+DC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175
Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGI 217
S GNNGC GG + AF YI N GI
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGI 202
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 34/58 (58%), Positives = 46/58 (79%), Gaps = 1/58 (1%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+QLDH V +VG+GT +DG +YWL+KNSWG +WGD GY+K+ R+ + CGI T +SYPL
Sbjct: 279 SQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 80/209 (38%), Positives = 114/209 (54%), Gaps = 17/209 (8%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
+I L + +Q VS I E+W +H + Y+DE E+ RLKIF EN
Sbjct: 4 YIFALLALVAVAQAVSFADV-------IKEEWQTFKLEHRKQYQDETEERFRLKIFNENK 56
Query: 76 EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQ 129
I K N+ G ++K+G N+++D+ + EF G+ + +TF +
Sbjct: 57 HKIAKHNQLYAAGEVSFKMGLNKYADMLHHEFHETMNGFNYTLHKQLRASDATFTGVTFI 116
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+ +P S+DWR+KGAVT +K+Q CG CWAF++ A+EG ++G LI LSEQ L+
Sbjct: 117 SPEHVKLPQSVDWRNKGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLV 176
Query: 190 DCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
DCST GNNGC GG + AF YI N GI
Sbjct: 177 DCSTKYGNNGCNGGLMDNAFRYIKDNGGI 205
>gi|426331346|ref|XP_004026643.1| PREDICTED: cathepsin S [Gorilla gorilla gorilla]
Length = 220
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/203 (37%), Positives = 120/203 (59%), Gaps = 14/203 (6%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+++
Sbjct: 4 LVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM 60
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
N E G +Y LG N D+T++E +L + ++PS R+ T Y++ +
Sbjct: 61 LHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPNRIL 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-- 194
P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 116 PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKY 175
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF YII N+GI
Sbjct: 176 GNKGCNGGFMTTAFQYIIDNKGI 198
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 79/207 (38%), Positives = 115/207 (55%), Gaps = 15/207 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
F+I + SQ VS + E+W A H + Y+ + E+ R+KIF EN
Sbjct: 3 FLIFLAICVAGSQAVSFFDL-------VQEQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55
Query: 76 EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNL 131
+ K NK +G ++KLG N+++D+ + EF + G+ RS S + +
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ +P +DWRDKGAVTP+K+Q +CG CW+F+A ++EG +SG L+ LSEQ L+DC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175
Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGI 217
S GNNGC GG + AF YI N GI
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGI 202
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 34/58 (58%), Positives = 46/58 (79%), Gaps = 1/58 (1%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
+QLDH V +VG+GT +DG +YWL+KNSWG +WGD GY+K+ R+ + CGI T +SYPL
Sbjct: 279 SQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336
>gi|114559418|ref|XP_001171268.1| PREDICTED: cathepsin S isoform 3 [Pan troglodytes]
gi|397492866|ref|XP_003817341.1| PREDICTED: cathepsin S isoform 1 [Pan paniscus]
gi|410225070|gb|JAA09754.1| cathepsin S [Pan troglodytes]
gi|410251608|gb|JAA13771.1| cathepsin S [Pan troglodytes]
gi|410328325|gb|JAA33109.1| cathepsin S [Pan troglodytes]
gi|410328327|gb|JAA33110.1| cathepsin S [Pan troglodytes]
Length = 331
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 121/206 (58%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF YII N+GI
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGI 198
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 47/73 (64%), Gaps = 3/73 (4%)
Query: 209 AYIIQNQGIF-NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 267
++ + G++ C ++H V +VG+G +G YWL+KNSWG+ +G+ GY+++ R++
Sbjct: 258 SFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGKEYWLVKNSWGHNFGEEGYIRMARNK 316
Query: 268 G-LCGIGTRSSYP 279
G CGI + SYP
Sbjct: 317 GNHCGIASFPSYP 329
>gi|76574394|gb|ABA46967.1| cysteine protease Mir1 [Zea diploperennis]
Length = 256
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/191 (38%), Positives = 116/191 (60%), Gaps = 20/191 (10%)
Query: 46 IHEKWMAQHGR------------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYK 90
I+E W ++HGR ++E ++ +RL++F++NL YI+ N E G T++
Sbjct: 1 IYEAWKSKHGRGGSNDDCDMAPGDEEEEDDRRLRLEVFRDNLRYIDAHNAEADAGLHTFR 60
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT----SSTFKYQNLSMTDVPTSLDWRDKG 146
LG F+DLT DE+R G++ + S RS ++ + +P ++DWR G
Sbjct: 61 LGLTPFADLTLDEYRGRVLGFRARARSRRSGARYGHGHGYRARPRGGDLLPDAIDWRQLG 120
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
AVT +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E
Sbjct: 121 AVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEN 179
Query: 207 AFAYIIQNQGI 217
AF ++I N GI
Sbjct: 180 AFRFVIGNGGI 190
>gi|348505824|ref|XP_003440460.1| PREDICTED: pro-cathepsin H-like [Oreochromis niloticus]
Length = 324
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 86/203 (42%), Positives = 117/203 (57%), Gaps = 16/203 (7%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLE 76
MF+ L A+ V S+ EQ E H K WMAQ+ + Y + E RL+IF EN +
Sbjct: 1 MFVFHVLF---AASVTSAFRLSEQD--EFHFKSWMAQYNKEYNLK-EYYQRLQIFTENKK 54
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
I+K N EGN ++ +G N+FSD+T EFR + M P + S T + N +
Sbjct: 55 RIDKHN-EGNHSFTMGLNEFSDMTFSEFRK---SFLMSEPQNCSATKGNYFSSNGLL--- 107
Query: 137 PTSLDWRDKG-AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P S+DWR KG VTP+KNQ CG CW F+ +E +T I G L+ LSEQQL+DC+ +
Sbjct: 108 PDSIDWRKKGNYVTPVKNQGGCGSCWTFSTTGCLESVTAINKGKLVPLSEQQLVDCAQDF 167
Query: 196 NN-GCLGGSREKAFAYIIQNQGI 217
NN GC GG +AF YI+ N+G+
Sbjct: 168 NNHGCNGGLPSQAFEYIMYNKGL 190
>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
Length = 331
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 121/206 (58%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF YII N+GI
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGI 198
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 50/78 (64%), Gaps = 3/78 (3%)
Query: 203 SREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 262
+R +F ++ ++ + C ++H V +VG+G +G YWL+KNSWG+ +G+ GY++
Sbjct: 254 ARHPSF-FLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGKEYWLVKNSWGHNFGEEGYIR 311
Query: 263 IVRDEG-LCGIGTRSSYP 279
+ R++G CGI + SYP
Sbjct: 312 MARNKGNHCGIASFPSYP 329
>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
Length = 332
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 121/206 (58%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF YII N+GI
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGI 198
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 50/78 (64%), Gaps = 3/78 (3%)
Query: 203 SREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 262
+R +F ++ ++ + C ++H V +VG+G +G YWL+KNSWG+ +G+ GY++
Sbjct: 254 ARHPSF-FLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGKEYWLVKNSWGHNFGEEGYIR 311
Query: 263 IVRDEG-LCGIGTRSSYP 279
+ R++G CGI + SYP
Sbjct: 312 MARNKGNHCGIASFPSYP 329
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 104/172 (60%), Gaps = 8/172 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W + H + Y E E+ R I+++NL+ IE N + G +Y+LG N F D+TN+EFR
Sbjct: 31 WKSWHSKKYH-EKEEGWRRMIWEKNLKMIELHNLDHSLGKHSYRLGMNHFGDMTNEEFRQ 89
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
+ G+K S S R S F N P S+DWR+KG VTP+K+Q +CG CWAF+A
Sbjct: 90 VMNGFKQ-SRSQRKYKGSQFLEPNF--LQAPKSVDWREKGYVTPVKDQGQCGSCWAFSAT 146
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
A+EG ++G L+ LSEQ L+DCS GN GC GG ++AF YI N GI
Sbjct: 147 GALEGQHFRKTGKLVSLSEQNLIDCSGPEGNQGCNGGLMDQAFQYIKDNNGI 198
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 39/61 (63%), Gaps = 5/61 (8%)
Query: 225 QLDHAVTIVGFG---TTEDGAN-YWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 279
+LDH V +VG+G T +D YW++KNSW WGD GY+ + +D CGI + +SYP
Sbjct: 277 ELDHGVLVVGYGYEGTDDDNKKRYWIVKNSWSEKWGDQGYIHMAKDRSNNCGIASAASYP 336
Query: 280 L 280
+
Sbjct: 337 M 337
>gi|357446993|ref|XP_003593772.1| Cysteine proteinase [Medicago truncatula]
gi|355482820|gb|AES64023.1| Cysteine proteinase [Medicago truncatula]
Length = 339
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 78/183 (42%), Positives = 108/183 (59%), Gaps = 12/183 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN--KEGNRTYKLGTNQFS 97
+ +EI + WM +HGR YKD E + IF NL+YI + N ++ + + LG F+
Sbjct: 11 QDKTIEIFQLWMKEHGRVYKDLDEMAKKFDIFISNLKYITETNAKRKSSNGFLLGLTNFT 70
Query: 98 DLTNDEFRALYT-GYKMPSPSHRSTTSSTFKYQN--LSMTDVPTSLDWRDKGAVTPIKNQ 154
D +++EF+ Y MP T T K + LS P+SLDWR KG V+ IK+Q
Sbjct: 71 DWSSEEFQERYLHNIDMP------TDIDTMKVNDVHLSSCSAPSSLDWRSKGVVSDIKDQ 124
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
K CG CWAF+AV A+EGI I +G LI LSEQ+LLDC + GC G KAF ++I+N
Sbjct: 125 KNCGSCWAFSAVGAIEGINAITTGKLINLSEQELLDCDP-ISGGCNSGWVNKAFDWVIRN 183
Query: 215 QGI 217
+G+
Sbjct: 184 KGV 186
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 76/159 (47%), Gaps = 22/159 (13%)
Query: 136 VPTSLDW--RDKGAVT----PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
V + DW R+KG P +K G C A+ I+ I + + ++ S+Q LL
Sbjct: 173 VNKAFDWVIRNKGVALDNDYPYTAEK--GVC--KASQIPNSAISSINTYHHVEQSDQGLL 228
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIFNG----VCGTQLDHAVTIVGFGTTEDGANYW 245
C+ + + F + + GI++G V +H V IVG+ + DG +YW
Sbjct: 229 -CAVAKQPVSVCLYAPQDFHHY--SSGIYDGPNCPVNSKDTNHCVLIVGYDSV-DGQDYW 284
Query: 246 LIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYPL 280
++KN WG +WG GYM I R+ G+C I + + P+
Sbjct: 285 IVKNQWGTSWGMEGYMHIKRNTNKKYGVCAINSWAYNPV 323
>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
Length = 331
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 121/206 (58%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF YII N+GI
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGI 198
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 50/78 (64%), Gaps = 3/78 (3%)
Query: 203 SREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 262
+R +F ++ ++ + C ++H V +VG+G +G YWL+KNSWG+ +G+ GY++
Sbjct: 254 ARHPSF-FLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGKEYWLVKNSWGHNFGEEGYIR 311
Query: 263 IVRDEG-LCGIGTRSSYP 279
+ R++G CGI + SYP
Sbjct: 312 MARNKGNHCGIASFPSYP 329
>gi|297663703|ref|XP_002810310.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin S [Pongo abelii]
Length = 330
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 121/206 (58%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKQLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMIHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF YII N+GI
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGI 198
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 49/78 (62%), Gaps = 3/78 (3%)
Query: 203 SREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 262
+R +F ++ ++ + C ++H V +VG+G +G YWL+KNSWG +G+ GY++
Sbjct: 253 ARHPSF-FLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGKEYWLVKNSWGRNFGEEGYIR 310
Query: 263 IVRDEG-LCGIGTRSSYP 279
+ R++G CGI + S+P
Sbjct: 311 MARNKGNHCGIASFPSFP 328
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 75/204 (36%), Positives = 115/204 (56%), Gaps = 10/204 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ + L C + +++ S Q ++ E + +QH ++Y +E+ +R KIF EN
Sbjct: 1 MLRLAFLCGCVAAAIAASS---QEILRTEWEAFKSQHNKAYSSHVEELLRFKIFTENTLL 57
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
+ K N + G +YKL N+F DL EF + GY+ ++ + NL+ +
Sbjct: 58 VAKHNAKYAKGLVSYKLAMNKFGDLLPHEFAKMVNGYR--GKQNKEQRPTFIPPANLNDS 115
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+PT++DWR KGAVTP+KNQ +CG CWAF+ ++EG ++G L+ LSEQ L+DCS +
Sbjct: 116 SLPTTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDD 175
Query: 195 -GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG + F YI N GI
Sbjct: 176 FGNQGCNGGLMDNGFQYIKANGGI 199
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 50/75 (66%), Gaps = 4/75 (5%)
Query: 209 AYIIQNQGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + +QG+++ +QLDH V VG+G ++G YWL+KNSWG WGD GY+ + RD
Sbjct: 259 SFQLYSQGVYDEPDCSSSQLDHGVLTVGYGV-KNGKKYWLVKNSWGGDWGDNGYILMSRD 317
Query: 267 -EGLCGIGTRSSYPL 280
+ CGI + +SYPL
Sbjct: 318 KDNQCGIASSASYPL 332
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 69/169 (40%), Positives = 99/169 (58%), Gaps = 2/169 (1%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
+ A + +SY E EK+ R IFK NL YI N++G +Y L N F DL+ DEFR Y
Sbjct: 119 FQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRRKYL 177
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
G+K + N+ +++P +DWR +G VTP+K+Q++CG CWAF+ A+
Sbjct: 178 GFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGAL 237
Query: 170 EGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
EG ++G L+ LSEQ+L+DCS GN C GG AF Y++ + GI
Sbjct: 238 EGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGI 286
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 48/75 (64%), Gaps = 4/75 (5%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVR--- 265
+ ++G+F+ CGT LDH V +VG+GT E ++W++KNSWG WG GYM +
Sbjct: 346 FQFYHEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKG 405
Query: 266 DEGLCGIGTRSSYPL 280
+EG CG+ +S+P+
Sbjct: 406 EEGQCGLLLDASFPV 420
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 79/198 (39%), Positives = 113/198 (57%), Gaps = 18/198 (9%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
FI+ +LL+ + + + QS + +H +SY +++E+ RL IF ENL I
Sbjct: 4 FILASLLIVAVGASLENVGSTFQS-------FKLKHSKSYSNQVEEAKRLAIFTENLRDI 56
Query: 79 EKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E+ N G +Y NQF+DLT DEF+A T + P T +T Y +
Sbjct: 57 EEHNALYAAGLVSYNKSVNQFTDLTIDEFKAYLTLHSKP-------TLNTVPYVRTGL-Q 108
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
VPT+LDWR +G VT +K+Q +CG CWAF+ V + EG +G L+ LSEQQL+DC+TN
Sbjct: 109 VPTTLDWRSQGYVTGVKDQGDCGSCWAFSVVGSTEGAYYKSTGKLVSLSEQQLIDCTTNV 168
Query: 196 NNGCLGGSREKAFAYIIQ 213
N+GC GG E+ F Y+ Q
Sbjct: 169 NDGCDGGYLEETFPYVQQ 186
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 31/55 (56%), Positives = 41/55 (74%), Gaps = 1/55 (1%)
Query: 226 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
L+H V +VG+GT +DG +YWLIKNSWGNTWG+ GY+K++R CGI YP+
Sbjct: 264 LNHGVLVVGYGT-QDGKDYWLIKNSWGNTWGEQGYLKLLRGTNECGIAEDDVYPI 317
>gi|75994612|gb|ABA33827.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 254
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/189 (37%), Positives = 113/189 (59%), Gaps = 18/189 (9%)
Query: 46 IHEKWMAQHGRSYKD------------ELEKEMRLKIFKENLEYIEKANKE---GNRTYK 90
++E W ++HGR ++ E ++ +RL++F++NL YI+K N E G T++
Sbjct: 1 MYEAWKSKHGRGGRNDDYDMAPGDEEQEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFR 60
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKGAV 148
LG F+DLT +E+R G++ + Y+ +P ++DWR GAV
Sbjct: 61 LGLTPFADLTLEEYRGRVLGFRARGRRSGARYGHGHGYRVRPRGGDLLPDAIDWRQLGAV 120
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF
Sbjct: 121 TEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENAF 179
Query: 209 AYIIQNQGI 217
++I N GI
Sbjct: 180 RFVIGNGGI 188
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 145/318 (45%), Gaps = 74/318 (23%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
S + S E + ++ +M Q+ ++Y E R FK N+E I N N +Y
Sbjct: 25 SALFSEEVPSEVMLQDMFTAFMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASY 83
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+G N+F+DL+ +EF+ Y GYK R S +Q + PTS+DWR AVT
Sbjct: 84 TMGLNEFADLSFEEFKGKYFGYKHV---EREFARSNNLHQEVEA--APTSIDWRTSNAVT 138
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGN-LIQLSEQQLLDCSTN-GNNGCLGGSREKA 207
PIK+Q +CG CWAF+A ++EG ++ + L LSEQQL+DCST+ G+ GC GG + A
Sbjct: 139 PIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYA 198
Query: 208 FAYIIQNQGI----------FNGVCGTQLDHAVTIVGF---------------------- 235
F YII N+GI G+C VTI G+
Sbjct: 199 FEYIIANKGICAESAYPYKGVGGLCQKSCTKVVTISGYKDVASGDEASLLNAVGTVGPVS 258
Query: 236 -------------------GTTEDGANYWLIKNSWGNT---------------WGDAGYM 261
GT ++ ++ +G T WG++GY+
Sbjct: 259 VAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYI 318
Query: 262 KIVRDEGLCGIGTRSSYP 279
+++R++ CGI + SYP
Sbjct: 319 RMIRNKNQCGIAIQPSYP 336
>gi|214015374|gb|ACJ62303.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/185 (38%), Positives = 112/185 (60%), Gaps = 16/185 (8%)
Query: 46 IHEKWMAQHGR----------SYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
++E W ++HGR + DE + +RL++F++NL YI+K N E G T++LG
Sbjct: 1 MYEAWKSKHGRGGSNDDDCDMAPGDEEGRRLRLEVFRDNLRYIDKHNAEADAGLHTFRLG 60
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
F+DLT DE+R G++ + + + + +P ++DW GAVT +K
Sbjct: 61 LTPFADLTLDEYRGRVLGFRARGRRSGARYGHRARPRGGDL--LPDAIDWCQLGAVTEVK 118
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E AF ++I
Sbjct: 119 DQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEDAFRFVI 177
Query: 213 QNQGI 217
N GI
Sbjct: 178 GNGGI 182
>gi|214015241|gb|ACJ62237.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015243|gb|ACJ62238.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015255|gb|ACJ62244.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015261|gb|ACJ62247.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 254
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 112/190 (58%), Gaps = 19/190 (10%)
Query: 46 IHEKWMAQHGRSYKDE-------------LEKEMRLKIFKENLEYIEKANKE---GNRTY 89
++E W ++HGR +++ ++ +RL++F++NL YI+ N E G T+
Sbjct: 1 MYEAWKSKHGRGGRNDDCDMAPGDEEQEEEDRRLRLEVFRDNLRYIDAHNAEADAGLHTF 60
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKGA 147
+LG F+DLT DE+R G++ + Y+ +P ++DWR GA
Sbjct: 61 RLGLTPFADLTLDEYRGRVLGFRARRSRSGARYGHGHGYRARPRGGDLLPDAIDWRQLGA 120
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
VT +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG E A
Sbjct: 121 VTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMENA 179
Query: 208 FAYIIQNQGI 217
F ++I N GI
Sbjct: 180 FRFVIGNGGI 189
>gi|228245|prf||1801240C Cys protease 3
Length = 321
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 75/174 (43%), Positives = 106/174 (60%), Gaps = 10/174 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
+ + Q+GR Y D E+ R ++F++N + IE NK+ G T+K+ NQF D+TN+EF
Sbjct: 20 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 79
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A+ GYK S R + F + M +DWR K VTP+K+Q++CG CWAF+
Sbjct: 80 NAVMKGYKKGS---RGEPKAVFTAEGRPMA---RDVDWRTKALVTPVKDQEQCGSCWAFS 133
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
A A+EG +++ L+ LSEQQL+DCST+ GN+GC GG AF YI N GI
Sbjct: 134 ATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI 187
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 49/75 (65%), Gaps = 4/75 (5%)
Query: 208 FAYIIQNQGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
F++ + G++ T LDH V VG+GT E +YWL+KNSWG++WGDAGY+K+ R
Sbjct: 246 FSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGT-ESTKDYWLVKNSWGSSWGDAGYIKMSR 304
Query: 266 D-EGLCGIGTRSSYP 279
+ + CGI + SYP
Sbjct: 305 NRDNNCGIASEPSYP 319
>gi|12803615|gb|AAH02642.1| Cathepsin S [Homo sapiens]
gi|49456313|emb|CAG46477.1| CTSS [Homo sapiens]
gi|60821573|gb|AAX36579.1| cathepsin S [synthetic construct]
gi|189069420|dbj|BAG37086.1| unnamed protein product [Homo sapiens]
gi|261858586|dbj|BAI45815.1| cathepsin S [synthetic construct]
Length = 331
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 121/206 (58%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 WILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF YII N+GI
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGI 198
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 50/78 (64%), Gaps = 3/78 (3%)
Query: 203 SREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 262
+R +F ++ ++ + C ++H V +VG+G +G YWL+KNSWG+ +G+ GY++
Sbjct: 254 ARHPSF-FLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGKEYWLVKNSWGHNFGEEGYIR 311
Query: 263 IVRDEG-LCGIGTRSSYP 279
+ R++G CGI + SYP
Sbjct: 312 MARNKGNHCGIASFPSYP 329
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/180 (39%), Positives = 107/180 (59%), Gaps = 8/180 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKD--ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E+S+ ++E+W + + S + +E R +FK+N Y+ + NK + ++L N+F+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKQNARYVHEGNKR-DMPFRLALNKFA 92
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQ 154
D+T DEFR Y G ++ H S + D +P ++DWR KGAVT IK+Q
Sbjct: 93 DMTTDEFRRTYAGSRVRH--HLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC N GC GG + AF +I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 36/70 (51%), Positives = 48/70 (68%), Gaps = 4/70 (5%)
Query: 214 NQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 269
++G+F G C T LDH V VG+G T DG YW++KNSWG WG+ GY+++ R EGL
Sbjct: 277 SEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQTEGL 336
Query: 270 CGIGTRSSYP 279
CGI ++SYP
Sbjct: 337 CGIAMQASYP 346
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 79/201 (39%), Positives = 121/201 (60%), Gaps = 16/201 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
I L+++C S ++R ++ + WM +H +SY ++ E R +F++N++ +
Sbjct: 7 LIFCFLIINCCS---AARIFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYSVFQDNMDIV 62
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL-SMTDVP 137
K N++G+ T LG N +DLTN+EF+ LY G K + T+K + L ++ +P
Sbjct: 63 AKWNQKGSNTI-LGLNVMADLTNEEFKKLYLGTK---------ANVTYKKKTLVGVSGLP 112
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGN 196
S+DWR GAVT +KNQ +CG C+AF+ +VEGI +I S L+ LSEQQ+LDCS + GN
Sbjct: 113 ASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGSEGN 172
Query: 197 NGCLGGSREKAFAYIIQNQGI 217
NGC GG +F YII G+
Sbjct: 173 NGCDGGLMTNSFEYIIAVGGL 193
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/59 (47%), Positives = 42/59 (71%), Gaps = 2/59 (3%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 281
TQLDH V VG+G+ + G +YW++KNSWG WG+ G++ + R+ + CGI T +S+P A
Sbjct: 269 TQLDHGVLAVGYGS-QSGQDYWIVKNSWGADWGENGFILMARNKDNNCGIATMASFPTA 326
>gi|281207567|gb|EFA81750.1| cysteine protease 4 [Polysphondylium pallidum PN500]
Length = 432
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 77/205 (37%), Positives = 121/205 (59%), Gaps = 7/205 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTH----EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
M+ + L++C +++ S + E+ + WM + Y D E R +FK+
Sbjct: 1 MYRLSAYLLACTVFMLAVLSANAAFTERQYQDSFVSWMQTNNVKY-DGKEFNHRYGVFKK 59
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N++Y+++ N +G+ T LG N F+DLTN E++ +Y G K+ + + ++ +N ++
Sbjct: 60 NMDYVQQWNAKGSSTV-LGMNIFADLTNAEYQRIYLGTKIDASGLLNVAAARAFDRNFNI 118
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+ ++DWR KGAVTPIKNQ +CG CW+F+ +VEG +I +GNL+ LSEQ L+DCS
Sbjct: 119 KALNPTVDWRAKGAVTPIKNQAQCGSCWSFSTTGSVEGAHEISTGNLVALSEQNLIDCSV 178
Query: 194 -NGNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG A YII+N GI
Sbjct: 179 PEGNQGCNGGLMWAAMEYIIKNGGI 203
>gi|12621903|gb|AAB60643.2|AAB60643 cathepsin S [Homo sapiens]
Length = 267
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 121/206 (58%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLTSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF YII N+GI
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGI 198
>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
gi|1096153|prf||2111244A Cys protease
Length = 380
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 102/310 (32%), Positives = 136/310 (43%), Gaps = 86/310 (27%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSDLTNDEFRALY 108
+M +GRSY E E RL IF +N+ + A + + T G QFSDLT DEF LY
Sbjct: 57 FMENYGRSYSTEEEYLRRLGIFAQNM--VRAAEHQALDPTAVHGVTQFSDLTEDEFEKLY 114
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
TG PS S ++ L + +P + DWR+KGAVT +K Q CG CWAF+ +
Sbjct: 115 TGVNGGFPS--SNNAAGGIAPPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGS 172
Query: 169 VEGITKIRSGNLIQLSEQQLLDCST-----------NGNNGCL-----------GGSREK 206
+EG + +G L+ LSEQQLLDC NG NG L GG E+
Sbjct: 173 IEGANFLATGKLVSLSEQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEE 232
Query: 207 A-----------------------------------FAYIIQNQGIFNGV---------- 221
+ AY+++N + GV
Sbjct: 233 SSYPYTGERGECKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQTYIG 292
Query: 222 -------CGTQ-LDHAVTIVGFGTTE------DGANYWLIKNSWGNTWGDAGYMKIVRDE 267
C + L+H V +VG+G YW+IKNSWG WG+ GY K+ R
Sbjct: 293 GVSCPLICSKKRLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWGEDGYYKLCRGH 352
Query: 268 GLCGIGTRSS 277
G+CGI T S
Sbjct: 353 GMCGINTMVS 362
>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 320
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 75/174 (43%), Positives = 106/174 (60%), Gaps = 10/174 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
+ + Q+GR Y D E+ R ++F++N + IE NK+ G T+K+ NQF D+TN+EF
Sbjct: 20 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 79
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A+ GYK S R + F + M +DWR K VTP+K+Q++CG CWAF+
Sbjct: 80 NAVMKGYKKGS---RGEPKAVFTAEAGPMA---ADVDWRTKALVTPVKDQEQCGSCWAFS 133
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
A A+EG +++ L+ LSEQQL+DCST+ GN+GC GG AF YI N GI
Sbjct: 134 ATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI 187
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 49/75 (65%), Gaps = 4/75 (5%)
Query: 208 FAYIIQNQGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
F++ + G++ T LDH V VG+GT E +YWL+KNSWG++WGDAGY+K+ R
Sbjct: 245 FSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGT-ESTKDYWLVKNSWGSSWGDAGYIKMSR 303
Query: 266 D-EGLCGIGTRSSYP 279
+ + CGI + SYP
Sbjct: 304 NRDNNCGIASEPSYP 318
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.132 0.402
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,373,813,019
Number of Sequences: 23463169
Number of extensions: 180273488
Number of successful extensions: 533027
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6700
Number of HSP's successfully gapped in prelim test: 832
Number of HSP's that attempted gapping in prelim test: 504751
Number of HSP's gapped (non-prelim): 15923
length of query: 281
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 140
effective length of database: 9,050,888,538
effective search space: 1267124395320
effective search space used: 1267124395320
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)