BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018958
(348 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 185/338 (54%), Positives = 242/338 (71%), Gaps = 9/338 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ ++ LLV+ + SRS HE S+ H+ WM Q+GR YK +EKE R KIFKEN+E+
Sbjct: 9 LVLMAMLLVTLWASQSWSRSLHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKENVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST-TSSTFKYQNLSMTDV 136
IE N GN+ YKLG N F+DLTN+EFRA + GY M SH+S+ + +F+Y+N+ T V
Sbjct: 69 IESFNNNGNKPYKLGINAFTDLTNEEFRASHNGYTMSMSSHQSSYRTKSFRYENV--TAV 126
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P SLDWR KGAVT IK+Q +CGCCWAF+AVAA+EGITK+ +G LI LSEQ+L+DC T+G
Sbjct: 127 PPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLISLSEQELVDCDTSGM 186
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDE 254
+ GC GG + AF +II+N G+ TE YPY+ V G+C+ + AAKI+ YE VP+ DE
Sbjct: 187 DQGCEGGLMDDAFEFIIENNGLTTEANYPYEGVDGSCNTRKAANHAAKITGYENVPAYDE 246
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+AL KAV+ QPVS+AI A + FQ Y GIF G CGT+LDH VT+VG+GT++DG YWL+
Sbjct: 247 EALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGTELDHGVTVVGYGTSDDGTKYWLV 306
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
KNSWG +WG+ GY+++ RD EGLCGI SYP A
Sbjct: 307 KNSWGTSWGEDGYIRMERDIDAKEGLCGIAMEPSYPTA 344
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 181/342 (52%), Positives = 241/342 (70%), Gaps = 11/342 (3%)
Query: 12 KINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
K NT + +L + A+++ ++ +++ HE+WMAQHGR Y D EKE R IF
Sbjct: 5 KCNTRIFLPFLLILAAWATKIACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIF 64
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
KEN+E IE N +R YKLG N+F+DLTN+EFRA+Y GYK S SS+F+Y+NL
Sbjct: 65 KENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYHGYKRQSSK---LMSSSFRYENL 121
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
S D+PTS+DWR+ GAVTP+K+Q CGCCWAF+ VAA+EGI K+++GNLI LSEQQL+DC
Sbjct: 122 S--DIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDC 179
Query: 192 STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVP 250
T GN GC GG + AF YII+N G+ +ED YPYQ V GTCS+ + + A+I+ YE+VP
Sbjct: 180 -TAGNKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQGVDGTCSSEKAASTEAQITGYEDVP 238
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E ALL+AV+ QPVS+ + +FQ YK G+FNG CGTQ +HAVT +G+GT DG +
Sbjct: 239 QNNENALLQAVAKQPVSVGVDGGGNDFQFYKSGVFNGDCGTQQNHAVTAIGYGTDIDGTD 298
Query: 311 YWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
YWL+KNSWG +WG+ GYM++ R EGLCG+ +SYP A
Sbjct: 299 YWLVKNSWGTSWGENGYMRMRRGIGSSEGLCGVAMDASYPTA 340
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 184/337 (54%), Positives = 238/337 (70%), Gaps = 10/337 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
MF+ + ++ ASQ S RS H+ ++ E HE WMA++GR YKD EKE R +IF+ N+E+
Sbjct: 10 MFVALLVVGLWASQAWS-RSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE NK GNR YKL N+F+DLTN+EF+ GYK S T S+F+Y N+ T VP
Sbjct: 69 IESFNKLGNRPYKLDINEFADLTNEEFKVSKNGYKRSSGVGL-TEKSSFRYANV--TAVP 125
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
TS+DWR GAVTPIK+Q +CGCCWAF+AVAA+EGITK+ +G LI LSEQ+L+DC T+G +
Sbjct: 126 TSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQ 255
GC GG + AF +I QN G+ TE YPYQ GTC+ + AAKI+ YE+VP+ E
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSED 245
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALLKAV+ QPVS+AI A + FQ Y G+F G CGT+LDH VT VG+GT++DG YWL+K
Sbjct: 246 ALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDDGTKYWLVK 305
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
NSWG +WG+ GY+++ RD EGLCGI + SYP A
Sbjct: 306 NSWGTSWGEDGYIRMERDIEAKEGLCGIAMQPSYPTA 342
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 369 bits (948), Expect = e-100, Method: Compositional matrix adjust.
Identities = 186/337 (55%), Positives = 238/337 (70%), Gaps = 11/337 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
MF+ + ++ SQ S RS H+ ++ E HE WM ++GR YKD EKE R +IF+ N+E+
Sbjct: 10 MFVALLVVGLWVSQAWS-RSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE NK GNR YKL N+F+DLTN+EF+A GYK S S SS F+Y N+ T VP
Sbjct: 69 IESFNKPGNRPYKLDINEFADLTNEEFKASRNGYKRSSNVGLSEKSS-FRYGNV--TAVP 125
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
TS+DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGITK+ +G LI LSEQ+L+DC T+G +
Sbjct: 126 TSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQ 255
GC GG + AF +I QN G+ TE YPYQ GTC+ + AAKI+ YE+VP+ E
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSED 245
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALLKAV+ QPVS+AI A + FQ Y G+F G CGT+LDH VT VG+GT+ DG YWL+K
Sbjct: 246 ALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTS-DGTKYWLVK 304
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
NSWG +WG+ GY+++ RD EGLCGI +SSYP A
Sbjct: 305 NSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSYPTA 341
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 369 bits (948), Expect = e-99, Method: Compositional matrix adjust.
Identities = 176/341 (51%), Positives = 241/341 (70%), Gaps = 11/341 (3%)
Query: 14 NTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
N+ + I + L+ + ++ + +SR+ + + HE+WMAQ+GR YK+E+EK R IFKE
Sbjct: 4 NSLKLLIALALVFATSAYLATSRTLLDSLMAVRHEQWMAQYGRVYKNEVEKTKRYNIFKE 63
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+EYIE NK G + YKLG N F+DLTN EF A GY +P H ++++ F+Y+N+S
Sbjct: 64 NVEYIESFNKAGTKPYKLGINAFADLTNKEFIASRNGYILP---HECSSNTPFRYENVSA 120
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
VPT++DWR KGAVTP+K+Q +CGCCWAF+AVAA+EGITK+ +GNLI LSEQ+L+DC
Sbjct: 121 --VPTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDV 178
Query: 194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPS 251
G + GC GG + AF +II N+G+ TE YPYQ G+C ++ +A IS YE+VP+
Sbjct: 179 KGIDQGCEGGLMDDAFTFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPA 238
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
E AL KAV+ QPVS+AI A ++FQ Y G+F G CGT+LDH VT VG+G EDG+ Y
Sbjct: 239 NSESALEKAVANQPVSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKY 298
Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
WL+KNSWG +WG+ GY+++ +D EGLCGI +SSYP A
Sbjct: 299 WLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQSSYPSA 339
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 184/338 (54%), Positives = 240/338 (71%), Gaps = 12/338 (3%)
Query: 19 FIIITLLVS--CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
I ITLL+ ASQ +S R+ HE S+ E HE WM +GR+YKD EKE R KIFKEN+E
Sbjct: 7 IICITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVE 65
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
YIE N GNR YKL N+F+D TN+EF+A GY M S RS+ ++F+Y+N++ V
Sbjct: 66 YIESVNSAGNRRYKLSINEFADQTNEEFKASRNGYNMSSRP-RSSEITSFRYENVAA--V 122
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P+S+DWR KGAVTPIK+Q +CGCCWAF+AVAA+EG+T++++G LI LSEQ+L+DC T+G
Sbjct: 123 PSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGE 182
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDE 254
+ GC GG + AF +II N G+ TE YPY+ V TC+ + ++A I NYE+VP+ E
Sbjct: 183 DQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSE 242
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
ALLKAV+ PVS+AI A ++FQ Y G+F G CGT+LDH VT VG+G T+DG YWL+
Sbjct: 243 AALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWLV 302
Query: 315 KNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
KNSWG WG+ GY+ + R DEGLCGI +SYP A
Sbjct: 303 KNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 340
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 368 bits (945), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 174/323 (53%), Positives = 235/323 (72%), Gaps = 11/323 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
+ +SR+ + +V HE+WMAQ+GR Y++E+EK R IFKEN+EYIE NK G + YKL
Sbjct: 24 LATSRTLSDSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKPYKL 83
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
G N F+DLTN EF+A GYK+P H ++++ F+Y+N+S VPT++DWR KGAVTP+
Sbjct: 84 GINAFADLTNQEFKASRNGYKLP---HDCSSNTPFRYENVS--SVPTTVDWRTKGAVTPV 138
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
K+Q +CGCCWAF+AVAA+EGITK+ +GNLI LSEQ+L+DC G + GC GG + AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSF 198
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDEQALLKAVSMQPVSIA 269
II N+G+ TE YPYQ G+C ++ +A IS YE+VP+ E AL KAV+ QPVS+A
Sbjct: 199 IINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVA 258
Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
I A ++FQ Y G+F G CGT+LDH VT VG+G EDG+ YWL+KNSWG +WG+ GY++
Sbjct: 259 IDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIR 318
Query: 330 IVRD----EGLCGIGTRSSYPLA 348
+ +D EGLCGI +SSYP A
Sbjct: 319 MQKDIEAKEGLCGIAMQSSYPSA 341
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 368 bits (945), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 180/338 (53%), Positives = 241/338 (71%), Gaps = 15/338 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F ++ L A QV SSR+ + S+ E HE+WMA++GR YKD EKE R IFKEN+ YI
Sbjct: 12 FALVLCLGLWAFQV-SSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSIFKENVNYI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV 136
E +N G++ YKLG NQF+DLTN+EF A +K M S R+TT FKY+N++
Sbjct: 71 EASNNAGDKPYKLGVNQFADLTNEEFIATRNKFKGHMSSSITRTTT---FKYENVT---A 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P+++DWR +GAVTP+KNQ CGCCWAF+AVAA EGI K+ +GNL+ LSEQ+L+DC T+G
Sbjct: 125 PSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGA 184
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
+ GC GG + AF +IIQN G+ TE +YPYQ V GTC+ ++ A I+ YE+VPS +E
Sbjct: 185 DQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEATHVATITGYEDVPSNNE 244
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
QAL +AV+ QP+SIAI A ++FQ+Y+ G+F G CGTQLDH V +VG+G ++DG YWL+
Sbjct: 245 QALQQAVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLV 304
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
KNSWG WG+ GY+++ RD EGLCG+ + SYP A
Sbjct: 305 KNSWGADWGEEGYIRMQRDVDAPEGLCGLAMQPSYPTA 342
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 368 bits (944), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 175/323 (54%), Positives = 233/323 (72%), Gaps = 11/323 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
+ +SR+ + +V HE+WMAQ+GR YK E EK R IFKEN+EYIE NK G + YKL
Sbjct: 22 LATSRTLSDSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKPYKL 81
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
G N F+DLTN EF+A GYK+P H ++++ F+Y+N+S VPT++DWR KGAVTP+
Sbjct: 82 GINAFADLTNQEFKASRNGYKLP---HDCSSNTPFRYENVS--SVPTTVDWRTKGAVTPV 136
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
K+Q +CGCCWAF+AVAA+EGITK+ +GNLI LSEQ+L+DC G + GC GG + AF++
Sbjct: 137 KDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSF 196
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDEQALLKAVSMQPVSIA 269
II N+G+ TE YPYQ G+C ++ +A IS YE+VP+ E AL KAV+ QPVS+A
Sbjct: 197 IINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVA 256
Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
I A ++FQ Y G+F G CGT+LDH VT VG+G EDG+ YWL+KNSWG +WG+ GY++
Sbjct: 257 IDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIR 316
Query: 330 IVRD----EGLCGIGTRSSYPLA 348
+ +D EGLCGI +SSYP A
Sbjct: 317 MQKDIEAKEGLCGIAMQSSYPSA 339
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 367 bits (942), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 172/309 (55%), Positives = 226/309 (73%), Gaps = 11/309 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+++ HE+WMAQHGR Y D EKE R IFKEN+E IE N +R YKLG N+F+DLTN+
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EFRA+Y GYK S SS+F+Y+NLS D+PTS+DWR+ GAVTP+K+Q CGCCWA
Sbjct: 61 EFRAMYHGYKRQSSK---LMSSSFRYENLS--DIPTSMDWRNDGAVTPVKDQGTCGCCWA 115
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ VAA+EGI K+++GNLI LSEQQL+DC T GN GC GG + AF YII+N G+ +ED
Sbjct: 116 FSTVAAIEGIIKLQTGNLISLSEQQLVDC-TAGNKGCQGGLMDTAFQYIIRNGGLTSEDN 174
Query: 223 YPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPYQ V GTCS+ + + A+I+ YE+VP +E ALL+AV+ QPVS+A+ +F+ YK
Sbjct: 175 YPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFYK 234
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLC 337
G+F G CGT L+H VT +G+GT DG +YWL+KNSWG +WG++GY ++ R EGLC
Sbjct: 235 SGVFEGDCGTNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASEGLC 294
Query: 338 GIGTRSSYP 346
G+ +SYP
Sbjct: 295 GVAMDASYP 303
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 367 bits (941), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 178/337 (52%), Positives = 242/337 (71%), Gaps = 13/337 (3%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+++ L + ASQ+ ++RS + S+ E HE+WMA +GR YKD EK+ R KIF+EN+ I
Sbjct: 10 LVVMVTLGALASQLAAARSLQDASMRERHEEWMASYGRVYKDINEKQKRYKIFEENVALI 69
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E +NK+ N+ YKL NQF+DLTN+EF+A +K H ST S++FKY N+S VP
Sbjct: 70 ESSNKDANKPYKLSVNQFADLTNEEFKASRNRFK----GHICSTKSTSFKYGNVSA--VP 123
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTP+K+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L+DC T+G +
Sbjct: 124 SAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKLTTGELISLSEQELVDCDTSGVD 183
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQ 255
GC GG + AF +I N G+A+E YPY+ V GTC+ Q AA+I+ +E+VP+ E+
Sbjct: 184 QGCEGGLMDNAFTFIQHNHGLASEANYPYKGVDGTCNTNKQAIHAAEINGFEDVPANSEE 243
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALL AV+ QPVS+AI A + FQ Y +G+F G CGTQLDH VT VG+GT++DG YWL+K
Sbjct: 244 ALLNAVAHQPVSVAIDAGGSGFQFYSKGVFIGACGTQLDHGVTAVGYGTSDDGTKYWLVK 303
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
NSWG WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 304 NSWGTQWGEEGYIRMQRDVDAKEGLCGIAMKASYPTA 340
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 366 bits (940), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 179/338 (52%), Positives = 241/338 (71%), Gaps = 15/338 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F ++ L A QV SSR+ + S+ E HE+WMA++G+ YKD EKE R IF+EN++YI
Sbjct: 12 FALVLCLGLWAFQV-SSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNIFQENVKYI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV 136
E +N GN+ YKLG NQF+DLTN EF A +K M S R+TT FKY+N++
Sbjct: 71 EASNNAGNKPYKLGVNQFTDLTNKEFIATRNKFKGHMSSSITRTTT---FKYENVT---A 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P+++DWR +GAVTP+KNQ CGCCWAF+AVAA EGI K+ +GNL+ LSEQ+L+DC T+G
Sbjct: 125 PSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGA 184
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
+ GC GG + AF +IIQN G+ TE +YPYQ V GTC+ ++ A I+ YE+VPS +E
Sbjct: 185 DQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVTHVATITGYEDVPSNNE 244
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
QAL +AV+ QP+S+AI A ++FQ+Y+ G+F G CGTQLDH V +VG+G ++DG YWL+
Sbjct: 245 QALQQAVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLV 304
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
KNSWG WG+ GY+++ RD EGLCGI + SYP A
Sbjct: 305 KNSWGEDWGEEGYIRMQRDVEAPEGLCGIAMQPSYPTA 342
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 366 bits (940), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 174/310 (56%), Positives = 228/310 (73%), Gaps = 10/310 (3%)
Query: 44 VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
+E HE WMAQ+GR+YK +EKE RL IFK N+E+IE NK G + YKL N+F+DLTN+E
Sbjct: 1 MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEE 60
Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
F+A GYKM S S+++ F+Y+N+S VP+++DWR KGAVTPIK+Q +CGCCWAF
Sbjct: 61 FQASRNGYKM-SAHLSSSSTKPFRYENVSA--VPSTMDWRKKGAVTPIKDQGQCGCCWAF 117
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDE 222
+AVAA EGIT++ +G LI LSEQ+L+DC T+G + GC GG + AF +IIQN+G+ TE
Sbjct: 118 SAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEAN 177
Query: 223 YPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
YPYQ G C++ + AAAKI+ YE+VP+ E ALLKAV+ QPVS+AI A + FQ Y
Sbjct: 178 YPYQGADGACNSGK--AAAKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYSS 235
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
G+F G CGT LDH VT VG+G ++DG YWL+KNSWG +WG+ GY+++ RD EGLCG
Sbjct: 236 GVFTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERDIDAQEGLCG 295
Query: 339 IGTRSSYPLA 348
I +SYP A
Sbjct: 296 IAMEASYPTA 305
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 365 bits (936), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 181/344 (52%), Positives = 241/344 (70%), Gaps = 14/344 (4%)
Query: 15 TTPMFIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
T+ +F ++ +++S + +SR E S +E HE+WM++ R Y D+ EK R +IFK+
Sbjct: 2 TSIIFFLLAIILSSRTSGATSRGGLFEASAIEKHEQWMSRFHRVYSDDSEKTSRFEIFKK 61
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSS----TFKY 128
NL+++E N N+TY L N+FSDLT++EF+A YTG +P R STT S +F+Y
Sbjct: 62 NLKFVESFNMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRMSTTDSHETVSFRY 121
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+N+ T S+DWR++GAVT +K+Q++CGCCWAF+AVAAVEG+TKI G L+ LSEQQL
Sbjct: 122 ENVGET--GESMDWREEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIAKGELVSLSEQQL 179
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
LDCST N+GC GG KAF YI++NQGI ED YPYQ TC + AAA IS YE
Sbjct: 180 LDCSTE-NDGCDGGIMWKAFDYIVENQGITAEDNYPYQGAQQTCES-NHVAAATISGYET 237
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP DE+ALLKAVS QPVS+AI EF Y GIFNG CGT L+HAVTIVG+G +E+G
Sbjct: 238 VPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTHLNHAVTIVGYGVSEEG 297
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
YWL+KNSWG +WG+ GYM+I+RD +G+CG+ + + YP+A
Sbjct: 298 IKYWLLKNSWGESWGEDGYMRIMRDVDAPQGMCGLASLAYYPVA 341
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 364 bits (934), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 174/338 (51%), Positives = 236/338 (69%), Gaps = 7/338 (2%)
Query: 18 MFIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+ + L SQV SSR +E S+ H++W+A H + YKD EKEMR KIFKEN+E
Sbjct: 12 LALFFIFLGVWRSQVASSRPINYEASMRARHDQWIAHHDKVYKDLNEKEMRFKIFKENVE 71
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N ++ YKLG N+FSDLTN++FR L+TGYK P S++ ++ ++TD+
Sbjct: 72 RIEAFNAGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRSHPKVMSSSKPKTHFRYANVTDI 131
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P ++DWR KGAVTPIK+QKECGCCWAF+AVAA EG+ ++++G LI LSEQ+L+DC G
Sbjct: 132 PPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDVEGE 191
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDE 254
+ GC GG + AF +I++N+G+ TE YPY+ G C+ + +AAKI+ YE+VP+ E
Sbjct: 192 DEGCSGGLLDTAFDFILKNKGLTTEANYPYKGEDGVCNKKKSALSAAKIAGYEDVPANSE 251
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+ALL+AV+ QPVS+AI S +FQ Y G+F+G C T L+HAVT VG+G T DG YW+I
Sbjct: 252 KALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKYWII 311
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
KNSWG+ WGD+GYM+I RD EGLCG+ +SYP A
Sbjct: 312 KNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 364 bits (934), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 178/325 (54%), Positives = 233/325 (71%), Gaps = 15/325 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-YKL 91
V+SR+ + S+ E HE+WM +G+ YK+ E+E RL+IF ENL+YIE +N GN+ YKL
Sbjct: 25 VTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNKKPYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
G NQF+DLTN+EF A +K M S R+TT FKY+N T VP+++DWR KGAVT
Sbjct: 85 GINQFADLTNEEFIASRNKFKGHMCSSIIRTTT---FKYEN---TSVPSTVDWRKKGAVT 138
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
P+KNQ +CGCCWAF+A+AA EGI KI +G L+ LSEQ+L+DC TNG + GC GG + AF
Sbjct: 139 PVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAF 198
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
+IIQN GI+TE YPYQ V GTC A + +AA I+ YE+VP+ +E AL KAV+ QP+S
Sbjct: 199 KFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNENALQKAVANQPIS 258
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A ++FQ YK G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY
Sbjct: 259 VAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGY 318
Query: 328 MKIVRD----EGLCGIGTRSSYPLA 348
+++ R EGLCGI ++SYP A
Sbjct: 319 IRMQRSIDAAEGLCGIAMQASYPTA 343
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 363 bits (933), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 185/344 (53%), Positives = 239/344 (69%), Gaps = 14/344 (4%)
Query: 15 TTPMFIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
T+ +F ++ +L+S + V+SR E S VE HE+WM++ R Y D+ EK R +IF
Sbjct: 2 TSIVFFLLAILLSSRTSGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTN 61
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSS----TFKY 128
NL+++E N N+TY L N+FSDLT++EF+A YTG +P R STT S +F+Y
Sbjct: 62 NLKFVESINMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFRY 121
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+N+ T S+DW +GAVT +K+Q++CGCCWAF+AVAAVEG+TKI +G L+ LSEQQL
Sbjct: 122 ENVGETG--ESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQL 179
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
LDCST NNGC GG KAF YI +NQGI TED YPYQ TC + AAA IS YE
Sbjct: 180 LDCSTE-NNGCGGGIMWKAFDYIKENQGITTEDNYPYQGAQQTCES-NHLAAATISGYET 237
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP DE+ALLKAVS QPVS+AI EF Y GIFNG CGTQL HAVTIVG+G +E+G
Sbjct: 238 VPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEEG 297
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
YWL+KNSWG +WG+ GYM+I+RD +G+CG+ + + YP+A
Sbjct: 298 IKYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYYPVA 341
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 363 bits (933), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 178/325 (54%), Positives = 233/325 (71%), Gaps = 15/325 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-RTYKL 91
V+SR+ + S+ E HE+WM +G+ YK+ E+E RL+IF ENL+YIE +N GN + YKL
Sbjct: 25 VTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNNKPYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
G NQF+DLTN+EF A +K M S R+TT FKY+N T VP+++DWR KGAVT
Sbjct: 85 GINQFADLTNEEFIASRNKFKGHMCSSIIRTTT---FKYEN---TSVPSTVDWRKKGAVT 138
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
P+KNQ +CGCCWAF+A+AA EGI KI +G L+ LSEQ+L+DC TNG + GC GG + AF
Sbjct: 139 PVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAF 198
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
+IIQN GI+TE YPYQ V GTC A + +AA I+ YE+VP+ +E AL KAV+ QP+S
Sbjct: 199 KFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNENALQKAVANQPIS 258
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A ++FQ YK G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY
Sbjct: 259 VAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGY 318
Query: 328 MKIVRD----EGLCGIGTRSSYPLA 348
+++ R EGLCGI ++SYP A
Sbjct: 319 IRMQRSIDAAEGLCGIAMQASYPTA 343
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 363 bits (933), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 178/346 (51%), Positives = 240/346 (69%), Gaps = 18/346 (5%)
Query: 11 FKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
I++ + ++ L A+ +R+ + S+ E HE+WM Q+G+ Y D EKE+R I
Sbjct: 7 LNISSLALLLVFGFLAFEAN----ARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNI 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKY 128
FKEN++ IE N GN+ YKLG NQF+DLTN+EF+A + G+ + ST + TFKY
Sbjct: 63 FKENVQRIEAFNNAGNKPYKLGINQFADLTNEEFKARNRFKGHMCSN----STRTPTFKY 118
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+++S VP SLDWR KGAVTPIK+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L
Sbjct: 119 EDVS--SVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQEL 176
Query: 189 LDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNY 246
+DC T G + GC GG + AF +I+QN+G+ TE +YPYQ V TC+A A+ AA I +
Sbjct: 177 VDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGF 236
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP+ E ALLKAV+ QP+S+AI A +EFQ Y G+F G CGT+LDH VT VG+G ++
Sbjct: 237 EDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVSD 296
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
DG YWL+KNSWG WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 297 DGTKYWLVKNSWGEQWGEEGYIRMQRDVAAEEGLCGIAMQASYPTA 342
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 363 bits (932), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 178/324 (54%), Positives = 232/324 (71%), Gaps = 12/324 (3%)
Query: 33 VSSRSTHEQSVV-EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-RTYK 90
V+SR+ + S++ E HE+WM +G+ YKD E+E RLKIFKEN+ YIE +N GN + YK
Sbjct: 26 VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+D+TN+EF A +K S T +STFKY+N S VP+++DWR KGAVTP
Sbjct: 86 LGINQFADITNEEFIASRNKFKGHMCS-SITKTSTFKYENAS---VPSTVDWRKKGAVTP 141
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
+KNQ +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC T G + GC GG + AF
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G+ TE +YPYQ V GTCSA + AA I+ YE+VP+ +E AL KAV+ QP+S+
Sbjct: 202 FIIQNHGLHTEAQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISV 261
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A ++FQ YK G+F G CGTQLDH VT VG+G + DG YWL+KNSWGN WG+ GY+
Sbjct: 262 AIDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEEGYI 321
Query: 329 KIVRD----EGLCGIGTRSSYPLA 348
++ R +GLCGI +SYP A
Sbjct: 322 RMQRSVDAAQGLCGIAMMASYPTA 345
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 180/344 (52%), Positives = 243/344 (70%), Gaps = 17/344 (4%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
+N T + ++ L+ S ++R+ + S+ E HE+WMAQ+G+ YKD EKE+R KIFK
Sbjct: 7 LNITSLTLL--LVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFK 64
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKYQN 130
EN++ IE N GN++YKLG NQF+DLTN+EF+A + G+ + ST + TFKY++
Sbjct: 65 ENVQRIEAFNNAGNKSYKLGINQFADLTNEEFKARNRFKGHMCSN----STRTPTFKYEH 120
Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
+ T VP SLDWR KGAVTPIK+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L+D
Sbjct: 121 V--TSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVD 178
Query: 191 CSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEE 248
C T G + GC GG + AF +I+QN+G+ TE +YPYQ V TC+A A+ AA I +E+
Sbjct: 179 CDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFED 238
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP+ E ALLKAV+ QP+S+AI A +EFQ Y G+F G CGT+LDH VT VG+G ++ G
Sbjct: 239 VPANSESALLKAVANQPISVAIDASGSEFQFYSSGVFTGSCGTELDHGVTAVGYG-SDGG 297
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
YWL+KNSWG WG+ GY+++ RD EGLCG ++SYP A
Sbjct: 298 TKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCGFAMQASYPTA 341
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 362 bits (929), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 175/337 (51%), Positives = 238/337 (70%), Gaps = 10/337 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + LL + A Q +SR+ E S+ E HE+WM Q+GR YKDE EK +R +IF +N+++
Sbjct: 29 MIAALILLGAWACQA-TSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKF 87
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE+ NK+G ++YKL N+F+D TN+EF+A GYKM + S R + ++ F+Y+N+ T VP
Sbjct: 88 IEEFNKDGRQSYKLAVNEFADQTNEEFQASRNGYKM-AVSSRPSQTTLFRYENV--TAVP 144
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+S+DWR KGAVTP+K+Q +CG CWAF+ +AA EGITK+++G LI LSEQ+L+DC G +
Sbjct: 145 SSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKTGED 204
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQ 255
GC GG E F +I++N+GIA E YPY A GTC++ ++ + AAKIS YE+VP+ E
Sbjct: 205 QGCEGGYMEDGFEFIVKNKGIALEASYPYTAADGTCNSKEEASRAAKISGYEKVPANSET 264
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALLKAV+ QPVS++I A FQ Y G+F G CGT LDH VT VG+G T DG YWL+K
Sbjct: 265 ALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDLDHGVTAVGYGKTSDGTKYWLVK 324
Query: 316 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
NSWG +WGD+GY+ + R GLCGI +SYP A
Sbjct: 325 NSWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYPTA 361
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 179/324 (55%), Positives = 230/324 (70%), Gaps = 12/324 (3%)
Query: 33 VSSRSTHEQSVV-EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-RTYK 90
V+SR+ + S++ E HE+WM +G+ YKD E+E RLKIFKEN+ YIE +N GN + YK
Sbjct: 26 VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+DLTN+EF A +K S T +STFKY+N S VP+++DWR KGAVTP
Sbjct: 86 LGINQFADLTNEEFIASRNKFKGHMCS-SITKTSTFKYENAS---VPSTVDWRKKGAVTP 141
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
+KNQ +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC T G + GC GG + AF
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G+ TE +YPYQ V GTCSA + A I+ YE+VP+ +EQAL KAV+ QP+S+
Sbjct: 202 FIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISV 261
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A ++FQ YK G+F G CGT+LDH VT VG+G DG YWL+KNSWG WG+ GY+
Sbjct: 262 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYI 321
Query: 329 KIVRD----EGLCGIGTRSSYPLA 348
K+ R EGLCGI +SYP A
Sbjct: 322 KMQRGVDAAEGLCGIAMEASYPTA 345
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 360 bits (925), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 172/324 (53%), Positives = 231/324 (71%), Gaps = 13/324 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+SR+ + S+ E H +WM+Q+G+ YKD E+E R KIF EN+ Y+E +N + ++YKLG
Sbjct: 25 VTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+S +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFVASRNKFKGHMCSSITRTTT---FKYENVSA--IPSTVDWRKKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
+KNQ +CGCCWAF+AVAA EGI K+ +G LI LSEQ+L+DC T G + GC GG + AF
Sbjct: 140 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFK 199
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G++TE +YPY+ V GTC+A + A I+ YE+VP+ EQAL KAV+ QP+S+
Sbjct: 200 FIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISV 259
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A ++FQ YK G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+
Sbjct: 260 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYI 319
Query: 329 KIVRD----EGLCGIGTRSSYPLA 348
+ R EGLCGI ++SYP A
Sbjct: 320 MMQRGVEAAEGLCGIAMQASYPTA 343
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 360 bits (924), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 179/327 (54%), Positives = 228/327 (69%), Gaps = 11/327 (3%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-R 87
A QV S + ++ E HE+WM +G+ YKD E+E RLKIFKEN+ YIE +N GN +
Sbjct: 23 AIQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNK 82
Query: 88 TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA 147
YKLG NQF+DLTN+EF A +K S T +STFKY+N S VP+++DWR KGA
Sbjct: 83 LYKLGINQFADLTNEEFIASRNKFKGHMCS-SITKTSTFKYENAS---VPSTVDWRKKGA 138
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREK 206
VTP+KNQ +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC T G + GC GG +
Sbjct: 139 VTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDD 198
Query: 207 AFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQP 265
AF +IIQN G+ TE +YPYQ V GTCSA + A I+ YE+VP+ +EQAL KAV+ QP
Sbjct: 199 AFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQP 258
Query: 266 VSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
+S+AI A ++FQ YK G+F G CGT+LDH VT VG+G DG YWL+KNSWG WG+
Sbjct: 259 ISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEE 318
Query: 326 GYMKIVRD----EGLCGIGTRSSYPLA 348
GY+K+ R EGLCGI +SYP A
Sbjct: 319 GYIKMQRGVDAAEGLCGIAMEASYPTA 345
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 360 bits (923), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 178/334 (53%), Positives = 240/334 (71%), Gaps = 14/334 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ +L + ASQ +SRS HE S+ E HE WMA++GR YKD EKE R KIFK+N+ IE
Sbjct: 14 LLFILAAWASQA-TSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
NK ++TYKL N+F+DLTN+EFR+L +K +H + ++TFKY+N+ T VP+++
Sbjct: 73 FNKAMDKTYKLSINEFADLTNEEFRSLRNRFK----AHICSEATTFKYENV--TAVPSTI 126
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGC 199
DWR KGAVTPIK+Q++CGCCWAF+AVAA EGIT+I +G LI LSEQ+L+DC T G N GC
Sbjct: 127 DWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGC 186
Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALL 258
GG + AF + I+ G+A+E YPY+ GTC++ ++ AAKI YE+VP+ +E+AL
Sbjct: 187 SGGLMDDAFRF-IKIHGLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANNEKALQ 245
Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
KAV+ QPV++AI A EFQ Y G+F G CGT+LDH V VG+G +DG YWL+KNSW
Sbjct: 246 KAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMMYWLVKNSW 305
Query: 319 GNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
G WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 306 GTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 339
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 359 bits (922), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 170/312 (54%), Positives = 224/312 (71%), Gaps = 11/312 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+++ HE+WMAQHGR Y D EKE R IFKEN+E IE N +R YKLG N+F+DLTN+
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EFRA++ GYK S SS+F+++NLS +PTS+DWR GAVTP+K+Q CGCCWA
Sbjct: 61 EFRAMHHGYKRQSSK---LMSSSFRHENLSA--IPTSMDWRKAGAVTPVKDQGTCGCCWA 115
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATED 221
F+AVAA+EGI K+++G LI LSEQQL+DC G + GC GG + AF +I++N G+ +E
Sbjct: 116 FSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSEA 175
Query: 222 EYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
YPYQ V GTC + + + AKI+ YE+VP +E ALL+AV+ QPVS+A+ +FQ Y
Sbjct: 176 TYPYQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQFY 235
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
K G+F G CGT LDHAVT +G+GT DG NYWL+KNSWG +WG++GYM++ R EGL
Sbjct: 236 KSGVFKGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRGIGAREGL 295
Query: 337 CGIGTRSSYPLA 348
CG+ +SYP A
Sbjct: 296 CGVAMDASYPTA 307
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 359 bits (922), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 175/346 (50%), Positives = 243/346 (70%), Gaps = 15/346 (4%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K TP+ ++ T+ V + + ++RS +E S+ E H++WMA++GR YK EK R
Sbjct: 4 TIKHQCTPLALLFTIGV--LASLAAARSLNEASMTETHDQWMARYGRVYKTANEKNRRST 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKY 128
IF+ENL+YI+ NK N+ YKLG N+F+DLTN+EF +K SH +T ++ F+Y
Sbjct: 62 IFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTTSRNKFK----SHVCATVTNVFRY 117
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+N+ T VP ++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGIT++++G LI LSEQ+L
Sbjct: 118 ENV--TAVPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQEL 175
Query: 189 LDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNY 246
+DC TNG + GC GG + AF +I QN G++TE YPY GTC+A ++ AA I+ +
Sbjct: 176 VDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEANHAATITGH 235
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP+ E ALLKAV+ QP+S+AI A ++FQ Y G+F G CGT+LDH VT VG+GT
Sbjct: 236 EDVPANSESALLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYGTAA 295
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
DG YWL+KNSWG +WG+ GY+++ R EGLCGI ++SYP A
Sbjct: 296 DGTKYWLVKNSWGTSWGEEGYIQMQRGVAAAEGLCGIAMQASYPTA 341
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 174/339 (51%), Positives = 230/339 (67%), Gaps = 11/339 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVV--EIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+F+I++L+ S + SR + ++ + H++WMA+HGR Y D EK R +FK N+
Sbjct: 8 IFLIVSLISSFCLSITLSRPLDDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKRNV 67
Query: 76 EYIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLS 132
E IE+ N RT+KL NQF+DLTNDEFR++YTGYK S S T +S+F+YQN+S
Sbjct: 68 ERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQNVS 127
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P S+DWR KGAVTPIKNQ CGCCWAF+AVAA+EG TKI+ G LI LSEQQL+DC
Sbjct: 128 SGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVDCD 187
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPS 251
TN + GC GG + AF +I+ G+ TE YPY+ TC KP A I+ YE+VP
Sbjct: 188 TN-DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGKDATCKIKNTKPTATSITGYEDVPV 246
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
DE+AL+KAV+ QPVSI I +FQ Y G+F G C T LDHAVT VG+G + +G+ Y
Sbjct: 247 NDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDHAVTAVGYGQSSNGSKY 306
Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
W+IKNSWG WG++GYM+I +D +GLCG+ ++SYP
Sbjct: 307 WIIKNSWGTKWGESGYMRIKKDVKDKKGLCGLAMKASYP 345
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 357 bits (917), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 177/343 (51%), Positives = 245/343 (71%), Gaps = 16/343 (4%)
Query: 18 MFIIITLLVS-CASQVVS--SRSTHEQSVVEIHEKWMAQHGRSYKDELE--KEMRLKIFK 72
+F+ + L++S C S ++ SR ++ + HE+WM+QHGR Y DE E K R +FK
Sbjct: 6 IFLFVALVLSFCFSIQLAGLSRPLLDEDSMR-HEEWMSQHGRVYADEQEDHKNKRFNVFK 64
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP-SHRSTTSSTFKYQNL 131
EN+E IE+ N +T+KL NQF+DLTN+EFRA Y G+K P S + T + F+Y+N+
Sbjct: 65 ENVERIEEFND--GKTFKLAINQFADLTNEEFRASYNGFKGPMVLSSQITKPTPFRYENV 122
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
S + +P S+DWR KGAVTP+KNQ +CGCCWAF+AVAA+EGIT+I +G LI LSEQ+L+DC
Sbjct: 123 S-SALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISLSEQELVDC 181
Query: 192 STNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEV 249
T G ++GC GG + AF +II N G+ TE YPY+ GTC+ + P A I+ YE+V
Sbjct: 182 DTKGIDHGCEGGLMDTAFEFIINNGGLTTESNYPYKGEDGTCNFNKTNPIAVSITGYEDV 241
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ DEQAL+KAV+ QPVS+AI A ++FQ Y G+F G CGT+LDHAVT VG+G +EDG+
Sbjct: 242 PANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGVFTGECGTELDHAVTAVGYGESEDGS 301
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
YW++KNSWG WG++GY+++ +D +GLCGI ++SYP A
Sbjct: 302 KYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCGIAMQASYPTA 344
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 357 bits (916), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 179/337 (53%), Positives = 235/337 (69%), Gaps = 11/337 (3%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
I L CA QV +SRS S+ E HE+WM+Q+ + YKD E+E R KIF N+ YI
Sbjct: 13 LTFIFCLGLCAIQV-TSRSLQVDSMYERHEQWMSQYSKVYKDPQEREERHKIFTANVNYI 71
Query: 79 EKANKEGN-RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
E N + N + YKLG NQF+DLTN+EF A +K S + T+ TFKY+N+S +P
Sbjct: 72 EVFNNDANNKLYKLGINQFADLTNEEFIASRNKFKGHMCSSIAKTT-TFKYENVSA--IP 128
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTP+KNQ +CGCCWAF+AVAA EGITK+ +G L+ LSEQ+L+DC T G +
Sbjct: 129 STVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVD 188
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQ 255
GC GG + AF +IIQN G++TE YPYQ V GTC+A + AA I+ YE+VP+ +EQ
Sbjct: 189 QGCEGGLMDDAFKFIIQNHGLSTEAAYPYQGVDGTCNANKASIHAATITGYEDVPANNEQ 248
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QP+S+AI A ++FQ YK G+F+G CGT+LDH VT VG+G DG YWL+K
Sbjct: 249 ALQKAVANQPISVAIDASGSDFQFYKSGVFSGSCGTELDHGVTAVGYGVGNDGTKYWLVK 308
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
NSWG WG+ GY+++ R EGLCGI ++SYP A
Sbjct: 309 NSWGTDWGEEGYIRMQRGVDAAEGLCGIAMQASYPTA 345
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 357 bits (915), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 177/345 (51%), Positives = 242/345 (70%), Gaps = 15/345 (4%)
Query: 14 NTTPMFIIITLLVSCA--SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
N +I + LL+ + V+SR+ + S+ E H++WM Q+ + Y D E E R +IF
Sbjct: 4 NKQLYYISLALLMCLGLWAVQVTSRTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQ 129
KEN+ YIE +NKEG R YKLG NQF DLTN+EF A +K M S R+ +T+KY+
Sbjct: 64 KENVNYIETSNKEGGRFYKLGVNQFVDLTNEEFIAPRNRFKGHMCSSIIRT---NTYKYE 120
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
N+ T VP+++DWR KGAVTP+K+Q +CGCCWAF+AVAA EGI ++ +G LI LSEQ+L+
Sbjct: 121 NV--TTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELV 178
Query: 190 DCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYE 247
DC T G + GC GG + AF +IIQN G+ TE +YPYQ V GTC+A + AA I++YE
Sbjct: 179 DCDTKGVDQGCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVDGTCNANEASINAATITSYE 238
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
+VP+ +EQAL KAV+ QP+S+AI A ++FQ Y G+F G CGT+LDH VT VG+G ++D
Sbjct: 239 DVPTNNEQALQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVSDD 298
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
G YWL+KNSWG +WG+ GY+++ R EGLCGI ++SYP+A
Sbjct: 299 GTKYWLVKNSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASYPIA 343
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 356 bits (913), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 170/341 (49%), Positives = 238/341 (69%), Gaps = 11/341 (3%)
Query: 19 FIIITLLVSC----ASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
++ + L C +SQV SR +E ++ H++W+ H + YKD EKE+R +IFKE
Sbjct: 9 YLCLALFFICLGLWSSQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQIFKE 68
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+E IE N ++ YKLG N+FSDLTN+EFR L+TGYK P +++ ++ ++
Sbjct: 69 NVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHFRYTNV 128
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
TD+P ++DWR KGAVTPIK+QKECGCCWAF+AVAA+EG+ ++++G LI LSEQ+L+DC
Sbjct: 129 TDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLSEQELVDCDV 188
Query: 194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPS 251
G + GC GG + AF +I++N+G+ TE YPY+ G C+ + +AAKI+ YE+VP+
Sbjct: 189 EGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCNKKKSALSAAKITGYEDVPA 248
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
E+ALL+AV+ QPVS+AI S +FQ Y G+F+G C T L+HAVT VG+G T DG Y
Sbjct: 249 NSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKY 308
Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
W+IKNSWG+ WGD+GYM+I RD EGLCG+ +SYP A
Sbjct: 309 WIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 355 bits (912), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 174/325 (53%), Positives = 230/325 (70%), Gaps = 14/325 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK-EGNRTYKL 91
V+SR+ + S+ E HE+WM +G+ YKD E+E R KIF EN++YIE N + N +YKL
Sbjct: 25 VTSRTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
G NQF+DLTN+EF A +K M S R+TT FKY+N+S +P+++DWR KGAVT
Sbjct: 85 GINQFADLTNEEFVASRNKFKGHMCSSIIRTTT---FKYENVSA--IPSTVDWRKKGAVT 139
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
P+KNQ +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC T G + GC GG + AF
Sbjct: 140 PVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAF 199
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVS 267
+IIQN G+ TE +YPYQ V GTC+A + A I+ YE+VP+ +EQAL KAV+ QP+S
Sbjct: 200 KFIIQNHGLNTEAQYPYQGVDGTCNANKASIQATTITGYEDVPANNEQALQKAVANQPIS 259
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A ++FQ YK G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY
Sbjct: 260 VAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGY 319
Query: 328 MKIVRD----EGLCGIGTRSSYPLA 348
+ + R EGLCGI ++SYP A
Sbjct: 320 IMMQRGVEAAEGLCGIAMQASYPTA 344
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 176/336 (52%), Positives = 241/336 (71%), Gaps = 16/336 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ +L + ASQ ++R+ HE S+ E HE WM Q+GR YKD EK R KIFK+N+ IE
Sbjct: 14 LLFVLAAWASQA-TARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY+N+ T VP++
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVPST 126
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
+DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQA 256
C GG + AF +I QN G+ TE YPY GTC+ A P AAKI+ YE+VP+ +E+A
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNEKA 245
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L KAV+ QP+++AI A +EFQ Y G+F G CGT+LDH V+ VG+GT++DG YWL+KN
Sbjct: 246 LQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKN 305
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 306 SWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 170/336 (50%), Positives = 227/336 (67%), Gaps = 9/336 (2%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F L++ + V+SR E S+ HE+WM G+ Y D EKE R +IFK+N+EYI
Sbjct: 10 FFAFILILGMWAYEVASRELQEPSMSARHEQWMETFGKVYADAAEKERRFEIFKDNVEYI 69
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N GN+ YKL N+F+DLTN+E + GY+ P + R ++FKY+N+ T VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADLTNEELKVARNGYRRPLQT-RPMKVTSFKYENV--TAVPA 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
++DWR KGAVTPIK+Q +CG CWAF+ VAA EGI ++ +G L+ LSEQ+L+DC T G +
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDTQGEDQ 186
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQA 256
GC GG E F +II+N GI TE YPYQA GTC++ ++ + AKI+ YE VP+ E A
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKEASRIAKITGYESVPANSEAA 246
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LLKAV+ QP+S++I A ++FQ Y G+F G CGT+LDH VT VG+G T DG YWL+KN
Sbjct: 247 LLKAVASQPISVSIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGETSDGTKYWLVKN 306
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG +WG+ GY+++ RD EGLCGI SSYP A
Sbjct: 307 SWGTSWGEEGYIRMQRDTEAEEGLCGIAMDSSYPTA 342
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 172/325 (52%), Positives = 231/325 (71%), Gaps = 14/325 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK-EGNRTYKL 91
V+SR+ + S+ E H +WM+Q+G+ YKD E+E R KIFKEN+ YIE N + ++YKL
Sbjct: 25 VTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
G NQF+DLTN+EF A +K M S R+T+ FKY+N+S +P+++DWR KGAVT
Sbjct: 85 GINQFADLTNEEFIASRNKFKGHMCSSIMRTTS---FKYENVS--GIPSTVDWRKKGAVT 139
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
P+KNQ +CGCCWAF+AVAA EGI K+ +G LI LSEQ+L+DC T G + GC GG + AF
Sbjct: 140 PVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAF 199
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+IIQN G++TE +YPY+ V GTC+A + A I+ YE+VP+ EQAL KAV+ QP+S
Sbjct: 200 KFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPIS 259
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A ++FQ YK G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY
Sbjct: 260 VAIDASGSDFQFYKSGVFTGACGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGY 319
Query: 328 MKIVRD----EGLCGIGTRSSYPLA 348
+ + R EG+CGI ++SYP A
Sbjct: 320 IMMQRGIEAAEGICGIAMQASYPTA 344
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 176/336 (52%), Positives = 239/336 (71%), Gaps = 16/336 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ +L + ASQ ++RS HE S+ E HE WM Q+GR YKD EK R KIFK+N+ IE
Sbjct: 14 LLFVLAAWASQA-TARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY+N+ T VP++
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVPST 126
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
+DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQA 256
C GG + AF +I QN G+ TE YPY GTC+ A P AAKI+ YE+VP+ +E+A
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNEKA 245
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L KAV+ QP+++AI A +EFQ Y G+F G CGT+LDH V VG+GT++DG YWL+KN
Sbjct: 246 LQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 305
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SW WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 306 SWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 171/335 (51%), Positives = 232/335 (69%), Gaps = 7/335 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ S + + + HE S +E HE+WMA+ R Y+DELEK+MR +FK+NL++
Sbjct: 10 IFTILFTTFSISQATSRTVTFHEPSSLEKHEQWMARFSRVYRDELEKQMRRDVFKKNLKF 69
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE NK+GN++YKLG N+F+D TN+EF A++TG K S T S+ + M V
Sbjct: 70 IENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLSSKVVDETISSRSWNISDMVGV- 128
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
S DWR +GAVTP+K Q +CGCCWAF+AVAAVEG+TKI GNL+ LSEQQLLDC +
Sbjct: 129 -SKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDCDREYDR 187
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG AF YIIQN+GIA+E++Y YQ G C ++ +P AA+IS ++ VPS +EQAL
Sbjct: 188 GCDGGIMSDAFNYIIQNRGIASENDYSYQGSDGRCRSSARP-AARISGFQTVPSNNEQAL 246
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
L+AVS QPVS+++ A F Y G+++G CGT +HAVT VG+GT++DG YWL KNS
Sbjct: 247 LEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNS 306
Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
WG TWG+ GY++I RD +G+CG+ + YP+A
Sbjct: 307 WGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 341
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 175/338 (51%), Positives = 239/338 (70%), Gaps = 15/338 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ + LL + ++R+ + S+ E HE+WMAQHG+ YKD EKE+R KIF++N++ IE
Sbjct: 12 LALLLLFGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIE 71
Query: 80 KANKEGNRTYKLGTNQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N GN+++KLG NQF+DLT +EF+A+ GY S +STFKY+++ T VP
Sbjct: 72 GFNNAGNKSHKLGVNQFADLTEEEFKAINKLKGYMWSKISR----TSTFKYEHV--TKVP 125
Query: 138 TSLDWRDKGAVTPIKNQK-ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
+LDWR KGAVTPIK+Q +CG CWAFAAVAA EGITK+ +G LI LSEQ+L+DC TNG+
Sbjct: 126 ATLDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGD 185
Query: 197 NG-CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDE 254
NG C G ++AF +I+QN+G+ATE YPYQAV GTC+A + A I YE+VP+ +E
Sbjct: 186 NGGCKWGIIQEAFKFIVQNKGLATEASYPYQAVDGTCNAKVESKHVASIKGYEDVPANNE 245
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
ALL AV+ QPVS+ + + +F+ Y G+ +G CGT DHAVT+VG+G ++DG YWLI
Sbjct: 246 TALLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDDGTKYWLI 305
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
KNSWG WG+ GY++I RD EG+CGI ++SYP+A
Sbjct: 306 KNSWGVYWGEQGYIRIKRDVAAKEGMCGIAMQASYPIA 343
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 353 bits (905), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 175/336 (52%), Positives = 238/336 (70%), Gaps = 16/336 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ +L + ASQ ++R HE S+ E HE WM Q+GR YKD EK R KIFK+N+ IE
Sbjct: 14 LLFVLAAWASQA-TARXLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY+N+ T VP++
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVPST 126
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
+DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQA 256
C GG + AF +I QN G+ TE YPY GTC+ A P AAKI+ YE+VP+ +E+A
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNEKA 245
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L KAV+ QP+++AI A +EFQ Y G+F G CGT+LDH V VG+GT++DG YWL+KN
Sbjct: 246 LQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 305
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SW WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 306 SWSTGWGEEGYIRMQRDVTVKEGLCGIAMQASYPTA 341
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 353 bits (905), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 176/326 (53%), Positives = 227/326 (69%), Gaps = 17/326 (5%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK-EGNRTYKL 91
V+SR T + + E H +WM+Q+G+ YKD E+E R KIF EN+ YIE NK + N+ Y L
Sbjct: 25 VTSR-TLQDDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTL 83
Query: 92 GTNQFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
G NQF+DLTNDEF R + G+ S T +STFKY+N S +P+S+DWR KGAV
Sbjct: 84 GVNQFADLTNDEFTSSRNKFKGHMCSSI----TRTSTFKYENASA--IPSSVDWRKKGAV 137
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKA 207
TP+KNQ +CGCCWAF+AVAA EGI K+ +G LI LSEQ+L+DC T G + GC GG + A
Sbjct: 138 TPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDA 197
Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPV 266
F +IIQN G+ TE YPYQ V GTC+A + A I+ YE+VP+ +EQAL KAV+ QP+
Sbjct: 198 FKFIIQNHGLNTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPI 257
Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
S+AI A ++FQ YK G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ G
Sbjct: 258 SVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEEG 317
Query: 327 YMKIVRD----EGLCGIGTRSSYPLA 348
Y+ + R EGLCGI ++SYP A
Sbjct: 318 YIMMQRGVDAAEGLCGIAMQASYPTA 343
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 178/340 (52%), Positives = 232/340 (68%), Gaps = 14/340 (4%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
T +F+I CA + ++R+ + + E HE+WMA HG+ YK EKE + +IF EN
Sbjct: 10 TLALFLIFAF---CAFEA-NARTLEDAPMRERHEQWMATHGKVYKHSYEKEQKYQIFMEN 65
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
++ IE N G + YKLG N F+DLTN+EF+A+ +K S R+ T+ TF+Y+N+ T
Sbjct: 66 VQRIEAFNNAGXKPYKLGINHFADLTNEEFKAI-NRFKGHVCSKRTRTT-TFRYENV--T 121
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
VP SLDWR KGAVTPIK+Q +CGCCWAF+AVAA EGITK+R+G LI LSEQ+L+DC T
Sbjct: 122 AVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQELVDCDTK 181
Query: 195 G-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSG 252
G + GC GG + AF +I+QN+G+ATE YPY+ GTC+A A A I YE+VP+
Sbjct: 182 GVDQGCEGGLMDDAFKFILQNKGLATEAIYPYEGFDGTCNAKADGNHAGSIKGYEDVPAN 241
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
E ALLKAV+ QPVS+AI A +FQ Y G+F G CGT LDH VT VG+G +DG YW
Sbjct: 242 SESALLKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVTSVGYGVGDDGTKYW 301
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
L+KNSWG WG+ GY+++ RD EGLCGI +SYP A
Sbjct: 302 LVKNSWGVKWGEKGYIRMQRDVAAKEGLCGIAMLASYPSA 341
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 176/340 (51%), Positives = 238/340 (70%), Gaps = 17/340 (5%)
Query: 19 FIIITLLVSCASQV--VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+I + LL A+ +R+ HE S+ E HE WMAQ+GR YKD EK R KIFK+N+
Sbjct: 9 YICLALLFVLAAWASHAKARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRYKIFKDNVA 68
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTD 135
IE NK N++YKL N+F+DLTN+EFRA +K +H ST +++FKY+++
Sbjct: 69 RIESFNKAMNKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYEHVXA-- 122
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
VP+++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G
Sbjct: 123 VPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSG 182
Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSG 252
+ GC GG + AF +I QN G+ TE YPY GTC+ A PAA KI+ YE+VP+
Sbjct: 183 EDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAA-KINGYEDVPAN 241
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
+E+AL KAV+ QP+++AI A EFQ Y G+F G CGT+LDH V+ VG+GT++DG YW
Sbjct: 242 NEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYW 301
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
L+KNSWG WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 302 LVKNSWGTGWGEEGYIRMQRDVTEKEGLCGIAMQASYPTA 341
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 351 bits (901), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 172/324 (53%), Positives = 227/324 (70%), Gaps = 13/324 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+ RS + S+ E HE+WM ++G+ YKD E+E R +IFKEN+ YIE N N+ YKL
Sbjct: 25 VTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLA 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T VP+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSIIRTTT---FKYENV--TAVPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + SG LI LSEQ+L+DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFK 199
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
++IQN G+ TE YPY+ V G C+ + AA I+ YE+VP+ +E+AL KAV+ QPVS+
Sbjct: 200 FVIQNHGLNTEANYPYKGVDGKCNVNEAANDAATITGYEDVPANNEKALQKAVANQPVSV 259
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A ++FQ YK G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+
Sbjct: 260 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYI 319
Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
++ R +EGLCGI ++SYP A
Sbjct: 320 RMQRGVNSEEGLCGIAMQASYPTA 343
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 351 bits (901), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 168/343 (48%), Positives = 238/343 (69%), Gaps = 13/343 (3%)
Query: 18 MFIIITLLVSCASQVVSSRST------HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
+ +++T+L+ + S++T EQS+V+ HE+WMA+ R Y+DELEK MR +F
Sbjct: 4 IMVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQN 130
K+NL++IE NK+GN++YKLG N+F+D TN+EF A++TG K + S + T Q
Sbjct: 64 KKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQT 123
Query: 131 LSMTD-VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+++D V S DWR +GAVTP+K Q +CGCCWAF+AVAAVEG+ KI GNL+ LSEQQLL
Sbjct: 124 WNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLL 183
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
DC + GC GG AF Y++QN+GIA+E++Y YQ G C + +P AA+IS ++ V
Sbjct: 184 DCDREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNARP-AARISGFQTV 242
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
PS +E+ALL+AVS QPVS+++ A F Y G+++G CGT +HAVT VG+GT++DG
Sbjct: 243 PSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGT 302
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
YWL KNSWG TWG+ GY++I RD +G+CG+ + YP+A
Sbjct: 303 KYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 351 bits (901), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 171/324 (52%), Positives = 226/324 (69%), Gaps = 13/324 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+ RS + S+ E HE+WM ++G+ YKD E+E R +IFKEN+ YIE N N+ YKL
Sbjct: 572 VTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLA 631
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T VP+++DWR KGAVTP
Sbjct: 632 INQFADLTNEEFIAPRNRFKGHMCSSIIRTTT---FKYENV--TAVPSTVDWRQKGAVTP 686
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + SG LI LSEQ+L+DC T G + GC GG + AF
Sbjct: 687 IKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFK 746
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
++IQN G+ TE YPY+ V G C+A + I+ YE+VP+ +E+AL KAV+ QPVS+
Sbjct: 747 FVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANNEKALQKAVANQPVSV 806
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A ++FQ YK G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+
Sbjct: 807 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYI 866
Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
++ R +EGLCGI ++SYP A
Sbjct: 867 RMQRGVDSEEGLCGIAMQASYPTA 890
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 351 bits (900), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 175/356 (49%), Positives = 239/356 (67%), Gaps = 13/356 (3%)
Query: 1 MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+L F + K + + + + L ++ + V+ RS + S+ E HE+WM ++G+ YKD
Sbjct: 11 FLLFFASTMVAKNHFCHISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKD 70
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSH 118
E+E R +IFKEN+ YIE N N+ YKL NQF+DLTN+EF A +K M S
Sbjct: 71 PQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSII 130
Query: 119 RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG 178
R+TT FKY+N+ T VP+++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI + SG
Sbjct: 131 RTTT---FKYENV--TAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSG 185
Query: 179 NLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
LI LSEQ+L+DC T G + GC GG + AF ++IQN G+ TE YPY+ V G C+A +
Sbjct: 186 KLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEA 245
Query: 238 P-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHA 296
I+ YE+VP+ +E+AL KAV+ QPVS+AI A ++FQ YK G+F G CGT+LDH
Sbjct: 246 ANDVVTITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHG 305
Query: 297 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
VT VG+G + DG YWL+KNSWG WG+ GY+++ R +EGLCGI ++SYP A
Sbjct: 306 VTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 361
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 350 bits (899), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 179/340 (52%), Positives = 233/340 (68%), Gaps = 21/340 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + LL S +SR+ + E+HE+WM QHG+ YK EK+ R IFKEN+ Y
Sbjct: 14 LFLCLGLL----SFQATSRTLQNDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENVNY 69
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N GN++YKLG N F+DLTN EF R + GY + +TFKY+N+S
Sbjct: 70 IEAFNNVGNKSYKLGLNHFADLTNHEFIAARNKFNGYL------HGSIITTFKYKNVS-- 121
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
DVP+++DWR +GAVTP+KNQ +CGCCWAF+AVA+ EGI K+ +GNL+ LSEQ+L+DC TN
Sbjct: 122 DVPSAVDWRQEGAVTPVKNQGQCGCCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTN 181
Query: 195 G-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSG 252
G + GC GG + AF +IIQN G++TE EYPYQ V GTC+ + +AA IS YE VP
Sbjct: 182 GEDQGCEGGLMDDAFEFIIQNNGLSTEAEYPYQGVDGTCNKTEVGSSAATISGYENVPVN 241
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
DEQAL KAV+ QPVS+AI A ++FQ YK G+F G CGT+LDH V +VG+G ED YW
Sbjct: 242 DEQALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVAVVGYGVGEDETEYW 301
Query: 313 LIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
L+KNSWG WG+ GY+++ R EGLCGI + SYP A
Sbjct: 302 LVKNSWGTQWGEEGYIRMQRGVDASEGLCGIAMQPSYPTA 341
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 170/342 (49%), Positives = 238/342 (69%), Gaps = 7/342 (2%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F + ++T++V S+ E+++ H++WMA+HGR+YKDE EK R +
Sbjct: 12 TFTAAALMILAVMTMVVEARDLSTSTGGYGEEAMKVRHQQWMAEHGRTYKDEAEKARRFQ 71
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
+FK N ++++++N G ++Y+L N+F+D+TNDEF A+YTG K P P+ + FKY+
Sbjct: 72 VFKANADFVDRSNAAGGKSYELAINEFADMTNDEFVAMYTGLK-PVPAGPKKMAG-FKYE 129
Query: 130 NLSMTDVPT-SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
NL+++DV ++DWR KGAVT IKNQ +CGCCWAFAAVAAVE I +I +GNL+ LSEQQ+
Sbjct: 130 NLTLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIHQITTGNLVSLSEQQV 189
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
LDC T+GNNGC GG + AF YII N G+ATED YPY A GTC ++ +P A IS+Y++
Sbjct: 190 LDCDTDGNNGCNGGYIDNAFQYIISNGGLATEDAYPYAAAQGTCQSSVQP-AVTISSYQD 248
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGT-QLDHAVTIVGFGTTE 306
VPSGDE AL AV+ QPV++AI A++ FQ Y G+ CGT L+HAVT VG+ T E
Sbjct: 249 VPSGDEAALAAAVANQPVAVAIDAHNN-FQFYSSGVLTADTCGTPSLNHAVTAVGYSTAE 307
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
DG YWL+KN WG WG+ GY+++ R CG+ ++SYP+A
Sbjct: 308 DGTPYWLLKNQWGQNWGEGGYLRVERGTNACGVAQQASYPVA 349
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 171/338 (50%), Positives = 229/338 (67%), Gaps = 12/338 (3%)
Query: 19 FIIITLLVSCA--SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
F++I L A + S+R HE ++VE HEKWMA+HG+ YKD+ EK R +IFK N+E
Sbjct: 9 FLLIALFFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVE 68
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+IE +N GN +Y LG N+F+DLTN+EFRA + GYK P + R T FKY+N+ T +
Sbjct: 69 FIESSNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDASRIVT--PFKYENV--TAL 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P S+DWR KGAVT IK+Q+ECG CWAF+AVAA EG+ K+R+G L+ LSEQ+L+DC G
Sbjct: 125 PYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGE 184
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
+ GC GG E AF +I +N GI TE Y Y+ G C ++ + AKI+ Y+ VP E
Sbjct: 185 DKGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHVAKITGYQVVPENSE 244
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
ALLKAV+ QPVS++I A S FQ Y+ GI+ G CG+ L+H V VG+GT+ G+ YW++
Sbjct: 245 AALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTSSSGSKYWIV 304
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
KNSWG WG+ GY+++ RD +GLCGI SYP A
Sbjct: 305 KNSWGPEWGERGYVRMKRDITSRKGLCGIAMDCSYPTA 342
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 350 bits (897), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 171/338 (50%), Positives = 226/338 (66%), Gaps = 10/338 (2%)
Query: 18 MFIIITLLVS-CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+F+I++L+ S C S +S E + + H++WMA+HGR+Y D EK R +FK N+E
Sbjct: 8 IFLIVSLVSSFCFSTTLSRLLDDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRNVE 67
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSM 133
IE+ N RT+KL NQF+DLTNDEFR +YTGYK S T S++F+YQN+
Sbjct: 68 RIERLNNVPAGRTFKLAVNQFADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNVFF 127
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P ++DWR KGAVTPIKNQ CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC T
Sbjct: 128 GALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS-AAQKPAAAKISNYEEVPSG 252
N + GC GG + AF +I+ G+ TE YPY+ C + KP+AA I+ YE+VP
Sbjct: 188 N-DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGEDANCKIKSTKPSAASITGYEDVPVN 246
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
DE AL+KAV+ QPVS+ I +FQ Y G+F G C T LDHAVT VG+ + G+ YW
Sbjct: 247 DENALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYW 306
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
+IKNSWG WG+ GYM+I +D EGLCG+ ++SYP
Sbjct: 307 IIKNSWGTKWGEGGYMRIKKDIKDKEGLCGLAMKASYP 344
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 349 bits (896), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 175/339 (51%), Positives = 236/339 (69%), Gaps = 17/339 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + +T L A QV + R+ + S+ E HE+WM ++G+ YKD E+E R ++FKEN+ Y
Sbjct: 14 MLLCMTFL---AFQV-TCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENVNY 69
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTD 135
IE N N++YKLG NQF+DLTN EF A G+K M S R+TT FK++N++ T
Sbjct: 70 IEAFNNAANKSYKLGINQFADLTNKEFIAPRNGFKGHMCSSIIRTTT---FKFENVTAT- 125
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P+++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI + +G LI LSEQ+L+DC T G
Sbjct: 126 -PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKG 184
Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGD 253
+ GC GG + AF +IIQN G+ TE YPY+ V G C+A + A I+ YE+VP+ +
Sbjct: 185 VDQGCEGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEAAKNAATITGYEDVPANN 244
Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E AL KAV+ QPVS+AI A ++FQ YK G+F G CGT+LDH VT VG+G ++DG YWL
Sbjct: 245 EMALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWL 304
Query: 314 IKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
+KNSWG WG+ GY+++ R +EGLCGI ++SYP A
Sbjct: 305 VKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 343
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 349 bits (896), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 166/337 (49%), Positives = 230/337 (68%), Gaps = 10/337 (2%)
Query: 19 FIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
F+I L +CA +++R T + S+V HE+WMA++GR Y D EK RL++FK N+ +
Sbjct: 82 FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + L NQF+D+T DEFRA +TGYK P P+++ T+ FKY N+S+ +P
Sbjct: 142 IELVNA-GNDKFSLEANQFADMTVDEFRAAHTGYK-PVPANKGRTTQ-FKYANVSLDALP 198
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
S+DWR KGAVTPIK+Q +CGCCWAF+ VA+VEGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 199 ASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDGMD 258
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQ 255
GC GG + AF +II N G+ TE YPY +C++ ++ A I YE+VPS DE
Sbjct: 259 QGCEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVPSNDET 318
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
+LLKAV+ QPVSIA+ F+ YK G+ +G CGT+LDH + VG+G T DG +WL+K
Sbjct: 319 SLLKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGITSDGTKFWLMK 378
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
NSWG +WG+ G++++ RD EGLCG+ + SYP A
Sbjct: 379 NSWGTSWGEKGFIRMERDIADEEGLCGLAMQPSYPTA 415
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 349 bits (896), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 175/336 (52%), Positives = 239/336 (71%), Gaps = 16/336 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L + ASQ ++R+ E S+ E HE WMAQ+GR YKD EK R KIFK+N+ IE
Sbjct: 14 LLFFLAAWASQA-TARNLLEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY++++ VP++
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYEHVAA--VPST 126
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
+DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQA 256
C GG + AF +I QN G+ATE YPY GTC+ A PAA KI+ YE+VP+ +E+A
Sbjct: 187 CNGGLMDDAFKFIEQNHGLATEANYPYAGTDGTCNRKKAAHPAA-KINGYEDVPANNEKA 245
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L KAV+ QP+++AI A EFQ Y G+F G CGT+LDH V VG+GT++DG YWL+KN
Sbjct: 246 LQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 305
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 306 SWGTGWGEVGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 170/343 (49%), Positives = 227/343 (66%), Gaps = 14/343 (4%)
Query: 12 KINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
KI +F ++ + CA Q +SR HE + HEKWMA+HG+ YKD+ EK R +IF
Sbjct: 8 KILPIALFFVLAM---CADQA-ASRELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
K N+ +IE N GN++Y LG N+F+DLTN+EFRA + GYK P + R T FKY+N+
Sbjct: 64 KSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAFWNGYKRPLGASRKIT--PFKYENV 121
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
T +P+S+DWR KGAVTPIK+Q CG CWAF+AVAA EGI K+R+G L+ LSEQ+L+DC
Sbjct: 122 --TALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDC 179
Query: 192 STNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEV 249
G + GC GG AF +I ++ G+ +E YPYQ G C ++ + A KI+ Y+ V
Sbjct: 180 DVKGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQAV 239
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P E ALLKAV+ QPVS+AI A S FQ Y+ GIF G+CG ++H V VG+G + G+
Sbjct: 240 PKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTGICGKDINHGVAAVGYGRSNSGS 299
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
YW++KNSWG WG+ GY+++ RD EGLCGI SYP A
Sbjct: 300 KYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAMECSYPTA 342
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 171/323 (52%), Positives = 224/323 (69%), Gaps = 14/323 (4%)
Query: 34 SSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGT 93
++R+ + + E HE+WMA HG+ Y EKE + + FKEN++ IE N GN+ YKLG
Sbjct: 27 NARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQKYQTFKENVQRIEAFNHAGNKPYKLGI 86
Query: 94 NQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
N F+DLTN+EF+A+ + G+ + T + TF+Y+N MT VP +LDWR +GAVTPI
Sbjct: 87 NHFADLTNEEFKAINRFKGH----VCSKITRTPTFRYEN--MTAVPATLDWRQEGAVTPI 140
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
K+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L+DC T G + GC GG + AF +
Sbjct: 141 KDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKF 200
Query: 211 IIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
I+QN+G+A E YPY+ V GTC+A A+ A I YE+VP+ E ALLKAV+ QPVS+A
Sbjct: 201 ILQNKGLAAEAIYPYEGVDGTCNAKAEGNHATSIKGYEDVPANSESALLKAVANQPVSVA 260
Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
I A EFQ Y G+F G CGT LDH VT VG+G ++DG YWL+KNSWG WGD GY++
Sbjct: 261 IEASGFEFQFYSGGVFTGSCGTNLDHGVTAVGYGVSDDGTKYWLVKNSWGVKWGDKGYIR 320
Query: 330 IVRD----EGLCGIGTRSSYPLA 348
+ RD EGLCGI +SYP A
Sbjct: 321 MQRDVAAKEGLCGIAMLASYPNA 343
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 175/336 (52%), Positives = 237/336 (70%), Gaps = 16/336 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ +L + ASQ ++R+ HE S+ E HE WMAQ+GR YKD EK R KIFK+N+ IE
Sbjct: 14 LLFVLAAWASQA-TARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK +++YKL N+F+DLTN+EF +K +H ST +++FKY+N+ T VP++
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFGTSRNRFK----AHICSTEATSFKYENV--TAVPST 126
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
+DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 IDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQA 256
C GG + AF +I QN G+ TE YPY GTC+ A P AAKI+ YE+VP+ +E+A
Sbjct: 187 CNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNEKA 245
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L KAV QP+++AI A EFQ Y G+F G CGT+LDH V VG+GT++DG YWL+KN
Sbjct: 246 LQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 305
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 306 SWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 165/321 (51%), Positives = 222/321 (69%), Gaps = 8/321 (2%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
+SR+ ++ +++ HE+WMA HGR Y DE EK++R +IFK N+ YI+ N +++Y L
Sbjct: 41 ATSRTLNDPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLE 100
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLTNDEFRA GYK S S F+Y N+S VP +DWR +GAVTP+K
Sbjct: 101 VNKFADLTNDEFRASRNGYKKQPDSDSHVVSGLFRYANVSA--VPDEVDWRKEGAVTPVK 158
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
+Q +CGCCWAF+AVAA+EGI K+ +G L+ LSEQ+L+DC +G + GC GG E AF +I
Sbjct: 159 DQGDCGCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFI 218
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
+ +G+A E YPY G C+ + AAKIS +E+VP+ +E+ALL+AV+ QPVSIAI
Sbjct: 219 EKRKGLAAESVYPYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAI 278
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A EFQ Y G+F G CGT+LDHA+T VG+G T DG YWL+KNSWG +WG+ GY++I
Sbjct: 279 DASGYEFQFYSGGVFTGSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGYIRI 338
Query: 331 VRD----EGLCGIGTRSSYPL 347
RD EGLCGI SYP+
Sbjct: 339 KRDSLAKEGLCGIAMDPSYPV 359
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 179/348 (51%), Positives = 231/348 (66%), Gaps = 17/348 (4%)
Query: 16 TPMFIIITLLVSCASQVVSSR-STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+ + I+T+ +S + + +SR S E S +E HE+WMA+ R Y DE EK R IFK+N
Sbjct: 3 STIIFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKN 62
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST------FKY 128
LE+++ N TYK+ N+FSDLT++EFRA +TG +P R +T S+ F+Y
Sbjct: 63 LEFVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFRY 122
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
N+S D S+DWR +GAVTP+K Q CG CWAF+AVAAVEGITKI G L+ LSEQQL
Sbjct: 123 GNVS--DNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQL 180
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA----AAKIS 244
LDC + N GC GG KAF YII+NQGI TED YPYQ TCS++ + AA IS
Sbjct: 181 LDCDRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATIS 240
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
YE VP +E+ALL+AVS QPVS+ I F+ Y G+FNG CGT L HAVTIVG+G
Sbjct: 241 GYETVPMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIVGYGM 300
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
+E+G YW++KNSWG TWG+ GYM+I RD +G+CG+ + YPLA
Sbjct: 301 SEEGTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 347 bits (890), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 175/324 (54%), Positives = 226/324 (69%), Gaps = 13/324 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+SR+ + S+ E HE+WMA++ + YKD E+E R KIFKEN+ YIE N N+ YKLG
Sbjct: 25 VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TALPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + SG LI LSEQ+++DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G+ TE YPY+AV G C+A + A I+ YE+VP +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSV 259
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A ++FQ YK G+F G CGTQLDH VT VG+G + DG YWL+KNSWG WG+ GY+
Sbjct: 260 AIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYI 319
Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
+ R EGLCGI +SYP A
Sbjct: 320 MMQRGVKAQEGLCGIAMMASYPTA 343
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 347 bits (890), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 172/316 (54%), Positives = 222/316 (70%), Gaps = 11/316 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ S+ E H +WMA+HGR+YKD EKE RL IFK N+EYIE N G R Y+L NQF+DL
Sbjct: 28 DASMAERHVEWMARHGRTYKDAAEKEQRLGIFKSNVEYIESFNA-GKRKYQLAANQFADL 86
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T++EF+A++TG+K PS + + F++ +LS VP S+DWR KGAVTP+K+Q CG
Sbjct: 87 THEEFKAMHTGFK-PSGTGAKKAGNGFRHGSLS--SVPDSVDWRSKGAVTPVKDQGLCGS 143
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIA 218
CWAF VAAVEGITKI +G LI LSEQQL+DC +G + GC GG + AF +I+ N GI
Sbjct: 144 CWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKDQGCQGGDMDAAFEFIVNNGGIT 203
Query: 219 TEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI-AAYSTE 276
+E YPY+ V C+A A I ++E+VP+ DE+AL KAV+ QPVS+ I A S +
Sbjct: 204 SEANYPYEEVQRLCNAHNASFVVATIESHEDVPTNDEKALRKAVANQPVSVGIDAGSSLD 263
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ Y G+F+G CGT LDHAVT+VG+GTT DG YWL KNSWG TWG+ GY+++ RD
Sbjct: 264 FQLYSGGVFSGECGTDLDHAVTVVGYGTTSDGTKYWLAKNSWGETWGENGYIRMERDVAA 323
Query: 334 -EGLCGIGTRSSYPLA 348
EGLCGI ++SYP A
Sbjct: 324 KEGLCGIAMQASYPTA 339
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 173/345 (50%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS E SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y YQ TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 180/348 (51%), Positives = 232/348 (66%), Gaps = 16/348 (4%)
Query: 15 TTPMFIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
++ + I+T+ +S + + +SR E S +E HE+WMA+ R Y DE EK R IFK+
Sbjct: 2 SSTIIFILTIFLSYRTSLATSRGGLFEASPIEKHEQWMARFNRVYSDESEKRNRFNIFKK 61
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-----FKY 128
NLE+++ N N TYKL N+FSDLT++EFRA +TG +P +T S+ F+Y
Sbjct: 62 NLEFVQSFNMNKNITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSSDKTVPFRY 121
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
N+S D S+DWR +GAVTP+K Q CG CWAF+AVAAVEGITKI G L+ LSEQQL
Sbjct: 122 GNVS--DTGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQL 179
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA----AAKIS 244
LDC T+ N GC GG KAF YII+NQGI TED YPYQ TCS++ + AA IS
Sbjct: 180 LDCDTDYNQGCHGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATIS 239
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
YE VP +E+ALL+AVS QPVS+ I F+ Y GIFNG CGT L HAVTIVG+G
Sbjct: 240 GYETVPMNNEEALLQAVSQQPVSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVTIVGYGM 299
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
+E+G YW++KNSWG TWG+ G+M+I RD +G+CG+ + YPLA
Sbjct: 300 SEEGTKYWVVKNSWGETWGEDGFMRIKRDVDAPQGMCGLAMLAFYPLA 347
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 346 bits (887), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 171/342 (50%), Positives = 236/342 (69%), Gaps = 8/342 (2%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ KI+ + I + ++S + ++RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKIDLMSILITLFFVISMFNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKY 128
IFKEN+++IE NK GN +YKLG N+F+D+T++EF +TG +PS S SST FK
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGINIPSYLSPSPMSSTEFKI 121
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+LS D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG KI +GNL++ SEQ+L
Sbjct: 122 NDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQEL 181
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
LDC+TN N GC GG AF +I +N GI++E +Y YQ TC + +K AA +IS+Y+
Sbjct: 182 LDCTTN-NYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQYTCRSQEKTAAVQISSYQV 240
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT E G
Sbjct: 241 VPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKG 298
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
YWL+KNSWG +WG+ G+MKI+RD G C I SSYP
Sbjct: 299 QKYWLLKNSWGTSWGENGFMKIIRDSGNPGGHCDIAKMSSYP 340
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 168/336 (50%), Positives = 220/336 (65%), Gaps = 9/336 (2%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F L++ + V+SR E + HE+WMA +G+ Y D EKE R KIFK N+EYI
Sbjct: 10 FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N GN+ YKL N+F+D TN++F+ GY+ P + R ++FKY+N+ T VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT-RPMKVTSFKYENV--TAVPA 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
++DWR KGAVTPIK+Q +CG CWAF+ VAA EGI ++ +G L+ LSEQ+L+DC G +
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGEDQ 186
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQA 256
GC GG E F +II+N GI TE YPYQA GTC S Q AKI+ YE VP+ E
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LLK V+ QP+S++I A ++FQ Y G+F G CGT+LDH VT VG+G T DG YWL+KN
Sbjct: 247 LLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWLVKN 306
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SW +WG+ GY+++ RD EGLCGI SSYP A
Sbjct: 307 SWXTSWGEEGYIRMQRDIDAEEGLCGIAMDSSYPTA 342
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 174/324 (53%), Positives = 226/324 (69%), Gaps = 13/324 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+SR+ + S+ E HE+WMA++ + YKD E+E R KIFKEN+ YIE N ++ YKLG
Sbjct: 25 VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNKFKGHMCSSITRTTT---FKYENV--TALPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + SG LI LSEQ+++DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G+ TE YPY+AV G C+A + A I+ YE+VP +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSV 259
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A ++FQ YK G+F G CGTQLDH VT VG+G + DG YWL+KNSWG WG+ GY+
Sbjct: 260 AIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYI 319
Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
+ R EGLCGI +SYP A
Sbjct: 320 MMQRGVKAQEGLCGIAMMASYPTA 343
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 172/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T++EF A +TG +P SPS +T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMPSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + K AA +ISN
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQGKTAAVQISN 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SHDLQFYAGGTYDGSCANRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 177/345 (51%), Positives = 232/345 (67%), Gaps = 17/345 (4%)
Query: 13 INTTPMFIIITLLVSC--ASQVVSSRSTHE-QSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
T + + LL+ ASQ + RS E +S++E HE+WMAQHGR YK+ EK R +
Sbjct: 4 FKTVKLLPALALLIVAIWASQGEAGRSLGENKSMLERHEQWMAQHGRVYKNAAEKAHRFE 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IF+ N+E IE N E N +KLG NQF+DLTN+EF+ T PS ++T S FKY+
Sbjct: 64 IFRANVERIESFNAE-NHKFKLGVNQFADLTNEEFKTRNT----LKPSKMASTKS-FKYE 117
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
N+ T VP ++DWR KGAVTPIK+Q +CG CWAF+AVAA EGITK+ +G LI LSEQ+++
Sbjct: 118 NV--TAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLISLSEQEVV 175
Query: 190 DCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYE 247
DC T+ + GC GG + AF YII+N+GI TE YPY+A GTC+ + + AA I+ YE
Sbjct: 176 DCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPYKAADGTCNTKKAASHAASITGYE 235
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
+V E ALLKA + QP+++AI A FQ Y G+F G CGT LDH VT+VG+G T D
Sbjct: 236 DVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGVFTGDCGTDLDHGVTLVGYGATSD 295
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
G YWL+KNSWG +WG+ GY+++ RD EGLCGI +SYP A
Sbjct: 296 GTKYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGIAMDASYPTA 340
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 166/343 (48%), Positives = 236/343 (68%), Gaps = 13/343 (3%)
Query: 18 MFIIITLLVSCASQVVSSRST------HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
+ +++T+L+ + S++T EQS+V+ HE+WMA+ R Y+DELEK MR +F
Sbjct: 4 IMVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQN 130
K+NL++IE NK+GN++YKLG N+F+D TN+EF A++TG K + S + T Q
Sbjct: 64 KKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQT 123
Query: 131 LSMTD-VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+++D V S DWR +GAVTP+K Q +CGCCWAF+AVAAVEG+ KI GNL+ LSEQQLL
Sbjct: 124 WNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLL 183
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
DC + C GG AF Y++QN+GIA+E++Y YQ G C + +P AA+IS ++ V
Sbjct: 184 DCDREYDRDCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNARP-AARISGFQTV 242
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
PS +E+ALL+AVS QPVS+++ A F Y G+++G CGT +HAVT VG+GT++DG
Sbjct: 243 PSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGT 302
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
YWL KNSWG TW + GY++I RD +G+CG+ + YP+A
Sbjct: 303 KYWLAKNSWGETWEEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 171/337 (50%), Positives = 234/337 (69%), Gaps = 14/337 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ ++ LL C SQV+S R+ HE S+ E HE+WM ++G+ YKD EK+ RL IFK+N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN+ YKL N +D TN+EF A + GYK SH T FKY N+ TD+P
Sbjct: 69 IESFNAAGNKPYKLSINHLADQTNEEFVASHNGYKYKG-SHSQT---PFKYGNV--TDIP 122
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
T++DWR GAVT +K+Q +CG CWAF+ VAA EGI +I +G L+ LSEQ+L+DC + ++
Sbjct: 123 TAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV-DH 181
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQA 256
GC GG E F +II+N GI++E YPY AV GTC A+++ + AA+I YE VP+ E+A
Sbjct: 182 GCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSEEA 241
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN-YWLIK 315
L +AV+ QPVS++I A + FQ Y G+F G CGTQLDH VT+VG+GTT+DG + YW++K
Sbjct: 242 LQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGTHEYWIVK 301
Query: 316 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
NSWG WG+ GY+++ R EGLCGI +SYP+
Sbjct: 302 NSWGTQWGEEGYIRMQRGIDAQEGLCGIAMDASYPMG 338
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 237/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T++EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK ++S D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCANRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E+G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 ENGQKYWLLKNSWGTSWGEKGFMKIIRDYGNPSGLCDIAKLSSYP 341
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 166/338 (49%), Positives = 226/338 (66%), Gaps = 10/338 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEI-HEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+F+ + + S + SR + +++ H +WM +HGR Y D EK R +FK N+E
Sbjct: 8 IFLFVAIFSSFYFSISLSRPLDNELIMQKRHIEWMTKHGRVYADVKEKSNRYVVFKSNVE 67
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP--SHRSTTSSTFKYQNLSM 133
IE N RT+KL NQF+DLTNDEFR++YTG+K S S T +++F+YQN+S
Sbjct: 68 RIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQTKTTSFRYQNVSS 127
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR KGAVTPIKNQ CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC T
Sbjct: 128 GALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSG 252
N + GC GG + AF +I+ G+ TE YPY+ TC++ + P A I+ YE+VP
Sbjct: 188 N-DFGCEGGLMDTAFEHIMATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVN 246
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
DEQAL+KAV+ QPVS+ I +FQ Y G+F G C T LDHAVT +G+G + +G+ YW
Sbjct: 247 DEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGQSTNGSKYW 306
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
+IKNSWG WG++GYM+I +D +GLCG+ ++SYP
Sbjct: 307 IIKNSWGTKWGESGYMRIQKDIKDKQGLCGLAMKASYP 344
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 174/349 (49%), Positives = 236/349 (67%), Gaps = 20/349 (5%)
Query: 11 FKINTTPMFIIITLLVSCAS--QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
++ FI + LL + ++R+ + S+ E HE+WMAQ+GR YKD+ EKE R
Sbjct: 1 MRLTKQSQFICLALLFVLGAWPSKSAARTLQDVSMYERHEQWMAQYGRVYKDDAEKETRY 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTF 126
IFKEN+ I+ N + ++YKLG NQF+DL+N+EF+A +K M SP + F
Sbjct: 61 NIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKASRNRFKGHMCSPQ-----AGPF 115
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
+Y+N+S VP ++DWR KGAVTP+K+Q +CGCCWAF+AVAA+EGI ++ +G LI LSEQ
Sbjct: 116 RYENVSA--VPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTTGKLISLSEQ 173
Query: 187 QLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA--AAKI 243
+++DC T G + GC GG + AF +I QN+G+ TE YPY GTC+ QK A AAKI
Sbjct: 174 EVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTCN-TQKEATHAAKI 232
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFG 303
+ +E+VP+ E AL+KAV+ QPVS+AI A EFQ Y GIF G CGTQLDH VT VG+G
Sbjct: 233 TGFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGTQLDHGVTAVGYG 292
Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
+ DG YWL+KNSWG WG+ GY+++ +D EGLCGI ++SYP A
Sbjct: 293 IS-DGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPSA 340
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 169/324 (52%), Positives = 224/324 (69%), Gaps = 13/324 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+ R+ + S+ E HE+WM ++ + YKD E+E R KIFKEN+ YIE N N+ Y LG
Sbjct: 25 VTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TAIPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + +G LI LSEQ+++DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G+ E YPY+AV G C+A A A I+ YE+VP +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSV 259
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A ++FQ Y+ G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+
Sbjct: 260 AIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYI 319
Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
++ R +EGLCGI +SYP A
Sbjct: 320 RMQRGVKAEEGLCGIAMMASYPTA 343
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 171/337 (50%), Positives = 231/337 (68%), Gaps = 15/337 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ ++ LL C SQV+S R HE S+ E HE+WM ++G+ YKD EK+ RL IFK+N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDV 136
IE N GN+ YKLG N +D TN+EF A + GYK H+++ S T FKY+N+ T V
Sbjct: 69 IESFNAAGNKPYKLGINHLADQTNEEFVASHNGYK-----HKASHSQTPFKYENV--TGV 121
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P ++DWR+ GAVT +K+Q +CG CWAF+ VAA EGI +I + L+ LSEQ+L+DC + +
Sbjct: 122 PNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV-D 180
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQ 255
+GC GG E F +II+N GI++E YPY AV GTC A ++ + AA+I YE VP+ E
Sbjct: 181 HGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSED 240
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QPVS+ I A + FQ Y G+F G CGTQLDH VT VG+G+T+DG YW++K
Sbjct: 241 ALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIVK 300
Query: 316 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
NSWG WG+ GY+++ R EGLCGI +SYP A
Sbjct: 301 NSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 169/324 (52%), Positives = 224/324 (69%), Gaps = 13/324 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+ R+ + S+ E HE+WM ++ + YKD E+E R KIFKEN+ YIE N N+ Y LG
Sbjct: 25 VTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TAIPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + +G LI LSEQ+++DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G+ E YPY+AV G C+A A A I+ YE+VP +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSV 259
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A ++FQ Y+ G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+
Sbjct: 260 AIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYI 319
Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
++ R +EGLCGI +SYP A
Sbjct: 320 RMQRGVKAEEGLCGIAMMASYPTA 343
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 168/336 (50%), Positives = 220/336 (65%), Gaps = 9/336 (2%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F L++ + V+SR E + HE+WMA +G+ Y D EKE R KIFK N+EYI
Sbjct: 10 FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N GN+ YKL N+F+D TN++F+ GY+ P + R ++FKY+N+ T VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT-RPMKVTSFKYENV--TAVPA 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
++DWR KGAVT IK+Q +CG CWAF+ VAA EGI ++ +G L+ LSEQ+L+DC G +
Sbjct: 127 TMDWRKKGAVTLIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGEDQ 186
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQA 256
GC GG E F +II+N GI TE YPYQA GTC S Q AKI+ YE VP+ E
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LLK V+ QP+S++I A ++FQ Y G+F G CGT+LDH VT VG+G T DG YWL+KN
Sbjct: 247 LLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWLVKN 306
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG +WG+ GY+++ RD EGLCGI SSYP A
Sbjct: 307 SWGTSWGEEGYIRMQRDIDTEEGLCGIAMDSSYPTA 342
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 163/336 (48%), Positives = 221/336 (65%), Gaps = 12/336 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ L + C + + + + ++V HE+WMAQ+ R YKD EK R ++FK N+++
Sbjct: 8 ILAILGLALFCGAALAARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE N GNR + LG NQF+DLTNDEFRA T G+K PSP T F+Y+N+S+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFK-PSPVKVPTG---FRYENVSVDA 123
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P S+DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI + LI LSEQ+L+DC +G
Sbjct: 124 LPASIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVHG 183
Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
+ GC GG + AF +II+N G+ TE YPY A G C + +AA I +E+VP+ DE
Sbjct: 184 EDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTATDGKCKSGTN-SAANIKGFEDVPANDE 242
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
AL+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G T DG YWL+
Sbjct: 243 AALMKAVANQPVSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLL 302
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
KNSWG TWG+ GY+++ +D G+CG+ SYP
Sbjct: 303 KNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 338
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISIFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q +CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKTNDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYSGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E+G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 343 bits (881), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 171/337 (50%), Positives = 230/337 (68%), Gaps = 15/337 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ ++ LL C SQV+S R+ HE S+ E HE+WM ++G+ YKD EK+ RL IFK+N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDV 136
IE N GNR YKL N +D TN+EF A + GYK H+ + S T FKY+N+ T V
Sbjct: 69 IESFNAAGNRPYKLSINHLADQTNEEFVASHNGYK-----HKGSHSQTPFKYENV--TGV 121
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P ++DWR+ GAVT +K+Q +CG CWAF+ VAA EGI +I + L+ LSEQ+L+DC + +
Sbjct: 122 PNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV-D 180
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQ 255
+GC GG E F +II+N GI++E YPY AV GTC A ++ + AA+I YE VP+ E
Sbjct: 181 HGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSED 240
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QPVS+ I A + FQ Y G+F G CGTQLDH VT VG+G+T+DG YW++K
Sbjct: 241 ALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIVK 300
Query: 316 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
NSWG WG+ GY+++ R EGLCGI +SYP A
Sbjct: 301 NSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (881), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPSGLCDIAKMSSYP 341
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 343 bits (881), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E+G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 ENGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 343 bits (880), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 174/330 (52%), Positives = 229/330 (69%), Gaps = 9/330 (2%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
I LL +CA +S R+ E SVVE H++WM ++ R+Y + E E R KIFKENLEYIE
Sbjct: 9 IILLWACAYPTMS-RTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENF 67
Query: 82 NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
N GN++YKLG N++SDLT++EF A +TG+K+ S S NL+ DVPT+ D
Sbjct: 68 NNVGNKSYKLGLNRYSDLTSEEFIASHTGFKVSDQLSDSKMRSVAIPFNLN-DDVPTNFD 126
Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLG 201
WR+KG VT +KNQ++CGCCWAF AVAAVEGI KI++GNLI LSEQQL+DC ++GC G
Sbjct: 127 WREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDCDRQ-SSGCGG 185
Query: 202 GSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPAAAKISNYEEVPSGDEQALLKA 260
G AF II+++GI ED+YPY+A TC Q P AA+I+ Y +VP+ DEQ LL+A
Sbjct: 186 GDFVLAFDSIIKSRGIVKEDDYPYKANDVQTCQLGQIPGAAQINGYFKVPANDEQQLLRA 245
Query: 261 VSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 320
V QPVS+AI+ S +F Y G++ G CG +L+HAVTI+G+G +E G YWLIKNSWG
Sbjct: 246 VLQQPVSVAIST-SYDFHHYMGGVYEGSCGPKLNHAVTIIGYGVSEAGKKYWLIKNSWGE 304
Query: 321 TWGDAGYMKIVRDE----GLCGIGTRSSYP 346
TWG+ GYMK++R+ G C I ++YP
Sbjct: 305 TWGEKGYMKVLRESSATGGQCSIAVHAAYP 334
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 343 bits (880), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCANRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 341
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 183/340 (53%), Positives = 242/340 (71%), Gaps = 14/340 (4%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQS--VVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
P+ + T+L +CA +S E S V + H++WM Q+GRSY ++ E E R KIF EN
Sbjct: 6 PIIALCTMLWACAYTAMSRTLYDETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFMEN 65
Query: 75 LEYIEKANKE-GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
LEYIEK N GN++YKL NQFSDLTN+EF A +TG M PS S++S +L +
Sbjct: 66 LEYIEKFNNAPGNKSYKLDLNQFSDLTNEEFIASHTGL-MIDPSKPSSSSKRASPASLDL 124
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+D PTSLDWR++GAVT +KNQ CG CWAF+AVAAVEGI KI++GNLI LSEQQL+DC++
Sbjct: 125 SDTPTSLDWREQGAVTDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCAS 184
Query: 194 N-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPS 251
N N GC GG + AF+YI +N GIA+E++Y Y+ GTC + AA+IS YE+VP+
Sbjct: 185 NEQNQGCGGGFMDNAFSYITEN-GIASENDYQYRGGAGTCQNNEMITPAARISGYEDVPA 243
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT-EDGAN 310
G++Q LL AVS QPVS+AIA F YKEGI++G CG+ L+H VT+VG+GT+ EDG
Sbjct: 244 GEDQLLL-AVSQQPVSVAIAV-GQSFHLYKEGIYSGPCGSSLNHGVTLVGYGTSEEDGTK 301
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
YWLIKNSWG +WG+ GYM+++R+ EG CGI ++S+P
Sbjct: 302 YWLIKNSWGESWGENGYMRLLRESGQSEGHCGIAVKASHP 341
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E+G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 ENGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS E SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI++E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 342 bits (877), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS E SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI++E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 342 bits (877), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 170/325 (52%), Positives = 227/325 (69%), Gaps = 15/325 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+SR+ + S+ E HE+WMA++G+ YKD EKE R ++FKEN+ YIE N N+ YKLG
Sbjct: 25 VASRTLQDASMYERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLG 84
Query: 93 TNQFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
NQF+DLT++EF R + G+ S +T ++TFKY+N+++ +P S+DWR KGAVT
Sbjct: 85 INQFADLTSEEFIVPRNRFNGHTRSS----NTRTTTFKYENVTV--LPDSIDWRQKGAVT 138
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
PIKNQ CGCCWAF+A+AA EGI KI +G L+ LSEQ+++DC T G ++GC GG + AF
Sbjct: 139 PIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAF 198
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+IIQN GI TE YPY+ V G C+ ++ AA I+ YE+VP +E+AL KAV+ QPVS
Sbjct: 199 KFIIQNHGINTEASYPYKGVDGKCNIKEEAVHAATITGYEDVPINNEKALQKAVANQPVS 258
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A +FQ YK GIF G CGT+LDH VT VG+G +G YWL+KNSWG WG+ GY
Sbjct: 259 VAIDASGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEGTKYWLVKNSWGTEWGEEGY 318
Query: 328 MKIVRD----EGLCGIGTRSSYPLA 348
+ + R EG+CGI +SYP A
Sbjct: 319 IMMQRGVKAVEGICGIAMMASYPTA 343
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 342 bits (877), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFK+N+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKKNMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q +CGCCWAF+AV ++EG KI +G L++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y EG ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAEGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 160/335 (47%), Positives = 221/335 (65%), Gaps = 9/335 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ + C+S V+S+R + ++VE HE+WMA+ R YKD EK R ++FK N+ +
Sbjct: 8 LLAIVGCICLCSSAVLSARELGDTAMVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N E NR + LG NQF+DLTNDEFRA T + R+ T FKY N+S+ +P
Sbjct: 68 IESFNAE-NRKFWLGVNQFTDLTNDEFRATKTNKGLKMSGGRAPTG--FKYSNVSIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
T++DWR KG VTPIK+Q +CGCCWAF+AV A EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 TAVDWRTKGVVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHGVD 184
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQ 255
GC GG + AF +II+N G+ TE YPY A G C ++ + A I YE+VP+ DE
Sbjct: 185 QGCEGGEMDDAFKFIIKNGGLTTEANYPYTAQDGQCKTSIASNSVATIKGYEDVPANDES 244
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
+L+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G T DG YWL+K
Sbjct: 245 SLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIAAIGYGMTSDGTKYWLLK 304
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
NSWG TWG++GY+++ +D G+CG+ + SYP
Sbjct: 305 NSWGTTWGESGYLRMEKDISDKSGMCGLAMQPSYP 339
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENIKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI++E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 168/338 (49%), Positives = 224/338 (66%), Gaps = 10/338 (2%)
Query: 18 MFIIITLLVS-CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+F+ + + S C S +S +E + + H +WM +HGR Y D E+ R +FK N+E
Sbjct: 8 IFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVE 67
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP--SHRSTTSSTFKYQNLSM 133
IE N RT+KL NQF+DLTNDEFR++YTG+K S S T S F+YQN+S
Sbjct: 68 RIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSS 127
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR KGAVTPIKNQ CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC T
Sbjct: 128 GALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSG 252
N + GC GG + AF +I G+ TE YPY+ TC++ + P A I+ YE+VP
Sbjct: 188 N-DFGCEGGLMDTAFEHIKATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVN 246
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
DEQAL+KAV+ QPVS+ I +FQ Y G+F G C T LDHAVT +G+G + +G+ YW
Sbjct: 247 DEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYW 306
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
+IKNSWG WG++GYM+I +D +GLCG+ ++SYP
Sbjct: 307 IIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 161/344 (46%), Positives = 229/344 (66%), Gaps = 14/344 (4%)
Query: 17 PMFIIITLLVSC---ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
P ++ +++ C S V+S+R + ++VE HE+WMAQHGR YKD EK R + F+
Sbjct: 4 PKVFLLAVVLGCICLCSTVLSARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAFRN 63
Query: 74 NLEYIEKANKEGNR-TYKLGTNQFSDLTNDEFRALYT--GY--KMPSPSHRSTTSSTFKY 128
N+ +IE N GNR + LG NQF+DLTNDEFRA T G+ + + ++++ + TF+Y
Sbjct: 64 NVVFIESFNAAGNRRKFWLGVNQFTDLTNDEFRATKTNKGFIKRNAAAVNKASPTGTFRY 123
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
N+S +P ++DWR KGAVTPIKNQ +CGCCWAF+AVAA EGI ++ +G L+ LSEQ+L
Sbjct: 124 SNVSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQEL 183
Query: 189 LDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNY 246
+DC NG ++GC GG + AF +II+N G+ +E YPY A G C A + A I Y
Sbjct: 184 VDCDANGADHGCEGGEMDDAFEFIIKNGGLTSETNYPYTAQDGQCKAKNTINSVATIKGY 243
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP+ DE +L+KAV+ QPVS+A+ FQ Y G+ +G CGT LDH + VG+G +
Sbjct: 244 EDVPANDEASLMKAVAAQPVSVAVDGGDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGAAD 303
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
DG +WL+KNSWG TWG+ GY+++ +D G+CG+ + SYP
Sbjct: 304 DGTKFWLMKNSWGTTWGEDGYIRMEKDVADAGGMCGLAMQPSYP 347
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 159/339 (46%), Positives = 227/339 (66%), Gaps = 9/339 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
+++ + I S S V+++R + ++VE HE WM ++GR YKD EK R ++FK
Sbjct: 2 VSSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEVFK 61
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+N+ ++E N N + LG NQF+DLT +EF+A G+K S TT FKY+NLS
Sbjct: 62 DNVAFVESFNTNKNNKFWLGINQFADLTIEEFKA-NKGFKPISAEKVPTTG--FKYENLS 118
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
++ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ +GNLI LSEQ+L+DC
Sbjct: 119 VSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCD 178
Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
T+ + GC GG + AF ++I+N G+AT YPY+AV G C K +AA I +E+VP
Sbjct: 179 THSMDEGCEGGWMDSAFEFVIKNGGLATVSSYPYKAVDGKCKGGSK-SAATIKGHEDVPV 237
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
DE AL+KAV+ QPVS+A+ A F Y G+ G CGT+LDH + +G+G DG Y
Sbjct: 238 NDEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKY 297
Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
W++KNSWG TWG+ G++++ +D +G+CG+ + SYP
Sbjct: 298 WILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 220/336 (65%), Gaps = 12/336 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ +++ C + + + + ++V HE+WMAQ+ R YKD EK R ++FK N+++
Sbjct: 8 ILAVLSFAFFCGAALAARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE N GNR + LG NQF+DLTNDEFR T G+K PS ST F+Y+N+S+
Sbjct: 68 IESFNTGGNRKFWLGINQFADLTNDEFRTTKTNKGFK-PSLDKVSTG---FRYENVSVDA 123
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P ++DWR GAVTPIK+Q +CGCCWAF+AVAA EGI KI +G LI LSEQ+L+DC +G
Sbjct: 124 IPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQELVDCDVHG 183
Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
+ GC GG + AF +II+N G+ TE YPY A G C + +AA I YE+VP+ DE
Sbjct: 184 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSN-SAANIKGYEDVPTNDE 242
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
AL+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G T DG YWL+
Sbjct: 243 AALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLM 302
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
KNSWG TWG+ GY+++ +D +G+CG+ SYP
Sbjct: 303 KNSWGTTWGENGYLRMEKDISDKKGMCGLAMEPSYP 338
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (874), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 341 bits (874), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 156/334 (46%), Positives = 218/334 (65%), Gaps = 8/334 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ C + + + + + ++V HE+WMAQ+ R YKD EK R ++FK N+++
Sbjct: 8 ILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKANVKF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR++ T S + + T F+Y+N+S+ +P
Sbjct: 68 IESFNAGGNNKFWLGVNQFADLTNDEFRSIKTNKGFKSSNMKIPTG--FRYENVSVDALP 125
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
T++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +G L+ L+EQ+L+DC +G +
Sbjct: 126 TTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGED 185
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG + AF +II N G+ TE YPY A G C + +AA I YE+VP+ DE A
Sbjct: 186 QGCEGGLMDDAFKFIINNGGLTTESSYPYTAADGKCKSGSN-SAATIKGYEDVPANDEAA 244
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G T DG YWL+KN
Sbjct: 245 LMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKN 304
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
SWG TWG+ GY+++ +D G+CG+ SYP
Sbjct: 305 SWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 338
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 168/324 (51%), Positives = 223/324 (68%), Gaps = 13/324 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+ R+ + S+ E HE+WM ++ + YKD E+E R KIFKEN+ YIE N N+ Y LG
Sbjct: 25 VTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLG 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+ T +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TAIPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + +G LI LSEQ+++DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G+ E YPY+AV G C+A A A I+ YE+VP +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSV 259
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A ++FQ Y+ G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+
Sbjct: 260 AIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYI 319
Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
++ R +EGL GI +SYP A
Sbjct: 320 RMQRGVKAEEGLXGIAMMASYPTA 343
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 340 bits (872), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS +P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTCRSREKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C Q++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGNCADQINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E+G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 340 bits (872), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 161/338 (47%), Positives = 226/338 (66%), Gaps = 10/338 (2%)
Query: 15 TTPMFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
T+ F++ L S S V+++R + ++VE HE WM ++GR YKD EK R + FK
Sbjct: 3 TSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKH 62
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+ ++E N + LG NQF+DLT +EF+A G+K S TT FKY+NLS+
Sbjct: 63 NVAFVESFNTNKKNKFWLGVNQFADLTTEEFKA-NKGFKPISAEMVPTTG--FKYENLSV 119
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ +GNLI LSEQ+L+DC T
Sbjct: 120 SALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDT 179
Query: 194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
+ + GC GG + AF ++I+N G+ATE YPY+AV G C K +AA I +E+VP
Sbjct: 180 HSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSK-SAATIKGHEDVPVN 238
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
DE AL+KAV+ QPVS+A+ A F Y G+ G CGT+LDH + +G+G DG YW
Sbjct: 239 DEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYW 298
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
++KNSWG TWG+ G++++ +D +G+CG+ + SYP
Sbjct: 299 ILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 171/342 (50%), Positives = 230/342 (67%), Gaps = 13/342 (3%)
Query: 18 MFIIITLLVSC--ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
MF+ +T+L SQ S + HE V E H++WM + R Y DELEK+MR +FK+NL
Sbjct: 7 MFVSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNL 66
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK----MPSPSHRSTTSSTFKYQNL 131
++IEK NK+G+RTYKLG N+F+D T +EF A +TG K +PS ++ + N+
Sbjct: 67 KFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIPSSEFVDEMIPSWNW-NV 125
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
S P DWR +GAVTP+K Q +CGCCWAF++VAAVEG+TKI GNL+ LSEQQLLDC
Sbjct: 126 SDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNLVSLSEQQLLDC 185
Query: 192 STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
+NGC GG AF+YII+N+GIA+E YPYQ GTC KP+A I ++ VPS
Sbjct: 186 DRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQETEGTCRYNAKPSAW-IRGFQTVPS 244
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQLDHAVTIVGFGTTEDGAN 310
+E+ALL+AVS QPVS++I A F Y G+++ CGT ++HAVT VG+GT+ +G
Sbjct: 245 NNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDVNHAVTFVGYGTSPEGIK 304
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
YWL KNSWG TWG+ GY++I RD +G+CG+ + YP+A
Sbjct: 305 YWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 346
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 173/337 (51%), Positives = 222/337 (65%), Gaps = 13/337 (3%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQ--SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+ + LL++ V SR HE S++E HE+WMA++ + YKD EKE R IFK+N+E
Sbjct: 11 ILALFLLLAVGISRVISRELHETETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVE 70
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+IE N GN+ YKLG N +DLT +EF+A G K TTS FKY+N+ T +
Sbjct: 71 FIESFNAAGNKPYKLGVNHLADLTIEEFKASRNGLKRSYDYEVGTTS--FKYENV--TAI 126
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P S+DWR KGAVTPIK+Q +CG CWAF+ VAA EGI KI +G L+ LSEQ+L+DC G
Sbjct: 127 PASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGT 186
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
+ GC GG E F +II+N GI TE YPY+AV G+C A P AA+I YE+VP E+
Sbjct: 187 DQGCEGGYMEDGFEFIIKNGGITTEANYPYKAVDGSCKNATAP-AAQIKGYEKVPVNSEK 245
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALLKAV+ QPVS++I A F Y GIF G CGT+LDH VT VG+G +G +YW++K
Sbjct: 246 ALLKAVANQPVSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAVGYGRA-NGTDYWIVK 304
Query: 316 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
NSWG WG+ GY+++ R EGLCGI SSYP A
Sbjct: 305 NSWGTVWGEQGYIRMQRGIAAKEGLCGIAMDSSYPTA 341
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
K +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (871), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 340 bits (871), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 163/349 (46%), Positives = 230/349 (65%), Gaps = 19/349 (5%)
Query: 15 TTPMFIIITLLVS---CASQVVS--------SRST--HEQSVVEIHEKWMAQHGRSYKDE 61
TT M ++ + ++ C + V + R+T E ++ ++KWMAQ+ R YKD+
Sbjct: 14 TTLMLLLCVIAIADCICQAAVAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKDD 73
Query: 62 LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHR 119
EK R ++FK N E+I+++N G + Y LGTNQF+DLT+ EF A+YTG + P+ PS
Sbjct: 74 AEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGA 133
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ FKYQN + D +DWR +GAVTP+KNQ +CGCCWAF+AV A+EG+ I +GN
Sbjct: 134 KQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGN 193
Query: 180 LIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
L+ LSEQQ+LDC ++GN GC GG + AF Y++ N G+ TED YPY AV GTC Q
Sbjct: 194 LVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQGTCQNVQP- 252
Query: 239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAV 297
AA IS ++++PSGDE AL AV+ QPVS+ + S+ FQ Y+ GI++G CGT ++HAV
Sbjct: 253 -AATISGFQDLPSGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNHAV 311
Query: 298 TIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYP 346
T +G+G + G YW++KNSWG WG+ G+M++ G CGI T +SYP
Sbjct: 312 TAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQMGVGACGISTMASYP 360
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (871), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (871), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 339 bits (870), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 339 bits (870), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 167/324 (51%), Positives = 225/324 (69%), Gaps = 13/324 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+ R+ + S+ E H +WMA++ + YKD E+E R +IFKEN+ YIE N N++YKL
Sbjct: 25 VTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIFKENVNYIETFNSADNKSYKLD 84
Query: 93 TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
NQF+DLTN+EF A +K M S R+TT FKY+N+++ +P+++DWR KGAVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENVTV--IPSTVDWRQKGAVTP 139
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFA 209
IK+Q +CGCCWAF+AVAA EGI + +G LI LSEQ+++DC T G + GC GG + AF
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G+ TE YPY+A G C+A A I+ YE+VP +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNTEPNYPYKAADGKCNAKAAANHAATITGYEDVPVNNEKALQKAVANQPVSV 259
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A ++FQ YK G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+
Sbjct: 260 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYI 319
Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
++ R +EGLCGI +SYP A
Sbjct: 320 RMQRGVKAEEGLCGIAMMASYPTA 343
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 157/331 (47%), Positives = 216/331 (65%), Gaps = 8/331 (2%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
II C + + + + + +V HE+WMAQ+ R YKD EK R ++FK N+++IE
Sbjct: 104 IIGFAFFCGAAMAARDLSDDSVMVARHEQWMAQYSRVYKDASEKARRFEVFKANVQFIES 163
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
N GN + LG NQF+DLTNDEFR+ T + S + + T F+Y+N+S +PT++
Sbjct: 164 FNAGGNNKFWLGVNQFADLTNDEFRSTKTNKGLKSSNMKIPTG--FRYENVSADALPTTI 221
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGC 199
DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +G L+ L+EQ+L+DC +G + GC
Sbjct: 222 DWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGC 281
Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLK 259
GG + AF +II+N G+ TE YPY A G C + +AA I YE+VP+ DE AL+K
Sbjct: 282 EGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSN-SAATIKGYEDVPANDEAALMK 340
Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
AV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G T DG YWL+KNSWG
Sbjct: 341 AVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWG 400
Query: 320 NTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
TWG+ GY+++ +D G+CG+ SYP
Sbjct: 401 TTWGENGYLRMEKDISDKRGMCGLAMEPSYP 431
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 171/335 (51%), Positives = 232/335 (69%), Gaps = 15/335 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I L + ASQ ++ R+ + S+ E HE+WM + R Y D EKE+R KIFKEN++ IE
Sbjct: 14 LIFFLGALASQAIA-RTLQDASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK ++YKLG NQF+DLTN+EF+ +K H S+ + F+Y+N+ T VP+S
Sbjct: 73 FNKASEKSYKLGINQFADLTNEEFKTSRNRFK----GHMCSSQAGPFRYENI--TAVPSS 126
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
+DWR +GAVT IK+Q +CG CWAF+AVAAVEGIT++ + LI LSEQ+L+DC T G + G
Sbjct: 127 MDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
C GG + AF +I QNQG+ TE YPY+ GTC+ Q+ AAKI+ +E+VP+ +E AL
Sbjct: 187 CQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGAL 246
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+KAV+ QPVS+AI A EFQ Y GIF G CGT+LDH V VG+G + +G NYWL+KNS
Sbjct: 247 MKAVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYGES-NGMNYWLVKNS 305
Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
WG WG+ GY+++ +D EGLCGI ++SYP A
Sbjct: 306 WGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HG YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGHVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q +CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI++E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 158/338 (46%), Positives = 226/338 (66%), Gaps = 11/338 (3%)
Query: 15 TTPMFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
T+ F++ L S S V+++R + ++VE HE WM ++GR YKD EK R + FK
Sbjct: 3 TSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKH 62
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+ ++E N + LG NQF+DLT +EF+A G+K P+ ++ FKY+NLS+
Sbjct: 63 NVAFVESFNTNKKNKFWLGVNQFADLTTEEFKA-NKGFK---PTAEKVPTTGFKYENLSV 118
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ +GNLI LSEQ+L+DC T
Sbjct: 119 SALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDT 178
Query: 194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
+ + GC GG + AF ++I+N G+ATE YPY+AV G C K +AA I +E+VP
Sbjct: 179 HSMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSK-SAATIKGHEDVPVN 237
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
+E AL+KAV+ QPVS+A+ A F Y G+ G CGT+LDH + +G+G DG YW
Sbjct: 238 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYW 297
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
++KNSWG TWG+ G++++ +D G+CG+ + SYP
Sbjct: 298 ILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 335
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 167/338 (49%), Positives = 224/338 (66%), Gaps = 10/338 (2%)
Query: 18 MFIIITLLVS-CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+F+ + + S C S +S +E + + H +WM +HGR Y D E+ R +FK N+E
Sbjct: 8 IFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVE 67
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP--SHRSTTSSTFKYQNLSM 133
IE N RT+KL NQF+DLTNDEF ++YTG+K S S T S F+YQN+S
Sbjct: 68 RIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTKMSPFRYQNVSS 127
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR KGAVTPIKNQ CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC T
Sbjct: 128 GALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSG 252
N + GC GG + AF +I G+ TE +YPY+ TC++ + P A I+ YE+VP
Sbjct: 188 N-DFGCEGGLMDTAFEHIKATGGLTTESDYPYKGEDATCNSKKTNPKATSITGYEDVPVN 246
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
DEQAL+KAV+ QPVS+ I +FQ Y G+F G C T LDHAVT +G+G + +G+ YW
Sbjct: 247 DEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYW 306
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
+IKNSWG WG++GYM+I +D +GLCG+ ++SYP
Sbjct: 307 IIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 164/348 (47%), Positives = 228/348 (65%), Gaps = 16/348 (4%)
Query: 15 TTPMFIIITLLVSC------ASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMR 67
+ P+ + I + C + V ++R + ++ HE+WMAQHGR YKD EK R
Sbjct: 5 SKPLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARR 64
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTF 126
L++FK N+ +IE N G Y LG NQF+DLT++EF+A T K +P++ S+ F
Sbjct: 65 LEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVSTGF 124
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
KY+N+S +P S+DWR KGAVT IK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ
Sbjct: 125 KYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQ 184
Query: 187 QLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKIS 244
+L+DC +GN+ GC GG + AF +I+ N G+ E YPY A G C + A AA I
Sbjct: 185 ELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAASIR 244
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
YE+VP+ DE +L+KAV+ QPVS+A+ A ++FQ Y G+ G CGT LDH VT++G+G
Sbjct: 245 GYEDVPANDEPSLMKAVAGQPVSVAVDA--SKFQFYGGGVMAGECGTSLDHGVTVIGYGA 302
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
DG YWL+KNSWG TWG+AGY+++ +D G+CG+ + SYP A
Sbjct: 303 ASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPTA 350
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + +++ + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVITMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 172/335 (51%), Positives = 232/335 (69%), Gaps = 15/335 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I LL + SQ ++ R+ + S+ E HE+WM++ GR Y D EKE+R KIFKEN++ IE
Sbjct: 14 LIFLLGALVSQAMA-RTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK ++YKLG NQF+DLTN+EF+ +K H S+ + F+Y+NL T P+S
Sbjct: 73 FNKASGKSYKLGINQFADLTNEEFKTSRNRFK----GHMCSSQAGPFRYENL--TAAPSS 126
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
+DWR KGAVT IK+Q +CG CWAF+AVAAVEGIT++ + LI LSEQ+L+DC T G + G
Sbjct: 127 MDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
C GG + AF +I QNQG+ TE YPY+ GTC+ Q+ AAKI+ +E+VP+ +E AL
Sbjct: 187 CQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGAL 246
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+KAV+ QPVS+AI A FQ Y GIF G CGT+LDH V VG+G + +G NYWL+KNS
Sbjct: 247 MKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYGES-NGMNYWLVKNS 305
Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
WG WG+ GY+++ +D EGLCGI ++SYP A
Sbjct: 306 WGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 175/336 (52%), Positives = 225/336 (66%), Gaps = 35/336 (10%)
Query: 20 IIITLLV---SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+ I LLV + ASQ ++ + +E ++VE HE+WMA+HGR+Y+D EKE R +IFK NLE
Sbjct: 9 LAIALLVVFSTWASQAMARQLINEDALVEKHEQWMARHGRTYQDSEEKERRFQIFKSNLE 68
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
YI+ NK N+TY+LG N F+DL+++E+ A YT KMP +V
Sbjct: 69 YIDNFNKASNQTYQLGLNNFADLSHEEYVATYTARKMP-------------------VEV 109
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWRD GAVTPIKNQ +CGCCWAF+A AAVEGI N + LS QQLLDC ++ N
Sbjct: 110 PESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGIV----ANGVSLSAQQLLDCVSD-N 164
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG AF YIIQNQGIA E +YPYQ + CS+ + AAA+IS +E+V DE+A
Sbjct: 165 QGCKGGWMNNAFNYIIQNQGIALETDYPYQQMQQMCSS--RMAAAQISGFEDVTPKDEEA 222
Query: 257 LLKAVSMQPVSIAIAAYST-EFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLI 314
L++AV+ QPVS+ I A S F+ YKEG+F CG HAVT+VG+GT+EDG YWL
Sbjct: 223 LMRAVAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVTLVGYGTSEDGTKYWLA 282
Query: 315 KNSWGNTWGDAGYMKIVRDEGL----CGIGTRSSYP 346
KNSWG TWG++GYM++ RD GL CGI +SYP
Sbjct: 283 KNSWGETWGESGYMRLQRDIGLEGGPCGIALYASYP 318
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 338 bits (867), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 168/343 (48%), Positives = 233/343 (67%), Gaps = 9/343 (2%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITVFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFK 127
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P+ S +S+ FK
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSSTEFK 121
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SEQ+
Sbjct: 122 INDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQE 181
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+Y+
Sbjct: 182 LLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISSYK 240
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT E
Sbjct: 241 VVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 172/355 (48%), Positives = 233/355 (65%), Gaps = 15/355 (4%)
Query: 4 IFERSGSFKINTTPMFIIITLLVSCASQV--VSSRSTHEQSVVEIHEKWMAQHGRSYKDE 61
IF+R + I + +L+ A V+ + + S+ E HE+WM +HG+ YKD
Sbjct: 90 IFKRDSTMVAKNHFYHISLAMLLCTAFLAFQVTCCTLQDASMYERHEQWMTRHGKVYKDP 149
Query: 62 LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHR 119
E+E R +IF EN+ Y+E N N+ YKLG NQF DLTN EF A +K M S R
Sbjct: 150 REREKRFRIFNENVNYVEAFNNAANKPYKLGINQFXDLTNQEFIAPRNRFKGHMCSSIIR 209
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+TT FKY+N+ T VP+++DWR GAVTP+K+Q +CGCCWAF+AVAA EGI + G
Sbjct: 210 TTT---FKYENV--TTVPSTVDWRQNGAVTPVKDQGQCGCCWAFSAVAATEGIHALSGGK 264
Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
LI LSEQ+L+DC T G + GC GG + A+ +IIQN G+ TE YPY+ V G C+A +
Sbjct: 265 LISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNHGLNTEANYPYKGVDGKCNANEAA 324
Query: 239 AAAK-ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAV 297
A I+ YE+VP+ +E+AL KAV+ QPVS+AI A S++FQ YK G F G CGT+LDH V
Sbjct: 325 NHAATITGYEDVPANNEKALQKAVANQPVSVAIDASSSDFQFYKSGAFTGSCGTELDHGV 384
Query: 298 TIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
T VG+G ++ G YWL+KNSWG WG+ GY+++ R +EG+CGI ++SYP A
Sbjct: 385 TAVGYGVSDHGTKYWLVKNSWGTEWGEEGYIRMQRGVDSEEGVCGIAMQASYPTA 439
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 158/344 (45%), Positives = 231/344 (67%), Gaps = 14/344 (4%)
Query: 13 INTTPMFIIITLLVSCA----SQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
+++ +++ +L CA S V+++R + + ++ E HE+WMA +GR YKD EK R
Sbjct: 2 VSSRAFLLLLAILTGCACSFPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAEKARR 61
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
++FK+NL ++E N + + LG NQF+DLT +EF+A G+K S TT FK
Sbjct: 62 FEVFKDNLAFVESFNADKKNKFWLGVNQFADLTTEEFKA-NKGFKPISAEEVPTTG--FK 118
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y+NLS++ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ + NL+ LSEQ+
Sbjct: 119 YENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLSEQE 178
Query: 188 LLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
L+DC T+ + GC GG + AF ++I+N G+ATE YPY+AV G C K +AA I +
Sbjct: 179 LVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSK-SAATIKGH 237
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP +E AL+KAV+ QPVS+A+ A F Y G+ G CGTQLDH + +G+G
Sbjct: 238 EDVPPNNEAALMKAVASQPVSVAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIGYGVES 297
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
DG YW++KNSWG TWG+ ++++ +D +G+CG+ + SYP
Sbjct: 298 DGTKYWILKNSWGTTWGEKRFLRMEKDISDKQGMCGLAMKPSYP 341
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
K +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 164/347 (47%), Positives = 223/347 (64%), Gaps = 18/347 (5%)
Query: 17 PMFIIITLL----VSCASQVVSSR---STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
P +++ +L C++ V+++R E ++V HE+WM QHGR YKDE +K R
Sbjct: 4 PKALLLAILGCGVCLCSAAVLAARELGGDDELAMVARHEQWMVQHGRVYKDETDKAHRFL 63
Query: 70 IFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF 126
+FK N+++IE N GNR + LG NQF+DLTNDEFRA T + T F
Sbjct: 64 VFKANVKFIESFNAAAAAGNRKFWLGVNQFADLTNDEFRATKTNKGFNPNVVKVPTG--F 121
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
+YQNLS+ +P ++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +G L LSEQ
Sbjct: 122 RYQNLSIDALPQTVDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLTSLSEQ 181
Query: 187 QLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
+L+DC +G + GC GG + AF +II+N G+ TE YPY A G C + AA I
Sbjct: 182 ELVDCDVHGEDQGCNGGEMDDAFKFIIKNGGLTTESNYPYTAQDGQCKSGSN-GAATIKG 240
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
YE+VP+ DE AL+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G T
Sbjct: 241 YEDVPANDEAALMKAVASQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKT 300
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
DG YWL+KNSWG TWG+ G++++ +D +G+CG+ + SYP A
Sbjct: 301 SDGTKYWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMQPSYPTA 347
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
K +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 233/345 (67%), Gaps = 12/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFK 121
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
K +LS D+P++LDWR+ GAVT +K+Q +CGCCWAF+AV ++EG KI +G L++ SE
Sbjct: 122 -KINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSE 180
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 181 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 239
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 240 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 297
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 298 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 342
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 12/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFK 121
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
K +LS +P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 122 -KINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 180
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +II+N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 181 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 239
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 240 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGNCADRINHAVTAIGYGTD 297
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E+G YWL+KNSWG +WG+ GYMKI+RD GLC I SSYP
Sbjct: 298 EEGQKYWLLKNSWGTSWGENGYMKIIRDSGDPSGLCDIAKMSSYP 342
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + +RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFCAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 337 bits (863), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
FK +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++E KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 337 bits (863), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 336 bits (862), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 169/347 (48%), Positives = 233/347 (67%), Gaps = 15/347 (4%)
Query: 15 TTPMFIIITLLVSC----ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
T+ +F++++L + SQ S + HE V E H++WM + R Y DELEK+MR +
Sbjct: 11 TSILFMLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDV 70
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK----MPSPSHRSTTSSTF 126
FK+NL++IEK NK+G+RTYKLG N+F+D T +EF A +TG K +PS ++
Sbjct: 71 FKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSW 130
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
+ N+S + DWR +GAVTP+K Q +CGCCWAF++VAAVEG+TKI NL+ LSEQ
Sbjct: 131 NW-NVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQ 189
Query: 187 QLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
QLLDC +NGC GG AF+YII+N+GIA+E YPYQA GTC KP+A I +
Sbjct: 190 QLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQAAEGTCRYNGKPSAW-IRGF 248
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQLDHAVTIVGFGTT 305
+ VPS +E+ALL+AVS QPVS++I A F Y G+++ CGT ++HAVT VG+GT+
Sbjct: 249 QTVPSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTS 308
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
+G YWL KNSWG TWG+ GY++I RD +G+CG+ + YP+A
Sbjct: 309 PEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 162/346 (46%), Positives = 226/346 (65%), Gaps = 16/346 (4%)
Query: 15 TTPMFIIITLLVSC------ASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMR 67
+ P+ + I + C + V ++R + ++ HE+WMAQHGR YKD EK R
Sbjct: 5 SKPLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARR 64
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTF 126
L++FK N+ +IE N G Y LG NQF+DLT++EF+A T K +P++ S+ F
Sbjct: 65 LEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVSTGF 124
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
KY+N+S +P S+DWR KGAVT IK+Q +CGCCWAF+AVAA+EG K+ +G LI LSEQ
Sbjct: 125 KYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVKLSTGKLISLSEQ 184
Query: 187 QLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKIS 244
+L+DC +GN+ GC GG + AF +I+ N G+ E YPY A G C + A AA I
Sbjct: 185 ELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAASIR 244
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
YE+VP+ DE +L+KAV+ QPVS+A+ A ++FQ Y G+ G CGT LDH VT++G+G
Sbjct: 245 GYEDVPANDEPSLMKAVAGQPVSVAVDA--SKFQFYGGGVMAGECGTSLDHGVTVIGYGA 302
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
DG YWL+KNSWG TWG+AGY+++ +D G+CG+ + SYP
Sbjct: 303 ASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYP 348
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F +LS D+P++LDWR+ GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 159/336 (47%), Positives = 220/336 (65%), Gaps = 9/336 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I++ L C++ + + + ++V HE+WM Q+GR YKD EK R +IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTN EFRA T + R T TF+Y+N+S+ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGFIPSTVRVPT--TFRYENVSIDTLP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG + AF +II+N G+ TE +YPY A G C+ +AA I YEEVP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSN-SAATIKGYEEVPANNEAA 243
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKN 303
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG TWG+ G++++ +D G+CG+ SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 172/338 (50%), Positives = 228/338 (67%), Gaps = 18/338 (5%)
Query: 20 IIITLLV---SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
++ITLL+ + SQ + + +++ E HE+WMA+HGR+Y D EKE R +IFK NL+
Sbjct: 10 LVITLLMILGTWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQIFKNNLD 69
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSMT 134
YIE NK N+TYKLG N+FSDL+ +EF Y GY+MP+ P+ +T TF +
Sbjct: 70 YIENFNKAFNKTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTANTTVKPTFFSNYYNQD 129
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+VP S+DWR+ G VT +KNQ ECGCCWAF+AVAAVEGI +GN LS QQLLDC
Sbjct: 130 EVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGI----AGNGASLSAQQLLDC-VG 184
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
N+GC GG+ KAF YI+QNQGI ++ +YPY+ C + AA+I+ YE V E
Sbjct: 185 DNSGCGGGTMIKAFEYIVQNQGIVSDTDYPYEQTQEMCRSGSN-VAARITGYESVIQ-SE 242
Query: 255 QALLKAVSMQPVSIAIAAYS-TEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYW 312
+AL +AV+ QP+S+AI A S F+SY G+F+ CGT L HAVT+VG+GTTEDG YW
Sbjct: 243 EALKRAVAKQPISVAIDASSGPNFKSYISGVFSAEDCGTHLTHAVTLVGYGTTEDGTKYW 302
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
L+KNSWG WG++GYM++ RD EG CGI ++SYP
Sbjct: 303 LVKNSWGEEWGESGYMRLQRDVGAMEGPCGIAMQASYP 340
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 164/321 (51%), Positives = 219/321 (68%), Gaps = 13/321 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V R HE S+ E HE+WM ++G+ YKD EK+ R +IFK+N+E+IE N +GN+ YKLG
Sbjct: 24 VMCRKLHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLG 83
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N +DLT +EF+A G+K P H +T +TFKY+N+ T +P ++DWR KGAVTPIK
Sbjct: 84 VNHLADLTVEEFKASRNGFKRP---HEFST-TTFKYENV--TAIPAAIDWRTKGAVTPIK 137
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
+Q +CG CWAF+ +AA EGI +I +G L+ LSEQ+L+DC T G + GC GG E F +I
Sbjct: 138 DQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFI 197
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
I+N GI +E YPY+AV G C+ A P A+I YE+VP E AL KAV+ QPVS++I
Sbjct: 198 IKNGGITSETNYPYKAVDGKCNKATSP-VAQIKGYEKVPPNSETALQKAVANQPVSVSID 256
Query: 272 AYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
A F Y GI+NG CGT+LDH VT VG+GT +G +YW++KNSWG WG+ GY+++
Sbjct: 257 ADGAGFMFYSSGIYNGECGTELDHGVTAVGYGTA-NGTDYWIVKNSWGTQWGEKGYVRMQ 315
Query: 332 R----DEGLCGIGTRSSYPLA 348
R GLCGI SSYP +
Sbjct: 316 RGIAAKHGLCGIALDSSYPTS 336
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 164/322 (50%), Positives = 220/322 (68%), Gaps = 12/322 (3%)
Query: 33 VSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
V SR +E S+ E HE+WM+++G+ YKD +EKE R IFK+N+E+IE N N+ YKL
Sbjct: 25 VMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
N +DLT DEF+A GYK R +++FKY+N+ T +P ++DWR KGAVTPI
Sbjct: 85 SVNHLADLTLDEFKASRNGYK---KIDREFATTSFKYENV--TAIPEAVDWRVKGAVTPI 139
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
K+Q +CG CWAF+ VAA+EGI +I +G LI LSEQ+L+DC T G + GC GG E F +
Sbjct: 140 KDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
II+N GI +E YPY+A G+CSAA AKI+ YE+VP E +LLKAV+ QP+S++I
Sbjct: 200 IIKNGGITSETNYPYKAADGSCSAATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSI 259
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A + F Y GI+ G CGT+LDH VT VG+G+ +G +YW++KNSWG WG+ GY+++
Sbjct: 260 DASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRM 318
Query: 331 VR----DEGLCGIGTRSSYPLA 348
R EGLCGI SSYP A
Sbjct: 319 QRGIADKEGLCGIAMDSSYPTA 340
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 168/348 (48%), Positives = 232/348 (66%), Gaps = 12/348 (3%)
Query: 8 SGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
S F N T + ++ ++L S +V+SR+ E S++E HE WM HGR YKD++EKE R
Sbjct: 3 SNFFLKNITVVLLLFSIL-SLYPFIVTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHR 61
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
K FKEN+E+IE NK G + YKL N+++DLT +EF + G S + +T++T
Sbjct: 62 FKTFKENVEFIESFNKNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTS 121
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
++ S+T+VP S+DWR +G+VT +K+Q CGCCWAF+A AA+EG +I + LI LSEQQ
Sbjct: 122 FKYDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQ 181
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQ--GIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
LLDCST N GC GG A+ +++QN GI TE YPY+ C Q PAA I+
Sbjct: 182 LLDCSTQ-NKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVCKTEQ-PAAVTING 239
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
YE VPS DE +LLKAV QP+S+ IAA + EF Y GI++G C ++L+HAVT++G+GT+
Sbjct: 240 YEVVPS-DESSLLKAVVNQPISVGIAA-NDEFHMYGSGIYDGSCNSRLNHAVTVIGYGTS 297
Query: 306 -EDGANYWLIKNSWGNTWGDAGYMKIVRDEGL----CGIGTRSSYPLA 348
EDG YW++KNSWG+ WG+ GYM+I RD G+ CGI +S+P A
Sbjct: 298 EEDGTKYWIVKNSWGSDWGEEGYMRIARDVGVDGGHCGIAKVASFPTA 345
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 158/336 (47%), Positives = 220/336 (65%), Gaps = 9/336 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I++ L C++ + + + ++V HE+WM Q+GR YKD EK R +IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTN EFRA T + R T TF+Y+N+S+ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGFIPSTVRVPT--TFRYENVSIDTLP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG + AF +II+N G+ TE +YPY A G C+ +AA I YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSN-SAATIKGYEDVPANNEAA 243
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKN 303
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG TWG+ G++++ +D G+CG+ SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 162/322 (50%), Positives = 219/322 (68%), Gaps = 12/322 (3%)
Query: 33 VSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
V SR +E S+ E HE+WM +HG+ Y+D +EKE R IFK+N+E+IE N N+ YKL
Sbjct: 25 VMSRKLYESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
N +DLT DEF+A GYK R T+++FKY+N+ T +P ++DWR KGAVTPI
Sbjct: 85 SVNHLADLTLDEFKASRNGYK---KIDREFTTTSFKYENV--TAIPAAVDWRVKGAVTPI 139
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
K+Q +CG CWAF+ VAA EGI +I +G L+ LSEQ+L+DC T G + GC GG E F +
Sbjct: 140 KDQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
II+N GI +E YPY+A G+C+ A AKI+ YE+VP E++LLKAV+ QP+S++I
Sbjct: 200 IIKNGGITSETNYPYKAADGSCNTATTTPVAKITGYEKVPVNSEKSLLKAVANQPISVSI 259
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A + F Y GI+ G CGT+LDH VT VG+G+ +G +YW++KNSWG WG+ GY+++
Sbjct: 260 DASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRM 318
Query: 331 VR----DEGLCGIGTRSSYPLA 348
R EGLCGI SSYP A
Sbjct: 319 QRGIAAKEGLCGIAMDSSYPTA 340
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 158/330 (47%), Positives = 222/330 (67%), Gaps = 7/330 (2%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
+ V ++ V + E ++ ++KWMAQ+ R YKD+ EK R ++FK N E+I+++
Sbjct: 34 VAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRS 93
Query: 82 NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PS-HRSTTSSTFKYQNLSMTDVPT 138
N G + Y LGTNQF+DLT+ EF A+YTG + P+ PS + ++ KYQN + D
Sbjct: 94 NAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDV 153
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNN 197
+DWR +GAVTP+KNQ +CGCCWAF+AV A+EG+ I +GNL+ LSEQQ+LDC ++GN
Sbjct: 154 QVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQ 213
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG + AF Y+I N G+ TED YPY AV GTC Q AA IS ++++PSGDE AL
Sbjct: 214 GCNGGYMDNAFQYVINNGGVTTEDAYPYSAVQGTCQNVQP--AATISGFQDLPSGDENAL 271
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
AV+ QPVS+ + S+ FQ Y+ GI++G CGT ++HAVT +G+G + G YW++KN
Sbjct: 272 ANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKN 331
Query: 317 SWGNTWGDAGYMKIVRDEGLCGIGTRSSYP 346
SWG WG+ G+M++ G CGI T +SYP
Sbjct: 332 SWGTGWGENGFMQLQMGVGACGISTMASYP 361
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 157/326 (48%), Positives = 222/326 (68%), Gaps = 12/326 (3%)
Query: 19 FIIITLLV-SCA---SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
F++++++ +CA S + +Q++V HE+WMA++ R Y D EK R ++FK N
Sbjct: 9 FVLLSVVAWACALSGSLAARDLADQDQAMVARHEEWMAKYDRVYSDAAEKARRFEVFKAN 68
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK----MPSPSHRSTTSST-FKYQ 129
+ IE N GN + L N+F+DLT+DEFRA +TGY+ S RS T++T FKY
Sbjct: 69 MALIESVNA-GNHKFWLEANRFADLTDDEFRATWTGYRPKTAAASSKGRSRTATTGFKYA 127
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
N+S+ DVP S+DWR KGAVTPIKNQ ECGCCWAF+AVA++EG+ K+ +G L+ LSEQ+L+
Sbjct: 128 NVSLDDVPASVDWRTKGAVTPIKNQGECGCCWAFSAVASMEGVVKLSTGKLVSLSEQELV 187
Query: 190 DCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYE 247
DC NG + GC GG + AF +I+ N G+ TE YPY A GTC++ + AA I YE
Sbjct: 188 DCDVNGMDQGCEGGEMDDAFDFIVGNGGLTTESRYPYTASDGTCNSNEASGDAASIKGYE 247
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
+VP+ DE +L KAV+ QPVS+A+ + F+ YK G+ +G CGT+LDH + VG+G D
Sbjct: 248 DVPANDEASLRKAVANQPVSVAVDGGDSHFRFYKGGVLSGACGTELDHGIAAVGYGVASD 307
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD 333
G YW++KNSWG +WG+AGY+++ RD
Sbjct: 308 GTKYWVMKNSWGTSWGEAGYIRMERD 333
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 168/345 (48%), Positives = 231/345 (66%), Gaps = 13/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
K +LS D+P++LDW + GAVT +K+Q CGCCWAF+AV ++EG KI +GNL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
Q+LLDC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 164/304 (53%), Positives = 220/304 (72%), Gaps = 13/304 (4%)
Query: 51 MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG 110
MA++GR YKD EKE R KIFK+N+ IE NK ++TYKL N+F+DLTN+EFR+L
Sbjct: 1 MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNR 60
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
+K +H + ++TFKY+N+ T VP+++DWR KGAVTPIK+Q++CGCCWAF+AVAA E
Sbjct: 61 FK----AHICSEATTFKYENV--TAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATE 114
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP 229
GIT+I +G LI LSEQ+L+DC T G N GC GG + AF + I+ G+A+E YPY+
Sbjct: 115 GITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRF-IKIHGLASEATYPYEGDD 173
Query: 230 GTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV 288
GTC++ ++ AAKI YE+VP+ +E+AL KAV+ QPV++AI A EFQ Y G+F G
Sbjct: 174 GTCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQ 233
Query: 289 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSS 344
CGT+LDH V VG+G +DG YWL+KNSWG WG+ GY+++ RD EGLCGI ++S
Sbjct: 234 CGTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQAS 293
Query: 345 YPLA 348
YP A
Sbjct: 294 YPTA 297
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 333 bits (855), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 167/341 (48%), Positives = 231/341 (67%), Gaps = 12/341 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ KI+ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKIDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P+ S+ S +
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPN-SYLSPS----PIN 116
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+LS D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG KI +GNL++ SEQ+LL
Sbjct: 117 DLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 176
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
DC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+Y+ V
Sbjct: 177 DCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISSYQVV 235
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT E G
Sbjct: 236 PEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQ 293
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 294 KYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 333 bits (855), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 232/340 (68%), Gaps = 18/340 (5%)
Query: 19 FIIITLLVSCAS--QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
F+ + LL + ++R+ + + E HE+WM Q+GR YKD+ E+ R IFKEN+
Sbjct: 9 FVCLALLFILGAWPSKSTARTLLDAPMYERHEQWMTQYGRVYKDDNERATRYSIFKENVA 68
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMT 134
I+ N + ++YKLG NQF+DLTN+EF+A +K M SP + F+Y+N+S
Sbjct: 69 RIDAFNSQTGKSYKLGVNQFADLTNEEFKASRNRFKGHMCSPQ-----AGPFRYENVSA- 122
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
VP+++DWR +GAVTP+K+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+++DC T
Sbjct: 123 -VPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTK 181
Query: 195 G-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSG 252
G + GC GG + AF +I QN+G+ TE YPY+ GTC+ + AAKI+ +E+VP+
Sbjct: 182 GEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYKGTDGTCNTNKAAIHAAKITGFEDVPAN 241
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
E AL+KAV+ QPVS+AI A ++FQ Y GIF G C TQLDH VT VG+G + DG+ YW
Sbjct: 242 SEAALMKAVAKQPVSVAIDAGGSDFQFYSSGIFTGSCDTQLDHGVTAVGYGVS-DGSKYW 300
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
L+KNSWG WG+ GY+++ +D EGLCGI ++SYP A
Sbjct: 301 LVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPTA 340
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 333 bits (854), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 173/338 (51%), Positives = 226/338 (66%), Gaps = 32/338 (9%)
Query: 19 FIIITLLVS--CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
I ITLL+ ASQ +S R+ HE S+ E HE WM +GR+YKD EKE R KIFKEN+E
Sbjct: 7 IICITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVE 65
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
YIE NK F+A GY M S RS+ ++F+Y+N++ V
Sbjct: 66 YIESVNK--------------------FKASRNGYNMSSRP-RSSEITSFRYENVAA--V 102
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
P+S+DWR KGAVTPIK+Q +CGCCWAF+AVAA+EG+T++++G LI LSEQ+L+DC T+G
Sbjct: 103 PSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGE 162
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDE 254
+ GC GG + AF +II N G+ TE YPY+ V TC+ + ++A I NYE+VP+ E
Sbjct: 163 DQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSE 222
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
ALLKAV+ PVS+AI A ++FQ Y G+F G CGT+LDH VT VG+G T+DG YWL+
Sbjct: 223 AALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWLV 282
Query: 315 KNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
KNSWG WG+ GY+ + R DEGLCGI +SYP A
Sbjct: 283 KNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 320
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 333 bits (853), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 166/341 (48%), Positives = 231/341 (67%), Gaps = 12/341 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+ K++ + I + ++S + RS + SV E HE WM++HGR YKDE+EK R
Sbjct: 2 AMKVDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFKEN+++IE NK GN +YKLG N+F+D+T+ EF A +TG +P+ S+ S +
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPN-SYLSPS----PIN 116
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+LS D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG KI +GNL++ SEQ+LL
Sbjct: 117 DLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 176
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
DC+TN N GC GG AF +I +N GI+ E +Y Y TC + +K AA +IS+Y+ V
Sbjct: 177 DCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISSYQVV 235
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P G E +LL+AV+ QPVSI IAA S + Q Y G ++G C +++HAVT +G+GT E G
Sbjct: 236 PEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQ 293
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 294 KYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 167/330 (50%), Positives = 222/330 (67%), Gaps = 16/330 (4%)
Query: 28 CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR 87
C SQV SR H+ S+ E HE+WM ++G+ YKD E E R IF+ N+E+IE N GN+
Sbjct: 20 CTSQV-KSRKLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNK 78
Query: 88 TYKLGTNQFSDLTNDEFRALYTGYKMPSPSH----RSTTSSTFKYQNLSMTDVPTSLDWR 143
YKL N +D TN+EF A + GYK SH R TT + FKY+N+ TD+P ++DWR
Sbjct: 79 PYKLSINHLADQTNEEFMASHKGYK---GSHWQGLRITTQTPFKYENV--TDIPWAVDWR 133
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
KG T IK+Q +CG CWAF+AVAA EGI +I +GNL+ LSEQ+L+DC + ++GC GG
Sbjct: 134 QKGDATSIKDQGQCGICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSV-DHGCDGGL 192
Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVS 262
E F +II+N GI++E YPY AV GTC ++ + A+I YE VP E+ L KAV+
Sbjct: 193 MEHGFEFIIKNGGISSEANYPYTAVNGTCDTNKEASPGAQIKGYETVPVNCEEELQKAVA 252
Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
QPVS++I A + FQ Y G+F G CGTQLDH VT VG+G+T+DG YW++KNSWG W
Sbjct: 253 NQPVSVSIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQW 312
Query: 323 GDAGYMKIVR----DEGLCGIGTRSSYPLA 348
G+ GY++++R EGLCGI +SYP A
Sbjct: 313 GEEGYIRMLRGIDAQEGLCGIAMDASYPTA 342
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 157/336 (46%), Positives = 219/336 (65%), Gaps = 9/336 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I++ L C++ + + + ++V HE+WM Q+GR YKD EK R +IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + L NQF+DLTN EFRA T + R T TF+Y+N+S+ +P
Sbjct: 68 IESFNA-GNHKFWLSVNQFADLTNYEFRATKTNKGFIPSTVRVPT--TFRYENVSIDTLP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG + AF +II+N G+ TE +YPY A G C+ +AA I YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSN-SAATIKGYEDVPANNEAA 243
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKN 303
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG TWG+ G++++ +D G+CG+ SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 166/328 (50%), Positives = 223/328 (67%), Gaps = 11/328 (3%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
SQ S + HE V E H++WM + R Y DELEK+MR +FK+NL++IEK NK+G+RTY
Sbjct: 6 SQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTY 65
Query: 90 KLGTNQFSDLTNDEFRALYTGYK----MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDK 145
KLG N+F+D T +EF A +TG K +PS ++ + N+S + DWR +
Sbjct: 66 KLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNW-NVSDVAGRETKDWRYE 124
Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
GAVTP+K Q +CGCCWAF++VAAVEG+TKI NL+ LSEQQLLDC +NGC GG
Sbjct: 125 GAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMS 184
Query: 206 KAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQP 265
AF+YII+N+GIA+E YPYQA GTC KP+A I ++ VPS +E+ALL+AVS QP
Sbjct: 185 DAFSYIIKNRGIASEASYPYQAAEGTCRYNGKPSAW-IRGFQTVPSNNERALLEAVSKQP 243
Query: 266 VSIAIAAYSTEFQSYKEGIFN-GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 324
VS++I A F Y G+++ CGT ++HAVT VG+GT+ +G YWL KNSWG TWG+
Sbjct: 244 VSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGE 303
Query: 325 AGYMKIVRD----EGLCGIGTRSSYPLA 348
GY++I RD +G+CG+ + YP+A
Sbjct: 304 NGYIRIRRDVAWPQGMCGVAQYAFYPVA 331
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 332 bits (851), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 153/336 (45%), Positives = 219/336 (65%), Gaps = 9/336 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR Y+D+ EK R ++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR + T + R T F+Y+N+++ +P
Sbjct: 68 IESFNA-GNHNFWLGVNQFADLTNDEFRWMKTNKGFIPSTTRVPTG--FRYENVNIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG + AF +II+N G+ TE YPY A C + + A I YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSN-SVASIKGYEDVPANNEAA 243
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+A+ FQ YK G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 303
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG TWG+ G++++ +D G+CG+ SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 168/339 (49%), Positives = 230/339 (67%), Gaps = 12/339 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + L SQV+ R H+ ++ E HE WMA++G+ YKD EKE R +IFK+N+E+
Sbjct: 10 MLALFLFLAVGISQVMP-RKLHQTALRERHENWMAEYGKIYKDAAEKEKRFQIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDV 136
IE N GN+ YKLG N +DLT +EF+ G K +T + FKY+N+ TD+
Sbjct: 69 IESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENV--TDI 126
Query: 137 PTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P ++DWR KGAVTPIK+Q +CG CWAF+ VAA EGI +I +G L+ LSEQ+L+DC +
Sbjct: 127 PEAIDWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV- 185
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
++GC GG E F +II+N GI++E YPY AV GTC A+++ + AA+I YE VP+ E
Sbjct: 186 DHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSE 245
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN-YWL 313
+AL +AV+ QPVS++I A + FQ Y G+F G CGTQLDH VT+VG+GTT+DG + YW+
Sbjct: 246 EALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGTHEYWI 305
Query: 314 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
+KNSWG WG+ GY+++ R EGLCGI +SYP A
Sbjct: 306 VKNSWGTQWGEEGYIRMQRGIDALEGLCGIAMDASYPTA 344
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 162/322 (50%), Positives = 219/322 (68%), Gaps = 12/322 (3%)
Query: 33 VSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
V SR +E S+ E HE+WM+++G+ YKD +EKE R IFK+N+E+IE N N+ YKL
Sbjct: 25 VMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKL 84
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
N +DLT DEF+A GYK R +++FKY+N+ T +P ++DWR KGAVTPI
Sbjct: 85 SVNHLADLTLDEFKASRNGYK---KIDREFATTSFKYENV--TAIPEAVDWRVKGAVTPI 139
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
K+Q +CG CWAF+ VAA+EGI +I +G LI LSEQ+L+DC T G + GC GG E F +
Sbjct: 140 KDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
II+N GI +E YPY+A G+C+ A AKI+ YE+VP E +LLKAV+ QP+S++I
Sbjct: 200 IIKNGGITSETNYPYKAADGSCNTATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSI 259
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A + F Y GI+ G CGT+LDH VT VG+G+ +G +YW++KNSWG WG+ GY+++
Sbjct: 260 DASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRM 318
Query: 331 VR----DEGLCGIGTRSSYPLA 348
R EGLCGI SSYP A
Sbjct: 319 QRGIADKEGLCGIAMDSSYPTA 340
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 161/314 (51%), Positives = 221/314 (70%), Gaps = 16/314 (5%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ E HE+WM Q+GR YKD+ E+ R IFKEN+ I+ N + ++YKLG NQF+DLTN+
Sbjct: 1 MYERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNE 60
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
EF+A +K M SP + F+Y+N+S VP+++DWR +GAVTP+K+Q +CGCC
Sbjct: 61 EFKASRNRFKGHMCSPQ-----AGPFRYENVSA--VPSTVDWRKEGAVTPVKDQGQCGCC 113
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIAT 219
WAF+AVAA+EGI K+ +G LI LSEQ+++DC T G + GC GG + AF +I QN+G+ T
Sbjct: 114 WAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTT 173
Query: 220 EDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E YPY+ GTC+ + AAKI+ +E+VP+ E AL+KAV+ QPVS+AI A ++FQ
Sbjct: 174 EANYPYKGTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQ 233
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 334
Y GIF G C TQLDH VT VG+G + DG+ YWL+KNSWG WG+ GY+++ +D E
Sbjct: 234 FYSSGIFTGSCDTQLDHGVTAVGYGVS-DGSKYWLVKNSWGAQWGEEGYIRMQKDISAKE 292
Query: 335 GLCGIGTRSSYPLA 348
GLCGI ++SYP A
Sbjct: 293 GLCGIAMQASYPTA 306
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 221/345 (64%), Gaps = 21/345 (6%)
Query: 14 NTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
N +F I+TL S V+SSR ++E HE+WM +HG+ YKD EKE R +IFKE
Sbjct: 11 NILTLFFILTLWTSL---VISSR------LLEKHEQWMEEHGKFYKDAAEKEQRFQIFKE 61
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY-TGYKMPSPS---HRSTTSSTFKYQ 129
NLE+IE N G+ + L NQF D TNDEF+A Y G K P S F+Y+
Sbjct: 62 NLEFIESFNAAGDNGFNLSINQFGDQTNDEFKANYLNGKKKPLIGVGIAAIEEESVFRYE 121
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
N+ T+VP ++DWR++GAVTPIK+Q CG CWAFA VAA+EGI +I +G L+ LSEQ+L+
Sbjct: 122 NV--TEVPATMDWRERGAVTPIKHQHLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELV 179
Query: 190 DC-STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYE 247
DC TN +GC GG E A +I++ GI +E YPY V G C+ + AKI YE
Sbjct: 180 DCVKTNTTDGCNGGYVEDACDFIVKKGGITSETNYPYTRVDGKCNVRKGTYNVAKIKGYE 239
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP+ +E+ALLKAV+ QP+++ IAA FQ Y GI G CG LDH VTIVG+GT++D
Sbjct: 240 HVPANNEKALLKAVANQPIAVYIAATKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTSDD 299
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
G YWL+KNSWG WG+ GY+KI RD EG CGI +YP+
Sbjct: 300 GVKYWLVKNSWGTKWGEKGYIKIKRDVHAKEGSCGIAMVPTYPIV 344
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 153/336 (45%), Positives = 218/336 (64%), Gaps = 9/336 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR YKD+ EK R ++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR+ T + R T F+Y+N+++ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTG--FRYENVNIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KG VTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG + AF +II+N G+ TE YPY A C + + A I YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSN-SVASIKGYEDVPANNEAA 243
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+A+ FQ YK G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 303
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG TWG+ G++++ +D G+CG+ SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 161/314 (51%), Positives = 222/314 (70%), Gaps = 8/314 (2%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
S+V+ H++WM Q R Y DE EK++RL++ ENL++IE N GN++YKLG N+F+D T
Sbjct: 34 SIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTK 93
Query: 102 DEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKECGC 159
+EF A YTG + + S + T N +++DV T+ DWR++GAVTP+K+Q ECG
Sbjct: 94 EEFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNKDWRNEGAVTPVKSQGECGG 153
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+A+AAVEG+TKI GNLI LSEQQLLDC+ NNGC GG+ AF YII+++GI++
Sbjct: 154 CWAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGISS 213
Query: 220 EDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
E+EYPYQ G C + +PA I +E VPS +E+ALL+AVS QPV++AI A F
Sbjct: 214 ENEYPYQVKEGPCRSNARPAIL-IRGFENVPSNNERALLEAVSRQPVAVAIDASEAGFVH 272
Query: 280 YKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 334
Y G++N CGT ++HAVT+VG+GT+ +G YWL KNSWG TWG+ GY++I RD +
Sbjct: 273 YSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQ 332
Query: 335 GLCGIGTRSSYPLA 348
G+CG+ +SYP+A
Sbjct: 333 GMCGVAQYASYPVA 346
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 153/336 (45%), Positives = 218/336 (64%), Gaps = 9/336 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR Y+D+ EK R ++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR T + R T F+Y+N+++ +P
Sbjct: 68 IESFNA-GNHNFWLGVNQFADLTNDEFRWTKTNKGFIPSTTRVPTG--FRYENVNIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG + AF +II+N G+ TE YPY A C + + A I YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSN-SVASIKGYEDVPANNEAA 243
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+A+ FQ YK G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 303
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG TWG+ G++++ +D G+CG+ SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 159/335 (47%), Positives = 224/335 (66%), Gaps = 7/335 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+I++L+ S + SR E ++ + H WM +HGR Y D EK R +FK N+E
Sbjct: 8 IFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVES 67
Query: 78 IEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE+ N+ + T+KL NQF+DLTN+EFR++YTGYK S T ++F+YQ++S +
Sbjct: 68 IERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGNSVLSSRTKPTSFRYQHVSSDAL 127
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAVTPIK+Q CG CWAF+AVAA+EG+ +I+ G LI LSEQ+L+DC TN +
Sbjct: 128 PISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-D 186
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQ 255
+GC+GG AF Y + G+ +E YPY++ GTC+ + K A I +E+VP+ DE+
Sbjct: 187 DGCMGGYMNSAFNYTMTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEK 246
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL+KAV+ PVSI IA T FQ Y G+F+G C T LDH V +VG+G + +G+ YW++K
Sbjct: 247 ALMKAVAHHPVSIGIAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILK 306
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
NSWG WG+ GYM+I +D G CG+ +SYP
Sbjct: 307 NSWGPKWGERGYMRIKKDTKAKHGQCGLAMNASYP 341
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 155/341 (45%), Positives = 219/341 (64%), Gaps = 9/341 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I + I+ L C+S + + + S+ HE WMAQ+GR YKD EK + ++FK
Sbjct: 3 IPKASILAILGCLCFCSSVLAARELNDDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N +I+ N E N + LG NQF+DLTN+EF+A T S +++ S+ FKY+NL
Sbjct: 63 ANARFIDSFNAE-NHKFWLGINQFADLTNEEFKATKTNKGFIS--NKARVSTGFKYENLK 119
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ +PTS+DWR KGAVTP+K+Q +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC
Sbjct: 120 IEALPTSIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
+G + GC GG + AF +II N G+ E YPY A G C + K +A I +YE+VP+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSGSK-SAGTIKSYEDVPA 238
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
+E AL+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G T DG +
Sbjct: 239 NNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKF 298
Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
WL+KNSWG TWG+ G++++ +D +G+CG+ SYP A
Sbjct: 299 WLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 340
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 166/335 (49%), Positives = 221/335 (65%), Gaps = 14/335 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F ++ L C + SSR+ E S+ HE+WMA H R Y D EK+ R +IFKENLE+I
Sbjct: 13 FFMLFLTCICRA---SSRTLSESSIATQHEEWMAMHDRVYADSAEKDRRQQIFKENLEFI 69
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTG--YKMPSPSHRSTTSSTFKYQNLSMTDV 136
EK N EG + Y L N F+DLTN+EF A +TG YK P+ + + + +S+ D+
Sbjct: 70 EKHNNEGKKRYNLSLNSFADLTNEEFVASHTGALYKPPTQLGSFKINHSLGFHKMSVGDI 129
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
SLDWR +GAV IKNQ CG CWAF+AVAAVEGI +I++G L+ LSEQ L+DC++ N
Sbjct: 130 EASLDWRKRGAVNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSLSEQNLVDCAS--N 187
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
+GC G EKAF Y I++ G+A E+EYPY GTCS P A +I Y+ V +E+
Sbjct: 188 DGCHGQYVEKAFDY-IRDYGLANEEEYPYVETVGTCSGNSNP-AIQIRGYQSVTPQNEEQ 245
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LL AV+ QPVS+ + A FQ Y G+F+G CGT+L+HAVTIVG+G +G YWLI+N
Sbjct: 246 LLTAVASQPVSVLLEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYGEEAEG-KYWLIRN 304
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
SWG +WG+ GYMK++RD +GLCGI ++SYP
Sbjct: 305 SWGKSWGEGGYMKLMRDTGNPQGLCGINMQASYPF 339
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 328 bits (841), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 158/336 (47%), Positives = 222/336 (66%), Gaps = 8/336 (2%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++I+ L+++ + V SR E E HEKWMAQ+GR YKD EKE R ++FK N+ +I
Sbjct: 9 YLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFI 68
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L NQF+DL ++EF+AL + + ++T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTETSFRYE--SVTKIPA 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
++DWR +GAVTPIK+Q CG CWAF+AVAA EGI +I +G L+ LSEQ+L+DC + G
Sbjct: 127 TIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
C+GG + AF +I + GIA+E YPY+ V TC ++ A+I YE+VPS +E+AL
Sbjct: 187 CIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKAL 246
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LKAV+ QPVS+ I A + F+ Y GIFN CGT +HAV +VG+G DG+ YWL+KN
Sbjct: 247 LKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDGSKYWLVKN 306
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG WG+ GY++I RD EGLCGI YP A
Sbjct: 307 SWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 328 bits (841), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 159/336 (47%), Positives = 222/336 (66%), Gaps = 8/336 (2%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++I+ L++S + V SR E E HEKWMAQ+GR YKD EKE R ++FK N+ +I
Sbjct: 9 YLILFLVLSVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFI 68
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L NQF+DL ++EF+AL + + ++T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTQTSFRYE--SVTKIPA 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
++DWR +GAVTPIK+Q CG CWAF+AVAA EGI +I +G L+ LSEQ+L+DC + G
Sbjct: 127 TIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
C+GG + AF +I + GIA+E YPY+ V TC ++ A+I YE+VPS +E+AL
Sbjct: 187 CIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKAL 246
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LKAV+ QPVS+ I A + F+ Y GIFN CGT +HAV +VG+G DG+ YWL+KN
Sbjct: 247 LKAVANQPVSVYIDAGTHAFKYYSSGIFNVRNCGTDPNHAVAVVGYGKALDGSKYWLVKN 306
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG WG+ GY++I RD EGLCGI YP A
Sbjct: 307 SWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 161/326 (49%), Positives = 223/326 (68%), Gaps = 8/326 (2%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
S+ S + HE ++ H+KWM R Y DE EK+MRL++F ENL++IE N G+++Y
Sbjct: 21 SEATSRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSY 80
Query: 90 KLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGA 147
KLG N+F+D T +EF A +TG + S + T N +++DV T+ DWR++GA
Sbjct: 81 KLGVNKFTDWTKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGA 140
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
VTP+K Q ECG CWAF+A+AAVEG+TKI GNLI LSEQQLLDC+ NNGC GG+ +A
Sbjct: 141 VTPVKYQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCAREQNNGCKGGTMIEA 200
Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
F YI++N G+++E+ YPYQ G C + PA I +E VPS +E+ALL+AVS QPV+
Sbjct: 201 FNYIVKNGGVSSENAYPYQVKEGPCRSNDIPAIV-IRGFENVPSNNERALLEAVSRQPVA 259
Query: 268 IAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
+ I A T F Y G++N CGT ++HAVT+VG+GT+++G YWL KNSWG TWG+ G
Sbjct: 260 VDIDASETGFIHYSGGVYNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKNSWGKTWGENG 319
Query: 327 YMKIVRD----EGLCGIGTRSSYPLA 348
Y++I RD +G+CG+ +SYP+A
Sbjct: 320 YIRIRRDVEWPQGMCGVAQYASYPVA 345
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 160/341 (46%), Positives = 227/341 (66%), Gaps = 13/341 (3%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
TT + ++ +S ++ +S RS E V EI++ W+A+HG++Y E+E R +IFKEN
Sbjct: 5 TTSLALLSFFFLSISASALSRRSDGE--VREIYDLWLAKHGKAYNGIDEREKRFQIFKEN 62
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQNLS 132
L++I+ N E NRTYK+G N F+DLTN+E+RALY G + P P+ R + T +Y +
Sbjct: 63 LKFIDDHNSE-NRTYKVGLNMFADLTNEEYRALYLGTRSP-PARRVMKAKTASRRYAVNN 120
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ +P S+DWR +GAV P+KNQ CG CWAF+ +AAVEGI +I +G LI LSEQ+L+ C
Sbjct: 121 LDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCD 180
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPS 251
N+GC GG + AF +II N G+ TE++YPY+A G C +K A I YE+VP+
Sbjct: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPA 240
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
DE++L KAV+ QPVS+AI A Q Y+ G+F G CG+ LDH V VG+G E+G +Y
Sbjct: 241 NDEESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYG-KENGVDY 299
Query: 312 WLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
WL++NSWG +WG+ GY K+ R+ EG CGI ++SYP+
Sbjct: 300 WLVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPV 340
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 166/342 (48%), Positives = 232/342 (67%), Gaps = 17/342 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH---EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+ I+ + ++ +SS ST E+++ H++WMA+HGR+Y+DE EK R ++FK N
Sbjct: 19 LTILAVTTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAHRFQVFKAN 78
Query: 75 LEYIEKANKEGN--RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
++++ +N G+ ++Y+L N+F+D+TNDEF A+YTG + P P+ + FKY N++
Sbjct: 79 ADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTGLR-PVPAGAKKMAG-FKYGNVT 136
Query: 133 MTDVPT---SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
++D ++DWR KGAVT IKNQ +CGCCWAFAAVAAVEGI +I +GNL+ LSEQQ+L
Sbjct: 137 LSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVL 196
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
DC T+GNNGC GG + AF YI+ N G+ TED YPY A C + Q AA IS Y++V
Sbjct: 197 DCDTDGNNGCNGGYIDNAFQYIVGNGGLGTEDAYPYTAAQAMCQSVQPVAA--ISGYQDV 254
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT--QLDHAVTIVGFGTTE 306
PSGDE AL AV+ QPVS+AI A++ FQ Y G+ C T L+HAVT VG+GT E
Sbjct: 255 PSGDEAALAAAVANQPVSVAIDAHN--FQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAE 312
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
DG YWL+KN WG WG+ GY+++ R CG+ ++SYP+A
Sbjct: 313 DGTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 224/336 (66%), Gaps = 8/336 (2%)
Query: 18 MFIIITLLVSCASQVVSSRST-HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+F+I++L+ S + + SR E ++ + H +WM +HGR Y D EK R +FK N+E
Sbjct: 8 IFLIVSLVSSFSLSITLSRPLLDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVE 67
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE+ N + T+KL NQF+DLTN+EFR++YTG+K S T ++F+YQN+S
Sbjct: 68 RIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGNSVLSSRTKPTSFRYQNVSSDA 127
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P S+DWR KGAVTPIK+Q CG CWAF+AVAA+EG+ +I+ G LI LSEQ+L+DC TN
Sbjct: 128 LPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN- 186
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDE 254
+ GC+GG + AF Y I G+ +E YPY++ GTC+ + K A I +E+VP+ DE
Sbjct: 187 DGGCMGGLMDTAFNYTITIGGLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPANDE 246
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+AL+KAV+ PVSI IA FQ Y G+F+G C T LDH VT VG+G +++G YW++
Sbjct: 247 KALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWIL 306
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
KNSWG WG+ GYM+I +D G CG+ +SYP
Sbjct: 307 KNSWGPKWGERGYMRIKKDIKPKHGQCGLAMNASYP 342
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 153/318 (48%), Positives = 209/318 (65%), Gaps = 5/318 (1%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ II + C+S V+S+R + ++VE HE+WMA+ R YKD EK R K FK N+ +
Sbjct: 8 LLAIIGSICLCSSTVLSARELGDAAMVEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFRA T + R+ T FKY N+S +P
Sbjct: 68 IESFNT-GNHKFWLGVNQFTDLTNDEFRATKTNKGLKRNGARAPTR--FKYNNVSTDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KG VTPIK+Q +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC +G +
Sbjct: 125 AAVDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGVD 184
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQ 255
GC GG + AF +II+N G+ TE YPY A G C ++ + A I YE+VP+ DE
Sbjct: 185 QGCEGGEMDNAFKFIIKNGGLTTEANYPYTAQDGQCKTSTTSNSVATIKGYEDVPANDES 244
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
+L+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G T DG +WL+K
Sbjct: 245 SLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIVAIGYGMTSDGTKFWLLK 304
Query: 316 NSWGNTWGDAGYMKIVRD 333
NSWG TWG++GY+++ +D
Sbjct: 305 NSWGTTWGESGYLRMEKD 322
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 327 bits (837), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 153/338 (45%), Positives = 219/338 (64%), Gaps = 10/338 (2%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQS--VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
F+ ++ + A + +R + + HE+WMA++GR Y D EK RL++FK N+
Sbjct: 3 FLFALVVCTFALGALGARDLADDDWLIAARHEQWMARYGRVYSDVAEKARRLEVFKANVG 62
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+IE N GN + L NQF+D+T DEFRA++ GYKM ++ + F+Y N+S+ D+
Sbjct: 63 FIESVNA-GNHKFWLEANQFADITKDEFRAMHKGYKMQVIGSKARATG-FRYANVSIDDL 120
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P S+DWR GAVTP+K+Q +CGCCWAF+ VA++EGI K+ +G LI LSEQ+L+DC
Sbjct: 121 PASVDWRANGAVTPVKDQGQCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVGMQ 180
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDE 254
N GC GG + AF +I+ N G+ TE +YPY GTC++ ++ AA I YE+VP+ DE
Sbjct: 181 NKGCGGGLMDNAFEFIVNNGGLDTEADYPYTGADGTCNSNKESNIAASIKGYEDVPANDE 240
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+L KAV+ QPVSIA+ F+ YK G+ G CGT+LDH V VG+G DG YWL+
Sbjct: 241 ASLQKAVAAQPVSIAVDGGDDLFRFYKGGVLTGACGTELDHGVAAVGYGVAGDGTKYWLV 300
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
KNSWG +WG+ G++++ RD G+CG+ + SYP A
Sbjct: 301 KNSWGTSWGEDGFIRLERDVADEAGMCGLAMKPSYPTA 338
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 325 bits (834), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 153/314 (48%), Positives = 220/314 (70%), Gaps = 8/314 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ +++E++E W+AQH ++Y EK+ + +FK+N YI + N +GN +YKLG NQF+DL
Sbjct: 37 DDAIMELYELWLAQHKKAYNGLDEKQKKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADL 96
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
+++EF+A Y G K+ + R + S + +YQ D+P S+DWR+KGAVT +KNQ CG
Sbjct: 97 SHEEFKAAYLGTKLDA-KKRLSRSPSPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGS 155
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T+ N GC GG + AF +II N G+ +
Sbjct: 156 CWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIISNGGLDS 215
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
ED+YPY+A G+C A +K A I +YE+VP DE++L KA + QP+S+AI A FQ
Sbjct: 216 EDDYPYKANNGSCDAYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQ 275
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----- 333
Y+ G+F CGTQLDH VT+VG+G +E G +YWL+KNSWGN+WG+ G++K+ R+
Sbjct: 276 FYESGVFTSNCGTQLDHGVTLVGYG-SESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGAS 334
Query: 334 EGLCGIGTRSSYPL 347
G+CGI +SYP+
Sbjct: 335 TGMCGIAMEASYPV 348
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 165/335 (49%), Positives = 218/335 (65%), Gaps = 14/335 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
I + LL++ + SR HE S+ E HE+WMA++G+ YKD EKE R IFK N+E+IE
Sbjct: 11 IALFLLLALGIPQMMSRKLHETSMRERHEQWMAEYGKVYKDAAEKEKRFLIFKHNVEFIE 70
Query: 80 KANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
N N+ YKLG N +DLT +EF+A G K P +++ FKY+N+ T +P +
Sbjct: 71 SFNAAANKPYKLGVNHLADLTVEEFKASRNGLKRP----YELSTTPFKYENV--TAIPAA 124
Query: 140 LDWRDKGAVTPIKNQKEC-GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
+DWR KGAVT IK+Q +C G CWAF+ VAA EGI +I +G L+ LSEQ+L+DC T G +
Sbjct: 125 IDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLSEQELVDCDTKGVDQ 184
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG E F +II+N GI +E YPY+AV G C+ A P A+I YE+VP E+ L
Sbjct: 185 GCEGGYMEDGFEFIIKNGGITSEANYPYKAVDGKCNKATSP-VAQIKGYEKVPPNSEKTL 243
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
KAV+ QPVS++I A F Y GI+NG CGT+LDH VT VG+G +G +YWL+KNS
Sbjct: 244 QKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELDHGVTAVGYGIA-NGTDYWLVKNS 302
Query: 318 WGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
WG WG+ GY+++ R GLCGI SSYP A
Sbjct: 303 WGTQWGEKGYVRMQRGVAAKHGLCGIALDSSYPTA 337
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 166/342 (48%), Positives = 231/342 (67%), Gaps = 17/342 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH---EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+ I+ + ++ +SS ST E+++ H++WMA+HGR+Y+DE EK R ++FK N
Sbjct: 19 LTILAVKTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAHRFQVFKAN 78
Query: 75 LEYIEKANKEGN--RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
++++ +N G+ ++Y++ N+F+D+TNDEF A+YTG + P P+ + FKY N++
Sbjct: 79 ADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTGLR-PVPAGAKKMAG-FKYGNVT 136
Query: 133 MTDVPT---SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
++D ++DWR KGAVT IKNQ +CGCCWAFAAVAAVEGI +I +GNL+ LSEQQ+L
Sbjct: 137 LSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVL 196
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
DC T GNNGC GG + AF YI N G+ATED YPY A C + Q AA IS Y++V
Sbjct: 197 DCDTEGNNGCNGGYIDNAFQYIAGNGGLATEDAYPYTAAQAMCQSVQPVAA--ISGYQDV 254
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT--QLDHAVTIVGFGTTE 306
PSGDE AL AV+ QPVS+AI A++ FQ Y G+ C T L+HAVT VG+GT E
Sbjct: 255 PSGDEAALAAAVANQPVSVAIDAHN--FQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAE 312
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
DG YWL+KN WG WG+ GY+++ R CG+ ++SYP+A
Sbjct: 313 DGTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 325 bits (832), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 156/333 (46%), Positives = 220/333 (66%), Gaps = 9/333 (2%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++I+ L+++ + V SR E E HEKWMAQ+G+ Y D EKE R +IFK N+++I
Sbjct: 9 YLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFI 68
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L NQF+DL N+EF+A + + T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETATETSFRYE--SITKIPV 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
++DWR +GAVTPIK+Q CG CWAF+ VAA+EGI +I +G L+ LSEQ+L+DC + G
Sbjct: 127 TMDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLVSLSEQELVDCVKGKSEG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
C G +E+AF ++ +N G+A+E YPY+A TC ++ A+I YE VPS E+AL
Sbjct: 187 CNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVPSNSEKAL 246
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
LKAV+ QPVS+ I A + +F Y GIF G CGT +HAVT++G+G GA YWL+KNS
Sbjct: 247 LKAVANQPVSVYIDAGALQF--YSSGIFTGKCGTAPNHAVTVIGYGKARGGAKYWLVKNS 304
Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
WG WG+ GY+K+ RD EGLCGI T +SYP
Sbjct: 305 WGTKWGEKGYIKMKRDIRAKEGLCGIATNASYP 337
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 151/314 (48%), Positives = 219/314 (69%), Gaps = 8/314 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ +++E++E W+AQH ++Y EK+ R +FK+N YI + N +GN +YKLG NQF+DL
Sbjct: 37 DDAIMELYELWLAQHKKAYNGLGEKQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADL 96
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
+++EF+A Y G K+ + R + S + +YQ D+P S+DWR+KGAVT +K+Q CG
Sbjct: 97 SHEEFKATYLGAKLDT-KKRLSNSPSPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGS 155
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T+ N GC GG + AF +II N G+ +
Sbjct: 156 CWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIINNGGLDS 215
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
ED+YPY+A G+C A +K A I +YE+VP DE++L KA + QP+S+AI A FQ
Sbjct: 216 EDDYPYKANDGSCDAYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQ 275
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----- 333
Y+ G+F CGTQLDH VT+VG+G +E G +YW++KNSWG +WG+ G++++ R+
Sbjct: 276 FYESGVFTSTCGTQLDHGVTLVGYG-SESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVS 334
Query: 334 EGLCGIGTRSSYPL 347
G+CGI +SYPL
Sbjct: 335 TGMCGIAMEASYPL 348
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 153/341 (44%), Positives = 216/341 (63%), Gaps = 9/341 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I + I+ L C S + + + S+V HE WM Q+GR YKD EK + ++FK
Sbjct: 3 IPKASLLAILGCLCLCGSVLAARELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N E+I N GN + LG NQF+D+TN+EF+A T S R T F Y+N+S
Sbjct: 63 ANAEFINSFNA-GNHKFWLGINQFADITNEEFKATKTNKGFISNKVRVPTG--FMYENMS 119
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P ++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G L+ LSEQ+L+DC
Sbjct: 120 FDALPATIDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
+G + GC GG + AF +II+N G+ E YPY A G C + +AA I +YE+VP+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIIKNGGLTQESNYPYDAADGKCKSGSS-SAATIKSYEDVPA 238
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
+E AL+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+GTT DG +
Sbjct: 239 NNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGTTSDGTKF 298
Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
W++KNSWG +WG+ G++++ +D +G+CG+ SYP A
Sbjct: 299 WIMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 166/338 (49%), Positives = 228/338 (67%), Gaps = 37/338 (10%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ +L + ASQ ++R+ HE S+ E HE WM Q+GR YKD EK R KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G +
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDE 254
GC YPY GTC+ A P AAKI+ YE+VP+ +E
Sbjct: 185 QGCT---------------------NYPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNE 222
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+AL KAV+ QP+++AI A +EFQ Y G+F G CGT+LDH V+ VG+GT++DG YWL+
Sbjct: 223 KALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLV 282
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
KNSWG WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 283 KNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 153/316 (48%), Positives = 216/316 (68%), Gaps = 9/316 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E ++E W+ +HG++Y EKE R KIFK+NL +IE+ N G+++YKLG N+F+DL
Sbjct: 41 ESHTRHVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADL 100
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSS--TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
TN+E+RA++ G + P +++ + T +Y + ++P +DWR+KGAVTPIK+Q +C
Sbjct: 101 TNEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQGQC 160
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ V AVEGI +I +GNL LSEQ+L+DC N GC GG + AF +I+QN GI
Sbjct: 161 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYAFEFIVQNGGI 220
Query: 218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
TE++YPY A TC +K A I YE+VP+ DE++L+KAV+ QPVS+AI A E
Sbjct: 221 DTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIEAGGME 280
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
FQ Y+ G+F G CGT LDH V VG+G TE+G +YWL++NSWG+ WG+ GY+K+ R
Sbjct: 281 FQLYQSGVFTGRCGTNLDHGVVAVGYG-TENGTDYWLVRNSWGSAWGENGYIKLERNVQN 339
Query: 333 -DEGLCGIGTRSSYPL 347
+ G CGI +SYP+
Sbjct: 340 TETGKCGIAIEASYPI 355
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 160/344 (46%), Positives = 227/344 (65%), Gaps = 17/344 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F+++ L + ++ TH ++ V+ ++E W+A+HG+SY EKE R +
Sbjct: 14 LFLLLGLASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALGEKERRFQ 73
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL +I++ N E NRTYK+G N+F+DLTN+E+R++Y G + + RS+ + +Y
Sbjct: 74 IFKDNLRFIDEHNAE-NRTYKVGLNRFADLTNEEYRSMYLGTRTAA-KRRSSNKISDRYA 131
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+P S+DWR KGAV +K+Q CG CWAF+ +AAVEGI KI +G LI LSEQ+L+
Sbjct: 132 FRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELV 191
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEE 248
DC T+ N GC GG + AF +II N GI +E++YPY+A G C +K A I YE+
Sbjct: 192 DCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAKVVTIDGYED 251
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP DE++L KAV+ QPVS+AI A EFQ Y+ GIF G CGT LDH VT VG+G TE+G
Sbjct: 252 VPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYG-TENG 310
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
+YW++KNSWG +WG+ GY+++ RD G CGI +SYP+
Sbjct: 311 VDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPI 354
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 166/338 (49%), Positives = 226/338 (66%), Gaps = 37/338 (10%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ +L + ASQ ++RS HE S+ E HE WM Q+GR YKD EK R KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N+F+DLTN+EFRA +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
+++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G +
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDE 254
GC YPY GTC+ A P AAKI+ YE+VP+ +E
Sbjct: 185 QGCT---------------------NYPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNE 222
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+AL KAV+ QP+++AI A +EFQ Y G+F G CGT+LDH V VG+GT++DG YWL+
Sbjct: 223 KALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLV 282
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
KNSW WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 283 KNSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 155/311 (49%), Positives = 217/311 (69%), Gaps = 10/311 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ EKW+A+H ++Y EK R ++FK+NL++I+K N+E +Y LG N+F+DLT++
Sbjct: 146 IIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVT-SYWLGLNEFADLTHE 204
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+A Y G P+P+ S S FKY+++S D+P S+DWR KGAVT +KNQ +CG CWA
Sbjct: 205 EFKATYLGLAPPAPARESRGS--FKYEDVSADDLPKSVDWRTKGAVTEVKNQGQCGSCWA 262
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ VAAVEGI I +GNL LSEQ+L+DCS +GNNGC GG + AF+YI + G+ TE+
Sbjct: 263 FSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGGLMDYAFSYIASSGGLHTEEA 322
Query: 223 YPYQAVPGTCSAAQK--PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
YPY G+C +K A IS YE+VP+ +EQAL+KA++ QPVS+AI A FQ Y
Sbjct: 323 YPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQPVSVAIEASGRHFQFY 382
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVR----DEG 335
G+F+G CGTQLDH V VG+G+ + G +Y +++NSWG WG+ GY+++ R EG
Sbjct: 383 SGGVFDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAKWGEKGYIRMKRGTGKGEG 442
Query: 336 LCGIGTRSSYP 346
LCGI +SYP
Sbjct: 443 LCGINKMASYP 453
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 156/328 (47%), Positives = 220/328 (67%), Gaps = 7/328 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+I++L+ S + SR E ++ + H WM +HGR Y D EK R +FK N+E
Sbjct: 2 IFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVES 61
Query: 78 IEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE+ N+ + T+KL NQF+DLTN+EFR++YTGYK S T ++F+YQ++S +
Sbjct: 62 IERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGNSVLSSRTKPTSFRYQHVSSDAL 121
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAVTPIK+Q CG CWAF+AVAA+EG+ +I+ G LI LSEQ+L+DC TN +
Sbjct: 122 PISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-D 180
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQ 255
+GC+GG AF Y + G+ +E YPY++ GTC+ + K A I +E+VP+ DE+
Sbjct: 181 DGCMGGYMNSAFNYTMTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEK 240
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL+KAV+ PVSI IA T FQ Y G+F+G C T LDH V +VG+G + +G+ YW++K
Sbjct: 241 ALMKAVAHHPVSIGIAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILK 300
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGI 339
NSWG WG+ GYM+I +D G CG+
Sbjct: 301 NSWGPKWGERGYMRIKKDTKAKHGQCGL 328
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 323 bits (828), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 152/337 (45%), Positives = 216/337 (64%), Gaps = 10/337 (2%)
Query: 18 MFIIITLLVSCASQVVSSR--STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+ I+ L C++ V+++R + ++ HE+WMAQ GR YKD EK RL++FK N+
Sbjct: 10 LVAIVGCLCLCSTAVLAARELGDADNAMAARHEQWMAQFGRVYKDPAEKAHRLEVFKANV 69
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+IE N E N + LG NQF+DLTNDEFRA T + R + FKY ++S+
Sbjct: 70 AFIESFNAE-NHEFWLGANQFADLTNDEFRASKTNKGIKQGGVRDAPTG-FKYSDVSIDA 127
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P S+DWR KGAVTPIKNQ +CG CWAF+AVAA EG+ K+ +G L+ LSEQ+L+DC +G
Sbjct: 128 LPASVDWRTKGAVTPIKNQGQCGSCWAFSAVAATEGVVKLSTGKLVSLSEQELVDCDVHG 187
Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGD 253
+ GC+GG + AF +II+N G+ TE YPY C + + AA I YE+VP+ D
Sbjct: 188 VDQGCMGGWMDDAFKFIIKNGGLTTEANYPYTGEDDKCKSNETVNVAATIKGYEDVPAND 247
Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E AL+KAV+ QPVS+ + FQ Y G+ G CG ++DH + +G+G T +G YWL
Sbjct: 248 ESALMKAVAHQPVSVVVDGGDMTFQLYAGGVMTGSCGVEMDHGIAAIGYGATSNGTKYWL 307
Query: 314 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
+KNSWG TWG+ G++++ +D G+CG+ + SYP
Sbjct: 308 MKNSWGTTWGEKGFLRMAKDIPDKRGMCGLAMKPSYP 344
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 163/354 (46%), Positives = 231/354 (65%), Gaps = 23/354 (6%)
Query: 14 NTTPMFIIITLLVSCAS------QVVSSRSTH--------EQSVVEIHEKWMAQHGRSYK 59
+++ M + + LL+ AS ++ TH ++ V+ ++E W+A+HG+SY
Sbjct: 6 SSSSMAVFLFLLLGLASASAXDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYN 65
Query: 60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR 119
EKE R +IFK+NL +I++ N E NRTYK+G N+F+DLTN+E+R++Y G + + R
Sbjct: 66 ALGEKERRFQIFKDNLRFIDEHNAE-NRTYKVGLNRFADLTNEEYRSMYLGTRTAA-KRR 123
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
S+ + +Y +P S+DWR KGAV +K+Q CG CWAF+ +AAVEGI KI +G
Sbjct: 124 SSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGG 183
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA 239
LI LSEQ+L+DC T+ N GC GG + AF +II N GI +E++YPY+A G C +K A
Sbjct: 184 LISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNA 243
Query: 240 -AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
I YE+VP DE++L KAV+ QPVS+AI A EFQ Y+ GIF G CGT LDH VT
Sbjct: 244 XVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVT 303
Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
VG+G TE+G +YW++KNSWG +WG+ GY+++ RD G CGI +SYP+
Sbjct: 304 AVGYG-TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPI 356
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 166/339 (48%), Positives = 223/339 (65%), Gaps = 14/339 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + L SQV+ R H+ ++ E HE WMA++G+ YKD EKE R +IFK+N+E+
Sbjct: 10 MLALFLFLAVGISQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDV 136
IE N GN+ YKLG N +DLT +EF+ G K +T + FKY+N+ TD+
Sbjct: 69 IESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENV--TDI 126
Query: 137 PTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P ++DWR KGAVTPIK+Q +CG CWAF+ +AA EGI +I +GNL+ LSEQ+L+DC +
Sbjct: 127 PEAIDWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSV- 185
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGD 253
++GC GG E F +II+N GI +E YPY+ V GTC+ A P A+I YE VPS
Sbjct: 186 DDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASP-VAQIKGYEIVPSYS 244
Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E+AL KAV+ QPVS++I A + F Y GI+NG CGT LDH VT VG+G TE+G +YW+
Sbjct: 245 EEALQKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYG-TENGTDYWI 303
Query: 314 IKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
+KNSWG WG+ GY+++ R G+CGI SSYP A
Sbjct: 304 VKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 153/341 (44%), Positives = 214/341 (62%), Gaps = 9/341 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I + I+ L C+S + + + S+V HE WM Q+GR YKD EK + ++FK
Sbjct: 3 IPKASLLAILGCLCFCSSVLAARELNDDLSMVARHESWMLQYGRVYKDAAEKASKFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N +I+ N GN + LG NQF+D+TN EF+A T S R+ T F Y+N+S
Sbjct: 63 ANAGFIDSFNA-GNHKFWLGINQFADITNKEFKATKTNKGFISNKVRAPTG--FSYENVS 119
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P S+DWR KGAVTP+K+Q +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC
Sbjct: 120 FDALPASIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
+G + GC GG + AF +II N G+ E YPY A G C + K +A I +YE+VP+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIISNGGLTQESSYPYDAEDGKCKSGSK-SAGTIKSYEDVPA 238
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
+E AL+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G T DG Y
Sbjct: 239 NNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKY 298
Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
WL+KNSWG +WG+ G++++ +D +G+CG+ SYP A
Sbjct: 299 WLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 156/316 (49%), Positives = 215/316 (68%), Gaps = 10/316 (3%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLT 100
++ + HE+WMA+HGR+Y D+ EK RL++F++N+ +IE N ++ + L NQF+DLT
Sbjct: 35 AMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLT 94
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
N EFRA TG + PS S + ++F+Y N+S D+P S+DWR KGAV P+K+Q +CGCC
Sbjct: 95 NAEFRATRTGLR-PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCC 153
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIAT 219
WAF+AVAA+EG K+ +G L+ LSEQQL+ C G + GC GG + AF +II+N G+A
Sbjct: 154 WAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAA 213
Query: 220 EDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E +YPY A C +A AAA I YE+VP+ DE ALLKAV+ QPVS+AI FQ
Sbjct: 214 ESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQ 273
Query: 279 SYKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
YK G+ +G C T+LDHA+T VG+G DG YWL+KNSWG +WG+ GY+++ R
Sbjct: 274 FYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVAD 333
Query: 333 DEGLCGIGTRSSYPLA 348
EG+CG+ +SYP A
Sbjct: 334 KEGVCGLAMMASYPTA 349
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 164/338 (48%), Positives = 215/338 (63%), Gaps = 11/338 (3%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
PMF+I T + V+SSR E + HEKWM Q G+SYKD EKE R +IFK N+E
Sbjct: 10 PMFLIFTTWM--LPYVMSSR-VLEPYLSNKHEKWMTQFGKSYKDAAEKEKRFQIFKNNVE 66
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTD 135
+IE N GN+ + L N F+DLTN+EF+A G K+ +++F+Y N+ T
Sbjct: 67 FIELFNAVGNKPFNLSINHFADLTNEEFKASLNGNKKLHDKFDILNETTSFRYHNV--TS 124
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
VP S+DWR +GAVTPIKNQ CG CWAF+ VA++EGI +I +G L+ LSEQ+L+DC
Sbjct: 125 VPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIEGIHQITTGELVSLSEQELIDCVRGN 184
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
++GC GG E AF +I + G+A+E YPY+ C ++ A+I YE+VPS E
Sbjct: 185 SSGCSGGYLEDAFKFIAKKGGMASETNYPYKETDEKCKFKKESKHVAEIKGYEKVPSNSE 244
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
LLKAV+ QPVS+ + A FQ Y GIF G CGT DH VTIVG+G + D YWL+
Sbjct: 245 NDLLKAVANQPVSVYVDAGDYVFQFYSGGIFTGKCGTDTDHVVTIVGYGVSLDYTEYWLV 304
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
KNSWG WG+ GYMK+ R+ +GLCGI T SYP+A
Sbjct: 305 KNSWGTGWGEKGYMKLKRNVDSKKGLCGIATNPSYPVA 342
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 158/326 (48%), Positives = 225/326 (69%), Gaps = 8/326 (2%)
Query: 26 VSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG 85
+S ++ SSR+ E V+ ++E W+ +HG+SY EKE R +IFK+NL +I++ N E
Sbjct: 27 MSIIGELSSSRTDDE--VMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAE- 83
Query: 86 NRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDK 145
+RTYK+G N+F+DLTNDE+R++Y G + S ST + +Y ++ +P S+DWR+K
Sbjct: 84 SRTYKVGLNRFADLTNDEYRSMYLGARTGSRRRLSTQKRSDRYVPVAGESLPDSVDWREK 143
Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
GAV +K+Q CG CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG +
Sbjct: 144 GAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 203
Query: 206 KAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQ 264
AF +II+N GI TE++YPY A G C +K A I +YE+VP +EQAL KAV+ Q
Sbjct: 204 YAFEFIIKNGGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQ 263
Query: 265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 324
PVS+AI A FQ Y+ G+F G CGT LDH VT VG+G TE+ +YW++KNSWG++WG+
Sbjct: 264 PVSVAIEASGMAFQFYESGVFTGNCGTALDHGVTAVGYG-TENSVDYWIVKNSWGSSWGE 322
Query: 325 AGYMKIVRDEGL---CGIGTRSSYPL 347
+GY+++ R+ G CGI SYP+
Sbjct: 323 SGYIRMERNTGATGKCGIAVEPSYPI 348
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 156/336 (46%), Positives = 221/336 (65%), Gaps = 8/336 (2%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++I+ L+++ + V SR E E HEKWMAQ+GR YKD EKE R ++FK N+ +I
Sbjct: 9 YLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFI 68
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L NQF+DL ++EF+AL + + ++T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTETSFRYE--SVTKIPA 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
++D R +GAVTPIK+Q CG CWAF+AVAA EGI +I +G L+ LSEQ+L+DC + G
Sbjct: 127 TIDRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
C+GG + AF +I + GIA+E YPY+ V TC ++ A+I YE+VPS +E+AL
Sbjct: 187 CIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKAL 246
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LKAV+ QPVS+ I A + F+ Y GIFN CGT +HAV +VG+G D + YWL+KN
Sbjct: 247 LKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDDSKYWLVKN 306
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG WG+ GY++I RD EGLCGI YP+A
Sbjct: 307 SWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPIA 342
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 159/310 (51%), Positives = 210/310 (67%), Gaps = 10/310 (3%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN----KEGNRTYKLGTNQFSDLTND 102
HEKWMA+HG++YKDE EK RL++F+ N + I+ N K+G ++L TN+F+DLT+D
Sbjct: 42 HEKWMAKHGKTYKDEEEKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDD 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EFRA TGY+ P P+ + F Y+N S+ P S+DWR GAVT +K+Q CGCCWA
Sbjct: 102 EFRAARTGYQRP-PAAVAGAGGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWA 160
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATED 221
F+AVAAVEG+ KIR+G L+ LSEQ+L+DC G + GC GG + AF YI + G+A E
Sbjct: 161 FSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAES 220
Query: 222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY+ V G C AA AAA I +++VPS DE AL+ AV+ QPVS+AI F+ Y
Sbjct: 221 SYPYRGVDGACRAAAGRAAASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYD 280
Query: 282 EGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLC 337
G+ G CGT+L+HAVT VG+GT DG YWL+KNSWG +WG+ GY++I R EG C
Sbjct: 281 RGVLGGAGCGTELNHAVTAVGYGTASDGTGYWLMKNSWGASWGEGGYVRIRRGVGREGAC 340
Query: 338 GIGTRSSYPL 347
GI +SYP+
Sbjct: 341 GIAQMASYPV 350
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 154/333 (46%), Positives = 219/333 (65%), Gaps = 9/333 (2%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++I+ L+++ + V SR E E HEKWMAQ+G+ Y D EKE R +IFK N+++I
Sbjct: 9 YLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFI 68
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L NQF+DL N+EF+A + + T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETATETSFRYE--SITKIPV 126
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
++DWR +GAVTPIK+Q CG CWAF+ VAA+EGI +I +G L+ LSEQ+L+DC + G
Sbjct: 127 TMDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDCVKGKSEG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
C G +E+AF ++ +N G+A+E YPY+A TC ++ A+I YE VPS E+AL
Sbjct: 187 CNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVPSNSEKAL 246
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
LKAV+ QPVS+ I A + +F Y GIF G CGT +HA T++G+G GA YWL+KNS
Sbjct: 247 LKAVANQPVSVYIDAGALQF--YSSGIFTGKCGTAPNHAATVIGYGKARGGAKYWLVKNS 304
Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
WG WG+ GY+++ RD EGLCGI T +SYP
Sbjct: 305 WGTKWGEKGYIRMKRDIRAKEGLCGIATNASYP 337
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 218/341 (63%), Gaps = 9/341 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I + I+ L AS + + + S+V HE WM+Q+GRSYKD EK+ + ++FK
Sbjct: 3 IPNASLLAILGCLCFFASGLAARELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N +I+ N + N + LG NQF+D+TN+EF+ T S R++T F Y+N+S
Sbjct: 63 ANAAFIDSFNAK-NHKFWLGINQFADITNEEFKVTKTNKGFISNKVRASTG--FSYENVS 119
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ +P ++DWR KGAVTP+K+Q +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC
Sbjct: 120 IDALPATIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
+G + GC GG + AF +II N G+ E YPY A G C + K +A I +YE+VP+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSGSK-SAGTIKSYEDVPA 238
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
+E AL+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G T DG Y
Sbjct: 239 NNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKY 298
Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
WL+KNSWG +WG+ G++++ +D +G+CG+ SYP A
Sbjct: 299 WLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 156/315 (49%), Positives = 214/315 (67%), Gaps = 10/315 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLTN 101
+ + HE+WMA+HGR+Y D+ EK RL++F++N+ +IE N ++ + L NQF+DLTN
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EFRA TG + PS S + ++F+Y N+S D+P S+DWR KGAV P+K+Q +CGCCW
Sbjct: 61 AEFRATRTGLR-PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCW 119
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATE 220
AF+AVAA+EG K+ +G L+ LSEQQL+ C G + GC GG + AF +II+N G+A E
Sbjct: 120 AFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAE 179
Query: 221 DEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
+YPY A C +A AAA I YE+VP+ DE ALLKAV+ QPVS+AI FQ
Sbjct: 180 SDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQF 239
Query: 280 YKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----D 333
YK G+ +G C T+LDHA+T VG+G DG YWL+KNSWG +WG+ GY+++ R
Sbjct: 240 YKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADK 299
Query: 334 EGLCGIGTRSSYPLA 348
EG+CG+ +SYP A
Sbjct: 300 EGVCGLAMMASYPTA 314
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 164/315 (52%), Positives = 213/315 (67%), Gaps = 27/315 (8%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E S +E HE+WM++ R Y D+ EK R +IFK+NL+++E N N TYKL N+FSDL
Sbjct: 11 EASAIEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDL 70
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T++EF+A Y G + S + +F+Y+N+S T S+DWR +GAVTP+K+Q +CGC
Sbjct: 71 TDEEFQARYMGLVPEGMTGDSQKTVSFRYENVSETG--ESMDWRLEGAVTPVKDQGQCGC 128
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIA 218
CWAFAAVAAVEG+TKI +G L+ LSEQQL+DCST NN GC GG A+ YI +NQGI
Sbjct: 129 CWAFAAVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQGIT 188
Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
+E+ YPYQAV TC + PAAA IS YE VP DE+ALLKAVS
Sbjct: 189 SEENYPYQAVQQTCKSTD-PAAATISGYEAVPKDDEEALLKAVS---------------- 231
Query: 279 SYKEGIF-NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
+ GIF + CGT HAVTIVG+GT+E+G YWL+KNSWG +WG+ GYM+I RD
Sbjct: 232 --QHGIFEDEYCGTDSHHAVTIVGYGTSEEGIKYWLLKNSWGESWGENGYMRIKRDVDEP 289
Query: 334 EGLCGIGTRSSYPLA 348
+G+CG+ R+ YP+A
Sbjct: 290 QGMCGLAHRAYYPVA 304
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 149/327 (45%), Positives = 212/327 (64%), Gaps = 9/327 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR YKD+ EK R ++FK N +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANAAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR T + R T F+Y+N+++ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRLTKTNKGFIPSTTRVPTG--FRYENVNIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KG VTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC +G +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG + AF +II+N G+ TE YPY A C + + A I YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSN-SVASIKGYEDVPANNEAA 243
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+A+ FQ YK G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGDDMTFQFYKGGVMIGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 303
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGI 339
SWG TWG+ G++++ +D G+CG+
Sbjct: 304 SWGMTWGENGFLRMEKDISDKRGMCGL 330
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 153/323 (47%), Positives = 222/323 (68%), Gaps = 10/323 (3%)
Query: 32 VVSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
++SS+ E +++E++E W+A+H R+Y EK+ R +FK+N YI + N +GNR+YK
Sbjct: 26 IISSKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHN-QGNRSYK 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG NQF+DL+++EF+A Y G K+ + S S +YQ D+P S+DWR+KGAVT
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKLDTKKRLSRPPSR-RYQYSDGEDLPESIDWREKGAVTS 143
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF +
Sbjct: 144 VKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEF 203
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
II N G+ +E++YPY A G+C + +K A I +YE+VP DE++L KA + QP+S+A
Sbjct: 204 IINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVA 263
Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
I A EFQ Y G+F CGTQLDH VT+VG+G +E G +YW +KNSWG +WG+ G+++
Sbjct: 264 IEASGREFQFYDSGVFTSTCGTQLDHGVTLVGYG-SESGTDYWTVKNSWGKSWGEEGFIR 322
Query: 330 IVRD-----EGLCGIGTRSSYPL 347
+ R+ G+CGI +SYP+
Sbjct: 323 LQRNIEVASTGMCGIAMEASYPV 345
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 156/315 (49%), Positives = 214/315 (67%), Gaps = 10/315 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLTN 101
+ + HE+WMA+HGR+Y D+ EK RL++F++N+ +IE N ++ + L NQF+DLTN
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EFRA TG + PS S + ++F+Y N+S D+P S+DWR KGAV P+K+Q +CGCCW
Sbjct: 61 AEFRATRTGLR-PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCW 119
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATE 220
AF+AVAA+EG K+ +G L+ LSEQQL+ C G + GC GG + AF +II+N G+A E
Sbjct: 120 AFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAE 179
Query: 221 DEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
+YPY A C +A AAA I YE+VP+ DE ALLKAV+ QPVS+AI FQ
Sbjct: 180 SDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQF 239
Query: 280 YKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----D 333
YK G+ +G C T+LDHA+T VG+G DG YWL+KNSWG +WG+ GY+++ R
Sbjct: 240 YKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADK 299
Query: 334 EGLCGIGTRSSYPLA 348
EG+CG+ +SYP A
Sbjct: 300 EGVCGLAMMASYPTA 314
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 320 bits (819), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 166/336 (49%), Positives = 226/336 (67%), Gaps = 35/336 (10%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ +L + ASQ ++R+ HE S+ E HE WMAQ+GR YKD EK R KIFK+N+ IE
Sbjct: 14 LLFVLAAWASQA-TARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK +++YKL N+F+DLTN+EF +K +H ST +++FKY+N+ T VP++
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFGTSRNRFK----AHICSTEATSFKYENV--TAVPST 126
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
+DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 IDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQA 256
C G + YPY GTC+ A P AAKI+ YE+VP+ +E+A
Sbjct: 187 CNGAN-------------------YPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNEKA 226
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L KAV QP+++AI A EFQ Y G+F G CGT+LDH V VG+GT++DG YWL+KN
Sbjct: 227 LQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 286
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 287 SWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 322
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 157/329 (47%), Positives = 220/329 (66%), Gaps = 8/329 (2%)
Query: 18 MFIIITLLVSCASQVVSSRST-HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+F+I++L+ S + + SR E ++ + H +WM +HGR Y D EK R +FK N+E
Sbjct: 2 IFLIVSLVSSFSLSITLSRPLLDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVE 61
Query: 77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE+ N + T+KL NQF+DLTN+EFR++YTG+K S T ++F+YQN+S
Sbjct: 62 RIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGNSVLSSRTKPTSFRYQNVSSDA 121
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P S+DWR KGAVTPIK+Q CG CWAF+AVAA+EG+ +I+ G LI LSEQ+L+DC TN
Sbjct: 122 LPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN- 180
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDE 254
+ GC+GG + AF Y I G+ +E YPY++ GTC+ + K A I +E+VP+ DE
Sbjct: 181 DGGCMGGLMDTAFNYTITIGGLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPANDE 240
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+AL+KAV+ PVSI IA FQ Y G+F+G C T LDH VT VG+G +++G YW++
Sbjct: 241 KALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWIL 300
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGI 339
KNSWG WG+ GYM+I +D G CG+
Sbjct: 301 KNSWGPKWGERGYMRIKKDIKPKHGQCGL 329
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 147/309 (47%), Positives = 207/309 (66%), Gaps = 9/309 (2%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ E HE+WMA++ R YKD EK R ++FK+N ++E N + + LG NQF+DLT +
Sbjct: 1 MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTE 60
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+A G+K S TT FKY+NLS++ +PT++DWR KGAVTPIKNQ +CGCCWA
Sbjct: 61 EFKA-NKGFKPISAEEVPTTG--FKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWA 117
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATED 221
F+A+AA+EGI K+ +GNL+ LSEQ+ +DC T N + GC GG + AF ++I+N G+ATE
Sbjct: 118 FSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLATES 177
Query: 222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY+ V G C K +AA I +E+VP +E AL+K V+ QPVS+A+ A F Y
Sbjct: 178 SYPYKVVDGKCKGGSK-SAATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTFMLYS 236
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
G+ G CGTQLDH + +G+G D YW++KNSWG TWG+ G++++ +D G+C
Sbjct: 237 GGVMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDISDKRGMC 296
Query: 338 GIGTRSSYP 346
+ + SYP
Sbjct: 297 DLAMKPSYP 305
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 153/313 (48%), Positives = 218/313 (69%), Gaps = 10/313 (3%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
+ +VE+ EKW+A+H ++Y EK R ++FK+NL++I+K N+E +Y LG N+F+DLT
Sbjct: 43 ERLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINRE-VTSYWLGLNEFADLT 101
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
+DEF+A Y G + + R +S +F+Y+++S +D+P S+DWR KGAVT +KNQ +CG C
Sbjct: 102 HDEFKAAYLG--LDAAPARRGSSRSFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSC 159
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
WAF+ VAAVEGI I +GNL LSEQ+L+DCS +GN+GC GG + AF+YI + G+ TE
Sbjct: 160 WAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTE 219
Query: 221 DEYPYQAVPGTCSAAQKP--AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
+ YPY G+C +K A IS YE+VP+ DEQAL+KA++ QPVS+AI A FQ
Sbjct: 220 EAYPYLMEEGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQPVSVAIEASGRHFQ 279
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVR----D 333
Y G+F+G CG QLDH V VG+G+ + G +Y +++NSWG WG+ GY+++ R
Sbjct: 280 FYSGGVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAQWGEKGYIRMKRGTSNG 339
Query: 334 EGLCGIGTRSSYP 346
EGLCGI +SYP
Sbjct: 340 EGLCGINKMASYP 352
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 154/315 (48%), Positives = 216/315 (68%), Gaps = 10/315 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
H ++++ E+W+A++ ++Y EK R ++FK+NL +I++ANK+ TY LG N F+
Sbjct: 57 VHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVT-TYWLGLNAFA 115
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT+DEF+A Y G + P + TT S F+Y ++ DVP S+DWR KGAVT +KNQ +C
Sbjct: 116 DLTHDEFKATYLGLRQPET--KKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQGQC 173
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DCST+GNNGC GG + AF+YI + G+
Sbjct: 174 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAFSYIASSGGL 233
Query: 218 ATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
TE+ YPY G C A IS YE+VP+ DEQAL+KA++ QP+S+AI A
Sbjct: 234 RTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAIEASGR 293
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
FQ Y G+FNG CG++LDH V VG+G+++ G +Y ++KNSWG+ WG+ GY+++ R
Sbjct: 294 HFQFYSGGVFNGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGSHWGEKGYIRMKRGTG 352
Query: 334 --EGLCGIGTRSSYP 346
EGLCGI +SYP
Sbjct: 353 KPEGLCGINKMASYP 367
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 163/331 (49%), Positives = 222/331 (67%), Gaps = 17/331 (5%)
Query: 28 CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR 87
C SQV SR H+ S+ E HE+WM ++G+ YKD E + R IF+ N+E+IE N GN+
Sbjct: 20 CTSQV-KSRKLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNK 78
Query: 88 TYKLGTNQFSDLTNDEFRALYTGYKMPSPSH----RSTTSSTFKYQNLSMTDVPTSLDWR 143
YKL N +D TN+EF A + GYK SH R TT + FKY+N+ TD+P ++DWR
Sbjct: 79 PYKLSINHLADQTNEEFMASHKGYK---GSHWQGLRITTQTPFKYENV--TDIPWAVDWR 133
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
KG VT IK+Q +CG CWAF+AVAA EGI +I +GNL+ LSE++L+DC + ++GC GG
Sbjct: 134 QKGDVTSIKDQAQCGNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSV-DHGCDGGL 192
Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVS 262
E F +II+N GI++E YPY AV GTC ++ + A+I+ YE VP E+ L KAV+
Sbjct: 193 MEHGFEFIIKNGGISSEANYPYTAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVA 252
Query: 263 MQ-PVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
Q +S++I A + FQ Y G+F G CGTQLDH VT VG+G+T+ G YW++KNSWG
Sbjct: 253 NQLTMSVSIDAGGSAFQFYPSGVFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQ 312
Query: 322 WGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
WG+ GY++++R EGLCGI +SYP A
Sbjct: 313 WGEEGYIRMLRGIDAQEGLCGIAMDASYPTA 343
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 165/339 (48%), Positives = 222/339 (65%), Gaps = 14/339 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + L SQV+ R H+ ++ E HE WMA++G+ YKD EKE R +IFK+N+E+
Sbjct: 10 MLALFLFLAVGISQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDV 136
IE N GN+ YKLG N +DLT +EF+ G K +T + FKY+N+ TD+
Sbjct: 69 IESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENV--TDI 126
Query: 137 PTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P ++DWR KGAVTPIK+Q +CG WAF+ +AA EGI +I +GNL+ LSEQ+L+DC +
Sbjct: 127 PEAIDWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSV- 185
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGD 253
++GC GG E F +II+N GI +E YPY+ V GTC+ A P A+I YE VPS
Sbjct: 186 DDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASP-VAQIKGYEIVPSYS 244
Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E+AL KAV+ QPVS++I A + F Y GI+NG CGT LDH VT VG+G TE+G +YW+
Sbjct: 245 EEALKKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYG-TENGTDYWI 303
Query: 314 IKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
+KNSWG WG+ GY+++ R G+CGI SSYP A
Sbjct: 304 VKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 154/322 (47%), Positives = 218/322 (67%), Gaps = 15/322 (4%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
++HE+ ++E+ EK+MA++ ++Y EK R ++FK+NL +I++ NK+ Y LG N+F
Sbjct: 43 ASHER-LMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKK-ITGYWLGLNEF 100
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLT+DEF+A Y G + +P+ R++ F+Y+ + +P +DWR KGAVT +KNQ +
Sbjct: 101 ADLTHDEFKAAYLGLTL-TPARRNSNDQLFRYEEVEAASLPKEVDWRKKGAVTEVKNQGQ 159
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ VAAVEGI I +GNL +LSEQ+L+DC T+GNNGC GG + AF+YI N G
Sbjct: 160 CGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLMDYAFSYIAANGG 219
Query: 217 IATEDEYPYQAVPGTC--------SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+ TE+ YPY GTC + AA IS YE+VP +EQALLKA++ QPVS+
Sbjct: 220 LHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALAHQPVSV 279
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A FQ Y G+F+G CGT+LDH VT VG+GT G +Y ++KNSWG+ WG+ GY+
Sbjct: 280 AIEASGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTASKGHDYIIVKNSWGSHWGEKGYI 339
Query: 329 KIVR----DEGLCGIGTRSSYP 346
++ R +GLCGI +SYP
Sbjct: 340 RMRRGTGKHDGLCGINKMASYP 361
>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 174/339 (51%), Positives = 222/339 (65%), Gaps = 15/339 (4%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
T + I++ +LV+ SQ + E +V E HE+WMA+HGR+Y+D+ EKE R IFK+NL
Sbjct: 7 TKLAIVLMILVTWVSQAMPRPLIDEDAVAEKHEQWMARHGRTYQDDEEKERRFHIFKKNL 66
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSM 133
++IE N NRTYKLG N F+DLT++EF A YTGYKMP P+ TT +T L
Sbjct: 67 KHIENFNNAFNRTYKLGLNHFADLTDEEFLATYTGYKMPKVLPTANITTKTTQSSDVLYE 126
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+VP S+DWR +G VTP+KNQ CGCCWAF+A AAVEGI GN + LS QQLLDC
Sbjct: 127 ANVPESIDWRTRGVVTPVKNQGRCGCCWAFSAAAAVEGII----GNGVSLSAQQLLDCVP 182
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
+ +NGC GG + AF YIIQNQG+A+ YPYQ + C + AA+IS Y +V D
Sbjct: 183 D-SNGCNGGFMDNAFRYIIQNQGLASATYYPYQLMREMCRPSNN--AARISGYVDVTPAD 239
Query: 254 EQALLKAVSMQPVSIAIAAYS-TEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANY 311
E+ L AV+ QPVS A+ A S F+ Y GIF CG+ L HA+TIVG+GT+ +G Y
Sbjct: 240 EETLKSAVARQPVSAAVDATSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTSAEGTKY 299
Query: 312 WLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
WLIKNSWG WG+ GYM++ RD G CGI R+SYP
Sbjct: 300 WLIKNSWGEGWGEGGYMRLQRDVGSYGGACGIALRASYP 338
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 317 bits (811), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 163/339 (48%), Positives = 223/339 (65%), Gaps = 17/339 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQS--VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+ ++ LL C SQV+S R+ HE S + E HE+W ++G+ YKD EK+ RL IFK+N+
Sbjct: 10 ILALVLLLPICISQVMS-RNLHEASXCMSERHEQWTKKYGKVYKDAAEKQKRLLIFKDNV 68
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMT 134
E+IE N GN+ YKL N +D TN+EF A + GYK H+ + S T FKY+N+ T
Sbjct: 69 EFIESFNAAGNKPYKLSINHLTDQTNEEFVASHNGYK-----HKGSHSQTPFKYENI--T 121
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
VP ++DWR+ GAV +K+Q +CG CWAF+ VA EGI +I + L+ LSEQ+L+DC +
Sbjct: 122 GVPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATTEGIYQITTSMLMSLSEQELVDCDSV 181
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGD 253
++GC GG E F +I +N GI++E YPY AV GT A ++ + AA+I YE VP+
Sbjct: 182 -DHGCDGGYMEGGFEFIXKNGGISSEANYPYTAVDGTYDANKEASPAAQIKGYETVPANS 240
Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E AL KAV+ QPVS+ I + FQ G+F G CGTQLDH VT VG+G+T+DG YW+
Sbjct: 241 EDALQKAVANQPVSVTIDVGGSAFQFNSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWI 300
Query: 314 IKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
+KNSWG WG+ GY+++ R EGLCGI +SYP A
Sbjct: 301 VKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 339
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 161/344 (46%), Positives = 221/344 (64%), Gaps = 18/344 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F ++ + + ++S +H ++ V+ I+E+W+ +HG+ Y EKE R +
Sbjct: 15 LFTVLAVSSALDMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHGKVYNAVEEKEKRFQ 74
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL +IE+ N NRTYK+G N+FSDL+N+E+R+ Y G K+ PS R + +Y
Sbjct: 75 IFKDNLNFIEEHNAV-NRTYKVGLNRFSDLSNEEYRSKYLGTKI-DPS-RMMARPSRRYS 131
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
++P S+DWR +GAV +KNQ EC CWAF+A+AAVEGI KI +GNL LSEQ+LL
Sbjct: 132 PRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLTALSEQELL 191
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEE 248
DC N GC GG + AF +II N GI TE++YP+Q G C + A A I YE
Sbjct: 192 DCDRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADGICDQYKINARAVTIDGYER 251
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP+ DE AL KAV+ QPVS+AI AY EFQ Y+ GIF G CGT +DH VT VG+G TE+G
Sbjct: 252 VPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSIDHGVTAVGYG-TENG 310
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
+YW++KNSWG WG+AGY+ + R+ G CGI + YP+
Sbjct: 311 IDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPI 354
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 158/346 (45%), Positives = 220/346 (63%), Gaps = 11/346 (3%)
Query: 8 SGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
S S K N +F+++T+ S QV+S R + S V+ HEKWMAQ+G+ YKD EKE R
Sbjct: 3 SFSQKKNILVVFLVLTVWTS---QVMSRRLSEAYSSVK-HEKWMAQYGKVYKDAAEKEKR 58
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
+IFK N+ +IE + G++ + L NQF+DL +F+AL + + R+ T++
Sbjct: 59 FQIFKNNVHFIESFHAAGDKPFNLSINQFADL--HKFKALLINGQKKEHNVRTATATEAS 116
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
++ S+T +P+SLDWR +GAVTPIK+Q C CWAF+ VA +EG+ +I G L+ LSEQ+
Sbjct: 117 FKYDSVTRIPSSLDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKGELVSLSEQE 176
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNY 246
L+DC + GC GG E AF +I + G+A+E YPY+ V TC ++ +I Y
Sbjct: 177 LVDCVKGDSEGCYGGYVEDAFEFIAKKGGVASETHYPYKGVNKTCKVKKETHGVVQIKGY 236
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VPS E+ALLKAV+ QPVS + A FQ Y GIF G CGT +DH+VT+VG+G
Sbjct: 237 EQVPSNSEKALLKAVAHQPVSAYVEAGGYAFQFYSSGIFTGKCGTDIDHSVTVVGYGKAR 296
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
G YWL+KNSWG WG+ GY+++ RD EGLCGI T + YP A
Sbjct: 297 GGNKYWLVKNSWGTEWGEKGYIRMKRDIRAKEGLCGIATGALYPTA 342
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 170/357 (47%), Positives = 224/357 (62%), Gaps = 20/357 (5%)
Query: 8 SGSFKINTTPMFIII----TLLVSCASQVVSSRSTHEQSVV-EIHEKWMAQHGRSYKDEL 62
S SF + + II+ T LV A + ++ + S + E +EKW A HGR+YKD L
Sbjct: 5 SSSFSLAAILLIIIMYCCPTGLVEAARKGPAAAGGGDDSAMRERYEKWAADHGRTYKDSL 64
Query: 63 EKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST 121
EK R ++F+ N +I+ N G + + +L TN+F+DLTN+EF A Y G +P
Sbjct: 65 EKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEF-AEYYGRPFSTPV---I 120
Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
S F Y N+ +DVP +++WRD+GAVT +KNQK+C CWAF+AVAAVEGI +IRS NL+
Sbjct: 121 GGSGFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEGIHQIRSHNLV 180
Query: 182 QLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQ-AVPGTCSAAQKPA 239
LS QQLLDCST NN GC G ++AF YI N GIA E +YPY+ GTC A+ KP
Sbjct: 181 ALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDYPYEDRALGTCRASGKPV 240
Query: 240 AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF----NGVCGTQLDH 295
AA I ++ VP +E ALL AV+ QPVS+A+ Q + G+F N C T L+H
Sbjct: 241 AASIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVSQFFSSGVFGAMQNETCTTDLNH 300
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
A+T VG+GT E G YWL+KNSWG WG+ GYMKI RD GLCG+ + SYP+A
Sbjct: 301 AMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIARDVASNTGLCGLAMQPSYPVA 357
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 151/338 (44%), Positives = 220/338 (65%), Gaps = 20/338 (5%)
Query: 15 TTPMFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
T+ F++ L S S V+++R + ++VE HE WM ++GR YKD EK R ++FK+
Sbjct: 3 TSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKD 62
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+ ++E N N + LG NQF+DLT +EF+A G+K P+ ++ FKY+NLS+
Sbjct: 63 NVAFVESFNTNKNNKFWLGVNQFADLTTEEFKA-NKGFK---PTAEKVPTTGFKYENLSV 118
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+ +PT++DWR KGAVTPIKNQ +C AA+EGI K+ +GNLI LSEQ+L+DC T
Sbjct: 119 SALPTAVDWRTKGAVTPIKNQGQC---------AAMEGIVKLSTGNLISLSEQELVDCDT 169
Query: 194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
+ + GC GG + AF ++I+N G+ATE YPY+AV G C K +AA I +E+VP
Sbjct: 170 HSMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSK-SAATIKGHEDVPVN 228
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
+E AL+KAV+ QPVS+A+ A F Y G+ G CGT+LDH + +G+G DG YW
Sbjct: 229 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYW 288
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
++KNSWG TWG+ G++++ +D G+CG+ + SYP
Sbjct: 289 ILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 326
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 159/364 (43%), Positives = 226/364 (62%), Gaps = 27/364 (7%)
Query: 9 GSFKINTTP----------MFIIITLLVSCASQVVSSRSTH---------EQSVVEIHEK 49
GS I T+P +F + + + ++S S H E+ ++ ++E+
Sbjct: 2 GSSSITTSPATMTMAAIVLLFTVFAVSSALDMSIISYDSAHADKAATLRTEEELMSMYEQ 61
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W+ +HG+ Y EKE R +IFK+NL +I+ N +RTYKLG N+F+DLTN+E+RA Y
Sbjct: 62 WLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYL 121
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
G K+ P+ R + + +Y +P S+DWR +GAV P+K+Q CG CWAF+A+ AV
Sbjct: 122 GTKI-DPNRRLGKTPSNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAV 180
Query: 170 EGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP 229
EGI KI +G LI LSEQ+L+DC T N GC GG + AF +II N GI ++++YPY+ V
Sbjct: 181 EGINKIVTGELISLSEQELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDSDEDYPYRGVD 240
Query: 230 GTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV 288
G C +K A I +YE+VP+ DE AL KAV+ QPVS+AI EFQ Y G+F G
Sbjct: 241 GRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGR 300
Query: 289 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRS 343
CGT LDH V VG+GT + G +YW+++NSWG++WG+ GY+++ R+ G CGI
Sbjct: 301 CGTALDHGVVAVGYGTAK-GHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGIAIEP 359
Query: 344 SYPL 347
SYPL
Sbjct: 360 SYPL 363
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 154/311 (49%), Positives = 211/311 (67%), Gaps = 6/311 (1%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+V HEKWMA+HGR+Y DE EK RL+IF+ N E+I+ N G +++L TN+F+DLT++
Sbjct: 43 MVSRHEKWMAEHGRTYTDEAEKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDE 102
Query: 103 EFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EFRA TG++ + S F+Y+N S+ D S+DWR GAVT +K+Q ECGCCW
Sbjct: 103 EFRAARTGFRPRPAPAAAAGSGGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCW 162
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATE 220
AF+AVAAVEG+ KIR+G L+ LSEQ+L+DC NG + GC GG + AF +I + G+A+E
Sbjct: 163 AFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASE 222
Query: 221 DEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
YPYQ G+C ++ A A I +E+VP +E AL AV+ QPVS+AI F+
Sbjct: 223 SGYPYQGDDGSCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRF 282
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---VRDEGL 336
Y G+ G CGT L+HA+T VG+GT DG+ YWL+KNSWG +WG+ GY++I VR EG+
Sbjct: 283 YDSGVLGGECGTDLNHAITAVGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGVRGEGV 342
Query: 337 CGIGTRSSYPL 347
CG+ SYP+
Sbjct: 343 CGLAKLPSYPV 353
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 156/338 (46%), Positives = 218/338 (64%), Gaps = 12/338 (3%)
Query: 19 FIIITLLVS----CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
F++ L+V C + + + ++ HEKWMA+HGR+YKDE EK RL++F+ N
Sbjct: 6 FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSM 133
E I+ N G +++L TN+F+DLT EFRA TG + P+PS + F+Y+N S+
Sbjct: 66 AELIDSFNAAGTHSHRLATNRFADLTVQEFRAARTGLRPRPAPS---AGAGRFRYENFSL 122
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
D S+DWR GAVT +K+Q GCCWAF+AVAAVEG+ KIR+G L+ LSEQ+L+DC
Sbjct: 123 ADAAQSVDWRAMGAVTGVKDQGASGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDV 182
Query: 194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
+G + GC GG + AF ++ + G+A+E YPYQ G C ++ AAA I +E+VP
Sbjct: 183 SGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQCRDGPCRSSAAAAAASIRGHEDVPRN 242
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
+E AL AV+ QPVS+AI F+ Y G+ G CGT L+HA+T VG+GT DG YW
Sbjct: 243 NEAALAAAVAHQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTAADGTRYW 302
Query: 313 LIKNSWGNTWGDAGYMKI---VRDEGLCGIGTRSSYPL 347
L+KNSWG +WG+ GY++I VR EG+CG+ SYP+
Sbjct: 303 LMKNSWGASWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 340
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 158/335 (47%), Positives = 224/335 (66%), Gaps = 14/335 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSV---VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+ IT ++ +V H S+ +E+ E WM++H ++Y+ EK R +IF +NL+
Sbjct: 17 LFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRSIEEKLHRFEIFLDNLK 76
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+I++ NK+ + +Y LG N+F+DL+++EF++ Y G ++ P RS S F Y ++ D+
Sbjct: 77 HIDETNKKVS-SYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRS--SRGFSYGDVE--DL 131
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAVTP+KNQ CG CWAF+ VAAVEGI +I +GNL LSEQ+L+DC + N
Sbjct: 132 PESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRSFN 191
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQ 255
NGC GG + AF YI+ N G+ E++YPY G C ++ IS YE+VP+ DEQ
Sbjct: 192 NGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVVTISGYEDVPANDEQ 251
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
+LLKA+S QPVS+AI A S FQ YK GIF G CGTQ+DH VT VG+G++E G +Y ++K
Sbjct: 252 SLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVGYGSSE-GTDYIIVK 310
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
NSWG WG+ GY+++ R+ EGLCGI +SYP
Sbjct: 311 NSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYP 345
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 156/338 (46%), Positives = 219/338 (64%), Gaps = 11/338 (3%)
Query: 19 FIIITLLVS----CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
F++ L+V C + + + ++ HEKWMA+HGR+YKDE EK RL++F+ N
Sbjct: 6 FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
E I+ N G +++L TN+F+DLT +EFRA TG + P P+ S + F+Y+N S+
Sbjct: 66 AELIDSFNAAGTHSHRLATNRFADLTVEEFRAARTGLR-PRPAP-SAGAGRFRYENFSLA 123
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
D S+DWR GAVT +K+Q CGCCWAF+AVAAVEG+ KIR+G L+ LSEQ+L+DC +
Sbjct: 124 DAAQSVDWRAMGAVTGVKDQGACGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVS 183
Query: 195 G-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSG 252
G + GC GG + AF ++ + G+A+E YPYQ G C ++ A A I +E+VP
Sbjct: 184 GVDQGCDGGLMDNAFQFVARRGGLASESGYPYQGRDGPCRSSAAAARAASIRGHEDVPRN 243
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
+E AL AV+ QPVS+AI F+ Y G+ G CGT L+HA+T VG+GT DG YW
Sbjct: 244 NEAALAAAVANQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTANDGTRYW 303
Query: 313 LIKNSWGNTWGDAGYMKI---VRDEGLCGIGTRSSYPL 347
L+KNSWG +WG+ GY++I VR EG+CG+ SYP+
Sbjct: 304 LMKNSWGASWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 341
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 155/343 (45%), Positives = 218/343 (63%), Gaps = 18/343 (5%)
Query: 17 PMFIIITLLV-SCASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
P+ +++ L S S + + E S+ ++E+W + H S +D +K+ R +FKEN
Sbjct: 6 PVLLVLALAFGSTLSIPIKEKDLESEDSLWSLYERWRSHHAVS-RDLDQKQKRFNVFKEN 64
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK------MPSPSHRSTTSSTFKY 128
+++I + NK + T+KL N+F D+TN EFRA Y G K M H S + + F Y
Sbjct: 65 VKFIHEFNKNKDVTFKLALNKFGDMTNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAKFMY 124
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+N P S+DWR++GAV +KNQ +CG CWAF+A+AAVEGI +I + L+ LSEQ+L
Sbjct: 125 ENAV---APPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVEGINQIVTKELVPLSEQEL 181
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
+DC T+ N GC GG + AF +I N GI TED YPYQA TC + A I YE+
Sbjct: 182 IDCDTDQNQGCSGGLMDYAFEFIKNNGGITTEDVYPYQAEDATCK--KNSPAVVIDGYED 239
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP+ DE AL+KAV+ QPV++AI A FQ Y EG+F G CGT+LDH V +VG+GTT+DG
Sbjct: 240 VPTNDEDALMKAVANQPVAVAIEASGYVFQFYSEGVFTGRCGTELDHGVAVVGYGTTQDG 299
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
YW ++NSWG WG++GY+++ R GLCGI ++SYP+
Sbjct: 300 TKYWTVRNSWGADWGESGYVRMQRGIKATHGLCGIAMQASYPI 342
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 154/324 (47%), Positives = 214/324 (66%), Gaps = 10/324 (3%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
+ +SRS E ++ ++E+W+ +HG+ Y EKE R +IFK+NL +I+ N + +RTY
Sbjct: 64 AHAATSRSDEE--LMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQEDRTY 121
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
KLG N+F+DLTN+E+RA Y G K+ P+ R + + +Y +P S+DWR +GAV
Sbjct: 122 KLGLNRFADLTNEEYRAKYLGTKI-DPNRRLGKTPSNRYAPRVGDKLPESVDWRKEGAVP 180
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
P+K+Q CG CWAF+A+ AVEGI KI +G LI LSEQ+L+DC T N GC GG + AF
Sbjct: 181 PVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNEGCNGGLMDYAFE 240
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+II N GI +E++YPY+ V G C +K A I +YE+VP+ DE AL KAV+ QPVS+
Sbjct: 241 FIINNGGIDSEEDYPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSV 300
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI EFQ Y G+F G CGT LDH V VG+GT +G +YW+++NSWG +WG+ GY+
Sbjct: 301 AIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGTA-NGHDYWIVRNSWGPSWGEDGYI 359
Query: 329 KIVRD-----EGLCGIGTRSSYPL 347
++ R+ G CGI SYPL
Sbjct: 360 RLERNLANSRSGKCGIAIEPSYPL 383
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 313 bits (803), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 150/313 (47%), Positives = 207/313 (66%), Gaps = 7/313 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E I+E W+ +HGR+Y EKE R +IFK+NL++I++ N GN +YKLG N+F+DL
Sbjct: 18 EAETRRIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADL 77
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
+NDE+R++Y G +M + +Y D+P ++DWR+KGAV P+K+Q +CG
Sbjct: 78 SNDEYRSVYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGS 137
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ V AVEGI +I +GNL LSEQ+L+DC N GC GG + AF +II+N GI T
Sbjct: 138 CWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIENGGIDT 197
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E++YPY+A+ C +K A I YE+VP DE++L KAV+ QPVS+AI A FQ
Sbjct: 198 EEDYPYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRGFQ 257
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----- 333
Y+ G+F G CGTQLDH V VG+G TE G +YW+++NSWG WG+ GY+++ RD
Sbjct: 258 LYQSGVFTGSCGTQLDHGVVTVGYG-TEHGVDYWIVRNSWGPAWGENGYIRMERDVASTE 316
Query: 334 EGLCGIGTRSSYP 346
G CGI +SYP
Sbjct: 317 TGKCGIAMEASYP 329
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 313 bits (803), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 168/351 (47%), Positives = 230/351 (65%), Gaps = 27/351 (7%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRS----THEQ--SVVEIHEK----WMAQHGRSYKDEL 62
I TT +FI++ L +C V++S S TH+Q S VE +K W+ +HGR YK
Sbjct: 5 ILTTTIFILLMLCNTC---VIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHND 61
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
E+E+R I++ N++YI+ N + N +Y L N+F+DLTN+EF++ Y G SH
Sbjct: 62 EREVRFGIYQANVQYIQCKNAQKN-SYNLTDNKFADLTNEEFQSTYMGLSTRLRSH---- 116
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
++ F+Y D+P S DWR +GAVT I +Q +CG CWAFAAVAAVEGI KI+SG LI
Sbjct: 117 NTGFRYDEHG--DLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLIS 174
Query: 183 LSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-A 240
LSEQ+L+DC +GN GC GG E A+ +II+N G+ TE +YPY+ V GTC + A
Sbjct: 175 LSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYA 234
Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV 300
A IS YEEVP+ +E L A + QPVS+AI A FQ Y EG+F+G+CG QL+H VT+V
Sbjct: 235 ASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVV 294
Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G+G E YW++KNSWG WG++GY+++ RD EG+CGI ++SYPL
Sbjct: 295 GYG-KETINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQASYPL 344
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 158/335 (47%), Positives = 223/335 (66%), Gaps = 14/335 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSV---VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
+ IT + +V H S+ +E+ E WM++H ++Y+ EK R +IF +NL+
Sbjct: 17 LFITYATAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKAYRSIEEKLHRFEIFLDNLK 76
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+I++ NK+ + +Y LG N+F+DL+++EF++ Y G ++ P RS S F Y ++ D+
Sbjct: 77 HIDETNKKVS-SYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRS--SRGFSYGDVE--DL 131
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAVTP+KNQ CG CWAF+ VAAVEGI +I +GNL LSEQ+L+DC + N
Sbjct: 132 PESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRSFN 191
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQ 255
NGC GG + AF YI+ N G+ E++YPY G C ++ IS YE+VP+ DEQ
Sbjct: 192 NGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVVTISGYEDVPANDEQ 251
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
+LLKA+S QPVS+AI A S FQ YK GIF G CGTQ+DH VT VG+G++E G +Y ++K
Sbjct: 252 SLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVGYGSSE-GTDYIIVK 310
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
NSWG WG+ GY+++ R+ EGLCGI +SYP
Sbjct: 311 NSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYP 345
>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
Length = 351
Score = 313 bits (803), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 166/355 (46%), Positives = 231/355 (65%), Gaps = 22/355 (6%)
Query: 10 SFKINTTPM------FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRS 57
S+ N P+ + + + +C V++R E+++ HEKWM +HGR+
Sbjct: 3 SYIANNKPLITAAVALLTVLAIANCIGCAVAARDLSSSTGYGEEAMTARHEKWMVEHGRT 62
Query: 58 YKDELEKEMRLKIFKENLEYIEKANKE-GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP 116
YKDE EK R ++FK N +++ +N G + Y L N+F+D+T+DEF A YTG+K P P
Sbjct: 63 YKDEAEKARRFQVFKANAAFVDTSNAAAGGKKYHLAINRFADMTHDEFMARYTGFK-PLP 121
Query: 117 SHRSTTSSTFKYQNLSMT-DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
+ FKY N++++ + ++DWR KGAVT +KNQ++CGCCWAF+AVAA+EG+ +I
Sbjct: 122 ATGKKMPG-FKYANVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEGMHQI 180
Query: 176 RSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA 234
+G L+ LSEQQL+DCST NNGC GG+ E AF Y+I N GIATE YPY A+ G C
Sbjct: 181 NTGELVSLSEQQLVDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQGMCQN 240
Query: 235 AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQL 293
Q PA A + +Y++VP DE AL AV+ QPVS+A+ A FQ YK G+ CGT L
Sbjct: 241 VQ-PAVA-VRSYQQVPRDDEDALAAAVAGQPVSVAVDA--NNFQFYKGGVMTADSCGTNL 296
Query: 294 DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
+HAVT VG+GT EDG YWL+KN WG+TWG+ GY+++ R G CG+ +SYP+A
Sbjct: 297 NHAVTAVGYGTAEDGTPYWLLKNQWGSTWGEEGYLRLQRGVGACGVAKDASYPVA 351
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 223/351 (63%), Gaps = 11/351 (3%)
Query: 7 RSGSFKINTTPMFIIITLLVSCASQVVSSRSTH-----EQSVVEIHEKWMAQHGRSYKDE 61
S + I+ M I TL + ++S TH + V ++E W+ +HG+SY
Sbjct: 4 HSSTLTISILLMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNAL 63
Query: 62 LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST 121
EK+ R +IFK+NL YI++ N N++YKLG +F+DLTN+E+R++Y G K + +
Sbjct: 64 GEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLS 123
Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
+ + +Y +P S+DWR+KG + +K+Q CG CWAF+AVAA+E I I +GNLI
Sbjct: 124 KNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLI 183
Query: 182 QLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-A 240
LSEQ+L+DC + N GC GG + AF ++I+N GI TE++YPY+ G C +K A
Sbjct: 184 SLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKV 243
Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV 300
KI +YE+VP +E+AL KAV+ QPVSIA+ A +FQ YK GIF G CGT +DH V I
Sbjct: 244 VKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIA 303
Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G+G TE+G +YW+++NSWG WG+ GY+++ R+ GLCG+ SYP+
Sbjct: 304 GYG-TENGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCGLAIEPSYPV 353
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 154/317 (48%), Positives = 216/317 (68%), Gaps = 11/317 (3%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
++H++ ++E+ EKW+A++ ++Y EK R ++FK+NL +I+ NK+ +Y LG N+F
Sbjct: 42 ASHDR-LIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGLNEF 99
Query: 97 SDLTNDEFRALYTGYKMPSPSHRST---TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
+DLT+DEF+A Y G P P+ ++ +S F+Y +S +VP +DWR K AVT +KN
Sbjct: 100 ADLTHDEFKATYLGL-TPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKN 158
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q +CG CWAF+ VAAVEGI I +GNL LSEQ+L+DCST+GNNGC GG + AF+YI
Sbjct: 159 QGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIAS 218
Query: 214 NQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
G+ TE+ YPY G C + A IS YE+VP+ DEQAL+KA++ QPVS+AI A
Sbjct: 219 TGGLRTEEAYPYAMEEGDCDEGKGAAVVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 278
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
FQ Y G+F+G CG QLDH VT VG+GT++ G +Y ++KNSWG WG+ GY+++ R
Sbjct: 279 GRHFQFYSGGVFDGPCGEQLDHGVTAVGYGTSK-GQDYIIVKNSWGPHWGEKGYIRMKRG 337
Query: 333 ---DEGLCGIGTRSSYP 346
EGLCGI +SYP
Sbjct: 338 TGKGEGLCGINKMASYP 354
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 155/341 (45%), Positives = 216/341 (63%), Gaps = 14/341 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHE------QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
F+ + L ++ + S HE +S+ +++E+W + H S + EK R +FK
Sbjct: 6 FLFVALSLALVLGITESLDFHEKDLESEESLWDLYERWRSHHTVSTSLD-EKHKRFNVFK 64
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKYQNL 131
EN+ ++ K NK G + YKL N+F+D+TN EFR++Y G K+ R TT +
Sbjct: 65 ENVMHVHKTNKMG-KPYKLKLNKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGNGSFMYG 123
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ VPTS+DWR KGAVT +K+Q +CG CWAF+ + AVEGI I++ L+ LSEQ+L+DC
Sbjct: 124 KVEKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINYIKTNELVSLSEQELVDC 183
Query: 192 STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVP 250
T N GC GG E AF +I + +GI TE YPY+A G C AA++ A I YE+VP
Sbjct: 184 DTTENQGCNGGLMEYAFEFIKKKRGITTESTYPYKAEDGHCDAAKENNPAVSIDGYEKVP 243
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
DE ALLKA + QPVS+AI A ++FQ Y EG+F G CGT+LDH V +VG+GTT DG
Sbjct: 244 ENDEDALLKAAANQPVSVAIDAGGSDFQFYSEGVFIGECGTELDHGVAVVGYGTTLDGTK 303
Query: 311 YWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
YW+++NSWG WG+ GY+++ R EGLCGI +SYP+
Sbjct: 304 YWIVRNSWGPEWGEKGYIRMQRGISDKEGLCGIAMEASYPI 344
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 147/326 (45%), Positives = 214/326 (65%), Gaps = 14/326 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V+S + ++ V +E W+A+HG++Y EKE R +IF +NL++I++ N GNR+YK+G
Sbjct: 22 VTSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVG 81
Query: 93 TNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA 147
NQF+DLTN+E+R++Y G Y+ + R S + Q M P +DWR++GA
Sbjct: 82 LNQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEM--FPAKVDWRERGA 139
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
V+P+KNQ CG CWAF+ VA+VEGI KI +G+LI LSEQ+L+DC N+GC GGS + A
Sbjct: 140 VSPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYA 199
Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPV 266
F +I+ N GI +E +YPY+ V C + A I YE+VP +E+AL+KAV+ QPV
Sbjct: 200 FQFIVSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPV 259
Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
S+ I A FQ Y G+ G CGT LDH V +VG+G +E+G +YW+++NSWG WG+ G
Sbjct: 260 SVGIEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYG-SENGKDYWIVRNSWGPEWGEDG 318
Query: 327 YMKIVRDE-----GLCGIGTRSSYPL 347
Y+++ R+ G+CGI +SYP+
Sbjct: 319 YIRMERNMVDTPVGMCGITLMASYPI 344
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 152/336 (45%), Positives = 210/336 (62%), Gaps = 24/336 (7%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ L C + + + + ++V HE+WM Q+ R YKD EK R ++FK N+++
Sbjct: 8 ILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE N GNR + LG NQF+DLTNDEFRA T G+K PSP ST F+Y+N+S+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFK-PSPVKVSTG---FRYENVSVDA 123
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P ++DWR KGAVTPIK+Q +C EGI KI +G LI LSEQ+L+DC +G
Sbjct: 124 LPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDVHG 171
Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
+ GC GG + AF +II+N G+ TE YPY A G C + +AA + +E+VP+ DE
Sbjct: 172 EDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSN-SAATVKGFEDVPANDE 230
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
AL+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G T DG YWL+
Sbjct: 231 AALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLL 290
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
KNSWG TWG+ GY+++ +D G+CG+ SYP
Sbjct: 291 KNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 326
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 215/341 (63%), Gaps = 12/341 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTH---EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+ ++ + L S A+ + E S+ ++E+W + H S +D EK+ R +FKEN
Sbjct: 6 LILVASFLASVAATAIDIADKDLETEDSLWNLYERWRSHHTVS-RDLDEKQKRFNVFKEN 64
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP-----SHRSTTSSTFKYQ 129
YI NK + YKL N+F+DLTN EFR+ Y G ++ S R +++F YQ
Sbjct: 65 PRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRGSRRGGATNSFMYQ 124
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+L +P S+DWR KGAVT +K+Q +CG CWAF+ VAAVEGI +I++ L+ LSEQ+L+
Sbjct: 125 SLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLLSLSEQELI 184
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
DC T+ NNGC GG + AF +I +N GI++E EYPY A C+ +K I +E+V
Sbjct: 185 DCDTDENNGCNGGLMDYAFDFIKKNGGISSEAEYPYAAEDSYCATEKKSHVVSIDGHEDV 244
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ DE +LLKAV+ QPVSIAI A +FQ Y EG+F G GT+LDH V IVG+G T+ G
Sbjct: 245 PANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGVFTGRSGTELDHGVAIVGYGKTQQGT 304
Query: 310 NYWLIKNSWGNTWGDAGYMKI---VRDEGLCGIGTRSSYPL 347
YW+++NSWG WG+ GY++I + LCG+ +SYP+
Sbjct: 305 KYWIVRNSWGAEWGEKGYIRISAASDSKRLCGLAMEASYPI 345
>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 352
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 156/353 (44%), Positives = 222/353 (62%), Gaps = 21/353 (5%)
Query: 8 SGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
S ++ + +++ +S ++V + ++ ++ H+KWMA+HGR+YKD EK R
Sbjct: 5 SSKLQVMAASLLLVVAGGLSTMAKV--TMASRAGTMEARHDKWMAEHGRTYKDAAEKARR 62
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
++FK N++ I+++N GN+ Y+L TN+F+DLT+ EF A+YTGY + + + ++T
Sbjct: 63 FRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATT-- 120
Query: 128 YQNLSMTD--VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
LS D P +DWR +GAVT +KNQ+ CGCCWAF+ VAAVEGI +I +G L+ LSE
Sbjct: 121 --RLSSEDDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSE 178
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC----SAAQKPAAA 241
QQLLDC+ NG GC GGS + AF Y+ + G+ TE Y YQ G C S++ AA
Sbjct: 179 QQLLDCADNG--GCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAA 236
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIV 300
IS Y+ V DE +L AV+ QPVS+AI F+ Y G+F CGT+LDHAV +V
Sbjct: 237 TISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVV 296
Query: 301 GFGTTEDGA---NYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
G+G DG+ YW+IKNSWG TWGD GYMK+ +D +G CG+ SYP+
Sbjct: 297 GYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 349
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 215/314 (68%), Gaps = 9/314 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T+ ++E+ E WM++H ++YK EK R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42 TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT++EF+ Y G P S + S+ F+Y+++ TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T N+GC GG + AF YII G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
ED+YPY G C ++ IS YE+VP D+++L+KA++ QPVS+AI A +
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ YK G+FNG CGT LDH V VG+G+++ G++Y ++KNSWG WG+ G++++ R+
Sbjct: 279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGK 337
Query: 334 -EGLCGIGTRSSYP 346
EGLCGI +SYP
Sbjct: 338 PEGLCGINKMASYP 351
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 154/338 (45%), Positives = 216/338 (63%), Gaps = 10/338 (2%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
P ++++ S A+ +S + E V++++E+W+ +H + Y EKE R ++FK+NL
Sbjct: 7 PTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLG 65
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTD 135
+I+ N + N TY LG N+F+D+TN+E+RA+Y G + + T +T +Y S
Sbjct: 66 FIQDHNAQ-NNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQ 124
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P +DWR KGAV PIK+Q CG CWAF+ VAAVEGI I +G + LSEQ+L+DC
Sbjct: 125 LPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREY 184
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
+ GC GG + AF +IIQN GI TE++YPYQ + GTC +K +I YE+VPS +E
Sbjct: 185 DEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGYEDVPSNNE 244
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
AL KAVS QPVS+AI A Q Y+ G+F G CGT LDH V +VG+G TE+G +YWL+
Sbjct: 245 NALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYG-TENGVDYWLV 303
Query: 315 KNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
+NSWG WG+ GY K+ R+ EG CGI SYP+
Sbjct: 304 RNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341
>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
Length = 342
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 157/343 (45%), Positives = 218/343 (63%), Gaps = 22/343 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ ++ L + A ++SR+ ++ H+KWMA+HGR+YKD EK R ++FK N++
Sbjct: 6 LLVVAGGLSTMAKVTMASRAGTMEAR---HDKWMAEHGRTYKDAAEKARRFRVFKANVDL 62
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-- 135
I+++N GN+ Y+L TN+F+DLT+ EF A+YTGY + + + ++T LS D
Sbjct: 63 IDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATT----RLSSEDDQ 118
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P +DWR +GAVT +KNQ+ CGCCWAF+ VAAVEGI +I +G L+ LSEQQLLDC+ NG
Sbjct: 119 QPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCADNG 178
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC----SAAQKPAAAKISNYEEVPS 251
GC GGS + AF Y+ + G+ TE Y YQ G C S++ AA IS Y+ V
Sbjct: 179 --GCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNP 236
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGA- 309
DE +L AV+ QPVS+AI F+ Y G+F CGT+LDHAV +VG+G DG+
Sbjct: 237 NDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSG 296
Query: 310 --NYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW+IKNSWG TWGD GYMK+ +D +G CG+ SYP+
Sbjct: 297 GGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 339
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 151/313 (48%), Positives = 211/313 (67%), Gaps = 8/313 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E V E+ E W+ +HG+SY EK+ R KIF++NL+YI++ N NR+YKLG N+F+D+
Sbjct: 43 EDEVKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADI 102
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN+E+R Y G K + S S + +Y ++ +P S+DWR+KGAVT +K+Q CG
Sbjct: 103 TNEEYRTGYLGAKRDA-SRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGS 161
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ +AAVEG+ ++ +GNLI LSEQ+L+DC N GC GG AF +II+N GI +
Sbjct: 162 CWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFIIKNGGIDS 221
Query: 220 EDEYPYQAVPGTCSAAQKPAA--AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
E++YPY G C + ++ A A I YEEVP +E++L KAV+ QPVS+AI A +F
Sbjct: 222 EEDYPYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGGYDF 281
Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
Q Y GIF G CGT LDH V VG+G TE+G +YW++KNSWG+ WG+ GY+++ R+
Sbjct: 282 QLYSSGIFTGSCGTDLDHGVAAVGYG-TENGVDYWIVKNSWGDYWGEKGYVRMQRNVKAK 340
Query: 334 EGLCGIGTRSSYP 346
GLCGI +SYP
Sbjct: 341 TGLCGIAMEASYP 353
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 150/317 (47%), Positives = 213/317 (67%), Gaps = 14/317 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
++S ++EKWM HGR Y EKE R +IF++N EYIE+ N++ N+TY LG N F+D+
Sbjct: 27 DRSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADM 86
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T+DEF+ALY G K+P + T S F+Y++ T++P DWR KGAV +KNQ CG
Sbjct: 87 THDEFKALYFGTKVPLSN---TIKSGFRYKD--ATNLPLDTDWRSKGAVATVKNQGACGS 141
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ VAAVEG+ +I +G L+ LSEQ+L+DC N GC GG + AF +IIQN G+ +
Sbjct: 142 CWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDS 201
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E +YPY+AV G+C +++ + I +E+VP+ E LLKAV+ QPVS+AI A FQ
Sbjct: 202 EADYPYKAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQ 261
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGT--TEDGA--NYWLIKNSWGNTWGDAGYMKIVRD- 333
Y G++ G CG +LDH V VG+GT T DG +YW+++NSWG+ WG++GY+++ R+
Sbjct: 262 LYSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNV 321
Query: 334 ---EGLCGIGTRSSYPL 347
G CGI +SYP+
Sbjct: 322 ASPRGKCGIAMMASYPV 338
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 214/314 (68%), Gaps = 9/314 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++E+ E WM++H + YK EK R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42 TSTEKLLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT++EF+ Y G P S + S+ F+Y+++ TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T N+GC GG + AF YII G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
ED+YPY G C ++ IS YE+VP D+++L+KA++ QPVS+AI A +
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ YK G+FNG CGT LDH V VG+G+++ G++Y ++KNSWG WG+ G++++ R+
Sbjct: 279 FQFYKGGVFNGQCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGK 337
Query: 334 -EGLCGIGTRSSYP 346
EGLCGI +SYP
Sbjct: 338 PEGLCGINKMASYP 351
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 149/311 (47%), Positives = 210/311 (67%), Gaps = 14/311 (4%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
++EKWM HGR Y EKE R +IF++N EYIE+ N++ N+TY LG N F+D+T+DEF+
Sbjct: 33 LYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFK 92
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
ALY G K+P + T S F+Y++ T++P DWR KGAV +KNQ CG CWAF+
Sbjct: 93 ALYFGTKVPLSN---TIKSGFRYED--ATNLPLDTDWRSKGAVATVKNQGACGSCWAFST 147
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
VAAVEG+ +I +G L+ LSEQ+L+DC N GC GG + AF +IIQN G+ +E +YPY
Sbjct: 148 VAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPY 207
Query: 226 QAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
+AV G+C +++ + I +E+VP+ E LLKAV+ QPVS+AI A FQ Y G+
Sbjct: 208 KAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGV 267
Query: 285 FNGVCGTQLDHAVTIVGFGT--TEDGA--NYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
+ G CG +LDH V VG+GT T DG +YW+++NSWG+ WG++GY+++ R+ G
Sbjct: 268 YTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASSRGK 327
Query: 337 CGIGTRSSYPL 347
CGI +SYP+
Sbjct: 328 CGIAMMASYPV 338
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 150/307 (48%), Positives = 210/307 (68%), Gaps = 13/307 (4%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
++KW+ Q+GR Y + E +R I+ N+++IE N + N ++KL N+F+DLTNDEF +
Sbjct: 46 YDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQ-NLSFKLTDNKFADLTNDEFNS 104
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
+Y GY++ RS + + + TD+P ++DWR+ GAVTPIK+Q +CG CWAF+AV
Sbjct: 105 IYLGYQI-----RSYKRRNLSHMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAV 159
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPY 225
AAVEGI KI++GNL+ LSEQ+L+DC NG+N GC GG EKAF +I G+ TE++YPY
Sbjct: 160 AAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYPY 219
Query: 226 QAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
+ G+C A+ A I YE VP+ +E +L AVS QPVS+AI A EFQ Y EG+
Sbjct: 220 KGTDGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGV 279
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIG 340
F+G CG QL+H VTIVG+G +G YWL+KNSWG WG++GY+++ RD +G+CGI
Sbjct: 280 FSGYCGIQLNHGVTIVGYGDN-NGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCGIA 338
Query: 341 TRSSYPL 347
SYP+
Sbjct: 339 MEPSYPI 345
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 161/333 (48%), Positives = 216/333 (64%), Gaps = 14/333 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ + LL+S V SR HE S+ E HE W+A++G+ YK EKE +IFKEN+E+IE
Sbjct: 11 LALFLLLSIEISQVMSRKLHETSLREEHENWIARYGQVYKVAAEKET-FQIFKENVEFIE 69
Query: 80 KANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
N N+ YKLG N F+DLT +EF+ G K +H + + FKY+N+ TD+P +
Sbjct: 70 SFNAAANKPYKLGVNLFADLTLEEFKDFRFGLK---KTHEFSITP-FKYENV--TDIPEA 123
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
LDWR+KGAVTPIK+Q +CG CWAF+ VAA EGI +I +GNL+ L EQ+L+ C T G + G
Sbjct: 124 LDWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKGVDQG 183
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAA-QKPAAAKISNYEEVPSGDEQAL 257
C GG E F +II+N GI T+ YPY+ V GTC+ A+I YE VPS E+AL
Sbjct: 184 CEGGYMEDGFEFIIKNGGITTKANYPYKGVNGTCNTTIAASTVAQIKGYETVPSYSEEAL 243
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
KAV+ QPVS++I A + F Y GI+ G CGT LDH VT VG+GTT + +YW++KNS
Sbjct: 244 QKAVANQPVSVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNE-TDYWIVKNS 302
Query: 318 WGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
WG W + G++++ R GLCG+ SSYP
Sbjct: 303 WGTGWDEKGFIRMQRGITVKHGLCGVALDSSYP 335
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 154/338 (45%), Positives = 215/338 (63%), Gaps = 10/338 (2%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
P ++++ S A+ +S + E V++++E+W+ +H + Y EKE R ++FK+NL
Sbjct: 7 PTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLG 65
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTD 135
+I+ N + N TY LG N+F+D+TN E+RA+Y G + + T +T +Y S
Sbjct: 66 FIQDHNAQ-NNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQ 124
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P +DWR KGAV PIK+Q CG CWAF+ VAAVEGI I +G + LSEQ+L+DC
Sbjct: 125 LPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREY 184
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
+ GC GG + AF +IIQN GI TE++YPYQ + GTC +K +I YE+VPS +E
Sbjct: 185 DEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGYEDVPSNNE 244
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
AL KAVS QPVS+AI A Q Y+ G+F G CGT LDH V +VG+G TE+G +YWL+
Sbjct: 245 NALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYG-TENGVDYWLV 303
Query: 315 KNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
+NSWG WG+ GY K+ R+ EG CGI SYP+
Sbjct: 304 RNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 154/317 (48%), Positives = 220/317 (69%), Gaps = 11/317 (3%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
S+H++ +VE+ EKW+A+H ++Y EK R ++FK+NL+ I++ N+E +Y LG N+F
Sbjct: 35 SSHDR-LVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKLIDEINRE-VTSYWLGLNEF 92
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLT+DEF+ Y G + P R ++S +F+Y+N++ D+P ++DWR KGAVT +KNQ +
Sbjct: 93 ADLTHDEFKTTYLG--LSPPPARRSSSRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQ 150
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ VAAVEGI I +GNL LSEQ+L+DCS +GN+GC GG + AF+YI + G
Sbjct: 151 CGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGMMDYAFSYIASSGG 210
Query: 217 IATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
+ TE+ YPY G+C +K A IS YE+VP+ DEQAL+KA++ QPVS+AI A
Sbjct: 211 LHTEEAYPYLMEEGSCGDGKKSESEAVSISGYEDVPTKDEQALIKALAHQPVSVAIEASG 270
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVR- 332
FQ Y G+F+G CG QLDH V VG+G+ + G +Y ++KNSWG WG+ GY+++ R
Sbjct: 271 RHFQFYSGGVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVKNSWGGKWGEKGYIRMKRG 330
Query: 333 ---DEGLCGIGTRSSYP 346
EGLCGI +SYP
Sbjct: 331 TGKSEGLCGINKMASYP 347
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 310 bits (795), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 155/355 (43%), Positives = 226/355 (63%), Gaps = 14/355 (3%)
Query: 1 MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+VL F + +++ IIT + R TH+Q ++ ++E W+ +H ++Y
Sbjct: 16 LVLFFSLASFLMLSSASDMSIITYDETHGLNSPPLR-THDQ-LLSLYESWLVKHHKNYNA 73
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EKE R IFK+N+ ++++ N N++YKLG N+F+DLTNDE+R+LY KM ++
Sbjct: 74 LGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRSLYLSGKMMKRERKN 133
Query: 121 TTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
F+ D +P S+DWRD+GAV P+K+Q +CG CWAF+ V AVEGI KI +
Sbjct: 134 EDG--FRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFSTVGAVEGINKIVT 191
Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
G LI LSEQ+L+DC N GC GG + AF +I++N GI TED+YPY+ V G C +K
Sbjct: 192 GELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDTEDDYPYKGVDGLCDQNRK 251
Query: 238 PA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHA 296
A I+ YE+VP DE++L KAV+ QPVS+AI A FQ Y+ G+F G CGT+LDH
Sbjct: 252 NAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGVFTGQCGTELDHG 311
Query: 297 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYP 346
V VG+G +E+G +YW+++NSWG WG++GY+++ R+ G CGI ++SYP
Sbjct: 312 VVAVGYG-SENGKDYWIVRNSWGPDWGESGYIRLERNVASTSTGKCGIAMQASYP 365
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 153/353 (43%), Positives = 225/353 (63%), Gaps = 18/353 (5%)
Query: 11 FKINTTPMFIIITLLVSCASQVV----------SSRSTHEQSVVEIHEKWMAQHGRSYKD 60
F++ F+ + +S AS + S E +++++E W+ +HG++Y
Sbjct: 6 FRLCIAISFLFMVFSLSLASMSIIDYDLPADPLQSTERTEAHMMKMYEHWLVKHGKNYNA 65
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH-R 119
EKE R +IFK+NL ++++ N RTYKLG +F+DLTN+E+RA+Y G KM R
Sbjct: 66 IGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKMEKKEKLR 125
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ S + ++ + D+P+ +DWR+KGAVT +K+Q +CG CWAF+ V +VEGI +I +G+
Sbjct: 126 TERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGINQIVTGD 185
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA 239
LI LSEQ+L+DC N GC GG + AF +II+N GI +E +YPY+A C + +K A
Sbjct: 186 LISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGIDSEADYPYRASDNMCDSNRKNA 245
Query: 240 -AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
I YE+VP DE++L KAV+ QPVS+AI A EFQ Y+ G+F G CGT LDH V
Sbjct: 246 HVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSGVFTGRCGTNLDHGVV 305
Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLCGIGTRSSYP 346
VG+G TE+G +YW+++NSWG WG++GY+++ R D G CGI +SYP
Sbjct: 306 AVGYG-TENGIDYWIVRNSWGPKWGESGYIRMERNVASTDTGKCGIAMEASYP 357
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 151/337 (44%), Positives = 210/337 (62%), Gaps = 24/337 (7%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ L C + + + + ++V HE+WM Q+ R YKD EK R ++FK N+++
Sbjct: 8 ILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE N GNR + LG NQF+DLTNDEFRA T G+K PSP T F+Y+N+S+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFK-PSPVKVPTG---FRYENVSVDA 123
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P ++DWR KGAVTPIK+Q +C EGI KI +G LI LSEQ+L+DC +G
Sbjct: 124 LPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDVHG 171
Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
+ GC GG + AF +II+N G+ TE YPY A G C + +AA + +E+VP+ DE
Sbjct: 172 EDQGCEGGLMDDAFQFIIKNGGLTTESSYPYTAADGKCKSGSN-SAATVKGFEDVPANDE 230
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
AL+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G T DG YWL+
Sbjct: 231 AALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLL 290
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
KNSWG TWG+ GY+++ +D G+CG+ SYP+
Sbjct: 291 KNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPI 327
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 156/350 (44%), Positives = 226/350 (64%), Gaps = 25/350 (7%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQS-------------VVEIHEKWMAQHGRSYKDE--L 62
M I+ +V+ AS V S ++++ V+ I+E W+ +HG++ +
Sbjct: 1 MVILFLAMVAVASAVDMSIISYDEKHGVSTTGGRSDAEVMSIYEAWLVKHGKAQNQNSLV 60
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK+ R +IFK+NL +I+ NK+ N +Y+LG +F+DLTNDE+R+ Y G KM R T+
Sbjct: 61 EKDRRFEIFKDNLRFIDDHNKK-NLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS 119
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
+Y+ ++P S+DWR KGAV +K+Q CG CWAF+ + AVEGI +I +G+LI
Sbjct: 120 Q---RYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLIT 176
Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AA 241
LSEQ+L+DC T+ N GC GG + AF +II+N GI T+ +YPY+ V GTC +K A
Sbjct: 177 LSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVV 236
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
I +YE+VP+ E++L KAV+ QPVS+AI A FQ Y GIF+G CGTQLDH V VG
Sbjct: 237 TIDSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGIFDGTCGTQLDHGVVAVG 296
Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+G TE+G +YW+++NSWG +WG++GY+K+ R+ G CGI SYP+
Sbjct: 297 YG-TENGKDYWIVRNSWGKSWGESGYLKMARNIASSSGKCGIAIEPSYPI 345
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 148/311 (47%), Positives = 204/311 (65%), Gaps = 9/311 (2%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
E HEKWMAQ+G+ YKD EKE R ++FK N+++IE N G++ + L NQF+DL ++EF
Sbjct: 33 ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEF 92
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK-ECGCCWAF 163
+AL + + + T ++F+Y+N+ T +P+++DWR +GAVTPIK+Q CG CWAF
Sbjct: 93 KALLNNVQKKASRVETATETSFRYENV--TKIPSTMDWRKRGAVTPIKDQGYTCGSCWAF 150
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A VA VE + +I +G L+ LSEQ+L+DC + GC GG E AF +I GI +E Y
Sbjct: 151 ATVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGITSEAYY 210
Query: 224 PYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
PY+ +C ++ A+I YE VPS E+ALLKAV+ QPVS+ I A + F+ Y
Sbjct: 211 PYKGKDRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKFYSS 270
Query: 283 GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
GIF CGT LDHAV +VG+G DG YWL+KNSW WG+ GYM+I RD +GLC
Sbjct: 271 GIFEARNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGYMRIKRDIRAKKGLC 330
Query: 338 GIGTRSSYPLA 348
GI + +SYP+A
Sbjct: 331 GIASNASYPIA 341
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 152/352 (43%), Positives = 227/352 (64%), Gaps = 25/352 (7%)
Query: 13 INTTPMFIIITLLVSCAS------------QVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+N+ + + +T++V ++ VSSRS E V ++E+W+ +HG++
Sbjct: 4 LNSATVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAE--VSRLYEEWLVKHGKAQNS 61
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EK+ R +IFK+NL +I++ N + N +Y+LG +F+DLTNDE+R++Y G ++ R
Sbjct: 62 LTEKDRRFEIFKDNLRFIDEHNGK-NLSYRLGLTKFADLTNDEYRSMYLGSRL----KRK 116
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
T S+ +Y+ +P S+DWR +GAV +K+Q CG CWAF+ + AVEGI KI +G+L
Sbjct: 117 ATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDL 176
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA- 239
I LSEQ+L+DC T+ N GC GG + AF +II N GI TE++YPY+ V G C +K A
Sbjct: 177 ITLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAK 236
Query: 240 AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTI 299
I YE+VP+ E++L KA+S QP+S+AI FQ Y GIF+G+CGT LDH V
Sbjct: 237 VVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVA 296
Query: 300 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
VG+G TE+G +YW++KNSWG +WG++GY+++ R+ G CGI SYP+
Sbjct: 297 VGYG-TENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPI 347
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 212/315 (67%), Gaps = 10/315 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQF 96
E+ + ++E W+A+HGR+ EKE R +IFK+N+ +I+ N G+R+++LG N+F
Sbjct: 43 EEEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRF 102
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+D+TN+E+R +Y G + P+ R + +Y+ + ++P S+DWRDKGAVT +K+Q
Sbjct: 103 ADMTNEEYRTVYLGTR-PASHRRRARLGSDRYRYNAGEELPESVDWRDKGAVTTVKDQGS 161
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+DC N GC GG + AF +II N G
Sbjct: 162 CGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQGCNGGLMDYAFEFIINNGG 221
Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
I TE++YPY+A G C +K A I YE+VP DE+AL KAV+ QPVS+AI A
Sbjct: 222 IDTEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGR 281
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
EFQ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG WG++GY+++ R+
Sbjct: 282 EFQLYHSGIFTGRCGTDLDHGVVAVGYG-TENGKDYWIVRNSWGGDWGESGYIRMERNVN 340
Query: 334 --EGLCGIGTRSSYP 346
G CGI SSYP
Sbjct: 341 ASTGKCGIAMESSYP 355
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 211/324 (65%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS ++ ++ +WMA HGR+Y E+E R ++F++NL YI+ N G +
Sbjct: 31 IVSYGERSDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHS 90
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+RA Y G + R + +Y D+P S+DWR KGAV
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGARTRPQRERKLGA---RYHAADNEDLPESVDWRAKGAV 147
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q CG CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 207
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TE +YPY+ G C +K A I +YE+VP+ DE++L KAV+ QPVS
Sbjct: 208 EFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVS 267
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A T FQ Y GIF G CGT LDH VT VG+G TE+G +YW++KNSWG++WG++GY
Sbjct: 268 VAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 326
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 327 VRMERNIKASSGKCGIAVEPSYPL 350
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 217/322 (67%), Gaps = 13/322 (4%)
Query: 31 QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
VSSRS E V ++E+W+ +HG++ EK+ R +IFK+NL +I++ N + N +Y+
Sbjct: 28 HTVSSRSDVE--VSRLYEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYR 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG +F+DLTNDE+R++Y G ++ R T ++ +Y+ +P S+DWR +GAV
Sbjct: 85 LGLTKFADLTNDEYRSMYLGSRL----KRKATKTSLRYEARVGDAIPESVDWRKEGAVAE 140
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q CG CWAF+ + AVEGI KI +G+LI LSEQ+L+DC T+ N GC GG + AF +
Sbjct: 141 VKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEF 200
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
II+N GI TE++YPY+ V G C +K A I +YE+VP+ E++L KA+S QP+S+A
Sbjct: 201 IIKNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPISVA 260
Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
I FQ Y GIF+G+CGT LDH V VG+G TE+G +YW++KNSWG +WG++GY++
Sbjct: 261 IEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYG-TENGKDYWIVKNSWGTSWGESGYIR 319
Query: 330 IVRD----EGLCGIGTRSSYPL 347
+ R+ G CGI SYP+
Sbjct: 320 MERNIASSAGKCGIAVEPSYPI 341
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 147/320 (45%), Positives = 219/320 (68%), Gaps = 9/320 (2%)
Query: 35 SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTN 94
S S ++ V+ I+ +W+A+HG++Y E+E R +IFK+NL+++++ N E NR+YK+G N
Sbjct: 35 SSSRTDEEVMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSE-NRSYKVGLN 93
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-VPTSLDWRDKGAVTPIKN 153
+F+DLTN+E+R+++ G K S + S + + +D +P S+DWR+ GAV PIK+
Sbjct: 94 RFADLTNEEYRSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKD 153
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q CG CWAF+ VAAVEG+ +I +G +IQLSEQ+L+DC + GC GG + AF +II
Sbjct: 154 QGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAGCNGGLMDYAFEFIIN 213
Query: 214 NQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
N GI TE++YPY+ V GTC +K I++YE+VP DE AL KAV+ QPVS+AI A
Sbjct: 214 NGGIDTEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEA 273
Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
FQ Y G+F G CG LDH V +VG+G T++GA++W+++NSWG +WG+ GY+++ R
Sbjct: 274 SGRAFQLYLSGVFTGECGRALDHGVVVVGYG-TDNGADHWIVRNSWGTSWGENGYIRMER 332
Query: 333 D-----EGLCGIGTRSSYPL 347
+ G CGI ++SYP+
Sbjct: 333 NVVDNFGGKCGIAMQASYPI 352
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 152/341 (44%), Positives = 213/341 (62%), Gaps = 9/341 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
+ +T +F+ TL SCA + + + V+ ++E+W+ +H + Y EK+ R ++FK
Sbjct: 8 VTSTLLFLSFTL--SCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFK 65
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNL 131
+NL +I++ N N TYKLG NQF+D+TN+E+R +Y G K + T ST +Y
Sbjct: 66 DNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYS 125
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ +P +DWR KGAV PIK+Q CG CWAF+ VA VE I KI +G + LSEQ+L+DC
Sbjct: 126 AGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC 185
Query: 192 STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVP 250
N GC GG + AF +IIQN GI T+ +YPY+ G C +K A I +E+VP
Sbjct: 186 DRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGFEDVP 245
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
DE AL KAV+ QPVSIAI A + Q Y+ G+F G CGT LDH V +VG+G +E+G +
Sbjct: 246 PYDENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYG-SENGVD 304
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
YWL++NSWG WG+ GY K+ R+ G CGI +SYP+
Sbjct: 305 YWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 152/351 (43%), Positives = 221/351 (62%), Gaps = 11/351 (3%)
Query: 7 RSGSFKINTTPMFIIITLLVSCASQVVSSRSTH-----EQSVVEIHEKWMAQHGRSYKDE 61
S + I+ M I TL + ++S TH + V ++E W+ +HG+SY
Sbjct: 4 HSSTLTISLLLMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNAL 63
Query: 62 LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST 121
EK+ R +IFK+NL+YI++ N N++YKLG +F+DLTN+E+R++Y G K + +
Sbjct: 64 GEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLS 123
Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
+ + +Y +P S+DWRDKG + +K+Q CG CWAF+AVAA+E I I +GNLI
Sbjct: 124 KNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLI 183
Query: 182 QLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-A 240
LSEQ+L+DC + N GC GG + AF ++I N GI TE++YPY+ C +K A
Sbjct: 184 SLSEQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKV 243
Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV 300
KI +YE+VP +E+AL KAV+ QPVSIAI A + Q YK GIF G CGT +DH V
Sbjct: 244 VKIDSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHGVVAA 303
Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G+G +E+G +YW+++NSWG WG+ GY+++ R+ GLCG+ T SYP+
Sbjct: 304 GYG-SENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGLATEPSYPV 353
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 211/314 (67%), Gaps = 8/314 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ V+ ++E W+ +HG+SY E+E R +IFK+NL +IE+ N NRTYK+G N+F+DL
Sbjct: 47 DAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVGLNRFADL 105
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN+E+R+ Y G + + + + +Y + D+P S+DWR+KGAV P+K+Q CG
Sbjct: 106 TNEEYRSRYLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGS 165
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC + N GC GG + AF +II N GI +
Sbjct: 166 CWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDS 225
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E++YPY+A TC +K A I YE+VP DE++L KAV+ QPVS+AI A FQ
Sbjct: 226 EEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQ 285
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----D 333
Y+ G+F G CGTQLDH V VG+G TE+ +YW+++NSWG WG++GY+K+ R +
Sbjct: 286 LYQSGVFTGQCGTQLDHGVVAVGYG-TENSVDYWIVRNSWGPNWGESGYIKLERNLAGTE 344
Query: 334 EGLCGIGTRSSYPL 347
G CGI SYP+
Sbjct: 345 TGKCGIAIEPSYPI 358
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 154/315 (48%), Positives = 207/315 (65%), Gaps = 12/315 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYK-DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
E+S+ +++ W QH S D E R +IFKEN++YI+ NK+ + YKLG N+F+D
Sbjct: 39 EKSLRSLYDNWALQHRSSRSLDSEEHAERFEIFKENVKYIDSVNKK-DSPYKLGLNKFAD 97
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
L+N+EF+A+Y G KM R S +F YQN +P S+DWR KGAV +KNQ CG
Sbjct: 98 LSNEEFKAIYMGTKMDLRGDREVQSGSFMYQN--SEPLPASIDWRQKGAVAAVKNQGHCG 155
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+ VA+VEGI I +GNL+ LSEQQL+DCST N+GC GG + AF YII N GI
Sbjct: 156 SCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTE-NSGCNGGLMDTAFQYIINNGGIV 214
Query: 219 TEDEYPYQAVPGTCSAAQ---KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
TED YPY A CS+ + + I +E+VP+ +EQAL +AV+ QPVS+AI A
Sbjct: 215 TEDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEASGQ 274
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
+FQ Y G+F G CGT LDH V VG+GT+ +G NYW+++NSWG WG+ GY+++ +
Sbjct: 275 DFQFYSTGVFTGKCGTALDHGVVAVGYGTSPEGINYWIVRNSWGPKWGEEGYIRMQQGIE 334
Query: 334 --EGLCGIGTRSSYP 346
EG CGI ++SYP
Sbjct: 335 AAEGKCGIAMQASYP 349
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 156/343 (45%), Positives = 220/343 (64%), Gaps = 16/343 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F+ TL + ++S TH + V+ I+E+W+ + G+ Y E+E R +
Sbjct: 15 LFLSFTLSSASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGKVYNALGEREKRFQ 74
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
+FK+NL +I++ N E NRTYKLG N F+DLTN+E+R+ Y G + +R +S +Y
Sbjct: 75 VFKDNLRFIDEHNSE-NRTYKLGLNGFADLTNEEYRSTYLGARGGMKRNRLRKTSD-RYA 132
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+P S+DWR +GAV +K+Q CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+
Sbjct: 133 PRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELV 192
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEE 248
DC T+ N GC GG + AF +II N GI TE++YPY A G C +K A I +YE+
Sbjct: 193 DCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDTYRKNAKVVTIDDYED 252
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP E AL KAV+ QPVS+AI A +FQ Y GIF+G CGTQLDH V VG+G TE+G
Sbjct: 253 VPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLDHGVAAVGYG-TENG 311
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+YW+++NSWG +WG+ GY+++ R G+CGI +SYP+
Sbjct: 312 KDYWIVRNSWGKSWGENGYLRMARSINSPTGICGIAMEASYPI 354
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 149/322 (46%), Positives = 215/322 (66%), Gaps = 13/322 (4%)
Query: 31 QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
VSSRS E V ++E+W+ +HG++ EK+ R +IFK+NL +I++ N + N +Y+
Sbjct: 28 HTVSSRSDAE--VSRLYEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYR 84
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
LG +F+DLTNDE+R++Y G ++ R T S+ +Y+ +P S+DWR +GAV
Sbjct: 85 LGLTKFADLTNDEYRSMYLGSRL----KRKATKSSLRYEVRVGDAIPESVDWRKEGAVAE 140
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q CG CWAF+ + AVEGI KI +G+LI LSEQ+L+DC T+ N GC GG + AF +
Sbjct: 141 VKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEF 200
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
II N GI TE++YPY+ V G C +K A I YE+VP+ E++L KA+S QP+S+A
Sbjct: 201 IINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPISVA 260
Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
I FQ Y GIF+G+CGT LDH V VG+G TE+G +YW++KNSWG +WG++GY++
Sbjct: 261 IEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYG-TENGKDYWIVKNSWGTSWGESGYIR 319
Query: 330 IVRD----EGLCGIGTRSSYPL 347
+ R+ G CGI SYP+
Sbjct: 320 MERNIASSAGKCGIAVEPSYPI 341
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 152/350 (43%), Positives = 219/350 (62%), Gaps = 16/350 (4%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYKDEL 62
SF T F+ + L + ++ H E + ++E W+ ++G++Y
Sbjct: 7 SFAFLATFYFLSVCLAID--MSIIDYNLKHGQVPERTEAETLRLYEMWLVKYGKAYNALG 64
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EKE R +IFK+NL+++++ N GN +YKLG N+F+DL+N+E+RA Y G +M
Sbjct: 65 EKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGG 124
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
+ +Y D+P S+DWR+KGAV P+K+Q +CG CWAF+ V AVEGI +I +GNL
Sbjct: 125 PKSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTS 184
Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AA 241
LSEQ+L+DC N GC GG + AF +I++N GI TE++YPY+AV C +K A
Sbjct: 185 LSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDSMCDPNRKNARVV 244
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
I YE+VP DE++L KAV+ QPVS+AI A FQ Y+ G+F G CGTQLDH V VG
Sbjct: 245 TIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGVFTGSCGTQLDHGVVAVG 304
Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLCGIGTRSSYP 346
+G TE+G +YW+++NSWG WG+ GY+++ R + G CGI +SYP
Sbjct: 305 YG-TENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIAMEASYP 353
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 152/316 (48%), Positives = 209/316 (66%), Gaps = 7/316 (2%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKD-ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
S ++ V+ ++E W+ +HG+SY EK+ R +IFK+NL YI++ N G+R+YKLG N+
Sbjct: 39 SRSDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNR 98
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
F+DLTN+E+R+ Y G K + + T S +Y + +P S+DWR+KGAV +K+Q
Sbjct: 99 FADLTNEEYRSTYLGAKTDARRRIAKTKSDRRYAPKAGGSLPDSIDWREKGAVAEVKDQG 158
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ +AAVEGI +I +G LI LSEQ+L+DC T+ N GC GG + AF +II+N
Sbjct: 159 SCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNG 218
Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
GI TE +YPY G C +K A I YE+V DE AL +AV+ QPVS+AI A
Sbjct: 219 GIDTEADYPYTGRYGRCDQTRKNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEAGG 278
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
+FQ Y GIF G CGT LDH VT VG+G TE+G +YW++KNSW +WG+ GY+++ R+
Sbjct: 279 RDFQLYSSGIFTGSCGTDLDHGVTAVGYG-TENGVDYWIVKNSWAASWGEKGYLRMQRNV 337
Query: 334 ---EGLCGIGTRSSYP 346
GLCGI SYP
Sbjct: 338 KDKNGLCGIAIEPSYP 353
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 156/320 (48%), Positives = 212/320 (66%), Gaps = 18/320 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYK--DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
++S+ +++KW QH RS + D E R +IFKEN+++I+ NK+ + YKLG N+F+
Sbjct: 38 DESLRGLYDKWALQH-RSTRSLDSDEHARRFEIFKENVKHIDSVNKK-DGPYKLGLNKFA 95
Query: 98 DLTNDEFRALYTGYKMPSPS----HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
DL+N+EF+A++ KM R S +F YQN +P S+DWR KGAVTP+KN
Sbjct: 96 DLSNEEFKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKR--LPASIDWRKKGAVTPVKN 153
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q +CG CWAF+ +A+VEGI I++G L+ LSEQQL+DCS N GC GG + AF YII
Sbjct: 154 QGQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSKE-NAGCNGGLMDNAFQYIID 212
Query: 214 NQGIATEDEYPYQAVPGTCSAAQ---KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
N GI TEDEYPY A G CS + K A I +E+VP+ +E AL KAV+ QPVSIAI
Sbjct: 213 NGGIVTEDEYPYTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAI 272
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A +FQ Y G+F G CGT+LDH V +VG+G + +G NYW+++NSWG WG+ GY+++
Sbjct: 273 EASGHDFQFYSTGVFTGKCGTELDHGVVVVGYGKSPEGINYWIVRNSWGPEWGEQGYIRM 332
Query: 331 VRD----EGLCGIGTRSSYP 346
R EG CGI ++SYP
Sbjct: 333 QRGIEATEGKCGISMQASYP 352
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 210/324 (64%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS + ++ +WMA HGR+Y E+E R ++F++NL YI+ N G +
Sbjct: 26 IVSYGERSXEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHS 85
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+RA Y G + R + +Y D+P S+DWR KGAV
Sbjct: 86 FRLGLNRFADLTNDEYRATYLGARTRPQRERKLGA---RYHAADNEDLPESVDWRAKGAV 142
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q CG CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 143 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 202
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TE +YPY+ G C +K A I +YE+VP+ DE++L KAV+ QPVS
Sbjct: 203 EFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVS 262
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A T FQ Y GIF G CGT LDH VT VG+G TE+G +YW++KNSWG++WG++GY
Sbjct: 263 VAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 321
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 322 VRMERNIKASSGKCGIAVEPSYPL 345
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 152/317 (47%), Positives = 218/317 (68%), Gaps = 9/317 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++V HEKWMA+HGR+Y +E EK RL++F+ N + I+ N + T++L TN+F+
Sbjct: 35 TVDSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFA 94
Query: 98 DLTNDEFRALYTGYKMP--SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
DLT++EFRA TG + P + + + + F+Y+N S+ D S+DWR GAVT +K+Q
Sbjct: 95 DLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQG 154
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQN 214
CGCCWAF+AVAAVEG+TKIR+G L+ LSEQQL+DC G++ GC GG + AF Y+I
Sbjct: 155 SCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINR 214
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
G+ TE YPY+ G+C + +AA I YE+VP+ +E AL+ AV+ QPVS+AI
Sbjct: 215 GGLTTESSYPYRGTDGSCR--RSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGD 272
Query: 275 TEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI--- 330
+ F+ Y G+ G CGT+L+HA+T VG+GT DG YW++KNSWG +WG+ GY++I
Sbjct: 273 SVFRFYDSGVLGGSGCGTELNHAITAVGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRG 332
Query: 331 VRDEGLCGIGTRSSYPL 347
VR EG+CG+ +SYP+
Sbjct: 333 VRGEGVCGLAQLASYPV 349
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 153/339 (45%), Positives = 212/339 (62%), Gaps = 9/339 (2%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+T +F+ TL SCA + + + V+ ++E+W+ +H + Y EK+ R ++FK+N
Sbjct: 10 STLLFLSFTL--SCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKRFQVFKDN 67
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSM 133
L +I++ N N TYKLG N+F+D+TN+E+R +Y G K + T ST +Y +
Sbjct: 68 LGFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYSAG 127
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P +DWR KGAV PIK+Q CG CWAF+ VA VE I KI +G + LSEQ+L+DC
Sbjct: 128 DQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDR 187
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSG 252
N GC GG + AF +IIQN GI T+ +YPY+ G C +K A A I YE+VP
Sbjct: 188 AYNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKAVNIDGYEDVPPY 247
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
DE AL KAV+ QPVSIAI A Q Y+ G+F G CGT LDH V +VG+G +E+G +YW
Sbjct: 248 DENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVVVVGYG-SENGVDYW 306
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
L++NSWG WG+ GY K+ R+ G CGI +SYP+
Sbjct: 307 LVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 153/295 (51%), Positives = 206/295 (69%), Gaps = 14/295 (4%)
Query: 63 EKEMRLKIFKENLEYIEKANKE-GNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHR 119
E+E RL+IF +N+ YIE +N N+ YKL N+F+DLTN+EF A +K M S R
Sbjct: 3 EREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIASRNKFKGHMCSSIIR 62
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+TT FKY+N S +P+++DWR KGAVTP+KNQ +CG CWAF+AVAA EGI ++ +G
Sbjct: 63 TTT---FKYENASA--IPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGK 117
Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
L+ LSEQ+L+DC T G + GC GG + AF +IIQN G++TE +YPY+ V GTC+A +
Sbjct: 118 LVSLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKAS 177
Query: 239 A-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAV 297
A I+ YE+VP+ +E AL KAV+ QP+S+AI A ++FQ Y G+F G CGT+LDH V
Sbjct: 178 IHAVTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGV 237
Query: 298 TIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
T VG+G DG YWL+KNSWG WG+ GY+++ R EGLCGI ++SYP A
Sbjct: 238 TAVGYGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYPTA 292
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 154/342 (45%), Positives = 217/342 (63%), Gaps = 19/342 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHG--RSYKDELEKEMRLKIFKENL 75
+F ++ L +C E+ + +++++W + H RS E+E R +F+ N+
Sbjct: 9 LFSLVILETACGFDYEDKEIESEEGLSKLYDRWRSHHSVPRSLH---EREKRFNVFRHNV 65
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQN 130
++ +NK+ NR+YKL N+F+DLT EF+ YTG K+ HR S F Y +
Sbjct: 66 MHVHNSNKK-NRSYKLKLNKFADLTIHEFKNAYTGSKIKH--HRMLQGPKRGSKQFMYDH 122
Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
+++ +P+S+DWR KGAVT IKNQ +CG CWAF+ VAAVEGI KI++ L+ LSEQ+L+D
Sbjct: 123 ENVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182
Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEV 249
C TN N GC GG E AF +I +N GI TED YPY+ + G C A++ I +E V
Sbjct: 183 CDTNQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHENV 242
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P DE ALLKAV+ QPVS+AI A S++FQ Y EG+F G CGT+L+H V VG+G ++ G
Sbjct: 243 PENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELNHGVATVGYG-SQGGK 301
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
YW+++NSWG WG+ GY+KI R EG CGI +SYP+
Sbjct: 302 KYWIVRNSWGTEWGEGGYIKIERGIDEPEGRCGIAMEASYPI 343
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 147/318 (46%), Positives = 217/318 (68%), Gaps = 13/318 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSY----KDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTN 94
E V +++ W+A+HGR+Y + E E++ R +F +NL +++ N + G R ++LG N
Sbjct: 50 EPEVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERAGARGFRLGMN 109
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
QF+DLTNDEFRA Y G +P+ + +++ + ++P S+DWR+KGAV P+KNQ
Sbjct: 110 QFADLTNDEFRAAYLGAMVPAARRGAVVGERYRHDG-AAEELPESVDWREKGAVAPVKNQ 168
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQ 213
+CG CWAF+AV++VE + +I +G ++ LSEQ+L++CST+ GN+GC GG + AF +II+
Sbjct: 169 GQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIK 228
Query: 214 NQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
N GI TED+YPY+AV G C +K A I +E+VP DE++L KAV+ QPVS+AI A
Sbjct: 229 NGGIDTEDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 288
Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
EFQ YK G+F+G C T LDH V VG+G E+G +YW+++NSWG WG+AGY+++ R
Sbjct: 289 GGREFQLYKSGVFSGSCTTNLDHGVVAVGYG-AENGKDYWIVRNSWGPKWGEAGYIRMER 347
Query: 333 D----EGLCGIGTRSSYP 346
+ G CGI +SYP
Sbjct: 348 NVNASTGKCGIAMMASYP 365
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 151/313 (48%), Positives = 207/313 (66%), Gaps = 16/313 (5%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
E++E+W + H S + EK+ R +FK N+ Y+ NK+ ++ YKL N+F+D+TN EF
Sbjct: 36 ELYERWRSHHTVSRSLD-EKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEF 93
Query: 105 RALYTGYKMPSPSHRS-----TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
R Y G K+ HRS + TF Y N+ DVP S+DWR KGAVTP+K+Q +CG
Sbjct: 94 RHHYAGSKIKH--HRSFLGASRANGTFMYANVE--DVPPSVDWRKKGAVTPVKDQGKCGS 149
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ V AVEGI +I++ L+ LSEQ+L+DC T+ N GC GG + AF +I + GI T
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINT 209
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E+ YPY A G C ++ + I YE+VP DE +LLKAV+ QPVS+AI A ++FQ
Sbjct: 210 EENYPYMAEGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQ 269
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DE 334
Y EG+F G CGT+LDH V IVG+GTT DG YW+++NSWG WG+ GY+++ R +E
Sbjct: 270 FYSEGVFTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQREIDAEE 329
Query: 335 GLCGIGTRSSYPL 347
GLCGI + SYP+
Sbjct: 330 GLCGIAMQPSYPI 342
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 153/342 (44%), Positives = 217/342 (63%), Gaps = 19/342 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHG--RSYKDELEKEMRLKIFKENL 75
+F ++ L +C E+ + ++++W + H RS E+E R +F+ N+
Sbjct: 9 LFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPRSLN---EREKRFNVFRHNV 65
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQN 130
++ NK+ NR+YKL N+F+DLT +EF+ YTG ++M R + + ++N
Sbjct: 66 MHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHEN 124
Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
LS +P+S+DWR KGAVT IKNQ +CG CWAF+ VAAVEGI KI++ L+ LSEQ+L+D
Sbjct: 125 LSK--LPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182
Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEV 249
C T N GC GG E AF +I +N GI TED YPY+ + G C A++ I +E+V
Sbjct: 183 CDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDV 242
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P DE ALLKAV+ QPVS+AI A S++FQ Y EG+F G CGT+L+H V VG+G +E G
Sbjct: 243 PENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYG-SERGK 301
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
YW+++NSWG WG+ GY+KI R+ EG CGI +SYP+
Sbjct: 302 KYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPI 343
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 154/318 (48%), Positives = 206/318 (64%), Gaps = 13/318 (4%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
+ E V +E W+A+HGR+Y EKE R +IFK+NL +IE N GNRTYK+G NQF+
Sbjct: 41 SDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSS---TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DLTN+E+R +Y G K S + R S + +Y + +P S+DWR +GAV PIKNQ
Sbjct: 101 DLTNEEYRTMYLGTK--SDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQ 158
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ VAAVEGI +I +G +I LSEQ+L+DC N+GC GG + AF +II N
Sbjct: 159 GSCGSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN 218
Query: 215 QGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
G+ TE YPY+ V G C +K I YE+VP +E+AL KAV+ QPV +AI A
Sbjct: 219 GGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEAS 277
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
FQ Y G+F G CG ++DH V +VG+G +EDG +YW+++NSWG WG+ GY+K+ R+
Sbjct: 278 GRAFQLYSSGVFTGECGEEVDHGVVVVGYG-SEDGVDYWIVRNSWGTKWGENGYVKMERN 336
Query: 334 E-----GLCGIGTRSSYP 346
G CGI T +SYP
Sbjct: 337 VKKSHLGKCGIMTEASYP 354
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 150/323 (46%), Positives = 216/323 (66%), Gaps = 12/323 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
V ++ E V+ I+E W+ +HG+ S +EK+ R +IFK+NL ++++ N E N +Y
Sbjct: 35 VSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSY 93
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+LG +F+DLTNDE+R+ Y G KM R T+ +Y+ ++P S+DWR KGAV
Sbjct: 94 RLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVA 150
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q CG CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 151 EVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFE 210
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+II+N GI T+ +YPY+ V GTC +K A I +YE+VP+ E++L KAV+ QP+SI
Sbjct: 211 FIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISI 270
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A FQ Y GIF+G CGTQLDH V VG+G TE+G +YW+++NSWG +WG++GY+
Sbjct: 271 AIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGKSWGESGYL 329
Query: 329 KIVRD----EGLCGIGTRSSYPL 347
++ R+ G CGI SYP+
Sbjct: 330 RMARNIASSSGKCGIAIEPSYPI 352
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 148/333 (44%), Positives = 211/333 (63%), Gaps = 10/333 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +S +++ E+WMA++GR YKD EK +R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG NQF+D+TN+EF A YTG +P R S + ++ ++ VP
Sbjct: 68 IETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVVS---FDDVDISSVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
S+DWRD GAVT +KNQ CG CWAFA++A VE I KI+ GNL+ LSEQQ+LDC+ +
Sbjct: 125 QSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAV--SY 182
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG KA+++II N+G+A+ YPY+A GTC P +A I+ Y V +E+ +
Sbjct: 183 GCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNSAYITRYTYVQRNNERNM 242
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+ AVS QP++ A+ A S FQ YK G+F G CGT+L+HA+ I+G+G G +W+++NS
Sbjct: 243 MYAVSNQPIAAALDA-SGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNS 301
Query: 318 WGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
WG WG+ GY+++ RD GLCGI YP
Sbjct: 302 WGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 149/316 (47%), Positives = 203/316 (64%), Gaps = 13/316 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ ++E+W +H + +D +K R +FKEN+ I N+ + YKL N+F D+
Sbjct: 40 EEALWALYERWRGRHAVA-RDLGDKARRFNVFKENVRLIHDFNQR-DEPYKLRLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSP----SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
T DEFR Y G ++ R ++S+F Y D+PTS+DWR KGAVT +K+Q
Sbjct: 98 TADEFRRHYAGSRVAHHRMFRGDRQGSASSFMYAG--ARDLPTSVDWRQKGAVTDVKDQG 155
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
+CG CWAF+ +AAVEGI I++ NL LSEQQL+DC T GN GC GG + AF YI ++
Sbjct: 156 QCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAKHG 215
Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
G+A ED YPY+A +C + PA I YE+VP+ DE AL KAV+ QPVS+AI A +
Sbjct: 216 GVAAEDAYPYKARQASCKKSPAPAVT-IDGYEDVPANDESALKKAVAHQPVSVAIEASGS 274
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
FQ Y EG+F G CGT+LDH VT VG+G DG YW++KNSWG WG+ GY+++ RD
Sbjct: 275 HFQFYSEGVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEKGYIRMARDVA 334
Query: 334 --EGLCGIGTRSSYPL 347
EG CGI +SYP+
Sbjct: 335 AKEGHCGIAMEASYPV 350
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 150/323 (46%), Positives = 216/323 (66%), Gaps = 12/323 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
V ++ E V+ I+E W+ +HG+ S +EK+ R +IFK+NL ++++ N E N +Y
Sbjct: 35 VSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSY 93
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+LG +F+DLTNDE+R+ Y G KM R T+ +Y+ ++P S+DWR KGAV
Sbjct: 94 RLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVA 150
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q CG CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 151 EVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFE 210
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+II+N GI T+ +YPY+ V GTC +K A I +YE+VP+ E++L KAV+ QP+SI
Sbjct: 211 FIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISI 270
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A FQ Y GIF+G CGTQLDH V VG+G TE+G +YW+++NSWG +WG++GY+
Sbjct: 271 AIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGKSWGESGYL 329
Query: 329 KIVRD----EGLCGIGTRSSYPL 347
++ R+ G CGI SYP+
Sbjct: 330 RMARNIASSSGKCGIAIEPSYPI 352
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 157/349 (44%), Positives = 218/349 (62%), Gaps = 22/349 (6%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHE------QSVVEIHEKWMAQHGRSYKDELEKEMRL 68
TT ++I L ++ V S H+ +S+ +++E+W + H S ++ EK+ R
Sbjct: 2 TTKKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRF 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTS 123
+FK N+ ++ NK ++ YKL N+F+D+TN EF+ Y G K+ HR S
Sbjct: 61 NVFKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNH--HRMFRGTPRVS 117
Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
TF Y+N T P S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++ L+ L
Sbjct: 118 GTFMYENF--TKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPL 175
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAK 242
SEQ+L+DC N GC GG E AF YI Q GI TE YPY A G+C A ++ A
Sbjct: 176 SEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDATKENVPAVS 235
Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGF 302
I +E VP+ DE ALLKAV+ QPVS+AI A ++FQ Y EG+F G CG +L+H V IVG+
Sbjct: 236 IDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGY 295
Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
GTT DG NYW+++NSWG WG+ GY+++ R+ EGLCGI +SYP+
Sbjct: 296 GTTVDGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPV 344
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 150/323 (46%), Positives = 216/323 (66%), Gaps = 12/323 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
V ++ E V+ I+E W+ +HG+ S +EK+ R +IFK+NL ++++ N E N +Y
Sbjct: 35 VSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSY 93
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+LG +F+DLTNDE+R+ Y G KM R T+ +Y+ ++P S+DWR KGAV
Sbjct: 94 RLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVA 150
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q CG CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 151 EVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFE 210
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+II+N GI T+ +YPY+ V GTC +K A I +YE+VP+ E++L KAV+ QP+SI
Sbjct: 211 FIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISI 270
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A FQ Y GIF+G CGTQLDH V VG+G TE+G +YW+++NSWG +WG++GY+
Sbjct: 271 AIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGKSWGESGYL 329
Query: 329 KIVRD----EGLCGIGTRSSYPL 347
++ R+ G CGI SYP+
Sbjct: 330 RMARNIASSSGKCGIAIEPSYPI 352
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 149/339 (43%), Positives = 228/339 (67%), Gaps = 11/339 (3%)
Query: 18 MFIIITLLVSCA---SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+F II ++ S A S + + + + + ++E W+ +HG++Y EK++R IFK+N
Sbjct: 11 LFSIIFIVSSSALDLSIIDRAFNRPDDEIASLYETWLVKHGKNYNGLGEKQLRFNIFKDN 70
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKYQNLSM 133
L ++++ N E N ++KLG N+F+DLTN+E+R++Y G + S + RS S + +Y +
Sbjct: 71 LRFVDERNSE-NLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGRSKSDRYAFRAG 129
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR KGAV IK+Q CG CWAF+A+AAVEG+ +I +G+LI LSEQ+L++C T
Sbjct: 130 DTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVTGDLISLSEQELVECDT 189
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSG 252
+ N+GC GG + AF +II+N+GI ++++YPY G C +K A I +YE+ P
Sbjct: 190 SYNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRDGRCDTNRKNAKVVTIDDYEDSPVY 249
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
DE++L KAV+ QPVS+AI +FQ Y G+F G CGT LDH V +VG+G TEDG +YW
Sbjct: 250 DEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHGVAVVGYG-TEDGLDYW 308
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+++NSWG+TWG+ GY+++ R+ G+CGI SYP+
Sbjct: 309 IVRNSWGDTWGEGGYIRMQRNTKLPSGICGIAIEPSYPI 347
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 154/343 (44%), Positives = 216/343 (62%), Gaps = 17/343 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F++ L + ++S TH + V+ ++E+W+ +HG++Y EKE R +
Sbjct: 5 LFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFE 64
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL +I++ N E NRTY +G N+F+DLTN+EFR++Y G + TS +Y
Sbjct: 65 IFKDNLMFIDQHNSE-NRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSD--RYA 121
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+P S+DWR +GAV +K+Q CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+
Sbjct: 122 PRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELV 181
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEE 248
DC T+ N GC GG + AF +II N GI TED+YPY G C +K A I +YE+
Sbjct: 182 DCDTSYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYED 241
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP DE AL KAV+ QPVS+AI FQ Y G+F G CGT LDH V VG+G TE G
Sbjct: 242 VPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYG-TEKG 300
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+YW+++NSWG +WG++GY+++ R+ G CGI SYP+
Sbjct: 301 KDYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPI 343
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 150/320 (46%), Positives = 205/320 (64%), Gaps = 19/320 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ ++E+W +H + +D +K R +FK N+ I + N+ + YKL N+F D+
Sbjct: 42 EEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDM 99
Query: 100 TNDEFRALYTGYKMPSPSHR--------STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
T DEFR Y G ++ HR S+ S++F Y + DVP S+DWR KGAVT +
Sbjct: 100 TADEFRRHYAGSRVAH--HRMFRGDRQGSSASASFMYAD--ARDVPASVDWRQKGAVTDV 155
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q +CG CWAF+ +AAVEGI I++ NL LSEQQL+DC T N GC GG + AF YI
Sbjct: 156 KDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYI 215
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
++ G+A ED YPY+A +C + P I YE+VP+ DE AL KAV+ QPVS+AI
Sbjct: 216 AKHGGVAAEDAYPYRARQASCKKSPAPVVT-IDGYEDVPANDESALKKAVAHQPVSVAIE 274
Query: 272 AYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
A + FQ Y EG+F+G CGT+LDH VT VG+G T DG YWL+KNSWG WG+ GY+++
Sbjct: 275 ASGSHFQFYSEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMA 334
Query: 332 RD----EGLCGIGTRSSYPL 347
RD EG CGI +SYP+
Sbjct: 335 RDVAAKEGHCGIAMEASYPV 354
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 153/319 (47%), Positives = 210/319 (65%), Gaps = 12/319 (3%)
Query: 36 RSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLG 92
RS E + ++E W+A+HGR+Y EKE R +IFK+N+ +I+ N G+R+++LG
Sbjct: 41 RSEEEMRI--LYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLG 98
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+D+TN+E+RA+Y G + P+ R + +Y+ + D+P S+DWR KGAV +K
Sbjct: 99 LNRFADMTNEEYRAVYLGTR-PAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVK 157
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
+Q CG CWAF+ VAAVEGI KI +G+LI LSEQ+L+DC N GC GG + F +II
Sbjct: 158 DQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQGCNGGLMDYGFEFII 217
Query: 213 QNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
N GI TE++YPY A G C +K A I YE+VP DE+AL KAV+ QPVS+AI
Sbjct: 218 NNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIE 277
Query: 272 AYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
A EFQ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG WG++GY+++
Sbjct: 278 AGGREFQLYHSGIFTGRCGTDLDHGVVAVGYG-TENGKDYWIVRNSWGGDWGESGYIRME 336
Query: 332 RD----EGLCGIGTRSSYP 346
R+ G CGI SYP
Sbjct: 337 RNVNTSTGKCGIAIEPSYP 355
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 151/317 (47%), Positives = 217/317 (68%), Gaps = 9/317 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++V HEKWMA+HGR+Y +E EK RL++F+ N + I+ N + T++L TN+F+
Sbjct: 35 TVDAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFA 94
Query: 98 DLTNDEFRALYTGYKMP--SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
DLT++EFRA TG + P + + + + F+Y+N S+ D S+DWR GAVT +K+Q
Sbjct: 95 DLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQG 154
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQN 214
CGCCWAF+AVAAVEG+TKIR+G L+ LSEQQL+DC G++ GC GG + AF Y+I
Sbjct: 155 SCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINR 214
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
G+ TE YPY+ G+C + +AA I YE+VP+ +E AL+ AV+ QPVS+AI
Sbjct: 215 GGLTTESSYPYRGTDGSCR--RSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGD 272
Query: 275 TEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI--- 330
+ F+ Y G+ G CGT+L+HA+T G+GT DG YW++KNSWG +WG+ GY++I
Sbjct: 273 SVFRFYDSGVLGGSGCGTELNHAITAAGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRG 332
Query: 331 VRDEGLCGIGTRSSYPL 347
VR EG+CG+ +SYP+
Sbjct: 333 VRGEGVCGLAQLASYPV 349
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 148/332 (44%), Positives = 211/332 (63%), Gaps = 8/332 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG N+F+D+TN+EF YTG +P R S + +++++ V
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVTQYTGVSLPLNFKREPVVS---FDDVNISAVG 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
S+DWRD GAVT +K+Q CG CWAF+A+A VEGI KI +G L+ LSEQ++LDC+ + N
Sbjct: 125 QSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS--N 182
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG + A+ +II N G+A+E +YPYQA G C+A P +A I+ Y V S DE ++
Sbjct: 183 GCDGGFVDNAYDFIISNNGVASEADYPYQAYEGDCTANSWPNSAYITGYSYVRSNDESSM 242
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
AV QP++ AI A FQ Y G+F+G CGT L+HA+TI+G+G G YW++KNS
Sbjct: 243 KYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQYWIVKNS 302
Query: 318 WGNTWGDAGYMKIVR---DEGLCGIGTRSSYP 346
WG++WG+ GY+++ R GLCGI YP
Sbjct: 303 WGSSWGERGYVRMARGVSSSGLCGIAMDPLYP 334
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 205/324 (63%), Gaps = 10/324 (3%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
+ SR E E HE WMAQ+G+ YKD EK+ R +IFK N+ +IE N G++ + L
Sbjct: 24 IMSRRLFEACTSERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLS 83
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRST---TSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
NQF+DL ++EF+AL T S T T ++FKY + T + ++DWR +GAVT
Sbjct: 84 INQFADLHDEEFKALLTNGNKKVRSVVGTATETETSFKYNRV--TKLLATMDWRKRGAVT 141
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
PIK+Q+ CG CWAF+AVAA+EGI +I + L+ LSEQ+L+DC + GC GG E AF
Sbjct: 142 PIKDQRRCGSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFE 201
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
++ + GIA+E YPY+ +C ++ ++I YE+VPS E+AL KAV+ QPVS+
Sbjct: 202 FVAKKGGIASESYYPYKGKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQPVSV 261
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
+ A FQ Y GIF G CGT DHA+T+VG+G + G YWL+KNSWG WG+ GY+
Sbjct: 262 YVEAGGNAFQFYSSGIFTGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWGEKGYI 321
Query: 329 KIVRD----EGLCGIGTRSSYPLA 348
++ RD EGLCGI + YP A
Sbjct: 322 RMKRDIRAKEGLCGIAMNAFYPTA 345
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 222/332 (66%), Gaps = 19/332 (5%)
Query: 32 VVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDEL---EKEMRLKIFKENLEYIEK 80
+VS TH + V+ I+E+W+ ++G+++ + EKE R ++FK+NL +I++
Sbjct: 28 IVSYDQTHLTKSSWRTDDEVMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDE 87
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
N E NR+YK+G N+F+DLTN+E+R++Y G + + +R + SS +Y +P S+
Sbjct: 88 HNSE-NRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRSSN-RYLPRVGDSLPDSV 145
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL 200
DWR +GAV +K+Q CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+DC + N GC
Sbjct: 146 DWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSYNEGCN 205
Query: 201 GGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLK 259
GG + AF +II N GI +E++YPY A GTC +K A I NYE+VP DE+AL K
Sbjct: 206 GGLMDYAFQFIINNGGIDSEEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPVNDEKALQK 265
Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
AV+ QPVS+AI A EFQ Y+ GIF G CGT LDH V VG+G TE+G +YW+++NSWG
Sbjct: 266 AVANQPVSVAIEAGGREFQFYQSGIFTGRCGTALDHGVAAVGYG-TENGKDYWIVRNSWG 324
Query: 320 NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+WG++GY+++ R+ G CGI SYP+
Sbjct: 325 KSWGESGYIRMERNIATATGKCGIAIEPSYPI 356
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 155/314 (49%), Positives = 213/314 (67%), Gaps = 24/314 (7%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ E HE+WMAQ+GR YKD+ EKE R IFKEN+ I+ N + ++Y LG NQF+DL+N+
Sbjct: 1 MYERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNE 60
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
EF+A +K M SP + F+Y+N+S VP ++DWR KGAVTP+K+Q +C
Sbjct: 61 EFKASRNRFKGHMCSPQ-----AGPFRYENVSA--VPATMDWRKKGAVTPVKDQGQC--- 110
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIAT 219
VAA+EGI ++ +G LI LSEQ+++DC T G + GC GG + AF +I QN+G+ T
Sbjct: 111 -----VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTT 165
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E YPY GTC+ ++ + AAKI+ +++VP+ E AL+KAV+ QPVS+AI A EFQ
Sbjct: 166 EANYPYTGTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQ 225
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 334
Y GIF G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+++ +D E
Sbjct: 226 FYSSGIFTGSCGTELDHGVTAVGYGGS-DGTKYWLVKNSWGAQWGEEGYIRMQKDISAKE 284
Query: 335 GLCGIGTRSSYPLA 348
GLCGI ++SYP A
Sbjct: 285 GLCGIAMQASYPTA 298
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 155/349 (44%), Positives = 216/349 (61%), Gaps = 18/349 (5%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEM 66
+T +F+ TL + ++S H + V+ ++ W+A+H ++Y E+E
Sbjct: 8 STLLFLFFTLSSAWDMSILSHNHGHHHQSSWRSDNEVISMYNWWLAKHSKTYNKLGEREK 67
Query: 67 RLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-- 124
R +IFK NL +I++ N NRTYK+G +F+DLTN+E+RA + G K P R S
Sbjct: 68 RFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFLGTK-SDPKRRLMKSKNP 126
Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
+ +Y + +P S+DWR GAV+ IK+Q CG CWAF+ +AAVEG+ KI +G LI LS
Sbjct: 127 SQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLS 186
Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKI 243
EQ+L+DC + N GC GG + AF +II N GI T+ +YPYQAV G C + K A I
Sbjct: 187 EQELVDCDRSYNAGCNGGLMDNAFQFIINNGGIDTDKDYPYQAVDGKCDTTKVKNKAVTI 246
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFG 303
+E+V + DE AL KAV+ QPVS+AI A Q Y+ G+F G CG+ LDH V IVG+G
Sbjct: 247 DGFEDVMAFDEMALQKAVAHQPVSVAIEASGMALQFYQSGVFTGECGSALDHGVVIVGYG 306
Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
TEDG +YWL++NSWG WG+ GY+K+ R+ G CGI SSYP+
Sbjct: 307 -TEDGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGKCGIAMESSYPI 354
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 204/321 (63%), Gaps = 20/321 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ ++E+W +H + +D +K R +FK N+ I + N+ + YKL N+F D+
Sbjct: 149 EEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDM 206
Query: 100 TNDEFRALYTGYKMPSPSHR---------STTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
T DEFR Y G ++ HR S ++S+F Y + DVP S+DWR KGAVT
Sbjct: 207 TADEFRRHYAGSRVAH--HRMFRGDRQGSSASASSFMYAD--ARDVPASVDWRQKGAVTD 262
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q +CG CWAF+ +AAVEGI I++ NL LSEQQL+DC T N GC GG + AF Y
Sbjct: 263 VKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQY 322
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
I ++ G+A ED YPY+A +C + P I YE+VP+ DE AL KAV+ QPVS+AI
Sbjct: 323 IAKHGGVAAEDAYPYRARQASCKKSPAPVVT-IDGYEDVPANDESALKKAVAHQPVSVAI 381
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A + FQ Y EG+F+G CGT+LDH V VG+G T DG YWL+KNSWG WG+ GY+++
Sbjct: 382 EASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRM 441
Query: 331 VRD----EGLCGIGTRSSYPL 347
RD EG CGI +SYP+
Sbjct: 442 ARDVAAKEGHCGIAMEASYPV 462
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 152/344 (44%), Positives = 220/344 (63%), Gaps = 17/344 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F + L + ++S + H ++ V ++E+W+ +HG+ Y EK+ R +
Sbjct: 3 LFALFALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKRFQ 62
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL +I++ N E NRTYKLG N+F+DLTN+E+RA Y G K+ P+ R + + +Y
Sbjct: 63 IFKDNLRFIDQQNAE-NRTYKLGLNRFADLTNEEYRARYLGTKI-DPNRRLGRTPSNRYA 120
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+P S+DWR +GAV P+K+Q CG CWAF+A+ AVEGI KI +G+LI LSEQ+L+
Sbjct: 121 PRVGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQELV 180
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEE 248
DC T N GC GG + AF +II+N GI +E++YPY+ V G C +K A I YE+
Sbjct: 181 DCDTGYNMGCNGGLMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGYED 240
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
V + DE AL KAV+ QPVS+A+ EFQ Y G+F G CGT LDH V VG+G T++G
Sbjct: 241 VNTYDELALKKAVANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYG-TDNG 299
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
++W+++NSWG WG+ GY+++ R+ G CGI SYP+
Sbjct: 300 HDFWIVRNSWGADWGEEGYIRLERNLGNSRSGKCGIAIEPSYPI 343
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 306 bits (784), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 151/335 (45%), Positives = 214/335 (63%), Gaps = 12/335 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG NQF+D+TN+EF A YTG +P R S + ++ ++ VP
Sbjct: 68 IETFNSRNGNSYTLGINQFTDMTNNEFVAQYTGVSLPLNIEREPVVS---FDDVDISAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
S+DWR+ GAVT +KN CG CWAFAA+A VE I KI+ G LI LSEQQ+LDC+ +
Sbjct: 125 QSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQQVLDCAV--SY 182
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAV--PGTCSAAQKPAAAKISNYEEVPSGDEQ 255
GC GG KA+ +II N+G+A+ YPY+A GTC P +A I+ Y V S +E+
Sbjct: 183 GCDGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTCRINGVPNSAYITGYTRVQSNNER 242
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
+++ AVS QP++ +I A S +FQ YK G+F+G CGT L+HA+TI+G+G G +W+++
Sbjct: 243 SMMYAVSNQPIAASIEA-SGDFQHYKRGVFSGPCGTSLNHAITIIGYGQDSSGKKFWIVR 301
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
NSWG +WG+ GY+++ RD GLCGI R YP
Sbjct: 302 NSWGASWGERGYIRMARDVSSSSGLCGIAIRPLYP 336
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 306 bits (784), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 148/315 (46%), Positives = 216/315 (68%), Gaps = 11/315 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL-EKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFS 97
E V ++E W+ +HGR + L E + R ++F +NL +++ N + G ++LG NQF+
Sbjct: 49 EAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERAGEHGFRLGMNQFA 108
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLTNDEFRA Y G ++P+ RS + Y++ ++P S+DWR+KGAV P+KNQ +C
Sbjct: 109 DLTNDEFRAAYLGARIPAA--RSGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQGQC 166
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQG 216
G CWAF+AV++VE I +I +G ++ LSEQ+L++CST+ GN+GC GG + AF +II+N G
Sbjct: 167 GSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGG 226
Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
I TED+YPY+AV G C ++ A I +E+VP DE++L KAV+ QPVS+AI A
Sbjct: 227 IDTEDDYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEAGGR 286
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
+FQ YK G+F+G C T LDH V VG+G TE+G +YW+++NSWG WG+AGY+++ R+
Sbjct: 287 QFQLYKSGVFSGSCTTNLDHGVVAVGYG-TENGKDYWIVRNSWGPKWGEAGYIRMERNIN 345
Query: 334 --EGLCGIGTRSSYP 346
G CGI +SYP
Sbjct: 346 ATTGKCGIAMMASYP 360
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 151/324 (46%), Positives = 211/324 (65%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS ++ ++ +WMA HGR+Y E+E R ++F++NL YI+ N G +
Sbjct: 29 IVSYGERSDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHS 88
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+RA Y G + R + +Y D+P S+DWR KGAV
Sbjct: 89 FRLGLNRFADLTNDEYRATYLGARTRPQRERKLGA---RYHAADNEDLPESVDWRAKGAV 145
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q G CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 146 AEVKDQGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 205
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TE +YPY+ G C +K A I +YE+VP+ DE++L KAV+ QPVS
Sbjct: 206 EFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVS 265
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A T+FQ Y GIF G CGT LDH VT VG+G TE+G +YW++KNSWG++WG++GY
Sbjct: 266 VAIEAAGTQFQLYSSGIFTGSCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 324
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 325 VRMERNIKASSGKCGIAVEPSYPL 348
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 152/334 (45%), Positives = 209/334 (62%), Gaps = 11/334 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +S +++ E+WM ++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG NQF+D+TN+EF A YTG P R S + ++ ++ V
Sbjct: 68 IETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGISRPLNIEREPVVS---FDDVDISAV 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWRD GAVT +KNQ CG CWAFAA+A VE I KI+ G L LSEQQ+LDC+
Sbjct: 125 PQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAKG-- 182
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG +AF +II N+G+A+ YPY+A GTC P +A I+ Y VP +E +
Sbjct: 183 YGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTCKTNGVPNSAYITGYARVPRNNESS 242
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
++ AVS QP+++A+ A + FQ YK G+FNG CGT L+HAVT +G+G +G YW++KN
Sbjct: 243 MMYAVSKQPITVAVDA-NANFQYYKSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVKN 301
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
SWG WG+AGY+++ RD G+CGI S YP
Sbjct: 302 SWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 151/350 (43%), Positives = 219/350 (62%), Gaps = 19/350 (5%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTH------------EQSVVEIHEKWMAQHGRSYKDEL 62
T F +I+++ + +++ +TH + V ++E W+ +HG++Y
Sbjct: 8 TLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKTYNALG 67
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK+ R +IFK+NL +I++ N G+ TYKLG N+F+DLTN+E+R YTG K + +
Sbjct: 68 EKDRRFQIFKDNLRFIDEHN-SGDHTYKLGLNKFADLTNEEYRMTYTGIKTIDDKKKLSK 126
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
+ +Y S +P +DWR++GAVT +K+Q CG CWAF+ +VEG+ KI +G+LI
Sbjct: 127 MKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIVTGDLIS 186
Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AA 241
+SEQ+L++C T+ N GC GG + AF +II+N GI TE++YPY G C +K A
Sbjct: 187 VSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNKKNAKVV 246
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
I +YE+VP DE +L KAVS QPV++AI A +FQ Y GIF G CGT LDH V G
Sbjct: 247 TIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDHGVLAAG 306
Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+G TEDG +YWL+KNSWG WG+ GY+K+ R+ G CGI +SYP+
Sbjct: 307 YG-TEDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAMEASYPI 355
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 143/308 (46%), Positives = 208/308 (67%), Gaps = 8/308 (2%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSDLTNDEF 104
++E W+A+HGR+Y E++ R ++F +NL +++ N + ++LG NQF+DLTNDEF
Sbjct: 51 LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEF 110
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
RA Y G ++P+ R T ++P S+DWR+KGAV P+KNQ +CG CWAF+
Sbjct: 111 RAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFS 170
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
AV++VE + +I +G ++ LSEQ+L++CST+ GN+GC GG + AF +II+N GI TE +Y
Sbjct: 171 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 230
Query: 224 PYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
PY+AV G C ++ A I +E+VP DE++L KAV+ QPVS+AI A EFQ YK
Sbjct: 231 PYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYKA 290
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
G+F G C T LDH V VG+G TE+G +YW+++NSWG WG+ GY+++ R+ G CG
Sbjct: 291 GVFTGTCTTNLDHGVVAVGYG-TENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKCG 349
Query: 339 IGTRSSYP 346
I +SYP
Sbjct: 350 IAMMASYP 357
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 158/340 (46%), Positives = 217/340 (63%), Gaps = 14/340 (4%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + ++ T+++ V ++E W+ +HG+SY E+E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLA---LDAKRTNDE-VKAMYESWLIKHGKSYNSLGERERRFEIFK 63
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + +R+YK+G NQF+DLTN+EFR+ Y G+ S + T + +Y+
Sbjct: 64 ETLRFIDEHNADTSRSYKVGLNQFADLTNEEFRSTYLGFTRGS----NKTKVSNRYEPRV 119
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR +GAV IKNQ +CG CWAF+A+AAVEGI KI +GNLI LSEQ+L+DC
Sbjct: 120 GQVLPDYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGINKIVTGNLISLSEQELVDCG 179
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
T GC GG F +II N GI TE+ YPY A G C Q I NYE VP
Sbjct: 180 RTQSTKGCDGGYMTDGFEFIINNGGINTEENYPYTAQEGQCDLNLQNEKYVTIDNYENVP 239
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E AL AV+ QPVS+A+ + FQ Y GIF G CGT DHAVTIVG+G TE G +
Sbjct: 240 YYNEWALQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGTATDHAVTIVGYG-TEGGID 298
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSW TWG+ GYM+I+R+ G CGI T SYP+
Sbjct: 299 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 338
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 143/308 (46%), Positives = 209/308 (67%), Gaps = 8/308 (2%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSDLTNDEF 104
++E W+A+HGR+Y E++ R ++F +NL +++ N + ++LG NQF+DLTNDEF
Sbjct: 48 LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEF 107
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
RA Y G ++P+ R T ++P S+DWR+KGAV P+KNQ +CG CWAF+
Sbjct: 108 RAAYLGARIPAARRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFS 167
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
AV++VE + +I +G ++ LSEQ+L++CST+ GN+GC GG + AF +II+N GI TE +Y
Sbjct: 168 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 227
Query: 224 PYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
PY+AV G C ++ A I +E+VP DE++L KAV+ QPVS+AI A EFQ YK
Sbjct: 228 PYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYKA 287
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
G+F+G C T LDH V VG+G TE+G +YW+++NSWG WG+ GY+++ R+ G CG
Sbjct: 288 GVFSGTCTTNLDHGVVAVGYG-TENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKCG 346
Query: 339 IGTRSSYP 346
I +SYP
Sbjct: 347 IAMMASYP 354
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 214/315 (67%), Gaps = 9/315 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+ V+ +++ WMA+HG++Y EKE R +IFK+NL++I++ N + NRTYK+G N+F+DL
Sbjct: 39 EEEVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQ-NRTYKVGLNRFADL 97
Query: 100 TNDEFRALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
TN+E+RA+Y G + P +++ +Y + +P S+DWR+ GAV P+K+Q+ CG
Sbjct: 98 TNEEYRAIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSCG 157
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+ VAAVEGI +I +G LI LSEQ+L+DC T + GC GG + AF +II+N G+
Sbjct: 158 SCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNGGLD 217
Query: 219 TEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
TE +YPY G C+ + K + I YE+VP DE+AL KAV+ QPVS+A+ A
Sbjct: 218 TEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRAL 277
Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
Q Y GIF G CGT LDH + VG+G TE+G +YW+++NSWG++WG+ GY+++ R+
Sbjct: 278 QLYVSGIFTGECGTALDHGIVAVGYG-TENGTDYWIVRNSWGSSWGENGYIRMERNMADA 336
Query: 334 -EGLCGIGTRSSYPL 347
G CGI +SYP+
Sbjct: 337 FSGKCGIAMEASYPI 351
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 143/308 (46%), Positives = 208/308 (67%), Gaps = 8/308 (2%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSDLTNDEF 104
++E W+A+HGR+Y E++ R ++F +NL +++ N + ++LG NQF+DLTNDEF
Sbjct: 108 LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEF 167
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
RA Y G ++P+ R T ++P S+DWR+KGAV P+KNQ +CG CWAF+
Sbjct: 168 RAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFS 227
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
AV++VE + +I +G ++ LSEQ+L++CST+ GN+GC GG + AF +II+N GI TE +Y
Sbjct: 228 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 287
Query: 224 PYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
PY+AV G C ++ A I +E+VP DE++L KAV+ QPVS+AI A EFQ YK
Sbjct: 288 PYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYKA 347
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
G+F G C T LDH V VG+G TE+G +YW+++NSWG WG+ GY+++ R+ G CG
Sbjct: 348 GVFTGTCTTNLDHGVVAVGYG-TENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKCG 406
Query: 339 IGTRSSYP 346
I +SYP
Sbjct: 407 IAMMASYP 414
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 146/330 (44%), Positives = 207/330 (62%), Gaps = 22/330 (6%)
Query: 38 THEQSVVEIHEKWMAQH--------GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
+ E+S+ ++E+W +++ G D+ E R +F EN YI +AN+ G R +
Sbjct: 33 SSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPF 92
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRS------TTSSTFKYQNLSMTDVPTSLDWR 143
+L N+F+D+T DEFR Y G + + HRS +F+Y ++P ++DWR
Sbjct: 93 RLALNKFADMTTDEFRRTYAGSR--ARHHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWR 150
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
++GAVT IK+Q +CG CWAF+AVAAVEG+ KI++G L+ LSEQ+L+DC T N GC GG
Sbjct: 151 ERGAVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGL 210
Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVS 262
+ AF +I +N GI TE YPY+A G C+ A+ + I YE+VP+ DE AL KAV+
Sbjct: 211 MDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVA 270
Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
QPV++A+ A +FQ Y EG+F G CGT LDH V VG+G T DG YW++KNSWG W
Sbjct: 271 NQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDW 330
Query: 323 GDAGYMKIVR-----DEGLCGIGTRSSYPL 347
G+ GY+++ R GLCGI +SYP+
Sbjct: 331 GERGYIRMQRGVSSDSNGLCGIAMEASYPV 360
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 146/333 (43%), Positives = 212/333 (63%), Gaps = 10/333 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L AS +SR +++ E+WMA++GR YKD+ EK R +IFK N+++
Sbjct: 8 VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG NQF+D+T EF A YTG +P R S + +++++ VP
Sbjct: 68 IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVS---FDDVNISAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
S+DWRD GAV +KNQ CG CW+FAA+A VEGI KI++G L+ LSEQ++LDC+ +
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV--SY 182
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG KA+ +II N G+ TE+ YPY A GTC+A P +A I+ Y V DE+++
Sbjct: 183 GCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNSAYITGYSYVRRNDERSM 242
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+ AVS QP++ I A S FQ Y G+F+G CGT L+HA+TI+G+G G YW+++NS
Sbjct: 243 MYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNS 301
Query: 318 WGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
WG++WG+ GY+++ R G+CGI +P
Sbjct: 302 WGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 150/338 (44%), Positives = 212/338 (62%), Gaps = 9/338 (2%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
T +F+ TL + + + + + +E V+ ++E+W+ +H + Y + +K+ R ++FK+NL
Sbjct: 9 TLLFLSFTLSYAIKTSTIINYTDNE--VMAMYEEWLVRHQKGYNELGKKDKRFQVFKDNL 66
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+I++ N N TYKLG N+F+D+TN+E+RA+Y G K + T ST S D
Sbjct: 67 GFIQEHNNNLNNTYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKSTGHRYAFSARD 126
Query: 136 -VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+P +DWR KGAV PIK+Q CG CWAF+ VA VE I KI +G + LSEQ+L+DC
Sbjct: 127 RLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRA 186
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGD 253
N GC GG + AF +IIQN GI T+ +YPY+ G C +K A I YE+VP D
Sbjct: 187 YNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGYEDVPPYD 246
Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E AL KAV+ QPVS+AI A Q Y+ G+F G CGT LDH V +VG+G +E+G +YWL
Sbjct: 247 ENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHGVVVVGYG-SENGVDYWL 305
Query: 314 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
++NSWG WG+ GY K+ R+ G CGI +SYP+
Sbjct: 306 VRNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPV 343
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 148/313 (47%), Positives = 210/313 (67%), Gaps = 9/313 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ V ++E W+ +HG++Y EKE R +IFK+NL +I++ N +R+YK+G N+F+DL
Sbjct: 44 DSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSV-DRSYKVGLNRFADL 102
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN+E++A++ G KM + T S +Y D+P ++DWR+KGAV P+K+Q +CG
Sbjct: 103 TNEEYKAMFLGTKMERKNRFLGTRSQ-RYLFKDGDDLPENVDWREKGAVVPVKDQGQCGS 161
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ V AVEGI +I +G LI LSEQ+L+DC + N GC GG + AF +II N GI T
Sbjct: 162 CWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDT 221
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E++YPY+A C +K A I YE+VP DE +L KAV+ QPVS+AI A FQ
Sbjct: 222 EEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAFQ 281
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----- 333
YK G+F G CGT+LDH V VG+G TE+G NYW+++NSWG+ WG++GY+++ R+
Sbjct: 282 LYKSGVFTGRCGTELDHGVVAVGYG-TENGVNYWIVRNSWGSAWGESGYIRMERNVANTK 340
Query: 334 EGLCGIGTRSSYP 346
G CGI + SYP
Sbjct: 341 TGKCGIAIQPSYP 353
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 148/309 (47%), Positives = 210/309 (67%), Gaps = 11/309 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+VE+ E W++ HG++Y EK R ++FKENL++I++ NKE +Y LG N+F+DL+++
Sbjct: 43 LVELFESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKEVT-SYWLGLNEFADLSHE 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF++ + G P R +S F Y+++ D+P S+DWR KGAVTP+KNQ CG CWA
Sbjct: 102 EFKSKFLGLYPEFP--RKKSSEDFSYRDV--VDLPKSIDWRKKGAVTPVKNQGSCGSCWA 157
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ VAAVEGI +I +GNL LSEQQL+DC T+ NNGC GG + AF +I+ N G+ E++
Sbjct: 158 FSTVAAVEGINQIVAGNLTSLSEQQLIDCDTSFNNGCNGGLMDYAFEFIVNNGGLHKEED 217
Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY GTC ++ IS Y +VP DEQ+LLKA++ QP+S+AI A +FQ Y
Sbjct: 218 YPYLMEEGTCDEKREEMEVVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYS 277
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
G+F+G CGT LDH V VG+G++ G +Y ++KNSWG WG+ GY+++ R+ EGLC
Sbjct: 278 GGVFSGPCGTDLDHGVAAVGYGSSS-GIDYIIVKNSWGPKWGERGYLRMKRNTGKPEGLC 336
Query: 338 GIGTRSSYP 346
GI +SYP
Sbjct: 337 GINKMASYP 345
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 153/318 (48%), Positives = 206/318 (64%), Gaps = 13/318 (4%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
+ E V +E W+A+HGR+Y EKE R +IFK+NL +IE+ N GNRTYK+G NQF+
Sbjct: 41 SDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSS---TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DLTN+E+R +Y G K S + R S + +Y + +P S+DWR +GAV PIKNQ
Sbjct: 101 DLTNEEYRTMYLGTK--SDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQ 158
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ VAAV GI +I +G +I LSEQ+L+DC N+GC GG + AF +II N
Sbjct: 159 GSCGSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN 218
Query: 215 QGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
G+ TE YPY+ V G C +K I YE+VP +E+AL KAV+ QPV +AI A
Sbjct: 219 GGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEAS 277
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
FQ Y G+F G CG ++DH V +VG+G +EDG +YW+++NSWG WG+ GY+K+ R+
Sbjct: 278 GRAFQLYSSGVFTGECGEEVDHGVVVVGYG-SEDGVDYWIVRNSWGTKWGENGYVKMERN 336
Query: 334 E-----GLCGIGTRSSYP 346
G CGI T +SYP
Sbjct: 337 VKKSHLGKCGIMTEASYP 354
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 206/322 (63%), Gaps = 13/322 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE----GNRTYKLGTNQ 95
++++ E +EKWMA+ GR+YKD EK R ++FK N +I+ N G KL TN+
Sbjct: 13 DKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTNK 72
Query: 96 FSDLTNDEFRALY-TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
F+DLT DEFR +Y TG+++ T + FK+ +S++DVP S+DWR +GAVT +K+Q
Sbjct: 73 FADLTEDEFRNIYVTGHRVNYRPTSLVTDTVFKFGAVSLSDVPPSIDWRARGAVTSVKDQ 132
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
C CCWAF++ AAVEGI +I +GN + LS QQL+DCS N C G +KA+ YI ++
Sbjct: 133 HLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYEYIARS 192
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
G+ + +YPY+ GTC K A A+IS ++ VP+ +E ALL AV+ QPVS+A+ S
Sbjct: 193 GGLVADQDYPYEGHSGTCRVYGKQAVARISGFQYVPARNETALLLAVAHQPVSVALDGLS 252
Query: 275 TEFQSYKEGIFNGV---CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
Q GIF C T L+HA+TIVG+GT E G YWL+KNSWG+ WGD GY+K
Sbjct: 253 RALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGDKGYVKFA 312
Query: 332 RD-----EGLCGIGTRSSYPLA 348
RD G+CG+ +SYP+A
Sbjct: 313 RDVASEINGVCGLALEASYPVA 334
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 150/343 (43%), Positives = 220/343 (64%), Gaps = 16/343 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F TL + ++S +H + V+ I+E W+ +HG++Y EKE R +
Sbjct: 5 LFFASTLSSASDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLVKHGKAYNSLGEKERRFE 64
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
+FK+NL +I++ N E NRTY++G N+F+DLTN+E+R++Y G + + +Y
Sbjct: 65 VFKDNLRFIDEHNSE-NRTYRVGLNRFADLTNEEYRSMYLG-ALSGIRRNKLRKISDRYT 122
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+P S+DWR +GAV +K+Q CG CWAF+AVAAVEGI KI +G+LI LSEQ+L+
Sbjct: 123 PRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSEQELV 182
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEE 248
DC + N GC GG + F +II N GI +E++YPY A G C +K A I +YE+
Sbjct: 183 DCDNSYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVSIDSYED 242
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP +E AL KAV+ QPVS+AI A +FQ Y G+F+G CGT LDH V VG+G TE+G
Sbjct: 243 VPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGYG-TENG 301
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+YW+++NSWG +WG++GY+++ R+ G+CGI +SYP+
Sbjct: 302 QDYWIVRNSWGKSWGESGYLRMARNIRKPTGICGIAMEASYPI 344
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/357 (45%), Positives = 230/357 (64%), Gaps = 22/357 (6%)
Query: 1 MVLIFERSG-SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYK 59
M+L+ G S+ I+ + II+ + VSSRS E V I+E WM +HG+
Sbjct: 9 MILLLAMIGVSYAIDMS----IISYDENHHISTVSSRSDAE--VERIYEAWMVEHGKKKM 62
Query: 60 DE----LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
++ EK+ R +IFK+NL YI++ N + N +YKLG +F+DLTNDE+R++Y G K
Sbjct: 63 NQNGLGAEKDQRFEIFKDNLRYIDEHNTK-NLSYKLGLTRFADLTNDEYRSMYLGAK--- 118
Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
P R +S +Y+ +P S+DWR +GAV +K+Q CG CWAF+ + AVEGI KI
Sbjct: 119 PVKRVLKTSD-RYEARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKI 177
Query: 176 RSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAA 235
+G+LI LSEQ+L+DC T+ N GC GG + AF +II+N GI TE +YPY+A G C
Sbjct: 178 VTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQN 237
Query: 236 QKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD 294
+K A I +YE+VP E +L KA++ QP+S+AI A FQ Y G+F+G+CGT+LD
Sbjct: 238 RKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGICGTELD 297
Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
H V VG+G TE+G +YW+++NSWGN WG++GY+K+ R+ G CGI +SYP+
Sbjct: 298 HGVVAVGYG-TENGKDYWIVRNSWGNRWGESGYIKMARNIAEPTGKCGIAMEASYPI 353
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 210/313 (67%), Gaps = 6/313 (1%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ V+ ++ W+ +HG+SY EKE R +IFK+NL YI+ N + +R+Y+LG N+F+DL
Sbjct: 42 DDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADL 101
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN+E+RA Y G K + + + +Y + ++P S+DWR+KGAV +K+Q CG
Sbjct: 102 TNEEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEELPDSIDWREKGAVAAVKDQGSCGS 161
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+A+ AVEGI +I +G LI LSEQ+L+DC + N GC GG + AF +II+N GI +
Sbjct: 162 CWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGGLMDYAFNFIIKNGGIDS 221
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
+ +YPY GTC+ ++ A I +YE+VP DE+AL KA + QP+S+AI A +FQ
Sbjct: 222 DLDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDFQ 281
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 334
Y GIF G CGT +DH V +VG+G +E+G +YW+++NSWG WG+AGY+K+ R+
Sbjct: 282 LYVSGIFTGKCGTAVDHGVVVVGYG-SEEGMDYWIVRNSWGAAWGEAGYLKMQRNVGKSS 340
Query: 335 GLCGIGTRSSYPL 347
GLCGI SYP+
Sbjct: 341 GLCGITIEPSYPV 353
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 150/324 (46%), Positives = 210/324 (64%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS ++ ++ +WMA HGR+Y +E R ++F++NL YI+ N G +
Sbjct: 29 IVSYGERTDEEARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHS 88
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+ A Y G + R + +Y D+P S+DWR KGAV
Sbjct: 89 FRLGLNRFADLTNDEYPATYLGARTRPQRDRKLGA---RYHAADNEDLPESVDWRAKGAV 145
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q CG CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 146 AEVKDQGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 205
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TE +YPY+ G C +K A I +YE+VP+ DE++L KAV+ QPVS
Sbjct: 206 EFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVS 265
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A T FQ Y GIF G CGT+LDH VT VG+G TE+G +YW++KNSWG++WG++GY
Sbjct: 266 VAIEAAGTAFQLYSSGIFTGSCGTRLDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 324
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 325 VRMERNIKASSGKCGIAVEPSYPL 348
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 147/304 (48%), Positives = 210/304 (69%), Gaps = 10/304 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E W+++HG+ YK EK R ++F+ENL +I++ NKE + +Y LG N+F+DL+++EF++
Sbjct: 405 ESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHEEFKSK 463
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
Y G + P R S F+Y++++ D+P S+DWR KGAVT +KNQ CG CWAF+ VA
Sbjct: 464 YLGLRAEFPRSRDY-SGEFRYRDVA--DLPESVDWRKKGAVTHVKNQGACGSCWAFSTVA 520
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
AVEGI +I +GNL LSEQ+L+DC T N+GC GG + AFA+I N G+ ED+YPY
Sbjct: 521 AVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDDYPYLM 580
Query: 228 VPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
GTC ++ IS YE+VP DE++LLKA++ QP+S+AI A +FQ Y G+FN
Sbjct: 581 EEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYSGGVFN 640
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTR 342
G CGT+LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R+ EGLCGI
Sbjct: 641 GPCGTELDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCGINKM 699
Query: 343 SSYP 346
+SYP
Sbjct: 700 ASYP 703
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 305 bits (780), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 147/335 (43%), Positives = 210/335 (62%), Gaps = 10/335 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR YKD+ EK R ++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR+ T + R T F+ +N+++ +P
Sbjct: 68 IESFN-AGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTG--FRNENVNIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
++DWR KG VTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI S + L T +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSL--LTVMSM 182
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG + AF +II+N G+ TE YPY AV + + A I YE+VP+ +E AL
Sbjct: 183 GCEGGLMDDAFKFIIKNGGLTTESNYPYAAVDDKFKSVSN-SVASIKGYEDVPANNEAAL 241
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+KAV+ QPVS+A+ FQ YK G+ G CGT LDH + +G+G DG YWL+KNS
Sbjct: 242 MKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNS 301
Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
WG TWG+ G++++ +D G+CG+ SYP A
Sbjct: 302 WGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 336
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 155/344 (45%), Positives = 211/344 (61%), Gaps = 14/344 (4%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
T + + L S V +S H E+S+ +++E+W + H S + EK R
Sbjct: 3 TKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRFN 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKY 128
+FK NL ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ P R T +
Sbjct: 62 VFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAF 120
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+ VP S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++ L+ LSEQ+L
Sbjct: 121 MYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQEL 180
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYE 247
+DC N GC GG E AF +I Q GI TE YPY+A GTC A++ A I +E
Sbjct: 181 VDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHE 240
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP+ DE ALLKAV+ QPVS+AI A ++FQ Y EG+F G C T L+H V IVG+GTT D
Sbjct: 241 NVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD 300
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G NYW+++NSWG WG+ GY+++ R+ EGLCGI SYP+
Sbjct: 301 GTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 344
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 149/339 (43%), Positives = 220/339 (64%), Gaps = 13/339 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTH-----EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
+F+ + ++S TH + + I+EKW+ HG++Y EKE R +IFK
Sbjct: 13 LFLCFAFSSALDMSIISYDQTHPPQRTDAEAMAIYEKWLTTHGKAYNAIGEKERRFEIFK 72
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+NL ++++ N +Y++G N+F+DLTN+E+R+++ G M RS ++ + +Y +
Sbjct: 73 DNLRFVDEHNAVAG-SYRVGLNRFADLTNEEYRSMFLGGNMEM-KERSASTKSDRYAFRA 130
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P S+DWR+KGAV+P+K+Q +CG CWAF+ ++AVEGI +I +G LI LSEQ+L+DC
Sbjct: 131 GDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQELVDCD 190
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPS 251
+ N GC GG + F +II N GI TE++YPY+AV GTC +K A I+ YE+VP
Sbjct: 191 KSYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVVSINGYEDVPE 250
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
DE +L KAV+ QPVS+AI A FQ Y+ G+F G CGT LDH V VG+G TE+G +Y
Sbjct: 251 DDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTGHCGTNLDHGVVAVGYG-TENGVDY 309
Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
W ++NSWG WG+ GY+K+ R+ G CGI + +SYP
Sbjct: 310 WTVRNSWGPKWGENGYIKLERNINATSGKCGIASMASYP 348
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 145/330 (43%), Positives = 207/330 (62%), Gaps = 22/330 (6%)
Query: 38 THEQSVVEIHEKWMAQH--------GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
+ E+S+ ++E+W +++ G D+ E R +F EN YI +AN+ G R +
Sbjct: 33 SSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPF 92
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS------TFKYQNLSMTDVPTSLDWR 143
+L N+F+D+T DEFR Y G + + HRS + +F+Y ++P ++DWR
Sbjct: 93 RLALNKFADMTTDEFRRTYAGSR--ARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWR 150
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
++GAVT IK+Q +CG CWAF+ VAAVEG+ KI++G L+ LSEQ+L+DC T N GC GG
Sbjct: 151 ERGAVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGL 210
Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVS 262
+ AF +I +N GI TE YPY+A G C+ A+ + I YE+VP+ DE AL KAV+
Sbjct: 211 MDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVA 270
Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
QPV++A+ A +FQ Y EG+F G CGT LDH V VG+G T DG YW++KNSWG W
Sbjct: 271 NQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDW 330
Query: 323 GDAGYMKIVR-----DEGLCGIGTRSSYPL 347
G+ GY+++ R GLCGI +SYP+
Sbjct: 331 GERGYIRMQRGVSSDSNGLCGIAMEASYPV 360
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 146/308 (47%), Positives = 203/308 (65%), Gaps = 9/308 (2%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
I E W+ +HG+ Y EKE RL IFK+NL +I N E N Y+LG N+F+DL+ E++
Sbjct: 63 IFESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSE-NLGYRLGLNRFADLSLHEYK 121
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
+ G P + SS+ +Y+ + +P S+DWR++GAVT +K+Q C CWAF+
Sbjct: 122 EICHGADPKPPRNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFST 181
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
V AVEG+ KI +G L+ LSEQ L++C+ NNGC GG E A+ +I+ N G+ T+++YPY
Sbjct: 182 VGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIVSNGGLGTDNDYPY 240
Query: 226 QAVPGTCSAAQKP--AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEG 283
+AV G C K I YE +P+ DE AL+KAV+ QPV+ I + S EFQ Y+ G
Sbjct: 241 KAVNGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQLYESG 300
Query: 284 IFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGI 339
+F+G CGT L+H V +VG+G TE+G NYW+++NSWGNTWG+AGYMK+ R+ GLCGI
Sbjct: 301 VFDGRCGTNLNHGVVVVGYG-TENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRGLCGI 359
Query: 340 GTRSSYPL 347
R SYPL
Sbjct: 360 AMRVSYPL 367
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 209/324 (64%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +WMA HGR+Y E+E R ++F++NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+RA Y G + R +Y D+P S+DWR KGAV
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGD---RYLAGDNEDLPESVDWRAKGAV 147
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+ +AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG + AF
Sbjct: 148 AEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAF 207
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TE++YPY+ G C +K A I +YE+VP+ E++L KAV+ QP+S
Sbjct: 208 EFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPIS 267
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A FQ Y GIF G CGT LDH VT VG+G TE+G +YW++KNSWG++WG++GY
Sbjct: 268 VAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 326
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 327 VRMERNIKASSGKCGIAVEPSYPL 350
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 150/318 (47%), Positives = 205/318 (64%), Gaps = 16/318 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+ E++E+W + H S + EK R +FK N+ Y+ NK+ ++ YKL N+F+D+
Sbjct: 31 EEKFWELYERWRSHHTVSRSLD-EKHKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR Y G K+ HR S + TF Y N +VP S+DWR KGAVTP+K+Q
Sbjct: 89 TNHEFRQHYAGSKIKH--HRTLLGASRANGTFMYANED--NVPPSIDWRKKGAVTPVKDQ 144
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ V AVEGI +I++ L+ LSEQ+L+DC T N GC GG + AF +I +
Sbjct: 145 GQCGSCWAFSTVVAVEGINQIKTKKLVSLSEQELVDCDTTENQGCNGGLMDPAFDFIKKR 204
Query: 215 QGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
GI TE+ YPY+A C ++ I +E+VP DE ALLKAV+ QP+S+AI A
Sbjct: 205 GGITTEERYPYKAEDDKCDIQKRNTPVVSIDGHEDVPPNDEDALLKAVANQPISVAIDAS 264
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
++FQ Y EG+F G CGT+LDH V IVG+GTT DG YW++KNSWG WG+ GY+++ R
Sbjct: 265 GSQFQFYSEGVFTGECGTELDHGVAIVGYGTTVDGTKYWIVKNSWGAGWGEKGYIRMQRK 324
Query: 333 ---DEGLCGIGTRSSYPL 347
+EGLCGI + SYP+
Sbjct: 325 VDAEEGLCGIAMQPSYPI 342
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 146/311 (46%), Positives = 211/311 (67%), Gaps = 9/311 (2%)
Query: 44 VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
+ ++E+W+ +HG++Y EK+ R IFK+NL +I+ N + NRTYKLG N+F+DLTN+E
Sbjct: 1 MSLYEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNAD-NRTYKLGLNRFADLTNEE 59
Query: 104 FRALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
+RA Y G ++ P+ T + + +Y ++P S+DWR++ AV P+K+Q CG CWA
Sbjct: 60 YRARYLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWA 119
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ + AVEGI KI +G+LI LSEQ+L+DC T+ N GC GG + A+ +II N GI +E++
Sbjct: 120 FSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDSEED 179
Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY+AV GTC +K A I +YE+VP+ DE AL KAV+ QPVS+AI EFQ Y
Sbjct: 180 YPYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYV 239
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGL 336
G+F G CGT LDH V VG+G+ + G +YW+++NSWG +WG+ GY+++ R+ G
Sbjct: 240 SGVFTGRCGTALDHGVVAVGYGSVK-GHDYWIVRNSWGASWGEEGYVRLERNLAKSRSGK 298
Query: 337 CGIGTRSSYPL 347
CGI SYP+
Sbjct: 299 CGIAIEPSYPI 309
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 209/324 (64%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +WMA HGR+Y E+E R ++F++NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+RA Y G + R +Y D+P S+DWR KGAV
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGD---RYLAGDNEDLPESVDWRAKGAV 147
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q CG CWAF+ +AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG + AF
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAF 207
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TE++YPY+ G C +K A I +YE+VP+ E++L KAV+ QP+S
Sbjct: 208 EFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPIS 267
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A FQ Y GIF G CGT LDH VT VG+G TE+G +YW++KNSWG++WG++GY
Sbjct: 268 VAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 326
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 327 VRMERNIKASSGKCGIAVEPSYPL 350
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 303 bits (777), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 148/317 (46%), Positives = 212/317 (66%), Gaps = 10/317 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQF 96
+ V +++ W AQH RSY E E RL+IF++NL +I++ N G +++LG +F
Sbjct: 40 DDEVHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRF 99
Query: 97 SDLTNDEFRALYTGYKMP-SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
+DLTN+E+R+ Y G + S R++T + +Y+ S D+P S+DWRDKGAV +K+Q
Sbjct: 100 ADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKDQG 159
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ +AAVEGI I +G+LI LSEQ+L+DC T N GC GG + AF +II N
Sbjct: 160 SCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIISNG 219
Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
GI T+++YPY G+C +K A I +YE+VP DE++L KAV+ QPVS+AI A
Sbjct: 220 GIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAGG 279
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
FQ Y+ GIF G CGT+LDH VT +G+G +E+G YW++KNSWG+ WG++GY+++ R+
Sbjct: 280 RAFQLYESGIFTGYCGTELDHGVTAIGYG-SENGKYYWIVKNSWGSDWGESGYIRMERNI 338
Query: 334 ---EGLCGIGTRSSYPL 347
G CGI +SYP+
Sbjct: 339 NSATGKCGIAMEASYPI 355
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 149/334 (44%), Positives = 210/334 (62%), Gaps = 11/334 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG NQF+D+T EF A YTG P R S + +++++ V
Sbjct: 68 IETFNSHNGNSYTLGINQFTDMTKSEFVAQYTGGISRPLNIEREPVVS---FDDVNISAV 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWRD GAV +KNQ CG CWAFAA+A VEGI KI++G L+ LSEQ++LDC+ +
Sbjct: 125 PQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV--S 182
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG KA+ +II N G+ TE+ YPYQA GTC+A P +A I+ Y V DE++
Sbjct: 183 YGCKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTCNANSFPNSAYITGYSYVRRNDERS 242
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
++ AVS QP++ I A S FQ Y G+F+G CGT L+HA+TI+G+G G YW+++N
Sbjct: 243 MMYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRN 301
Query: 317 SWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
SWG++WG+ GY+++ R G CGI +P
Sbjct: 302 SWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFP 335
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 210/329 (63%), Gaps = 17/329 (5%)
Query: 32 VVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
++S TH + V+ ++E+W+ +HG++Y EKE R +IFK+NL +I++ N
Sbjct: 28 IISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNS 87
Query: 84 EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
E NRTY +G N+F+DLTN+EFR++Y G + TS +Y +P S+DWR
Sbjct: 88 E-NRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSD--RYAPRVGDSLPDSVDWR 144
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
+GAV +K+Q CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+DC T+ N GC GG
Sbjct: 145 KEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGL 204
Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVS 262
+ AF +II N GI TED+YPY G C +K A I +YE+VP DE AL KAV+
Sbjct: 205 MDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVA 264
Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
QPVS+AI FQ Y G+F G CGT LDH V VG+G TE G +YW+++NSWG +W
Sbjct: 265 NQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYG-TEKGKDYWIVRNSWGKSW 323
Query: 323 GDAGYMKIVRD----EGLCGIGTRSSYPL 347
G++GY+++ R+ G CGI SYP+
Sbjct: 324 GESGYIRMERNIASPTGKCGIAIEPSYPI 352
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 303 bits (776), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 157/356 (44%), Positives = 230/356 (64%), Gaps = 17/356 (4%)
Query: 1 MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+VLI SF ++ II+ + + S R+ E V+ ++E+W+ +HG+SY
Sbjct: 14 IVLIIS---SFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNG 68
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EK+ R +IFK+NL++I++ N N TY+LG +F+DLTN+E+R+ + G K+ P+ R
Sbjct: 69 LGEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKI-DPNRRM 126
Query: 121 TT---SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
S + +Y +P S+DWR +GAV +K+Q CG CWAF+A+AAVEGI KI +
Sbjct: 127 KKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVT 186
Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
G+LI LSEQ+L+DC T+ N GC GG + AF +II N GI +ED+YPY+AV G C +K
Sbjct: 187 GDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRK 246
Query: 238 PA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHA 296
A I +YE+VP+ DE AL KAV+ QP+++A+ EFQ Y+ G+F G CGT LDH
Sbjct: 247 NAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHG 306
Query: 297 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
V VG+G TE+G +YW+++NSWG +WG+ GY+++ R+ G CGI SYP+
Sbjct: 307 VAAVGYG-TENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 303 bits (775), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 157/356 (44%), Positives = 230/356 (64%), Gaps = 17/356 (4%)
Query: 1 MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+VLI SF ++ II+ + + S R+ E V+ ++E+W+ +HG+SY
Sbjct: 14 IVLIIS---SFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNG 68
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EK+ R +IFK+NL++I++ N N TY+LG +F+DLTN+E+R+ + G K+ P+ R
Sbjct: 69 LGEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKI-DPNRRM 126
Query: 121 TT---SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
S + +Y +P S+DWR +GAV +K+Q CG CWAF+A+AAVEGI KI +
Sbjct: 127 KKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVT 186
Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
G+LI LSEQ+L+DC T+ N GC GG + AF +II N GI +ED+YPY+AV G C +K
Sbjct: 187 GDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRK 246
Query: 238 PA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHA 296
A I +YE+VP+ DE AL KAV+ QP+++A+ EFQ Y+ G+F G CGT LDH
Sbjct: 247 NAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHG 306
Query: 297 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
V VG+G TE+G +YW+++NSWG +WG+ GY+++ R+ G CGI SYP+
Sbjct: 307 VAAVGYG-TENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 303 bits (775), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 148/309 (47%), Positives = 205/309 (66%), Gaps = 9/309 (2%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
V+ + E W+ ++G+SY EKE R +IFK+NL ++++ N + NR+YK+G NQFSDLT +
Sbjct: 44 VMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTLE 103
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
E+ ++Y G K T+ + +Y+ +P S+DWR KGAV +KNQ CG CW
Sbjct: 104 EYSSIYLGTKF----DMRMTNVSDRYEPRVGDQLPNSIDWRKKGAVLGVKNQGNCGSCWT 159
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATED 221
FA +AAVE I +I +GNLI LSEQQ++DC NNGC GGSR A+ +II N GI TE
Sbjct: 160 FAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDNGGINTEA 219
Query: 222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY+A G C + I YE VP +E+AL KAVS Q VS+ IA+ S+EF++YK
Sbjct: 220 NYPYKAQDGECDEQKNQKYVTIDRYENVPRKNEKALQKAVSNQLVSVGIASNSSEFKAYK 279
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCG 338
GIF G CG ++DHAVTIVG+G TE G +YW+++NSWG+ WG+ GY+++ R + G C
Sbjct: 280 SGIFTGPCGAKIDHAVTIVGYG-TEGGMDYWIVRNSWGSNWGENGYVRMQRNVGNAGTCF 338
Query: 339 IGTRSSYPL 347
I T +YP+
Sbjct: 339 IATSPNYPV 347
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 143/315 (45%), Positives = 205/315 (65%), Gaps = 11/315 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ ++++ +W+ +H R Y EK+ R +IFK+NL YI NK+ ++Y LG N+FSDL
Sbjct: 45 DDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQ-EKSYWLGLNKFSDL 103
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T+DEFRALY G + +H F Y+++ ++ +DWR KGAV+ +K+Q CG
Sbjct: 104 THDEFRALYLGIRPAGRAHGLRNGDRFIYEDVVAEEM---VDWRKKGAVSDVKDQGSCGS 160
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+A+ +VEG+ I +G LI LSEQ+L+DC N GC GG + AF +II+N GI T
Sbjct: 161 CWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMDYAFDFIIKNGGIDT 220
Query: 220 EDEYPYQAVPGTCSAAQKPAA--AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
E++YPY+A G C A+K + I +Y++VP+ E +LLKAVS PVS+AI A +F
Sbjct: 221 EEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKNPVSVAIEAGGRDF 280
Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----- 332
Q Y+ G+F G CGT LDH V VG+GT +DG NYW++KNSWG +WG+ GY+++ R
Sbjct: 281 QHYQGGVFTGPCGTDLDHGVLAVGYGTDDDGVNYWIVKNSWGPSWGEKGYIRMERMGSNS 340
Query: 333 DEGLCGIGTRSSYPL 347
G CGI S+P+
Sbjct: 341 TSGKCGINIEPSFPI 355
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 203/321 (63%), Gaps = 15/321 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ +++E+W H R + EK R FK N+ +I NK G+R Y+L N+F D+
Sbjct: 39 EEALWDLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSST-----FKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
+ EFRA + G ++ S R ++ F Y ++++D+P S+DWR KGAVT +KNQ
Sbjct: 98 SQAEFRATFAGSRV-SDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVKNQ 156
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ V +VEGI IR+G L+ LSEQ+L+DC T N+GC GG + AF YI +N
Sbjct: 157 GKCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYIKKN 216
Query: 215 QGIATEDEYPYQAVPGTCSAAQ----KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
G+ TE YPY+A GTC AA+ P I +++VP+ E+AL KAV+ QPVS+ I
Sbjct: 217 GGLTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSVGI 276
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A F Y EG+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++
Sbjct: 277 DASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEKGYIRV 336
Query: 331 VRDE----GLCGIGTRSSYPL 347
+D GLCGI +SY +
Sbjct: 337 EKDSGAEGGLCGIAMEASYAV 357
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 143/326 (43%), Positives = 213/326 (65%), Gaps = 12/326 (3%)
Query: 32 VVSSRSTH-----EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN 86
+++ TH + ++ +E W+ +HG+SY EKE R +IFK+N YI++ N +
Sbjct: 24 IITYDQTHAVGSTDDVIMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKD 83
Query: 87 RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKG 146
R++KLG N+F+DLTN+E+R+ YTG + S + + + +Y +L+ +P S+DWR+ G
Sbjct: 84 RSFKLGLNRFADLTNEEYRSKYTGIRTKD-SRKKVSGKSQRYASLAGESLPESVDWREHG 142
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
AV +K+Q +CG CWAF+ ++AVEGI +I +G LI LSEQ+L+DC + N GC GG +
Sbjct: 143 AVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDD 202
Query: 207 AFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQP 265
AF +II N GI ++ +YPY G C +K A I +YE+VP DE+AL KA + QP
Sbjct: 203 AFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQP 262
Query: 266 VSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
+S+AI A +FQ Y GIF G CGT LDH V +VG+G TE+G +YW+++NSWG WG+
Sbjct: 263 ISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYG-TENGKDYWIVRNSWGADWGEK 321
Query: 326 GYMKIVR----DEGLCGIGTRSSYPL 347
GY+++ R G+CGI + SYP+
Sbjct: 322 GYLRMERGISSKAGICGITSEPSYPV 347
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 153/356 (42%), Positives = 227/356 (63%), Gaps = 27/356 (7%)
Query: 13 INTTPMFIIITLL-VSCA-----------SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
+ +PM +++ ++ VS A + + S + V I+E WM +HG+ +
Sbjct: 4 LKLSPMILLLAMIGVSYAMDMSIISYDENHHITTETSRSDSEVERIYEAWMVEHGKKKMN 63
Query: 61 E----LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP 116
+ EK+ R +IFK+NL +I++ N + N +YKLG +F+DLTN+E+R++Y G K P
Sbjct: 64 QNGLGAEKDQRFEIFKDNLRFIDEHNTK-NLSYKLGLTRFADLTNEEYRSMYLGAK---P 119
Query: 117 SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIR 176
+ R +S +YQ +P S+DWR +GAV +K+Q CG CWAF+ + AVEGI KI
Sbjct: 120 TKRVLKTSD-RYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIV 178
Query: 177 SGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
+G+LI LSEQ+L+DC T+ N GC GG + AF +II+N GI TE +YPY+A G C +
Sbjct: 179 TGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQNR 238
Query: 237 KPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDH 295
K A I +YE+VP E +L KA++ QP+S+AI A FQ Y G+F+G+CGT+LDH
Sbjct: 239 KNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGLCGTELDH 298
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
V VG+G TE+G +YW+++NSWGN WG++GY+K+ R+ G CGI +SYP+
Sbjct: 299 GVVAVGYG-TENGKDYWIVRNSWGNRWGESGYIKMARNIEAPTGKCGIAMEASYPI 353
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 140/317 (44%), Positives = 202/317 (63%), Gaps = 11/317 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ +++E+W + H R + EK R FK N +I NK G+ Y+L N+F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 100 TNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
EFRA + G + +PS + + F Y L+++D+P S+DWR KGAVT +K+Q +CG
Sbjct: 98 DQAEFRATFVGDLRRDTPS-KPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCG 156
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+ V +VEGI IR+G+L+ LSEQ+L+DC T N+GC GG + AF YI N G+
Sbjct: 157 SCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLI 216
Query: 219 TEDEYPYQAVPGTCSAAQ----KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
TE YPY+A GTC+ A+ P I +++VP+ E+ L +AV+ QPVS+A+ A
Sbjct: 217 TEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASG 276
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
F Y EG+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 277 KAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDS 336
Query: 335 ----GLCGIGTRSSYPL 347
GLCGI +SYP+
Sbjct: 337 GASGGLCGIAMEASYPV 353
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 158/359 (44%), Positives = 228/359 (63%), Gaps = 28/359 (7%)
Query: 14 NTTPMFIIITLLV------SCASQVVSSRSTH--------EQSVVEIHEKWMAQHGR--S 57
N +PM +I+ + + ++S TH ++ V I+E+W +HG+ +
Sbjct: 6 NRSPMLVILIVFTLFTATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNN 65
Query: 58 YKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS 117
D EK+ R +IFK+NL++I++ N E NRTYK+G N+F+DL+N+E+R+ Y G K+
Sbjct: 66 NIDGSEKDKRFEIFKDNLKFIDEHNAE-NRTYKVGLNRFADLSNEEYRSRYLGTKIDPIG 124
Query: 118 H---RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITK 174
R+ T S +Y +P S+DWR +GAV +K+Q CG CWAF+ +AAVEGI K
Sbjct: 125 MMMARTKTRSN-RYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINK 183
Query: 175 IRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA 234
I +G L+ LSEQ+L+DC N GC GG E AF +II N GI ++++YPY+ V G C
Sbjct: 184 IVTGELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVDGKCDQ 243
Query: 235 AQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL 293
+K A I +YE+VP+ DE AL KAV+ QP+S+AI A EFQ Y GIF G CGT L
Sbjct: 244 YKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGKCGTAL 303
Query: 294 DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
DH VT VG+G TE+G +YW+++NSWG +WG++GY+++ R+ G CGI +SSYP+
Sbjct: 304 DHGVTAVGYG-TENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGIVMQSSYPI 361
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 148/333 (44%), Positives = 211/333 (63%), Gaps = 9/333 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG N+F+D+TN+EF A YTG P + S + +++++ V
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPVVS---FDDVNISAV 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
S+DWRD GAVT +K+Q CG CWAF+A+A VEGI KI +G L+ LSEQ++LDC+ +
Sbjct: 125 GQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS-- 182
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG + A+ +II N G+A+E +YPYQA G C+A P +A I+ Y V S DE +
Sbjct: 183 NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPNSAYITGYSYVRSNDESS 242
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
+ AV QP++ AI A FQ Y G+F+G CGT L+HA+TI+G+G G YW++KN
Sbjct: 243 MKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQYWIVKN 302
Query: 317 SWGNTWGDAGYMKIVR---DEGLCGIGTRSSYP 346
SWG++WG+ GY+++ R GLCGI YP
Sbjct: 303 SWGSSWGERGYIRMARGVSSSGLCGIAMDPLYP 335
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 153/335 (45%), Positives = 210/335 (62%), Gaps = 27/335 (8%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
S+HE S+ E+ E+W+++H R+Y EK R ++FK+NL +I++ N++ + +Y LG N+F
Sbjct: 50 SSHE-SLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRKVS-SYWLGLNEF 107
Query: 97 SDLTNDEFRALYTGYK------MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
+DLT+DEF+A Y G + Y+ + +P S+DWR KGAVT
Sbjct: 108 ADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSVDWRSKGAVTG 167
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+KNQ +CG CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T+GNNGC GG + AF+Y
Sbjct: 168 VKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNNGCNGGLMDYAFSY 227
Query: 211 IIQNQGIATEDEYPYQAVPGTC---------------SAAQKPAAAKISNYEEVPSGDEQ 255
I N G+ TE+ YPY GTC A A IS YE+VP +EQ
Sbjct: 228 IAHNGGLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISGYEDVPRNNEQ 287
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALLKA++ QPVS+AI A FQ Y G+F+G CGTQLDH V VG+GT G +Y ++K
Sbjct: 288 ALLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVAAVGYGTAAKGHDYIIVK 347
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
NSWG +WG+ GY+++ R +GLCGI +SYP
Sbjct: 348 NSWGPSWGEKGYIRMRRGTGKRQGLCGINKMASYP 382
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/347 (44%), Positives = 216/347 (62%), Gaps = 26/347 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQ------SVVEIHEKWMAQH--GRSYKDELEKEMRLKI 70
FI++ L + + S HE+ S+ E++E+W + H RS + EK R +
Sbjct: 4 FIVLALCMLMVLETTKSLDFHEKDVESEDSLWELYERWKSHHTIARSLE---EKAKRFNV 60
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTSST 125
FK N+++I + NK+ N +YKL N+F D+T++EFR Y G + HR T+ +
Sbjct: 61 FKHNVKHIHETNKKEN-SYKLKLNKFGDMTSEEFRRTYAGSNIKH--HRMFQGERQTTKS 117
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F Y N+ +PTS+DWR GAVTP+KNQ +CG CWAF+ V AVEGI +IR+ L LSE
Sbjct: 118 FMYANVDT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSE 175
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKIS 244
Q+L+DC TN N GC GG + AF +I + G+ +E YPY+A TC ++ A I
Sbjct: 176 QELVDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSID 235
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
+E+VP E L+KAV+ QPVS+AI A ++FQ Y EG+F G CGT+L+H V +VG+GT
Sbjct: 236 GHEDVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGT 295
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
T DG YW++KNSWG WG+ GY+++ R EGLCGI +SYPL
Sbjct: 296 TIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 150/286 (52%), Positives = 196/286 (68%), Gaps = 15/286 (5%)
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKY 128
KEN+ YIE N N+ YKLG NQF+DLT++EF R + G+ S +T ++TFKY
Sbjct: 5 KENVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNGHMRFS----NTRTTTFKY 60
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+N+++ +P S+DWR KGAVTPIKNQ CGCCWAF+A+AA EGI KI +G L+ LSEQ++
Sbjct: 61 ENVTV--LPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEV 118
Query: 189 LDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNY 246
+DC T G ++GC GG + AF +IIQN GI TE YPY+ V G C+ ++ A I+ Y
Sbjct: 119 VDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGY 178
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP +E+AL KAV+ QPVS+AI A +FQ YK GIF G CGT+LDH VT VG+G
Sbjct: 179 EDVPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENN 238
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
+G YWL+KNSWG WG+ GY + R EG+CGI +SYP A
Sbjct: 239 EGTKYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICGIAMLASYPTA 284
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 198/316 (62%), Gaps = 9/316 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ +++E+W + H R + EK R FK N +I NK G+ Y+L N+F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
EFRA + G + + F Y L+++D+P S+DWR KGAVT +K+Q +CG
Sbjct: 98 DQAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGS 157
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ V +VEGI IR+G+L+ LSEQ+L+DC T N+GC GG + AF YI N G+ T
Sbjct: 158 CWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLIT 217
Query: 220 EDEYPYQAVPGTCSAAQ----KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
E YPY+A GTC+ A+ P I +++VP+ E+ L +AV+ QPVS+A+ A
Sbjct: 218 EAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGK 277
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 334
F Y EG+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 278 AFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337
Query: 335 ---GLCGIGTRSSYPL 347
GLCGI +SYP+
Sbjct: 338 ASGGLCGIAMEASYPV 353
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 149/307 (48%), Positives = 202/307 (65%), Gaps = 12/307 (3%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
+E+W+ QHGR YK+ E + I++ N+ +I N + N ++ L NQF+D+TN+E++A
Sbjct: 45 YERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKA 103
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
LY G S ++ S+FK + + +P S+DWR GAVTP++NQ ECG CWAF+ V
Sbjct: 104 LYMGLGTSETSRKN--QSSFKRERSKV--LPISVDWRKMGAVTPVRNQGECGSCWAFSTV 159
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
AAVEGI KIR+G L+ LSEQ+LLDC + GN GC GG AF +I QN GI T YPY
Sbjct: 160 AAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPY 219
Query: 226 QAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
G C+ + KIS YE VP +E+ L AV+ QPVS+AI A EFQ Y +GI
Sbjct: 220 IGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI 279
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIG 340
FNG CG QL+HAVT++G+G ++G YWL+KNSWG WG+AGY +++R DEG+CGI
Sbjct: 280 FNGFCGKQLNHAVTVIGYG-EDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIA 338
Query: 341 TRSSYPL 347
+SYP+
Sbjct: 339 MEASYPI 345
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 147/333 (44%), Positives = 213/333 (63%), Gaps = 12/333 (3%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
++L + +VS E+ V ++ +WMA+HG +Y E+E R + F++NL YI++
Sbjct: 18 VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 82 N---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N G +++LG N+F+DLTN+E+R+ Y G + R ++ +YQ ++P
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPE 134
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
S+DWR KGAV +K+Q CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQAL 257
C GG + AF +II N GI +E++YPY+ C A +K A I YE+VP E++L
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
KAV+ QP+S+AI A FQ YK GIF G CGT LDH V VG+G TE+G +YWL++NS
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYG-TENGKDYWLVRNS 313
Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
WG+ WG+ GY+++ R+ G CGI SYP
Sbjct: 314 WGSVWGEDGYIRMERNIKASSGKCGIAVEPSYP 346
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 150/312 (48%), Positives = 203/312 (65%), Gaps = 16/312 (5%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
++E+W + H S + EK+ R +FK N ++ ANK ++ YKL N+F+D+TN EFR
Sbjct: 37 LYERWRSHHTVS-RSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFR 94
Query: 106 ALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
Y+G K+ HR + TF Y+ + VP S+DWR KGAVT +K+Q +CG C
Sbjct: 95 NTYSGSKVKH--HRMFRGGPRGNGTFMYEKVDT--VPASVDWRKKGAVTSVKDQGQCGSC 150
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
WAF+ + AVEGI +I++ L+ LSEQ+L+DC T+ N GC GG + AF +I Q GI TE
Sbjct: 151 WAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTE 210
Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
YPY+A GTC +++ A A I +E VP DE ALLKAV+ QPVS+AI A ++FQ
Sbjct: 211 ANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQF 270
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEG 335
Y EG+F G CGT+LDH V IVG+GTT DG YW +KNSWG WG+ GY+++ R EG
Sbjct: 271 YSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEG 330
Query: 336 LCGIGTRSSYPL 347
LCGI +SYP+
Sbjct: 331 LCGIAMEASYPI 342
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 150/357 (42%), Positives = 224/357 (62%), Gaps = 24/357 (6%)
Query: 9 GSFKINTTPMFIIITLLVSCASQVVSSRSTH---------EQSVVEIHEKWMAQHGRSYK 59
GS K+ + ++I + + ++S H + V I+E WM +HG+ +
Sbjct: 2 GSVKVTILLLAMMIGVSYAADMSIISYDEKHHITAENERSDAEVARIYEAWMEKHGKKAQ 61
Query: 60 DE----LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
EK+ R +IFK+NL +I++ N + N +YKLG +F+DLTN+E+R++Y G K
Sbjct: 62 SNGLVGEEKDQRFEIFKDNLRFIDEHNNK-NLSYKLGLTRFADLTNEEYRSIYLGAK--- 117
Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
S + ++ +YQ +P S+DWR +GAV +K+Q CG CWAF+ + AVEGI KI
Sbjct: 118 -SKKRVLKTSDRYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKI 176
Query: 176 RSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAA 235
+G+LI LSEQ+L+DC T+ N GC GG + AF +II+N GI TE++YPY+A G C
Sbjct: 177 VTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQT 236
Query: 236 QKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD 294
+K A I YE+VP +E AL K ++ QP+S+AI A FQ Y G+F+G+CGT+LD
Sbjct: 237 RKNAKVVTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGVFDGICGTELD 296
Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
H V VG+G TE+G +YW+++NSWG +WG++GY+K+ R+ G CGI +SYP+
Sbjct: 297 HGVVAVGYG-TENGKDYWIVRNSWGGSWGESGYIKMARNIAEPTGKCGIAMEASYPI 352
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 216/349 (61%), Gaps = 22/349 (6%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHE------QSVVEIHEKWMAQHGRSYKDELEKEMRL 68
TT ++I L ++ V S H+ +S+ +++E+W + H S ++ EK+ R
Sbjct: 2 TTKKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRF 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTS 123
+FK N+ ++ NK ++ YKL N+F+D+TN EF+ Y G K+ HR S
Sbjct: 61 NVFKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNH--HRMFRGTPRVS 117
Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
TF Y+N T P S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++ L+ L
Sbjct: 118 GTFMYENF--TKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPL 175
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAK 242
SEQ+L+DC N GC GG E AF YI Q G+ TE YPY A G+C A ++
Sbjct: 176 SEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVS 235
Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGF 302
I +E VP+ DE ALLKAV+ QPVS+AI A ++FQ Y EG+F G CG +L+H V IVG+
Sbjct: 236 IDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGY 295
Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
GTT DG NYW+++NSWG WG+ G +++ R+ EGLCGI +SYP+
Sbjct: 296 GTTVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 204/315 (64%), Gaps = 11/315 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++E+W + H S +D EK R +FKEN ++I + NK+ + YKLG N+F+D+
Sbjct: 33 EESLWGLYERWRSHHTVS-RDLSEKNKRFNVFKENAKFIHEFNKK-DAPYKLGLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSP-SHRSTTSST--FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
TN EFR+ Y G K+ + R T +T F Y+N+ +P S+DWR +GAV P+K+Q +
Sbjct: 91 TNQEFRSTYAGSKIHHHRTQRGTPRATGSFMYENVH--SIPASVDWRTQGAVAPVKDQGQ 148
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ +A+VEGI KI++ L+ LS QQL+DC T+ N GC GG + AF +I N G
Sbjct: 149 CGSCWAFSTIASVEGINKIKTNQLVPLSGQQLVDCDTDQNEGCNGGLMDYAFEFIKSNGG 208
Query: 217 IATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
I +E YPY A G+C++ I YE+VP+ +E AL+KAV+ Q VS+AI A
Sbjct: 209 ITSESAYPYTAEQGSCASESSAPVVTIDGYEDVPANNEAALMKAVANQVVSVAIEASGMA 268
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ Y EG+F G CG +LDH V +VG+G T DG YW+++NSWG WG+ GY+++ R
Sbjct: 269 FQFYSEGVFTGSCGNELDHGVAVVGYGATRDGTKYWIVRNSWGAEWGEKGYIRMQRGIRA 328
Query: 334 -EGLCGIGTRSSYPL 347
GLCGI SYPL
Sbjct: 329 RHGLCGIAMEPSYPL 343
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 201/307 (65%), Gaps = 12/307 (3%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
++++KW+ +HG++Y E + R +IFKEN+ YI N N ++ LG N+F+DLTN EF
Sbjct: 36 QVYQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEF 95
Query: 105 RALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
R LY G + P+P H + + D TS+DWR KG VT IK+Q +CG CWAF
Sbjct: 96 RGLYVGRLQRPAPFHEVGDIAL-------VADTATSVDWRKKGGVTEIKDQGDCGSCWAF 148
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
+AVAAVEG+T + +G L+ LSEQ+L+DC T N GC GG + AF Y+I+N GI ++ Y
Sbjct: 149 SAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRNGGITSQSNY 208
Query: 224 PYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
PY+A+ G C + K AA I+ ++ +P E+ LL+AV+ QPVS+AI A +FQ Y
Sbjct: 209 PYRALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYSS 268
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 339
G+F G CG+ LDH V IVG+GT G YWL+KNSWG+ WG++GY+++ R G+CGI
Sbjct: 269 GVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGSGWGESGYVRMERQGPGAGVCGI 328
Query: 340 GTRSSYP 346
+SYP
Sbjct: 329 NLDASYP 335
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 148/333 (44%), Positives = 212/333 (63%), Gaps = 9/333 (2%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
I+ L + SS + V+ +++ W+ QHG++Y E+E R +IFK+NL +I++
Sbjct: 20 ISTLTLNQNHPSSSSWRSDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEH 79
Query: 82 NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS--TFKYQNLSMTDVPTS 139
N N TYKLG N+F+DLTN E+RA + G + P R S + +Y + + ++P S
Sbjct: 80 NSNNNTTYKLGLNKFADLTNQEYRAKFLGTRT-DPRRRLMKSKIPSSRYAHRAGDNLPDS 138
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+DWRD GAV+P+K+Q CG CWAF+ +A VEGI KI SG L+ LSEQ+L+DC + + GC
Sbjct: 139 VDWRDHGAVSPVKDQGSCGSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAGC 198
Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALL 258
GG + AF +I+ N GI TE +YPY C +K A I YE+VP+ +E AL
Sbjct: 199 NGGLMDYAFQFIMDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPN-NENALK 257
Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
KAV+ QPVSIAI A FQ Y+ G+FNG CG LDH V VG+GT ++G +YW+++NSW
Sbjct: 258 KAVAHQPVSIAIEAGGRAFQLYESGVFNGECGLALDHGVVAVGYGTDDNGQDYWIVRNSW 317
Query: 319 GNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
G+ WG+ GY+++ R + G CGI +SYP+
Sbjct: 318 GSNWGENGYIRMERNINANTGKCGIAMEASYPV 350
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/344 (44%), Positives = 210/344 (61%), Gaps = 14/344 (4%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
T + + L S V +S H E+S+ +++E+W + H S + EK R
Sbjct: 2 TKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRFN 60
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKY 128
+FK NL ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ R T +
Sbjct: 61 VFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRGTPHENGAF 119
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+ VP S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++ L+ LSEQ+L
Sbjct: 120 MYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQEL 179
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYE 247
+DC N GC GG E AF +I Q GI TE YPY+A GTC A++ A I +E
Sbjct: 180 VDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHE 239
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP+ DE ALLKAV+ QPVS+AI A ++FQ Y EG+F G C T L+H V IVG+GTT D
Sbjct: 240 NVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD 299
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G NYW+++NSWG WG+ GY+++ R+ EGLCGI SYP+
Sbjct: 300 GTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 343
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 216/349 (61%), Gaps = 22/349 (6%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHE------QSVVEIHEKWMAQHGRSYKDELEKEMRL 68
TT ++I L ++ V S H+ +S+ +++E+W + H S ++ EK+ R
Sbjct: 2 TTKKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRF 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTS 123
+FK N+ ++ NK ++ YKL N+F+D+TN EF+ Y G K+ HR S
Sbjct: 61 NVFKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGTKVNH--HRMFRGTPRVS 117
Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
TF Y+N T P S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++ L+ L
Sbjct: 118 GTFMYENF--TKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPL 175
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAK 242
SEQ+L+DC N GC GG E AF YI Q G+ TE YPY A G+C A ++
Sbjct: 176 SEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVS 235
Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGF 302
I +E VP+ DE ALLKAV+ QPVS+AI A ++FQ Y EG+F G CG +L+H V IVG+
Sbjct: 236 IDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGY 295
Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
GTT DG NYW+++NSWG WG+ G +++ R+ EGLCGI +SYP+
Sbjct: 296 GTTVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 301 bits (770), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 156/346 (45%), Positives = 228/346 (65%), Gaps = 20/346 (5%)
Query: 16 TPMFIIITLLVSCASQVVSSRST---HEQSVVE---IHEKWMAQHGRSYKDEL-EKEMRL 68
T +F++I ++S S + +T H +S E I + WM++HG++Y + L EKE R
Sbjct: 10 TILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRF 69
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
+ FK+NL +I++ N + N +Y+LG +F+DLT E+R L+ G P P R+ +S +Y
Sbjct: 70 QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLKTSR-RY 125
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
L+ +P S+DWR +GAV+ IK+Q C CWAF+ VAAVEG+ KI +G LI LSEQ+L
Sbjct: 126 VPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQEL 185
Query: 189 LDCSTNGNNGCLG-GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA--AAKISN 245
+DC+ NNGC G G + AF ++I N G+ +E +YPYQ G+C+ Q + I +
Sbjct: 186 VDCNL-VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITIDS 244
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
YE+VP+ DE +L KAV+ QPVS+ + S EF Y+ I+NG CGT LDHA+ IVG+G +
Sbjct: 245 YEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYG-S 303
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
E+G +YW+++NSWG TWGDAGY+KI R+ +GLCGI +SYP+
Sbjct: 304 ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 349
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 149/307 (48%), Positives = 202/307 (65%), Gaps = 12/307 (3%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
+E+W+ QHGR YK+ E + I++ N+ +I N + N ++ L NQF+D+TN+E++A
Sbjct: 41 YERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKA 99
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
LY G S ++ S+FK + + +P S+DWR GAVTP++NQ ECG CWAF+ V
Sbjct: 100 LYMGLGTSETSRKN--QSSFKRERSKV--LPISVDWRKMGAVTPVRNQGECGSCWAFSTV 155
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
AAVEGI KIR+G L+ LSEQ+LLDC + GN GC GG AF +I QN GI T YPY
Sbjct: 156 AAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPY 215
Query: 226 QAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
G C+ + KIS YE VP +E+ L AV+ QPVS+AI A EFQ Y +GI
Sbjct: 216 IGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI 275
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIG 340
FNG CG QL+HAVT++G+G ++G YWL+KNSWG WG+AGY +++R DEG+CGI
Sbjct: 276 FNGFCGKQLNHAVTVIGYG-EDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIA 334
Query: 341 TRSSYPL 347
+SYP+
Sbjct: 335 MEASYPI 341
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 151/322 (46%), Positives = 204/322 (63%), Gaps = 16/322 (4%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDEL--------EKEMRLKIFKENLEYIEKANKEGNRTY 89
+ E+ + + + WM QHG+SY D EK R IFK+NL +I N E N+ Y
Sbjct: 48 SSEERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGY 106
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
LG N F+DLTN+EFRA G + R T+ F+Y ++ + D+P S+DWR+KGAV
Sbjct: 107 FLGLNAFADLTNEEFRAQRHGGRFDRSRER-TSHEEFRYGSVQLKDLPDSIDWREKGAVV 165
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q CG CWAF+AVAA+EG+ K+ +G L+ LSEQ+L+DC + GC GG + AF
Sbjct: 166 GVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFG 225
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
++I+N G+ TE +YPY+ C ++ A I YE+VP DE ALLKAV+ QPVS+
Sbjct: 226 FVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSV 285
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A + Q Y+ GIF G CGT LDH VT VG+G EDG YW+IKNSWG+ WG+ GY+
Sbjct: 286 AIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYG-KEDGKAYWIIKNSWGSNWGEKGYV 344
Query: 329 KIVRD----EGLCGIGTRSSYP 346
K+ R+ GLCGI +SYP
Sbjct: 345 KMARNTGLAAGLCGINMEASYP 366
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 150/334 (44%), Positives = 207/334 (61%), Gaps = 11/334 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +S +++ E+WM ++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG NQF+D+TN+EF A YTG P R S + ++ ++ V
Sbjct: 68 IETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGISRPLNIEREPVVS---FDDVDISAV 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWRD GAVT +KNQ CG CWAFAA+A VE I KI+ G L LSEQQ+LDC+
Sbjct: 125 PQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAK--G 182
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG +AF +II N+G+A+ YPY+A GTC P +A I+ Y VP +E +
Sbjct: 183 YGCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTCKTNGVPNSAYITGYARVPRNNESS 242
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
++ AVS QP+++A+ A + Q Y G+FNG CGT L+HAVT +G+G +G YW++KN
Sbjct: 243 MMYAVSKQPITVAVDANANS-QYYNSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVKN 301
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
SWG WG+AGY+++ RD G+CGI S YP
Sbjct: 302 SWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 149/314 (47%), Positives = 211/314 (67%), Gaps = 11/314 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T ++++ E W+++ GR Y+ EK R +IFK+NL +I+ NK+ R Y LG N+F+
Sbjct: 38 TSNDKLIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKK-VRNYWLGLNEFA 96
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+++EF+ Y G K P S R+ F Y++++ +P S+DWR KGAVTP+KNQ C
Sbjct: 97 DLSHEEFKNKYLGLK-PDLSKRAQCPEEFTYKDVA---IPKSVDWRKKGAVTPVKNQGSC 152
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T NNGC GG + AFAYI+ N G+
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGL 212
Query: 218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
E++YPY GTC ++ + A IS Y +VP E++LLKA++ QP+SIAI A +
Sbjct: 213 HKEEDYPYIMEEGTCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRD 272
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ Y G+F+G CGT+LDH V VG+GT++ G +Y ++KNSWG WG+ GY+++ R
Sbjct: 273 FQFYSGGVFDGHCGTELDHGVAAVGYGTSK-GLDYIIVKNSWGPKWGEKGYIRMKRKTSK 331
Query: 334 -EGLCGIGTRSSYP 346
EG+CGI +SYP
Sbjct: 332 PEGICGIYKMASYP 345
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 148/340 (43%), Positives = 222/340 (65%), Gaps = 11/340 (3%)
Query: 18 MFIIITLLVSCASQVVS---SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
+F+ +TL + S SR HE S+ E HE+WMA++ R+YKD+ E+E R +FK+N
Sbjct: 3 LFVCMTLHIYYLEHRASEATSRPLHEASMYERHEQWMARYSRNYKDDAEEERRFXMFKDN 62
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
+++I+ + GN KLG N +D+T++EFRA +K+P + +++F++QN+ T
Sbjct: 63 VDFIQTFDTAGNMPNKLGVNALADMTHEEFRASGNTFKIPPNLGLRSETTSFRHQNV--T 120
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+P+++DWR K VT IKNQ +CG CWAF+AVAA+EGI K+++ I LSEQ+L+DC
Sbjct: 121 RIPSTMDWRKKRTVTHIKNQLQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIF 180
Query: 195 GNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSG 252
G+N GC GG + AF +IIQN+G+ +E Y Y+ V G C+ ++ + AA+I++YE +P
Sbjct: 181 GSNIGCEGGCMDDAFKFIIQNRGLNSEARYLYKGVEGHCNKKKESSRAARINDYENMPEF 240
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
E+ALLK V+ QP+S+AI A + FQ Y+ GI G LD+ VT G+G + DG +W
Sbjct: 241 SEKALLKVVAHQPISVAIDAGGSAFQFYEIGIITXESGNDLDYGVTTDGYGRSADGKKHW 300
Query: 313 LIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
L+KNSWG WG+ GY ++ R GLCG ++SYP A
Sbjct: 301 LVKNSWGTDWGENGYTRMERGVKATTGLCGFTMQASYPTA 340
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 300 bits (769), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 140/312 (44%), Positives = 202/312 (64%), Gaps = 6/312 (1%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++EKW A H S +D + + R +FKEN+++I + N++ + TYKL N+F D+
Sbjct: 34 EESLWSLYEKWRAHHAVS-RDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDM 92
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN EFR+ Y G K+ ++ D+PTS+DWR+KGAVT +K+Q +CG
Sbjct: 93 TNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGS 152
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ V AVEGI +I++ L+ LSEQQL+DC T N+GC GG + AF +I N G+++
Sbjct: 153 CWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDTK-NSGCNGGLMDYAFDFIKNNGGLSS 211
Query: 220 EDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
ED YPY A +C + A I Y++VP +E AL+KAV+ QPVS+AI A FQ
Sbjct: 212 EDSYPYLAEQKSCGSEANSAVVTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQF 271
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEG 335
Y +G+F+G CGT+LDH V VG+G +DG YW++KNSWG WG++GY+++ R G
Sbjct: 272 YSQGVFSGHCGTELDHGVAAVGYGVDDDGKKYWIVKNSWGEGWGESGYIRMERGIKDKRG 331
Query: 336 LCGIGTRSSYPL 347
CGI +SYP+
Sbjct: 332 KCGIAMEASYPI 343
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 300 bits (769), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 149/315 (47%), Positives = 215/315 (68%), Gaps = 12/315 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++E+W + H S + EK R +FKENL++I K N++ +R YKL N+F+D+
Sbjct: 33 EESLWNLYERWRSHHTVS-RSLTEKNQRFNVFKENLKHIHKVNQK-DRPYKLRLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSPS--HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
TN EF Y G K+ H S + F ++N S ++P+S+DWR +GAVT +K+Q +C
Sbjct: 91 TNHEFLQHYGGSKVSHYRMFHGSRRQTGFAHENTS--NLPSSIDWRKQGAVTGVKDQGKC 148
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF++VAAVEGI KI++G LI LSEQ+L+DC++ N+GC GG E+AF++I + G+
Sbjct: 149 GSCWAFSSVAAVEGINKIKTGELISLSEQELVDCNSV-NHGCDGGLMEQAFSFIEKTGGL 207
Query: 218 ATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
TE+ YPY+A G C SA I YE VP DE AL++AV+ QPVSIAI A +
Sbjct: 208 TTENNYPYRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIAIDAGGQD 267
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
FQ Y EG++ G CGT+L+H V +VG+G T+DG YW++KNSWG+ WG+ G++++ R
Sbjct: 268 FQFYSEGVYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWGSEWGENGFIRMQRENDV 327
Query: 333 DEGLCGIGTRSSYPL 347
+EGLCGI +SYP+
Sbjct: 328 EEGLCGITLEASYPI 342
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 300 bits (768), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 149/317 (47%), Positives = 205/317 (64%), Gaps = 15/317 (4%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
HE ++E W +HG++Y D + R ++K+NL YI + E NRTY LG +F+D
Sbjct: 46 HENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHS--ETNRTYSLGLTKFAD 103
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTN+EFR +YTG ++ S R+ + F+Y + ++ P S+DWR GAVT +K+Q CG
Sbjct: 104 LTNEEFRRMYTGTRIDR-SRRAKRRTGFRYAD---SEAPESVDWRKNGAVTSVKDQGSCG 159
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+AV +VEGI IR+G + LSEQ+L+DC N GC GG + AF +IIQN GI
Sbjct: 160 SCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGGID 219
Query: 219 TEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
TE +YPY+ G C ++K A I YE+VP DE+AL KAV+ QPVS+AI A +F
Sbjct: 220 TEKDYPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 279
Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
Q Y +G+F+G CGT LDH V VG+G TEDG +YW++KNSWG WG++GY+++ R+
Sbjct: 280 QLYAQGVFSGECGTDLDHGVLAVGYG-TEDGVDYWIVKNSWGEYWGESGYLRMKRNMKDS 338
Query: 334 ---EGLCGIGTRSSYPL 347
GLCGI SY +
Sbjct: 339 NDGPGLCGINIEPSYAV 355
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 300 bits (768), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 148/310 (47%), Positives = 205/310 (66%), Gaps = 10/310 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
V+ + E W+ ++G+SY EKE R +IFK+NL ++++ N + NR+YK+G NQFSDLT+
Sbjct: 44 VIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTDA 103
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
E+ ++Y G K + T+ + +Y+ +P S+DWR KGAV +KNQ CG CW
Sbjct: 104 EYSSIYLGTKF----NIRMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGNCGSCWT 159
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATED 221
FA++AAVEGI KI +GNLI LSEQ+++DC NNGC GG+ A+ +II N GI TE
Sbjct: 160 FASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGGINTEA 219
Query: 222 EYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
YPY G C +K I YE VPS +E+AL KAV+ QPVS+ IA+ ST F+SY
Sbjct: 220 NYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNSTAFKSY 279
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLC 337
K GIFNG CG ++DH VTIVG+G TE G +YW+++NSWG WG++GY+++ R+ G C
Sbjct: 280 KSGIFNGPCGPRIDHGVTIVGYG-TEGGKDYWIVRNSWGPNWGESGYVRMQRNVGGSGKC 338
Query: 338 GIGTRSSYPL 347
I YP+
Sbjct: 339 FIARAPVYPV 348
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 300 bits (768), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 149/316 (47%), Positives = 203/316 (64%), Gaps = 12/316 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H S + EK R +FKEN+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHHTVS-RSLTEKHKRFNVFKENVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSPSHRSTT---SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
TN EFR+ Y G K+ T + TF Y+ + VP S+DWR KGAVT +K+Q +
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVG--SVPASVDWRKKGAVTDVKDQGQ 148
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ V AVEGI +I++ L+ LSEQ+L+DC N GC GG E AF +I Q G
Sbjct: 149 CGSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGG 208
Query: 217 IATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
I TE YPY A GTC A++ A I +E VP DE ALLKAV+ QPVS+AI A +
Sbjct: 209 ITTESNYPYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGS 268
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
+FQ Y EG+ G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+
Sbjct: 269 DFQFYSEGVLTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNIS 328
Query: 334 --EGLCGIGTRSSYPL 347
EGLCGI +SYP+
Sbjct: 329 KKEGLCGIAMMASYPI 344
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 156/345 (45%), Positives = 227/345 (65%), Gaps = 19/345 (5%)
Query: 16 TPMFIIITLLVSCASQVVSSRST---HEQSVVE---IHEKWMAQHGRSYKDEL-EKEMRL 68
T +F++I ++S S + +T H +S E I + WM++HG++Y + L EKE R
Sbjct: 10 TILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRF 69
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
+ FK+NL +I++ N + N +Y+LG +F+DLT E+R L+ G P P R+ +S +Y
Sbjct: 70 QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLKTSR-RY 125
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
L+ +P S+DWR +GAV+ IK+Q C CWAF+ VAAVEG+ KI +G LI LSEQ+L
Sbjct: 126 VPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQEL 185
Query: 189 LDCSTNGNNGCLG-GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNY 246
+DC+ NNGC G G + AF ++I N G+ +E +YPYQ G+C+ Q I +Y
Sbjct: 186 VDCNL-VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQVHLLVITIDSY 244
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP+ DE +L KAV+ QPVS+ + S EF Y+ I+NG CGT LDHA+ IVG+G +E
Sbjct: 245 EDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYG-SE 303
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+G +YW+++NSWG TWGDAGY+KI R+ +GLCGI +SYP+
Sbjct: 304 NGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 348
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 152/335 (45%), Positives = 219/335 (65%), Gaps = 12/335 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD-ELEKEMRLKIFKENLE 76
F+ I L + S ++ R+ E V+ ++++W A+HG+ + + E E R IFK+NL+
Sbjct: 14 FFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPENRFHIFKDNLK 71
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+I++ N + N Y+LG N F+DLTN+E+R+ Y G K S S R+ TS+ +Y D+
Sbjct: 72 FIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSN--RYLPRLGDDL 128
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAV P+K+Q CG CWAF+ VA+VE I +I +G+LI LSEQ+L+DC + N
Sbjct: 129 PDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDCDRSYN 188
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQ 255
GC GG + AF +II+N G+ TE++YPY +C +K A I +YE+VP +E+
Sbjct: 189 EGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVAIDSYEDVPVNNEK 248
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAVS Q VS+AI FQ Y+ GIF G CGT LDH V +VG+G +E G +YW+++
Sbjct: 249 ALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYG-SEGGVDYWIVR 307
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
NSWG +WG++GY+K+ R+ GLCGI SYP
Sbjct: 308 NSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYP 342
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 155/316 (49%), Positives = 208/316 (65%), Gaps = 16/316 (5%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
S+ + + ++KWM ++GR YK E E R I++ N++YI+ N N ++ L N F
Sbjct: 9 SSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNF 67
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLTN+EF+A Y GYK S + F+Y N M ++PT++DWR +GAVTPIKNQ +
Sbjct: 68 ADLTNEEFKATYLGYKTVS-----IPDTCFRYGN--MVNLPTNVDWRQEGAVTPIKNQGQ 120
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+AVAAVEGI KI++G LI LSEQ+L+DC T+GN GC GG KAF + I+
Sbjct: 121 CGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF-IKRT 179
Query: 216 GIATEDEYPYQAVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
G+ TE EYPYQ C+ +K IS YE+VP DE++L AV+ QPVS+AI A
Sbjct: 180 GLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEG 239
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
FQ Y GIF+G CG QL+H V IVG+G T + A YWL+KNSWG WG++GY+++ RD
Sbjct: 240 NNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQA-YWLVKNSWGTDWGESGYIRMKRDS 298
Query: 334 ---EGLCGIGTRSSYP 346
+G CGI +SYP
Sbjct: 299 TDRQGTCGIAMMASYP 314
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 155/316 (49%), Positives = 208/316 (65%), Gaps = 16/316 (5%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
S+ + + ++KWM ++GR YK E E R I++ N++YI+ N N ++ L N F
Sbjct: 9 SSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNF 67
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLTN+EF+A Y GYK S + F+Y N M ++PT++DWR +GAVTPIKNQ +
Sbjct: 68 ADLTNEEFKATYLGYKTVS-----IPDTCFRYGN--MVNLPTNVDWRQEGAVTPIKNQGQ 120
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+AVAAVEGI KI++G LI LSEQ+L+DC T+GN GC GG KAF + I+
Sbjct: 121 CGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF-IKRT 179
Query: 216 GIATEDEYPYQAVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
G+ TE EYPYQ C+ +K IS YE+VP DE++L AV+ QPVS+AI A
Sbjct: 180 GLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEG 239
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
FQ Y GIF+G CG QL+H V IVG+G T + A YWL+KNSWG WG++GY+++ RD
Sbjct: 240 NNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQA-YWLVKNSWGTDWGESGYIRMKRDS 298
Query: 334 ---EGLCGIGTRSSYP 346
+G CGI +SYP
Sbjct: 299 TDKQGTCGIAMMASYP 314
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 155/340 (45%), Positives = 208/340 (61%), Gaps = 11/340 (3%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I+ + +F L++S A + +S V+ ++E W+ + G+SY EKEMR +IFK
Sbjct: 8 ISMSLLFFSTLLILSLALDIENSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
ENL I+ N + NR+Y LG N+F+DLT++E+R+ Y G KM + S +Y
Sbjct: 68 ENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPKTDVSN-----EYMPKV 122
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR GAV +KNQ C CWAF+AV AVEGI KI +GNLI LSEQ+L+DC
Sbjct: 123 GEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDCG 182
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVP 250
T GC G AF +II N GI TED YPY A G C+ + K I NY+ VP
Sbjct: 183 RTQRTKGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCNLSLKNQKYVTIDNYKNVP 242
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
S +E AL KAV+ QPVS+ + + +F+ Y GIF G CGT +DH VTIVG+G TE G +
Sbjct: 243 SNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTIVGYG-TERGMD 301
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSWG WG+ GY++I R+ G CGI SYP+
Sbjct: 302 YWIVKNSWGTNWGENGYIRIQRNIGGAGKCGIARMPSYPV 341
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 156/347 (44%), Positives = 214/347 (61%), Gaps = 26/347 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQH--GRSYKDELEKEMRLKI 70
F+ + L +S V +S H E+S+ +++E+W + H RS D K R +
Sbjct: 6 FLWVVLSLSLVLGVANSFDFHDKDLESEESLWDLYERWRSHHTVSRSLGD---KHKRFNV 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTSST 125
FK N+ ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ HR + T
Sbjct: 63 FKANMMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNH--HRMFRDMPRGNGT 119
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F Y+ + VP S+DWR KGAVT +K+Q CG CWAF+ V AVEGI +I++ L+ LSE
Sbjct: 120 FMYEKVG--SVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSE 177
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKIS 244
Q+L+DC T N GC GG E AF +I Q GI TE YPY A GTC A++ A I
Sbjct: 178 QELVDCDTEENAGCNGGLMESAFQFIKQKGGITTESYYPYTAQDGTCDASKANDLAVSID 237
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
+E VP DE ALLKAV+ QPVS+AI A ++FQ Y EG+F G C T+L+H V IVG+G
Sbjct: 238 GHENVPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGA 297
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
T DG +YW+++NSWG WG+ GY+++ R+ EGLCGI +SYP+
Sbjct: 298 TVDGTSYWIVRNSWGPEWGELGYIRMQRNISKKEGLCGIAMLASYPI 344
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 208/317 (65%), Gaps = 11/317 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL-EKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFS 97
E V ++E+WMA+HG++ + L E + R + F +NL +++ N + G R Y+LG N+F+
Sbjct: 45 EAQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGYRLGINRFA 104
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLTN EFRA Y + + +T ++ +Y++ + +P +DWR KGAV P+KNQ +C
Sbjct: 105 DLTNAEFRAAY--LSAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVAPVKNQGQC 162
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQG 216
G CWAF+AV AVEGI +I +G L+ LSEQ+L+DCS NG N GC GG + AFA+I+ N G
Sbjct: 163 GSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVGNGG 222
Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
I T+ +YPY A G C A++ I +E VP DE++L KAV+ QPV++AI A
Sbjct: 223 IDTDKDYPYTARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVAVAIEAGGR 282
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA-NYWLIKNSWGNTWGDAGYMKIVRD- 333
EFQ Y+ G+F G CGT LDH V VG+GT DG +YWL++NSWG WG+ GY+++ R+
Sbjct: 283 EFQLYQSGVFTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGYIRMERNV 342
Query: 334 ---EGLCGIGTRSSYPL 347
G CGI +SYP+
Sbjct: 343 GARAGKCGIAMEASYPV 359
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 157/334 (47%), Positives = 210/334 (62%), Gaps = 55/334 (16%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ +L + ASQ +SRS HE S+ E HE WMA++GR YKD EKE R KIFK+N+
Sbjct: 14 LLFILAAWASQA-TSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV----- 67
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
++TFKY+N+ T VP+++
Sbjct: 68 ----------------------------------------AQATTFKYENV--TAVPSTI 85
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGC 199
DWR KGAVTPIK+Q++CG CWAF+AVAA EGIT+I +G LI LSEQ+L+DC T G N GC
Sbjct: 86 DWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGC 145
Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALL 258
GG + AF +I + G+A+E YPY+ GTC++ ++ AAKI YE+VP+ +E+AL
Sbjct: 146 SGGLXDDAFRFIXIH-GLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANNEKALQ 204
Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
KAV+ QPV++AI A EFQ Y G+F G CGT+LDH V VG+G +DG YWL+KNSW
Sbjct: 205 KAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMXYWLVKNSW 264
Query: 319 GNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
G WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 265 GTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 298
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 144/302 (47%), Positives = 207/302 (68%), Gaps = 10/302 (3%)
Query: 51 MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG 110
+ +H ++Y KE R +IFK+NL +I++ NK N+++KLG N+F+DL+N+E+++++ G
Sbjct: 11 LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
+M + S FKY ++P S+DWR+KGAV P+K+Q +CG CWAF+ VAAVE
Sbjct: 71 GRMVR-DRKGFESDRFKYG--VGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVE 127
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
GI +I +G+LI LSEQ+L+DC N GC GG + AF +I++N GI TED+YPY+ V G
Sbjct: 128 GINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDG 187
Query: 231 TCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVC 289
C +K A I+ +E+VP DE++L KAV+ QPVS+AI A FQ Y+ GIFNG+C
Sbjct: 188 QCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGLC 247
Query: 290 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLCGIGTRSS 344
GT LDH V VG+G TEDG +YW+++NSWG WG+ GY+++ R + G CGI + S
Sbjct: 248 GTDLDHGVVAVGYG-TEDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQPS 306
Query: 345 YP 346
YP
Sbjct: 307 YP 308
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 152/355 (42%), Positives = 220/355 (61%), Gaps = 26/355 (7%)
Query: 16 TPMFIIITLLVSCAS----QVVSSRSTHEQSVVE-------------IHEKWMAQHGRSY 58
T + ++ ++ SCA+ VVSS + H + I + WM +HG+ Y
Sbjct: 8 TLILLVAMVITSCATAMDMSVVSSNNNHHLTTSPGRLHSGFDAEASLIFDSWMVKHGKVY 67
Query: 59 KDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH 118
EKE RL IF++NL +I N E N +Y+LG QF+DL+ E+ + G P +
Sbjct: 68 GSVAEKERRLTIFEDNLRFISNRNAE-NLSYRLGLTQFADLSLHEYGEVCHGADPRPPRN 126
Query: 119 RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG 178
+S+ +Y+ + +P S+DWR++GAVT +K+Q C CWAF+ V AVEG+ KI +G
Sbjct: 127 HVFMTSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTG 186
Query: 179 NLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
L+ LSEQ L++C+ NNGC GG E A+ +I++N G+ T+++YPY+AV G C K
Sbjct: 187 ELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKE 245
Query: 239 --AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHA 296
I +E +P+ DE AL+KAV+ QPV+ I + S EFQ Y+ G+F+G CGT L+H
Sbjct: 246 NNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHG 305
Query: 297 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
V +VG+G TE+G +YWL+KNS GNTWG+AGYMK+ R+ GLCGI R+SYPL
Sbjct: 306 VVVVGYG-TENGRDYWLVKNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 142/303 (46%), Positives = 198/303 (65%), Gaps = 10/303 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E+WMA++GR YKD EK R +IFK N+++IE N +Y LG NQF+D+T EF A
Sbjct: 11 EEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDMTKSEFVAQ 70
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
YTG +P R S + +++++ VP S+DWRD GAV +KNQ CG CWAFAA+A
Sbjct: 71 YTGVSLPLNIEREPVVS---FDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIA 127
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
VEGI KI++G L+ LSEQ++LDC+ + GC GG KA+ +II N G+ TE+ YPYQA
Sbjct: 128 TVEGIYKIKTGYLVSLSEQEVLDCAV--SYGCKGGWVNKAYDFIISNNGVTTEENYPYQA 185
Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG 287
GTC+A P +A I+ Y V DE++++ AVS QP++ I A S FQ Y G+F+G
Sbjct: 186 YQGTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDA-SENFQYYNGGVFSG 244
Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRS 343
CGT L+HA+TI+G+G G YW+++NSWG++WG+ GY+++ R G CGI
Sbjct: 245 PCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSSSSGACGIAMSP 304
Query: 344 SYP 346
+P
Sbjct: 305 LFP 307
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 148/349 (42%), Positives = 223/349 (63%), Gaps = 19/349 (5%)
Query: 16 TPMFIIITLLVSCASQ--VVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKE 65
T +FI +T +S A ++S TH V+ ++E+W+ +HG++Y EKE
Sbjct: 6 TILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNALGEKE 65
Query: 66 MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM-PSPSHRSTTSS 124
R +IFK+NL +I++ N + N +++LG N+F+DLTN+E+R + G ++ P+ +R S
Sbjct: 66 KRFEIFKDNLGFIDEHNSK-NLSFRLGLNRFADLTNEEYRTRFLGTRINPNRRNRKVNSQ 124
Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
T +Y +P S+DWR +GAV +K+Q CG CWAF+A+AAVEG+ K+ +G+LI LS
Sbjct: 125 TNRYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLATGDLISLS 184
Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKI 243
EQ+L+DC T+ N GC GG + AF +II + E++YPY+A+ G C +K A I
Sbjct: 185 EQELVDCDTSYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRCDQNRKNAKVVSI 244
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFG 303
YE+VP+ DE AL KAV+ Q +++A+ EFQ Y G+F G CGT LDH V VG+G
Sbjct: 245 DQYEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGTALDHGVAAVGYG 304
Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
TE+G +YW+++NSWG +WG+AGY+++ R+ G CGI SYP+
Sbjct: 305 -TENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPSYPI 352
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 150/322 (46%), Positives = 204/322 (63%), Gaps = 16/322 (4%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDEL--------EKEMRLKIFKENLEYIEKANKEGNRTY 89
+ E+ + + + WM QHG+SY + EK R IFK+NL +I N E N+ Y
Sbjct: 48 SSEERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGY 106
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
LG N F+DLTN+EFRA G + R T+ F+Y ++ + D+P S+DWR+KGAV
Sbjct: 107 FLGLNAFADLTNEEFRAQRHGGRFDRSRER-TSYEEFRYGSVQLKDLPDSIDWREKGAVV 165
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q CG CWAF+AVAA+EG+ K+ +G L+ LSEQ+L+DC + GC GG + AF
Sbjct: 166 GVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFG 225
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
++I+N G+ TE +YPY+ C ++ A I YE+VP DE ALLKAV+ QPVS+
Sbjct: 226 FVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSV 285
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A + Q Y+ GIF G CGT LDH VT VG+G EDG YW+IKNSWG+ WG+ GY+
Sbjct: 286 AIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYG-KEDGKAYWIIKNSWGSNWGEKGYI 344
Query: 329 KIVRD----EGLCGIGTRSSYP 346
K+ R+ GLCGI +SYP
Sbjct: 345 KMARNTGLAAGLCGINMEASYP 366
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 151/345 (43%), Positives = 216/345 (62%), Gaps = 22/345 (6%)
Query: 19 FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
FI++ L + + H E S+ E++E+W + H + E EK R +FK
Sbjct: 4 FIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFNVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFK 127
N+++I + NK+ +++YKL N+F D+T++EFR Y G ++M ++T S F
Sbjct: 63 HNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKS--FM 119
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y N++ +PTS+DWR GAVTP+KNQ +CG CWAF+ V AVEGI +IR+ L LSEQ+
Sbjct: 120 YANVNT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNY 246
L+DC TN N GC GG + AF +I + G+ +E YPY+A TC ++ A I +
Sbjct: 178 LVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGH 237
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP E L+KAV+ QPVS+AI A ++FQ Y EG+F G CGT+L+H V +VG+GTT
Sbjct: 238 EDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTI 297
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
DG YW++KNSWG WG+ GY+++ R EGLCGI +SYPL
Sbjct: 298 DGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 148/298 (49%), Positives = 200/298 (67%), Gaps = 10/298 (3%)
Query: 56 RSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
++Y EK R ++FK+NL +I+ NK+ +Y LG N+F+DLT+DEF+A Y G P
Sbjct: 38 KAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGLNEFADLTHDEFKATYLGL-TPP 95
Query: 116 PSHRST---TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGI 172
P+ ++ +S F+Y +S +VP +DWR K AVT +KNQ +CG CWAF+ VAAVEGI
Sbjct: 96 PTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGI 155
Query: 173 TKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC 232
I +GNL LSEQ+L+DCST+GNNGC GG + AF+YI G+ TE+ YPY G C
Sbjct: 156 NAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTEEAYPYAMEEGDC 215
Query: 233 SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQ 292
+ A IS YE+VP+ DEQAL+KA++ QPVS+AI A FQ Y G+F+G CG Q
Sbjct: 216 DEGKGAAVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGEQ 275
Query: 293 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
LDH VT VG+GT++ G +Y ++KNSWG WG+ GY+++ R EGLCGI +SYP
Sbjct: 276 LDHGVTAVGYGTSK-GQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCGINKMASYP 332
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 150/318 (47%), Positives = 207/318 (65%), Gaps = 17/318 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+ + +++E+W + H S + EK R +FK N+ ++ +NK ++ YKL N+F+D+
Sbjct: 33 EEGLWDLYERWRSHHTVS-RSLDEKHNRFNVFKGNVMHVHSSNKM-DKPYKLKLNRFADM 90
Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR++Y G K+ HR + TF YQN+ VP+S+DWR KGAVT +K+Q
Sbjct: 91 TNHEFRSIYAGSKVNH--HRMFRGTPRGNGTFMYQNVDR--VPSSVDWRKKGAVTDVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI +I++ L+ LSEQ+L+DC T N GC GG E AF +I Q
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTHKLVPLSEQELVDCDTTQNQGCNGGLMESAFEFIKQ- 205
Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
GI T YPY+A GTC A++ A I +E VP +E ALLKAV+ QPVS+AI A
Sbjct: 206 YGITTASNYPYEAKDGTCDASKVNEPAVSIDGHENVPVNNEAALLKAVAHQPVSVAIEAG 265
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
+FQ Y EG+F G CGT LDH V IVG+GTT+DG YW +KNSWG+ WG+ GY+++ R
Sbjct: 266 GIDFQFYSEGVFTGNCGTALDHGVAIVGYGTTQDGTKYWTVKNSWGSEWGEKGYIRMKRS 325
Query: 334 ----EGLCGIGTRSSYPL 347
+GLCGI +SYP+
Sbjct: 326 ISVKKGLCGIAMEASYPI 343
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 144/309 (46%), Positives = 208/309 (67%), Gaps = 12/309 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK----ANKEGNRTYKLGTNQFSDLTNDE 103
+ W+ +H ++Y EKE R IF++NLE+I++ N G ++LG N+F+DLTNDE
Sbjct: 6 QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65
Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
FR +Y G K P ++ + + +Y ++P S+DWR KGAV+ +K+Q +CG CWAF
Sbjct: 66 FRRIYFGVKRP---EKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAF 122
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
+A+ AVEGI KI +G+LI LSEQ+L+DC T+ N+GC GG + AF +II N GI T+ +Y
Sbjct: 123 SAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKDY 182
Query: 224 PYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
PY+A G+C + +K A I E+VP+ +E+AL KAV+ QPV +AI A +FQ YK
Sbjct: 183 PYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYKS 242
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
G+F G CGT LDH V VG+GTT+DG +YW+++NSWG+ WG+ GY+++ R+ G CG
Sbjct: 243 GVFTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTESKSGKCG 302
Query: 339 IGTRSSYPL 347
I SYP+
Sbjct: 303 IAIEPSYPV 311
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 142/308 (46%), Positives = 214/308 (69%), Gaps = 13/308 (4%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEF 104
++ W+A++GRSY E+E R ++F +NL++++ N + ++LG N+F+DLTNDEF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
R+ + G K+ S ++ +Y++ + ++P S+DWR+KGAV P+KNQ +CG CWAF+
Sbjct: 109 RSTFLGAKVVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFS 164
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEY 223
AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG + AF +II+N GI TED+Y
Sbjct: 165 AVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDY 224
Query: 224 PYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
PY+AV G C ++ A I +E+VP DE++L KAV+ QPVS+AI A EFQ Y
Sbjct: 225 PYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHS 284
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
G+F+G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+ G CG
Sbjct: 285 GVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINATTGKCG 343
Query: 339 IGTRSSYP 346
I +SYP
Sbjct: 344 IAMMASYP 351
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 147/333 (44%), Positives = 213/333 (63%), Gaps = 12/333 (3%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
++L + +VS E+ V ++ +WMA+HG +Y E+E R + F++NL YI++
Sbjct: 18 VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 82 N---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N G +++LG N+F+DLTN+E+R+ Y G + R ++ +YQ ++P
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPE 134
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
S+DWR KGAV +K+Q CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQAL 257
C GG + AF +II N GI +E++YPY+ C A +K A I YE+VP E++L
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
KAV+ QP+S+AI A FQ YK GIF G CGT LDH V VG+G TE+G +YWL++NS
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYG-TENGKDYWLVRNS 313
Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
WG+ WG+ GY+++ R+ G CGI SYP
Sbjct: 314 WGSVWGEDGYIRMERNIKASSGKCGIAVEPSYP 346
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 148/317 (46%), Positives = 209/317 (65%), Gaps = 14/317 (4%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T +V + E+W+A++ ++Y EK R ++FK+NL +I++AN++ +Y LG N F+
Sbjct: 63 TQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFA 122
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKGAVTPIKNQK 155
DLT+DEF+A Y G +P + T+ F+Y + P S+DWR KGAVT +KNQ
Sbjct: 123 DLTHDEFKATYLGL-LP----KRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQG 177
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
+CG CWAF+ VAAVEGI +I +GNL LSEQQL+DCST+GNNGC GG + AF++I
Sbjct: 178 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGA 237
Query: 216 GIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
G+ +E+ YPY G C A IS YE+VP+ DEQAL+KA++ QPVS+AI A
Sbjct: 238 GLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 297
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
FQ Y G+F+G CG++LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R
Sbjct: 298 GRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGTHWGEKGYIRMKRG 356
Query: 334 ----EGLCGIGTRSSYP 346
EGLCGI +SYP
Sbjct: 357 TGKPEGLCGINKMASYP 373
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 139/309 (44%), Positives = 214/309 (69%), Gaps = 10/309 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK +R ++FK+NL++I+ NK + Y LG N+F+DL++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+ Y G K+ R ++ F Y+++ D+P S+DWR KGAVTP+KNQ +CG CWA
Sbjct: 102 EFKNKYLGLKVDLSQRRESSEEEFTYRDV---DLPKSVDWRKKGAVTPVKNQGQCGSCWA 158
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ VAAVEGI +I +GNL LSEQ+L+DC T NNGC GG + AF++I++N G+ E++
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKNGGLHKEED 218
Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY TC ++ + I+ Y +VP +EQ+LLKA++ QP+S+AI A +FQ Y
Sbjct: 219 YPYIMEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
G+F+G CG++LDH V+ VG+GT++ G +Y ++KNSWG WG+ G++++ R+ EG+C
Sbjct: 279 GGVFDGHCGSELDHGVSAVGYGTSK-GLDYIIVKNSWGAKWGEKGFIRMKRNIGKSEGIC 337
Query: 338 GIGTRSSYP 346
G+ +SYP
Sbjct: 338 GLYKMASYP 346
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 149/318 (46%), Positives = 204/318 (64%), Gaps = 16/318 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H S + EK R +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR+ Y G +KM S S TF Y+ + VP S+DWR KGAVT +K+Q
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI +I++ L+ LSEQ+L+DC N GC GG E AF +I Q
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206
Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
GI TE YPY+A GTC ++ A I +E VP DE ALLKAV+ QPVS+AI A
Sbjct: 207 GGITTESNYPYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
++FQ Y EG+F G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326
Query: 334 ----EGLCGIGTRSSYPL 347
EGLCGI +SYP+
Sbjct: 327 ISKKEGLCGIAMMASYPI 344
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 150/353 (42%), Positives = 220/353 (62%), Gaps = 26/353 (7%)
Query: 18 MFIIITLLVSCAS----QVVSSRSTHE--------QSVVE-----IHEKWMAQHGRSYKD 60
+F++ ++ SCA+ VVSS H Q + + + E WM +HG+ Y
Sbjct: 10 IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDS 69
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EKE RL IF++NL +I N E N +Y+LG N+F+DL+ E+ + G P +
Sbjct: 70 VAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV 128
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
+S+ +Y+ +P S+DWR++GAVT +K+Q C CWAF+ V AVEG+ KI +G L
Sbjct: 129 FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGEL 188
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-- 238
+ LSEQ L++C+ NNGC GG E A+ +I+ N G+ T+++YPY+A+ G C K
Sbjct: 189 VTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDN 247
Query: 239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
I YE +P+ DE AL+KAV+ QPV+ + + S EFQ Y+ G+F+G CGT L+H V
Sbjct: 248 KNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVV 307
Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+VG+G TE+G +YW++KNS G+TWG+AGYMK+ R+ GLCGI R+SYPL
Sbjct: 308 VVGYG-TENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 148/317 (46%), Positives = 209/317 (65%), Gaps = 14/317 (4%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T +V + E+W+A++ ++Y EK R ++FK+NL +I++AN++ +Y LG N F+
Sbjct: 77 TQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFA 136
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKGAVTPIKNQK 155
DLT+DEF+A Y G +P + T+ F+Y + P S+DWR KGAVT +KNQ
Sbjct: 137 DLTHDEFKATYLGL-LP----KRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQG 191
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
+CG CWAF+ VAAVEGI +I +GNL LSEQQL+DCST+GNNGC GG + AF++I
Sbjct: 192 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGA 251
Query: 216 GIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
G+ +E+ YPY G C A IS YE+VP+ DEQAL+KA++ QPVS+AI A
Sbjct: 252 GLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 311
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
FQ Y G+F+G CG++LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R
Sbjct: 312 GRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGTHWGEKGYIRMKRG 370
Query: 334 ----EGLCGIGTRSSYP 346
EGLCGI +SYP
Sbjct: 371 TGKPEGLCGINKMASYP 387
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 146/314 (46%), Positives = 212/314 (67%), Gaps = 10/314 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++ E WM++HG+SY+ EK R ++F++NL++I++ NK+ + +Y LG N+F+
Sbjct: 39 TSMDKLTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+++EF+ Y G K+ P R + F Y++++ D+P S+DWR KGAV +KNQ C
Sbjct: 98 DLSHEEFKRKYLGLKIELPKRRDSPEE-FSYKDVA--DLPKSVDWRKKGAVAHVKNQGAC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC NNGC GG + AFA+II N G+
Sbjct: 155 GSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGL 214
Query: 218 ATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
E++YPY GTC ++ IS Y +VP +EQ+ LKA++ QP+S+AI A S
Sbjct: 215 RKEEDYPYVMEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRG 274
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ Y GIFNG CGT+LDH V VG+GT++ G +Y +KNSWG+ WG+ GY+++ R+
Sbjct: 275 FQFYSGGIFNGHCGTELDHGVAAVGYGTSK-GVDYITVKNSWGSKWGEKGYIRMKRNVGK 333
Query: 334 -EGLCGIGTRSSYP 346
EG+CGI +SYP
Sbjct: 334 PEGICGIYKMASYP 347
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 151/357 (42%), Positives = 223/357 (62%), Gaps = 28/357 (7%)
Query: 16 TPMFIIITLLV--SCAS----QVVSSRSTHE--------QSVVE-----IHEKWMAQHGR 56
+ M +++ +V SCA+ +VSS H Q V + + E WM +HG+
Sbjct: 6 SAMLVLLLAMVISSCATAMDMSIVSSNDNHHVTNGPGRRQGVFDAEATLMFESWMVKHGK 65
Query: 57 SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP 116
Y+ EKE RL IF++NL +I N E N +Y+LG N+F+DL+ E+ + G P
Sbjct: 66 VYESVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYAQICHGADPRPP 124
Query: 117 SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIR 176
+ +S+ +Y+ +P S+DWR++GAVT +K+Q +C CWAF+ V AVEG+ KI
Sbjct: 125 RNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFSTVGAVEGLNKIV 184
Query: 177 SGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
+G L+ LSEQ L++C+ NNGC GG E A+ +I+ N G+ T+++YPY+A+ G C+
Sbjct: 185 TGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCNDRL 243
Query: 237 KP--AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD 294
K I YE +P+ DE AL+KAV+ QPV+ + + S EFQ Y G+F+G CGT L+
Sbjct: 244 KENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASGVFDGTCGTNLN 303
Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
H V +VG+G TE+G +YW+++NS GNTWG+AGYMK+ R+ GLCGI R+SYPL
Sbjct: 304 HGVVVVGYG-TENGRDYWIVRNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 145/311 (46%), Positives = 197/311 (63%), Gaps = 24/311 (7%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+V HE+WM Q+ R YKD EK R ++FK N+++IE N GNR + LG NQF+DLTND
Sbjct: 1 MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTND 60
Query: 103 EFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
EFRA T G+K PSP T F+Y+N+S+ +P ++DWR KGAVTPIK+Q +C
Sbjct: 61 EFRATKTNKGFK-PSPVKVPTG---FRYENISVDALPATIDWRTKGAVTPIKDQGQC--- 113
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIAT 219
EGI KI +G LI LSEQ+L+DC +G + GC GG + AF +II+ G+ T
Sbjct: 114 ---------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTT 164
Query: 220 EDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
E YPY A G C + + A + +E+VP+ DE +L+KAV+ QPVS+A+ FQ
Sbjct: 165 ESSYPYTAADGKCKSGSN-SVATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTFQF 223
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EG 335
Y G+ G CGT LDH + +G+G T DG YWL+KNSWG TWG+ GY+++ +D G
Sbjct: 224 YSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKRG 283
Query: 336 LCGIGTRSSYP 346
+CG+ SYP
Sbjct: 284 MCGLAMEPSYP 294
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 210/340 (61%), Gaps = 11/340 (3%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I+ + +F L++S A + +S V+ ++E W+ +HG+SY EKEMR +IFK
Sbjct: 8 ISKSLLFFSTLLILSSAIDIENSVQRTNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
ENL I+ N + NR+Y LG N+F+DLT++E+R+ Y G K T + +Y
Sbjct: 68 ENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLK-----RGPKTDVSNQYMPKV 122
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR GAV +KNQ C CWAF+AVAAVEGI KI +GNLI LSEQ+L+DC
Sbjct: 123 GDALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCG 182
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVP 250
T GC G AF +II N GI TE+ YPY A G C+ + K I +Y+ VP
Sbjct: 183 RTQITKGCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCNLSLKNQKYVTIDSYKNVP 242
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
S +E AL KAV+ QPVS+ + + +F+ Y GIF G CGT +DH VTIVG+G TE G +
Sbjct: 243 SNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYG-TERGMD 301
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSWG WG++GY++I R+ G CGI SYP+
Sbjct: 302 YWIVKNSWGTNWGESGYIRIQRNIGGAGKCGIAKMPSYPV 341
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 150/345 (43%), Positives = 219/345 (63%), Gaps = 12/345 (3%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRST--HEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
S K T + I LL+S + V++ T +E ++E+W+ ++ ++Y EKE R
Sbjct: 4 SIKSITLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERR 63
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
+IFK+NL+++E+ + NRTY++G +F+DLTNDEFRA+Y KM K
Sbjct: 64 FEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKM---ERTRVPVKGEK 120
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y +P ++DWR KGAV P+K+Q CG CWAF+A+ AVEGI +I++G LI LSEQ+
Sbjct: 121 YLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPA-AAKISN 245
L+DC T+ N+GC GG + AF +II+N GI TE++YPY A C++ +K I
Sbjct: 181 LVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDG 240
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
YE+VP DE++L KA++ QP+S+AI A FQ Y G+F G CGT LDH V VG+G +
Sbjct: 241 YEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYG-S 299
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E G +YW+++NSWG+ WG++GY K+ R+ G CG+ +SYP
Sbjct: 300 EGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYP 344
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 207/313 (66%), Gaps = 9/313 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
++ V ++E W+ HG++Y EKE R +IFK+NL +I++ N+E +RTYK+G +F+DL
Sbjct: 55 DEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRE-SRTYKVGLTRFADL 113
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN+E+RA + G + S R + + + +Y D+P +DWR KGAV +K+Q +CG
Sbjct: 114 TNEEYRARFLGGRF-SRKPRLSAAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQCGS 172
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF++VAAVEGI +I +G LI LSEQ+L+DC + N GC GG + AF +II N GI T
Sbjct: 173 CWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGIDT 232
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E++YPY+ C +K A I YE+VP DE +L KAV+ QPVS+AI A FQ
Sbjct: 233 EEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQ 292
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----- 333
Y+ G+F G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+
Sbjct: 293 LYQSGVFTGRCGTDLDHGVVAVGYG-TDNGTDYWIVRNSWGKDWGESGYIRLERNVANIT 351
Query: 334 EGLCGIGTRSSYP 346
G CGI + SYP
Sbjct: 352 TGKCGIAVQPSYP 364
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 149/318 (46%), Positives = 203/318 (63%), Gaps = 16/318 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H S + EK R +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR+ Y G +KM S S TF Y+ + VP S+DWR KGAVT +K+Q
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI +I++ L+ LSEQ+L+DC N GC GG E AF +I Q
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206
Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
GI TE YPY A GTC ++ A I +E VP DE ALLKAV+ QPVS+AI A
Sbjct: 207 GGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
++FQ Y EG+F G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326
Query: 334 ----EGLCGIGTRSSYPL 347
EGLCGI +SYP+
Sbjct: 327 ISKKEGLCGIAMMASYPI 344
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 154/345 (44%), Positives = 222/345 (64%), Gaps = 17/345 (4%)
Query: 11 FKINTTPMFIIITL--LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
K T +I+ L + S ++ + ST+ + + +E W+ ++GR Y+D E E+R
Sbjct: 1 MKTTITLSIVILNLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRF 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
I++ N++YIE N + N +YKL N+F+D+TN+EF++ Y GY +P R + F+Y
Sbjct: 61 DIYQSNVQYIEFYNSQ-NYSYKLIDNRFADITNEEFKSTYLGY-LP----RFRVQTEFRY 114
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
++P S+DWR KGAVT +K+Q CG CWAF+AVAAVEGI KI++ NL+ LSEQQL
Sbjct: 115 H--KHGELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQL 172
Query: 189 LDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNY 246
+DC +GN GC GG AF YI ++ GIAT EYPY+ G C+ ++ K A IS Y
Sbjct: 173 IDCDIKSGNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNNAVTISGY 232
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E VP+ +E+ L AV+ QPVSIA A FQ Y +GIF+G CG L+H +TIVG+G E
Sbjct: 233 ESVPARNEKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYG-EE 291
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+G YW++KNSW N WG++GY+++ RD +G CGI ++YP+
Sbjct: 292 NGDKYWIVKNSWANDWGESGYVRMKRDTKDKDGTCGIAMDATYPV 336
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 145/317 (45%), Positives = 206/317 (64%), Gaps = 11/317 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL-EKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQ 95
E I+ W A+HG + L E+E R + F +NL +++ N G ++LG N+
Sbjct: 45 EAEARAIYGLWRAEHGSGNSNSLGEEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNR 104
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
F+DLTNDEFRA Y G K + +Y++ + ++P ++DWR+KGAV P+KNQ
Sbjct: 105 FADLTNDEFRAAYLGVKGAGQRRSARAGVGERYRHDGVEELPEAVDWREKGAVAPVKNQG 164
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQN 214
+CG CWAF+AV+AVE I ++ +G L+ LSEQ+L++C NG +NGC GG + AF +II N
Sbjct: 165 QCGSCWAFSAVSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINN 224
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
GI TED+YPY+A+ G C ++ A I +E+VP DE++L KAV+ QPVS+AI A
Sbjct: 225 GGIDTEDDYPYKALDGKCDINRRNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAG 284
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
EFQ Y G+F G CGT+LDH V VG+G TE+G +YW+++NSWG WG+AGY+++ R+
Sbjct: 285 GREFQLYHSGVFTGRCGTELDHGVVAVGYG-TENGKDYWIVRNSWGPKWGEAGYLRMERN 343
Query: 334 ----EGLCGIGTRSSYP 346
G CGI SSYP
Sbjct: 344 INATTGKCGIAMMSSYP 360
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 151/320 (47%), Positives = 203/320 (63%), Gaps = 20/320 (6%)
Query: 40 EQSVVEIHEKWMAQH--GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E+S +++E+W + H RS D K R +FK N+ ++ NK ++ YKL N+F+
Sbjct: 33 EESFWDLYERWRSHHTVSRSLGD---KHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFA 88
Query: 98 DLTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
D+TN EFR+ Y G K+ HR + TF Y+ + VP S+DWR GAVT +K
Sbjct: 89 DMTNHEFRSTYAGSKVNH--HRMFQGTPRGNGTFMYEKVG--SVPPSVDWRKNGAVTGVK 144
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
+Q +CG CWAF+ V AVEGI +I++ L+ LSEQ+L+DC T N GC GG E AF +I
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIK 204
Query: 213 QNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
Q GI TE YPY A GTC A++ A I +E VP+ DE ALLKAV+ QPVS+AI
Sbjct: 205 QKGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAID 264
Query: 272 AYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
A ++FQ Y EG+F G C T+L+H V IVG+GTT DG NYW ++NSWG WG+ GY+++
Sbjct: 265 AGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQ 324
Query: 332 RD----EGLCGIGTRSSYPL 347
R EGLCGI +SYP+
Sbjct: 325 RSISKKEGLCGIAMMASYPI 344
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 206/315 (65%), Gaps = 9/315 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ V+ +++ W+ QHG++Y E+E R +IFK+NL +I++ N N TYKLG N+F+DL
Sbjct: 39 DDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADL 98
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSS--TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
TN E+RA + G + P R S + +Y + + ++P S++WRD GAV+ +K+Q C
Sbjct: 99 TNQEYRAKFLGTRT-DPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSC 157
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+A+AAVEGI KI SG LI LSEQ+L+DC + + GC GG + AF +II N GI
Sbjct: 158 GSCWAFSAIAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDNGGI 217
Query: 218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
TE +YPY C +K A I YE+VP+ +E AL KAV+ QPVSIAI A
Sbjct: 218 DTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPN-NENALKKAVAHQPVSIAIEAGGRA 276
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
FQ Y+ G+FNG CG LDH V VG+G+ ++G +YW+++NSWG WG+ GY+++ R
Sbjct: 277 FQLYESGVFNGECGLALDHGVVAVGYGSDDNGQDYWIVRNSWGGNWGENGYIRMERNINA 336
Query: 333 DEGLCGIGTRSSYPL 347
+ G CGI +SYP+
Sbjct: 337 NTGKCGIAMEASYPV 351
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 152/350 (43%), Positives = 221/350 (63%), Gaps = 21/350 (6%)
Query: 16 TPMFIIITLLV--SCASQVVSSRSTHE-----QSVVE-----IHEKWMAQHGRSYKDELE 63
+ M I++ +V SCA+ + S +++ SV + I E WM +HG+ Y E
Sbjct: 6 SAMLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAE 65
Query: 64 KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS 123
KE RL IF++NL +I N E N +Y+LG F+DL+ E++ + G P + +
Sbjct: 66 KERRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMT 124
Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
S+ +Y+ + +P S+DWR++GAVT +K+Q C CWAF+ V AVEG+ KI +G L+ L
Sbjct: 125 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 184
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP--AAA 241
SEQ L++C+ NNGC GG E A+ +I++N G+ T+++YPY+AV G C K
Sbjct: 185 SEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNV 243
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
I YE +P+ DE AL+KAV+ QPV+ I + S EFQ Y+ G+F+G CGT L+H V +VG
Sbjct: 244 MIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVG 303
Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+G TE+G +YWL+KNS G TWG+AGYMK+ R+ GLCGI R+SYPL
Sbjct: 304 YG-TENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 352
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 142/310 (45%), Positives = 214/310 (69%), Gaps = 11/310 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK +R ++FK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+ Y G K+ R S+ F Y+++ D+P S+DWR KGAVTP+KNQ +CG CW
Sbjct: 102 EFKNKYLGLKVNLSQRRESSNEEEFTYRDV---DLPKSVDWRKKGAVTPVKNQGQCGSCW 158
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
AF+ VAAVEGI +I +GNL LSEQ+L+DC T NNGC GG + AF++I+QN G+ ED
Sbjct: 159 AFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGGLHKED 218
Query: 222 EYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
+YPY TC ++ I+ Y +VP +EQ+LLKA++ QP+S+AI A S +FQ Y
Sbjct: 219 DYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFY 278
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
G+F+G CG+ LDH V+ VG+GT+++ +Y ++KNSWG WG+ G++++ R+ EG+
Sbjct: 279 SGGVFDGHCGSDLDHGVSAVGYGTSKN-LDYIIVKNSWGAKWGEKGFIRMKRNIGKPEGI 337
Query: 337 CGIGTRSSYP 346
CG+ +SYP
Sbjct: 338 CGLYKMASYP 347
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 212/340 (62%), Gaps = 10/340 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+ S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAA-QKPAAAKISNYEEVP 250
T GC GG F +II N GI TE+ YPY A G C+ Q I YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQNEKYVTIDTYENVP 243
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E AL AV+ QPVS+A+ A F+ Y GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG-TEGGID 302
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSW TWG+ GYM+I+R+ G CGI T SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 212/340 (62%), Gaps = 10/340 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNTKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+ S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
T GC GG F +II N GI TE+ YPY A G C+ Q I YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVP 243
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E AL AV+ QPVS+A+ A F+ Y GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG-TEGGID 302
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSW TWG+ GYM+I+R+ G CGI T SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 149/346 (43%), Positives = 219/346 (63%), Gaps = 19/346 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHE-----QSVVE-----IHEKWMAQHGRSYKDELEKEMR 67
+ ++ ++ SCA+ + S +++ SV + I E WM +HG+ Y EKE R
Sbjct: 3 ILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAEKERR 62
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
L IF++NL +I N E N +Y+LG F+DL+ E++ + G P + +S+ +
Sbjct: 63 LTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMTSSDR 121
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y+ + +P S+DWR++GAVT +K+Q C CWAF+ V AVEG+ KI +G L+ LSEQ
Sbjct: 122 YKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQD 181
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP--AAAKISN 245
L++C+ NNGC GG E A+ +I++N G+ T+++YPY+AV G C K I
Sbjct: 182 LINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMIDG 240
Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
YE +P+ DE AL+KAV+ QPV+ I + S EFQ Y+ G+F+G CGT L+H V +VG+G T
Sbjct: 241 YENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYG-T 299
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
E+G +YWL+KNS G TWG+AGYMK+ R+ GLCGI R+SYPL
Sbjct: 300 ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 345
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 204/313 (65%), Gaps = 16/313 (5%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
E++E+W + H S + EK+ R +FK N+ Y+ NK+ ++ YKL N+F+D+TN EF
Sbjct: 36 ELYERWRSHHTVS-RSLDEKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEF 93
Query: 105 RALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
R Y G K+ HR S + TF Y + VP ++DWR KGAVTP+K+Q +CG
Sbjct: 94 RHHYAGSKIKH--HRTFLGASRANGTFMYAHED--SVPPTVDWRKKGAVTPVKDQGKCGS 149
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ V AVEGI +I++ L+ LSEQ+L+DC T+ N GC GG + AF +I + GI T
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINT 209
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E+ YPY A G C ++ + I +E+VP DE +LLKAV+ QPVS+AI A ++FQ
Sbjct: 210 EENYPYMAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQ 269
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DE 334
Y EG+F G CGT+LDH V IVG+GTT D YW++KNSWG WG+ GY+++ R +E
Sbjct: 270 FYSEGVFTGDCGTELDHGVAIVGYGTTLDRTKYWIVKNSWGPEWGEKGYIRMQREIDAEE 329
Query: 335 GLCGIGTRSSYPL 347
GLCGI + SYP+
Sbjct: 330 GLCGIAMQPSYPI 342
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 296 bits (758), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 145/317 (45%), Positives = 206/317 (64%), Gaps = 15/317 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E ++ +++E+W + ++ ++L R +FK N+ ++ + NK ++ YKL N+F+D+
Sbjct: 33 EDNLWDMYERWRHKVATNHGEKLR---RFNVFKSNVLHVHETNKM-DKPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMP----SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
TN EFR++Y G K+ S + S TF Y N+ VPTS+DWR KGAV P+K+Q
Sbjct: 89 TNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVE--SVPTSVDWRKKGAVAPVKDQG 146
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
+CG CWAF+ VAAVEGI KI++ L+ LSEQ+L+DC T N GC GG + AF +I +
Sbjct: 147 QCGSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDFIKKTG 206
Query: 216 GIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
G+ ED YPY A G C S I +E+VP DEQ+L+KAV+ QPV++AI A S
Sbjct: 207 GLTREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAIDAGS 266
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 332
++FQ Y EG+F G CGTQLDH V VG+GTT DG YW+++NSWG+ WG+ GY+++ R
Sbjct: 267 SDFQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSEWGEKGYIRMERGI 326
Query: 333 --DEGLCGIGTRSSYPL 347
GLCGI +SYP+
Sbjct: 327 SDKRGLCGIAMEASYPI 343
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 296 bits (758), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 149/318 (46%), Positives = 201/318 (63%), Gaps = 16/318 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++E+W + H S + EK R +FKEN+ ++ + NK+ + YKL N+F+D+
Sbjct: 31 EESLWNLYERWRSHHTVS-RSLDEKHKRFNVFKENVNFVHEFNKK-DEPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSS-----TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR+ Y G K+ HR S +F Y+ + VP S+DWR KGAVTPIK+Q
Sbjct: 89 TNHEFRSTYAGSKVNH--HRMFRGSQHAAGSFMYEKVK--SVPPSVDWRKKGAVTPIKDQ 144
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ V AVEGI I++ L+ LSEQ+L+DC T+ N GC GG AF +I +
Sbjct: 145 GQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEK 204
Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
GI TE YPY A GTC ++ I +E VP +E ALLKA + QP+S+AI A
Sbjct: 205 GGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAG 264
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
+ FQ Y EG+F G CGT LDH V IVG+GTT DG YW++KNSWG WG+ GY+++ R
Sbjct: 265 GSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRG 324
Query: 333 ---DEGLCGIGTRSSYPL 347
EGLCGI +SYP+
Sbjct: 325 ISAKEGLCGIAVEASYPI 342
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 210/324 (64%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ V ++ +WMA++GR+Y E+E R ++F++NL Y+++ N G +
Sbjct: 27 IVSYGERSEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHS 86
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + +YQ ++P S+DWR+KGAV
Sbjct: 87 FRLGLNRFADLTNEEYRDTYLGVRTKPVRERRLSG---RYQAADNEELPESVDWREKGAV 143
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG + AF
Sbjct: 144 AKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQGCNGGLMDYAF 203
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI +E++YPY+ C A +K A I YE+VP E +L KAV+ QP+S
Sbjct: 204 EFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQPIS 263
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A FQ YK GIF G CGT LDH VT VG+G +E+G +YW++KNSWG WG+ GY
Sbjct: 264 VAIEAGGRAFQLYKSGIFTGRCGTALDHGVTAVGYG-SENGKDYWIVKNSWGTVWGEDGY 322
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 323 VRLERNIKATSGKCGIAIEPSYPL 346
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 153/339 (45%), Positives = 214/339 (63%), Gaps = 31/339 (9%)
Query: 37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMR-LKIFKENLEYIEKANKEGNRTYKLGTNQ 95
S+HE S+ E+ E+W+++H + LE+++R ++FK+NL +I++ N++ + +Y LG N+
Sbjct: 39 SSHE-SLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRKVS-SYWLGLNE 96
Query: 96 FSDLTNDEFRALYTGYKMPSPS------HRSTTSST-------------FKYQNLSMTDV 136
F+DLT+DEF+A Y G H F+Y+ + +
Sbjct: 97 FADLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAARL 156
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAVT +KNQ +CG CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T+GN
Sbjct: 157 PKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTDGN 216
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG + AF+YI N G+ TE+ YPY GTCS A IS YE+VP +EQA
Sbjct: 217 NGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCSRGSSAAVVTISGYEDVPRNNEQA 276
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT--EDG---ANY 311
LLKA++ QPVS+AI A Q Y G+F+G CGTQLDH V VG+GT ++G A+Y
Sbjct: 277 LLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAAVGYGTAGKDNGHVVADY 336
Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
++KNSWG +WG+ GY+++ R +GLCGI SYP
Sbjct: 337 IIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMPSYP 375
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 151/340 (44%), Positives = 210/340 (61%), Gaps = 11/340 (3%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I+ + +F L++S A +V+S V +++E W+ + G+SY EKEMR +IFK
Sbjct: 8 ISMSLLFFSTLLILSSALDIVNSAQRTNDQVRDMYESWLVEQGKSYNSLDEKEMRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+NL I+ N + NR++ LG N+F+DLT++E+R+ Y G+K P + + K ++
Sbjct: 68 DNLRIIDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFK-SGPKAKVSNRYVPKVGDV- 125
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR GAV +KNQ C CWAF+AVAAVEGI KI +GNL+ LSEQ+L+DC
Sbjct: 126 ---LPNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGNLLSLSEQELVDCG 182
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
T GC G AF +II N GI TED YPY A G C+ Q I +YE VP
Sbjct: 183 RTQSTRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQCNRYLQNQKYVTIDDYENVP 242
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
S +E AL AV+ QPVS+ + + +F+ Y GIF CGT +DH VTIVG+G TE G +
Sbjct: 243 SNNEWALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTAIDHGVTIVGYG-TERGLD 301
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSWG WG+ GY++I R+ G CGI +SYP+
Sbjct: 302 YWIVKNSWGTNWGENGYIRIQRNIGGAGKCGIARMASYPV 341
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 212/340 (62%), Gaps = 10/340 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+ S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
T GC GG F +II N GI TE+ YPY A G C+ Q I YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVP 243
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E AL AV+ QPVS+A+ A F+ Y GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGID 302
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSW TWG+ GYM+I+R+ G CGI T SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 212/340 (62%), Gaps = 10/340 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+ S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRF 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
T GC GG F +II N GI TE+ YPY A G C+ Q I YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVP 243
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E AL AV+ QPVS+A+ A F+ Y GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGID 302
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSW TWG+ GYM+I+R+ G CGI T SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 295 bits (756), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 198/308 (64%), Gaps = 10/308 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E E+WMA++GR Y D EK R +IFK N+ +IE N +Y LG NQF+D+TN+
Sbjct: 6 MMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMTNN 65
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF A YTG +P R S + ++ ++ VP S+DWRD GAVT +KNQ CG CWA
Sbjct: 66 EFLARYTGASLPLNIERDPVVS---FDDVDISAVPQSIDWRDYGAVTSVKNQGSCGSCWA 122
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+A+A VEGI KI++GNLI LSEQ++LDC+ + GC GG KA+ +II N G+ +
Sbjct: 123 FSAIATVEGIYKIKAGNLISLSEQEVLDCAL--SYGCDGGWVNKAYDFIISNNGVTSFAN 180
Query: 223 YPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
PY+ G C+ P A I+ Y V S +E++++ AV+ QP++ I A +FQ YK
Sbjct: 181 LPYKGYKGPCNHNDLPNKAYITGYTYVQSNNERSMMIAVANQPIAALIDA-GGDFQYYKS 239
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
G+F G CGT L+HA+T++G+G T G YW++KNSWG +WG+ GY+++ RD GLCG
Sbjct: 240 GVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARDVSSPYGLCG 299
Query: 339 IGTRSSYP 346
I +P
Sbjct: 300 IAMAPLFP 307
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 142/307 (46%), Positives = 212/307 (69%), Gaps = 12/307 (3%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSDLTNDEFR 105
++ W+A++GRSY E E R ++F +NL + + N + + ++LG N+F+DLTN+EFR
Sbjct: 54 YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 113
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
A + G K+ S ++ +Y++ + ++P S+DWR+KGAV P+KNQ +CG CWAF+A
Sbjct: 114 ATFLGAKVVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 169
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYP 224
V+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG + AF +II+N GI TED+YP
Sbjct: 170 VSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYP 229
Query: 225 YQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEG 283
Y+AV G C ++ A I +E+VP DE++L KAV+ QPVS+AI A EFQ Y G
Sbjct: 230 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 289
Query: 284 IFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGI 339
+F+G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+ G CGI
Sbjct: 290 VFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGI 348
Query: 340 GTRSSYP 346
+SYP
Sbjct: 349 AMMASYP 355
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 149/325 (45%), Positives = 215/325 (66%), Gaps = 14/325 (4%)
Query: 34 SSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGT 93
SSRS E V+ I+E W+ QH ++Y EKE R IFK+NLE+I++ N + ++T+K+G
Sbjct: 42 SSRSDDE--VMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFKVGL 99
Query: 94 NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-----YQNLSMTDVPTSLDWRDKGAV 148
N+F+DLTN+EFR++Y G K S S +S+ K Y ++P ++DWR GAV
Sbjct: 100 NKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKNGAV 159
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q +CG CWAF+ +AAVEGI +I +G L+ LSEQ+L+DC T+ N+GC GG + A+
Sbjct: 160 AKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNSGCDGGLMDYAY 219
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI T+ +YPY A G C +K A I ++E+VP DE+AL KAV+ QPVS
Sbjct: 220 EFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQPVS 279
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A + FQ Y+ G+F G CG LDH V VG+G ++DG +YW+++NSWG WG++GY
Sbjct: 280 VAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYG-SDDGKDYWIVRNSWGADWGESGY 338
Query: 328 MKIVRD-----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYP+
Sbjct: 339 IRMERNLETVKTGKCGIAIEPSYPI 363
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 148/340 (43%), Positives = 216/340 (63%), Gaps = 12/340 (3%)
Query: 15 TTPMFIIITLLVSCASQVVSSRST--HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
T + I LL+S + V++ T +E ++E+W+ ++ ++Y EKE R +IF
Sbjct: 9 TLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEKETRFEIFT 68
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+NL+YIE+ N N+T+++G +F+DLTNDEFRA+Y KM +Y
Sbjct: 69 DNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKM---ERTRVPVKGERYLYKV 125
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR KGAV P+K+Q CG CWAF+A+ AVEGI +I++G LI LSEQ+L+DC
Sbjct: 126 GDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV-PGTCSAAQKPA-AAKISNYEEVP 250
T+ N GC GG + AF +II+N GI TE++YPY A C++ +K + I YE+VP
Sbjct: 186 TSYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVP 245
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
DE++L KA++ QP+S+AI A FQ YK G+F G CGT LDH V VG+G +E G +
Sbjct: 246 QNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYG-SEGGQD 304
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
YW+++NSWG+ WG++GY K+ R+ G CG+ +SYP
Sbjct: 305 YWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYP 344
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 209/323 (64%), Gaps = 12/323 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ V ++ +WM++H R+Y E+E R ++F++NL YI++ N G +
Sbjct: 26 IVSYGERSEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHS 85
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R+ Y G + R ++ +YQ ++P ++DWR KGAV
Sbjct: 86 FRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQADDNEELPETVDWRKKGAV 142
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG + AF
Sbjct: 143 AAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMDYAF 202
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI +E++YPY+ C A +K A I YE+VP E++L KAV+ QP+S
Sbjct: 203 EFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPIS 262
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A FQ YK GIF G CGT LDH V VG+G TE+G +YWL++NSWG WG+ GY
Sbjct: 263 VAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYG-TENGKDYWLVRNSWGTVWGEDGY 321
Query: 328 MKIVRD----EGLCGIGTRSSYP 346
+++ R+ G CGI SYP
Sbjct: 322 IRMERNIKASSGKCGIAVEPSYP 344
>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 334
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 154/346 (44%), Positives = 219/346 (63%), Gaps = 25/346 (7%)
Query: 13 INTTPMFIIITLLVS--CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
++ +F+ +T+L SQ + +EQS+V+ H++WM Q R YKDE EKEMRLK+
Sbjct: 2 VSVRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKV 61
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
FK+NL++IE N GN++Y LG N+F+D +EF A +TG ++ S + T +N
Sbjct: 62 FKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRN 121
Query: 131 LSMTDVPT---SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+M+D+ S DWRD+GAVTP+K Q C +TKI NL+ LSEQQ
Sbjct: 122 WNMSDIDMEDESKDWRDEGAVTPVKYQGAC-------------RLTKISGKNLLTLSEQQ 168
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNY 246
L+DC N GC GG E+AF YII+N G++ E EYPYQ +C A A++ +I +
Sbjct: 169 LIDCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGF 228
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTT 305
+ VPS +E+ALL+AV QPVS+ I A + F YK G++ G+ CGT ++HAVTIVG+GT
Sbjct: 229 QMVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTM 288
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G NYW++KNSWG +WG+ GYM+I RD +G+CGI ++YP+
Sbjct: 289 -SGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 333
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 148/311 (47%), Positives = 208/311 (66%), Gaps = 14/311 (4%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+V++ W +H + Y EK R ++FK+NL++I + N+ N +Y LG NQF+D+ ++
Sbjct: 44 LVDLFSSWSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRR-NGSYWLGLNQFADVAHE 102
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
EF++ Y G K M P+ T F+Y+N ++P S+DWR KGAVTP+KNQ ECG C
Sbjct: 103 EFKSTYLGLKTGMDGPARAPTA---FRYEN--SVNLPWSVDWRKKGAVTPVKNQGECGSC 157
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
WAF+ VAAVEGI +I +G L LSEQ+L+DC T ++GC GG + AFAYI+ N GI T+
Sbjct: 158 WAFSTVAAVEGINQIATGKLESLSEQELMDCDTTFDHGCGGGFMDFAFAYIMGNLGIHTD 217
Query: 221 DEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
D+YPY G C Q + IS YE+VP E +LLKA++ QP+S+ IAA S +FQ
Sbjct: 218 DDYPYLMEEGYCKEKQPQSKVVTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQF 277
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEG 335
YK G+F G CGT+LDHA+T VG+G++ DG +Y ++KNSWG +WG+ GY +I R EG
Sbjct: 278 YKRGVFEGSCGTELDHALTAVGYGSS-DGQDYIIMKNSWGKSWGEQGYFRIKRGTGKPEG 336
Query: 336 LCGIGTRSSYP 346
+C I + +SYP
Sbjct: 337 VCSIYSMASYP 347
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 148/318 (46%), Positives = 208/318 (65%), Gaps = 17/318 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E++V +++E+W H + + E R +F+ N+ ++ + NK+ N+ YKL N+F+D+
Sbjct: 30 EENVWKLYERWRDHHSVT-RASHEALKRFNVFRHNVLHVHRTNKK-NKPYKLKVNRFADI 87
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
T+ EFR+ Y G ++M R S F Y+N+ T VP+S+DWR+KGAVT +KNQ
Sbjct: 88 THHEFRSSYAGSNVKHHRMLRGPKRG--SGGFMYENV--TRVPSSVDWREKGAVTEVKNQ 143
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
++CG CWAF+ VAAVEGI KIR+ L+ LSEQ+L+DC T N GC GG E AF +I N
Sbjct: 144 QDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNN 203
Query: 215 QGIATEDEYPYQA--VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
GI TE+ YPY + V + + I +E VP DE+ALLKAV+ QPVS+AI A
Sbjct: 204 GGIKTEETYPYDSNDVQFCRAKSIDGETVTIDGHEHVPENDEEALLKAVAHQPVSVAIDA 263
Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
S++FQ Y EG+F G CGTQL+H V IVG+G T++G YW+++NSWG WG+ GY++I R
Sbjct: 264 GSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIER 323
Query: 333 ----DEGLCGIGTRSSYP 346
+EG CGI +SYP
Sbjct: 324 GISENEGRCGIAMEASYP 341
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 295 bits (754), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 205/336 (61%), Gaps = 25/336 (7%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR YKD+ EK R ++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG NQF+DLTNDEFR+ T + R T F+ +N+++ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTG--FRNENVNIDALP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
++DWR KG VTPIK+Q +CGCCWAF+AVAA+E +L+DC +G +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAME----------------ELVDCDVHGED 168
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG + AF +II+N G+ TE YPY AV + + A I YE+VP+ +E A
Sbjct: 169 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVDDKFKSVSN-SVASIKGYEDVPANNEAA 227
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+A+ FQ YK G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 228 LMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 287
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
SWG TWG+ G++++ +D G+CG+ SYP A
Sbjct: 288 SWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 323
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 203/321 (63%), Gaps = 20/321 (6%)
Query: 40 EQSVVEIHEKWMAQHG---RSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
E+S+ ++E W + H R E E R +FKEN+ YI +ANK+ +R ++L N+F
Sbjct: 33 EESLRGLYETWRSHHTVSRRGLGAEAEAR-RFNVFKENVRYIHEANKK-DRPFRLALNKF 90
Query: 97 SDLTNDEFRALYTGYKMPSPSHRS------TTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
+D+T DEFR Y G ++ HRS +F Y + ++P ++DWR KGAVTP
Sbjct: 91 ADMTTDEFRRTYAGSRVRH--HRSLSGGRRQGGGSFMYADAE--NLPAAVDWRQKGAVTP 146
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
IK+Q +CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC+ N+GC GG + AF +
Sbjct: 147 IKDQGQCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQF 206
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIA 269
I QN GI TE YPYQ +C +++ + I YE+VP+ DE AL KAV+ QPVS+A
Sbjct: 207 IQQNGGITTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVA 266
Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
I A +FQ Y EG+F GT LDH V VG+GTT DG YW++KNSWG WG+ GY++
Sbjct: 267 IDASGNDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIR 326
Query: 330 IVRD----EGLCGIGTRSSYP 346
+ R EGLCGI +SYP
Sbjct: 327 MQRGVKQAEGLCGIAMEASYP 347
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 153/340 (45%), Positives = 211/340 (62%), Gaps = 10/340 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+ S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
T GC G F +II N GI TE+ YPY A G C+ Q I YE VP
Sbjct: 184 RTQNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVP 243
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E AL AV+ QPVS+A+ A F+ Y GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG-TEGGID 302
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSW TWG+ GYM+I+R+ G CGI T SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 148/318 (46%), Positives = 205/318 (64%), Gaps = 17/318 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E++V +++E+W H S + E R +F+ N+ ++ + NK+ N+ YKL N+F+D+
Sbjct: 31 EENVWKLYERWRGHHSVS-RASHEAIKRFNVFRHNVLHVHRTNKK-NKPYKLKINRFADI 88
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
T+ EFR+ Y G ++M R S F Y+N+ T VP+S+DWR+KGAVT +KNQ
Sbjct: 89 THHEFRSSYAGSNVKHHRMLRGPKRG--SGGFMYENV--TRVPSSVDWREKGAVTEVKNQ 144
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
++CG CWAF+ VAAVEGI KIR+ L+ LSEQ+L+DC T N GC GG E AF +I N
Sbjct: 145 QDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNN 204
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA--AKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
GI TE+ YPY + A I +E VP DE+ LLKAV+ QPVS+AI A
Sbjct: 205 GGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDA 264
Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
S++FQ Y EG+F G CGTQL+H V IVG+G T++G YW+++NSWG WG+ GY++I R
Sbjct: 265 GSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIER 324
Query: 333 ----DEGLCGIGTRSSYP 346
+EG CGI +SYP
Sbjct: 325 GISENEGRCGIAMEASYP 342
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 149/325 (45%), Positives = 203/325 (62%), Gaps = 10/325 (3%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
A+ S + V+ ++E W+ +HG+SY EKE R +IFK+NL +I++ N E N +
Sbjct: 32 ATHASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLS 91
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
YK+G N+F+DLTN+E+R+ Y G K P S +Y +P S+DWR KGAV
Sbjct: 92 YKVGLNRFADLTNEEYRSTYLGAK-SKPKLSKVKSD--RYAPRVGDSLPESVDWRAKGAV 148
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
PIK+Q CG CWAF+ V AVEGI +I +G LI LSEQ+L+DC + N GC GG + F
Sbjct: 149 APIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGGLMDYGF 208
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI T+ +YPY C +K A I +YE+VP +E+AL KAV+ QPVS
Sbjct: 209 EFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVS 268
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+ I FQ Y GIF G CGT LDH V +VG+G TE G +YW+++NSWG++WG+AGY
Sbjct: 269 VGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYG-TEKGKDYWIVRNSWGSSWGEAGY 327
Query: 328 MKIVRD-----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 328 IRMERNLAGTSVGKCGIAMEPSYPL 352
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 142/309 (45%), Positives = 210/309 (67%), Gaps = 11/309 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK +R +IFK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 43 LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+ Y G K+ S R + F Y+++ ++P S+DWR KGAV P+KNQ CG CWA
Sbjct: 102 EFKNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ VAAVEGI +I +GNL LSEQ+L+DC NNGC GG + AF++I++N G+ E++
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY GTC ++ IS Y +VP +EQ+LLKA++ QP+S+AI A +FQ Y
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 277
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
G+F+G CG+ LDH V VG+GT + G +Y ++KNSWG+ WG+ GY+++ R+ EG+C
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYIIVKNSWGSKWGEKGYIRMRRNIGKPEGIC 336
Query: 338 GIGTRSSYP 346
GI +SYP
Sbjct: 337 GIYKMASYP 345
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 142/310 (45%), Positives = 212/310 (68%), Gaps = 11/310 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK +R ++FK+NL++I+ NK + Y LG N+F+DL++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+ Y G K+ R S+ F Y+++ D+P S+DWR KGAVTP+KNQ +CG CW
Sbjct: 102 EFKNKYLGLKVDLSQRRESSNEEEFTYRDV---DLPKSVDWRKKGAVTPVKNQGQCGSCW 158
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
AF+ VAAVEGI +I +GNL LSEQ+L+DC T NNGC GG + AF++I QN G+ E+
Sbjct: 159 AFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGGLHKEE 218
Query: 222 EYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
+YPY TC ++ I+ Y +VP +EQ+LLKA++ QP+S+AI A S +FQ Y
Sbjct: 219 DYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFY 278
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
G+F+G CG+ LDH V+ VG+GT+++ +Y ++KNSWG WG+ G++++ RD EG+
Sbjct: 279 SGGVFDGHCGSDLDHGVSAVGYGTSKN-LDYIIVKNSWGAKWGEKGFIRMKRDIGKPEGI 337
Query: 337 CGIGTRSSYP 346
CG+ +SYP
Sbjct: 338 CGLYKMASYP 347
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 153/340 (45%), Positives = 211/340 (62%), Gaps = 10/340 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y + S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYL--RFTSGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
T GC GG F +II N GI TE+ YPY A G C+ Q I YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVP 243
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E AL AV+ QPVS+A+ A F+ Y GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYG-TEGGID 302
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSW TWG+ GYM+I+R+ G CGI T SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 145/314 (46%), Positives = 204/314 (64%), Gaps = 8/314 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H S +D EK R +FKEN +++ K N+ ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSYHTVS-RDLEEKNKRFNVFKENTKHVHKVNQM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
TN EFR+ Y G K+ R T + + T +P S+DWR KGAVT IK+Q +CG
Sbjct: 91 TNHEFRSSYGGSKVKHYRMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCG 150
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+ V VEGI +I++ L+ LSEQQL+DC + ++GC GG E AF +I +N GI
Sbjct: 151 SCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCDRSDDHGCNGGLMESAFEFIKKNGGIT 210
Query: 219 TEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
TE+ YPY+A C + A I +E VP DE+AL+KAV+ QPVS+AI A ++
Sbjct: 211 TENNYPYKAKDERCDMLKMNAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDAGGSDL 270
Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
Q Y EG+F+G CGT+LDH V IVG+GTT DG YW++KNSWG WG+ GY+++ R
Sbjct: 271 QFYSEGVFDGECGTELDHGVAIVGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAA 330
Query: 334 EGLCGIGTRSSYPL 347
EG CGI +SYP+
Sbjct: 331 EGQCGIAMEASYPV 344
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 145/314 (46%), Positives = 204/314 (64%), Gaps = 8/314 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H S +D EK R +FKEN +++ K N+ ++ YKL N+F+D+
Sbjct: 31 EESLWDLYERWRSYHTVS-RDLEEKNKRFNVFKENTKHVHKVNQM-DKPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
TN EFR+ Y G K+ R T + + T +P S+DWR KGAVT IK+Q +CG
Sbjct: 89 TNHEFRSSYGGSKVKHYRMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCG 148
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+ V VEGI +I++ L+ LSEQQL+DC + ++GC GG E AF +I +N GI
Sbjct: 149 SCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCDRSDDHGCNGGLMESAFEFIKKNGGIT 208
Query: 219 TEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
TE+ YPY+A C + A I +E VP DE+AL+KAV+ QPVS+AI A ++
Sbjct: 209 TENNYPYKAKDERCDMLKMNAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDAGGSDL 268
Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
Q Y EG+F+G CGT+LDH V IVG+GTT DG YW++KNSWG WG+ GY+++ R
Sbjct: 269 QFYSEGVFDGECGTELDHGVAIVGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAA 328
Query: 334 EGLCGIGTRSSYPL 347
EG CGI +SYP+
Sbjct: 329 EGQCGIAMEASYPV 342
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 147/340 (43%), Positives = 213/340 (62%), Gaps = 11/340 (3%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
+N +ITLL + S + + ++ E W +HG++Y + +K R KIF+
Sbjct: 1 MNFLSALFLITLLFF---NLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFE 57
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
EN E+++K N +GN +Y L N F+DLT+ EF+A G S S + + F +
Sbjct: 58 ENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGK-LSRRNFPLHDF- 115
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DWR KGAV+ +K+Q CG CW+F+A A+EGI KI +G+L+ LSEQ+L+DC
Sbjct: 116 VGDVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCD 175
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPS 251
+ NNGC GG + A+ ++I+N GI TE++YPYQA TC+ + K I Y +VP
Sbjct: 176 RSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQ 235
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
+E+ LLKAV+ QPVS+ I FQ Y +GIF G C T LDHAV IVG+G +E+G +Y
Sbjct: 236 NNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYG-SENGVDY 294
Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
W++KNSWG WG GYM ++R+ +GLCGI +S+P+
Sbjct: 295 WIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPV 334
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 151/325 (46%), Positives = 208/325 (64%), Gaps = 12/325 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK-EGNRTYK 90
+V+ R+ E+ V ++E W+ +G++Y EKE R +IF +NL YI+ N+ E N +Y
Sbjct: 25 IVAERT--EEEVRLLYEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYT 82
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT--DVPTSLDWRDKGAV 148
LG +F+DLTN+E+R+ Y G K R + + ++LS D+P +DWR+KGAV
Sbjct: 83 LGLTRFADLTNEEYRSTYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAV 142
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
PIK+Q CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T N GC GG + AF
Sbjct: 143 APIKDQGGCGSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYAF 202
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TE++YPY+ G C +K A I +YE+V DE AL AV+ QPVS
Sbjct: 203 QFIISNGGIDTEEDYPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVS 262
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI FQ YK GIF+G CG LDH V VG+G TE G +YW+++NSWG +WG+AGY
Sbjct: 263 VAIEGGGRSFQLYKSGIFDGRCGIDLDHGVVAVGYG-TESGKDYWIVRNSWGKSWGEAGY 321
Query: 328 MKIVRD-----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYP+
Sbjct: 322 IRMERNLPSSSSGKCGIAIEPSYPI 346
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 148/337 (43%), Positives = 208/337 (61%), Gaps = 11/337 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
F +IT ++ Q+ + RS E V+ ++E+W+ +H + Y EK+ R +IFK+NL +
Sbjct: 12 FFSLITFSLALDIQLPTGRSNDE--VMTMYEEWLVKHQKVYNGLREKDQRFQIFKDNLNF 69
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDV 136
I++ N + N TY +G N+F+D+TN+E+R +Y G + T +Y S +
Sbjct: 70 IDEHNAQ-NYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHRYAYNSGDRL 128
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P +DWR KGA+T IK+Q CG CWAF+ +A VE I KI +G L+ LSEQ+L+DC N
Sbjct: 129 PVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFN 188
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQ 255
GC GG + AF +II N GI T+ YPY+ G C +K A I YE+VPS +E
Sbjct: 189 EGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDGYEDVPSNNEN 248
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QPVS+AI A Q Y+ G+F G CGT LDHAV IVG+G +E+G +YWL++
Sbjct: 249 ALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYG-SENGLDYWLVR 307
Query: 316 NSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
NSWG WG+ GY K+ R+ G CGI +SYP+
Sbjct: 308 NSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPV 344
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 204/314 (64%), Gaps = 13/314 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ +++++ +W+ H R Y+ EK R +IFKEN YI NK+ ++Y LG N+FSDL
Sbjct: 42 DDAILDVFHQWLETHSRVYRSLSEKHHRFQIFKENFLYIHAHNKQ-QKSYWLGLNKFSDL 100
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T+ EFRA Y G K P +R + F Y+++ + +DWR KGAVT +K+Q CG
Sbjct: 101 THQEFRAQYLGTK---PVNRQRKEANFMYEDV---EAEPKVDWRLKGAVTDVKDQGACGS 154
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+AV +VEG+ I++G L+ LSEQ+L+DC N GC GG + AF +II+N GI T
Sbjct: 155 CWAFSAVGSVEGVNAIKTGELVSLSEQELVDCDRKQNQGCNGGLMDYAFEFIIKNGGIDT 214
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E +YPY+A G C ++ + I +Y++VP+ E AL+KA++ PVS+AI A +FQ
Sbjct: 215 EKDYPYKARDGRCDEGRRNSKVVVIDDYQDVPTQSESALMKALTKNPVSVAIEAGGRDFQ 274
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----D 333
Y+ G+F G CG++LDH V VG+GT +DG NYW++KNSWG WG+ GY+++ R
Sbjct: 275 HYQGGVFTGPCGSELDHGVLAVGYGTDDDGVNYWIVKNSWGPGWGEKGYIRMERFGSDST 334
Query: 334 EGLCGIGTRSSYPL 347
+G CGI +S+P+
Sbjct: 335 DGKCGINIEASFPI 348
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 200/313 (63%), Gaps = 7/313 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
+ + E W+ HG+SY E+E R +IFK NL YI++ N +R +KLG N+F+DL
Sbjct: 38 DDEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADL 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
TN+E+R+ YTG K + ++ + +Y LS +P S+DWR+ GAV +K+Q CG
Sbjct: 98 TNEEYRSKYTGIK-SKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGS 156
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ ++AVEGI +I +G LI LSEQ+L+DC + N GC GG + AF +II N GI T
Sbjct: 157 CWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINNGGIDT 216
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
+ +YPY G C +K A I +YE+VP+ DE AL KA + QP+S+AI A +FQ
Sbjct: 217 DVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQ 276
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DE 334
Y GIF G CG LDH V +VG+G TE+G +YW+++NSWG WG+ GY+++ R
Sbjct: 277 FYDSGIFTGKCGIALDHGVVVVGYG-TENGKDYWIVRNSWGADWGENGYLRMERGISSKT 335
Query: 335 GLCGIGTRSSYPL 347
G+CGI SYP+
Sbjct: 336 GICGIAIEPSYPV 348
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 152/335 (45%), Positives = 209/335 (62%), Gaps = 10/335 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F L++S A + + ++E W+ ++G+SY E E R +IFKE L +
Sbjct: 13 LFFSTLLVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + NR+Y++G NQF+D TN+EF++ Y G+ S S++ S+ +Y+ +P
Sbjct: 73 IDEHNADTNRSYRVGLNQFADQTNEEFQSTYLGFT--SGSNKMKVSN--RYEPRVGQVLP 128
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGN 196
+DWR GAV IK+Q +CG CWAF+A+A VEGI KI +G+LI LSEQ+L+DC T
Sbjct: 129 DYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRTQNT 188
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQ 255
GC GGS F +II N GI TE YPY A G C+ Q A I YE VP +E
Sbjct: 189 RGCDGGSITDGFQFIINNGGINTEANYPYTAEDGQCNLDLQNEKYASIDTYENVPYNNEW 248
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL AV+ QPVS+A+ A FQ Y GIF G CGT +DHAVTIVG+G TE G +YW++K
Sbjct: 249 ALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYG-TEGGIDYWIVK 307
Query: 316 NSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
NSW TWG+ GY++I+R+ G CGI T+ SYP+
Sbjct: 308 NSWDTTWGEEGYIRILRNVGGAGTCGIATKPSYPV 342
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 144/330 (43%), Positives = 199/330 (60%), Gaps = 13/330 (3%)
Query: 28 CASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN 86
CA+ R ++++ +++E+W H + EK R FK+N+ YI + NK G
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRGG 84
Query: 87 RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWR 143
R Y+L N+F D+ +EFRA + G + F Y+ + D+P ++DWR
Sbjct: 85 RGYRLRLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVR--DLPRAVDWR 142
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
KGAVT +K+Q +CG CWAF+ V +VEGI IR+G L+ LSEQ+L+DC T N+GC GG
Sbjct: 143 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGL 202
Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGDEQALLKAV 261
E AF YI + GI TE YPY+A GTC A A++ I ++ VP+ E AL KAV
Sbjct: 203 MENAFEYIKHSGGITTESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAV 262
Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
+ QPVS+AI A FQ Y +G+F G CGT LDH V +VG+G T DG YW++KNSWG
Sbjct: 263 ANQPVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTA 322
Query: 322 WGDAGYMKIVRDE----GLCGIGTRSSYPL 347
WG+ GY+++ RD GLCGI +SYP+
Sbjct: 323 WGEGGYIRMQRDSGYDGGLCGIAMEASYPV 352
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 206/320 (64%), Gaps = 14/320 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A+HG ++E R F +NL +++ N G ++L
Sbjct: 45 EAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLA 104
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLTNDEFRA Y G K + +R+ +Y++ ++P ++DWR+KGAV P+K
Sbjct: 105 MNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAPVK 164
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+AV+ VE I +I +G ++ LSEQ+L++C NG ++GC GG + AF +I
Sbjct: 165 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFI 224
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
I+N GI TED+YPY+AV G C +K A I +E+VP DE++L KAV+ PVS+AI
Sbjct: 225 IKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAI 284
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A EFQ Y G+F+G CGTQLDH V VG+G TE+G +YW+++NSWG WG+AGY+++
Sbjct: 285 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGEAGYLRM 343
Query: 331 VRD----EGLCGIGTRSSYP 346
R+ G CGI SSYP
Sbjct: 344 ERNINVTSGKCGIAMMSSYP 363
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 206/320 (64%), Gaps = 14/320 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A+HG ++E R F +NL +++ N G ++L
Sbjct: 45 EAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLA 104
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLTNDEFRA Y G K + +R+ +Y++ ++P ++DWR+KGAV P+K
Sbjct: 105 MNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVK 164
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+AV+ VE I +I +G ++ LSEQ+L++C NG ++GC GG + AF +I
Sbjct: 165 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFI 224
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
I+N GI TED+YPY+AV G C +K A I +E+VP DE++L KAV+ PVS+AI
Sbjct: 225 IKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAI 284
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A EFQ Y G+F+G CGTQLDH V VG+G TE+G +YW+++NSWG WG+AGY+++
Sbjct: 285 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGEAGYLRM 343
Query: 331 VRD----EGLCGIGTRSSYP 346
R+ G CGI SSYP
Sbjct: 344 ERNINVTSGKCGIAMMSSYP 363
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 206/320 (64%), Gaps = 14/320 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A+HG ++E R F +NL +++ N G ++L
Sbjct: 45 EAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLA 104
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLTNDEFRA Y G K + +R+ +Y++ ++P ++DWR+KGAV P+K
Sbjct: 105 MNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVK 164
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+AV+ VE I +I +G ++ LSEQ+L++C NG ++GC GG + AF +I
Sbjct: 165 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFI 224
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
I+N GI TED+YPY+AV G C +K A I +E+VP DE++L KAV+ PVS+AI
Sbjct: 225 IKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAI 284
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A EFQ Y G+F+G CGTQLDH V VG+G TE+G +YW+++NSWG WG+AGY+++
Sbjct: 285 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGEAGYLRM 343
Query: 331 VRD----EGLCGIGTRSSYP 346
R+ G CGI SSYP
Sbjct: 344 ERNINVTSGKCGIAMMSSYP 363
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 204/321 (63%), Gaps = 16/321 (4%)
Query: 40 EQSVVEIHEKW----MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
E+S+ ++E+W M +++ +K +FKEN+ YI +ANK+G R+++L N+
Sbjct: 35 EESLRALYEQWRSHYMVSRPAGLQEQDDKARWFNVFKENVRYIHEANKKG-RSFRLALNK 93
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT-----DVPTSLDWRDKGAVTP 150
F+D+T DEFR Y + HR+ +S ++ + S ++P ++DWR +GAVT
Sbjct: 94 FADMTTDEFRRAYAAGSR-TRHHRALSSGIRRHGDGSFMYAQAGNLPLAVDWRQRGAVTG 152
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
IK+Q +CG CWAF+ +AAVEGI KIR+G L+ LSEQ+L+DC N GC GG + AF Y
Sbjct: 153 IKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDVDNQGCNGGLMDYAFQY 212
Query: 211 IIQNQGIATEDEYPYQAVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
I +N GI TE YPY A +C+ A ++ I YE+VP+ +E AL KAV+ QPVSIA
Sbjct: 213 IKRNGGITTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVANQPVSIA 272
Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
I A +FQ Y EG+F G CGT+LDH V VG+G T DG YW++KNSWG WG+ GY++
Sbjct: 273 IEASGQDFQFYSEGVFTGSCGTELDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERGYIR 332
Query: 330 IVR----DEGLCGIGTRSSYP 346
+ R +GLCGI SYP
Sbjct: 333 MQRGISDSQGLCGIAMEPSYP 353
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 150/348 (43%), Positives = 216/348 (62%), Gaps = 21/348 (6%)
Query: 18 MFIIITLLVSCASQ--VVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMR 67
+F+++ S A +VS H + V+ ++E W+ +HG++Y EKE R
Sbjct: 10 LFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKEKR 69
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH--RSTTSST 125
IFK+NL +I++ N + N TY+LG N+F+DLTN+E+R++Y G K P + R + +
Sbjct: 70 FGIFKDNLRFIDEHNSQ-NLTYRLGLNRFADLTNEEYRSMYLGVK-PGATRVTRKVSRKS 127
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
++ +P +DWR +GAV +K+Q CG CWAF+ +AAVEGI +I +G+LI LSE
Sbjct: 128 DRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSE 187
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKIS 244
Q+L+DC T+ N GC GG + AF +II N GI +E++YPY+A C +K A I
Sbjct: 188 QELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANVVSID 247
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
YE+VP DE AL KAV+ QPVS+AI A FQ Y+ G+F G CGT LDH V VG+G
Sbjct: 248 GYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAVGYG- 306
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
TE+G +YW++ NSWG WG+ GY+++ R+ G CGI SYP+
Sbjct: 307 TENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPI 354
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 203/317 (64%), Gaps = 14/317 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKD--ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E+++ ++E+W + + S + +E R +FKEN YI + NK+ +R ++L N+F+
Sbjct: 33 EENLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYIHEGNKK-DRPFRLALNKFA 91
Query: 98 DLTNDEFRALYTGYKMP---SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
D+T DEFR Y G ++ S S +F+Y + ++P ++DWR KGAVT IK+Q
Sbjct: 92 DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGSFRYGDAD--NLPPAVDWRQKGAVTAIKDQ 149
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC N GC GG + AF +I +N
Sbjct: 150 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIHKN 209
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
GI TE YPYQ G+C A++ A A I YE+VP+ DE AL KAV+ QPVS+AI A
Sbjct: 210 -GITTESNYPYQGEQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 268
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
+FQ Y EG+F G C T LDH V VG+GTT DG YW++KNSWG WG+ GY+++ R
Sbjct: 269 GNDFQFYSEGVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRG 328
Query: 334 ----EGLCGIGTRSSYP 346
EG CGI ++SYP
Sbjct: 329 VSQAEGQCGIAMQASYP 345
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 197/316 (62%), Gaps = 12/316 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
++++ +++E+W H R ++ EK R FKEN +I NK G+R Y+L N+F D+
Sbjct: 35 DEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGDM 93
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+EFR+ + ++ T + F Y + TD+P S+DWR KGAVT +KNQ
Sbjct: 94 GREEFRSGFADSRINDLRREPTAAPAVPGFMYDD--ATDLPRSVDWRQKGAVTAVKNQGR 151
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ V AVEGI IR+G+L+ LSEQ+L+DC T+ NGC GG E AF +I + G
Sbjct: 152 CGSCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTD-ENGCQGGLMENAFEFIKSHGG 210
Query: 217 IATEDEYPYQAVPGTCSAAQ--KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
I TE YPY A GTC A+ + I ++ VP+G E AL KAV+ QPVS+AI A
Sbjct: 211 ITTESAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGG 270
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 332
Q Y EG+F G CGT LDH V VG+G ++DG YW++KNSWG +WG+ GY+++ R
Sbjct: 271 QALQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGEGGYIRMQRGT 330
Query: 333 -DEGLCGIGTRSSYPL 347
+ GLCGI +S+P+
Sbjct: 331 GNGGLCGIAMEASFPI 346
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 146/314 (46%), Positives = 209/314 (66%), Gaps = 10/314 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T ++++ E W+++HG+ Y+ EK +R +IFK+NL +I++ NK+ Y LG N+FS
Sbjct: 24 TSGDKIIDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKK-VVNYWLGLNEFS 82
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+++EF+ Y G K+ S R S F Y+++ +P S+DWR KGAVT +KNQ C
Sbjct: 83 DLSHEEFKNKYLGLKV-DMSERRECSQEFNYKDV--MSIPKSVDWRKKGAVTDVKNQGSC 139
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T N GC GG + AF+YII N G+
Sbjct: 140 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAFSYIISNGGL 199
Query: 218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
E +YPY GTC ++ + IS Y +VP E++LLKA++ QP+S+AI A +
Sbjct: 200 HKEVDYPYIMEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRD 259
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ Y G+F+G CGTQLDH V VG+G+T +G +Y ++KNSWG+ WG+ GY+++ R+
Sbjct: 260 FQFYSGGVFDGHCGTQLDHGVAAVGYGST-NGLDYIIVKNSWGSKWGEKGYIRMKRNTGK 318
Query: 334 -EGLCGIGTRSSYP 346
GLCGI +SYP
Sbjct: 319 PAGLCGINKMASYP 332
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 151/337 (44%), Positives = 212/337 (62%), Gaps = 17/337 (5%)
Query: 21 IITLLVSCASQVVSSR--STHEQSVVEIH---EKWMAQHGRSYKDELEKEMRLKIFKENL 75
+I L+V A+ +R + + +EI E W A+HG+SY +LEK RL IF + L
Sbjct: 10 LILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDLEKARRLMIFSDTL 69
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMT 134
YIEK N + N T+ LG N+FSDLTN EFRA++ G +K P R +++ ++
Sbjct: 70 AYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAED----EDVDVS 125
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+PTSLDWR KGAVTPIK+Q +CG CWAF+A+A++E + + L+ LSEQQL+DC T
Sbjct: 126 SLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCDTV 185
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA---AQKPAAAKISNYEEVPS 251
+ GC GG E AF ++++N G+ TE YPY G+C+A A A+I+ ++ V
Sbjct: 186 -DAGCDGGLMETAFKFVVKNGGVTTEASYPYTGSVGSCNANKVAIINKVAEITGFKVVTE 244
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
AL+KAVS PV+++I FQ+YK GI +G CG LDH V ++G+G TE G Y
Sbjct: 245 DSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGQCGDSLDHGVLLIGYG-TEGGMPY 303
Query: 312 WLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRSSYP 346
W+IKNSWG +WG+ G+MKI R +G+CG+ SSYP
Sbjct: 304 WIIKNSWGTSWGEDGFMKIERKDGDGICGMNGDSSYP 340
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 151/316 (47%), Positives = 208/316 (65%), Gaps = 12/316 (3%)
Query: 34 SSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGT 93
SS ++ V EI+E W+A+H + Y +E E R +IFK+NL++I++ N E N TYK+G
Sbjct: 32 SSAWRTDEEVKEIYELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDEHNSE-NHTYKMGL 90
Query: 94 NQFSDLTNDEFRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
++DLTN+EF+A+Y G + + + T + + +Y + ++P +DWR KGAVTP+K
Sbjct: 91 TPYTDLTNEEFQAIYLGTRSDTIHRLKRTINISERYAYEAGDNLPEQIDWRKKGAVTPVK 150
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
NQ +CG CWAF+ V+ VE I +IR+GNLI LSEQQL+DC+ N+GC GG+ A+ YII
Sbjct: 151 NQGKCGSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNKK-NHGCKGGAFVYAYQYII 209
Query: 213 QNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
N GI TE YPY+AV G C AA+K +I Y+ VP +E AL KAV+ QP +AI A
Sbjct: 210 DNGGIDTEANYPYKAVQGPCRAAKK--VVRIDGYKGVPHCNENALKKAVASQPSVVAIDA 267
Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY--MKI 330
S +FQ YK GIF+G CGT+L+H V IVG+ +YW+++NSWG WG+ GY MK
Sbjct: 268 SSKQFQHYKSGIFSGPCGTKLNHGVVIVGYWK-----DYWIVRNSWGRYWGEQGYIRMKR 322
Query: 331 VRDEGLCGIGTRSSYP 346
V GLCGI YP
Sbjct: 323 VGGCGLCGIARLPYYP 338
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 145/320 (45%), Positives = 209/320 (65%), Gaps = 16/320 (5%)
Query: 40 EQSVVEIHEKWMAQHGR-SYKDE---LEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A+HG SY + E+E R + F +NL +++ N G ++L
Sbjct: 43 EAEARAVYDLWLAEHGGGSYPNANSIPERERRFRAFWDNLRFVDAHNARAAAGEEGFRLA 102
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLTNDEFRA Y G K R +Y++ ++P ++DWR+KGAV P+K
Sbjct: 103 MNRFADLTNDEFRAAYLGVK--GQRARPGRVVGERYRHDGAEELPEAVDWREKGAVAPVK 160
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+A++ VE I +I +G ++ LSEQ+L++C TNG ++GC GG + AF +I
Sbjct: 161 NQGQCGSCWAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFI 220
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
I+N GI TED+YPY+A+ G C +K A I +E+VP DE++L KAV+ QPVS+AI
Sbjct: 221 IKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAI 280
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A EFQ Y G+F+G CGTQLDH V VG+G TE+G +YW+++NSWG WG+AGY+++
Sbjct: 281 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGEAGYLRM 339
Query: 331 VRD----EGLCGIGTRSSYP 346
R+ G CGI SSYP
Sbjct: 340 ERNINVTSGKCGIAMMSSYP 359
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 143/314 (45%), Positives = 211/314 (67%), Gaps = 10/314 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T ++E+ E+W++ HG+ Y+ EK R ++FK+NL++I++ NK+ +Y LG N+F+
Sbjct: 36 TSMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFA 94
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT+ EF+ +Y G K+ S R + F Y+++ D+P S+DWR KGAVT +KNQ C
Sbjct: 95 DLTHQEFKNMYLGLKVESSRTRQSPEE-FTYKDV--VDLPKSVDWRKKGAVTRVKNQGSC 151
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI KI GNL LSEQ+L+DC NNGC GG + AF++I+ + G+
Sbjct: 152 GSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGL 211
Query: 218 ATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
E++YPY V TC + + IS Y++VP +E +L+KA++ QP+S+AI A +
Sbjct: 212 HKEEDYPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRD 271
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ Y G+F+G CGTQLDH VT VG+G+++ G +Y ++KNSWG WG+ GY+++ R+
Sbjct: 272 FQFYSGGVFDGPCGTQLDHGVTAVGYGSSK-GVDYIIVKNSWGPKWGEKGYIRMKRNTGK 330
Query: 334 -EGLCGIGTRSSYP 346
GLCGI +SYP
Sbjct: 331 PAGLCGINKMASYP 344
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 144/349 (41%), Positives = 217/349 (62%), Gaps = 18/349 (5%)
Query: 11 FKINTTPMFIIITLLVSCASQV--------VSSRSTHEQSVVEIHEKWMAQHGRSYKDEL 62
F T +F++ +++C++ T V+ + E W+A+H + Y+
Sbjct: 5 FSSKKTSLFLVFVSVLACSALANEFSILGYAPEDLTSIHKVIHLFESWLAKHSKIYESLD 64
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK R +IF +NL++I+ NK+ + Y LG N+F+DLT++EF+ + G K P + +
Sbjct: 65 EKLHRFEIFMDNLKHIDDTNKKVS-NYWLGLNEFADLTHEEFKNKFLGLKGELPERKDES 123
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
F Y++ D+P S+DWR KGAV P+KNQ +CG CWAF+ VAAVEGI +I +GNL
Sbjct: 124 IEEFSYRDF--VDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTM 181
Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AA 241
LSEQ+L+DC T NNGC GG + AFAY++++ G+ E+EYPY GTC + +
Sbjct: 182 LSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSETV 240
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
IS Y +VP +E + LKA++ QP+S+AI A +FQ Y G+F+G CGT+LDH V VG
Sbjct: 241 TISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVG 300
Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
+GTT+ G +Y +++NSWG WG+ GY+++ R G+CG+ +SYP
Sbjct: 301 YGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRKTGKPHGMCGLYMMASYP 348
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 150/349 (42%), Positives = 219/349 (62%), Gaps = 25/349 (7%)
Query: 17 PMFIIITLL----VSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
P FI + L+ +S A + + E S+ ++EKW H + +D EK R +
Sbjct: 4 PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVA-RDLDEKNRRFNV 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS-----TTSST 125
FKEN+++I + N++ + YKL N+F D+TN EFR+ Y G K+ HRS + +
Sbjct: 63 FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQH--HRSQRGIQKNTGS 120
Query: 126 FKYQNLSMTDVPT-SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
F Y+N+ +P S+DWR KGAVT +K+Q +CG CWAF+ +A+VEGI +I++G L+ LS
Sbjct: 121 FMYENVG--SLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLS 178
Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAK 242
EQ+L+DC T+ N GC GG + AF +I Q GI TED YPY GTC++ P +
Sbjct: 179 EQELVDCDTSYNEGCNGGLMDYAFEFI-QKNGITTEDSYPYAEQDGTCASNLLNSPVVS- 236
Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGF 302
I +++VP+ +E AL++AV+ QP+S++I A FQ Y EG+F G CGT+LDH V IVG+
Sbjct: 237 IDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGY 296
Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
G T DG YW++KNSWG WG++GY+++ R G CGI +SYP+
Sbjct: 297 GATRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI 345
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 148/335 (44%), Positives = 213/335 (63%), Gaps = 14/335 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F +ITL S + + S RS E V+ ++EKW+ +H + Y EK R +IFK+NL +
Sbjct: 10 LFGLITL--SLSLDMSSGRSNKE--VMTMYEKWLVKHQKVYYGLGEKNQRFQIFKDNLIF 65
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N N +Y++G N+FSD+TN E+R Y + TS + Y+ +P
Sbjct: 66 IDEHNAP-NHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIKNKITSVRYAYKAGHNNKLP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
S+DWR GA+TPIKNQ CG CWAF+AVAAVE I KI +G+L+ LSEQ+L+DC N
Sbjct: 125 VSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVSLSEQELVDCDRTKNK 182
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDEQA 256
GC GG++ A+ +I++N G+ ++ +YPY TC+ A+K I+ Y+ V E A
Sbjct: 183 GCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKNTKVVSINGYKNVQRNSESA 242
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L++AV+ QPVS+ I AY +FQ Y+ G+F G CGT LDHAV +VG+G +E+G +YWL+KN
Sbjct: 243 LMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAVVVVGYG-SENGKDYWLVKN 301
Query: 317 SWGNTWGDAGYMKIVR-----DEGLCGIGTRSSYP 346
SWG WG+ GY+KI R + G CGI ++YP
Sbjct: 302 SWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYP 336
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 142/309 (45%), Positives = 209/309 (67%), Gaps = 11/309 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y++ EK +R +IFK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 44 LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHR 102
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF Y G K+ S R + F Y+++ ++P S+DWR KGAV P+KNQ CG CWA
Sbjct: 103 EFNNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ VAAVEGI +I +GNL LSEQ+L+DC NNGC GG + AF++I++N G+ E++
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218
Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY GTC ++ IS Y +VP +EQ+LLKA++ QP+S+AI A +FQ Y
Sbjct: 219 YPYIMEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
G+F+G CG+ LDH V VG+GT + G +Y +KNSWG+ WG+ GY+++ R+ EG+C
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGIC 337
Query: 338 GIGTRSSYP 346
GI +SYP
Sbjct: 338 GIYKMASYP 346
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 143/314 (45%), Positives = 211/314 (67%), Gaps = 10/314 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T ++E+ E+W++ HG+ Y+ EK R ++FK+NL++I++ NK+ +Y LG N+F+
Sbjct: 39 TSMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT+ EF+ +Y G K+ S R + F Y+++ D+P S+DWR KGAVT +KNQ C
Sbjct: 98 DLTHQEFKNMYLGLKVESSRTRQSPEE-FTYKDV--VDLPKSVDWRKKGAVTRVKNQGSC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI KI GNL LSEQ+L+DC NNGC GG + AF++I+ + G+
Sbjct: 155 GSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGL 214
Query: 218 ATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
E++YPY V TC + + IS Y++VP +E +L+KA++ QP+S+AI A +
Sbjct: 215 HKEEDYPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRD 274
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ Y G+F+G CGTQLDH VT VG+G+++ G +Y ++KNSWG WG+ GY+++ R+
Sbjct: 275 FQFYSGGVFDGPCGTQLDHGVTAVGYGSSK-GVDYIIVKNSWGPKWGEKGYIRMKRNTGK 333
Query: 334 -EGLCGIGTRSSYP 346
GLCGI +SYP
Sbjct: 334 PAGLCGINKMASYP 347
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 154/316 (48%), Positives = 205/316 (64%), Gaps = 25/316 (7%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL-- 99
S+ E E W ++G YKD E++ +IFK N+ YI+ N GN+ YKL N+F D
Sbjct: 37 SLSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPI 96
Query: 100 --TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
++D F + +T ++TFKY+N+ TD+P ++DWR +GAVTPIKNQ +C
Sbjct: 97 EDSDDGFER----------TTTTTPTTTFKYENV--TDIPATVDWRKRGAVTPIKNQGKC 144
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN-NGCLGGSREKAFAYIIQNQG 216
G CWAF+AVAA+EGI KI SGNL+ LSEQQL+DC +G GC G+ AF +I++N G
Sbjct: 145 GSCWAFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGG 204
Query: 217 IATEDEYPYQ-AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
IATE YPY+ V GTC +I +YEEVPS E +LLKAV+ QPVS+ I
Sbjct: 205 IATEANYPYKRVVKGTCKKVSH--KVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRGM 262
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
F+ Y GIF G CGT+ +HA+TIVG+GT++DG YWL+KNSW WG+ GY++I RD
Sbjct: 263 -FKFYSSGIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRDID 321
Query: 334 --EGLCGIGTRSSYPL 347
EGLCGI + SYP+
Sbjct: 322 AKEGLCGIAMKPSYPI 337
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 208/320 (65%), Gaps = 16/320 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A++G E+E R + F +NL +++ N G Y+LG
Sbjct: 46 EAEARAVYDLWLAENGGGSSPNANSIPERERRFRAFWDNLNFVDAHNARAAAGEEGYRLG 105
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N+F+DLTNDEFRA Y G K + R +Y++ ++P ++DWR+KGAV P+K
Sbjct: 106 MNRFADLTNDEFRAAYLGVK--AQRARPGRMVGERYRHDGAEELPEAVDWREKGAVAPVK 163
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+AV+ VE I +I +G ++ LSEQ+L++C TNG ++GC GG + AF +I
Sbjct: 164 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFI 223
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
I+N GI TED+YPY+A+ G C +K A I +E+VP DE++L KAV+ QPVS+AI
Sbjct: 224 IKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAI 283
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A EFQ Y G+F+G CGTQLDH V VG+G TE+G +YW+++NSWG WG++GY+++
Sbjct: 284 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGESGYLRM 342
Query: 331 VRD----EGLCGIGTRSSYP 346
R+ G CGI SSYP
Sbjct: 343 ERNINVTSGKCGIAMMSSYP 362
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 143/309 (46%), Positives = 207/309 (66%), Gaps = 11/309 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK R IFK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 43 LIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+ Y G K+ S R + F Y++ ++P S+DWR KGAVT +KNQ CG CWA
Sbjct: 102 EFKNKYLGLKVDY-SRRRESPEEFTYKDF---ELPKSVDWRKKGAVTQVKNQGSCGSCWA 157
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ VAAVEGI +I +GNL LSEQ+L+DC NNGC GG + AF++I++N G+ E++
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY GTC ++ IS Y +VP +EQ+LLKA+ QP+S+AI A +FQ Y
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYS 277
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
G+F+G CG+ LDH V VG+GT++ G NY ++KNSWG+ WG+ GY+++ R+ EG+C
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTSK-GVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEGIC 336
Query: 338 GIGTRSSYP 346
GI +SYP
Sbjct: 337 GIYKMASYP 345
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 142/292 (48%), Positives = 187/292 (64%), Gaps = 18/292 (6%)
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-------- 119
+FK N+ I + N+ + YKL N+F D+T DEFR Y G ++ HR
Sbjct: 70 FNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFRRHYAGSRVAH--HRMFRGDRQG 126
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
S+ S++F Y + DVP S+DWR KGAVT +K+Q +CG CWAF+ +AAVEGI I++ N
Sbjct: 127 SSASASFMYAD--ARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKN 184
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA 239
L LSEQQL+DC T N GC GG + AF YI ++ G+A ED YPY+A +C + P
Sbjct: 185 LTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASCKKSPAPV 244
Query: 240 AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTI 299
I YE+VP+ DE AL KAV+ QPVS+AI A + FQ Y EG+F+G CGT+LDH V
Sbjct: 245 VT-IDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAA 303
Query: 300 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
VG+G T DG YWL+KNSWG WG+ GY+++ RD EG CGI +SYP+
Sbjct: 304 VGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPV 355
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 142/271 (52%), Positives = 191/271 (70%), Gaps = 13/271 (4%)
Query: 86 NRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
N+ YKLG N+F+DLTN+EF+A +K M S R+TT FKY+N S +P+++DWR
Sbjct: 7 NKLYKLGINKFADLTNEEFKASRNKFKGHMCSSIIRTTT---FKYENASA--IPSTVDWR 61
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGG 202
KGAVTP+KNQ +CG CWAF+AVAA EGI ++ +G L+ LSEQ+L+DC T G + GC GG
Sbjct: 62 KKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGG 121
Query: 203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAV 261
+ AF +IIQN G++TE +YPY+ V GTC+ + A I+ YE+VP+ +E AL KAV
Sbjct: 122 LMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELALQKAV 181
Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
+ QP+S+AI A ++FQ Y G+F G CGT+LDH VT VG+G DG YWL+KNSWG
Sbjct: 182 ANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGAD 241
Query: 322 WGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
WG+ GY+++ R EGLCGI ++SYP A
Sbjct: 242 WGEEGYIRMQRGIDAAEGLCGIAMQASYPTA 272
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 152/325 (46%), Positives = 207/325 (63%), Gaps = 18/325 (5%)
Query: 38 THEQS-------VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
+H+QS V+ I++ W+ +HG++Y EK R +IFK NL +I++ N + NRTYK
Sbjct: 12 SHDQSSWRSDDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQ-NRTYK 70
Query: 91 LGTNQFSDLTNDEFRALYTGYKMPSPSHR--STTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
+G +F+DLTN E+RA++ G + P R + + + +Y + +P S+DWR KGAV
Sbjct: 71 VGLTKFADLTNQEYRAMFLGTR-SDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAV 129
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
PIK+Q CG CWAF+ VAAVEGI +I +G LI LSEQ+L+DC N GC GG + AF
Sbjct: 130 NPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGCNGGLMDYAF 189
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N G+ TE +YPY TC + K A I +E+V DE+AL KAV+ QPVS
Sbjct: 190 QFIINNGGLDTEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQPVS 249
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A Q Y+ G+F G CGT LDH V +VG+G TE G +YWL++NSWG WG+ GY
Sbjct: 250 VAIEASGMALQFYQSGVFTGECGTALDHGVVVVGYG-TEKGLDYWLVRNSWGTEWGEHGY 308
Query: 328 MKI---VRD--EGLCGIGTRSSYPL 347
+K+ VRD G CGI SSYP+
Sbjct: 309 IKMQRNVRDTYTGRCGIAMESSYPV 333
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 144/344 (41%), Positives = 216/344 (62%), Gaps = 17/344 (4%)
Query: 15 TTPMFIIITLLVSCASQ-------VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
T+ +F+ +++L A T V+ + E W+ +H + Y+ EK R
Sbjct: 10 TSLLFLFVSILACSALAHEFSILGYAPEDLTSIHKVIHLFESWLVKHSKFYESLDEKLHR 69
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
+IF +NL++I++ NK+ + Y LG N+F+DLT++EF+ + G+K + +S F
Sbjct: 70 FEIFMDNLKHIDETNKKVS-NYWLGLNEFADLTHEEFKHKFLGFKGELAERKDESSKEFG 128
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y++ D+P S+DWR KGAV P+KNQ +CG CWAF+ VAAVEGI +I +GNL LSEQ+
Sbjct: 129 YRDF--VDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQE 186
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNY 246
L+DC T NNGC GG + AFAY++++ G+ E+EYPY GTC + + IS Y
Sbjct: 187 LIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSEKVTISGY 245
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
+VP DE + LKA++ QP+S+AI A +FQ Y G+F+G CGT+LDH V VG+GTT+
Sbjct: 246 HDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTTK 305
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
G +Y +++NSWG WG+ GY+++ R G+CG+ +SYP
Sbjct: 306 -GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYP 348
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 215/315 (68%), Gaps = 11/315 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
+H++ ++E+ E W++ ++Y+ EK +R ++FK+NL++I++ NK+G ++Y LG N+F+
Sbjct: 43 SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
DL+++EF+ +Y G K S + F Y+++ VP S+DWR KGAV +KNQ
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ VAAVEGI KI +GNL LSEQ+L+DC T NNGC GG + AF YI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218
Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
+ E++YPY GTC + + I+ +++VP+ DE++LLKA++ QP+S+AI A
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGR 278
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
EFQ Y G+F+G CG LDH V VG+G+++ G++Y ++KNSWG WG+ GY+++ R+
Sbjct: 279 EFQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTG 337
Query: 334 --EGLCGIGTRSSYP 346
EGLCGI +S+P
Sbjct: 338 KPEGLCGINKMASFP 352
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 149/346 (43%), Positives = 214/346 (61%), Gaps = 15/346 (4%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
S I + F +ITL ++ + S RS E V+ ++E+W+ +H + Y EK+ R +
Sbjct: 3 SITITSLLFFSLITLSLAMDT---SMRSNEE--VMTMYEEWLVKHHKVYNGLGEKDQRFE 57
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH--RSTTSSTFK 127
IFK+NL +I++ N + N TYK+G N+F+D TN+E+R +Y G K + + + ++ +
Sbjct: 58 IFKDNLGFIDEHNAQ-NYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIKITTGHR 116
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y S +P +DWR KGAV IK+Q CG CWAF+ +A VE I KI +G L+ LSEQ+
Sbjct: 117 YAFNSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQE 176
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNY 246
L+DC N GC GG + AF +I++N GI TE +YPY+ G C +K A I Y
Sbjct: 177 LVDCDRAFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKVVSIDGY 236
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP+ +E AL KAV QPVS+AI A Q Y+ G+F G CGT LDH V +VG+G E
Sbjct: 237 EDVPAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGYG-FE 295
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLCGIGTRSSYPL 347
+G +YWL++NSWG WG+ GY K+ R + G CGI ++SYP+
Sbjct: 296 NGVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPV 341
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 143/315 (45%), Positives = 204/315 (64%), Gaps = 12/315 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQF 96
E+ V ++ +WMA+H +Y E+E R + F+ NL YI++ N G +++LG N+F
Sbjct: 35 EEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRF 94
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLTN+E+R+ Y G + R ++ +YQ ++P S+DWR KGAV +K+Q
Sbjct: 95 ADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPESVDWRKKGAVGAVKDQGG 151
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG + AF +II N G
Sbjct: 152 CGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGG 211
Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
I +E++YPY+ C A +K A I YE+VP E++L KAV+ QP+S+AI A
Sbjct: 212 IDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGR 271
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
FQ YK GIF G CGT LDH V VG+G TE+G +YWL++NSWG+ WG+ GY+++ R+
Sbjct: 272 AFQLYKSGIFTGTCGTALDHGVAAVGYG-TENGKDYWLVRNSWGSVWGENGYIRMERNIK 330
Query: 334 --EGLCGIGTRSSYP 346
G CGI SYP
Sbjct: 331 ASSGKCGIAVEPSYP 345
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 142/303 (46%), Positives = 199/303 (65%), Gaps = 8/303 (2%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
+ E W A+HG+SY + EK RL IF + L YIEK N + N T+ LG N+FSDLTN EFR
Sbjct: 1 MFEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
A Y G K SP ++ + K ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A
Sbjct: 61 ANYVG-KFKSPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
+A++E + + L+ LSEQQL+DC T + GC GG E AF ++++N G+ TE+ YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPEDAFKFVVENGGVTTEEAYPY 176
Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
G+C+ A K +I+ Y++V AL+KAVS PV++ I FQ+Y+ GI
Sbjct: 177 TGFAGSCN-ANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRS 343
+G C DHAV ++G+G TE G YW+IKNSWG +WG+ G+MKI + EG+CG+ +S
Sbjct: 236 SGQCSNSRDHAVLVIGYG-TEGGMPYWIIKNSWGTSWGENGFMKIKKKDGEGMCGMNGQS 294
Query: 344 SYP 346
SYP
Sbjct: 295 SYP 297
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 144/315 (45%), Positives = 197/315 (62%), Gaps = 10/315 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKD--ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E+S+ ++E+W + + S + +E R +FKEN Y+ + NK +R ++L N+F+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYVHEGNKR-DRPFRLALNKFA 92
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-VPTSLDWRDKGAVTPIKNQKE 156
D+T DEFR Y G ++ S + D +P ++DWR KGAVT IK+Q +
Sbjct: 93 DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKDQGQ 152
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC N GC GG + AF +I +N G
Sbjct: 153 CGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCEGGLMDYAFQFIQKN-G 211
Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
I TE YPYQ G+C A++ A A I YE+VP+ DE AL KAV+ QPVS+AI A
Sbjct: 212 ITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQ 271
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
+FQ Y EG+F G C T LDH V VG+G T DG YW++KNSWG WG+ GY+++ R
Sbjct: 272 DFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVS 331
Query: 334 --EGLCGIGTRSSYP 346
EGLCGI ++SYP
Sbjct: 332 QTEGLCGIAMQASYP 346
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 290 bits (743), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 142/309 (45%), Positives = 209/309 (67%), Gaps = 11/309 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E W+++HG+ Y+ EK R +IFK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 44 LIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 102
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+ Y G K+ S R + F Y+++ ++P S+DWR KGAVT +KNQ CG CWA
Sbjct: 103 EFKNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVTQVKNQGSCGSCWA 158
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ VAAVEGI +I +GNL LSEQ+L+DC NNGC GG + AF++I++N G+ E++
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENDGLHKEED 218
Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY GTC A++ IS Y +VP +EQ+LLKA++ QP+S+AI A +FQ Y
Sbjct: 219 YPYIMEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
G+F+G CG+ LDH V VG+GT + G +Y +KNSWG+ WG+ GY+++ R+ EG+C
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGIC 337
Query: 338 GIGTRSSYP 346
GI +SYP
Sbjct: 338 GIYKMASYP 346
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 148/340 (43%), Positives = 207/340 (60%), Gaps = 11/340 (3%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I+ + +F L++S A + +S V+ ++E W+ + G+SY EKEMR +IFK
Sbjct: 10 ISMSLLFFSTLLILSSALDIKNSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFK 69
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
ENL I+ N + NR+Y LG N+F+DLT++E+R+ Y G+K + S +Y
Sbjct: 70 ENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSGPKAKVSN-----RYVPKV 124
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR GAV +K+Q C CWAF+AVAAVEGI KI +GNLI LSEQ+L+DC
Sbjct: 125 GVVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCG 184
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVP 250
T GC G AF +II N GI TED YPY A G C +K I NYE++P
Sbjct: 185 RTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCDWYRKNQRYVTIDNYEQLP 244
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+ +E L AV+ QP+++ + + +F+ Y GI+ G CGT +DH VTIVG+G TE G +
Sbjct: 245 ANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTIVGYG-TERGLD 303
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSWG WG+ GY++I R+ G CGI SYP+
Sbjct: 304 YWIVKNSWGTNWGENGYIRIQRNIGGAGKCGIAMVPSYPV 343
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 149/335 (44%), Positives = 211/335 (62%), Gaps = 15/335 (4%)
Query: 21 IITLLVSCASQVVSSR--STHEQSVVEIH---EKWMAQHGRSYKDELEKEMRLKIFKENL 75
+I L+V A+ +R + + +EI E W A+HG+SY + EK RL IF + L
Sbjct: 6 LILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDWEKARRLMIFSDTL 65
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMT 134
YIEK N + N T+ LG N+FSDLTN EFRA++ G +K P R +++ ++
Sbjct: 66 AYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAED----EDVDVS 121
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+PTSLDWR KGAVTPIK+Q +CG CWAF+A+A++E + + L+ LSEQQL+DC T
Sbjct: 122 SLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCDTV 181
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGD 253
+ GC GG E AF ++++N G+ TE YPY G+C+A + K A+I+ ++ V
Sbjct: 182 -DAGCDGGLMETAFKFVVKNGGVTTEAAYPYTGSVGSCNANKAKNKVAEITGFKVVTEDS 240
Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
AL+KAVS PV+++I FQ+YK GI +G C LDH V ++G+G TE G YW+
Sbjct: 241 ADALMKAVSKTPVTVSICGSDENFQNYKSGILSGKCDDSLDHGVLLIGYG-TEGGMPYWI 299
Query: 314 IKNSWGNTWGDAGYMKIVRD--EGLCGIGTRSSYP 346
IKNSWG +WG+ G+MKI R +G+CG+ SSYP
Sbjct: 300 IKNSWGTSWGEDGFMKIERKDGDGMCGMNGDSSYP 334
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 145/317 (45%), Positives = 205/317 (64%), Gaps = 14/317 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++EKW + H S + EK R +F+ N+ ++ NK ++ YKL N+F+D+
Sbjct: 31 EESLWDLYEKWRSHHTVSTSLD-EKRKRFNVFRANVLHVHNTNKM-DKPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMPSPSH---RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
TN EFR Y K+ + + +F Y N+ VP S+DWR KGAVTP+K+Q +
Sbjct: 89 TNHEFRTAYASSKVKHHTMFRGAPLGNGSFMYGNID--KVPASIDWRKKGAVTPVKDQGK 146
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ + AVEGI I++ LI LSEQ+L+DC+T N+GC GG + AF +I + +G
Sbjct: 147 CGSCWAFSTIVAVEGINFIKTNKLISLSEQELVDCNTGENHGCNGGLMDYAFEFITKQKG 206
Query: 217 IATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
I TE YPY+A G C A A +PA + I +E+V +E ALLKAV+ QPVS+AI A
Sbjct: 207 ITTEANYPYRAQDGHCDANKANQPAVS-IDGHEDVLHNNENALLKAVANQPVSVAIDAGG 265
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
++FQ Y EG+F G CG +LDH V IVG+GTT DG YW+++NSWG WG+ GY+++ R
Sbjct: 266 SDFQFYSEGVFTGECGKELDHGVAIVGYGTTVDGTKYWIVRNSWGPEWGERGYIRMQRGI 325
Query: 334 ---EGLCGIGTRSSYPL 347
GLCGI +SYP+
Sbjct: 326 SDRRGLCGIAMEASYPI 342
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 200/323 (61%), Gaps = 18/323 (5%)
Query: 40 EQSVVEIHEKWMAQHGR----SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
E+S+ ++E+W + + R D+ ++ R +FKEN Y+ +AN++ R ++L N+
Sbjct: 34 EESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKDGRPFRLALNK 93
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL-------SMTDVPTSLDWRDKGAV 148
F+D+T DEFR Y G + + HR+ + + T++P ++DWR +GAV
Sbjct: 94 FADMTTDEFRRTYAGSR--TRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAVDWRLRGAV 151
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q +CG CWAF+A+AAVEG+ KI +G L+ LSEQ+L+DC N GC GG + AF
Sbjct: 152 TGVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAF 211
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
YI +N G+ TE YPY A +C+ A ++ I YE+VP+ +E AL KAV+ QPV+
Sbjct: 212 QYIQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVA 271
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A +FQ Y EG+F G CGT LDH V VG+GTT DG YW +KNSWG WG+ GY
Sbjct: 272 VAIEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGY 331
Query: 328 MKIVR----DEGLCGIGTRSSYP 346
+++ R GLCGI SYP
Sbjct: 332 IRMQRGVPDSRGLCGIAMEPSYP 354
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 148/340 (43%), Positives = 217/340 (63%), Gaps = 20/340 (5%)
Query: 20 IIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ +LV+ S ++ R E+S+ +++E+W + H S +D EK R +FK N+ +I
Sbjct: 12 VLAVILVAAMSMEITERDLASEESLWDLYERWRSHHTVS-RDLSEKRKRFNVFKANVHHI 70
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTG----YKMPSPSHRSTTSSTFKYQNLSMT 134
K N++ ++ YKL N F+D+TN EFR Y+ Y+M S +T K ++L
Sbjct: 71 HKVNQK-DKPYKLKLNSFADMTNHEFREFYSSKVKHYRMLHGSRANTGFMHGKTESL--- 126
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
P S+DWR +GAVT +KNQ +CG CWAF+ V VEGI KI++G L+ LSEQ+L+DC T+
Sbjct: 127 --PASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETD 184
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGD 253
N GC GG E A+ +I ++ GI TE YPY+A G+C +++ A A I +E VP+ D
Sbjct: 185 -NEGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDSSKMNAPAVTIDGHEMVPAND 243
Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYW 312
E AL+KAV+ QPVS+AI A ++ Q Y EG++ G CG +LDH V +VG+GT DG YW
Sbjct: 244 ENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVGYGTALDGTKYW 303
Query: 313 LIKNSWGNTWGDAGYMKIVR-----DEGLCGIGTRSSYPL 347
++KNSWG WG+ GY+++ R + G+CGI +SYPL
Sbjct: 304 IVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPL 343
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 205/311 (65%), Gaps = 10/311 (3%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
V+ + E W+ +H + Y+ EK R +IF +NL++I++ NK+ + Y LG N+F+DLT
Sbjct: 43 HKVIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLT 101
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
++EF+ + G+K + +S F Y++ D+P S+DWR KGAV P+KNQ +CG C
Sbjct: 102 HEEFKHKFLGFKGELAERKDESSKEFGYRDF--VDLPKSVDWRKKGAVAPVKNQGQCGNC 159
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
WAF+ VAAVEGI +I +GNL LSEQ+L+DC T NNGC GG + AFAY++++ G+ E
Sbjct: 160 WAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKE 218
Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
+EYPY GTC + + IS Y +VP DE + LKA++ QP+S+AI A +FQ
Sbjct: 219 EEYPYIMSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQF 278
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EG 335
Y G+F+G CGT+LDH V VG+GTT+ G +Y +++NSWG WG+ GY+++ R G
Sbjct: 279 YSGGVFDGHCGTELDHGVAAVGYGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHG 337
Query: 336 LCGIGTRSSYP 346
+CG+ +SYP
Sbjct: 338 MCGLYMMASYP 348
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 209/322 (64%), Gaps = 16/322 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
++ V I+ +W A+HG++ + +++ R IFK+NL +I+ N++ N TYKLG
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLT 101
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
+F+DLTNDE+R LY G + P+ R + KY ++ +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CWAF+ AAVEGI KI +G LI LSEQ+L+DC + N GC GG + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
++N G+ TE +YPY+ G C++ K + I YE+VP+ DE AL KA+S QPVS+AI
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A FQ Y+ GIF G CGT LDHAV VG+G +E+G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339
Query: 331 VRD-----EGLCGIGTRSSYPL 347
R+ G CGI +SYP+
Sbjct: 340 ERNLAASKSGKCGIAVEASYPV 361
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 143/301 (47%), Positives = 206/301 (68%), Gaps = 10/301 (3%)
Query: 51 MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG 110
M++HG+SY+ EK R ++F++NL++I++ NK+ + +Y LG N+F+DL+++EF+ Y G
Sbjct: 1 MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKRKYLG 59
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
K+ P R + F Y++++ D+P S+DWR KGAV +KNQ CG CWAF+ VAAVE
Sbjct: 60 LKIELPKRRDSPEE-FSYKDVA--DLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVE 116
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
GI +I +GNL LSEQ+L+DC NNGC GG + AFA+II N G+ E++YPY G
Sbjct: 117 GINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEG 176
Query: 231 TCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVC 289
TC ++ IS Y +VP +EQ+ LKA++ QP+S+AI A S FQ Y GIFNG C
Sbjct: 177 TCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHC 236
Query: 290 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSY 345
GT+LDH V VG+GT++ G +Y +KNSWG+ WG+ GY+++ R+ EG+CGI +SY
Sbjct: 237 GTELDHGVAAVGYGTSK-GVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMASY 295
Query: 346 P 346
P
Sbjct: 296 P 296
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 150/322 (46%), Positives = 201/322 (62%), Gaps = 24/322 (7%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
E+S +++E+W RSY+ +K R +FK N+ ++ NK ++ YKL N+
Sbjct: 33 EESFWDLYERW-----RSYRTVSRSLGDKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNK 86
Query: 96 FSDLTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
F+D+TN EFR+ Y G K+ HR + TF Y+ + VP S DWR GAVT
Sbjct: 87 FADMTNHEFRSTYAGSKVNH--HRMFQGTPRGNGTFMYEKVG--SVPPSADWRKNGAVTG 142
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q +CG CWAF+ V AVEGI +I++ L+ LSEQ+L+DC T N GC GG E AF +
Sbjct: 143 VKDQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEF 202
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
I Q GI TE YPY A GTC A++ A I +E VP+ DE ALLKAV+ QPVS+A
Sbjct: 203 IKQKGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVA 262
Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
I A +FQ Y EG+F G C T+L+H V IVG+GTT DG NYW ++NSWG WG+ GY++
Sbjct: 263 IDAGGFDFQFYFEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIR 322
Query: 330 ----IVRDEGLCGIGTRSSYPL 347
I + EGLCGI +SYP+
Sbjct: 323 MQRSIFKKEGLCGIAMMASYPI 344
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 197/309 (63%), Gaps = 7/309 (2%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ + E W QHG++Y + EK RLK+F++N +++ + N +GN +Y L N F+DLT+
Sbjct: 26 IAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHH 85
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+A G + + + S + + + DVP S+DWR GAVT +K+Q CG CW+
Sbjct: 86 EFKASRLGLSSAASASLNVDRSNRQIPDF-VADVPASVDWRKNGAVTQVKDQGNCGACWS 144
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+A A+EGI KI +G+L+ LSEQ+L+DC + NNGC GG + AF ++I N GI TE++
Sbjct: 145 FSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEED 204
Query: 223 YPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPYQ +C+ + K I Y +VP +E+ LLKAV+ QPVS+ I FQ Y
Sbjct: 205 YPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYS 264
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
+GIF G C T LDHAV IVG+G +E+G +YW++KNSWG+ WG GYM + R+ GLC
Sbjct: 265 KGIFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLC 323
Query: 338 GIGTRSSYP 346
GI +SYP
Sbjct: 324 GINMLASYP 332
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 147/319 (46%), Positives = 205/319 (64%), Gaps = 19/319 (5%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-------RTYKLGTNQFSDL 99
HE WMA+HGR+Y D EK RL+IF+ N E I+ N + + +++L TN+F+DL
Sbjct: 43 HESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATNRFADL 102
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM-TDVPTSLDWRDKGAVTPIKNQKECG 158
T++EFRA TG + P+ F+Y+N S+ D S+DWR GAVT +K+Q CG
Sbjct: 103 TDEEFRAARTGLRRPAAVA-GAVGGGFRYENFSLQADAAGSMDWRAMGAVTGVKDQGSCG 161
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI 217
CCWAF+AVAA+EG+TKIR+G L+ LSEQQL+DC G++ GC GG + AF YI + G+
Sbjct: 162 CCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYISRQGGL 221
Query: 218 ATEDEYPYQAVP-GTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
A+E YPY G+C + + AA I +E+VP+ +E AL+ AV+ QPVS+AI
Sbjct: 222 ASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVAINGGDYV 281
Query: 277 FQSYKE----GIFNGVC-GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI- 330
F+ Y NG C T+LDHA+T VG+G DG YWL+KNSWG+ WG++GY++I
Sbjct: 282 FRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWLMKNSWGSGWGESGYVRIR 341
Query: 331 --VRDEGLCGIGTRSSYPL 347
R EG+CG+ +SYP+
Sbjct: 342 RGSRGEGVCGLAKLASYPV 360
>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 379
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 144/355 (40%), Positives = 208/355 (58%), Gaps = 23/355 (6%)
Query: 12 KINTTPMFIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQHGRSYKDELEKE 65
+++ T + + + + S A ++ + E+ + +++E+W H R ++ EK
Sbjct: 3 QVSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKG 61
Query: 66 MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM------PSPSHR 119
R FKEN+ +I NK G+R Y+L N+F D+ +EFR+ + ++ SP+ R
Sbjct: 62 RRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAAR 121
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ F Y S D P S+DWR +GAVT +K+Q CG CWAF+ V AVEGI IR+G+
Sbjct: 122 AGAVPGFMYD--SAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGS 179
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ--- 236
L LSEQ+L+DC T+ NGC GG E AF +I GI TE YPY+A GTC +
Sbjct: 180 LASLSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARR 238
Query: 237 -KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDH 295
I ++ VP+G E AL KAV+ QPVS+A+ A FQ Y EG+F G CGT LDH
Sbjct: 239 GGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDH 298
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYPL 347
V VG+G +DG YW++KNSWG +WG+ GY+++ R + GLCGI +S+P+
Sbjct: 299 GVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 353
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 208/322 (64%), Gaps = 16/322 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
++ V I+ +W A+HG++ + +++ R IFK+NL +I+ N+ N TYKLG
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLT 101
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
+F+DLTNDE+R LY G + P+ R + KY ++ +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CWAF+ AAVEGI KI +G LI LSEQ+L+DC + N GC GG + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
++N G+ TE +YPY+ G C++ K + I YE+VP+ DE AL KA+S QPVS+AI
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A FQ Y+ GIF G CGT LDHAV VG+G +E+G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339
Query: 331 VRD-----EGLCGIGTRSSYPL 347
R+ G CGI +SYP+
Sbjct: 340 ERNLAASKSGKCGIAVEASYPV 361
>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 423
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 144/355 (40%), Positives = 208/355 (58%), Gaps = 23/355 (6%)
Query: 12 KINTTPMFIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQHGRSYKDELEKE 65
+++ T + + + + S A ++ + E+ + +++E+W H R ++ EK
Sbjct: 47 QVSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKG 105
Query: 66 MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM------PSPSHR 119
R FKEN+ +I NK G+R Y+L N+F D+ +EFR+ + ++ SP+ R
Sbjct: 106 RRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAAR 165
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ F Y S D P S+DWR +GAVT +K+Q CG CWAF+ V AVEGI IR+G+
Sbjct: 166 AGAVPGFMYD--SAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGS 223
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ--- 236
L LSEQ+L+DC T+ NGC GG E AF +I GI TE YPY+A GTC +
Sbjct: 224 LASLSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARR 282
Query: 237 -KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDH 295
I ++ VP+G E AL KAV+ QPVS+A+ A FQ Y EG+F G CGT LDH
Sbjct: 283 GGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDH 342
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYPL 347
V VG+G +DG YW++KNSWG +WG+ GY+++ R + GLCGI +S+P+
Sbjct: 343 GVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 397
>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 149/332 (44%), Positives = 218/332 (65%), Gaps = 17/332 (5%)
Query: 27 SCASQVVSSRSTHEQSVVE---IHEKWMAQHGRSYKDEL-EKEMRLKIFKENLEYIEKAN 82
S A + ++ H +S E I + WM++HG++Y + L EKE R + FK+NL +I++ N
Sbjct: 25 SSAIDLPATSGGHNRSNEEVGFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHN 84
Query: 83 KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDW 142
+ N +Y+LG +F+DLT E+R L+ G P P R+ S +Y L +P S+DW
Sbjct: 85 AK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLRISR-RYVPLDGDQLPESVDW 140
Query: 143 RDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLG- 201
R++GAV+ IK+Q C CWAF+ VAAVEGI KI +G L+ LSEQ+L+DC+ NNGC G
Sbjct: 141 RNEGAVSAIKDQGTCNSCWAFSTVAAVEGINKIVTGELVSLSEQELVDCNLV-NNGCYGS 199
Query: 202 GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA--KISNYEEVPSGDEQALLK 259
G+ + AF ++I N G+ ++ +YPYQ G C+ + + I +YE+VP+ DE +L K
Sbjct: 200 GTMDAAFQFLINNGGLDSDTDYPYQGSQGYCNRKESTSNKIITIDSYEDVPANDEISLQK 259
Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
AV+ QPVS+ + S EF Y+ GI+NG CGT LDHA+ IVG+G +E+G +YW+++NSWG
Sbjct: 260 AVAHQPVSVGVDKKSQEFMLYRSGIYNGPCGTDLDHALVIVGYG-SENGQDYWIVRNSWG 318
Query: 320 NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
TWGDAGY K+ R+ G+CGI +SYP+
Sbjct: 319 TTWGDAGYAKMARNFEYPSGVCGIAMLASYPV 350
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 148/353 (41%), Positives = 218/353 (61%), Gaps = 24/353 (6%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVV-----------EIHEKWMAQHGRSYKDELEKE 65
P + ++ A S+ + SVV + W +HG+ Y EK
Sbjct: 3 PKLAVAVFVLFLAFAACSANHHRDPSVVGYSQEDLALPSSLFRSWSVKHGKLYASPTEKL 62
Query: 66 MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP---SHRSTT 122
R +IFK+NL +I + N++ N +Y LG NQF+D+ ++EF+A Y G K P + ++ T
Sbjct: 63 ERYEIFKQNLMHIAETNRK-NGSYWLGLNQFADVAHEEFKASYLGLKRALPRAGAPQTRT 121
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
+ F+Y + +P S+DWR KGAVTP+KNQ +CG CWAF++VAAVEGI +I +G L+
Sbjct: 122 PTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVS 181
Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA- 241
LSEQ+L+DC T ++GC GG+ + AFAY++ +QGI ED+YPY G C Q
Sbjct: 182 LSEQELVDCDTTLDHGCEGGTMDLAFAYMMGSQGIHAEDDYPYLMEEGYCKEKQPCVLGI 241
Query: 242 ---KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
++ +E+VP E +LLKA++ QPVS+ IAA S +FQ Y+ G+F+G C +LDHA+T
Sbjct: 242 TEQDLTGFEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYRGGVFDGACSVELDHALT 301
Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV----RDEGLCGIGTRSSYPL 347
VG+G++ G NY +KNSWG WG+ GY++I + EG+CGI T +SYP+
Sbjct: 302 AVGYGSSY-GQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPV 353
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 139/334 (41%), Positives = 199/334 (59%), Gaps = 34/334 (10%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ C + + + + + ++V HE+WMAQ+ R YKD EK R K
Sbjct: 8 ILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFK-------- 59
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
F+DLTN EFR++ T S + + T F+Y+N+S +P
Sbjct: 60 ------------------FADLTNHEFRSVKTNKGFKSSNMKILTG--FRYENVSADALP 99
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
T++DWR KG VTPIK+Q +CGCC AF+AVAA EGI KI +G L+ L++Q+L+DC +G +
Sbjct: 100 TTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVSLADQELVDCDVHGED 159
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG + AF +II+N G+ TE YPY A G C++ +AA I YE+VP+ DE A
Sbjct: 160 QGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCNSGSN-SAATIKGYEDVPANDEAA 218
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KA++ QPVS+A+ F+ Y G+ G CGT LDH + +G+G T DG YWL+KN
Sbjct: 219 LMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKN 278
Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
SWG TWG+ GY+++ +D G+CG+ SYP
Sbjct: 279 SWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 312
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 156/351 (44%), Positives = 216/351 (61%), Gaps = 31/351 (8%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEK--------------WMAQHGRSYKDELE 63
M ++ V+C++ + S H+ SVV ++ W +H + Y E
Sbjct: 16 MLFLLLGFVACSA----TASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE 71
Query: 64 KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT- 122
K R +IFK NL +I + N+ N +Y LG N F+D+ ++EF+A Y G K P + R
Sbjct: 72 KVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK-PGLARRDAQP 129
Query: 123 --SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
S+TF+Y N ++P ++DWR KGAVTP+KNQ ECG CWAF+ VAAVEGI +I +G L
Sbjct: 130 HGSTTFRYAN--AVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKL 187
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA 240
+ LSEQ+L+DC N+GC GG + AFAYI+ NQGI TE++YPY G C Q +
Sbjct: 188 VSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSK 247
Query: 241 A-KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTI 299
I+ YE+VP+ E +LLKA++ QPVS+ IAA S +FQ YK GIF+G CG Q DHA+T
Sbjct: 248 VITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHALTA 307
Query: 300 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
VG+G+ G +Y ++KNSWG WG+ GY +I R EG+C I +SYP
Sbjct: 308 VGYGSYY-GQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYP 357
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 140/309 (45%), Positives = 208/309 (67%), Gaps = 11/309 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y++ EK +R +IFK+NL++I++ NK + Y LG ++F+DL++
Sbjct: 44 LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLSEFADLSHR 102
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF Y G K+ S R + F Y+++ ++P S+DWR KGAV P+KNQ CG CWA
Sbjct: 103 EFNNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ VAAVEGI +I +GNL LSEQ+L+DC NNGC GG + AF++I++N G+ E++
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218
Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY G C ++ IS Y +VP +EQ+LLKA++ QP+S+AI A +FQ Y
Sbjct: 219 YPYIMEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
G+F+G CG+ LDH V VG+GT + G +Y +KNSWG+ WG+ GY+++ R+ EG+C
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGIC 337
Query: 338 GIGTRSSYP 346
GI +SYP
Sbjct: 338 GIYKMASYP 346
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 206/322 (63%), Gaps = 16/322 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTN 94
++ V I+ +W A HG++ + +++ R IFK+NL +I+ N K N TYKLG
Sbjct: 42 DEEVRSIYLQWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEKNKNATYKLGLT 101
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPI 151
+F+DLTN+E+R+LY G + P R + + + D VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNEEYRSLYLGART-EPVRRIAKAKNVNQKYSAAVDGKEVPETVDWRLKGAVNPI 160
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CWAF+ AAVEGI KI +G LI LSEQ+L+DC + N GC GG + AF +I
Sbjct: 161 KDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVDCDNSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
++N G+ TE +YPY+ G C++ K A I YE+VP+ DE AL +A+S+QPVS+AI
Sbjct: 221 MKNGGLKTEKDYPYRGFGGKCNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVSVAI 280
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A FQ Y+ GIF G CGT LDHAV VG+G +E+G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQTGIFTGNCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339
Query: 331 VRD-----EGLCGIGTRSSYPL 347
R+ G CGI +SYP+
Sbjct: 340 ERNLASSKSGKCGIAVEASYPV 361
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 137/312 (43%), Positives = 200/312 (64%), Gaps = 18/312 (5%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
++ E W +HG+SY + E+ RLK+F++N +++ K N +GN +Y L N F+DLT+ EF
Sbjct: 27 QLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEF 86
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMT----DVPTSLDWRDKGAVTPIKNQKECGCC 160
+ G S ++NL +T D+P S+DWR+KG VT +K+Q CG C
Sbjct: 87 KTSRLGL--------SAAPLNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGAC 138
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
W+F+A A+EGI KI +G+L+ LSEQ+L++C + N+GC GG + AF ++I N GI TE
Sbjct: 139 WSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTE 198
Query: 221 DEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
++YPY+A GTC+ + K I Y +VP +E+ LL+AV+ QPVS+ I FQ
Sbjct: 199 EDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQM 258
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EG 335
Y +GIF G C T LDHAV IVG+G +E+G +YW++KNSWG WG GYM + R+ +G
Sbjct: 259 YSKGIFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQG 317
Query: 336 LCGIGTRSSYPL 347
+CGI +SYP+
Sbjct: 318 VCGINMLASYPV 329
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 142/326 (43%), Positives = 206/326 (63%), Gaps = 21/326 (6%)
Query: 40 EQSVVEIHEKWMAQH-------GRSYKDEL---EKEMRLKIFKENLEYIEKANKEGNRTY 89
E+S+ ++E+W +++ G + +L + R +FKEN++YI +ANK+ +R +
Sbjct: 31 EESLRGLYERWRSRYTVSPSTPGSGLRGKLADHDPARRFNVFKENVKYIHEANKK-DRPF 89
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKG 146
+L N+F+D+T DE R Y G ++ HR+ + N + +D +P ++DWR+KG
Sbjct: 90 RLALNKFADMTTDELRHSYAGSRVRH--HRALSGGRRAQGNFTYSDAENLPPAVDWREKG 147
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
AVT IK+Q +CG CWAF+ +AAVE I KIR+G L+ LSEQ+L+DC + GC GG +
Sbjct: 148 AVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLSEQELMDCDNVNDQGCDGGLMDY 207
Query: 207 AFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQP 265
AF +I +N G+ +E YPYQ TC A++ I YE+VP+ DE AL KAV+ QP
Sbjct: 208 AFQFIQKNGGVTSEANYPYQGQQNTCDQAKENTHDVAIDGYEDVPANDESALQKAVAYQP 267
Query: 266 VSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
VS+AI A +FQ Y EG+F G C T LDH V VG+GT DG YW++KNSWG WG+
Sbjct: 268 VSVAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAVGYGTARDGTKYWIVKNSWGLDWGEK 327
Query: 326 GYMKIVRD----EGLCGIGTRSSYPL 347
GY+++ R EGLCGI ++SYP+
Sbjct: 328 GYIRMQRGVSQAEGLCGIAMQASYPI 353
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 152/338 (44%), Positives = 216/338 (63%), Gaps = 19/338 (5%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVV--EIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
T + I+I + V S + ++QS+ E ++ W ++ YKD+ E+E ++IFK
Sbjct: 9 TLINILIVIWVMFPS---NQNQENDQSLTLSERYKHWKIKYRVIYKDDAEEEKHIQIFKH 65
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+ YI+ N GN++YKL N+F+DL + + K+ TTSS FKY+N+
Sbjct: 66 NVAYIDSFNAAGNKSYKLTINRFADLPTEPSDDGFKKRKL-----EPTTSSLFKYKNI-- 118
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD-CS 192
TD+P ++DWR +GAVTP+KNQ+ECG CWAF+AV A+EGI +I SGNL+ LSEQ+L+D
Sbjct: 119 TDIPAAVDWRKRGAVTPVKNQRECGSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVR 178
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
+N NGC GG AF ++++N GIATE YPY+ V G ++ + +I +YE+VP
Sbjct: 179 SNWTNGCNGGYLIDAFEFVLENGGIATEASYPYRGVKGN-NSKKVSRQVQIKSYEQVPRN 237
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
E +LLK V+ QPVS+ I S + Y GIF G CGT+ +HAV IVG+GT+ DG YW
Sbjct: 238 SEDSLLKVVANQPVSVGIDI-SGMIRFYSSGIFTGECGTKPNHAVIIVGYGTSNDGTKYW 296
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
L+KNSWG WG+ Y+++ RD EGLCGI +SYP
Sbjct: 297 LVKNSWGIRWGEKRYIRMKRDIDAKEGLCGIPMDASYP 334
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 202/324 (62%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R F++NL YI++ N G +
Sbjct: 26 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 85
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 86 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 142
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 143 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 202
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TED+YPY+ C +K A I +YE+V E +L KAV+ QPVS
Sbjct: 203 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 262
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A FQ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 263 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 321
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 322 VRMERNIKASSGKCGIAVEPSYPL 345
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 202/324 (62%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TED+YPY+ C +K A I +YE+V E +L KAV+ QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 261
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A FQ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 156/351 (44%), Positives = 215/351 (61%), Gaps = 31/351 (8%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEK--------------WMAQHGRSYKDELE 63
M ++ V+C++ + S H+ SVV ++ W +H + Y E
Sbjct: 7 MLFLLLGFVACSA----TASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE 62
Query: 64 KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT- 122
K R +IFK NL +I + N+ N +Y LG N F+D+ ++EF+A Y G K P + R
Sbjct: 63 KVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK-PGLARRDAQP 120
Query: 123 --SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
S+TF+Y N ++P ++DWR KGAVTP+KNQ ECG CWAF+ VAAVEGI +I +G L
Sbjct: 121 HGSTTFRYAN--AVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKL 178
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA 240
+ LSEQ+L+DC N+GC GG + AFAYI+ NQGI TE++YPY G C Q +
Sbjct: 179 VSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSK 238
Query: 241 A-KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTI 299
I+ YE+VP E +LLKA++ QPVS+ IAA S +FQ YK GIF+G CG Q DHA+T
Sbjct: 239 VITITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHALTA 298
Query: 300 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
VG+G+ G +Y ++KNSWG WG+ GY +I R EG+C I +SYP
Sbjct: 299 VGYGSYY-GQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYP 348
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 212/310 (68%), Gaps = 15/310 (4%)
Query: 47 HEKWMAQHGRSYKDEL--EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTND 102
++ W+A++G + L E E R +F +NL++++ N + ++LG N+F+DLTN+
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EFRA + G K+ + RS + +Y++ + ++P S+DWR+KGAV P+KNQ +CG CWA
Sbjct: 112 EFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATED 221
F+AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG + AF +II+N GI TED
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227
Query: 222 EYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
+YPY+AV G C ++ A I +E+VP DE++L KAV+ QPVS+AI A EFQ Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
G+F+G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+ G
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 346
Query: 337 CGIGTRSSYP 346
CGI +SYP
Sbjct: 347 CGIAMMASYP 356
>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 337
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 142/308 (46%), Positives = 195/308 (63%), Gaps = 8/308 (2%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
HEKWMAQHG+ YKD EKE L+IF+ N+E+IE + G++++ L TNQF+DL ++EF+A
Sbjct: 32 HEKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFDVCGDKSFNLSTNQFADLHDEEFKA 91
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA-A 165
L T S +TT + F+Y N+ T +P S+DWR +G VTPIK+Q +C CWAF+
Sbjct: 92 LLTNGHKKEHSLWTTTETLFRYDNV--TKIPASMDWRKRGVVTPIKDQGKCLSCWAFSLC 149
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
VA +EG+ +I + L+ LSEQ+L+D + GC G E AF +I + I +E YPY
Sbjct: 150 VATIEGLHQIITSELVPLSEQELVDFVKGESEGCYGDYVEDAFKFITKKGRIESETHYPY 209
Query: 226 QAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
+ V TC ++ A+I Y++VPS E ALLKAV+ Q VS+++ A + FQ Y GI
Sbjct: 210 KGVNNTCKVKKETHGVAQIKGYKKVPSKSENALLKAVANQLVSVSVEARDSAFQFYSSGI 269
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIG 340
F G CGT DH V + +G + DG YWL KNSWG WG+ GY++I D EGLCGI
Sbjct: 270 FTGKCGTDTDHRVALASYGESGDGTKYWLAKNSWGTEWGEKGYIRIKXDIPAKEGLCGIA 329
Query: 341 TRSSYPLA 348
YP+A
Sbjct: 330 KYPYYPIA 337
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 207/322 (64%), Gaps = 16/322 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
++ V I+ +W A+HG++ + +++ R IFK+NL +I+ N+ N TYKLG
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLT 101
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
+F+DLTNDE+R LY G + P+ R + KY ++ +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CWAF+ AAVEGI KI +G LI LSEQ+L+DC + N GC GG + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
++N G+ TE +YPY+ G C++ K + I YE+VP+ DE AL KA+S QPV +AI
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVRVAI 280
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A FQ Y+ GIF G CGT LDHAV VG+G +E+G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339
Query: 331 VRD-----EGLCGIGTRSSYPL 347
R+ G CGI +SYP+
Sbjct: 340 ERNLAASKSGKCGIAVEASYPV 361
>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
gi|194701540|gb|ACF84854.1| unknown [Zea mays]
Length = 379
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 144/355 (40%), Positives = 207/355 (58%), Gaps = 23/355 (6%)
Query: 12 KINTTPMFIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQHGRSYKDELEKE 65
+++ T + + + + S A ++ + E+ + +++E+W H R ++ EK
Sbjct: 3 QVSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKG 61
Query: 66 MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM------PSPSHR 119
R FKEN+ +I NK G+R Y+L N+F D+ +EFR+ + ++ SP+ R
Sbjct: 62 RRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAAR 121
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ F Y S D P S+DWR +GAVT +K Q CG CWAF+ V AVEGI IR+G+
Sbjct: 122 AGAVPGFMYD--SAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGS 179
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ--- 236
L LSEQ+L+DC T+ NGC GG E AF +I GI TE YPY+A GTC +
Sbjct: 180 LASLSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARR 238
Query: 237 -KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDH 295
I ++ VP+G E AL KAV+ QPVS+A+ A FQ Y EG+F G CGT LDH
Sbjct: 239 GGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDH 298
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYPL 347
V VG+G +DG YW++KNSWG +WG+ GY+++ R + GLCGI +S+P+
Sbjct: 299 GVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 353
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 150/341 (43%), Positives = 217/341 (63%), Gaps = 12/341 (3%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
I+ + +F L+ S A S V+ ++E W+ ++G+SY E+EMR++IFK
Sbjct: 8 ISMSLLFFSTFLIFSFAIDAKISPLRTNDEVMALYESWLVKYGKSYNSLGEREMRIEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
ENL +I++ N + NR+Y +G NQF+DLT++E+R+ Y G+K S +S S+ + Q
Sbjct: 68 ENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFK---SSLKSKVSNRYMPQVGE 124
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ +P +DWR GAV +KNQ C CWAFA +A VE I +I +G+LI LSEQ+L+DC+
Sbjct: 125 V--LPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDCN 182
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVP 250
T N GC GG + A+ +II N GI TE+ YPY C +K I +YE+VP
Sbjct: 183 RTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCDEPKKNQNYVTIDSYEQVP 242
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQLDHAVTIVGFGTTEDGA 309
DE A+ +AV+ QPVS+AI AY F+ Y+ GIF G CGT L+HAVTI+G+G TE+G
Sbjct: 243 PNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVTIIGYG-TENGI 301
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
+YW++KNS+G WG++GY K+ R+ EG CGI + YP+
Sbjct: 302 DYWIVKNSYGTQWGESGYGKVQRNVGGEGRCGIASYPFYPV 342
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 147/312 (47%), Positives = 210/312 (67%), Gaps = 11/312 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+V + + W +H + Y EK R IFK+NL +I + N++ N +Y LG NQF+D+T++
Sbjct: 41 LVNLFKSWSVKHRKIYVSPKEKLKRYGIFKQNLMHIAETNRK-NGSYWLGLNQFADITHE 99
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
EF+A + G K + ++ T +TF+Y + ++P S+DWR KGAVTP+KNQ +CG C
Sbjct: 100 EFKANHLGLKQGLSRMGAQTRTPTTFRYA--AAANLPWSVDWRYKGAVTPVKNQGKCGSC 157
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
WAF++VAAVEGI +I +G L+ LSEQ+L+DC T ++GC GG + AFAYI+ +QGI E
Sbjct: 158 WAFSSVAAVEGINQIVTGKLVSLSEQELMDCDTMLDHGCEGGLMDFAFAYIMGSQGIHAE 217
Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
D+YPY G C Q A I+ YE+VP E +LLKA++ QPVS+ IAA S +FQ
Sbjct: 218 DDYPYLMEEGYCKEKQPYANVVTITGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQF 277
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV----RDEG 335
YK G+F+G C +LDHA+T VG+G++ G NY +KNSWG WG+ GY++I + EG
Sbjct: 278 YKGGVFDGSCSDELDHALTAVGYGSSY-GQNYITMKNSWGKNWGEQGYVRIKMGTGKPEG 336
Query: 336 LCGIGTRSSYPL 347
+CGI T +SYP+
Sbjct: 337 VCGIYTMASYPV 348
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 155/292 (53%), Positives = 197/292 (67%), Gaps = 11/292 (3%)
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
ELEK R +IFK NLEYIE N GN++YKLG NQ+SDLT+DEF A +TG K+ S
Sbjct: 78 ELEK--RKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQLSSS 135
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
S NL+ DVPT+ DWR +GAVT +K+Q CGCCWAF+ VAAVEG KI +G L
Sbjct: 136 KMRSAAVPFNLN-DDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGEL 194
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPA 239
I LSEQQL+DC N+GC GG+ + AF YIIQ +GI +E +YPYQ TC Q
Sbjct: 195 ISLSEQQLVDCDER-NSGCHGGNMDSAFKYIIQ-KGIVSEADYPYQEGSQTCQLNDQMKF 252
Query: 240 AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTI 299
A+I+N+ +VP+ DEQ LL+AV+ QPVS+ I EFQ Y +++G CG ++HAVT
Sbjct: 253 EAQITNFIDVPANDEQQLLQAVAQQPVSVGIEV-GDEFQHYMGDVYSGTCGQSMNHAVTA 311
Query: 300 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYPL 347
VG+G +EDG YWLIKNSWG WG+ GYMK++R+ G CGI +SYP+
Sbjct: 312 VGYGVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCGIAAHASYPI 363
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 202/317 (63%), Gaps = 16/317 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+ + +++E+W + H S + EK+ R +FKENL++I K N + +R YKL N F+D+
Sbjct: 33 EERLRDLYERWRSHHTVS-RSLAEKQERFNVFKENLKHIHKVNHK-DRPYKLKLNSFADM 90
Query: 100 TNDEFRALYTGYKMPS----PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
TN EF Y G K+ R T S + + +P+S+DWR GAVT IK+Q
Sbjct: 91 TNHEFLQHYGGSKVSHYRVLRGQRQGTGSMHE----DTSKLPSSVDWRKNGAVTGIKDQG 146
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
+CG CWAF+ VAAVEGI KI++G LI LSEQ+L+DC ++ N+GC GG E AF +I Q
Sbjct: 147 KCGSCWAFSTVAAVEGINKIKTGELISLSEQELVDCDSD-NHGCNGGLMEDAFNFIKQIG 205
Query: 216 GIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
G+ +E+ YPY+A C S I YE VP DE AL+KAV+ QPV+IA+ A
Sbjct: 206 GLTSENTYPYRAKEEPCDSNKMNSPVVNIDGYEMVPENDENALMKAVANQPVAIAMDAGG 265
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 332
+ Q Y E IF G CGT+L+H V +VG+GTT+DG YW++KNSWG WG+ GY+++ R
Sbjct: 266 KDLQFYSEAIFTGDCGTELNHGVALVGYGTTQDGTKYWIVKNSWGTDWGEKGYIRMQRGI 325
Query: 333 --DEGLCGIGTRSSYPL 347
+EGLCGI +SYP+
Sbjct: 326 DAEEGLCGITMEASYPV 342
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 147/320 (45%), Positives = 201/320 (62%), Gaps = 19/320 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ E++E+W QH R +D EK R +FK+N+ I + N+ + YKL N+F D+
Sbjct: 41 EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98
Query: 100 TNDEFRALYTGYKMPSPSH------RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
T DEFR Y ++ SH R S F Y D+P ++DWR+KGAV +K+
Sbjct: 99 TADEFRRAYASSRV---SHHRMFRGRGERRSGFMYAG--ARDLPAAVDWREKGAVGAVKD 153
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYII 212
Q +CG CWAF+ +AAVEGI IR+ NL LSEQQL+DC T GN GC GG + AF YI
Sbjct: 154 QGQCGSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIA 213
Query: 213 QNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
++ G+A YPY+A +C ++ + I YE+VP+ E AL KAV+ QPVS+AI
Sbjct: 214 KHGGVAASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIE 273
Query: 272 AYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
A + FQ Y EG+F G CGT+LDH V VG+GTT DG YW+++NSWG WG+ GY+++
Sbjct: 274 AGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMK 333
Query: 332 RD----EGLCGIGTRSSYPL 347
RD EGLCGI +SYP+
Sbjct: 334 RDVSAKEGLCGIAMEASYPI 353
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 140/303 (46%), Positives = 198/303 (65%), Gaps = 8/303 (2%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
+ E W A+HG+SY + EK RL IF + L YIEK N N T+ LG N+FSDLTN EFR
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
A Y G K P ++ + K ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A
Sbjct: 61 ANYVG-KFKPPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
+A++E + + L+ LSEQQL+DC T + GC GG E AF ++++N G+ TE+ YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPEDAFKFVVENGGVTTEEAYPY 176
Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
G+C+ A K +I+ Y++V AL+KAVS PV++ I FQ+Y+ GI
Sbjct: 177 TGFAGSCN-ANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRS 343
+G C DHAV ++G+G TE G YW+IKNSWG +WG+ G+M+I ++ EG+CG+ +S
Sbjct: 236 SGHCSNSRDHAVLVIGYG-TEGGMPYWIIKNSWGTSWGEDGFMRIKKEDGEGMCGMNGQS 294
Query: 344 SYP 346
SYP
Sbjct: 295 SYP 297
>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
Length = 348
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 140/317 (44%), Positives = 203/317 (64%), Gaps = 17/317 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
++S+ +++E+W +QH S + EK+ R +FK N+ +I + N+ G + YKL N+F+D+
Sbjct: 33 DKSLWDLYERWGSQHMVSRAPD-EKKKRFNVFKYNVNHINRVNQLG-KPYKLKLNEFADM 90
Query: 100 TNDEFRALYTG----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
TN EF+A + ++M R T + + TD P S+DWR GAV PIKNQ
Sbjct: 91 TNHEFKAGFDSKILHFRMLKGKRRQTP-----FTHAKTTDPPPSIDWRTNGAVNPIKNQG 145
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ + VEGI KI++ L+ LSEQ+L+DC T+ GC GG E + +I +
Sbjct: 146 RCGSCWAFSTIVGVEGINKIKTNQLVSLSEQELVDCETDCE-GCNGGLMENGYEFIKETG 204
Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
G+ TE YPY A G C +++ + KI +E VP+ DE A+L+AV+ QPVSIAI A
Sbjct: 205 GVTTEQIYPYFARNGRCDISKRNSPVVKIDGFENVPANDESAMLRAVANQPVSIAIDAGG 264
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 332
FQ Y +G+FNG CGT+L+H V IVG+GTT+DG NYW+++NSWG WG+ GY+++ R
Sbjct: 265 LNFQFYSQGVFNGACGTELNHGVAIVGYGTTQDGTNYWIVRNSWGTGWGEQGYVRMQRGV 324
Query: 333 --DEGLCGIGTRSSYPL 347
EGLCG+ +SYP+
Sbjct: 325 NVPEGLCGLAMDASYPI 341
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 145/324 (44%), Positives = 202/324 (62%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG++Y E+E R F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TED+YPY+ C +K A I +YE+V E +L KAV+ QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 261
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A FQ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 149/315 (47%), Positives = 199/315 (63%), Gaps = 9/315 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E S+ ++E+W QH + +D EK R +F+EN+ I + N+ G+ YKL N+F D+
Sbjct: 40 EDSLWALYERWREQHTVA-RDLGEKARRFNVFRENVRLIHEFNR-GDAPYKLRLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQN---LSMTDVPTSLDWRDKGAVTPIKNQKE 156
T DEFR Y ++ S + + S+ DVP S+DWR KGAVT +K+Q +
Sbjct: 98 TADEFRRAYASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQGQ 157
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ +AAVEGI IRS NL LSEQQL+DC T N GC GG + AF YI ++ G
Sbjct: 158 CGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCNGGLMDYAFQYIAKHGG 217
Query: 217 IATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
+A ED YPY+A + + A I YE+VP+ DE AL KAV+ QPV++AI A +
Sbjct: 218 VAAEDAYPYKARQASSCNKKPSAVVTIDGYEDVPANDETALKKAVAAQPVAVAIEASGSH 277
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ Y EG+F G CGT+LDH V VG+GTT DG YW++KNSWG WG+ GY+++ RD
Sbjct: 278 FQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWGPEWGEKGYIRMKRDVKD 337
Query: 334 -EGLCGIGTRSSYPL 347
EGLCGI +SYP+
Sbjct: 338 KEGLCGIAMEASYPV 352
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 143/317 (45%), Positives = 197/317 (62%), Gaps = 14/317 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKD--ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E+S+ ++E+W + + S + +E R +FK+N Y+ + NK + ++L N+F+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKQNARYVHEGNKR-DMPFRLALNKFA 92
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQ 154
D+T DEFR Y G ++ H S + D +P ++DWR KGAVT IK+Q
Sbjct: 93 DMTTDEFRRTYAGSRVRH--HLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC N GC GG + AF +I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
GI TE YPYQ G+C A++ A A I YE+VP+ DE AL KAV+ QPVS+AI A
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
+FQ Y EG+F G C T LDH V VG+G T DG YW++KNSWG WG+ GY+++ R
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRG 329
Query: 334 ----EGLCGIGTRSSYP 346
EGLCGI ++SYP
Sbjct: 330 VSQTEGLCGIAMQASYP 346
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 211/310 (68%), Gaps = 15/310 (4%)
Query: 47 HEKWMAQHGRSYKDEL--EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTND 102
++ W+A++G + L E E R +F +NL++++ N + ++LG N+F+DLTN+
Sbjct: 51 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNE 110
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EFRA + G K+ + RS + +Y++ + ++P S+DWR+KGAV P+KNQ +CG CWA
Sbjct: 111 EFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 166
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATED 221
F+AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG AF +II+N GI TED
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTED 226
Query: 222 EYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
+YPY+AV G C ++ A I +E+VP DE++L KAV+ QPVS+AI A EFQ Y
Sbjct: 227 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 286
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
G+F+G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+ G
Sbjct: 287 HSGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 345
Query: 337 CGIGTRSSYP 346
CGI +SYP
Sbjct: 346 CGIAMMASYP 355
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 145/324 (44%), Positives = 201/324 (62%), Gaps = 22/324 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSD 98
++++ +++E+W H R ++ EK R FKEN+ +I NK G+R +Y+L N+F D
Sbjct: 39 DEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFGD 97
Query: 99 LTNDEFRALYTG--------YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
+ +EFR+ + Y+ SP+ +T F Y + TDVP S+DWR GAVT
Sbjct: 98 MGPEEFRSTFADSRINDLRRYRESSPA--ATAVPGFMYDD--ATDVPRSVDWRQHGAVTA 153
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+KNQ CG CWAF+ V AVEGI IR+G+L+ LSEQ+L+DC T NGC GG E AF +
Sbjct: 154 VKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDT-AENGCQGGLMENAFDF 212
Query: 211 IIQNQGIATEDEYPYQAVPGTCS---AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
I GI TE YPY+A GTC A + I ++ VP+G E AL KAV+ QPVS
Sbjct: 213 IKSYGGITTESAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVS 272
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAG 326
+AI A FQ Y EG+F G CGT LDH V +VG+G ++ DG YW++KNSWG +WG+ G
Sbjct: 273 VAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSWGPSWGEGG 332
Query: 327 YMKIVR---DEGLCGIGTRSSYPL 347
Y+++ R + GLCGI +S+P+
Sbjct: 333 YIRMQRGAGNGGLCGIAMEASFPI 356
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 203/317 (64%), Gaps = 14/317 (4%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
+E+ + E W +HG+ Y E R ++K+NLEYI++ + E NR+Y LG +F+D
Sbjct: 38 NERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQR-HSEKNRSYWLGLTKFAD 96
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
+TNDEFR YTG ++ S RS + F+Y + ++ P S+DWR KGAVT +K+Q CG
Sbjct: 97 ITNDEFRRQYTGTRIDR-SKRSKRKTGFRYAD---SEAPESVDWRKKGAVTTVKDQGSCG 152
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+A+ +VEGI IR+G + LSEQ+L+DC N GC GG + AF +I++N GI
Sbjct: 153 SCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILENGGID 212
Query: 219 TEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
TE++YPY+ + G C +K A I YE+VP DE+AL KAV+ QPVS+AI A +F
Sbjct: 213 TENDYPYKGLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 272
Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
Q Y G+F G CGT LDH V VG+G +E +YW++KNSWG WG++GY+++ R+
Sbjct: 273 QLYSGGVFTGECGTDLDHGVLAVGYG-SEGSLDYWIVKNSWGEYWGESGYLRMQRNIKDS 331
Query: 334 ---EGLCGIGTRSSYPL 347
GLCGI SY +
Sbjct: 332 NHQFGLCGINIEPSYAV 348
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 208/314 (66%), Gaps = 10/314 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T ++++ E W+++H + Y+ EK R +IFK+NL +I++ NK+ Y LG N+F+
Sbjct: 24 TSRDRIIDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKK-VVNYWLGLNEFA 82
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+++EF+ Y G + S+R S F Y+++S +P S+DWR KGAVT +KNQ C
Sbjct: 83 DLSHEEFKNKYLGLNV-DLSNRRECSEEFTYKDVS--SIPKSVDWRKKGAVTDVKNQGSC 139
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T NNGC GG + AFAYII N G+
Sbjct: 140 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNNGCNGGLMDYAFAYIISNGGL 199
Query: 218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
E++YPY GTC + + IS Y +VP E++LLKA++ QP+S+AI A +
Sbjct: 200 HKEEDYPYIMEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDASGRD 259
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ Y G+F+G CGT+LDH V VG+G+ + G ++ ++KNSWG+ WG+ G++++ R+
Sbjct: 260 FQFYSGGVFDGHCGTELDHGVAAVGYGSAK-GLDFIVVKNSWGSKWGEKGFIRMKRNTGK 318
Query: 334 -EGLCGIGTRSSYP 346
GLCGI +SYP
Sbjct: 319 PAGLCGINKMASYP 332
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 138/299 (46%), Positives = 194/299 (64%), Gaps = 7/299 (2%)
Query: 51 MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG 110
MA++GR YKD EK R +IFK N+ +IE N +Y LG N+F+D+TN+EF A YTG
Sbjct: 1 MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
+ P + + +++++ V S+DWRD GAVT +K+Q CG CWAF+A+A VE
Sbjct: 61 -GISRPLNIEK-EPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVE 118
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
GI KI +G L+ LSEQ++LDC+ + NGC GG + A+ +II N G+A+E +YPYQA G
Sbjct: 119 GIYKIVTGYLVSLSEQEVLDCAVS--NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQG 176
Query: 231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG 290
C+A P +A I+ Y V S DE ++ AV QP++ AI A FQ Y G+F+G CG
Sbjct: 177 DCAANSWPNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCG 236
Query: 291 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYP 346
T L+HA+TI+G+G G YW++KNSWG++WG+ GY+++ R GLCGI YP
Sbjct: 237 TSLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGVSSSGLCGIAMDPLYP 295
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 142/321 (44%), Positives = 211/321 (65%), Gaps = 12/321 (3%)
Query: 33 VSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
V++++ H V++ E+W+ ++ ++Y EK+ R +IF +NL+++++ N N++Y+L
Sbjct: 22 VTAKADHRNPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYEL 81
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
G +F+DLTN+EFRA+Y KM R + S N+ +P +DWR KGAV P+
Sbjct: 82 GLTRFADLTNEEFRAIYLRSKMERT--RDSVKSERYLHNVG-DKLPDEVDWRAKGAVVPV 138
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CWAF+A+ AVEGI +I++G L+ LSEQ+L+DC T+ NNGC GG + AF +I
Sbjct: 139 KDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFI 198
Query: 212 IQNQGIATEDEYPYQAVPGT-CSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
I N GI TE++YPY A C+ +K I YE+VP +E +L KA++ QP+S+A
Sbjct: 199 ISNGGIDTEEDYPYTATDDNICNTDKKNTRVVTIDGYEDVPE-NENSLKKALANQPISVA 257
Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
I A FQ YK G+F G CGT LDH V VG+GT+E G +YW+I+NSWG+ WG++GY+K
Sbjct: 258 IEAGGRGFQLYKSGVFTGTCGTALDHGVVAVGYGTSE-GQDYWIIRNSWGSNWGESGYIK 316
Query: 330 IVRD----EGLCGIGTRSSYP 346
+ R+ G CG+ +SYP
Sbjct: 317 LQRNIKDSSGKCGVAMMASYP 337
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 140/303 (46%), Positives = 197/303 (65%), Gaps = 8/303 (2%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
+ E W A+HG+SY + EK RL IF + L YIEK N N T+ LG N+FSDLTN EFR
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
A Y G K P ++ + K ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A
Sbjct: 61 ANYVG-KFKPPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
+A++E + + L+ LSEQQL+DC T + GC GG E AF ++++N G+ TE+ YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPEDAFKFVVENGGVTTEEAYPY 176
Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
G+C+ A K +I+ Y++V AL+KAVS PV++ I FQ+Y+ GI
Sbjct: 177 TGFAGSCN-ANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRS 343
+G C DHAV ++G+G TE G YW+IKNSWG +WG+ G+M+I + EG+CG+ +S
Sbjct: 236 SGHCSNSRDHAVLVIGYG-TEGGMPYWIIKNSWGTSWGEDGFMRIKKKDGEGMCGMNGQS 294
Query: 344 SYP 346
SYP
Sbjct: 295 SYP 297
>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
Length = 365
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 143/317 (45%), Positives = 196/317 (61%), Gaps = 14/317 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKD--ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E+S+ ++E+W + + S + E R +FK+N Y+ + NK + ++L N+F+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAGERRFNVFKQNARYVHEGNKR-DMPFRLALNKFA 92
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQ 154
D+T DEFR Y G ++ H S + D +P ++DWR KGAVT IK+Q
Sbjct: 93 DMTTDEFRRTYAGSRVRH--HLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC N GC GG + AF +I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
GI TE YPYQ G+C A++ A A I YE+VP+ DE AL KAV+ QPVS+AI A
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
+FQ Y EG+F G C T LDH V VG+G T DG YW++KNSWG WG+ GY+++ R
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRG 329
Query: 334 ----EGLCGIGTRSSYP 346
EGLCGI ++SYP
Sbjct: 330 VSQTEGLCGIAMQASYP 346
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 149/317 (47%), Positives = 203/317 (64%), Gaps = 16/317 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++E+W + H + ++ EK R +FK N+ ++ NK ++ YKL N+F D+
Sbjct: 33 EKSLWNLYERWRSHHTVT-RNLDEKHNRFNVFKANVMHVHNTNKL-DKPYKLKLNKFGDM 90
Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR +Y K+ HR S + TF Y+N DVP+S+DWR+KGAVT +K+Q
Sbjct: 91 TNYEFRRIYADSKISH--HRMFRGMSHENGTFMYEN--AVDVPSSIDWRNKGAVTGVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ +AAVEGI +I++ L+ LSEQQL+DC T N GC GG E AF +I QN
Sbjct: 147 GQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCDTEENEGCNGGLMEYAFEFIKQN 206
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
GI TE YPY A GTC ++ A I +E VP +E ALLKA + QPVS+AI A
Sbjct: 207 -GITTESNYPYAAKDGTCDVEKEDKAVSIDGHENVPINNEAALLKAAAKQPVSVAIDAGG 265
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 332
FQ Y EG+F G C T L+H V IVG+G T+D YW++KNSWG+ WG+ GY+++ R
Sbjct: 266 YNFQFYSEGVFTGHCDTDLNHGVAIVGYGVTQDRTKYWIMKNSWGSEWGEQGYIRMQRGI 325
Query: 333 --DEGLCGIGTRSSYPL 347
EGLCGI +SYP+
Sbjct: 326 SSREGLCGIAMEASYPI 342
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 145/324 (44%), Positives = 202/324 (62%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q+ G CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 142 AEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TED+YPY+ C +K A I +YE+V E +L KAV+ QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 261
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A FQ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 144/296 (48%), Positives = 190/296 (64%), Gaps = 10/296 (3%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
+S + + E WM +H + YK EK R + FK+NL YI++ NK+ N +Y LG N+F+DLT
Sbjct: 42 ESSIRLFESWMLKHDKVYKTIDEKIYRFETFKDNLMYIDETNKK-NNSYWLGLNEFADLT 100
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
+DEF+ Y G +P S S ++ N + D P S+DWR KGAVTP+KNQ CG C
Sbjct: 101 HDEFKEKYVG-SIPEDSMIIEQSDDVEFPNKHVVDYPESIDWRQKGAVTPVKNQNPCGSC 159
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
WAF+ VA VEGI KI +GNLI LSEQ+LLDC ++GC GG + + Y++ N G+ TE
Sbjct: 160 WAFSTVATVEGINKIVTGNLISLSEQELLDCDRR-SHGCKGGYQTTSLKYVVDN-GVHTE 217
Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
EYPY+ G C A K I+ Y+ VPS DE +L+K +S+QPVS+ + + FQ
Sbjct: 218 KEYPYEKKQGNCRAKNKKGLKVYINGYKRVPSNDEISLIKTISIQPVSVLVESKGRPFQF 277
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 335
YK G+F G CGT+LDHAVT VG+ G +Y LIKNSWG WGD GY+KI R G
Sbjct: 278 YKGGVFGGPCGTKLDHAVTAVGY-----GKDYILIKNSWGPKWGDKGYIKIKRASG 328
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 204/317 (64%), Gaps = 15/317 (4%)
Query: 44 VEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSD 98
+ I+ +W +HG+S + +++ R IFK+NL +I+ N+ N TYKLG F++
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKGAVTPIKNQK 155
LTNDE+R+LY G + P R T + KY +++ +VP ++DWR KGAV IK+Q
Sbjct: 61 LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ AAVEGI KI +G L+ LSEQ+L+DC + N GC GG + AF +I++N
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179
Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
G+ TE +YPY G C++ K + I YE+VPS DE AL +AVS QPVS+AI A
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
FQ Y+ GIF G CGT +DHAV VG+G +E+G +YW+++NSWG WG+ GY+++ R+
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNV 298
Query: 334 ---EGLCGIGTRSSYPL 347
G CGI +SYP+
Sbjct: 299 ASKSGKCGIAIEASYPV 315
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 139/259 (53%), Positives = 181/259 (69%), Gaps = 8/259 (3%)
Query: 95 QFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
QF+++TNDEFR++YTGYK S S T S++F+YQN+S +P ++DWR KGAVTPIK
Sbjct: 1 QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
NQ CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC TN + GC GG + AF +I+
Sbjct: 61 NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCSGGLIDTAFEHIM 119
Query: 213 QNQGIATEDEYPYQAVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
G+ TE YPY+ TC + P+AA I+ YE+VP DE AL+KAV+ QPVS+ I
Sbjct: 120 ATGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGIE 179
Query: 272 AYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
+FQ Y G+F G C T LDHAVT VG+ + G+ YW+IKNSWG WG+ GYM+I
Sbjct: 180 GGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIK 239
Query: 332 RD----EGLCGIGTRSSYP 346
+D EGLCG+ ++SYP
Sbjct: 240 KDIKDKEGLCGLAMKASYP 258
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 145/324 (44%), Positives = 200/324 (61%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVE I +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TED+YPY+ C +K A I +YE+V E +L KAV QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPVS 261
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A FQ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 145/309 (46%), Positives = 198/309 (64%), Gaps = 17/309 (5%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W +HG+ Y E+ R ++K+NLEYI++ + E N +Y LG +F+DLTN+EFR YT
Sbjct: 48 WAHKHGKVYSAAEERAHRFLVWKDNLEYIQR-HSEKNLSYWLGLTKFADLTNEEFRRQYT 106
Query: 110 GYKMPSPSH----RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
G ++ R+ T S F+Y N ++ P S+DWR+KGAVT +K+Q CG CWAF+A
Sbjct: 107 GTRIDRSRRLKKGRNATGS-FRYAN---SEAPKSIDWREKGAVTSVKDQGSCGSCWAFSA 162
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
V +VEGI IR+G+ I LS Q+L+DC N GC GG + AF ++IQN GI TE +YPY
Sbjct: 163 VGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQNGGIDTEKDYPY 222
Query: 226 QAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
Q G C + A I +YE+VP DE+AL KAV+ QPVS+AI A +FQ Y G+
Sbjct: 223 QGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGGV 282
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD------EGLCG 338
F G CGT LDH V VG+G +E G +YW++KNSWG WG++GY+++ R+ GLCG
Sbjct: 283 FTGRCGTDLDHGVLAVGYG-SEKGLDYWIVKNSWGEYWGESGYLRMQRNLKDDNGYGLCG 341
Query: 339 IGTRSSYPL 347
I SY +
Sbjct: 342 INIEPSYAV 350
>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
Length = 360
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 148/318 (46%), Positives = 206/318 (64%), Gaps = 17/318 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H + + EK R +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EKSLWDLYERWRSHHTVT-RSLDEKHNRFNVFKANVMHVHNTNKL-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR +Y K+ HR S + TF Y+N+ +VP+S+DWR KGAVT +K+Q
Sbjct: 91 TNYEFRRIYADSKVSH--HRMFRGMSNENGTFMYENVK--NVPSSIDWRKKGAVTDVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI +I++ L+ LSEQ+L+DC T GN GC GG E AF +I QN
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGGNEGCNGGLMEYAFEFIKQN 206
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
GI TE YPY A GTC ++ A I YE VP +E ALLKA + QPVS+AI A
Sbjct: 207 -GITTESNYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVSVAIDAG 265
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
FQ Y EG+F+G CGT L+H V +VG+G T+D YW++KNSWG+ WG+ GY+++ R
Sbjct: 266 GYNFQFYSEGVFSGHCGTDLNHGVAVVGYGVTQDRTKYWIVKNSWGSEWGEQGYIRMQRG 325
Query: 333 ---DEGLCGIGTRSSYPL 347
EGLCGI +SYP+
Sbjct: 326 ISHKEGLCGIAMEASYPI 343
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 150/322 (46%), Positives = 207/322 (64%), Gaps = 16/322 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++E+W A+H S +D EK R +F+EN + + N + YKL N+F+DL
Sbjct: 42 EESLWALYERWRARHTVS-RDLAEKSRRFNVFRENARLVHEFNLRRDAPYKLRLNRFADL 100
Query: 100 TNDEFRALY-----TGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPI 151
T+DEFR Y + ++M P + + + S T +PTS+DWR+KGAVT +
Sbjct: 101 TSDEFRRSYASSRVSHHRMFKP-RAANNNDDDDDKGSSFTHGGALPTSVDWREKGAVTGV 159
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q +CG CWAF+ +AAVEGI IR+ NL LSEQQL+DC T N GC GG + AF+YI
Sbjct: 160 KDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCDGGLMDDAFSYI 219
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKIS--NYEEVPSGDEQALLKAVSMQPVSIA 269
++ G+A E YPY+A + ++K AAA +S YE+VP DE AL KAV+ QPV++A
Sbjct: 220 AKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALKKAVAAQPVAVA 279
Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
I A + FQ Y EG+F G CGT+LDH V VG+G T DG YW++KNSWG WG+ GY++
Sbjct: 280 IEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWIVKNSWGEEWGEKGYIR 339
Query: 330 IVRD----EGLCGIGTRSSYPL 347
+ RD EGLCGI +SYP+
Sbjct: 340 MKRDVADKEGLCGIAMEASYPV 361
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 203/317 (64%), Gaps = 15/317 (4%)
Query: 44 VEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSD 98
+ I+ +W +HG+S + +++ R IFK+NL +I+ N+ N TYKLG F++
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKGAVTPIKNQK 155
LTNDE+R+LY G + P R T + KY ++ +VP ++DWR KGAV IK+Q
Sbjct: 61 LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQG 119
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ AAVEGI KI +G L+ LSEQ+L+DC + N GC GG + AF +I++N
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179
Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
G+ TE +YPY G C++ K + I YE+VPS DE AL +AVS QPVS+AI A
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
FQ Y+ GIF G CGT +DHAV VG+G +E+G +YW+++NSWG WG+ GY+++ R+
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNV 298
Query: 334 ---EGLCGIGTRSSYPL 347
G CGI +SYP+
Sbjct: 299 ASKSGKCGIAIEASYPV 315
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 283 bits (724), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 144/355 (40%), Positives = 224/355 (63%), Gaps = 25/355 (7%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVV--------------EIHEKWMAQHGRSY 58
+++ + L +S A+ +S ++H+ S+V E+ E W++ ++Y
Sbjct: 3 LSSPSRILCFPLALSAATLSLSVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEKAY 62
Query: 59 KDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH 118
+ EK +R ++FK+NL++I++ NK+ ++Y LG N+F+DL+++EF+ +Y G K
Sbjct: 63 ETVEEKLLRFEVFKDNLKHIDETNKK-VKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRR 121
Query: 119 RSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
S + F Y+++ VP S+DWR KGAV +KNQ CG CWAF+ VAAVEGI KI +
Sbjct: 122 DEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVT 179
Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
GNL LSEQ+L+DC T NNGC GG + AF YI++N G+ E++YPY GTC +
Sbjct: 180 GNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKD 239
Query: 238 PA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE-GIFNGVCGTQLDH 295
+ I +++VP+ DE++LLKA++ QP+S+AI A EFQ Y +F+G CG LDH
Sbjct: 240 ESETVTIDGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYSGVSVFDGRCGVDLDH 299
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
V VG+G+++ G++Y ++KNSWG WG+ GY+++ R+ EGLCGI +S+P
Sbjct: 300 GVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCGINKMASFP 353
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 213/340 (62%), Gaps = 45/340 (13%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEF 104
++ W+A++GRSY E+E R ++F +NL++++ N + ++LG N+F+DLTNDEF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC------- 157
RA + G K S ++ +Y++ + ++P S+DWR+KGAV P+KNQ +C
Sbjct: 109 RATFLGAKFVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCVDRIIVW 164
Query: 158 -------------------------GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
G CWAF+AV+ VE I ++ +G +I LSEQ+L++CS
Sbjct: 165 NSMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVSTVESINQLVTGEMITLSEQELVECS 224
Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVP 250
TNG N+GC GG + AF +II+N GI TED+YPY+AV G C ++ A I +E+VP
Sbjct: 225 TNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVP 284
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
DE++L KAV+ QPVS+AI A EFQ Y G+F+G CGT LDH V VG+G T++G +
Sbjct: 285 QNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYG-TDNGKD 343
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
YW+++NSWG WG++GY+++ R+ G CGI +SYP
Sbjct: 344 YWIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMASYP 383
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 149/337 (44%), Positives = 216/337 (64%), Gaps = 17/337 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD-ELEKEMRLKIFKENLE 76
F+ I L + S ++ R+ E V+ ++++W A+HG+ + + E E R IFK+NL+
Sbjct: 14 FFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPENRFHIFKDNLK 71
Query: 77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+I++ N + N Y+LG N F+DLTN+E+R+ Y G K S S R+ TS+ +Y D+
Sbjct: 72 FIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSN--RYLPRLGDDL 128
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
P S+DWR KGAV P+K+Q CG CWAF+ VA+VE I +I +G+LI LSEQ+L+DC + N
Sbjct: 129 PDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDCDRSYN 188
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG + AF +II+N G+ TE++YPY +C +K A I YE+VP +E+A
Sbjct: 189 EGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNA---IDGYEDVPVNNEKA 245
Query: 257 LLKA---VSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
L KA + VS+AI FQ Y+ GIF G CGT LDH V +VG+G +E G +YW+
Sbjct: 246 LQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYG-SEGGVDYWI 304
Query: 314 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
++NSWG +WG++GY+K+ R+ GLCGI SYP
Sbjct: 305 VRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYP 341
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 146/344 (42%), Positives = 208/344 (60%), Gaps = 30/344 (8%)
Query: 28 CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR 87
C + +V+ ++E E+WM +HGR Y D EK+ RL++++ N+E +E N GN
Sbjct: 39 CGAALVA----RADPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN- 93
Query: 88 TYKLGTNQFSDLTNDEFRALYTGYKMPSP---SHRSTTSSTFKYQNLSM------TDVPT 138
Y+L N+F+DLTN+EFRA G+ P + ST ST + +D+P
Sbjct: 94 GYRLADNKFADLTNEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPK 153
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
S+DWR+KGAV P+K+Q +CG CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC T G
Sbjct: 154 SVDWREKGAVAPVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IG 212
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQAL 257
C GG AF ++++N+G+ TE YPYQ + G C + K +A IS Y V E L
Sbjct: 213 CAGGYMSWAFEFVMKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDL 272
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED---------- 307
L+A + QPVS+A+ A S +Q Y G+F G C +L+H VT+VG+G T+
Sbjct: 273 LRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVP 332
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G YW++KNSWG WGDAGY+ + R+ GLCGI SYP+
Sbjct: 333 GKKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIAMLPSYPV 376
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 202/311 (64%), Gaps = 11/311 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ E+ + W +HG++Y E E++ R++IFK+N +++ + N N TY L N F+DLT+
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT-DVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+A G + +PS + K Q+L + VP S+DWR KGAVT +K+Q CG CW
Sbjct: 88 EFKASRLGLSVSAPSVIMAS----KGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
+F+A A+EGI +I +G+LI LSEQ+L+DC + N GC GG + AF ++I+N GI TE
Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 222 EYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
+YPYQ GTC + K I +Y V S DE+AL++AV+ QPVS+ I FQ Y
Sbjct: 204 DYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLY 263
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
GIF+G C T LDHAV IVG+G +++G +YW++KNSWG +WG G+M + R+ +G+
Sbjct: 264 SRGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGV 322
Query: 337 CGIGTRSSYPL 347
CGI +SYP+
Sbjct: 323 CGINMLASYPI 333
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 202/311 (64%), Gaps = 11/311 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ E+ + W +HG++Y E E++ R++IFK+N +++ + N N TY L N F+DLT+
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT-DVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+A G + +PS + K Q+L + VP S+DWR KGAVT +K+Q CG CW
Sbjct: 88 EFKASRLGLSVSAPSVIMAS----KGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
+F+A A+EGI +I +G+LI LSEQ+L+DC + N GC GG + AF ++I+N GI TE
Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 222 EYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
+YPYQ GTC + K I +Y V S DE+AL++AV+ QPVS+ I FQ Y
Sbjct: 204 DYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLY 263
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
GIF+G C T LDHAV IVG+G +++G +YW++KNSWG +WG G+M + R+ +G+
Sbjct: 264 SSGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGV 322
Query: 337 CGIGTRSSYPL 347
CGI +SYP+
Sbjct: 323 CGINMLASYPI 333
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 146/344 (42%), Positives = 208/344 (60%), Gaps = 30/344 (8%)
Query: 28 CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR 87
C + +V+ ++E E+WM +HGR Y D EK+ RL++++ N+E +E N GN
Sbjct: 18 CGAALVA----RADPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN- 72
Query: 88 TYKLGTNQFSDLTNDEFRALYTGYKMPSP---SHRSTTSSTFKYQNLSM------TDVPT 138
Y+L N+F+DLTN+EFRA G+ P + ST ST + +D+P
Sbjct: 73 GYRLADNKFADLTNEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPK 132
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
S+DWR+KGAV P+K+Q +CG CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC T G
Sbjct: 133 SVDWREKGAVAPVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IG 191
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQAL 257
C GG AF ++++N+G+ TE YPYQ + G C + K +A IS Y V E L
Sbjct: 192 CAGGYMSWAFEFVMKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDL 251
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED---------- 307
L+A + QPVS+A+ A S +Q Y G+F G C +L+H VT+VG+G T+
Sbjct: 252 LRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVP 311
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G YW++KNSWG WGDAGY+ + R+ GLCGI SYP+
Sbjct: 312 GKKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIAMLPSYPV 355
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 208/322 (64%), Gaps = 18/322 (5%)
Query: 40 EQSVVEIHEKWMAQH---GRSYKDEL-EKEMRLKIFKENLEYIEKANKEGNRT--YKLGT 93
E +++ W+A+H G S+ + E E R ++F +NL++++ N + ++LG
Sbjct: 58 EAEARAVYDLWVARHRHGGGSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGM 117
Query: 94 NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT-PIK 152
N+F+DLTNDEFRA Y G P+ R + Y++ + +P S+DWRDKGAV P+K
Sbjct: 118 NRFADLTNDEFRAAYLG-TTPAGRGRHVGEA---YRHDGVEALPDSVDWRDKGAVVAPVK 173
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+AVAAVEGI KI +G L+ LSEQ+L++C+ NG N+GC GG + AFA+I
Sbjct: 174 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFI 233
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
+N G+ TE++YPY A+ G C+ A+K I +E+VP DE +L KAV+ QPVS+AI
Sbjct: 234 ARNGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAI 293
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMK 329
A EFQ Y G+F G CGT LDH V VG+GT G +YW ++NSWG WG+ GY++
Sbjct: 294 DAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIR 353
Query: 330 IVRD----EGLCGIGTRSSYPL 347
+ R+ G CGI +SYP+
Sbjct: 354 MERNVTARTGKCGIAMMASYPI 375
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 140/304 (46%), Positives = 198/304 (65%), Gaps = 31/304 (10%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E W+++HG+ YK EK R ++F+ENL +I++ NKE + +Y LG N+F+DL+++EF++
Sbjct: 50 ESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHEEFKSK 108
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
+ D+P S+DWR KGAVT +KNQ CG CWAF+ VA
Sbjct: 109 ------------------------DVADLPESVDWRKKGAVTHVKNQGACGSCWAFSTVA 144
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
AVEGI +I +GNL LSEQ+L+DC T N+GC GG + AFA+I N G+ ED+YPY
Sbjct: 145 AVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDDYPYLM 204
Query: 228 VPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
GTC ++ IS YE+VP DE++LLKA++ QP+S+AI A +FQ Y G+FN
Sbjct: 205 EEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYSGGVFN 264
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTR 342
G CGT+LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R+ EGLCGI
Sbjct: 265 GPCGTELDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCGINKM 323
Query: 343 SSYP 346
+SYP
Sbjct: 324 ASYP 327
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 144/328 (43%), Positives = 205/328 (62%), Gaps = 27/328 (8%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT--------YKL 91
E+++ E++ +W + H + EK R FK N+ +I N N T Y+L
Sbjct: 35 EEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYRL 94
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST----FKYQNLSMTDVPTSLDWRDKGA 147
N+F D+ EFR+ + G P HR T + F Y ++ D+P ++DWR KGA
Sbjct: 95 RLNRFGDMDQAEFRSTFAG-----PLHRHTRPAQSIPGFIYD--TVKDIPQAVDWRQKGA 147
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREK 206
VT +K+Q +CG CWAF+AVA+VEG+ IR+G+L+ LSEQ+L+DC T G +NGC GG E
Sbjct: 148 VTGVKDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMES 207
Query: 207 AFAYIIQNQ-GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQ 264
AF +I + G+ATE YPY A GTC+A + + + +I ++ VP+G+E+AL KAV+ Q
Sbjct: 208 AFEFIAHSAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQ 267
Query: 265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT-EDGANYWLIKNSWGNTWG 323
PVS+AI A FQ Y EG+F G CG++LDH V +VG+G EDG YW++KNSWG WG
Sbjct: 268 PVSVAIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKEYWIVKNSWGPGWG 327
Query: 324 DAGYMKIVRDE----GLCGIGTRSSYPL 347
+ GY+++ RD GLCGI +SYP+
Sbjct: 328 EHGYVRMQRDSGVDGGLCGIAMEASYPV 355
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 137/303 (45%), Positives = 196/303 (64%), Gaps = 8/303 (2%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
+ E W A+H +SY + EK RL +F + L YIEK N + N T+ LG N+FSDLTN EFR
Sbjct: 1 MFEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
A Y G K P ++ + K ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A
Sbjct: 61 ANYVG-KFKPPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
+A++E + + L+ LSEQQL+DC T + GC GG + AF ++++N G+ TE+ YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPDDAFKFVVENGGVTTEEAYPY 176
Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
G+C+ K +I+ Y++V AL+KAVS PV++ I FQ+Y+ GI
Sbjct: 177 TGFAGSCN-TNKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRS 343
+G C DHAV ++G+G TE G YW+IKNSWG +WG+ G+MKI + EG+CG+ +S
Sbjct: 236 SGQCCNSRDHAVLVIGYG-TEGGMPYWIIKNSWGTSWGEDGFMKIKKKDGEGMCGMNGQS 294
Query: 344 SYP 346
SYP
Sbjct: 295 SYP 297
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 208/322 (64%), Gaps = 18/322 (5%)
Query: 40 EQSVVEIHEKWMAQH---GRSYKDEL-EKEMRLKIFKENLEYIEKANKEGNRT--YKLGT 93
E +++ W+A+H G S+ + E E R ++F +NL++++ N + ++LG
Sbjct: 59 EAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGM 118
Query: 94 NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV-TPIK 152
N+F+DLTNDEFRA Y G +P+ R Y++ + +P S+DWRDKGAV +P+K
Sbjct: 119 NRFADLTNDEFRAAYLG---TTPAGRGRHVGEM-YRHDGVEALPDSVDWRDKGAVVSPVK 174
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYI 211
NQ +CG CWAF+AVAAVEGI KI +G L+ LSEQ+L++C+ N GN+GC GG + AFA+I
Sbjct: 175 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNRGNSGCNGGIMDDAFAFI 234
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
+N G+ TE++YPY A+ G C A+K I +E+VP DE +L KAV+ QPVS+AI
Sbjct: 235 TRNGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAI 294
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMK 329
A EFQ Y G+F G CGT LDH V VG+GT G +YW ++NSWG WG+ GY++
Sbjct: 295 DAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIR 354
Query: 330 IVRD----EGLCGIGTRSSYPL 347
+ R+ G CGI +SYP+
Sbjct: 355 MERNVTARTGKCGIAMMASYPI 376
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 208/322 (64%), Gaps = 18/322 (5%)
Query: 40 EQSVVEIHEKWMAQH---GRSYKDEL-EKEMRLKIFKENLEYIEKANKEGNRT--YKLGT 93
E +++ W+A+H G S+ + E E R ++F +NL++++ N + ++LG
Sbjct: 58 EAEARAVYDLWVARHRHGGDSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGM 117
Query: 94 NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT-PIK 152
N+F+DLTNDEFRA Y G P+ R + Y++ + +P S+DWRDKGAV P+K
Sbjct: 118 NRFADLTNDEFRAAYLG-TTPAGRGRHVGEA---YRHDGVEVLPDSVDWRDKGAVVAPVK 173
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
NQ +CG CWAF+AVAAVEGI KI +G L+ LSEQ+L++C+ NG N+GC GG + AFA+I
Sbjct: 174 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFI 233
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
+N G+ TE++YPY A+ G C+ A+K I +E+VP DE +L KAV+ QPVS+AI
Sbjct: 234 ARNGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAI 293
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMK 329
A EFQ Y G+F G CGT LDH V VG+GT G +YW ++NSWG WG+ GY++
Sbjct: 294 DAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIR 353
Query: 330 IVRD----EGLCGIGTRSSYPL 347
+ R+ G CGI +SYP+
Sbjct: 354 MERNVTARTGKCGIAMMASYPI 375
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 143/318 (44%), Positives = 201/318 (63%), Gaps = 16/318 (5%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ + WM +H + Y+ EK R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DL+NDEF+ Y G H T+K+ +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98 DLSNDEFKKKYVGSVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ +A VEG+ KI +GNL++LSEQ+L+DC N ++GC GG + + Y+ N
Sbjct: 154 GSCGSCWAFSTIATVEGVNKIVTGNLLELSEQELVDCDKN-SHGCKGGYQTTSLQYVADN 212
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
G+ T YPYQA C A KP KI+ Y+ VPS E + L A++ QP+S+ + A
Sbjct: 213 -GVHTSKVYPYQAKAMQCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAG 271
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
FQ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330
Query: 333 ---DEGLCGIGTRSSYPL 347
+G CG+ S YP
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 137/293 (46%), Positives = 189/293 (64%), Gaps = 14/293 (4%)
Query: 65 EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK---MPSPSHRST 121
+ R + FKEN YIE+ N+ G +Y+LG NQFSDLT++EFR + G + + SP +
Sbjct: 32 DRRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRFLGLRPDLIDSPVLKMP 91
Query: 122 TSSTFK--YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
S + +QN+ D+P S+DWR GAVT K+Q CG CWAFA A+EGI +I +G
Sbjct: 92 RDSDIEEGFQNV---DLPASVDWRKHGAVTAPKDQGSCGGCWAFATTGAIEGINQIVTGQ 148
Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KP 238
L+ LSEQ+L+DC + GC GG E A+ +I++N G+ TE +YPY A C+ +
Sbjct: 149 LMSLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTETDYPYHASESHCNMKKLNS 208
Query: 239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
I YE +P GDEQALL+AV+ QPVS+AI S +FQ Y G+F G CG +++H V
Sbjct: 209 RVVAIDGYEAIPDGDEQALLRAVAKQPVSVAIEGASKDFQHYASGVFTGHCGEEINHGVL 268
Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYPL 347
IVG+G TEDG +YW++KNSW TWGD G++K+ R+ GLC I T +SYP+
Sbjct: 269 IVGYG-TEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCSINTLASYPV 320
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 143/318 (44%), Positives = 201/318 (63%), Gaps = 16/318 (5%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ + WM +H + Y+ EK R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DL+NDEF+ Y G+ H T+K+ +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98 DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC + + GC GG + + Y + N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VAN 211
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
G+ T YPYQA C A KP KI+ Y+ VPS E + L A++ QP+S+ + A
Sbjct: 212 NGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAG 271
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
FQ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330
Query: 333 ---DEGLCGIGTRSSYPL 347
+G CG+ S YP
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 146/334 (43%), Positives = 207/334 (61%), Gaps = 28/334 (8%)
Query: 40 EQSVVEIHEKWMAQHGRSYKD----ELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
++ V ++E W ++HGR + E +RL++F++NL YI+ N E G T++LG
Sbjct: 47 DEEVRRMYEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLG 106
Query: 93 TNQFSDLTNDEFRALYTGYKMP---SPSHRSTTSSTFKYQNLSMT----------DVPTS 139
F+DLT +E+R G++ PS R+ S S D+P +
Sbjct: 107 LTPFADLTLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCGDLPDA 166
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+DWR GAVT +KNQ++CG CWAF+AVAA+EGI I +GNL+ LSEQ+++DC T ++GC
Sbjct: 167 IDWRQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDTQ-DSGC 225
Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGDEQAL 257
GG E AF ++I N GI +E +YP+ A GTC A A A I + EV S +E AL
Sbjct: 226 NGGQMENAFQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAAIDGFVEVASNNETAL 285
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+AV++QPVS+AI A FQ Y GIFNG CGT LDH VT+VG+G +E+G YW++KNS
Sbjct: 286 QEAVAIQPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYG-SENGKAYWIVKNS 344
Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
W ++WG+AGY++I R+ G CGI +SYP+
Sbjct: 345 WSDSWGEAGYIRIRRNVFLPVGKCGIAMDASYPV 378
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 140/309 (45%), Positives = 197/309 (63%), Gaps = 10/309 (3%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
+V E+ E W +HG+SY EK RL +F +N E++ N N +Y L N ++DLT+
Sbjct: 24 NVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTH 83
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+ G+ SP+ R+ + +L DVP SLDWR KGAVT +K+Q CG CW
Sbjct: 84 HEFKVSRLGF---SPALRNFRPVLPQEPSLP-RDVPDSLDWRKKGAVTAVKDQGSCGACW 139
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
+F+A A+EGI +I +G+LI LSEQ+L+DC + N+GC GG + A+ ++I N GI TE+
Sbjct: 140 SFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTEN 199
Query: 222 EYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
+YPYQA G+C + + I Y ++PS DE LL+AV+ QPVS+ I FQ Y
Sbjct: 200 DYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLY 259
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
+GIF+G C T LDHAV IVG+G +E+G +YW++KNSWG +WG GYM + R+ EG+
Sbjct: 260 SKGIFSGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGV 318
Query: 337 CGIGTRSSY 345
CGI +SY
Sbjct: 319 CGINKLASY 327
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 280 bits (715), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 142/316 (44%), Positives = 202/316 (63%), Gaps = 12/316 (3%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
+E V ++E+W+ ++ ++Y EKE R KIFK+NL+++++ N +RT+++G +F+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTN+EFRA+Y KM + S + + Y+ + +P +DWR GAV +K+Q CG
Sbjct: 96 LTNEEFRAIYLRKKMER-TKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
CWAF+AV AVEGI +I +G LI LSEQ+L+DC N GC GG AF +I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Query: 218 ATEDEYPYQAVP-GTCSAAQ--KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
T+ +YPY A G C+A + I YE+VP DE++L KAV+ QPVS+AI A S
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
FQ YK G+ G CG LDH V +VG+G+T G +YW+I+NSWG WGD+GY+K+ R+
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNI 331
Query: 334 ---EGLCGIGTRSSYP 346
G CGI SYP
Sbjct: 332 DDPFGKCGIAMMPSYP 347
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 143/318 (44%), Positives = 200/318 (62%), Gaps = 16/318 (5%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ + WM +H + Y+ EK R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DL+NDEF+ Y G+ H T+K+ +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98 DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC + + GC GG + + Y + N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VAN 211
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
G+ T YPYQA C A KP KI+ Y+ VPS E + L A++ QP+S + A
Sbjct: 212 NGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAG 271
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
FQ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330
Query: 333 ---DEGLCGIGTRSSYPL 347
+G CG+ S YP
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348
>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 155/339 (45%), Positives = 215/339 (63%), Gaps = 27/339 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F ++ + A QV + R+ + S+ E HE+ M ++G+ YKD ++ FKEN+ YI
Sbjct: 12 FAMLLCMAFLAFQV-TCRTLQDASMXERHEQRMTRYGKVYKDPPKRX-----FKENVNYI 65
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N N+ YK G NQF+ R + G+ M S R TT FK++N++ T P+
Sbjct: 66 EACNNAANKPYKRGINQFAP------RNRFKGH-MCSSIIRITT---FKFENVTAT--PS 113
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
++D R KGAVTPIK+Q +CGCCWAF+AVAA EGI + +G LI LSEQ+L+DC T G +
Sbjct: 114 TVDCRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDX 173
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYP-YQAVPGTCSAAQKPAAAK--ISNYEEVPSGDE 254
GC GG + AF +IIQN G+ + P Y V G C+A + A I+ YE+VP+ +E
Sbjct: 174 GCEGGLMDDAFKFIIQNHGLKHXSQLPLYMGVDGKCNANEAAKNAATIITGYEDVPANNE 233
Query: 255 QALL-KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
+A L KAV+ PVS AI A ++FQ YK G+F G CGT+LDH VT VG+G ++DG YWL
Sbjct: 234 KAHLQKAVANNPVSEAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWL 293
Query: 314 IKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
+KNSWG WG+ GY+++ R +E LCGI ++SYP A
Sbjct: 294 VKNSWGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 142/316 (44%), Positives = 202/316 (63%), Gaps = 12/316 (3%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
+E V ++E+W+ ++ ++Y EKE R KIFK+NL+++++ N +RT+++G +F+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTN+EFRA+Y KM + S + + Y+ + +P +DWR GAV +K+Q CG
Sbjct: 96 LTNEEFRAIYLRKKMER-NKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
CWAF+AV AVEGI +I +G LI LSEQ+L+DC N GC GG AF +I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Query: 218 ATEDEYPYQAVP-GTCSAAQ--KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
T+ +YPY A G C+A + I YE+VP DE++L KAV+ QPVS+AI A S
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
FQ YK G+ G CG LDH V +VG+G+T G +YW+I+NSWG WGD+GY+K+ R+
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNI 331
Query: 334 ---EGLCGIGTRSSYP 346
G CGI SYP
Sbjct: 332 DDPFGKCGIAMMPSYP 347
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 146/336 (43%), Positives = 202/336 (60%), Gaps = 24/336 (7%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG++Y E+E R F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAA------------QKPA-AAKISNYEEVPSGDEQ 255
+II N GI TED+YPY+ C QK A I +YE+V E
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPNSET 261
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
+L KAV+ QPVS+AI A FQ Y GIF G CGT LDH V VG+G TE+G +YW+++
Sbjct: 262 SLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVR 320
Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
NSWG +WG++GY+++ R+ G CGI SYPL
Sbjct: 321 NSWGKSWGESGYVRMERNIKASSGKCGIAVEPSYPL 356
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 136/317 (42%), Positives = 198/317 (62%), Gaps = 13/317 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
++++ +++E+W H EK R FKEN+ +I NK G+R Y+L N+F D+
Sbjct: 35 DEALWDLYERWQTHHHVHRHHG-EKGRRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDM 93
Query: 100 TNDEFRALYTGYKM----PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
+EFR+ + ++ + S + F Y + TD+P S+DWR +GAVT +K+Q
Sbjct: 94 GREEFRSTFADSRINDLRRAESPAAPAVPGFMYDGV--TDLPPSVDWRKEGAVTAVKDQG 151
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ V +VEGI IR+G+L+ LSEQ+L+DC T+ NGC GG E AF +I
Sbjct: 152 HCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTD-ENGCQGGLMENAFEFIKSYG 210
Query: 216 GIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
G+ TE YPY+A GTC + +++ I ++ VP+G E AL KAV+ QPVS+AI A
Sbjct: 211 GVTTESAYPYRASNGTCDSVRSRRGQIVSIDGHQMVPTGSEDALAKAVANQPVSVAIDAG 270
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
FQ Y EG+F G CGT LDH V VG+G ++DG YW++KNSWG +WG+ GY+++ R
Sbjct: 271 GQAFQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTAYWIVKNSWGPSWGEGGYIRMQRG 330
Query: 333 --DEGLCGIGTRSSYPL 347
+ GLCGI +S+P+
Sbjct: 331 AGNGGLCGIAMEASFPI 347
>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 350
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 140/330 (42%), Positives = 197/330 (59%), Gaps = 7/330 (2%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L+ + V+++ +V HE+WMA+ GR Y D EK R +F N Y++
Sbjct: 14 LLVLVATAVFHAVAAQGEAGLTVAARHEQWMAKFGRVYTDANEKARRQAVFGANARYVDA 73
Query: 81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
N+ GNRTY LG N+FSDLT++EF + GY+ P + + L+ ++P S
Sbjct: 74 VNRAGNRTYTLGLNEFSDLTDNEFAKTHLGYREFRPETANISKGVDPGYGLA-GNIPKSF 132
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL 200
DWR KGAVT +K+Q CGCCWAFAAVAA EG+ KI G LI +SEQQ+LDC+T GNN C
Sbjct: 133 DWRTKGAVTEVKSQGGCGCCWAFAAVAATEGLVKIAKGTLISMSEQQVLDCTT-GNNTCK 191
Query: 201 GGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVP-SGDEQALL 258
GG A +Y+ + G+ TE++Y Y A G C P A + + E +P G+E L
Sbjct: 192 GGYMNDALSYVFASGGLQTEEDYEYNAEKGACRRDVTPNPATSVGHAEYMPLDGNEFLLQ 251
Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGAN-YWLIK 315
K V+ QPV +A+ AY T+F++Y G+F G CG LDH T+VG+G + G YWL+K
Sbjct: 252 KLVARQPVVVAVEAYGTDFKNYGGGVFTGSPSCGQNLDHFFTVVGYGFADGGKQMYWLVK 311
Query: 316 NSWGNTWGDAGYMKIVRDEGLCGIGTRSSY 345
N WG +WG++GYM+I R G ++Y
Sbjct: 312 NQWGTSWGESGYMRIARGSSARNCGMTNNY 341
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 204/318 (64%), Gaps = 18/318 (5%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+ E+ + W +HG++Y E E++ R++IFK+N +++ + N N TY L N F+DLT+
Sbjct: 26 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 85
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLS-MTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+A G + +PS + K Q+L VP S+DWR KGAVT +K+Q CG CW
Sbjct: 86 EFKASRLGLSVSAPSVIMAS----KGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 141
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
+F+A A+EGI +I +G+LI LSEQ+L+DC + N GC GG + AF ++I+N GI TE
Sbjct: 142 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 201
Query: 222 EYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA-------Y 273
+YPYQ GTC + K I +Y V S DE+AL++AV+ QPVS+ I Y
Sbjct: 202 DYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLY 261
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
S++F +GIF+G C T LDHAV IVG+G +++G +YW++KNSWG +WG G+M + R+
Sbjct: 262 SSKFYLLMQGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRN 320
Query: 334 ----EGLCGIGTRSSYPL 347
+G+CGI +SYP+
Sbjct: 321 TENSDGVCGINMLASYPI 338
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 139/309 (44%), Positives = 193/309 (62%), Gaps = 15/309 (4%)
Query: 50 WMAQHGRSYKDELE-KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
W A+ G+ + R + FKEN YIE+ N+ G +Y+LG NQFSDLT++EFR +
Sbjct: 16 WCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRF 75
Query: 109 TGYK---MPSPSHRSTTSSTFK--YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
G + + SP + S + +QN+ D+P S+DWR GAVT K+Q CG CWAF
Sbjct: 76 LGLRPDLIDSPVLKMPRDSDIEEGFQNV---DLPASVDWRQHGAVTAPKDQGSCGGCWAF 132
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A A+EGI +I +G L+ LSEQ+L+DC + GC GG E A+ +I++N G+ TE +Y
Sbjct: 133 ATTGAIEGINQIVTGQLVSLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTETDY 192
Query: 224 PYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
PY A C+ + I Y+ +P GDEQALL AV+ QPVS+AI S +FQ Y
Sbjct: 193 PYHASESHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGASKDFQHYAS 252
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCG 338
G+F G CG +++H V IVG+G TEDG +YW++KNSW TWGD G++K+ R+ GLC
Sbjct: 253 GVFTGHCGEEINHGVLIVGYG-TEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCS 311
Query: 339 IGTRSSYPL 347
I T +SYP+
Sbjct: 312 INTLASYPV 320
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 153/344 (44%), Positives = 214/344 (62%), Gaps = 16/344 (4%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ P I+TL A + RS E + I+++W +H + D+ + RL++FK
Sbjct: 21 VSVVPPLDILTLSKQ-AWAAPAGRSDEEVRI--IYQEWRVKHRPAENDQYVGDYRLEVFK 77
Query: 73 ENLEYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
ENL ++++ N +R Y+LG N+F+DLTN+E+RA + + S RST+
Sbjct: 78 ENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFL--RDLSRLGRSTSGEISNQY 135
Query: 130 NLSMTDV-PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
L DV P S+DWR+KGAV +KNQ CG CWAFAA+AAVEGI +I +G+LI LSEQQL
Sbjct: 136 RLREGDVLPDSIDWREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQL 195
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYE 247
+DCST N GC GG +AF YII N G+ +E+ YPY GTC+ ++ A I +Y
Sbjct: 196 VDCSTR-NYGCEGGWPYRAFQYIINNGGVNSEEHYPYTGTNGTCNTTKENAHVVSIDSYR 254
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VPS DE++L KA + QP+S+ I A FQ Y GIF G C T L+H VT+VG+G TE+
Sbjct: 255 NVPSNDEKSLQKAAANQPISVGIDASGRNFQLYHSGIFTGSCNTSLNHGVTVVGYG-TEN 313
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G +YW++KNSWG WG++GY+ + R+ G CGI SYP+
Sbjct: 314 GNDYWIVKNSWGENWGNSGYILMERNIAESSGKCGIAISPSYPI 357
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 194/329 (58%), Gaps = 14/329 (4%)
Query: 28 CASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN 86
CA+ R ++++ +++E+W H + EK R FK+N+ YI + NK
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAP 84
Query: 87 RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWR 143
L N+F D+ +EFRA + G + F Y+ + D+P ++DWR
Sbjct: 85 GYAPL--NRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVR--DLPRAVDWR 140
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
KGAVT +K+Q +CG CWAF+ V +VEGI IR+G L+ LSEQ+L+DC T N+GC GG
Sbjct: 141 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGL 200
Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVS 262
E AF YI + GI TE YPY+A GTC A + + I ++ VP+ E AL KAV+
Sbjct: 201 MENAFEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVA 260
Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
QPVS+AI A FQ Y +G+F G CGT LDH V +VG+G T DG YW++KNSWG W
Sbjct: 261 NQPVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAW 320
Query: 323 GDAGYMKIVRDE----GLCGIGTRSSYPL 347
G+ GY+++ RD GLCGI +SYP+
Sbjct: 321 GEGGYIRMQRDSGYDGGLCGIAMEASYPV 349
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 140/307 (45%), Positives = 209/307 (68%), Gaps = 12/307 (3%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSDLTNDEFR 105
++ W+A++GRSY E E R ++F +NL + + N + + ++LG N+F+DLTN+EFR
Sbjct: 53 YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 112
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
A + G K+ S ++ +Y++ + ++P S+DWR+KGAV P+KNQ +CG CWAF+A
Sbjct: 113 ATFLGAKVVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 168
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS-REKAFAYIIQNQGIATEDEYP 224
V+ VE I ++ +G +I LSEQ+L++CSTNG NG G + AF +II+N GI TED+YP
Sbjct: 169 VSTVESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGIDTEDDYP 228
Query: 225 YQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEG 283
Y+AV G C ++ A I +E+VP DE++L KAV+ QPVS+AI A EFQ Y G
Sbjct: 229 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 288
Query: 284 IFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGI 339
+F+G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+ G CGI
Sbjct: 289 VFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGI 347
Query: 340 GTRSSYP 346
+SYP
Sbjct: 348 AMMASYP 354
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 194/329 (58%), Gaps = 14/329 (4%)
Query: 28 CASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN 86
CA+ R ++++ +++E+W H + EK R FK+N+ YI + NK
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAP 84
Query: 87 RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWR 143
L N+F D+ +EFRA + G + F Y+ + D+P ++DWR
Sbjct: 85 GYPPL--NRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVR--DLPRAVDWR 140
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
KGAVT +K+Q +CG CWAF+ V +VEGI IR+G L+ LSEQ+L+DC T N+GC GG
Sbjct: 141 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGL 200
Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVS 262
E AF YI + GI TE YPY+A GTC A + + I ++ VP+ E AL KAV+
Sbjct: 201 MENAFEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVA 260
Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
QPVS+AI A FQ Y +G+F G CGT LDH V +VG+G T DG YW++KNSWG W
Sbjct: 261 NQPVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAW 320
Query: 323 GDAGYMKIVRDE----GLCGIGTRSSYPL 347
G+ GY+++ RD GLCGI +SYP+
Sbjct: 321 GEGGYIRMQRDSGYDGGLCGIAMEASYPV 349
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 136/309 (44%), Positives = 204/309 (66%), Gaps = 10/309 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK +R +IFK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 43 LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+ Y G K+ S R + F Y+++ ++P S+DWR KGAV P+KNQ CG CWA
Sbjct: 102 EFKNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ VAAVEGI +I +GNL LSEQ+L+DC +NGC GG + AF++I++N G+ E++
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY GTC ++ IS Y +VP +EQ+LLKA++ Q +S+AI A +FQ Y
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYS 277
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---VRDEGLCG 338
G+F+G CG+ LDH V VG+GT + G +Y ++KNSWG+ WG+ GY+++ + G
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYIIVKNSWGSKWGEKGYIRMRGTLETRGNLR 336
Query: 339 IGTRSSYPL 347
+SYPL
Sbjct: 337 YLQMASYPL 345
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 205/317 (64%), Gaps = 13/317 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQF 96
++ V I+++W A+H + D+ + RL++FKENL ++++ N +R Y+LG N+F
Sbjct: 36 DEEVRIIYQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRF 95
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQK 155
+DLTN+E+RA + + S RST+ L DV P S+DWR+KGAV +K+Q
Sbjct: 96 ADLTNEEYRARFL--RDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKSQG 153
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAFAA+A VEGI +I +G+LI LSEQQL+DCST N+GC GG +AF YII N
Sbjct: 154 RCGSCWAFAAIATVEGINQIVTGDLISLSEQQLVDCSTR-NHGCEGGWPYRAFQYIINNG 212
Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
G+ +E+ YPY GTC+ + A I +Y VPS DE++L KAV+ QP+S+ I A
Sbjct: 213 GVNSEEHYPYTGTNGTCNTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISVGINASG 272
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
FQ Y GIF G C T L+H VT+VG+GT +G +YW++KNSWG +WGD+GY+ + R+
Sbjct: 273 RNFQLYHSGIFTGSCNTSLNHGVTVVGYGTV-NGNDYWIVKNSWGESWGDSGYILMERNI 331
Query: 334 ---EGLCGIGTRSSYPL 347
G CGI SYP+
Sbjct: 332 AESSGKCGIAISPSYPI 348
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 199/318 (62%), Gaps = 16/318 (5%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ + WM +H + Y+ EK R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DL+NDEF+ Y G+ H T+K+ +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98 DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC + + GC GG + + Y + N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VAN 211
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
G+ T YP QA C A KP KI+ Y+ VPS E + L A++ QP+S + A
Sbjct: 212 NGVHTSKVYPCQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAG 271
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
FQ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330
Query: 333 ---DEGLCGIGTRSSYPL 347
+G CG+ S YP
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 197/311 (63%), Gaps = 9/311 (2%)
Query: 44 VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
+ +++ W+A+HG++Y E+ R +IFK NL +I++ N + N TYK+G +F+DLTN+E
Sbjct: 1 MSMYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQ-NHTYKVGLTKFADLTNEE 59
Query: 104 FRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
+RA++ G + + + S + +Y + +P S+DWR KGAV PIK+Q CG CWA
Sbjct: 60 YRAMFLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWA 119
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ VAAVEGI +I +G LI LSEQ+L+DC N GC GG + AF +II N G+ TE +
Sbjct: 120 FSTVAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGLDTEKD 179
Query: 223 YPYQA-VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY K A I +E+V DE+AL KAV+ QPVS+AI A Q Y+
Sbjct: 180 YPYVGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFYQ 239
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGL 336
G+F G CGT LDH V +VG+ +E+G +YWL++NSWG WG+ GY+K+ R+ G
Sbjct: 240 SGVFTGECGTALDHGVVVVGY-ASENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGR 298
Query: 337 CGIGTRSSYPL 347
CGI SSYP+
Sbjct: 299 CGIAMESSYPV 309
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 147/318 (46%), Positives = 207/318 (65%), Gaps = 22/318 (6%)
Query: 40 EQSVVEIHEKWMAQH--GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
++++ +++E+W + + RS+ EK+ R +FKEN++YI + NK ++ YKL NQF
Sbjct: 37 DETLWDLYERWRSVYTSARSFG---EKQNRFHVFKENVKYINEVNKM-DKPYKLRLNQFG 92
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT EF Y K+ + S F Y+N+ +VP S+DWR KGAVTP+KNQ C
Sbjct: 93 DLTPSEFARTYANSKIIEGTRNE--SGGFMYENV---EVPRSIDWRVKGAVTPVKNQGRC 147
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+A AAVEGI +I +G LI LSEQQL+DC T N+GC GG+ +AF YI Q GI
Sbjct: 148 GGCWAFSAAAAVEGINQITTGQLISLSEQQLIDCDTQ-NSGCRGGTMGRAFEYIKQRGGI 206
Query: 218 ATEDEYPYQAVPGTC--SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY-- 273
+E YPY+A G C + Q+P + I Y + E A+LK ++ QPVS+A+ A
Sbjct: 207 TSEANYPYKAQAGMCKNNLIQRPTVS-IDGYYNIRR-SEDAVLKILAHQPVSVAVDATTW 264
Query: 274 -STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
S ++ Y +G+F G CGT+L+H VT VG+GTT DG +YW+IKNSWG TWG+ GYM+++R
Sbjct: 265 SSLDWMFYFQGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMRMLR 324
Query: 333 D---EGLCGIGTRSSYPL 347
GLCGI ++S+P+
Sbjct: 325 GVSPYGLCGIAMQASFPI 342
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 138/313 (44%), Positives = 202/313 (64%), Gaps = 16/313 (5%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-----TYKLGTNQFSDL 99
E+ EKW +H ++Y E EK RLK+F++N ++ + N+ N +Y L N F+DL
Sbjct: 31 ELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADL 90
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T+ EF+ G + + + Q+ + +P+ +DWR GAVTP+K+Q CG
Sbjct: 91 THHEFKTTRLGLPLTLLRFKRPQNQ----QSRDLLHIPSQIDWRQSGAVTPVKDQASCGA 146
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+A A+EGI KI +G+L+ LSEQ+L+DC T+ N+GC GG + A+ ++I N+GI T
Sbjct: 147 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGIDT 206
Query: 220 EDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
ED+YPYQA +CS + K A I +Y +VP +E+ +LKAV+ QPVS+ I EFQ
Sbjct: 207 EDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGSEREFQ 265
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 334
Y +GIF G C T LDHAV IVG+G +E+G +YW++KNSWG WG GY+ ++R+ +
Sbjct: 266 LYSKGIFTGPCSTFLDHAVLIVGYG-SENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNSK 324
Query: 335 GLCGIGTRSSYPL 347
G+CGI T +SYP+
Sbjct: 325 GICGINTLASYPV 337
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 139/295 (47%), Positives = 194/295 (65%), Gaps = 14/295 (4%)
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
E E R ++F +NL++++ N + ++LG N+F+DLTN EFRA Y G P+ R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVT-PIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ Y++ + +P S+DWRDKGAV P+KNQ +CG CWAF+AVAAVEGI KI +G
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
L+ LSEQ+L++C+ NG N+GC GG + AFA+I +N G+ TE++YPY A+ G C+ A++
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 239 -AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAV 297
I +E+VP DE +L KAV+ QPVS+AI A EFQ Y G+F G CGT LDH V
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 298 TIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
VG+GT GA YW ++NSWG WG+ GY+++ R+ G CGI +SYP+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 131/296 (44%), Positives = 197/296 (66%), Gaps = 6/296 (2%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
V+ + E + +H + Y+ EK R +IF +NL++I++ NK+ + Y LG N+F+DLT
Sbjct: 43 HKVIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLT 101
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
++EF+ + G+K + + F+Y++ D+P S+DWR KGAV+P+KNQ +CG C
Sbjct: 102 HEEFKNKFLGFKGELAERKDESIEQFRYRDF--VDLPKSVDWRKKGAVSPVKNQGQCGSC 159
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
WAF+ VAAVEGI +I +GNL LSEQ+L+DC T NNGC GG + AFAY+ +N G+ E
Sbjct: 160 WAFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGGLMDYAFAYVTRN-GLHKE 218
Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
+EYPY GTC + + IS Y +VP +E + LKA++ QP+S+AI A +FQ
Sbjct: 219 EEYPYIMSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQF 278
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 335
Y G+F+G CGT+LDH V VG+GT++ G +Y +++NSWG WG+ GY+++ R+ G
Sbjct: 279 YSGGVFDGHCGTELDHGVAAVGYGTSK-GLDYVIVRNSWGPKWGEKGYIRMKRNTG 333
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 198/315 (62%), Gaps = 17/315 (5%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++ + E WM +H R Y + EK R +IFK+NL YI++ NK+ N +Y LG N+F
Sbjct: 39 TSTERLIRLFESWMLKHDRVYNNIEEKIHRFEIFKDNLMYIDETNKK-NNSYWLGLNEFV 97
Query: 98 DLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
DLT+DEF+ Y G + + F Y+++ D P S+DWRDKGAVTP+K
Sbjct: 98 DLTHDEFKEKYVGSIGEDFVTIEQSNDEEFPYKHV--VDYPESIDWRDKGAVTPVK-PNP 154
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ VA VEGI KI +G LI LSEQ+LLDC ++GC GG + + Y++ N G
Sbjct: 155 CGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCKGGYQTTSLQYVVDN-G 212
Query: 217 IATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
+ TE EYPY+ G C A +K +I+ Y+ VP+ DE +L++A++ QPVS+ + +
Sbjct: 213 VHTEKEYPYEKKQGKCRAKEKKGTKVQITGYKRVPANDEISLIQAIANQPVSVLLESKGR 272
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 332
FQ YK GIFNG CGT+LDHAVT +G+G T Y LIKNSWG WG+ GY+KI R
Sbjct: 273 AFQLYKGGIFNGPCGTKLDHAVTAIGYGKT-----YILIKNSWGPNWGEKGYLKIKRASG 327
Query: 333 -DEGLCGIGTRSSYP 346
EG CG+ S +P
Sbjct: 328 KSEGTCGVYKSSYFP 342
>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 139/312 (44%), Positives = 188/312 (60%), Gaps = 14/312 (4%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
HE+WMA++GR Y D EK R ++F N +I+ N+ GNRTY LG N FSDLTN+EF
Sbjct: 41 HERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEEFAQ 100
Query: 107 LYTGYK-MPSPS----HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
+ GY+ P P S+ ++ + + P S+DWR +GAVTP+K+Q CG CW
Sbjct: 101 THLGYRHQPGPGGLRPEDSSPAAAVNVTDAQLQSTPDSVDWRARGAVTPVKHQGHCGSCW 160
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
AFAAVAA EG+ +I +GNLI +SEQQ+LDC T G + C G A YI + G+ TE
Sbjct: 161 AFAAVAATEGLVQIATGNLISMSEQQVLDC-TGGTSSCKSGYVNAALTYITASGGLQTEA 219
Query: 222 EYPYQAVPGTC---SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
Y Y A G C A+ AAA + + +GDE AL V+ QPV++A+ A +F
Sbjct: 220 AYAYSAEQGACRSGGASPNSAAAVGVHRSAMLNGDEGALQVLVAGQPVAVAVEA-EPDFH 278
Query: 279 SYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL 336
YK G++ G CG +L HAVT+VG+G DG YW++KN WG WG+ GYM++ R G
Sbjct: 279 HYKSGVYVGSPSCGQKLHHAVTVVGYGADGDGQGYWVVKNQWGAGWGEVGYMRLTRGNGG 338
Query: 337 --CGIGTRSSYP 346
CG+ T + YP
Sbjct: 339 NNCGMATHAYYP 350
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 139/295 (47%), Positives = 194/295 (65%), Gaps = 14/295 (4%)
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
E E R ++F +NL++++ N + ++LG N+F+DLTN EFRA Y G P+ R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVT-PIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ Y++ + +P S+DWRDKGAV P+KNQ +CG CWAF+AVAAVEGI KI +G
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
L+ LSEQ+L++C+ NG N+GC GG + AFA+I +N G+ TE++YPY A+ G C+ A++
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 239 -AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAV 297
I +E+VP DE +L KAV+ QPVS+AI A EFQ Y G+F G CGT LDH V
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 298 TIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
VG+GT GA YW ++NSWG WG+ GY+++ R+ G CGI +SYP+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 144/332 (43%), Positives = 210/332 (63%), Gaps = 14/332 (4%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
L +S V RS E V ++ +W A++ + K E RL++FKENL++++K N
Sbjct: 30 LTLSKQGGAVPVRSDEE--VRMLYLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHNA 87
Query: 84 EGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNLSMTDVPTS 139
+R T++LG N+F+DLTN+E+R + + S RS + + +Y+ D+P S
Sbjct: 88 AADRGEHTFRLGMNRFADLTNEEYRTRF--LRDFSRLRRSASGKISSRYRLREGDDLPDS 145
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
+DWR+KGAV P+KNQ CG CWAF+ VAAVEGI +I +G+LI LSEQQL+DC+T N+GC
Sbjct: 146 IDWREKGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT-ANHGC 204
Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLK 259
GG AF +I+ N GI +E+ YPY+ G C++ I +YE VPS +EQ+L K
Sbjct: 205 RGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNSTVNAPVVSIDSYENVPSHNEQSLQK 264
Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
AV+ QPVS+ + A +FQ Y+ GIF G C +HA+T+VG+GT D +Y +KNSWG
Sbjct: 265 AVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDYRTVKNSWG 323
Query: 320 NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
WG++GY+++ R+ G CGI +SYP+
Sbjct: 324 KNWGESGYIRVERNIGNPNGKCGITRFASYPV 355
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 142/308 (46%), Positives = 193/308 (62%), Gaps = 41/308 (13%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
++E W+A+HG+SY EKE R +IFK+NL +I++ N E NRTYK+ +++++ D
Sbjct: 3 VYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE-NRTYKI-SDRYAFRVGDS-- 58
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
+P S+DWR KGAV +K+Q CG CWAF+
Sbjct: 59 ------------------------------LPESVDWRKKGAVVEVKDQGSCGSCWAFST 88
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
+AAVEGI KI +G LI LSEQ+L+DC T+ N GC GG + AF +II N GI +E++YPY
Sbjct: 89 IAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPY 148
Query: 226 QAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
+A G C +K A I YE+VP DE++L KAV+ QPVS+AI A EFQ Y+ GI
Sbjct: 149 KASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGI 208
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGI 339
F G CGT LDH VT VG+G TE+G +YW++KNSWG +WG+ GY+++ RD G CGI
Sbjct: 209 FTGRCGTALDHGVTAVGYG-TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGI 267
Query: 340 GTRSSYPL 347
+SYP+
Sbjct: 268 AMEASYPI 275
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 135/316 (42%), Positives = 198/316 (62%), Gaps = 12/316 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQF 96
+ V ++E W ++HG + + +RL++F++NL YI+ N E G T++LG F
Sbjct: 45 DDEVRRMYEAWKSEHGHGHGSD--DRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPF 102
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLT +E+R G++ S + D+P ++DWR+ GAVT +KNQ++
Sbjct: 103 ADLTLEEYRGRALGFRARRGGASRVGSGSSYRPRPRGGDLPDAIDWRELGAVTGVKNQEQ 162
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+AVAA+EGI +I +GNL+ LSEQ+++DC T + GC GG + AF ++I N G
Sbjct: 163 CGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCDTQ-DGGCNGGEMQNAFQFVINNGG 221
Query: 217 IATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
I TE +YPY C A + I + V + +E AL +AV+ QPVS+AI A
Sbjct: 222 IDTEADYPYLGTDAACDANRVNERVVTIDGFVSVATENETALQEAVANQPVSVAIDASGR 281
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
+FQ Y GIFNG CGTQLDH VT VG+G +E+G +YW++KNSW ++WG+AGY++I R+
Sbjct: 282 KFQHYTSGIFNGPCGTQLDHGVTAVGYG-SENGKDYWIVKNSWSSSWGEAGYIRIRRNVA 340
Query: 334 --EGLCGIGTRSSYPL 347
G CGI +SYP+
Sbjct: 341 AATGKCGIAMDASYPV 356
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 136/306 (44%), Positives = 196/306 (64%), Gaps = 13/306 (4%)
Query: 50 WMAQHGRSYKDELEK-EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
W+ ++YKD +E+ E + ++ +NLE++ N E + T+KLG F+DLT+DE+R
Sbjct: 51 WVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHN-EKDSTFKLGLTNFADLTHDEYRQHA 109
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
GY+ P + T T K D P S+DWR KGAVT +KNQ++CG CWAF+
Sbjct: 110 LGYR---PELKGTGLGTGKSTGFQYADYEAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTT 166
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
+VEG I SG L+ LSEQ+L+DC ++GC GG + AF++II+N GI TE +Y Y+
Sbjct: 167 GSVEGANAIYSGELVSLSEQELVDCDVTQDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYK 226
Query: 227 AVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
A G C+ A +K I +YE+VP DE AL KA + QP+S+AI A EFQ Y G+F
Sbjct: 227 AQDGVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVF 286
Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGT 341
+ CGT LDH V +VG+G +++G +YW++KNSWG+ WGD+GY+++ R G CGI
Sbjct: 287 DAPCGTALDHGVLVVGYG-SDNGTDYWIVKNSWGDFWGDSGYIRLARGISNSAGQCGIAM 345
Query: 342 RSSYPL 347
++SYP+
Sbjct: 346 QASYPI 351
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 198/311 (63%), Gaps = 13/311 (4%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
E+ + W +HG++Y E E++ R++IFK+N +++ + N N TY L N F+DLT+ EF
Sbjct: 30 ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLS-MTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
+A G + + S + K Q+L VP S+DWR KGAVT +K+Q CG CW+F
Sbjct: 90 KASRLGLSVSASSLIMAS----KGQSLGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSF 145
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
+A A+EGI +I +G+LI LSEQ+L+DC + N GC GG + AF ++I+N GI TE +Y
Sbjct: 146 SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 205
Query: 224 PYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
PYQ GTC + K I +Y V S DE+AL +AV+ QPVS+ I FQ Y
Sbjct: 206 PYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSR 265
Query: 283 --GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
GIF+G C T LDHAV IVG+G +++G +YW++KNSWG +WG G+M + R+ EG+
Sbjct: 266 VSGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGI 324
Query: 337 CGIGTRSSYPL 347
CGI +SYP+
Sbjct: 325 CGINMLASYPI 335
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 136/309 (44%), Positives = 188/309 (60%), Gaps = 11/309 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E+WM +HGR+Y + EK+ R +++KENL IE+ N G Y L N+F+DLTN+EFRA
Sbjct: 120 EQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNS-GGHGYTLTDNKFADLTNEEFRAK 178
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNL----SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
G P R L + TD+P +DWR KGAV +KNQ CG CWAF
Sbjct: 179 MLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGSCWAF 238
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
+AVAA+EG+ +I++G L+ LSEQ+L+DC GC GG AF +++ N G+ TE Y
Sbjct: 239 SAVAAMEGLNQIKNGKLVSLSEQELVDCDAEAV-GCAGGFMSWAFEFVMANHGLTTEASY 297
Query: 224 PYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
PY+ + G C A+ ++ I+ Y V E LLK ++QPVS+A+ A FQ Y
Sbjct: 298 PYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLFQLYAG 357
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCG 338
G+F+G C Q++H VT+VG+G T+ YW++KNSWG WG+AGYM + RD GLCG
Sbjct: 358 GVFSGPCTAQINHGVTVVGYGETDKAEKYWIVKNSWGPEWGEAGYMLMQRDAGVPTGLCG 417
Query: 339 IGTRSSYPL 347
I +SYP+
Sbjct: 418 IAMLASYPV 426
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 199/312 (63%), Gaps = 16/312 (5%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
+ +V + E W ++ + YK+ EK R +IFK+NL YI++ NK+ N +Y LG N+F+DLT
Sbjct: 16 ERLVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKK-NSSYWLGLNEFADLT 74
Query: 101 NDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
+DEF+A Y G S + F Y+++ D P S+DWR KGAVTP+KNQ CG
Sbjct: 75 HDEFKAKYVGSLGEDSTIIEQSDDEEFPYKHV--VDYPESIDWRQKGAVTPVKNQNPCGS 132
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ VA VEGI KI +G LI LSEQ+LLDC ++GC GG + + Y+ N G+ T
Sbjct: 133 CWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCKGGYQTTSLQYVADN-GVHT 190
Query: 220 EDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E EYPY+ G C A K + KI+ Y+ VP+ +E +L++A++ QPVS+ + + FQ
Sbjct: 191 EKEYPYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPVSVVVESKGRAFQ 250
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DE 334
YK GIF G CGT++DHAVT VG+ G NY LIKNSWG WG+ GY++I R +
Sbjct: 251 FYKGGIFEGPCGTKVDHAVTAVGY-----GKNYILIKNSWGPKWGEKGYIRIKRASGKSK 305
Query: 335 GLCGIGTRSSYP 346
G CG+ + S +P
Sbjct: 306 GTCGVYSSSYFP 317
>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 377
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 147/334 (44%), Positives = 196/334 (58%), Gaps = 31/334 (9%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE--GNRTYKLGTNQFSD 98
Q++ ++W A+HGR+Y E+ RL+++ N+ YIE AN + TY+LG ++D
Sbjct: 47 QTMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGETAYTD 106
Query: 99 LTNDEFRALYTGYKMPSP---SHRSTTSSTFK---------------YQNLSMTDVPTSL 140
LT DEF A+YT PSP +H + Y N+S P S+
Sbjct: 107 LTADEFTAMYTS---PSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAGAPASV 163
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL 200
DWR KGAVT +KNQ CG CWAF+ VA VEGI +IR+GNLI LSEQ+L+DC T + GC
Sbjct: 164 DWRAKGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCDTL-DYGCD 222
Query: 201 GGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLK 259
GG A +I N GIATE +YPY G C A + P AA IS + V + E +L
Sbjct: 223 GGVSYHALEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRSEPSLAN 282
Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV-GFGTTEDGANYWLIKNSW 318
AV+ QPV+++I A FQ Y +G++NG CGT+L+H VT+V DG YW++KNSW
Sbjct: 283 AVAAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEKYWIVKNSW 342
Query: 319 GNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
G WGD GY ++ +D EGLCGI R S+PL
Sbjct: 343 GKKWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376
>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
Length = 258
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 140/261 (53%), Positives = 183/261 (70%), Gaps = 12/261 (4%)
Query: 94 NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT---SLDWRDKGAVTP 150
N+F+D+TNDEF A+YTG + P P+ + FKY N++++D ++DWR KGAVT
Sbjct: 4 NEFADMTNDEFMAMYTGLR-PVPAGAKKMAG-FKYGNVTLSDADDDQQTVDWRQKGAVTG 61
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
IK+Q++CGCCWAFAAVAAVEGI +I +GNL+ LSEQQ+LDC T+GNNGC GG + AF Y
Sbjct: 62 IKDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQY 121
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
I+ N G+ATED YPY A C + Q AA IS Y++VPSGDE AL AV+ QPVS+AI
Sbjct: 122 IVGNGGLATEDAYPYTAAQAMCQSVQPVAA--ISGYQDVPSGDEAALAAAVANQPVSVAI 179
Query: 271 AAYSTEFQSYKEGIFNGV-CGT--QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
A++ FQ Y G+ C T L+HAVT VG+GT EDG YWL+KN WG WG+ GY
Sbjct: 180 DAHN--FQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGY 237
Query: 328 MKIVRDEGLCGIGTRSSYPLA 348
+++ R CG+ ++SYP+A
Sbjct: 238 LRLERGANACGVAQQASYPVA 258
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 195/316 (61%), Gaps = 14/316 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+DE E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
CG CWAF++ A+EG +SG L+ LSEQ L+DCST GNNGC GG + AF YI N
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PVS+AI A
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 294
Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
FQ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 295 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 354
Query: 333 D-EGLCGIGTRSSYPL 347
+ E CGI + SSYPL
Sbjct: 355 NKENQCGIASASSYPL 370
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 142/332 (42%), Positives = 213/332 (64%), Gaps = 18/332 (5%)
Query: 25 LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-- 82
+ S + Q+ S E+ ++ +W AQHG +E +E R + F++NL YI++ N
Sbjct: 26 IASSSGQIRS-----EEETRRMYAEWTAQHGSPITNE--EEGRYEAFRDNLRYIDEHNAA 78
Query: 83 -KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
G +++LG N+F+ LTN+E+RA Y G ++ S + + +Y+ +P S+D
Sbjct: 79 ADAGIHSFRLGLNRFAGLTNEEYRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVD 138
Query: 142 WRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL 200
WR+KGAV +K+Q + CG WAF+A+AAVE I +I +G LI LSEQ+L+DC T+ N GC
Sbjct: 139 WREKGAVGKVKDQGRSCGSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNAGCD 198
Query: 201 GGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDEQALLK 259
GG + AF +II N GI T+++YPY+A +C A ++ A I +YE++ +E++L K
Sbjct: 199 GGLMDDAFEFIISNGGIDTDEDYPYKARNDSCDANKRNRKAVTIDDYEDLRM-NEKSLQK 257
Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
AVS QPVS+AI A +FQ YK GIF G CGT LDHA TIVG+G +E+G +YW++K S+G
Sbjct: 258 AVSNQPVSVAIEAGGRDFQLYKSGIFTGTCGTDLDHATTIVGYG-SENGTDYWIVKESYG 316
Query: 320 NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+WG++GY ++ R+ G CGI SYP+
Sbjct: 317 TSWGESGYARMERNIKETSGKCGIAMLPSYPV 348
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 147/361 (40%), Positives = 204/361 (56%), Gaps = 61/361 (16%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E E+WM +HGR Y D EK+ RL++++ N+ +E N N Y+L N+F+DLTN+
Sbjct: 28 MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLTNE 87
Query: 103 EFRALYTGYKMPSPSHRSTTSSTF-------------KYQNLSMTDVPTSLDWRDKGAVT 149
EFRA G+ P P R+T +T +Y + ++P S+DWR+KGAV
Sbjct: 88 EFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSD----ELPKSVDWREKGAVA 143
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
P+KNQ ECG CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC T GC GG AF
Sbjct: 144 PVKNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IGCAGGYMSWAFE 202
Query: 210 YIIQNQGIATEDEYPYQ-----------AVPGTCS--------------AAQKP----AA 240
+++ N G+ TE YPYQ A+P C+ A Q P +A
Sbjct: 203 FVMNNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESA 262
Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV 300
IS Y V + E LL+A + QPVS+A+ A S +Q Y G+F G C L+H VT+V
Sbjct: 263 VSISGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVV 322
Query: 301 GFGTTE----------DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
G+G T+ G YW++KNSWG WGDAGY+ + R+ GLCGI SYP
Sbjct: 323 GYGETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSYP 382
Query: 347 L 347
+
Sbjct: 383 V 383
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 195/316 (61%), Gaps = 14/316 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+DE E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 59 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 118
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 119 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 178
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
CG CWAF++ A+EG +SG L+ LSEQ L+DCST GNNGC GG + AF YI N
Sbjct: 179 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 238
Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PVS+AI A
Sbjct: 239 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 298
Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
FQ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 299 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 358
Query: 333 D-EGLCGIGTRSSYPL 347
+ E CGI + SSYPL
Sbjct: 359 NKENQCGIASASSYPL 374
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 138/318 (43%), Positives = 201/318 (63%), Gaps = 17/318 (5%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSD 98
S+ ++ +W +HG++Y E EKE+RLKIF +N E+++K N E G T+ +G N +D
Sbjct: 63 SLSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLAD 122
Query: 99 LTNDEFRALYTGYKMPSPSHRS-TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
LT DEF+ + GY + R+ +ST++Y +++ P +DW GAVTP+KNQK+C
Sbjct: 123 LTKDEFKKML-GYNAALRASRAPVDASTWEYADVT---PPEEIDWVASGAVTPVKNQKQC 178
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ AVEG+ I++G LI LSE++L+ CSTNGN GC GG + F +I+ N+GI
Sbjct: 179 GSCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEWIVNNRGI 238
Query: 218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
TED + Y A C ++ A I +++VPS DE +L+KAVS QPVS+AI A
Sbjct: 239 DTEDGWEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQS 298
Query: 277 FQSYKEGIFNGV-CGTQLDHAVTIVGFGT---TEDGANYWLIKNSWGNTWGDAGYMKIVR 332
FQ Y G+++ CGT+LDH V +VG+G + ++W IKNSWG WG+ GY++I +
Sbjct: 299 FQLYAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRIAK 358
Query: 333 D----EGLCGIGTRSSYP 346
EG CG+ + SYP
Sbjct: 359 GGSGVEGQCGVAMQPSYP 376
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 195/316 (61%), Gaps = 14/316 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+DE E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
CG CWAF++ A+EG +SG L+ LSEQ L+DCST GNNGC GG + AF YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PVS+AI A
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 264
Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
FQ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 324
Query: 333 D-EGLCGIGTRSSYPL 347
+ E CGI + SSYPL
Sbjct: 325 NKENQCGIASASSYPL 340
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 137/331 (41%), Positives = 205/331 (61%), Gaps = 28/331 (8%)
Query: 40 EQSVVEIHEKWMAQHGR--------------SYKDELEKEMRLKIFKENLEYIEKANKE- 84
++ V ++E W ++HGR ++E ++ +RL++F++NL YI+ N E
Sbjct: 47 DEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNLRYIDAHNAEA 106
Query: 85 --GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDW 142
G T++LG F+DLT +E+R G++ + S + + D+P ++DW
Sbjct: 107 DAGLHTFRLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVRG---GDLPDAIDW 163
Query: 143 RDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGG 202
R GAVT +K+Q++CG CWAF+AVAA+EG+ I +GNL+ LSEQ+++DC ++GC GG
Sbjct: 164 RQLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGG 222
Query: 203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP--AAAKISNYEEVPSGDEQALLKA 260
E AF ++I N GI TE +YP+ GTC A+++ A I EV S +E AL +A
Sbjct: 223 QMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKNEKVATIDGLVEVASNNETALQEA 282
Query: 261 VSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 320
V++QPVS+AI A FQ Y GIFNG CGT LDH VT VG+G +E G +YW++KNSW
Sbjct: 283 VAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYG-SESGKDYWIVKNSWSA 341
Query: 321 TWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+WG+AGY+++ R+ G CGI +SYP+
Sbjct: 342 SWGEAGYIRMRRNVPRPTGKCGIAMDASYPV 372
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 196/316 (62%), Gaps = 14/316 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+D+ E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ T +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGH 144
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
CG CWAF++ A+EG +SG L+ LSEQ L+DCST GNNGC GG + AF YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PVS+AI A
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 264
Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
FQ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFIKMLR 324
Query: 333 D-EGLCGIGTRSSYPL 347
+ + CGI + SSYPL
Sbjct: 325 NKDNQCGIASASSYPL 340
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 141/333 (42%), Positives = 208/333 (62%), Gaps = 14/333 (4%)
Query: 23 TLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN 82
L +S V RS E V ++ +W ++ + K E RL++FKENL+++++ N
Sbjct: 31 VLTLSKQGGAVPVRSDEE--VRMLYLEWRVKNHPAEKYLDLNEYRLEVFKENLQFVDEHN 88
Query: 83 KEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNLSMTDVPT 138
+R T+ LG N+F+DLTN+E+R + + S RS + + +Y+ D+P
Sbjct: 89 AAADRGEHTFLLGMNRFADLTNEEYRTRF--LRDFSRLRRSASGKISSRYRLREGDDLPD 146
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
S+DWR+ GAV P+KNQ CG CWAF+ VAAVEGI +I +G+LI LSEQQL+DC+T N+G
Sbjct: 147 SIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT-ANHG 205
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALL 258
C GG AF +I+ N GI +E+ YPY+ G C++ I +YE VPS +EQ+L
Sbjct: 206 CRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNSTVNAPVVSIDSYENVPSHNEQSLQ 265
Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
KAV+ QPVS+ + A +FQ Y+ GIF G C +HA+T+VG+GT D ++W++KNSW
Sbjct: 266 KAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDFWIVKNSW 324
Query: 319 GNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G WG++GY++ R+ G CGI +SYP+
Sbjct: 325 GKNWGESGYIRAERNIENPNGKCGITRFASYPV 357
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 139/332 (41%), Positives = 204/332 (61%), Gaps = 26/332 (7%)
Query: 40 EQSVVEIHEKWMAQHGR-------------SYKDELEKEMRLKIFKENLEYIEKANKE-- 84
++ V ++E W ++HGR + E ++ +RL++F++NL YI+K N E
Sbjct: 77 DEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEAD 136
Query: 85 -GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLD 141
G T++LG F+DLT DE+R G++ + + Y+ +P ++D
Sbjct: 137 AGLHTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDAID 196
Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLG 201
WR GAVT +K+Q++CG CWAF+AVAA+EGI I +GNL+ LSEQ+++DC ++GC G
Sbjct: 197 WRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ-DSGCDG 255
Query: 202 GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVPSGDEQALLK 259
G E AF ++I N GI TE +YP+ GTC A+++ A I EV S +E AL +
Sbjct: 256 GQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEVASNNETALQE 315
Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
AV++QPVS+AI A FQ Y GIFNG CGT LDH VT VG+G +E G +YW++KNSW
Sbjct: 316 AVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYG-SESGKDYWIVKNSWS 374
Query: 320 NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+WG+AGY+++ R+ G CGI +SYP+
Sbjct: 375 ASWGEAGYIRMRRNVPRPTGKCGIAMDASYPV 406
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 131/262 (50%), Positives = 170/262 (64%), Gaps = 18/262 (6%)
Query: 99 LTNDEFRALYTGYKMPSPSHR---------STTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+T DEFR Y G ++ HR S ++S+F Y + DVP S+DWR KGAVT
Sbjct: 1 MTADEFRRHYAGSRVAH--HRMFRGDRQGSSASASSFMYAD--ARDVPASVDWRQKGAVT 56
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q +CG CWAF+ +AAVEGI I++ NL LSEQQL+DC T N GC GG + AF
Sbjct: 57 DVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQ 116
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
YI ++ G+A ED YPY+A +C + P I YE+VP+ DE AL KAV+ QPVS+A
Sbjct: 117 YIAKHGGVAAEDAYPYRARQASCKKSPAPVVT-IDGYEDVPANDESALKKAVAHQPVSVA 175
Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
I A + FQ Y EG+F+G CGT+LDH V VG+G T DG YWL+KNSWG WG+ GY++
Sbjct: 176 IEASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIR 235
Query: 330 IVRD----EGLCGIGTRSSYPL 347
+ RD EG CGI +SYP+
Sbjct: 236 MARDVAAKEGHCGIAMEASYPV 257
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 193/308 (62%), Gaps = 41/308 (13%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
++E W+ +HG+SY E+E R +IFK+NL +IE+ N NRTYK+G +++S FR
Sbjct: 3 VYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVG-DRYS------FR 54
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
A D+P S+DWR+KGAV P+K+Q CG CWAF+
Sbjct: 55 A--------------------------GEDLPESVDWREKGAVVPVKDQGNCGSCWAFST 88
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
+AAVEGI +I +G+LI LSEQ+L+DC + N GC GG + AF +II N GI +E++YPY
Sbjct: 89 IAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPY 148
Query: 226 QAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
+A TC +K A I YE+VP DE++L KAV+ QPVS+AI A FQ Y+ G+
Sbjct: 149 RAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGV 208
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLCGI 339
F G CGTQLDH V VG+G TE+ +YW+++NSWG WG++GY+K+ R + G CGI
Sbjct: 209 FTGQCGTQLDHGVVAVGYG-TENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCGI 267
Query: 340 GTRSSYPL 347
SYP+
Sbjct: 268 AIEPSYPI 275
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 195/316 (61%), Gaps = 14/316 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+D+ E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
CG CWAF++ A+EG +SG L+ LSEQ L+DCST GNNGC GG + AF YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PVS+AI A
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 264
Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
FQ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFIKMLR 324
Query: 333 D-EGLCGIGTRSSYPL 347
+ E CGI + SSYPL
Sbjct: 325 NKENQCGIASASSYPL 340
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 135/308 (43%), Positives = 186/308 (60%), Gaps = 9/308 (2%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
++EKW+ +H + Y EK+ R +IFK+NL +I++ N + N +YK+G N+F+D+ N+E+R
Sbjct: 3 MYEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQ-NYSYKVGLNKFADINNEEYR 61
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
+Y G K + T T + V +DWR KGAVT IK+Q CG CWAF+
Sbjct: 62 DMYLGTKSDAKRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFST 121
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
+A VE I KI +G + LSEQ+L+DC N GC GG + AF +II+N GI T+ +YPY
Sbjct: 122 IATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDYPY 181
Query: 226 QAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
C +K A I YE+VPS AL KAV+ QPVS+AIA Q Y+ G+
Sbjct: 182 NGFERKCDPTKKNAKVVSIDGYEDVPSY-MNALKKAVAHQPVSVAIAGLGRALQLYQSGV 240
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-----GLCGI 339
F G CGT LDH V +VG+G +E+G +YWL++NSWG WG+ GY KI CGI
Sbjct: 241 FTGKCGTDLDHGVVVVGYG-SENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKCGI 299
Query: 340 GTRSSYPL 347
+SYP+
Sbjct: 300 AMEASYPV 307
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 197/320 (61%), Gaps = 13/320 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V S + S ++ E W Q+G++Y E EK RLK+F+EN ++ + N N +Y L
Sbjct: 15 VHSSVSEASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLA 74
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
N F+DLT+ EF+A G+ SP +S S Q L VP ++DWR GAVT +
Sbjct: 75 LNAFADLTHHEFKASRLGF---SPGRAQSIRSVGTPVQEL---HVPPAVDWRKSGAVTGV 128
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CW+F+ A+EGI KI +G+L+ LSEQ+L+DC + N+GC GG + A+ ++
Sbjct: 129 KDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFV 188
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
I+NQGI +E +YPY + C+ + K I Y ++P DE+ LL+ V+ QPVS+ I
Sbjct: 189 IKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGI 248
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
FQ Y +G++ G C + LDHAV IVG+G TEDG ++W++KNSWG WG GY+ +
Sbjct: 249 CGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYG-TEDGVDFWIVKNSWGEHWGMRGYIHM 307
Query: 331 VRD----EGLCGIGTRSSYP 346
+R+ EG+CGI +SYP
Sbjct: 308 LRNNGTAEGICGINMLASYP 327
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 195/329 (59%), Gaps = 21/329 (6%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T +++ E+WM +HGR+Y D EK+ R ++++ N+E +E N N YKL N+F+
Sbjct: 23 TRADLMLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFA 81
Query: 98 DLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQ 154
DLTN+EFRA G++ + P +T S+ S D+ P S+DWR KGAV +KNQ
Sbjct: 82 DLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQ 141
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC GC GG AF +++ N
Sbjct: 142 GDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGN 200
Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
G+ TE YPY A G C AA+ +A I+ Y V E L +A + QPVS+A+
Sbjct: 201 HGLTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGG 260
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN----------YWLIKNSWGNTWG 323
S FQ Y G++ G C ++H VT+VG+G +E + YW++KNSWG WG
Sbjct: 261 SFMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWG 320
Query: 324 DAGYMKIVRD-----EGLCGIGTRSSYPL 347
DAGY+ + RD GLCGI SYP+
Sbjct: 321 DAGYILMQRDVAGLASGLCGIALLPSYPV 349
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 195/316 (61%), Gaps = 14/316 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+D+ E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
CG CWAF++ A+EG +SG L+ LSEQ L+DCST GNNGC GG + AF YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PV++AI A
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASH 264
Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
FQ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 324
Query: 333 D-EGLCGIGTRSSYPL 347
+ E CGI + SSYPL
Sbjct: 325 NKENQCGIASASSYPL 340
>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 148/357 (41%), Positives = 203/357 (56%), Gaps = 37/357 (10%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEK--------WMAQHGRSYKDELEKEMRLKIFKENL 75
+L C+S+ +++ S H ++ H WM RSY EK R K+++ N+
Sbjct: 29 MLAGCSSESLTTSSEHSDIGIDKHHDLMMARFHVWMTVQNRSYPTSSEKAHRFKVYRSNM 88
Query: 76 EYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR----------STT 122
YIE N E TY+LG F+DLT++EF +LYTG K+P HR +T
Sbjct: 89 RYIEALNAEATTSGFTYELGEGPFTDLTDEEFISLYTG-KIPDDDHREDGVHDEQIITTH 147
Query: 123 SSTFK-------YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
+ + Y N S P +DWR +GAVTP+K+Q +CG CWAF VA +EGI KI
Sbjct: 148 AGSVNGAEGVTVYANFS-AGAPIRMDWRKRGAVTPVKDQGKCGSCWAFPTVATIEGIHKI 206
Query: 176 RSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAA 235
+ G L+ LSEQQL+DC + GC GG AF +IIQN GI T Y Y+A G C
Sbjct: 207 KRGRLVSLSEQQLVDCDFL-DGGCNGGWPRNAFQWIIQNGGITTTSSYTYKAAEGQCKGN 265
Query: 236 QKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGT-QLD 294
+KP AAKI+ Y +V S E +++ V+ QP++ +I + +FQ YK GI+NG C T +L+
Sbjct: 266 RKP-AAKITGYRKVKSNSEVSMVNIVANQPIAASIVVHGGQFQHYKGGIYNGPCATSKLN 324
Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYPL 347
H +TIVG+G GA YW++KNSWG WG+ GYM + R G CGI R +PL
Sbjct: 325 HVITIVGYGQQAYGAKYWIVKNSWGAAWGNKGYMLMKRGTKNPLGQCGIAVRPIFPL 381
>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
Length = 355
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 135/339 (39%), Positives = 203/339 (59%), Gaps = 12/339 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F++ + + ++S + H + V+ + E+W+ +H + Y EKE R +
Sbjct: 8 LFMVFAVSSALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNALGEKEKRFQ 67
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK NL +I++ N NRTYKLG N F+DLTN E+RA+Y P T +Y
Sbjct: 68 IFKNNLRFIDERNSL-NRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTPPRNRYV 126
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+P S+DWR +GAVTP+KNQ C CWAF AV AVE + KI++G+LI LSEQ++
Sbjct: 127 PRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLISLSEQEV 186
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
+DC+T+ + GC GG + + YI +N GI+ E +YPY+ G C + +K A I +
Sbjct: 187 VDCTTSSSRGCGGGDIQHGYIYIRKN-GISLEKDYPYRGDEGKCDSNKKNAIVTIDGHGW 245
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP+ E+AL + ++ QPV++ I A EFQ Y G+F G CGT+L+HA+ +VG+G +DG
Sbjct: 246 VPTQLEEALKQGIANQPVAVPIPADDYEFQYYTSGVFKGKCGTELNHALLLVGYGAEKDG 305
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
+YW+ KNS+ + WG+ GY++I R C G YP+
Sbjct: 306 -DYWIAKNSYSDKWGENGYIRIQRKLSTCKFGNGGYYPI 343
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 197/316 (62%), Gaps = 14/316 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+D+ E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +T +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
CG CWAF++ A+EG +SG L+ LSEQ L+DCST GNNGC GG + AF YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PV++AI A
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASH 264
Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
FQ Y EG++N C Q LDH V +VG+GT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTWGDKGFIKMLR 324
Query: 333 D-EGLCGIGTRSSYPL 347
+ + CGI + SSYPL
Sbjct: 325 NKDNQCGIASASSYPL 340
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 131/304 (43%), Positives = 189/304 (62%), Gaps = 8/304 (2%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
WM + + LE R ++F N + IE NK+ + ++ +G N++S LT DEF+ L T
Sbjct: 31 WMKKFAVKL-NPLEWVHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKKLRT 89
Query: 110 GYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G ++ SPS+ +S ++MTDVP +DW ++G VTP+KNQ CG CWAF+ A
Sbjct: 90 GLRV-SPSYIQSRAKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFSTTGA 148
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV 228
+EG + S L+ +SEQ+L+DC NG+ GC GG + AF ++ ++G+ E++YPY A
Sbjct: 149 IEGAAFVSSKQLVSVSEQELVDCDHNGDMGCNGGLMDNAFKWVKTHKGLCKEEDYPYHAK 208
Query: 229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV 288
GTC+ + K++ + +VP+ DEQAL AV+ QPVS+AI A EFQ YK G+F+
Sbjct: 209 EGTCALKKCKPVTKVTAFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQFYKSGVFDKS 268
Query: 289 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSS 344
CGT+LDH V +VG+G E G YW +KNSWG WGD GY+K+ R + G CG+ S
Sbjct: 269 CGTKLDHGVLVVGYG-EEGGKKYWKVKNSWGADWGDKGYIKLAREFGPETGQCGVAMVPS 327
Query: 345 YPLA 348
YP A
Sbjct: 328 YPTA 331
>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 354
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 141/320 (44%), Positives = 190/320 (59%), Gaps = 17/320 (5%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
+V HE+WMA+ GR+YKD EK R ++F N +++ N+ GNRTY LG N FSDLT+
Sbjct: 33 TVASRHERWMARFGRAYKDADEKARRQEVFGANARHVDAVNRSGNRTYTLGLNHFSDLTD 92
Query: 102 DEFRALYTGYK--MPSPSH--RSTTSSTFKYQNLS--MTDVPTSLDWRDKGAVTPIKNQK 155
EF + GY+ P P R K L+ DVP S+DWR +GAVT IKNQ+
Sbjct: 93 HEFLQQHLGYRHHQPGPGGLLRPEDQDMSKATALADYGQDVPDSVDWRAQGAVTEIKNQR 152
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAFAAVAA EG+ KI +GNLI +SEQQ+LDC T G N C GG A Y+ +
Sbjct: 153 SCGSCWAFAAVAATEGLVKIATGNLISMSEQQVLDC-TGGGNTCDGGDINAALRYVAASG 211
Query: 216 GIATEDEYPYQAVPGTC---SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
G+ E Y Y A G C S A A+ + + + GDE AL + QPV++A+ A
Sbjct: 212 GLQPEAAYAYAAQKGACRGASPANSAASVGGARFARL-GGDEGALRGLAAGQPVAVALEA 270
Query: 273 YSTEFQSYKEGIFNG--VCGTQLDHAVTIVGFGTTED-GANYWLIKNSWGNTWGDAGYMK 329
+F+ YK G++ G CG +L+H VT+VG+G +D G YW++KN WG WG+ GYM+
Sbjct: 271 SEPDFRHYKSGVYAGSASCGRRLNHGVTVVGYGAEDDSGDEYWVVKNQWGTLWGEKGYMR 330
Query: 330 IVRDE---GLCGIGTRSSYP 346
+ R + CGI + + YP
Sbjct: 331 VARGDVAGANCGIASYAYYP 350
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 137/324 (42%), Positives = 194/324 (59%), Gaps = 21/324 (6%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+++ E+WM +HGR+Y D EK+ R ++++ N+E +E N N YKL N+F+DLTN+
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKECGC 159
EFRA G++ + P +T S+ S D+ P S+DWR KGAV +KNQ +CG
Sbjct: 86 EFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGS 145
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC GC GG AF +++ N G+ T
Sbjct: 146 CWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGNHGLTT 204
Query: 220 EDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E YPY A G C AA+ +A I+ Y V E L +A + QPVS+A+ S FQ
Sbjct: 205 EASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQ 264
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN----------YWLIKNSWGNTWGDAGYM 328
Y G++ G C ++H VT+VG+G +E + YW++KNSWG WGDAGY+
Sbjct: 265 LYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYI 324
Query: 329 KIVRD-----EGLCGIGTRSSYPL 347
+ RD GLCGI SYP+
Sbjct: 325 LMQRDVAGLASGLCGIALLPSYPV 348
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 144/304 (47%), Positives = 195/304 (64%), Gaps = 21/304 (6%)
Query: 52 AQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALY 108
+ + +SY+ E + RL F+ NLE+I K N E G +Y +G N+F+DLT DEF ALY
Sbjct: 3 SDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALY 62
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
+PS +R+ +T Y + D S+DWR KGAVTPIKNQ +CG CW+F+ +
Sbjct: 63 ----VPSKFNRTMPYNTV-YLPATSED---SVDWRTKGAVTPIKNQGQCGSCWSFSTTGS 114
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
EG I +GNL+ LSEQQL+DCS + GN GC GG + AF YII N+G+ TE++YPY A
Sbjct: 115 TEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTA 174
Query: 228 VPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
GTC+ ++ AA IS+Y +VP +E L AV+ PVS+AI A + FQ YK G+F+
Sbjct: 175 QDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVFD 234
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRS 343
G CGT LDH V +VG+ T+D YW++KNSWG TWG GY+ + R G+CGI +
Sbjct: 235 GNCGTNLDHGVLVVGY--TDD---YWIVKNSWGTTWGVEGYINMKRGVSASGICGIAMQP 289
Query: 344 SYPL 347
SYP+
Sbjct: 290 SYPI 293
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 144/309 (46%), Positives = 196/309 (63%), Gaps = 19/309 (6%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E+W+ Q+ R YKD+ E E+R I++ NLEYIE N + +Y L N+F+DLTN+EF +
Sbjct: 6 ERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQ-EXSYNLTDNKFADLTNEEFVSP 64
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
Y G+ R + F Y D+P S DWR +GAV+ IK+Q CG CWAF+AVA
Sbjct: 65 YLGFGT-----RFLPHTGFMYH--EHEDLPESKDWRKEGAVSDIKDQGNCGSCWAFSAVA 117
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
AVEGI KI+SG L+ LSEQ+ DC +GN GC GG + AFA+I +N G+ T +YPY+
Sbjct: 118 AVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDYPYE 177
Query: 227 AVPGTCSAAQK-PAAAKISNYEEVPSGDEQALLKAVSM--QPVSIAIAAYSTEFQSYKEG 283
V GTC+ + AA IS + +VP+ DE L + Q S+AI A FQ Y +G
Sbjct: 178 GVDGTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAFQLYLKG 237
Query: 284 IFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
+F+G+CG QL+H VTIVG+G T D YW++KNSWG WG++GY+++ RD G CG
Sbjct: 238 VFSGICGKQLNHGVTIVGYGKGTSD--KYWIVKNSWGADWGESGYIRMKRDAFDKAGTCG 295
Query: 339 IGTRSSYPL 347
I ++SYPL
Sbjct: 296 IAMQASYPL 304
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 141/321 (43%), Positives = 196/321 (61%), Gaps = 19/321 (5%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDL 99
+ E+W A QH + Y E E+ +R+KI+ +N I K N+ G ++L N+++DL
Sbjct: 23 VKEEWNAFKLQHRKKYDSESEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADL 82
Query: 100 TNDEF--------RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
++EF R+ G K+ T + + DVPT++DWR+KGAVTP+
Sbjct: 83 LHEEFVHTLNGFNRSAAAGSKLLGREQLMTIEEPITWIEPANVDVPTTIDWREKGAVTPV 142
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAY 210
K+Q CG CW+F+A A+EG ++G L+ LSEQ L+DCST GNNGC GG + AF Y
Sbjct: 143 KDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQY 202
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIA 269
+ N+GI TE YPY+A+ C K A + ++P GDE+AL KA+ ++ PVS+A
Sbjct: 203 VKDNKGIDTEKAYPYEAIDDECHYNPKAIGATDKGFVDIPQGDEKALKKALATVGPVSVA 262
Query: 270 IAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
I A FQ Y EG+ + C + QLDH V VG+GTTEDG +YWL+KNSWG TWGD GY
Sbjct: 263 IDASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTWGDQGY 322
Query: 328 MKIVRD-EGLCGIGTRSSYPL 347
+K+ R+ E CGI T +SYPL
Sbjct: 323 VKMARNRENHCGIATTASYPL 343
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 135/310 (43%), Positives = 185/310 (59%), Gaps = 12/310 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK------EGNRTYKLGTNQFSDLTN 101
E W A+HG++Y E+ RL F EN ++ N G +Y L N F+DLT+
Sbjct: 40 EAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLTH 99
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
DEFRA G P S + + VP +LDWR GAVT +K+Q CG CW
Sbjct: 100 DEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCGACW 159
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
+F+A A+EGI KI +G+L+ LSEQ+L+DC + N GC GG A+ ++I+N GI TED
Sbjct: 160 SFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDTED 219
Query: 222 EYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
+YP++ GTC+ + K I Y+EVPS E LL+AV+ QP+S+ I + FQ Y
Sbjct: 220 DYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQLY 279
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
+GIF+G C T LDHAV IVG+G +E G +YW++KNSWG WG GYM + R+ G+
Sbjct: 280 SQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSSGI 338
Query: 337 CGIGTRSSYP 346
CGI +S+P
Sbjct: 339 CGINMMASFP 348
>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
Length = 322
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 141/327 (43%), Positives = 205/327 (62%), Gaps = 35/327 (10%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
SQ + +EQS+V+ H++WM Q R Y+DE EKEMRL++FK+NL++IE N GN++Y
Sbjct: 21 SQARPHVTLNEQSIVDYHQQWMTQFSRVYQDESEKEMRLQVFKKNLKFIENFNNMGNQSY 80
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT---SLDWRDKG 146
+G N+F+D T +EF A +TG ++ + + T +N +++D+ S DWRD+G
Sbjct: 81 TVGVNEFTDWTIEEFLATHTGLRVNVTTLSELFNETMPSRNWNISDIDIDDESKDWRDEG 140
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
AV P+K Q C G+TKI NL+ LSEQQL+DC T N GC GG E+
Sbjct: 141 AVIPVKVQGAC-------------GLTKISGKNLLTLSEQQLIDCDTEKNTGCDGGGIEE 187
Query: 207 AFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQP 265
AF YII+N G++ E EYPYQ G+C A A+ +I +E VPS +E+ALL+AV QP
Sbjct: 188 AFKYIIKNGGVSLETEYPYQVKKGSCRANARSATQTQIRGFEMVPSHNERALLEAVRRQP 247
Query: 266 VSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 324
VS+ I A + F++YK G++ G+ CGT ++HAVT VG+GT +I+ +WG+
Sbjct: 248 VSVLIDARADSFKTYKGGVYAGLDCGTDVNHAVTFVGYGT--------MIQ-----SWGE 294
Query: 325 AGYMKIVRD----EGLCGIGTRSSYPL 347
GYM+I RD +G+CGI ++YP+
Sbjct: 295 NGYMRIRRDVEWPQGMCGIAQVAAYPI 321
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 143/347 (41%), Positives = 210/347 (60%), Gaps = 44/347 (12%)
Query: 13 INTTPMFIIITLLVSCAS-----QVVSSRSTHEQS---VVEIHEKWMAQHGRSYKDELEK 64
+++ +F I T LV C+ +V H S + E+ E WM++HG++Y+ EK
Sbjct: 5 VSSIFLFTIFTSLVICSVVAHDFSIVGYSPEHLTSMHKLTELFESWMSKHGKTYESIEEK 64
Query: 65 EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS 124
RL++FK+NL +I++ N++ TY L N+F+DL+++EF+ S
Sbjct: 65 LHRLEVFKDNLMHIDRRNRDVT-TYWLALNEFADLSHEEFK-----------------SK 106
Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
+ + L +KGAV P+KNQ CG CWAF+ VAAVEGI +I +GNL LS
Sbjct: 107 LAQIRRL------------EKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 154
Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKI 243
EQ+L+DC T+ N+GC GG + AF YI+ N G+ E++YPY GTC ++ I
Sbjct: 155 EQELIDCDTSFNSGCNGGLMDYAFDYIVNNGGLHKEEDYPYLMEEGTCDEKREEMEVVTI 214
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFG 303
S Y +VP +E++LLKA++ QP+SIAI A +FQ Y G+FNG CGT LDH V VG+G
Sbjct: 215 SGYHDVPENNEESLLKALAHQPLSIAIEASGRDFQFYGRGVFNGPCGTDLDHGVAAVGYG 274
Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
+++ G +Y ++KNSWG WG+ GY+++ R+ EGLCGI +SYP
Sbjct: 275 SSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 320
>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 202/343 (58%), Gaps = 21/343 (6%)
Query: 14 NTTPMFIIITLLVSCASQVVSSRS-----THEQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
T P+ I+ LL + S + +++ +W A H RSY E+ R
Sbjct: 7 GTRPVIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRF 66
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
++++ N+EYI+ N+ G TY+LG NQF+DLT +EF A Y G H + +T
Sbjct: 67 EVYRTNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAG------GHTGSAITTAAE 120
Query: 129 QNLSM-TDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
+ S+ D P S+DWR KGAVTP+KNQ +C CWAF+AVA +E + I++G L+ LSEQ
Sbjct: 121 ADGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQ 180
Query: 187 QLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
QL+DC + GC G +AF +I++N GI T +YPY+AV G CSAA KPA I+ +
Sbjct: 181 QLVDCDKY-DGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGACSAA-KPAV-TITGH 237
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
V +E AL AV+ QP+ +AI Q YK G+F+ CG Q+ HAV VG+G
Sbjct: 238 LAVAK-NELALQSAVARQPIGVAIEV-PISMQFYKSGVFSAACGIQMSHAVVTVGYGADA 295
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYP 346
G YWL+KNSWG TWG+AGY+++ RD GLCGI ++YP
Sbjct: 296 SGLKYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGIALDTAYP 338
>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 335
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 143/343 (41%), Positives = 197/343 (57%), Gaps = 22/343 (6%)
Query: 15 TTPMFIIITLLV--SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
T+ + ++ TL+ + A+ + + + +++ E+WMA+ G++YK EKE R IF+
Sbjct: 2 TSIVLLVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFR 61
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+N+ +I + +G NQF+DLTNDEF A YTG K P P + +
Sbjct: 62 DNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVD 113
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
P +DWR +GAVT +K+Q CG CWAFAAVAA+EG+TKIR+G L LSEQ+L+DC
Sbjct: 114 PIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCD 173
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVP 250
TN +NGC GG ++AF + GI E +Y Y+ G C AA I Y VP
Sbjct: 174 TN-SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVP 232
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
DE+ L AV+ QPV++ I A FQ YK G+F G CG +HAVT+VG+ +DGA+
Sbjct: 233 PNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGAS 290
Query: 311 ---YWLIKNSWGNTWGDAGYM----KIVRDEGLCGIGTRSSYP 346
YWL KNSWG TWG GY+ IV+ G CG+ YP
Sbjct: 291 GKKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTCGLAVSPFYP 333
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 209/337 (62%), Gaps = 13/337 (3%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ I+LL CA VV++ ++ + + E + A H +SY+ +E+ +R KIF EN +
Sbjct: 1 MLRISLL--CAFVVVTTAASSHEILRTQWEAFKATHKKSYQSNMEELLRFKIFSENSLLV 58
Query: 79 EKANKEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+ N++ R +YKLG NQF DL EF ++ GY+ + R +T N++ +
Sbjct: 59 ARHNEKYARGLVSYKLGMNQFGDLLPHEFARMFNGYRGARTAGRGST--FLPPANVNYSS 116
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
+P S+DWR+KGAVTP+KNQ +CG CWAF+ ++EG +++G L+ LSEQ L+DCS T
Sbjct: 117 LPQSMDWREKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSETF 176
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
GN+GC GG + AF YI N GI TE YPY+A G C ++ A + + ++ G E
Sbjct: 177 GNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEAEDGECRFKKQNVGATDTGFVDIEQGSE 236
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANY 311
L KAV ++ PVS+AI A + FQ Y EG+++ C + QLDH V +VG+G EDG Y
Sbjct: 237 DDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETECSSEQLDHGVLVVGYG-VEDGKKY 295
Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
WL+KNSW +WGD GY+K+ RD + CGI + +SYPL
Sbjct: 296 WLVKNSWAESWGDNGYIKMSRDKDNQCGIASAASYPL 332
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 144/337 (42%), Positives = 209/337 (62%), Gaps = 13/337 (3%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ + L+ +C VVSS S E +W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
K N + G+ TY LG NQF+DL N+EF A+ TG+++ S ++ STF N ++ +
Sbjct: 60 IKHNLKYDLGHFTYALGMNQFADLKNEEFVAMMTGFRVNGTS-KAAKGSTFLPSN-NIGE 117
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
+P ++DWR KG VTP+K+Q +CG CWAF+ ++EG +G L+ LSEQ L+DCS
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKE 177
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
GN GC GG ++AF YII+ GI TE+ YPY+AV G C + A ++ Y +V S E
Sbjct: 178 GNEGCDGGLMDQAFQYIIKAGGIDTEESYPYKAVDGECHFKKANIGATVTGYTDVTSDSE 237
Query: 255 QALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANY 311
AL KAV+ + P+S+AI A FQ YK G++N T LDH V VG+GTT DG +Y
Sbjct: 238 TALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTTSDGTDY 297
Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
W++KNSW TWG GY+ + R+ + CGI T++SYPL
Sbjct: 298 WIVKNSWAETWGMNGYLWMSRNKDNQCGIATQASYPL 334
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 144/346 (41%), Positives = 206/346 (59%), Gaps = 24/346 (6%)
Query: 15 TTPMFIIITLLVSCASQVVSS--------RSTHEQSVVEIHEKWMAQHGRSYKDELEKEM 66
T + II LLV C + +S S+ + + +E W+ ++G+ Y+++ E E
Sbjct: 4 TITLVAIINLLVLCNLWITASACPAKHNDNSSDSEVMRMRYESWLKKYGQKYRNKDEWEF 63
Query: 67 RLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF 126
R +I++ N+++IE N + N +YKL N+F DLTN+EFR +Y Y+ RS + F
Sbjct: 64 RFEIYRANVQFIEVYNSQ-NYSYKLMDNKFVDLTNEEFRRMYLVYQP-----RSHLQTRF 117
Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
YQ D+P +DWR +GAVT IK+Q CG CW+F+AVA VE I KI++G L+ LSEQ
Sbjct: 118 MYQ--KHGDLPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQ 175
Query: 187 QLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKIS 244
QL+DC NGN GC GG E F +I + G+ T+ YPYQ G + A+ + A I
Sbjct: 176 QLIDCDNRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQGSDGDXNKAKVRNHAVAIC 234
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
YE +P+ +E L AV+ QP S+A A FQ Y +G F+G CG L+H +TIVG+G
Sbjct: 235 GYENLPAHNENMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGSCGKDLNHRMTIVGYG- 293
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
E+G YWL+KNSW N G +GY+++ RD +G CG +SYP
Sbjct: 294 EENGEKYWLVKNSWANDXGVSGYIRMKRDPKDKDGTCGTAMEASYP 339
>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 356
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 189/315 (60%), Gaps = 16/315 (5%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
HE+WMA+ GR Y D EK R ++F N Y++ N+ GNRTY LG N+FSDLT+DEF
Sbjct: 39 HEEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFVQ 98
Query: 107 LYTGYK-MPSPSHRSTTSSTFKYQNLS--MTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
+ GY+ R + K L D+P S+DWR +GAVT +KNQ CGCCWAF
Sbjct: 99 THLGYRGHQQGGLRPEEENVSKVAALGYGQADMPESVDWRAQGAVTGVKNQGSCGCCWAF 158
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTN----GN-NGCLGGSREKAFAYIIQNQGIA 218
AAVAA EG+ KI +GNLI +SEQQ+LDC+ GN N C GG + A Y+ ++G+
Sbjct: 159 AAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYVAASRGLQ 218
Query: 219 TEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVP-SGDEQALLKAVSMQPVSIAIAAYSTE 276
E Y Y + G C + P +AA + V GDE L V+ QP+++++ A S +
Sbjct: 219 PEAAYAYTGLQGACQSGFTPNSAASFGEPQTVTLQGDEGRLQGLVAGQPIAVSVEA-SDD 277
Query: 277 FQSYKEGIFNG---VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
F+ Y G+F CG +L+HAVT+VG+G+ + G YWL+KN WG +WG+ GYM+I R
Sbjct: 278 FRHYMSGVFTAGTSSCGQRLNHAVTVVGYGSADGGQEYWLVKNQWGTSWGEGGYMRIARG 337
Query: 334 EGL--CGIGTRSSYP 346
G CGI + YP
Sbjct: 338 NGAPNCGISAYAYYP 352
>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 266 bits (681), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 151/340 (44%), Positives = 212/340 (62%), Gaps = 28/340 (8%)
Query: 20 IIITLLVSCASQV--VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
I +L+S A V+ R+ + S+ E H + M ++ + KD + +FKEN+ Y
Sbjct: 10 IAFAMLLSMAFLAFQVTCRTLQDASMYESHGQRMTRYSKVDKDPPDX-----VFKENVNY 64
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N ++ YK NQF+ + + G+ M S R TT FK++N++ T P
Sbjct: 65 IEACNNAADKPYKRDINQFAP------KKRFKGH-MCSSIIRITT---FKFENVTAT--P 112
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS-EQQLLDCSTNG- 195
+++D R K AVTPIK+Q +CGC WA +AVAA EGI + +G LI LS EQ+L+DC T G
Sbjct: 113 STVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHALXAGKLILLSSEQELVDCDTKGV 172
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGD 253
+ C GG + AF +IIQN G+ TE YPY+ V G C+A A K AA I+ YE+VP+ +
Sbjct: 173 DQDCQGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNAYEADKNAATIITGYEDVPANN 232
Query: 254 EQA-LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
E+A L KAV+ PVS+AI A ++FQ YK G+F G CGT+LDH VT VG+G ++DG YW
Sbjct: 233 EKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYW 292
Query: 313 LIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
L+KNS G WG+ GY+++ R +E LCGI ++SYP A
Sbjct: 293 LVKNSRGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 146/347 (42%), Positives = 199/347 (57%), Gaps = 33/347 (9%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR- 87
A + S ST + S++E ++W A + +SY E+ R ++ N+ YIE N E
Sbjct: 32 AGDMERSMSTDDSSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAA 91
Query: 88 --TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK------------------ 127
TY+LG ++DLTN EF A+YT P+P+ S
Sbjct: 92 GLTYELGETAYTDLTNQEFMAMYTA---PAPAQLPADESVITTRAGPVDAVGGAPGQLPV 148
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y NLS T P S+DWR GAVTP+KNQ CG CWAF+ VA VEGI +IR+G L+ LSEQ+
Sbjct: 149 YVNLS-TSAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQE 207
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNY 246
L+DC T ++GC GG +A +I N GI TE +YPY C+ A+ A I+
Sbjct: 208 LVDCDTL-DDGCDGGISYRALRWIASNGGITTETDYPYTGTTDACNRAKLSHNAVSIAGL 266
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
V + E +L AV+ QPV+++I A FQ YK+G++NG CGT L+H VT+VG+G
Sbjct: 267 RRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEA 326
Query: 307 DGAN-YWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
G + YW++KNSWG WGD GY+++ +D EGLCGI R SYPL
Sbjct: 327 AGGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 144/336 (42%), Positives = 208/336 (61%), Gaps = 13/336 (3%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ + L+ +C VVSS S E +W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFTDFDEDWNEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
K N + G+ TY LG NQF+DL N+EF A+ TG+++ S ++ STF N ++ +
Sbjct: 60 IKHNLKYDLGHFTYDLGINQFTDLQNEEFVAMMTGFRVSGTS-KAAKGSTFLPPN-NVGE 117
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P ++DWR KG VTP+K+Q +CG CWAF+ +VEG +G L+ LSEQ L+DCS
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDCSGR- 176
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
+ GC GG ++AF YII GI TE YPY+AV G C + A ++ Y +V SG E+
Sbjct: 177 DAGCDGGFMDRAFQYIIDAGGIDTEASYPYKAVDGKCHFKKANVGATVTGYTDVTSGSEK 236
Query: 256 ALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYW 312
AL KAV+ + P+S+AI A FQ YK G++N G T LDH V VG+GT+ DG +YW
Sbjct: 237 ALQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTVLDHGVLAVGYGTSSDGTDYW 296
Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
++KNSW TWG GY+ + R+ + CGI T +SYPL
Sbjct: 297 IVKNSWAETWGMNGYVWMSRNKDNQCGIATNASYPL 332
>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
Length = 362
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 144/361 (39%), Positives = 210/361 (58%), Gaps = 31/361 (8%)
Query: 11 FKINTTPMFIIITLLVSC----ASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYK 59
F +P + + LL SC A+ ++ +R+T + +++ W H RSY
Sbjct: 4 FMACASPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYP 63
Query: 60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM---PSP 116
E R +++ N E+I+ N G+ TY+L N+F+DLT +EF A YTGY P
Sbjct: 64 SAEEALQRFDVYRRNAEFIDAVNLRGDLTYRLAENEFADLTEEEFLATYTGYYAGDGPVD 123
Query: 117 SHRSTT-----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-CGCCWAFAAVAAVE 170
TT ++F Y+ DVP S+DWR +GAV P K+Q C CWAF A +E
Sbjct: 124 DSVITTGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIE 179
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
+ I++G L+ LSEQQL+DC + + GC GS +A+ ++++N G+ TE +YPY A G
Sbjct: 180 SLNMIKTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRG 238
Query: 231 TCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVC 289
C+ A+ AAKI+ + +VP +E AL AV+ QPV++AI + Q YK G++ G C
Sbjct: 239 PCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPC 297
Query: 290 GTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSY 345
GT+L HAVT+VG+GT GA YW IKNSWG +WG+ GY++I+RD GLCG+ +Y
Sbjct: 298 GTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCGVTLDIAY 357
Query: 346 P 346
P
Sbjct: 358 P 358
>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 152/341 (44%), Positives = 212/341 (62%), Gaps = 30/341 (8%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F ++ + A QV + R+ + S+ E HE+ M ++ + YKD E F N+ YI
Sbjct: 12 FAMLLCMAFLAFQV-TCRTLQDASMYERHEQRMTRYSKVYKDPPES------FXGNVNYI 64
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N ++ YK G NQF R + G+ M S R TT FK++N++ T P+
Sbjct: 65 EACNNAADKPYKXGINQFPP------RNRFKGH-MCSSIIRITT---FKFENVTAT--PS 112
Query: 139 SLDWRDKGAVTP--IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS-EQQLLDCSTNG 195
++D R KGAVTP +K+Q +CGC WA +AVAA EGI + +G LI LS E +L+DC T G
Sbjct: 113 TVDCRQKGAVTPYTVKDQGQCGCFWALSAVAATEGIHALXAGKLILLSXEPELVDCDTKG 172
Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSG 252
+ GC GG + AF +IIQN G+ TE YPY+ V G C+A A K AA I+ Y++VP+
Sbjct: 173 VDQGCEGGLTDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEADKNAATIITGYDDVPAN 232
Query: 253 DEQALL-KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
+E+A L KAV+ PVS+AI A ++FQ YK G+F G CGT+LDH VT VG+G ++DG Y
Sbjct: 233 NEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEY 292
Query: 312 WLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
WL+KNS G WG+ GY+++ R +E LCGI ++SYP A
Sbjct: 293 WLVKNSRGPEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 333
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 144/331 (43%), Positives = 201/331 (60%), Gaps = 23/331 (6%)
Query: 32 VVSSRSTH--EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
VV++ +H E V I+E+W+ +HG++Y EKE R KIFK+NL++IE+ N + NR+Y
Sbjct: 24 VVTATESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSY 83
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
G NQFSDLT DEF+A Y G K+ +S + +YQ +P +DWR++GAV
Sbjct: 84 DRGLNQFSDLTVDEFQASYLGGKI---EKKSLSDVAERYQYKEGDILPDEVDWRERGAVV 140
Query: 150 P-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKA 207
P +K Q +CG CWAFAA AVEGI +I +G L+ LSEQ+L+DC +N GC GG A
Sbjct: 141 PRVKRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWA 200
Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK------ISNYEEVPSGDEQALLKAV 261
F +I +N GI T+++Y Y G +AA K K I+ +E VP DE +L KAV
Sbjct: 201 FEFIKENGGIVTDEDYGYT---GDDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAV 257
Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGN 320
S QP+S+ I+A YK G++ G C DH V IVG+GT+ D +YWLI+NSWG
Sbjct: 258 SYQPISVMISA--ANMSDYKSGVYKGPCSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGP 315
Query: 321 TWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
WG+ GY+++ R+ G C + YP+
Sbjct: 316 GWGEGGYLRLQRNFNEPTGKCAVAVAPVYPI 346
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 137/306 (44%), Positives = 188/306 (61%), Gaps = 11/306 (3%)
Query: 53 QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYT 109
+H ++Y DE E+ RLKIF EN I K N+ G +YKL N+++D+ + EFR L
Sbjct: 111 EHRKNYLDETEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMN 170
Query: 110 GYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
G+ +FK + + +P S+DWRDKGAVT +K+Q CG CWAF++
Sbjct: 171 GFNYTLHKELRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSST 230
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
A+EG +SG L+ LSEQ L+DCST GNNGC GG + AF YI N GI TE YPY
Sbjct: 231 GALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 290
Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI 284
+A+ +C + A + ++P G+E+ L +AV ++ PVS+AI A FQ Y EG+
Sbjct: 291 EALDDSCHFNKGTIGATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSEGV 350
Query: 285 F-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGT 341
+ C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R+ + CGI +
Sbjct: 351 YVEPACDAQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRNKDNQCGIAS 410
Query: 342 RSSYPL 347
SSYPL
Sbjct: 411 ASSYPL 416
>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
Length = 336
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 140/340 (41%), Positives = 195/340 (57%), Gaps = 22/340 (6%)
Query: 18 MFIIITLLV--SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+ ++ TL+ + A+ + + + +++ E+WMA+ G++YK EKE R IF++N+
Sbjct: 6 LLVVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNV 65
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+I + +G NQF+DLTNDEF A YTG K P P + +
Sbjct: 66 HFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIW 117
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P +DWR +GAVT +K+Q CG CWAFAAVAA+EG+TKIR+G L LSEQ+L+DC TN
Sbjct: 118 TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN- 176
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVPSGD 253
+NGC GG ++AF + GI E +Y Y+ G C AA I Y VP D
Sbjct: 177 SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPND 236
Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN--- 310
E+ L AV+ QPV++ I A FQ YK G+F G CG +HAVT+VG+ +DGA+
Sbjct: 237 ERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGASGKK 294
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
YW+ KNSWG TWG GY+ + +D G CG+ YP
Sbjct: 295 YWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYP 334
>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
Length = 362
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 144/361 (39%), Positives = 210/361 (58%), Gaps = 31/361 (8%)
Query: 11 FKINTTPMFIIITLLVSC----ASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYK 59
F +P + + LL SC A+ ++ +R+T + +++ W H RSY
Sbjct: 4 FMACASPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYP 63
Query: 60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM---PSP 116
E R +++ N E+I+ N G+ TY+L N+F+DLT +EF A YTGY P
Sbjct: 64 SAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVD 123
Query: 117 SHRSTT-----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-CGCCWAFAAVAAVE 170
TT ++F Y+ DVP S+DWR +GAV P K+Q C CWAF A +E
Sbjct: 124 DSVITTGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIE 179
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
+ I++G L+ LSEQQL+DC + + GC GS +A+ ++++N G+ TE +YPY A G
Sbjct: 180 SLNMIKTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRG 238
Query: 231 TCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVC 289
C+ A+ AAKI+ + +VP +E AL AV+ QPV++AI + Q YK G++ G C
Sbjct: 239 PCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPC 297
Query: 290 GTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSY 345
GT+L HAVT+VG+GT GA YW IKNSWG +WG+ GY++I+RD GLCG+ +Y
Sbjct: 298 GTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCGVTLDIAY 357
Query: 346 P 346
P
Sbjct: 358 P 358
>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 535
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 141/305 (46%), Positives = 187/305 (61%), Gaps = 11/305 (3%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-YKLGTNQFSDLTNDEFRALY 108
WM H S+ D LE RL+ + N YI + N E T KL N+FS ++ +EF+
Sbjct: 32 WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91
Query: 109 TGYKMPSPSHRSTTSSTFKYQNL-SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
TGY MP +S + NL S VP S+DW+DKG VTP+KNQ CG CWAF+
Sbjct: 92 TGYVMPEGYLEQRLAS--RVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGSCWAFSTTG 149
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
AVEG + SG L+ LSEQ+L+DC NG+ GC GG + AFA+I N GI +ED+Y Y+A
Sbjct: 150 AVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYKA 209
Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG 287
C +K KIS +++V DE AL AV+ QPVS+AI A FQ YK G+FN
Sbjct: 210 KAQVCRDCEK--VVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 267
Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRS 343
CGT+LDH V VG+G +E+G +W +KNSWG++WG+ GY+++ R+E G CGI +
Sbjct: 268 TCGTRLDHGVLAVGYG-SENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGIASVP 326
Query: 344 SYPLA 348
SYP A
Sbjct: 327 SYPFA 331
>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
Length = 510
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 141/305 (46%), Positives = 187/305 (61%), Gaps = 11/305 (3%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-YKLGTNQFSDLTNDEFRALY 108
WM H S+ D LE RL+ + N YI + N E T KL N+FS ++ +EF+
Sbjct: 32 WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91
Query: 109 TGYKMPSPSHRSTTSSTFKYQNL-SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
TGY MP +S + NL S VP S+DW+DKG VTP+KNQ CG CWAF+
Sbjct: 92 TGYVMPEGYLEQRLAS--RVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGSCWAFSTTG 149
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
AVEG + SG L+ LSEQ+L+DC NG+ GC GG + AFA+I N GI +ED+Y Y+A
Sbjct: 150 AVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYKA 209
Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG 287
C +K KIS +++V DE AL AV+ QPVS+AI A FQ YK G+FN
Sbjct: 210 KAQVCRDCEK--VVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 267
Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRS 343
CGT+LDH V VG+G +E+G +W +KNSWG++WG+ GY+++ R+E G CGI +
Sbjct: 268 TCGTRLDHGVLAVGYG-SENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGIASVP 326
Query: 344 SYPLA 348
SYP A
Sbjct: 327 SYPFA 331
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 134/309 (43%), Positives = 184/309 (59%), Gaps = 12/309 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK------EGNRTYKLGTNQFSDLTN 101
E W A+HG++Y E+ RL F EN ++ N G +Y L N F+DLT+
Sbjct: 40 EAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLTH 99
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
DEFRA G P S + + VP +LDWR GAVT +K+Q CG CW
Sbjct: 100 DEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCGACW 159
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
+F+A A+EGI KI +G+L+ LSEQ+L+DC + N GC GG A+ ++I+N GI TED
Sbjct: 160 SFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDTED 219
Query: 222 EYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
+YP++ GTC+ + K I Y+EVPS E LL+AV+ QP+S+ I + FQ Y
Sbjct: 220 DYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQLY 279
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
+GIF+G C T LDHAV IVG+G +E G +YW++KNSWG WG GYM + R+ G+
Sbjct: 280 SQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSSGI 338
Query: 337 CGIGTRSSY 345
CGI +S+
Sbjct: 339 CGINMMASF 347
>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 137/327 (41%), Positives = 195/327 (59%), Gaps = 29/327 (8%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR--TYKLGTNQFSDLTNDEFR 105
+W A+H R+Y E+ RL+++ N+ YIE N + TY+LG ++DLT+DEF
Sbjct: 43 RRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLTYELGETAYTDLTSDEFT 102
Query: 106 ALYTGYKMPS-------PSHRSTTSSTFK-----------YQNLSMTDVPTSLDWRDKGA 147
A+YT P P TT + Y N S P S+DWR++GA
Sbjct: 103 AMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVNES-AGAPASVDWRERGA 161
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
VT +KNQ +CG CWAF+ VA +EGI +I++G L LSEQ+L+DC ++GC GG +A
Sbjct: 162 VTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVDCD-KLDHGCNGGVSYRA 220
Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPV 266
+I N GI ++D+YPY A TC + AA IS ++ V + E +L AV+MQPV
Sbjct: 221 LQWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGFQRVATRSELSLTNAVAMQPV 280
Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDA 325
+++I A FQ Y+ G++NG CGT+L+H VT+VG+G E G +YW++KNSWG WGD
Sbjct: 281 AVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYGEDEVTGESYWIVKNSWGEKWGDN 340
Query: 326 GYMK-----IVRDEGLCGIGTRSSYPL 347
GY++ I + EG+CGI R S+PL
Sbjct: 341 GYLRMKKGIIDKPEGICGIAIRPSFPL 367
>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
Length = 342
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 139/340 (40%), Positives = 195/340 (57%), Gaps = 22/340 (6%)
Query: 18 MFIIITLLV--SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+ ++ TL+ + + + + + +++ E+WMA+ G++YK EKE R IF++N+
Sbjct: 12 LLVVCTLMALQAMGADAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNV 71
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+I + +G NQF+DLTNDEF A YTG K P P + +
Sbjct: 72 HFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIW 123
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P +DWR +GAVT +K+Q CG CWAFAAVAA+EG+TKIR+G L LSEQ+L+DC TN
Sbjct: 124 TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN- 182
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVPSGD 253
+NGC GG ++AF + GI E +Y Y+ G C AA+I Y VP D
Sbjct: 183 SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAARIGGYRAVPPND 242
Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN--- 310
E+ L AV+ QPV++ I A FQ YK G+F G CG +HAVT+VG+ +DGA+
Sbjct: 243 ERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGASGKK 300
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
YW+ KNSWG TWG GY+ + +D G CG+ YP
Sbjct: 301 YWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYP 340
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 469
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 133/309 (43%), Positives = 193/309 (62%), Gaps = 10/309 (3%)
Query: 48 EKWMAQHGRSY-KDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
++W H RSY D E E R K++ ENLEY+ N ++ L N +DL+ E+++
Sbjct: 14 KEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNAR-TTSHWLTLNHLADLSTPEYKS 72
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
G+ + R+ + F+Y+++ +P ++DWR K AV +KNQ +CG CWAFA
Sbjct: 73 KLLGFDNQARVARNKLKTGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSCWAFATT 132
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
+VEGI I +G+L+ LSEQ+L+DC T + GC GG + A+A+II+N+GI TE++YPY
Sbjct: 133 GSVEGINAIVTGSLVSLSEQELVDCDTEQDKGCSGGLMDYAYAWIIKNKGINTEEDYPYT 192
Query: 227 AVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
A+ G C A+ K I +YE+VP DE AL KA + QPV++AI A + FQ Y G++
Sbjct: 193 AMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSFQLYGGGVY 252
Query: 286 NG-VCGTQLDHAVTIVGFG--TTEDGANYWLIKNSWGNTWGDAGYMKI----VRDEGLCG 338
+ CGT L+H V +VG+G T G+NYW++KNSWG WGDAGY+++ EGLCG
Sbjct: 253 DDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGSTDAEGLCG 312
Query: 339 IGTRSSYPL 347
I SYP+
Sbjct: 313 IAMAPSYPV 321
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 139/313 (44%), Positives = 192/313 (61%), Gaps = 12/313 (3%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
I E+W +H +++ E+E+ R+KIF EN I K N+ +G ++KLG N++SD+
Sbjct: 23 IKEEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDM 82
Query: 100 TNDEFRALYTGYKMP-SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
EF+ GY R+ S Y + +P S+DWR GAVT +K+Q CG
Sbjct: 83 LYHEFKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHCG 142
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
CWAF++ AA+EG ++G L+ LSEQ L+DCST GNNGC GG + AF YI N GI
Sbjct: 143 SCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 202
Query: 218 ATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTE 276
TE YPY+ + +C + A + + ++P GDE+AL+KAV +M PVS+AI A
Sbjct: 203 DTEKSYPYEGIDDSCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPVSVAIDASHES 262
Query: 277 FQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
FQ Y EG++N C Q LDH V +VG+GT + G +YWL+KNSWG TWGD GY+K+ R+
Sbjct: 263 FQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTWGDQGYIKMARNQ 322
Query: 334 EGLCGIGTRSSYP 346
+ CGI T SSYP
Sbjct: 323 DNQCGIATASSYP 335
>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
Length = 464
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 207/322 (64%), Gaps = 18/322 (5%)
Query: 40 EQSVVEIHEKWMAQH---GRSYKDEL-EKEMRLKIFKENLEYIE--KANKEGNRTYKLGT 93
E +++ W+A+H G S+ + E E R ++F +NL++++ A+ +G+ ++LG
Sbjct: 59 EAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGM 118
Query: 94 NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV-TPIK 152
N+F+DLTNDEFRA Y G +P+ R Y++ + +P S+DWRDKGAV +P+K
Sbjct: 119 NRFADLTNDEFRAAYLG---TTPAGRGRHVGEM-YRHDGVEALPDSVDWRDKGAVVSPVK 174
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS-REKAFAYI 211
NQ +CG CWAF+AVAAVEGI KI +G L+ LSEQ+L++C+ NG N G + AFA+I
Sbjct: 175 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGGNSGCNGGIMDDAFAFI 234
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
+N G+ TE++YPY A+ G C A+K I +E+VP DE +L KAV+ QPVS+AI
Sbjct: 235 TRNGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAI 294
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMK 329
A EFQ Y G+F G CGT LDH V VG+GT G +YW ++NSWG WG+ GY++
Sbjct: 295 DAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIR 354
Query: 330 IVRD----EGLCGIGTRSSYPL 347
+ R+ G CGI +SYP+
Sbjct: 355 MERNVTARTGKCGIAMMASYPI 376
>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
Length = 319
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 140/327 (42%), Positives = 188/327 (57%), Gaps = 20/327 (6%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
A+ + + + +++ E+WMA+ G++YK EKE R IF++N+ +I +
Sbjct: 2 AASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYD 61
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
+G NQF+DLTNDEF A YTG K P P + + P +DWR +GAV
Sbjct: 62 SAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIWTPCCIDWRFRGAV 113
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q CG CWAFAAVAA+EG+TKIR+G L LSEQ+L+DC TN +NGC GG ++AF
Sbjct: 114 TGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGGGHTDRAF 172
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVPSGDEQALLKAVSMQPV 266
+ GI E +Y Y+ G C AA I Y VP DE+ L AV+ QPV
Sbjct: 173 ELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPV 232
Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN---YWLIKNSWGNTWG 323
++ I A FQ YK G+F G CG +HAVT+VG+ +DGA+ YWL KNSWG TWG
Sbjct: 233 TVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGASGKKYWLAKNSWGKTWG 290
Query: 324 DAGYM----KIVRDEGLCGIGTRSSYP 346
GY+ IV+ G CG+ YP
Sbjct: 291 QQGYILLEKDIVQPHGTCGLAVSPFYP 317
>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 358
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 143/356 (40%), Positives = 209/356 (58%), Gaps = 31/356 (8%)
Query: 16 TPMFIIITLLVSC----ASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYKDELEK 64
+P + + LL SC A+ ++ +R+T + +++ W H RSY E
Sbjct: 5 SPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEA 64
Query: 65 EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM---PSPSHRST 121
R +++ N E+I+ N G+ TY+L N+F+DLT +EF A YTGY P T
Sbjct: 65 LQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVIT 124
Query: 122 T-----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-CGCCWAFAAVAAVEGITKI 175
T ++F Y+ DVP S+DWR +GAV P K+Q C CWAF A +E + I
Sbjct: 125 TGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMI 180
Query: 176 RSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAA 235
++G L+ LSEQQL+DC + + GC GS +A+ ++++N G+ TE +YPY A G C+ A
Sbjct: 181 KTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRA 239
Query: 236 QKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD 294
+ AAKI+ + +VP +E AL AV+ QPV++AI + Q YK G++ G CGT+L
Sbjct: 240 KSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPCGTRLA 298
Query: 295 HAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYP 346
HAVT+VG+GT GA YW IKNSWG +WG+ GY++I+RD GLCG+ +YP
Sbjct: 299 HAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCGVTLDIAYP 354
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 135/340 (39%), Positives = 203/340 (59%), Gaps = 13/340 (3%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F +ITLL++ + ++ ++ + V E + +H ++Y D E+ R+KIF EN +I
Sbjct: 3 FALITLLIALVA--MTQAVSYSELVREEWNTFKLEHRKNYADSTEETFRMKIFNENKHHI 60
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLS 132
K N+ G +YKL N+++D+ + EFR G+ +T +F + +
Sbjct: 61 AKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTFISPE 120
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+PT++DWR KGAVT +K+Q CG CWAF++ A+EG +SG L+ LSEQ L+DCS
Sbjct: 121 HVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDCS 180
Query: 193 TN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
T GNNGC GG + AF Y+ N GI TE Y Y+ + +C + A + ++P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCHFDKNSIGATDRGFADIPQ 240
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDG 308
G+E+ L +AV ++ PVS+AI A FQ Y EG+++ LDH V +VG+GT +DG
Sbjct: 241 GNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAENLDHGVLVVGYGTEKDG 300
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
++YWL+KNSWG TWGD G++K+ R+ E CGI + SSYPL
Sbjct: 301 SDYWLVKNSWGTTWGDKGFIKMSRNKENQCGIASASSYPL 340
>gi|194352764|emb|CAQ00110.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 406
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 147/347 (42%), Positives = 200/347 (57%), Gaps = 41/347 (11%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG---NRTYKLGTNQ 95
H+ +++ WM H RSY EK R ++++ N+ +IE N E TY+LG
Sbjct: 55 HQDLMMDRFHVWMTVHNRSYSTAGEKARRFEVYRSNMRFIEAVNAEAATSGLTYELGEGP 114
Query: 96 FSDLTNDEFRALYTGYKMP-------------------SPSHRSTTSSTFKYQNLSMTDV 136
F+DLTN+EF LYTG + S T Y N S +
Sbjct: 115 FTDLTNEEFMELYTGQILEDDQSEDGDDDEQIITTHAGSIDGLGTHKGATVYANFSAS-A 173
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
PTS+DWR +G VTP+KNQK+CG CWAF VA +EGI KI+ G L+ LSEQQL+DC +
Sbjct: 174 PTSIDWRKRGVVTPVKNQKQCGSCWAFPTVATIEGIHKIKRGTLVSLSEQQLIDCDYL-D 232
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG +AF +I +N GI + Y Y+AV G C +KP AAKI + +V S E +
Sbjct: 233 NGCKGGLVTRAFQWIKKNGGITSTSSYKYKAVRGRCLRNRKP-AAKIVGFRKVKSNSEVS 291
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG-TQLDHAVTIVGFGTTED-------- 307
L+ AV+ QPV+++I+++S+ F YK GI+NG C T+L+HAVT+VG+G +
Sbjct: 292 LMNAVANQPVAVSISSHSSHFHHYKGGIYNGPCSTTKLNHAVTVVGYGQQQQNGADSVHA 351
Query: 308 ---GANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
GA YW++KNSWG TWGD GY+ + R G CGI TR +PL
Sbjct: 352 SAPGAKYWIVKNSWGTTWGDKGYILMKRGTKHSSGQCGIATRPVFPL 398
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 143/348 (41%), Positives = 206/348 (59%), Gaps = 22/348 (6%)
Query: 19 FIIITLLVSCASQVVSS-----RSTHEQSVVEIH-------EKWMAQHGRSYKDEL-EKE 65
F+I LLV+ + V ++ R HE+ +++ ++WM Q+ ++Y +++ E E
Sbjct: 5 FLIAALLVAASGGVGAAPELQLREQHEKLLLDAKANPMAAFQQWMMQYTKAYANDIKELE 64
Query: 66 MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR-ALYTGYKMPSPSHRSTTSS 124
R ++ ENL YI N ++ L N F+DLT DEFR L +K S+R SS
Sbjct: 65 TRFSVWLENLNYILAYNAR-TTSHWLHLNAFADLTTDEFRNRLGYDFKARQASNR-LQSS 122
Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
F Y N+ +PT +DWR KGAVT +KNQ +CG CWAFA +VEGI I +G L LS
Sbjct: 123 PFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFATTGSVEGINAIVTGELASLS 182
Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKI 243
EQ+L+DC T+ + GC GG + A+ +II+N G+ TED+YPY A G C AA+K I
Sbjct: 183 EQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGLDTEDDYPYTAEDGVCVAAKKNRRVVTI 242
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGF 302
Y ++P DE AL KA + QP+++AI A + FQ Y G+++ CGT L+H V +VG+
Sbjct: 243 DGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGGVYDDPTCGTSLNHGVLVVGY 302
Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
G NYW++KNSWG WGD GY+++ +G+CGI S+P
Sbjct: 303 GKDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQGMCGIAMAPSFP 350
>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 144/346 (41%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 14 NTTPMFIIITLLVSCASQVVSSRS-----THEQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
T P+ I+ LL + S + +++ +W A H RSY E+ R
Sbjct: 7 GTRPVIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRF 66
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY----TGYKMPSPSHRSTTSS 124
++++ N+EYI+ N+ G TY+LG NQF+DLT +EF A Y TG + + + S
Sbjct: 67 EVYRTNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAGGHTGSAITTAAEADGLWS 126
Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQL 183
+ D P S+DWR KGAVTP+KNQ +C CWAF+AVA +E + I++G L+ L
Sbjct: 127 SGGSDGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVAL 186
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKI 243
SEQQL+DC + GC G +AF +I++N GI T +YPY+AV G CSAA KPA I
Sbjct: 187 SEQQLVDCDKY-DGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGACSAA-KPAV-TI 243
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFG 303
+ + V +E AL AV+ QP+ +AI Q YK G+F+ CG Q+ HAV VG+G
Sbjct: 244 TGHLAVAK-NELALQSAVARQPIGVAIEV-PISMQFYKSGVFSAACGIQMSHAVVTVGYG 301
Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYP 346
G YWL+KNSWG TWG+AGY+++ RD GLCGI ++YP
Sbjct: 302 ADASGLKYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGIALDTAYP 347
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 138/347 (39%), Positives = 201/347 (57%), Gaps = 21/347 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHE----------QSVVEIHEKWMAQHGRSYKDELEKEMR 67
M + LLV+C+ V++ E +S E + W+ R+Y E E R
Sbjct: 1 MRLSCVLLVACSCLAVAAGFPFENHRLFIQQAVESPREAFDFWVQTLKRAYASAEEYERR 60
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
++ +NL ++ + N G+ ++ L ++DL+ DE+R+ GY R ++ F
Sbjct: 61 FDVWLDNLRFVHEYNA-GHTSHWLSMGVYADLSQDEYRSKALGYNADLHEERPLRAAPFL 119
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y+ T P +DW KGAVTP+KNQ CG CWAF+ AVEG + I +G L LSEQ
Sbjct: 120 YEG---TVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKLASLSEQM 176
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNY 246
L+DC +NGC GG + AF +I++N GI TED+YPY A G C + + I +Y
Sbjct: 177 LVDCDRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTAEEGMCQDNKMRRHVVTIDDY 236
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
++VP DE AL+KAV+ QPVS+AI A FQ Y G+F+ CGT LDH V +VG+GT
Sbjct: 237 QDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFDAECGTALDHGVLVVGYGTAS 296
Query: 307 DGAN---YWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYPL 347
+G + YWL+KNSWG WGD GY++++R +EG CG+ ++S+P+
Sbjct: 297 NGTHHLPYWLVKNSWGAEWGDKGYIRLLRNLGEEGQCGVAMQASFPI 343
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 209/336 (62%), Gaps = 13/336 (3%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ + L+ C VVSS S E +W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAVC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
K N + G+ TY LG NQF+DL N+EF A+ TG+++ S ++ STF N ++
Sbjct: 60 IKHNLKYDLGHFTYALGMNQFADLQNEEFVAMMTGFRVNGTS-KAAKGSTFLPSN-NVDK 117
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P ++DWR KG VTP+K+Q +CG CWAF+A ++EG ++G L+ LSEQ L+DCS
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDCSYR- 176
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
N GC GG ++AF YII GI TE Y Y+AV G C + A ++ Y +V SG E+
Sbjct: 177 NYGCHGGFMDRAFQYIIDAGGIDTEATYSYRAVDGNCHFKKANVGATVTGYTDVTSGSEK 236
Query: 256 ALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYW 312
AL KAV+ + P+S+AI A F+ YK G++N G T+L HAV +VG+GTT DG +YW
Sbjct: 237 ALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTRLGHAVLVVGYGTTSDGTDYW 296
Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
++KNSW TWG GY+ + R+ + CGI + +SYP+
Sbjct: 297 IVKNSWAKTWGMNGYLWMSRNKDNQCGIASEASYPM 332
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 135/311 (43%), Positives = 198/311 (63%), Gaps = 17/311 (5%)
Query: 48 EKWM---AQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTN 101
E+W A HG++YK++ E+ R+KIF +N + IE N ++G +YK+ N F DL
Sbjct: 25 EEWHVFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMV 84
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+AL G+KM SP + F S +++P ++DWR KGAVTP+K+Q +CG CW
Sbjct: 85 HEFKALMNGFKM-SPDTKRNGELYFP----SNSNLPKTVDWRQKGAVTPVKDQGQCGSCW 139
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATE 220
+F+A ++EG +++G L+ LSEQ L+DCST+ GNNGC GG ++AF Y+ N+GI TE
Sbjct: 140 SFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAFQYVSDNKGIDTE 199
Query: 221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQS 279
YPY+A TC + + ++P+GDE+AL A+ ++ P+S+AI A FQ
Sbjct: 200 ASYPYEARENTCRFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPISVAIDANHGSFQF 259
Query: 280 YKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
Y +G++N LDH V VG+G TE+G +YWL+KNSWG +WG+ GY+KI R+
Sbjct: 260 YSKGVYNEPNCSSYDLDHGVLAVGYG-TENGQDYWLVKNSWGPSWGENGYIKIARNHSNH 318
Query: 337 CGIGTRSSYPL 347
CGI + +SYPL
Sbjct: 319 CGIASMASYPL 329
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 209/337 (62%), Gaps = 13/337 (3%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ + L+ C VVSS S E ++W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAVC---VVSSLSMSFTDFDEDWKEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+ N + G+ TY LG NQF+DL N EF A+ TG+++ S ++ STF N ++
Sbjct: 60 IRHNLKYDLGHFTYDLGMNQFADLQNKEFVAMMTGFRVNGTS-KAAKGSTFLPPN-NVGK 117
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P ++DWR KG VTP+K+Q +CG CWAF+A ++EG ++G L+ LSEQ L+DCS +
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCS-DK 176
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
N GC GG ++AF YII GI TE+ YPY A+ G C A ++ Y +V SG E+
Sbjct: 177 NYGCNGGLMDRAFQYIIDAGGIDTEESYPYIAMDGNCHFKTANVGATVTGYTDVTSGSEK 236
Query: 256 ALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYW 312
AL KAV+ + P+S+AI A FQ Y+ G++N G T LDH V VG+GTT DG +YW
Sbjct: 237 ALQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTLLDHGVLAVGYGTTIDGTDYW 296
Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
++KNSW TWG GY+ + R+ + CGI T++SYPL
Sbjct: 297 IVKNSWAETWGMNGYIWMSRNKDNQCGIATQASYPLV 333
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 143/344 (41%), Positives = 199/344 (57%), Gaps = 27/344 (7%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR- 87
A + S S + S++E ++W A + +SY E+ R +++ N+ YIE N E
Sbjct: 32 AGDTMGSMSNDDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAA 91
Query: 88 --TYKLGTNQFSDLTNDEFRALYTGYKMPS-PSHRSTTSSTFK--------------YQN 130
TY+LG ++DLTN EF A+YT + P+ S ++ Y N
Sbjct: 92 GLTYELGETAYTDLTNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVN 151
Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
LS + P S+DWR GAVTP+KNQ CG CWAF+ VA VEGI +IR+G L+ LSEQ+L+D
Sbjct: 152 LSAS-APASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVD 210
Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEV 249
C T ++GC GG +A +I N GI TE +YPY C+ A+ A I+ V
Sbjct: 211 CDTL-DDGCDGGISYRALRWIASNGGITTEADYPYTGTTDACNRAKLSHNAVSIAGLRRV 269
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDG 308
+ E +L AV+ QPV+++I A FQ YK+G++NG CGT L+H VT+VG+G G
Sbjct: 270 ATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAAG 329
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
YW++KNSWG WGD GY+++ +D EGLCGI R SYPL
Sbjct: 330 DRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
Length = 380
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 145/346 (41%), Positives = 195/346 (56%), Gaps = 35/346 (10%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TY 89
+ S + ++E ++W A + +SY E R ++ N+ YIE N E TY
Sbjct: 38 MGSSTDDNSPMIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTY 97
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---------------------Y 128
+LG ++DLTN EF A+YT PSP+ Y
Sbjct: 98 ELGETAYTDLTNQEFMAMYTA--APSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVY 155
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
NLS T P S+DWR GAVTP+KNQ CG CWAF+ VA VEGI +IR+G L+ LSEQ+L
Sbjct: 156 VNLS-TAAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQEL 214
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYE 247
+DC T + GC GG +A +I N G+ TE++YPY C+ A+ AA I+
Sbjct: 215 VDCDTL-DAGCDGGISYRALRWITSNGGLTTEEDYPYTGTTDACNRAKLAHNAASIAGLR 273
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT-TE 306
V + E +L AV+ QPV+++I A FQ YK G++NG CGT L+H VT+VG+G E
Sbjct: 274 RVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYGQEEE 333
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
DG YW+IKNSWG +WGD GY+K+ +D EGLCGI R S+PL
Sbjct: 334 DGDKYWIIKNSWGASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 192/309 (62%), Gaps = 24/309 (7%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
++E+W+ ++ ++Y EKE R KIFKENL++I++ N N+T+++G +F+DLTNDE +
Sbjct: 1 MYERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDEPK 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
+ + Y+ + +P +DWR KGAV P+K+Q CG CWAF+A
Sbjct: 61 DF-------------MKADRYLYKEGDI--LPDEIDWRAKGAVVPVKDQGNCGSCWAFSA 105
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYP 224
V AVEGI +I++G LI LS+Q+L+DC N GC GG AF +II N GI ++ +YP
Sbjct: 106 VGAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDYP 165
Query: 225 YQAVP-GTCSAAQK--PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
Y A G C+A +K KI YE V DE++L KAV+ QPV +AI A S F+ YK
Sbjct: 166 YTATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYK 225
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
G+F G CG LDH V +VG+GT+ G +YW+I+NSWG WG+ GY+K+ R+ G C
Sbjct: 226 SGVFTGTCGIYLDHGVVVVGYGTSS-GEDYWIIRNSWGLNWGENGYVKLQRNIDDSFGKC 284
Query: 338 GIGTRSSYP 346
G+ SYP
Sbjct: 285 GVAMMPSYP 293
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 141/343 (41%), Positives = 199/343 (58%), Gaps = 21/343 (6%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
+I L + +Q VS I E+W +H + Y+DE E+ RLKIF EN
Sbjct: 4 YIFALLALVAVAQAVSFADV-------IKEEWQTFKLEHRKQYQDETEERFRLKIFNENK 56
Query: 76 EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQ 129
I K N+ G ++K+G N+++D+ + EF G+ + +TF +
Sbjct: 57 HKIAKHNQLYAAGEVSFKMGLNKYADMLHHEFHETMNGFNYTLHKQLRASDATFTGVTFI 116
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+ +P S+DWR+KGAVT +K+Q CG CWAF++ A+EG ++G LI LSEQ L+
Sbjct: 117 SPEHVKLPQSVDWRNKGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLV 176
Query: 190 DCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
DCST GNNGC GG + AF YI N GI TE YPY+ + +C + A + +
Sbjct: 177 DCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKGTIGATDRGFTD 236
Query: 249 VPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTT 305
+P GDE+ L +AV ++ PVS+AI A FQ Y G+++ C Q LDH V +VG+GT
Sbjct: 237 IPQGDEKKLAQAVATIGPVSVAIDASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTD 296
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIGTRSSYPL 347
E+G +YWL+KNSWG TWGD G++K+ R D+ CGI T SSYPL
Sbjct: 297 ENGKDYWLVKNSWGTTWGDKGFIKMARNDDNQCGIATASSYPL 339
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 195/318 (61%), Gaps = 18/318 (5%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDL 99
I E+W +H ++Y+DE E+ RLKIF EN I K N+ G T+K+ N+++D+
Sbjct: 23 IKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADM 82
Query: 100 TNDEFRAL-----YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
+ EFR YT +K S S T TF + + +P S+DWR+KGAVT +K+Q
Sbjct: 83 LHHEFRETMNGFNYTLHKELRASDPSFTGITF--ISPAHVKLPKSVDWREKGAVTAVKDQ 140
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQ 213
CG CWAF++ A+EG ++G L+ LSEQ L+DCS GNNGC GG + AF YI
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKD 200
Query: 214 NQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAA 272
N GI TE YPY+ + +C + A + ++P G+E+ + +AV ++ PVS+AI A
Sbjct: 201 NGGIDTEKSYPYEGIDDSCHFNKDSVGATDRGFADIPQGNEKKMAEAVATIGPVSVAIDA 260
Query: 273 YSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
FQ Y EGI+N C +Q LDH V +VG+GT E G +YWL+KNSWG TWGD G++K+
Sbjct: 261 SHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESGKDYWLVKNSWGTTWGDKGFIKM 320
Query: 331 VRDE-GLCGIGTRSSYPL 347
R+E CGI + SSYPL
Sbjct: 321 ARNEDNQCGIASASSYPL 338
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 133/314 (42%), Positives = 191/314 (60%), Gaps = 15/314 (4%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR--------TYKLGTNQFS 97
+ E W A+HG++Y E+ RL F +N ++ N G +Y L N F+
Sbjct: 41 LFEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT+ EFRA G ++ R+ S ++ + VP +LDWR GAVT +K+Q C
Sbjct: 101 DLTHAEFRAARLG-RLAVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGSC 159
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CW+F+A A+EGI KI++G+LI LSEQ+L+DC + N GC GG + A+ ++I+N GI
Sbjct: 160 GACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRFVIKNGGI 219
Query: 218 ATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
TED+YPY+ GTC+ + K I Y +VP+ E +LL+AV+ QP+S+ I +
Sbjct: 220 DTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSARA 279
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ Y +GIF+G C T LDHAV IVG+G +E G +YW++KNSWG WG GYM + R+
Sbjct: 280 FQLYSQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTGS 338
Query: 334 -EGLCGIGTRSSYP 346
G+CGI +S+P
Sbjct: 339 SSGICGINMMASFP 352
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 130/315 (41%), Positives = 197/315 (62%), Gaps = 16/315 (5%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR--------TYKLGTNQFS 97
+ + W A+HG++Y E+ RL +F +N ++ N N +Y L N F+
Sbjct: 40 LFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFA 99
Query: 98 DLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
DLT++EFRA G + + RS + ++ + + VP +LDWR+ GAVT +K+Q
Sbjct: 100 DLTHEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGS 159
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CW+F+A A+EGI KI++G+L+ LSEQ+L+DC + N+GC GG + A+ ++++N G
Sbjct: 160 CGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGG 219
Query: 217 IATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
I TE++YPY+ GTC+ + K I Y +VPS E LL+AV+ QPVS+ I +
Sbjct: 220 IDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSAR 279
Query: 276 EFQSY-KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
FQ Y ++GIF+G C T LDHAV IVG+G +E G +YW++KNSWG +WG GYM + R+
Sbjct: 280 AFQLYSQQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGESWGMKGYMHMHRNT 338
Query: 334 ---EGLCGIGTRSSY 345
+G+CGI +S+
Sbjct: 339 GDSKGVCGINMMASF 353
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 136/305 (44%), Positives = 186/305 (60%), Gaps = 9/305 (2%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
+W A H R Y E+ +R +I+ NLE I + N G +Y LG N+F DL + EF A Y
Sbjct: 23 EWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAAKY 82
Query: 109 TGYKMPS-PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
G + + +S SST+ + M +P S+DWR G VTP+KNQ +CG CW+F+
Sbjct: 83 LGVRFNGVNATKSFASSTYLPR---MVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTG 139
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
+VEG ++G L+ LSEQ L+DCS+ GN GC GG + AF YII+N GI TE YPY
Sbjct: 140 SVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYPYT 199
Query: 227 AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF 285
A GTC A +++Y+++ +G E L AV ++ PVS+AI A FQ Y G++
Sbjct: 200 ATTGTCKFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVY 259
Query: 286 N--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTR 342
N TQLDH V VG+GT+ +G +YWL+KNSWG TWG AGY+ + R+ + CGI T
Sbjct: 260 NEKKCSTTQLDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKAGYIWMSRNADNQCGIATS 319
Query: 343 SSYPL 347
+SYPL
Sbjct: 320 ASYPL 324
>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 143/319 (44%), Positives = 206/319 (64%), Gaps = 20/319 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W H S ++ EK R +FKEN+ ++ N+ ++ YKL N+F+D+
Sbjct: 34 EESLWQLYERWGKHHTIS-RNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADM 91
Query: 100 TNDEFRALYTGYKMPSPSH------RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
+N EF Y + SH R + F Y+ TD+P+S+DWR++GAV +K
Sbjct: 92 SNYEFVNFYA---RSNISHYRKLHERRRGAGGFMYE--QDTDLPSSVDWRERGAVNAVKE 146
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q CG CWAF++VAAVEGI KI++ L+ LSEQ+LLDC+ N GC GG E AF +I +
Sbjct: 147 QGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCNGGFMEIAFDFIKR 205
Query: 214 NQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
N GIATE+ YPY G C +++ KI YE VP +E AL++AV+ QPVS+AI A
Sbjct: 206 NGGIATENSYPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDA 264
Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
+FQ Y +G+F+G CGT+L+H V +G+GTTEDG +YWL++NSWG WG+ GY+++ R
Sbjct: 265 AGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKR 324
Query: 333 D----EGLCGIGTRSSYPL 347
EGLCGI +SYP+
Sbjct: 325 GVEQAEGLCGIAMEASYPI 343
>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
Length = 340
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/344 (42%), Positives = 209/344 (60%), Gaps = 22/344 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ I+ LL A +S+ + V+ ++E+W+ +H + Y EK R +IFK+NL Y
Sbjct: 5 VLILSFLLFVSAITCISTNWRSDDEVIALYEEWLVKHQKLYSSLGEKIKRFEIFKDNLRY 64
Query: 78 IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKM-------PSPSHRSTTSSTFK 127
I++ N K + + LG NQF+DLT DEF ++Y G + +P+H K
Sbjct: 65 IDQQNHYNKVNHMNFTLGLNQFADLTLDEFSSIYLGTSVDYEQIISSNPNHDDVEEDILK 124
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+ ++P S+DWR+KG V PI+NQ +CG CW F+AVA++E + I+ G++I LSEQ+
Sbjct: 125 E---DVVELPDSVDWREKGVVFPIRNQGKCGSCWTFSAVASIETLNGIKKGHMIALSEQE 181
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
LLDC T + GC GG AFAY+ +N GI +E++YPY G C QK KIS Y+
Sbjct: 182 LLDCETI-SQGCKGGHYNNAFAYVAKN-GITSEEKYPYIFRQGQC--YQKEKVVKISGYK 237
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP + L AV+ Q VS+A+ S +FQ Y GIF+G CG LDHAV IVG+G ++
Sbjct: 238 RVPRNNGGQLQSAVAQQVVSVAVKCESKDFQFYDRGIFSGACGPILDHAVNIVGYG-SKG 296
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
GANYW+++NSWG WG+ GYM+I ++ EG CGI + SYP+
Sbjct: 297 GANYWIMRNSWGTNWGENGYMRIQKNSKHYEGHCGIAMQPSYPV 340
>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
Length = 319
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 138/327 (42%), Positives = 188/327 (57%), Gaps = 20/327 (6%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
A+ + + + +++ E+WMA+ G++YK EKE R IF++N+ +I +
Sbjct: 2 AASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYD 61
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
+G NQF+DLTNDEF A YTG K P P + + P +DWR +GAV
Sbjct: 62 SAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIWTPCCIDWRFRGAV 113
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
T +K+Q CG CWAFAAVAA+EG+TKIR+G L LSEQ+L+DC TN +NGC GG ++AF
Sbjct: 114 TGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGGGHTDRAF 172
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVPSGDEQALLKAVSMQPV 266
+ GI E +Y Y+ G C AA I Y VP DE+ L AV+ QPV
Sbjct: 173 ELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPV 232
Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN---YWLIKNSWGNTWG 323
++ I A FQ YK G+F G CG +HAVT+VG+ +DGA+ YW+ KNSWG TWG
Sbjct: 233 TVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWG 290
Query: 324 DAGYMKIVRD----EGLCGIGTRSSYP 346
GY+ + +D G CG+ YP
Sbjct: 291 QQGYILLEKDVLQPHGTCGLAVSPFYP 317
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 136/306 (44%), Positives = 191/306 (62%), Gaps = 11/306 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E + A+H + Y+ E+ MR IF+EN ++IE N + + LG N F DLTN E+R
Sbjct: 82 ENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFDFYLGMNHFGDLTNKEYRER 141
Query: 108 YTGYKMP--SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
Y GY+ P +PS S S + + DVP +DWRD+G VTP+KNQ +CG CWAF+A
Sbjct: 142 YLGYRRPENTPSKASYIFSRAE----KIEDVPDQIDWRDQGFVTPVKNQGQCGSCWAFSA 197
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
V ++EG +G L+ LSEQ L+DCST GN+GC GG ++AF Y+ N GI TED YP
Sbjct: 198 VGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMDQAFEYVKDNHGIDTEDSYP 257
Query: 225 YQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEG 283
Y G+C K A + + +V GDE+AL +AV + PVS+AI A S FQ Y+ G
Sbjct: 258 YVGTDGSCHFKNKSIGATLKGFMDVKEGDEEALRQAVGVAGPVSVAIDASSMLFQFYRGG 317
Query: 284 IFN-GVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIG 340
++N C T +LDH V +VG+G G ++W++KNSWG WG GY+++ R++G CGI
Sbjct: 318 VYNVPWCSTSELDHGVLVVGYGKQFQGKDFWMVKNSWGVGWGIYGYIEMSRNKGNQCGIA 377
Query: 341 TRSSYP 346
+++S P
Sbjct: 378 SKASIP 383
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 195/317 (61%), Gaps = 15/317 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W A QH + Y E E+ +RLKI+ +N I K N+ +G ++L N+++DL
Sbjct: 23 VKEEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDL 82
Query: 100 TNDEFRALYTGYKMPS---PSHRSTT-SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
++EF G+ + P + Y + +VP ++DWR+KGAVTP+K+Q
Sbjct: 83 LHEEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQG 142
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQN 214
CG CW+F+A A+EG ++G L+ LSEQ L+DCST GNNGC GG + AF YI N
Sbjct: 143 HCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDN 202
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAY 273
GI TE YPY+A+ TC K A + ++P GDE+AL+KA++ PVS+AI A
Sbjct: 203 GGIDTEKAYPYEAIDDTCHYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAIDAS 262
Query: 274 STEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
FQ Y EG+ + C ++ LDH V VG+GT+E+G +YWL+KNSWG TWGD GY+K+
Sbjct: 263 HESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMA 322
Query: 332 RD-EGLCGIGTRSSYPL 347
R+ + CGI T +SYPL
Sbjct: 323 RNRDNHCGIATAASYPL 339
>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 371
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 145/358 (40%), Positives = 203/358 (56%), Gaps = 43/358 (12%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHE--------KWMAQHGRSYKDELEKEMRLK 69
+F+ +T L A +++ + H VVE+ + +W A H R+Y D E+ R +
Sbjct: 27 LFVFLTALPPAA--IMTPAAGH---VVELDDMLMLDRFVRWQAAHNRTYGDAEERLRRFQ 81
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
+++ N+EYIE N+ G TY+LG NQF+DLT++EF ++Y S
Sbjct: 82 VYRANIEYIEATNRRGGLTYELGENQFADLTSEEFLSMYA-------SSYDAGDRADDEA 134
Query: 130 NLSMTDV---------------PTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGIT 173
L TDV P S DWR KGAVTP KNQ C CWAF VA +EG+T
Sbjct: 135 ALITTDVAGDGAWSDGDLEALPPPSWDWRAKGAVTPPKNQGPTCSSCWAFVTVATIEGLT 194
Query: 174 KIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS 233
I++G LI LSEQQL+DC + GC GS + F ++++N G+ TE EYPY A G C+
Sbjct: 195 FIKTGKLISLSEQQLVDCDMY-DGGCNTGSYSRGFRWVLENGGLTTEAEYPYTAARGPCN 253
Query: 234 AAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQ 292
A+ AAKI+ +P +E + KAV+ QPV +AI + Q YK G+++G CGT
Sbjct: 254 RAKSAHHAAKITGQGRIPPQNELVMQKAVAGQPVGVAIEV-GSGMQFYKTGVYSGPCGTN 312
Query: 293 LDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYP 346
L HAVT+VG+G GA YW++KNSWG WG+ G++++ RD GLCGI +YP
Sbjct: 313 LAHAVTVVGYGVDPASGAKYWIVKNSWGQAWGERGFIRMRRDVGGPGLCGIALDVAYP 370
>gi|242072388|ref|XP_002446130.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
gi|241937313|gb|EES10458.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
Length = 276
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 127/280 (45%), Positives = 179/280 (63%), Gaps = 29/280 (10%)
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
++N+ ++E N N + LG NQF+DLT +EF+A G+K S TT FKY+NL
Sbjct: 19 RDNVAFVESFNANKNNKFWLGVNQFADLTTEEFKA-NKGFKPTSAEKVPTTG--FKYENL 75
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
S++ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ +GNLI LS+Q+L+DC
Sbjct: 76 SVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSKQELVDC 135
Query: 192 STNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
T+ + GC E + PY+AV G C K +AA I +E+VP
Sbjct: 136 DTHSMDEGC--------------------EVQLPYKAVDGKCKGGSK-SAATIKGHEDVP 174
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E AL+KAV+ QPVS+A+ A F Y G+ G CGT+LDH + +G+G DG
Sbjct: 175 VNNEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTK 234
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
YW++KNSWG TWG+ G++++ +D G+CG+ + SYP
Sbjct: 235 YWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 274
>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
Length = 246
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 134/271 (49%), Positives = 183/271 (67%), Gaps = 34/271 (12%)
Query: 86 NRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRD 144
+++YKL N+F+DLTN+EF +K +H ST +++FKY+N+ T VP++ DWR
Sbjct: 2 DKSYKLSINEFADLTNEEFGTSRNRFK----AHICSTEATSFKYENV--TAVPSTXDWRK 55
Query: 145 KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGS 203
KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + GC G +
Sbjct: 56 KGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXGAN 115
Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQALLKAV 261
YPY GTC+ A PAA KI+ YE+VP+ +E+AL KAV
Sbjct: 116 -------------------YPYAGTDGTCNRKKAAHPAA-KINGYEDVPANNEKALQKAV 155
Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
+ QP+++AI A EFQ Y G+F G CGT+LDH V VG+GT++DG YWL+KNSWG
Sbjct: 156 AHQPIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTG 215
Query: 322 WGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 216 WGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 246
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 132/317 (41%), Positives = 192/317 (60%), Gaps = 20/317 (6%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-------------TYKLGTN 94
+ W A+HG++Y E+ RL +F +N ++ N +Y L N
Sbjct: 37 DAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTLALN 96
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
F+DLT++EFRA G P + RS + + + VP +LDWR GAVT +K+Q
Sbjct: 97 AFADLTHEEFRAARLGRIAPGAALRSRAAPVY-WGLGGGAAVPDALDWRKSGAVTKVKDQ 155
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CW+F+A A+EGI KI++G+L+ LSEQ+L+DC + N+GC GG + A+ ++I+N
Sbjct: 156 GSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVIKN 215
Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
GI TE++YPY+ GTC+ + K I Y +VPS E LL+AV+ QPVS+ I
Sbjct: 216 GGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVGICGS 275
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
+ FQ Y +GIF+G C T LDHAV IVG+G +E G +YW++KNSWG +WG GYM + R+
Sbjct: 276 ARAFQLYYQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGESWGMKGYMHMHRN 334
Query: 334 ----EGLCGIGTRSSYP 346
+G+CGI +S+P
Sbjct: 335 TGDSKGVCGINMMASFP 351
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 146/350 (41%), Positives = 210/350 (60%), Gaps = 32/350 (9%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKEN 74
M I+I L+ A+ ++ S +E + E+W A QH ++Y E E+ +RLKI+ +N
Sbjct: 1 MKILILLMAFVAA--ANAVSLYEL----VKEEWNAFKLQHRKNYDSETEERIRLKIYVQN 54
Query: 75 LEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---- 127
I K N+ G Y+L N+++DL ++EF G+ +R+ + + K
Sbjct: 55 KHKIAKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVNGF------NRTDSKKSLKGVRI 108
Query: 128 -----YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
+ + +VPT++DWR KGAVTP+K+Q CG CW+F+A A+EG ++G L+
Sbjct: 109 EEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVS 168
Query: 183 LSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA 241
LSEQ L+DCS GNNGC GG + AF YI N GI TE YPY+A+ TC K A
Sbjct: 169 LSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPYEAIDDTCHFNPKAVGA 228
Query: 242 KISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVT 298
Y ++P GDE+AL KA+ ++ PVSIAI A FQ Y EG+ + C ++ LDH V
Sbjct: 229 TDKGYVDIPQGDEEALKKALATVGPVSIAIDASHESFQFYSEGVYYEPQCDSENLDHGVL 288
Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
VG+GT+E+G +YWL+KNSWG TWGD GY+K+ R+ + CG+ T +SYPL
Sbjct: 289 AVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMARNRDNHCGVATCASYPL 338
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 191/314 (60%), Gaps = 13/314 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
I E+W QH ++Y +E+E+ R+KIF EN I K N+ +G +YKLG N+++D+
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83
Query: 100 TNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
+ EF+ GY + T Y + VP S+DWR+ GAVT +K+Q C
Sbjct: 84 LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQG 216
G CWAF++ A+EG ++G L+ LSEQ L+DCST GNNGC GG + AF YI N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 217 IATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYST 275
I TE YPY+ + +C + A + + ++P GDE+ + KAV +M PVS+AI A
Sbjct: 204 IDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHE 263
Query: 276 EFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
FQ Y EG++N C Q LDH V +VG+GT E G +YWL+KNSWG TWG+ GY+K+ R+
Sbjct: 264 SFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARN 323
Query: 334 E-GLCGIGTRSSYP 346
+ CGI T SSYP
Sbjct: 324 QNNQCGIATASSYP 337
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 207/332 (62%), Gaps = 17/332 (5%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
+L C + ++S ++++ E+ + H ++Y E E +MR I++ +L I + N
Sbjct: 1 MLACCIAATLASPLVFDEALDEMWTLFKTTHSKTYATEAE-DMRRFIWERHLNMINQHNI 59
Query: 84 E---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
E G T+ LG N++ DLT E+ A+ +GYKM + S SS + +NL VP ++
Sbjct: 60 EADLGKHTFSLGMNEYGDLTQHEYAAM-SGYKM---AKSSVGSSFLEPENLQ---VPKTV 112
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGC 199
DWR+KG VTP+KNQ +CG CWAF++ ++EG ++G L +SEQ L+DCS + GN GC
Sbjct: 113 DWREKGYVTPVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGC 172
Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLK 259
GG + AF YI +N GI +E YPY+AV G C + + S + ++P GDE AL
Sbjct: 173 SGGLMDNAFTYIKKNMGIDSEKSYPYEAVDGECRYKKSDSVTTDSGFVDIPHGDETALRT 232
Query: 260 AV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
AV S+ PVS+AI A T FQ YK G++ TQLDH V +VG+G E+G +YWL+KN
Sbjct: 233 AVASVGPVSVAIDASHTSFQFYKTGVYTEANCSSTQLDHGVLVVGYG-VENGQDYWLVKN 291
Query: 317 SWGNTWGDAGYMKIVRDEG-LCGIGTRSSYPL 347
SWG +WG+AGY+K+ R+ G CGI +++SYPL
Sbjct: 292 SWGASWGEAGYIKLARNHGNQCGIASQASYPL 323
>gi|125526835|gb|EAY74949.1| hypothetical protein OsI_02845 [Oryza sativa Indica Group]
Length = 360
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 186/321 (57%), Gaps = 16/321 (4%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSDL 99
S+ HE+WMA+ GR+Y D EK R+++F N E ++ AN+ G +RTY LG NQFSDL
Sbjct: 37 HSMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDL 96
Query: 100 TNDEFRALYTGYKM--PSPSHRS--TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
T+DEF + GY P PSHR + TDVP S+DWR +GAVT +KNQ+
Sbjct: 97 TDDEFARTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQR 156
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAFAAVAA EG+ ++ +GNL+ LSEQQ+LDC T G N C GG A YI +
Sbjct: 157 SCGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDC-TGGANTCSGGDVSAALRYIAASG 215
Query: 216 GIATEDEYPYQAVPGTCS----AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
G+ TE Y Y G C AA AAA GDE AL + QPV + +
Sbjct: 216 GLQTEAAYAYGGQQGACRAGGFAAPNSAAAVGGARWARLYGDEGALQALAAGQPVVVVVE 275
Query: 272 AYSTEFQSYKEGIFNG--VCGTQLDHAVTIV-GFGTTEDGANYWLIKNSWGNTWGDAGYM 328
A +F+ Y+ G++ G CG +L+HAVT+V + G YWL+KN WG WG+ GYM
Sbjct: 276 ASEPDFRHYRSGVYAGSAACGRRLNHAVTVVGYGAAADGGGEYWLVKNQWGTWWGEGGYM 335
Query: 329 KIVRD---EGLCGIGTRSSYP 346
++ R G CGI T + YP
Sbjct: 336 RVARGGAAGGNCGIATYAFYP 356
>gi|115438530|ref|NP_001043562.1| Os01g0613500 [Oryza sativa Japonica Group]
gi|11034572|dbj|BAB17096.1| cysteine proteinase-like [Oryza sativa Japonica Group]
gi|113533093|dbj|BAF05476.1| Os01g0613500 [Oryza sativa Japonica Group]
gi|215697766|dbj|BAG91959.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 360
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 186/321 (57%), Gaps = 16/321 (4%)
Query: 41 QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSDL 99
S+ HE+WMA+ GR+Y D EK R+++F N E ++ AN+ G +RTY LG NQFSDL
Sbjct: 37 HSMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDL 96
Query: 100 TNDEFRALYTGYKM--PSPSHRS--TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
T+DEF + GY P PSHR + TDVP S+DWR +GAVT +KNQ+
Sbjct: 97 TDDEFAQTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQR 156
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAFAAVAA EG+ ++ +GNL+ LSEQQ+LDC T G N C GG A YI +
Sbjct: 157 SCGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDC-TGGANTCSGGDVSAALRYIAASG 215
Query: 216 GIATEDEYPYQAVPGTCS----AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
G+ TE Y Y G C AA AAA GDE AL + QPV + +
Sbjct: 216 GLQTEAAYAYGGQQGACRAGGFAAPNSAAAVGGARWARLYGDEGALQALAAGQPVVVVVE 275
Query: 272 AYSTEFQSYKEGIFNG--VCGTQLDHAVTIV-GFGTTEDGANYWLIKNSWGNTWGDAGYM 328
A +F+ Y+ G++ G CG +L+HAVT+V + G YWL+KN WG WG+ GYM
Sbjct: 276 ASEPDFRHYRSGVYAGSAACGRRLNHAVTVVGYGAAADGGGEYWLVKNQWGTWWGEGGYM 335
Query: 329 KIVRD---EGLCGIGTRSSYP 346
++ R G CGI T + YP
Sbjct: 336 RVARGGAAGGNCGIATYAFYP 356
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 142/351 (40%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
SF+ ++ + +S + +E V+ ++E+W+ ++G++Y EKE R K
Sbjct: 4 SFRTLALLTLSVLLISISLGVVTATESQRNEGGVLTMYEQWLVENGKNYNGLGEKERRFK 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM +S + +YQ
Sbjct: 64 IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120
Query: 130 NLSMTDVPTSLDWRDKGAVTP-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+P +DWR++GAV P +K Q ECG CWAFAA AVEGI +I +G L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180
Query: 189 LDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK----- 242
+DC N N GC GG AF +I +N GI +++ Y Y G +AA K K
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYT---GEDTAACKAIEMKTTRVV 237
Query: 243 -ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL-DHAVTIV 300
I+ +E VP DE +L KAV+ QP+S+ I+A YK G++ G C DH V IV
Sbjct: 238 TINGHEVVPVNDEMSLKKAVAYQPISVMISA--ANMSDYKSGVYKGACSNLWGDHNVLIV 295
Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G+GT+ D +YWLI+NSWG WG+ GY+++ R+ G C + YP+
Sbjct: 296 GYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPI 346
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 142/351 (40%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
SF+ ++ + +S + +E V+ ++E+W+ ++G++Y EKE R K
Sbjct: 4 SFRTLALLTLSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFK 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM +S + +YQ
Sbjct: 64 IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120
Query: 130 NLSMTDVPTSLDWRDKGAVTP-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+P +DWR++GAV P +K Q ECG CWAFAA AVEGI +I +G L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180
Query: 189 LDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK----- 242
+DC N N GC GG AF +I +N GI +++ Y Y G +AA K K
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYT---GEDTAACKAIEMKTTRVV 237
Query: 243 -ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL-DHAVTIV 300
I+ +E VP DE +L KAV+ QP+S+ I+A YK G++ G C DH V IV
Sbjct: 238 TINGHEVVPVNDEMSLKKAVAYQPISVMISA--ANMSDYKSGVYKGACSNLWGDHNVLIV 295
Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G+GT+ D +YWLI+NSWG WG+ GY+++ R+ G C + YP+
Sbjct: 296 GYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPI 346
>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
Length = 369
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 188/314 (59%), Gaps = 23/314 (7%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ E++E+W QH R +D EK R +FK+N+ I + N+ + YKL N+F D+
Sbjct: 41 EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T DE Y ++ HR K Q L GAV +K+Q +CG
Sbjct: 99 TADESAGAYASSRVSH--HRMFRGRGEKAQRL-------------HGAVGAVKDQGQCGS 143
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+ +AAVEGI IR+ NL LSEQQL+DC T GN GC GG + AF YI ++ G+A
Sbjct: 144 CWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVA 203
Query: 219 TEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
YPY+A +C ++ + I YE+VP+ E AL KAV+ QPVS+AI A + F
Sbjct: 204 ASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHF 263
Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
Q Y EG+F G CGT+LDH V VG+GTT DG YW+++NSWG WG+ GY+++ RD
Sbjct: 264 QFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRDVSAK 323
Query: 334 EGLCGIGTRSSYPL 347
EGLCGI +SYP+
Sbjct: 324 EGLCGIAMEASYPI 337
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 136/331 (41%), Positives = 198/331 (59%), Gaps = 14/331 (4%)
Query: 23 TLLVSCASQVVSSR-STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
TLL C + V+S + + + WM +H +SY +E E R +++EN YIE
Sbjct: 5 TLLALCVALFVASTFAVSHDPLTGVFADWMQEHQKSYANE-EFVYRWNVWRENYLYIEAH 63
Query: 82 NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
N + N+++ L N+F DLTN EF L+ G + + + + +P D
Sbjct: 64 NHQ-NKSFHLAMNKFGDLTNAEFNKLFKGLSITADQAKQESDIA------PAPGLPADFD 116
Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCL 200
WR KGAVT +KNQ +CG CW+F+ + EG ++ G L LSEQ L+DCST+ GN+GC
Sbjct: 117 WRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVDCSTSYGNHGCN 176
Query: 201 GGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKA 260
GG + AF YII+N+GI TE+ YPY A GTC ++ + ++ +Y VPSG+E ALL A
Sbjct: 177 GGLMDYAFEYIIRNKGIDTEESYPYHASQGTCRYNKQHSGGELVSYTNVPSGNEGALLNA 236
Query: 261 VSMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
V+ QP S+AI A + FQ YK G+++ ++LDH V VG+G DG +YWL+KNSW
Sbjct: 237 VATQPTSVAIDASHSSFQFYKGGVYDEPACSSSRLDHGVLAVGWG-VRDGKDYWLVKNSW 295
Query: 319 GNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
G WG +GY+++ R++ CGI T +S+P A
Sbjct: 296 GADWGLSGYIEMSRNKHNQCGIATAASHPHA 326
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 130/314 (41%), Positives = 192/314 (61%), Gaps = 13/314 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
I E+W +H ++Y E+E+ R+KIF EN I K N+ +G ++KLG N+++D+
Sbjct: 23 IKEEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADM 82
Query: 100 TNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
+ EF+ GY M + Y + + VP ++DWR GAVT +K+Q C
Sbjct: 83 LHHEFKETMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGHC 142
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQG 216
G CW+F++ ++EG ++G L+ LSEQ L+DCST GNNGC GG + AF YI N G
Sbjct: 143 GSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 202
Query: 217 IATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYST 275
+ TE YPY+ + +C + A + + ++P GDE+A++KAV +M PV++AI A +
Sbjct: 203 VDTEKSYPYEGIDDSCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPVAVAIDASNE 262
Query: 276 EFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
FQ Y EG++N LDH V +VG+GT +DG +YWL+KNSWG TWGD GY+K+ R+
Sbjct: 263 SFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTWGDQGYIKMARN 322
Query: 334 -EGLCGIGTRSSYP 346
+ CGI T SS+P
Sbjct: 323 QDNQCGIATASSFP 336
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 144/334 (43%), Positives = 196/334 (58%), Gaps = 22/334 (6%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ L+ C S++ R H W HG++Y E E+++R I+ +NLE +
Sbjct: 8 LLVAVLIAQCFSELSQDRQWH---------AWKDFHGKTYTGE-EEDLRRAIWNDNLEIV 57
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
+K N E N +YKL N F+DLT EF+ + GY+ S ST STF LS +P
Sbjct: 58 KKHNAE-NHSYKLDMNHFADLTVTEFKQRFMGYRAAS---NSTGGSTF--LPLSNVQLPA 111
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNN 197
+DWRDKG VT +KNQ +CG CWAF++ ++EG ++G L+ LSEQ L+DCS GNN
Sbjct: 112 EVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDCSKKYGNN 171
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG + AF YI N GI TE YPY A G C A ++ Y +V G E L
Sbjct: 172 GCEGGLMDYAFKYIKNNDGIDTEQSYPYTARDGQCHFKPGSVGATVTGYTDVQRGSEGDL 231
Query: 258 LKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLI 314
AV ++ P+S+AI A + FQ YK G+++ TQLDH V VG+G EDG +YWL+
Sbjct: 232 QSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQLDHGVLAVGYG-AEDGKDYWLV 290
Query: 315 KNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
KNSWG WG GY+K+ R+ + CGI T++SYPL
Sbjct: 291 KNSWGEGWGMNGYIKMSRNKDNQCGIATQASYPL 324
>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 338
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 152/343 (44%), Positives = 209/343 (60%), Gaps = 19/343 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLE 76
M +++ L+ C VS+ S ++ H K W H +SY E E+ R +++ENL+
Sbjct: 1 MNLLVCLVSLCWGLAVSAPLG--DSELDRHWKLWKNWHQKSYH-EAEEGWRRTVWEENLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
I+ N E G TY+LG NQF DLTN+EF+ + TG + S +R S+ + +
Sbjct: 58 AIQLHNLEQSLGLHTYRLGMNQFGDLTNEEFQEILTGERHFSKGNRINGSA---FLEANF 114
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
VPTS+DWRD G VTP+KNQ CG CWAF+ A+EG +SG LI LSEQ L+DCS
Sbjct: 115 VQVPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSW 174
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPAAAKISNYEEVPS 251
GN GC GG + AF YI+QNQGI +ED YPY A C+ + A A ++ + ++P
Sbjct: 175 QQGNQGCHGGIVDLAFQYILQNQGIDSEDCYPYTAKDTAQCTFKPECATAPVTGFVDIPP 234
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFG---TT 305
E+AL+KAV ++ PVS+ I A ST F+ Y+ GIF + C ++ LDHAV +VG+G
Sbjct: 235 HSEEALMKAVATVGPVSVGIDASSTSFRFYQSGIFYDPKCSSESLDHAVLVVGYGYERED 294
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYPL 347
E G YW++KNSWG WGD GY+ + +D G CGI T +SYPL
Sbjct: 295 EAGKKYWIVKNSWGKHWGDRGYVYMSKDRGNHCGIATVASYPL 337
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 197/323 (60%), Gaps = 26/323 (8%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDL 99
+ E+W A QH ++Y E E+ +RLKI+ +N I K N+ G Y+L N+++DL
Sbjct: 23 VKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADL 82
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---------YQNLSMTDVPTSLDWRDKGAVTP 150
++EF G+ +R+ + + K + + +VPT++DWR KGAVTP
Sbjct: 83 LHEEFVQTVNGF------NRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTP 136
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFA 209
+K+Q CG CW+F+A A+EG ++G L+ LSEQ L+DCS GNNGC GG + AF
Sbjct: 137 VKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQ 196
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSI 268
YI N GI TE YPY+A+ TC K A Y ++P GDE+AL KA+ ++ PVSI
Sbjct: 197 YIKDNGGIDTEKSYPYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSI 256
Query: 269 AIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
AI A FQ Y EG+ + C ++ LDH V VG+GT+E+G +YWL+KNSWG TWGD G
Sbjct: 257 AIDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQG 316
Query: 327 YMKIVRD-EGLCGIGTRSSYPLA 348
Y+K+ R+ + CG+ T +SYPL
Sbjct: 317 YVKMARNHDNHCGVATCASYPLV 339
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 143/348 (41%), Positives = 202/348 (58%), Gaps = 27/348 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVE-IHEKWMA---QHGRSYKDELEKEMRLKIFKEN 74
F+I+ L A+ +S + E + E+W A QH + Y E E+ +R+KI+ +N
Sbjct: 4 FLILILGFVAAANAIS--------IFELVKEEWTAFKLQHRKKYDSETEERIRMKIYVQN 55
Query: 75 LEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
I K N+ G ++L N+++DL ++EF G+ K
Sbjct: 56 KHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSVSGKGQLLRGELKPIEE 115
Query: 132 SMT-------DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
+T DVPT++DWR KGAVT +K+Q CG CW+F+A A+EG ++G L+ LS
Sbjct: 116 PVTWIEPANVDVPTAMDWRTKGAVTQVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLS 175
Query: 185 EQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKI 243
EQ L+DCS GNNGC GG + AF YI N+GI TE YPY+A+ C K A
Sbjct: 176 EQNLVDCSQKYGNNGCNGGMMDFAFQYIKDNKGIDTEKSYPYEAIDDECHYNPKAVGATD 235
Query: 244 SNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIV 300
+ ++P G+E+AL+KA+ ++ PVS+AI A FQ Y EG+ + C + QLDH V V
Sbjct: 236 KGFVDIPQGNEKALMKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAV 295
Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
G+GTTEDG +YWL+KNSWG TWGD GY+K+ R+ + CGI T +SYPL
Sbjct: 296 GYGTTEDGEDYWLVKNSWGTTWGDQGYVKMARNRDNHCGIATTASYPL 343
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 141/343 (41%), Positives = 199/343 (58%), Gaps = 21/343 (6%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
T + I L+ S V SS +++ + EKW+ H + Y E +R I++ N
Sbjct: 11 TLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSN 70
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
++ I+ N + +KL N+F+D+TN EF+A + G +T+S + +
Sbjct: 71 VQLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGL--------NTSSLRLHKKQRPVC 121
Query: 135 D----VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
D VP ++DWR +GAVTPI+NQ +CG CWAF+AVAA+EGI KI++GNL+ LSEQQL+D
Sbjct: 122 DPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLID 181
Query: 191 CSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEE 248
C N GC GG E AF +I N G+ATE +YPY + GTC + K I Y++
Sbjct: 182 CDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQK 241
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
V +E +L A + QPVS+ I A FQ Y G+F CGT L+H VT+VG+G D
Sbjct: 242 VAQ-NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGVEGD- 299
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
YW++KNSWG WG+ GY+++ R D G CGI +SYPL
Sbjct: 300 QKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 195/335 (58%), Gaps = 10/335 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M I L++ A V S+ +T + + +WM + +SY +E E R +++EN +
Sbjct: 1 MRAITILVLLAAICVASTLATTHDPLTGVFAEWMRDNSKSYSNE-EFVFRWNVWRENQQL 59
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE+ N+ N+T L N+F DLTN EF L+ G H + ++ + + +
Sbjct: 60 IEEHNRS-NKTSFLAMNKFGDLTNAEFNKLFKGLAFDYSFHANKAAAE---KAVPAPGLS 115
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGN 196
DWR KGAVT +KNQ +CG CW+F+ + EG +++G L LSEQ L+DCS + GN
Sbjct: 116 ADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGN 175
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG + AF YII N+GI TE YPYQ TC + +++Y +V SGDE A
Sbjct: 176 NGCNGGLMDYAFEYIINNKGIDTEASYPYQTAQYTCQYNPANSGGSLTSYTDVSSGDENA 235
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
LL AV+ +P S+AI A FQ Y G++ + TQLDH V VG+G TEDG +YWL+
Sbjct: 236 LLNAVATEPTSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLAVGWG-TEDGQDYWLV 294
Query: 315 KNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
KNSWG WG AGY+K+ R+ CGI T +SYP A
Sbjct: 295 KNSWGADWGLAGYIKMARNRSNNCGIATSASYPTA 329
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 141/336 (41%), Positives = 205/336 (61%), Gaps = 14/336 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG NQF DL EF ++ G+ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P ++DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
NNGC GG E AF YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSED 236
Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
L KAV ++ P+S+AI A + FQ Y EG+++ C ++ LDH V +VG+G + G YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295
Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
L+KNSW +WGD GY+ + RD CGI +++SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 351
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 138/351 (39%), Positives = 210/351 (59%), Gaps = 21/351 (5%)
Query: 12 KINTTPMFIIITLLVSCASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
K P+ +I L C S + + E+S+++++++W + H R ++ E R K+
Sbjct: 5 KFLIVPLVLIAFLCNICESFELERKDFESEKSLMQLYKRWSSHH-RISRNANEMHNRFKV 63
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY----TGYKMPSPSHRSTTSST- 125
FK N +++ K N G ++ KL NQF+D+++DEFR +Y T YK T
Sbjct: 64 FKNNAKHVFKVNLMG-KSLKLKLNQFADMSDDEFRNMYSSNITYYKDLHAKKIEATGGRI 122
Query: 126 --FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
F Y++ + ++P+S+DWR KGAV IKNQ CG CWAFAAVAAVE I +I++ L+ L
Sbjct: 123 GGFMYEHAN--NIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIHQIKTNELVSL 180
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAK 242
SE+++LDC + GC GG AF +++ N G+ ED YPY G C + +
Sbjct: 181 SEEEVLDCDYR-DGGCRGGFYNSAFEFMMDNDGVTIEDNYPYYEGNGYCRRRGGRNKRVR 239
Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIV 300
I YE VP +E AL+KAV+ QPV++AIA+ ++F+ Y G+F N CG +DH V +V
Sbjct: 240 IDGYENVPRNNEYALMKAVAHQPVAVAIASGGSDFKFYGGGMFTENDFCGFNIDHTVVVV 299
Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G+GT EDG +YW+I+N +G+ WG GYMK+ R +G+CG+ + +YP+
Sbjct: 300 GYGTDEDG-DYWIIRNQYGHRWGMNGYMKMQRGAHSPQGVCGMAMQPAYPV 349
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 141/346 (40%), Positives = 202/346 (58%), Gaps = 22/346 (6%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFK 72
T + + + LV+ A Q VS I E+W +H ++Y+DE E+ RLKIF
Sbjct: 3 TALILPLLALVAVA-QAVSYAEV-------IQEEWHTFKLEHRKNYQDETEERFRLKIFN 54
Query: 73 ENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-- 127
EN I K N+ G ++K+ N+++D+ + EF + G+ +FK
Sbjct: 55 ENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGV 114
Query: 128 -YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
+ + +P +DWR KGAVT +K+Q CG CWAF++ A+EG +SG L+ LSEQ
Sbjct: 115 TFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQ 174
Query: 187 QLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
L+DCST GNNGC GG + AF YI N GI TE YPY+A+ +C + A
Sbjct: 175 NLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGSIGATDRG 234
Query: 246 YEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQ-LDHAVTIVGF 302
+ ++P G+E+ + +AV ++ PV++AI A FQ Y EG++N C Q LDH V +VGF
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGF 294
Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
GT E G +YWL+KNSWG TWGD G++K++R+ E CGI + SSYPL
Sbjct: 295 GTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340
>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
Length = 484
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 138/327 (42%), Positives = 199/327 (60%), Gaps = 21/327 (6%)
Query: 40 EQSVVEIHEKWMAQH----------GRSYKDELEKEMRLKIFKENLEYIEKANKE---GN 86
++ V ++E+W ++H G E + RL++F+ NL YI+ N E G
Sbjct: 46 DEEVRRLYEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNLRYIDAHNAEADAGL 105
Query: 87 RTYKLGTNQFSDLTNDEFRA-LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDK 145
++LG +F+DLT +E+RA L G + + + S +Y L+ +P ++DWR++
Sbjct: 106 HGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVVGSR-RYLPLAGEQLPDAVDWRER 164
Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
GAV +K+Q +CG CWAF+AVAAVEGI KI +G+LI LSEQ+L+DC + GC GG +
Sbjct: 165 GAVAEVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELIDCDKFQDQGCDGGLMD 224
Query: 206 KAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDEQALLKAVSMQ 264
AF ++I+N GI TE +YP+ GTC K I ++E VP E+AL KAV+ Q
Sbjct: 225 NAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDSFERVPINYERALQKAVAHQ 284
Query: 265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 324
PVS +I A FQ Y GIF+G CGT LDH VT+VG+G +E G +YW++KNSWG WG+
Sbjct: 285 PVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYG-SEGGKDYWIVKNSWGTQWGE 343
Query: 325 AGYMKIVRD----EGLCGIGTRSSYPL 347
AGY+++ R+ G CGI YP+
Sbjct: 344 AGYVRMARNVRVRAGKCGIAMEPLYPV 370
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 141/336 (41%), Positives = 206/336 (61%), Gaps = 14/336 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG NQF DL EF ++ G++ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHR----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P ++DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
NNGC GG E AF YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSEV 236
Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
L KAV ++ P+S+AI A + FQ Y EG+++ C ++ LDH V +VG+G + G YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295
Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
L+KNSW +WGD GY+ + RD CGI +++SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 141/346 (40%), Positives = 202/346 (58%), Gaps = 22/346 (6%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFK 72
T + + + LV+ A Q VS I E+W +H ++Y+DE E+ RLKIF
Sbjct: 3 TALILPLLALVAVA-QAVSYAEV-------IQEEWHTFKLEHRKNYQDETEERFRLKIFN 54
Query: 73 ENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-- 127
EN I K N+ G ++K+ N+++D+ + EF + G+ +FK
Sbjct: 55 ENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGV 114
Query: 128 -YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
+ + +P +DWR KGAVT +K+Q CG CWAF++ A+EG +SG L+ LSEQ
Sbjct: 115 TFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQ 174
Query: 187 QLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
L+DCST GNNGC GG + AF YI N GI TE YPY+A+ +C + A
Sbjct: 175 NLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRG 234
Query: 246 YEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQ-LDHAVTIVGF 302
+ ++P G+E+ + +AV ++ PV++AI A FQ Y EG++N C Q LDH V +VGF
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGF 294
Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
GT E G +YWL+KNSWG TWGD G++K++R+ E CGI + SSYPL
Sbjct: 295 GTDESGQDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 141/336 (41%), Positives = 205/336 (61%), Gaps = 14/336 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG NQF DL EF ++ G+ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P +DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
NNGC GG E AF YI +N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 177 NNGCEGGLMEDAFKYIKENDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSED 236
Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
L KAV ++ P+S+AI A + FQ Y EG+++ C ++ LDH V +VG+G + G YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295
Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
L+KNSW +WGD GY+ + RD CGI +++SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
AltName: Allergen=Car p 1; Flags: Precursor
gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
gi|387885|gb|AAA72774.1| papain [synthetic construct]
gi|225437|prf||1303270A papain
Length = 345
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 144/337 (42%), Positives = 206/337 (61%), Gaps = 17/337 (5%)
Query: 18 MFIIITLLVSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+F+ + L S V S++ T + ++++ E WM +H + YK+ EK R +IFK+NL
Sbjct: 17 LFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNL 76
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+YI++ NK+ N +Y LG N F+D++NDEF+ YTG + ++ +T S + N +
Sbjct: 77 KYIDETNKK-NNSYWLGLNVFADMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVN 133
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P +DWR KGAVTP+KNQ CG CWAF+AV +EGI KIR+GNL + SEQ+LLDC
Sbjct: 134 IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR- 192
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDE 254
+ GC GG A + Q GI + YPY+ V C + +K P AAK +V +E
Sbjct: 193 SYGCNGGYPWSALQLVAQ-YGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNE 251
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
ALL +++ QPVS+ + A +FQ Y+ GIF G CG ++DHAV VG+ G NY LI
Sbjct: 252 GALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILI 306
Query: 315 KNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
KNSWG WG+ GY++I R G+CG+ T S YP+
Sbjct: 307 KNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPV 343
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 139/335 (41%), Positives = 205/335 (61%), Gaps = 17/335 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+++ +LV+ +S+ S R + V W + HG+SY D E+ R+ I+++NLE
Sbjct: 3 VFLVLCVLVA-SSRGWSVRFGQDSEWV----AWKSYHGKSYSDVHEERTRMAIWQQNLEK 57
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N E + +YK+ N DLT DEFR Y G + H ST Y S +P
Sbjct: 58 IKRHNAE-DHSYKMAMNHLGDLTEDEFRYFYLGVR---AHHNSTKRGWATYMPPSNVKIP 113
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGN 196
+S+DW KG VT +KNQ +CG CWAF+ +VEG ++G+L+ LSEQ L+DCS + GN
Sbjct: 114 SSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLIDCSGSYGN 173
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG + AF YI N GI TE YPY G+C + A+++ Y+++P G EQA
Sbjct: 174 NGCQGGLMDNAFRYIESNGGIDTESSYPYLGQQGSCHFSSSHVGARVTGYQDIPQGSEQA 233
Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGTTEDGANYWL 313
L AV ++ PVS+A+ A +++Q Y G++ N C TQLDH V ++G+G +G +YWL
Sbjct: 234 LQSAVATVGPVSVAVDA--SQWQFYSSGVYDNPYCSSTQLDHGVLVIGYGNY-NGQDYWL 290
Query: 314 IKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
+KNSWG +WG GY+ + R++ CGI + +SYPL
Sbjct: 291 VKNSWGYSWGVEGYIMMSRNKNNQCGIASSASYPL 325
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 137/335 (40%), Positives = 207/335 (61%), Gaps = 11/335 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE+++R+KI+ EN + K
Sbjct: 6 LIFLLAAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKLRMKIYLENKHKVAK 64
Query: 81 AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y++ N+F DL + EFR++ GY+ + S STF + + +VP
Sbjct: 65 HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 123
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
S+DWR+KGA+TP+K+Q +CG CWAF++ A+EG T ++G L+ LSEQ L+DCS GN
Sbjct: 124 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGN 183
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG ++AF YI N+GI TE+ YPY+A G C + A + ++PSG+E
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDGVCRYNPRNRGAVDRGFVDIPSGEEDK 243
Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEG-IFNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
L AV ++ PVS+AI A FQ Y +G + C + LDH V +VG+G +++G +YWL
Sbjct: 244 LKAAVATVGPVSVAIDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYG-SDNGEDYWL 302
Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
+KNSW WGD GY+KI R+ + CG+ T +SYPL
Sbjct: 303 VKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 204/336 (60%), Gaps = 14/336 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H +SY+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKSYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG NQF DL EF ++ G+ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P +DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
NNGC GG E AF YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSEV 236
Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
L KAV ++ P+S+AI A + FQ Y EG+++ C ++ LDH V +VG+G + G YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295
Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
L+KNSW +WGD GY+ + RD CGI +++SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 117/229 (51%), Positives = 157/229 (68%), Gaps = 6/229 (2%)
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
S+ F+Y+N+S+ +P ++DWR GAVTPIK+Q +CGCCWAF+AVAA EGI KI +G LI
Sbjct: 3 STGFRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLIS 62
Query: 183 LSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA 241
LSEQ+L+DC G + GC GG + AF +II+N G+ TE YPY A G C + +AA
Sbjct: 63 LSEQELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSN-SAA 121
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
I YE+VP+ DE AL+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G
Sbjct: 122 NIKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIG 181
Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
+G T DG YWL+KNSWG TWG+ GY+++ +D +G+CG+ SYP
Sbjct: 182 YGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYP 230
>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
Length = 360
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 139/347 (40%), Positives = 203/347 (58%), Gaps = 28/347 (8%)
Query: 11 FKINTTPMFIIITLLVSC----ASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYK 59
F +P + + LL SC A+ ++ +R+T + +++ W H RSY
Sbjct: 4 FMACASPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYP 63
Query: 60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM---PSP 116
E R +++ N E+I+ N G+ TY+L N+F+DLT +EF A YTGY P
Sbjct: 64 SAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVD 123
Query: 117 SHRSTT-----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-CGCCWAFAAVAAVE 170
TT ++F Y+ DVP S+DWR +GAV P K+Q C CWAF A +E
Sbjct: 124 DSVITTGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIE 179
Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
+ I++G L+ LSEQQL+DC + + GC GS +A+ ++++N G+ TE +YPY A G
Sbjct: 180 SLNMIKTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRG 238
Query: 231 TCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVC 289
C+ A+ AAKI+ + +VP +E AL AV+ QPV++AI + Q YK G++ G C
Sbjct: 239 PCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPC 297
Query: 290 GTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 335
GT+L HAVT+VG+GT GA YW IKNSWG +WG+ GY++I+RD G
Sbjct: 298 GTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVG 344
>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
Length = 358
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 129/304 (42%), Positives = 183/304 (60%), Gaps = 10/304 (3%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
+W A + RSY E++ R ++++ N+E+IE N+ GN TY LG NQF+DLT +EF LY
Sbjct: 59 RWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADLTEEEFLDLY 118
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVA 167
T MP + + S+ D PTS+DWR +GAVTPIKNQ C CWAF A
Sbjct: 119 TMKGMPPVRRDAGKKQQANFS--SVVDAPTSVDWRSRGAVTPIKNQGPSCSSCWAFVTAA 176
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
+E IT+IR+G L+ LSEQ+L+DC + GC G + ++IQN G+ TE YPYQA
Sbjct: 177 TIESITQIRTGKLVSLSEQELIDCDPY-DGGCNLGYFVNGYKWVIQNGGLTTEANYPYQA 235
Query: 228 VPGTCSAAQK-PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
C+ ++ AA+ISNY ++P G+ Q + S +F Y G+++
Sbjct: 236 RRYQCNRSKAGQRAARISNYRQLPQGEAQLQQAVAQQPVAAAIEMGGSLQF--YSGGVWS 293
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---VRDEGLCGIGTRS 343
G CGT+++HA+T+VG+G G YWL+KNSWG TWG+ GY+++ VR GLCGI
Sbjct: 294 GQCGTRMNHAITVVGYGADSSGVKYWLVKNSWGQTWGERGYLRMRKDVRQGGLCGIALDL 353
Query: 344 SYPL 347
+YP+
Sbjct: 354 AYPI 357
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 139/335 (41%), Positives = 204/335 (60%), Gaps = 12/335 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N + G +YKLG NQF DL EF ++ GY S +S S+ N++ + +P
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYH---GSRKSGGSTFLPPANVNDSSLP 117
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
++DWR KGAVTP+K+Q +CG CWAF+ ++EG +++G L+ LSEQ L+DCS + GN
Sbjct: 118 KAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG E AF YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 178 NGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGCEDD 237
Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
L KAV ++ P+S+AI A + FQ Y EG+++ C ++ LDH V +VG+G + G YWL
Sbjct: 238 LKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYWL 296
Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
+KNSW +WGD GY+ + RD CGI +++SYPL
Sbjct: 297 VKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
Length = 341
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 198/317 (62%), Gaps = 32/317 (10%)
Query: 40 EQSVVEIHEKWMAQHGRSYKD-ELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQ 95
++ V ++++ W ++HGR + +RLK+F++NL YI+ N E G T++LG
Sbjct: 44 DEEVRQLYKTWKSEHGRPRDGISVADGLRLKVFRDNLRYIDAHNAEADAGLHTFRLGLTP 103
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
F+DLT +EFRA G+ + S R + +Y + D+P ++DWR +GAVT +KNQ
Sbjct: 104 FTDLTLEEFRAHALGF-LNSTLPRVASD---RYLPRAGDDLPDAVDWRQQGAVTGVKNQL 159
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
+CG CWAF+AVAA+EGI KI + NLI LSEQ+L+DC T + GC GG +KAF ++I N
Sbjct: 160 DCGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDTE-DYGCQGGEMQKAFQFVIDNG 218
Query: 216 GIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
GI TE +YP+ GTC A +K I +YE VP+ DE+AL KAV+ QP
Sbjct: 219 GIDTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQP--------- 269
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
GIFNG CG LDH VT VG+G +++G ++W++KNSWG WG++GY+++ R+
Sbjct: 270 --------GIFNGPCGFILDHGVTAVGYG-SDNGEDFWIVKNSWGAEWGESGYIRMKRNV 320
Query: 334 ---EGLCGIGTRSSYPL 347
G CGI +SYP+
Sbjct: 321 LLPMGKCGIAMYASYPV 337
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 140/343 (40%), Positives = 198/343 (57%), Gaps = 21/343 (6%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
T + I L+ S V SS +++ + EKW+ H + Y E +R I++ N
Sbjct: 11 TLVVLICFVLIASKLCSVNSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSN 70
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
++ I+ N + +KL N+F+D+TN EF+A + G +T+S + +
Sbjct: 71 VQLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGL--------NTSSLRLHKKQRPVC 121
Query: 135 D----VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
D VP ++DWR +GAVTPI+NQ +CG CWAF+AVAA+EGI KI++GNL+ LSEQQL+D
Sbjct: 122 DPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLID 181
Query: 191 CSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEE 248
C N GC GG E AF +I N G+ TE +YPY + GTC + K I Y++
Sbjct: 182 CDVGTYNKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIEGTCDQEKAKNKVVTIQGYQK 241
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
V +E +L A + QPVS+ I A FQ Y G+F CGT L+H VT+VG+G D
Sbjct: 242 VAQ-NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVTVVGYGVEGD- 299
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
YW++KNSWG WG+ GY+++ R D G CGI +SYPL
Sbjct: 300 QKYWIVKNSWGTGWGEEGYIRMERGISEDTGKCGIAMLASYPL 342
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 135/306 (44%), Positives = 190/306 (62%), Gaps = 11/306 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
+ W A HG SY E+ R I++ NL++IEK N EG+ +YKL N+F+DLT EF A
Sbjct: 23 DSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGH-SYKLAVNKFADLTYPEFAAK 81
Query: 108 YTGYKMPSP-SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
Y G + + + +S +ST+ + M +P S+DWR G VTPIK+Q +CG CW+F+
Sbjct: 82 YLGLRFDATNATKSFAASTYLPR---MVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTT 138
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
+VEG ++G L+ LSEQ L+DCS+ GN GC GG ++AF YII N GI TE YPY
Sbjct: 139 GSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPY 198
Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI 284
A GTC A +++Y+++ SG E L AV ++ P+S+AI A FQ Y G+
Sbjct: 199 TAQDGTCQFNSANVGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGV 258
Query: 285 FN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGT 341
+N +QLDH V VG+GT+ ++YWL+KNSWG +WG +GY+ + R+ CGI T
Sbjct: 259 YNEPACSSSQLDHGVLAVGYGTS-GSSDYWLVKNSWGTSWGQSGYIWMTRNSNNQCGIAT 317
Query: 342 RSSYPL 347
+SYPL
Sbjct: 318 AASYPL 323
>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 205/319 (64%), Gaps = 20/319 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W H S ++ EK R +FKEN+ ++ N+ ++ YKL N+F+D+
Sbjct: 34 EESLWQLYERWGKHHTIS-RNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADM 91
Query: 100 TNDEFRALYTGYKMPSPSH------RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
+N EF Y + SH R + F Y+ TD+P+S+D R++GAV +K
Sbjct: 92 SNYEFVNFYA---RSNISHYRKLHERRRGAGGFMYE--QDTDLPSSVDGRERGAVNAVKE 146
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q CG CWAF++VAAVEGI KI++ L+ LSEQ+LLDC+ N GC GG E AF +I +
Sbjct: 147 QGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCNGGFMEIAFDFIKR 205
Query: 214 NQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
N GIATE+ YPY G C +++ KI YE VP +E AL++AV+ QPVS+AI A
Sbjct: 206 NGGIATENSYPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDA 264
Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
+FQ Y +G+F+G CGT+L+H V +G+GTTEDG +YWL++NSWG WG+ GY+++ R
Sbjct: 265 AGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKR 324
Query: 333 D----EGLCGIGTRSSYPL 347
EGLCGI +SYP+
Sbjct: 325 GVEQAEGLCGIAMEASYPI 343
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 139/333 (41%), Positives = 199/333 (59%), Gaps = 14/333 (4%)
Query: 23 TLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN 82
LLV+ A VS + E E + HG++YK++ E+ R KIF N + IE N
Sbjct: 3 VLLVAVAVIAVSCANRFYNINPEEWETFKVVHGKNYKNQFEEMFRRKIFMNNKKRIEAHN 62
Query: 83 ---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
++G +YK+ N F DL + E +AL G+KM +P+ + F S +P S
Sbjct: 63 AKYEQGEVSYKMKMNHFGDLMSHEIKALMNGFKM-TPNTKREGKIYFP----SNDKLPKS 117
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNG 198
+DWR KGAVTP+K+Q +CG CW+F+A ++EG ++ G L+ LSEQ L+DCS GNNG
Sbjct: 118 VDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMDCSKEYGNNG 177
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALL 258
C GG +KAF Y+ N+GI TE YPY+A C + Y ++P GDE+AL
Sbjct: 178 CEGGLMDKAFQYVSDNKGIDTESSYPYEARDYACRFKKDKVGGTDKGYVDIPEGDEKALQ 237
Query: 259 KAV-SMQPVSIAIAAYSTEFQSYKEGIFN-GVCGT-QLDHAVTIVGFGTTEDGANYWLIK 315
A+ ++ P+S+AI A F Y EG++N C + LDH V VG+G TE+G +YWL+K
Sbjct: 238 NALATVGPISVAIDASHESFHFYSEGVYNEPYCSSYDLDHGVLAVGYG-TENGQDYWLVK 296
Query: 316 NSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
NSWG +WG++GY+KI R+ CGI + +SYP+
Sbjct: 297 NSWGPSWGESGYIKIARNHSNHCGIASMASYPI 329
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 204/335 (60%), Gaps = 12/335 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N + G +YKLG NQF DL EF ++ G+ + ++ SS N++ + +P
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLP 117
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
+DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + GN
Sbjct: 118 KVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG E AF YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 178 NGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSEVD 237
Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
L KAV ++ P+S+AI A + FQ Y EG+++ C ++ LDH V +VG+G + G YWL
Sbjct: 238 LKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYWL 296
Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
+KNSW +WGD GY+ + RD CGI +++SYPL
Sbjct: 297 VKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 142/345 (41%), Positives = 197/345 (57%), Gaps = 22/345 (6%)
Query: 18 MFIIITLLVSCA-SQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKE 73
M I+ LL A +Q VS I E+W +H ++Y DE E+ RLKIF E
Sbjct: 1 MRILFALLALVAVAQAVSYADV-------IKEEWQTFKLEHRKNYVDETEERFRLKIFNE 53
Query: 74 NLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF---K 127
N I K N+ G ++K+ N+++D+ + EF G+ + +F
Sbjct: 54 NKHKIAKHNQRYASGEVSFKMAVNKYADMLHHEFHTTMNGFNYTLHKQLRASDPSFVGVT 113
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+ + +P S+DWR KGAVT +K+Q CG CWAF++ A+EG ++G LI LSEQ
Sbjct: 114 FISPEHVKIPKSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQN 173
Query: 188 LLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
L+DCST GNNGC GG + AF YI N GI TE YPY+ + +C + A
Sbjct: 174 LVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDRGS 233
Query: 247 EEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFG 303
++P GDE+ + +AV ++ PVS+AI A FQ Y EGI+N C Q LDH V +VG+G
Sbjct: 234 VDIPQGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYG 293
Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
T E G +YWL+KNSWG TWGD G++K+ R+ + CGI + SSYPL
Sbjct: 294 TDESGQDYWLVKNSWGTTWGDKGFIKMARNADNQCGIASASSYPL 338
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 136/317 (42%), Positives = 195/317 (61%), Gaps = 13/317 (4%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
V+E E + +H + Y E+E+ R+KIF EN I NK +G+ TYKL N++ D+
Sbjct: 25 VLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDM 84
Query: 100 TNDEFRALYTGYKMPS----PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
+ EF + G++ ++R+ T +TF + + +P ++DWR KGAVTPIK+Q
Sbjct: 85 LHHEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQ-LPKNVDWRTKGAVTPIKDQG 143
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQN 214
+CG CWAF+A A+EG T ++G L+ LSEQ L+DCS GNNGC GG + AF Y+ +N
Sbjct: 144 QCGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKEN 203
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAY 273
GI TE+ YPY A C + A A+ + +V G E AL KAV ++ PVS+AI A
Sbjct: 204 GGIDTEESYPYDAEDEKCHYNPRAAGAEDKGFVDVREGSEHALKKAVATVGPVSVAIDAS 263
Query: 274 STEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
FQ Y G++ C + LDH V +VG+G +DG +YWL+KNSWG TWGD GY+K+
Sbjct: 264 HESFQFYSHGVYIEPECSPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGDQGYVKMA 323
Query: 332 RD-EGLCGIGTRSSYPL 347
R+ + CGI + +S+PL
Sbjct: 324 RNRDNQCGIASSASFPL 340
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 204/335 (60%), Gaps = 12/335 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N + G +YKLG NQF DL EF ++ G+ + ++ SS N++ + +P
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLP 117
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
+DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + GN
Sbjct: 118 KVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG E AF YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 178 NGCEGGLMEDAFKYIKANDGIDTEKSYPYKAVDGECRFKKEDVGATDTGYVEIKAGSEVD 237
Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
L KAV ++ P+S+AI A + FQ Y EG+++ C ++ LDH V +VG+G + G YWL
Sbjct: 238 LKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYWL 296
Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
+KNSW +WGD GY+ + RD CGI +++SYPL
Sbjct: 297 VKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 139/336 (41%), Positives = 205/336 (61%), Gaps = 14/336 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF E+ I +
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIAR 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG NQF DL EF ++ G+ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P ++DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
NNGC GG E AF YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSED 236
Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
L KAV ++ P+S+AI A + FQ Y EG+++ C ++ LDH V +VG+G + G YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295
Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
L+KNSW +WGD GY+ + RD CGI +++SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 197/342 (57%), Gaps = 19/342 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
F++ L SQ VS + E+W A H + Y+ E E+ R+KIF EN
Sbjct: 3 FLVFVALCVVGSQAVSFFDL-------VQEQWGAFKVTHKKQYESETEERFRMKIFMENA 55
Query: 76 EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGY-KMPSPSHRSTTSSTFKYQNL 131
+ K NK +G ++KLG N++SD+ N EF GY + +P + +
Sbjct: 56 HKVAKHNKLYAQGLVSFKLGVNKYSDMLNHEFVHTLNGYNRSKTPLRSGELDESITFIPP 115
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ ++P +DWR GAVTP+K+Q +CG CW+F+ ++EG +S L+ LSEQ L+DC
Sbjct: 116 ANVELPKQIDWRKLGAVTPVKDQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDC 175
Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
S GNNGC GG + AF YI N GI TE YPY+A C + A + ++
Sbjct: 176 SEKYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYKAEDEKCHYKPRNKGATDRGFVDIE 235
Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGT-QLDHAVTIVGFGTTED 307
SGDE+ L AV ++ P+S+AI A FQ Y EG++ C + QLDH V +VG+GT ED
Sbjct: 236 SGDEEKLKAAVATVGPISVAIDASHPTFQQYSEGVYYEPECSSEQLDHGVLVVGYGTDED 295
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
G +YWL+KNSWG++WGD GY+K+ R+ + CGI T++SYPL
Sbjct: 296 GNDYWLVKNSWGDSWGDQGYIKMARNRDNNCGIATQASYPLV 337
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 137/335 (40%), Positives = 206/335 (61%), Gaps = 11/335 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE++ R+KI+ EN + K
Sbjct: 6 LIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 64
Query: 81 AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y++ N+F DL + EFR++ GY+ + S STF + + +VP
Sbjct: 65 HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 123
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
S+DWR+KGA+TP+K+Q +CG CWAF++ A+EG T ++G LI LSEQ L+DCS GN
Sbjct: 124 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG ++AF YI N+GI TE+ YPY+A C + A + ++PSG+E
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDK 243
Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
L AV ++ PVS+AI A FQ Y +G+ + C + LDH V +VG+G +++G +YWL
Sbjct: 244 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDYWL 302
Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
+KNSW WGD GY+KI R+ + CG+ T +SYPL
Sbjct: 303 VKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 204/335 (60%), Gaps = 12/335 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N + G +YKLG NQF DL EF ++ G+ + ++ SS N++ + +P
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLP 117
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
+DWR KGAVTP+K+Q +CG CWAF+A ++EG +++G L+ LSEQ L+DCS + GN
Sbjct: 118 KVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG E AF YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 178 NGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSEVD 237
Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
L KAV ++ P+S+AI A + FQ Y EG+++ C ++ LDH V +VG+G + G YWL
Sbjct: 238 LKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYWL 296
Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
+KNSW +WGD GY+ + RD CGI +++SYPL
Sbjct: 297 VKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 138/333 (41%), Positives = 203/333 (60%), Gaps = 18/333 (5%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI-EKAN 82
++V+ S++VS E+S++EI ++W +H + Y+ E E R + FK NL+YI EKA
Sbjct: 32 IVVNDFSELVS-----EESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAG 86
Query: 83 KE-GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
K+ + +G N+F+DL+N+EF+ LY + + +T+ ++ +NL D P+SLD
Sbjct: 87 KKTAALGHSVGLNKFADLSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDAPSSLD 146
Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLG 201
WR KG VT +K+Q +CG CW+F+ A+EGI I +G+LI LSEQ+L+DC T N GC G
Sbjct: 147 WRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT-NYGCEG 205
Query: 202 GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKA 260
G + AF ++I N GI TE YPY V GTC+ ++ I Y +V D ALL A
Sbjct: 206 GYMDYAFEWVINNGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDETD-SALLCA 264
Query: 261 VSMQPVSIAIAAYSTEFQSYKEGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNS 317
QP+S+ + + +FQ Y GI++G C +DHAV IVG+G +E+G +YW++KNS
Sbjct: 265 TVQQPISVGMDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYG-SENGEDYWIVKNS 323
Query: 318 WGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
WG WG GY I R+ G+C I +SYP
Sbjct: 324 WGTEWGMEGYFYIKRNTDLPYGVCAINAEASYP 356
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 208/338 (61%), Gaps = 19/338 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F+I+ +LV AS + T EQ + + H + Y+ + R KIF +N I
Sbjct: 8 FLILAVLVGAASAAL----TLEQLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLI 63
Query: 79 EKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMT 134
+ N +G TYKL NQF D+ + EF + G S+R+ ST+ + +++S+
Sbjct: 64 ARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLR---SNRTYFGSTWIEPESVSL- 119
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
P S+DWR+KGAVTP+KNQ CG CW+F+ A+EG ++G L+ LSEQ L+DCST+
Sbjct: 120 --PKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTS 177
Query: 195 -GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GNNGC GG + AF YI +N GI TE+ YPY+ G C ++ +A + + + ++PSG+
Sbjct: 178 YGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDSAGRDTGFVDIPSGN 237
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGAN 310
E+AL KA+ ++ PVS+AI A FQ Y EG++N C + LDH V VG+GTT+DG +
Sbjct: 238 ERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQD 297
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
Y++IKNSWG WG GY+ + R+ + CG+ T++SYPL
Sbjct: 298 YYIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPL 335
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 115/226 (50%), Positives = 155/226 (68%), Gaps = 6/226 (2%)
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F+Y+N+S +PT++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +G L+ L+E
Sbjct: 7 FRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAE 66
Query: 186 QQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKIS 244
Q+L+DC + + GC GG + AF +II+N G+ TE YPY A G C + +AA I
Sbjct: 67 QELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSN-SAATIK 125
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
YE+VP+ DE AL+KAV+ QPVS+A+ FQ Y G+ G CGT LDH + +G+G
Sbjct: 126 GYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGK 185
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
T DG YWL+KNSWG TWG+ GY+++ +D G+CG+ SYP
Sbjct: 186 TSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 231
>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
Length = 330
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 137/333 (41%), Positives = 197/333 (59%), Gaps = 21/333 (6%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
+ V AS V S T++ S + WM +H RSY E + + FK+N+++I
Sbjct: 12 FSFNVCFASNSVYSAQTYQTSFL----GWMKKHDRSYHHH-EFNNKYQAFKDNMDFIHNW 66
Query: 82 NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTS 139
N N LG QF+DLTN+E+R +Y G K+ + N +M P S
Sbjct: 67 NTNKNSKTVLGLTQFADLTNEEYRKIYLGTKVNVAPEK---------HNFNMIHFTGPDS 117
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNG 198
+DWR KGAV+ +K+Q +CG CW+F+ +VEG +I++GN++ LSEQ L+DCS GNNG
Sbjct: 118 IDWRTKGAVSHVKDQGQCGSCWSFSTTGSVEGAHQIKTGNMVTLSEQNLVDCSGKFGNNG 177
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALL 258
C GG AF +I+ G+ATED YPY AV G C + A IS Y+E+ G E L
Sbjct: 178 CDGGLMVNAFKFIMSQGGVATEDSYPYNAVQGKCKFTKSMVGANISGYKEITQGSELELQ 237
Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYWLIKN 316
A++ QPVSIAI A FQ YK G+++ C + QLDH V VG+G TE+G +Y+++KN
Sbjct: 238 AALTKQPVSIAIDASQQSFQLYKSGVYDEPECSSYQLDHGVLAVGYG-TENGKDYYIVKN 296
Query: 317 SWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
SW ++WG GY+ + R+ + CG+ T +SYP++
Sbjct: 297 SWADSWGQDGYIFMSRNAKNQCGVATMASYPIS 329
>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
Length = 382
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 191/336 (56%), Gaps = 31/336 (9%)
Query: 42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
+++E+ ++W A++ RSY E+ RL+++ N+ YIE N Y+LG ++DLTN
Sbjct: 47 TMMEMFQRWKAEYNRSYATPEEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTN 106
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV----------------PTSLDWRDK 145
DEF A+YT + S + ++T V P S+DWR
Sbjct: 107 DEFMAMYTAPPLRSAADDDDDAATTTIITTRAGPVDEHQQPEVYFNESAGAPASVDWRAS 166
Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
GAVT +K+Q CG CWAF+ VA VEGI KI+ G L+ LSEQ+L+DC T ++GC GG
Sbjct: 167 GAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCDTL-DSGCDGGVSY 225
Query: 206 KAFAYIIQNQGIATEDEYPYQA-VPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSM 263
+A +I N GI T D+YPY C A+ AA I+ V + E +L A +
Sbjct: 226 RALEWITANGGITTRDDYPYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNAAAA 285
Query: 264 QPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE-------DGANYWLIKN 316
QPV+++I A FQ Y++G+++G CGT+L+H VT+VG+G E G YW+IKN
Sbjct: 286 QPVAVSIEAGGDNFQHYRKGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWIIKN 345
Query: 317 SWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
SWG WGD GY+K+ +D EGLCGI R S+PL
Sbjct: 346 SWGKNWGDQGYIKMKKDVAGKPEGLCGIAIRPSFPL 381
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 137/320 (42%), Positives = 196/320 (61%), Gaps = 14/320 (4%)
Query: 33 VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
V S T++ S + WM +H R+Y E E R + FKEN+++I K N + + T LG
Sbjct: 23 VFSSQTYQTSFI----GWMRKHDRAYSHE-EFTDRYQAFKENMDFIHKWNSQESDTV-LG 76
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
+F+DLTN+E++ Y G K+ + + K+ + P S+DWR+KGAV+ +K
Sbjct: 77 LTKFADLTNEEYKKHYLGIKVNVKKNLNAAQKGLKFFKFTG---PDSIDWREKGAVSQVK 133
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYI 211
+Q +CG CW+F+ AVEG +I+SGN++ LSEQ L+DCS GN GC GG AF YI
Sbjct: 134 DQGQCGSCWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYI 193
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
I N GIATE YPY A G C + A I Y+E+P G+E +L A++ QPVS+AI
Sbjct: 194 IDNGGIATESSYPYTAAQGRCKFTKSMNGANIIGYKEIPQGEEDSLTAALAKQPVSVAID 253
Query: 272 AYSTEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
A FQ Y G+++ C ++ LDH V VG+GT E G +Y++IKNSWG TWG GY+
Sbjct: 254 ASHMSFQLYSSGVYDEPACSSEALDHGVLAVGYGTLE-GKDYYIIKNSWGPTWGQDGYIF 312
Query: 330 IVRD-EGLCGIGTRSSYPLA 348
+ R+ + CG+ T +SYP++
Sbjct: 313 MSRNAQNQCGVATMASYPIS 332
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/341 (41%), Positives = 195/341 (57%), Gaps = 19/341 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
F+I + SQ VS + E+W A H + Y+ E E+ R+KIF EN
Sbjct: 3 FLIFLAICVAGSQAVSFFDL-------VQEQWGAFKMTHNKQYQSETEERFRMKIFMENS 55
Query: 76 EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNL 131
+ K NK +G ++KLG N+++D+ + EF + G+ RS S + +
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ +P +DWRDKGAVTP+K+Q +CG CW+F+A ++EG +SG L+ LSEQ L+DC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDC 175
Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
S GNNGC GG + AF YI N GI TE YPY+A C K A Y ++
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIE 235
Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTED 307
SG+E L AV ++ PVS+AI A FQ Y G++ +QLDH V +VG+GT +D
Sbjct: 236 SGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDD 295
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
G +YWL+KNSWG +WGD GY+K+ R+ CGI T +SYPL
Sbjct: 296 GTDYWLVKNSWGKSWGDQGYIKMARNRNNNCGIATEASYPL 336
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 201/329 (61%), Gaps = 18/329 (5%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
LL+ + R T + S + +W H ++Y + E+ +R I+K+N I + N
Sbjct: 8 LLLGVTLAYIIERPTEDDSWI----RWKMAHNKAYSHDGEETVRYTIWKDNERRIREHNL 63
Query: 84 EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
+G + L NQF D+TN+EF+ + GY SH+ + STF N + P S+DWR
Sbjct: 64 QGG-DFLLEMNQFGDMTNNEFKD-FNGY----LSHKHVSGSTFLTPNSFV--APDSVDWR 115
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGG 202
++G VTP+K+Q +CG CWAF+ ++EG ++G L+ LSEQ L+DCST GNNGC GG
Sbjct: 116 NEGYVTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCNGG 175
Query: 203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV- 261
+ AF YI +N GI +E YPY A G C+ + AA + + ++PSGDE L +AV
Sbjct: 176 LMDNAFTYIKENNGIDSEASYPYTAKDGKCAFTKPNVAATDTGFVDIPSGDENKLKEAVA 235
Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
S+ P+S+AI A FQ Y++G++N T+LDH V +VG+G TE G +YWL+KNSW
Sbjct: 236 SVGPISVAIDASHFSFQFYRKGVYNERKCSSTELDHGVLVVGYG-TESGKDYWLVKNSWN 294
Query: 320 NTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
+WGD GY+K+ R+ + CGI T +SYPL
Sbjct: 295 TSWGDKGYIKMSRNAKNQCGIATNASYPL 323
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 139/341 (40%), Positives = 196/341 (57%), Gaps = 19/341 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
F+I + SQ VS + E+W A H + Y+ + E+ R+KIF EN
Sbjct: 3 FLIFLAICVAGSQAVSFFDL-------VQEQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55
Query: 76 EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNL 131
+ K NK +G ++KLG N+++D+ + EF + G+ RS S + +
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ +P +DWRDKGAVTP+K+Q +CG CW+F+A ++EG +SG L+ LSEQ L+DC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175
Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
S GNNGC GG + AF YI N GI TE YPY+A C K A Y ++
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIE 235
Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTED 307
SG+E L AV ++ PVS+AI A FQ Y G++ +QLDH V +VG+GT +D
Sbjct: 236 SGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDD 295
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
G +YWL+KNSWG +WGD GY+K+ R+ + CGI T +SYPL
Sbjct: 296 GTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 208/338 (61%), Gaps = 19/338 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F+I+ +LV AS + T EQ + + H + Y+ + R KIF +N I
Sbjct: 3 FLILAVLVGAASAAL----TLEQLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLI 58
Query: 79 EKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMT 134
+ N +G TYKL NQF D+ + EF + G S+R+ ST+ + +++S+
Sbjct: 59 ARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLR---SNRTYFGSTWIEPESVSL- 114
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
P S+DWR+KGAVTP+KNQ CG CW+F+ A+EG ++G L+ LSEQ L+DCST+
Sbjct: 115 --PKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTS 172
Query: 195 -GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GNNGC GG + AF YI +N GI TE+ YPY+ G C ++ +A + + + ++PSG+
Sbjct: 173 YGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDSAGRDTGFVDIPSGN 232
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGAN 310
E+AL KA+ ++ PVS+AI A FQ Y EG++N C + LDH V VG+GTT+DG +
Sbjct: 233 ERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQD 292
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
Y++IKNSWG WG GY+ + R+ + CG+ T++SYPL
Sbjct: 293 YYIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPL 330
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 127/259 (49%), Positives = 164/259 (63%), Gaps = 14/259 (5%)
Query: 99 LTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
+TN EFR+ Y G K+ HR + +F Y+ + VP S+DWR KGAVTPIK+
Sbjct: 1 MTNHEFRSTYAGSKVNH--HRMFRGSQHAAGSFMYEKVK--SVPPSVDWRKKGAVTPIKD 56
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q +CG CWAF+ V AVEGI I++ L+ LSEQ+L+DC T+ N GC GG AF +I +
Sbjct: 57 QGQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKE 116
Query: 214 NQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
GI TE YPY A GTC ++ I +E VP +E ALLKA + QP+S+AI A
Sbjct: 117 KGGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDA 176
Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
+ FQ Y EG+F G CGT LDH V IVG+GTT DG YW++KNSWG WG+ GY+++ R
Sbjct: 177 GGSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKR 236
Query: 333 ----DEGLCGIGTRSSYPL 347
EGLCGI +SYP+
Sbjct: 237 GISAKEGLCGIAVEASYPI 255
>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
Length = 291
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 128/287 (44%), Positives = 183/287 (63%), Gaps = 6/287 (2%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG N+F+D+TN+EF A YTG P + S + +++++ V
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPVVS---FDDVNISAV 124
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
S+DWRD GAVT +K+Q CG CWAF+A+A VEGI KI +G L+ LSEQ++LDC+ +
Sbjct: 125 GQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS-- 182
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG + A+ +II N G+A+E +YPYQA G C+A P +A I+ Y V S DE +
Sbjct: 183 NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPNSAYITGYSYVRSNDESS 242
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFG 303
+ AV QP++ AI A FQ Y G+F+G CGT L+HA+TI+G+G
Sbjct: 243 MKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYG 289
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 134/313 (42%), Positives = 187/313 (59%), Gaps = 23/313 (7%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDLTND 102
+ + + + + Y+ E+ R +F +N+++I + N E R T+ + NQF+DLTN+
Sbjct: 29 LFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFADLTNE 88
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT--SLDWRDKGAVTPIKNQKECGCC 160
E+R LY P P T + + D P S+DWR KGAVTPIKNQ +CG C
Sbjct: 89 EYRQLYL---RPYP-----TELLGRERQEVWLDGPNAGSVDWRQKGAVTPIKNQGQCGSC 140
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIAT 219
W+F+ +VEG I +GNL+ LSEQQL+DCS + GN GC GG + AF YII N G+ T
Sbjct: 141 WSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDT 200
Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
E +YPY A G C +++ A IS Y++VP +E L AV PVS+AI A FQ
Sbjct: 201 EQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQ 260
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEG 335
Y G+F+G CGT LDH V +VG+ + +YW++KNSWG +WGD GY+ + R G
Sbjct: 261 MYSSGVFSGPCGTNLDHGVLVVGYTS-----DYWIVKNSWGASWGDQGYIMMKRGVSSAG 315
Query: 336 LCGIGTRSSYPLA 348
+CGI + SYP+A
Sbjct: 316 ICGIAMQPSYPIA 328
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 137/336 (40%), Positives = 204/336 (60%), Gaps = 14/336 (4%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
++ L + CA V+ + + + E + H +SY+ +E+ +R KIF EN I K
Sbjct: 1 MLRLSLLCAIVAVTVAANSHEILRTQWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG NQF DL EF ++ GY+ R++ STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGYR----GQRTSRGSTFMPPANVNDSSL 116
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P+++DWR KGAVTP+K+Q +CG CWAF+A ++EG ++ G L+ LSEQ L+DCS + G
Sbjct: 117 PSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
NNGC GG + AF YI N GI E+ YPY+A+ C ++ A + + ++ G E
Sbjct: 177 NNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMDDKCRFKKEDVGATDTGFVDIEGGSED 236
Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYW 312
L KAV ++ P+S+AI A + FQ Y EG+++ C + +LDH V VG+G +DG YW
Sbjct: 237 DLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPECSSEELDHGVLAVGYG-VKDGKKYW 295
Query: 313 LIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
L+KNSWG +WGD GY+ + RD+ CGI + +SYPL
Sbjct: 296 LVKNSWGGSWGDNGYILMSRDKNNQCGIASAASYPL 331
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 140/341 (41%), Positives = 197/341 (57%), Gaps = 19/341 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
F+I + SQ VS + E+W A H + Y+ + E+ R+KIF EN
Sbjct: 3 FLIFLAICVAGSQAVSFFDL-------VQEQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55
Query: 76 EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNL 131
+ K NK +G ++KLG N+++D+ + EF + G+ RS S + +
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ +P +DWRDKGAVTP+K+Q +CG CW+F+A ++EG +SG L+ LSEQ L+DC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175
Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
S GNNGC GG + AF YI N GI TE YPY+A C K A Y ++
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIE 235
Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCG-TQLDHAVTIVGFGTTED 307
SG+E L AV ++ PVS+AI A FQ Y G+ + C +QLDH V +VG+GT +D
Sbjct: 236 SGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGTEDD 295
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
G +YWL+KNSWG +WGD GY+K+ R+ + CGI T +SYPL
Sbjct: 296 GTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336
>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 533
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 134/305 (43%), Positives = 185/305 (60%), Gaps = 11/305 (3%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-YKLGTNQFSDLTNDEFRALY 108
WM+ HG ++ D LE RL+ + N YI + N E T KLG N FS ++ DEF+
Sbjct: 31 WMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVKLGHNAFSHMSFDEFKFKM 90
Query: 109 TGYKMPSPSHRSTTSSTFKYQNL-SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
TG +P +S + L S +VP+++DW DKG VTP+KNQ CG CWAF+
Sbjct: 91 TGLVLPEGYLEQRLAS--RVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCWAFSTTG 148
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
AVEG T + SG L+ LSEQ+L+DC NG+ GC GG + AF +I + GI +ED+Y Y+A
Sbjct: 149 AVEGATFVSSGKLLSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEYKA 208
Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG 287
C + K++ +++V DE AL AV+ QPVS+AI A FQ YK G+FN
Sbjct: 209 KAQVCRKCD--SVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 266
Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRS 343
CGT+LDH V VG+G ++G +W +KNSWG +WG+ GY+++ R+E G CGI +
Sbjct: 267 TCGTRLDHGVLAVGYG-NDNGQKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIASVP 325
Query: 344 SYPLA 348
SYP A
Sbjct: 326 SYPFA 330
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 131/304 (43%), Positives = 189/304 (62%), Gaps = 9/304 (2%)
Query: 52 AQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALY 108
A+HG+SY E E+ RLKI+ EN I K N++ G Y + N+F D+ + EF +
Sbjct: 32 AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G+K S+ + +N+ +P ++DWR KGAVTP+KNQ +CG CWAF+A +
Sbjct: 92 NGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATGS 151
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
+EG +SG+++ LSEQ L+DCST+ GNNGC GG + AF YI N+GI TE YPY
Sbjct: 152 LEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPYNG 211
Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN 286
GTC + A S + ++ G E L KAV ++ P+S+AI A FQ Y +G+++
Sbjct: 212 TDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYD 271
Query: 287 GV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRS 343
C ++ LDH V +VG+GT +G +YWL+KNSWG TWGD GY+++ R+ + CGI + +
Sbjct: 272 EPECDSESLDHGVLVVGYGTL-NGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQCGIASSA 330
Query: 344 SYPL 347
SYPL
Sbjct: 331 SYPL 334
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 137/320 (42%), Positives = 192/320 (60%), Gaps = 11/320 (3%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
S + S E + ++ +M Q+ ++Y E R FK N+E I N N +Y
Sbjct: 25 SALFSEEVPSEVMLQDMFTAFMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASY 83
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+G N+F+DL+ +EF+ Y GYK R S +Q + PTS+DWR AVT
Sbjct: 84 TMGLNEFADLSFEEFKGKYFGYKHV---EREFARSNNLHQEVEA--APTSIDWRTSNAVT 138
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGN-LIQLSEQQLLDCSTN-GNNGCLGGSREKA 207
PIK+Q +CG CWAF+A ++EG ++ + L LSEQQL+DCST+ GN GC GG + A
Sbjct: 139 PIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYA 198
Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPV 266
F YII N+GI E YPY+ V G C + IS Y++V SGDE +LL AV ++ PV
Sbjct: 199 FEYIIANKGICAESAYPYKGVGGLCQKSCTKVVT-ISGYKDVASGDEASLLNAVGTVGPV 257
Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
S+AI A FQ Y G+F+G CG LDH V VG+GTT +YW++KNSWG +WG++G
Sbjct: 258 SVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTT-GSQDYWIVKNSWGTSWGESG 316
Query: 327 YMKIVRDEGLCGIGTRSSYP 346
Y++++R++ CGI + SYP
Sbjct: 317 YIRMIRNKNQCGIAIQPSYP 336
>gi|357167707|ref|XP_003581294.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 358
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 134/317 (42%), Positives = 183/317 (57%), Gaps = 22/317 (6%)
Query: 47 HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
HE+WMA+ GRSY D EK R ++F N +++ N+ GNRTY LG NQFSDLT+ EF
Sbjct: 42 HERWMARFGRSYTDAGEKARRQEVFGANARHVDAVNRAGNRTYTLGLNQFSDLTDHEFLQ 101
Query: 107 LYTGYK-------MPSPSHRSTTSST-FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
+ GY + P +T Y D+P S+DWR KGAVT IKNQ+ CG
Sbjct: 102 QHLGYGRHHGQRGLLLPEEEVMPKATALGYGQ----DMPYSVDWRAKGAVTEIKNQRSCG 157
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
CWAFAAVAA EG+ KI +GNLI +SEQQ+LDC T + C G A Y++ + G+
Sbjct: 158 SCWAFAAVAATEGLVKIATGNLISMSEQQVLDC-TGDRSSCDSGYISDALRYVVTSGGLQ 216
Query: 219 TEDEYPYQAVPGTCSA---AQKPAAAKISN-YEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
E Y Y G C + A+ +AA + + +GDE AL + QPV++ + A
Sbjct: 217 REAAYAYTGQKGACGSRRFARPNSAASVGGVHMATLNGDEGALQGLAARQPVAVIVEASE 276
Query: 275 TEFQSYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
+F+ Y G++ G CG +L+HA+T+VG+GT YWL+KN WG WG+ GYM++ R
Sbjct: 277 PDFRHYSSGVYAGSASCGRELNHALTVVGYGTENGAGEYWLVKNQWGTWWGENGYMRVAR 336
Query: 333 DEGL---CGIGTRSSYP 346
G CGI + + YP
Sbjct: 337 RNGAGANCGIASVAFYP 353
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 136/315 (43%), Positives = 195/315 (61%), Gaps = 12/315 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ WM H + Y++ EK R +IFK+NL YI++ NK+ N +Y+LG N+F+
Sbjct: 39 TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYRLGLNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+NDEF Y G + + +S ++ N + ++P ++DWR KGAVTP+++Q C
Sbjct: 98 DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDIVNLPENVDWRKKGAVTPVRHQGSC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC ++GC GG A Y+ +N GI
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN-GI 212
Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
+YPY+A GTC A Q K S V +E LL A++ QPVS+ + +
Sbjct: 213 HLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 272
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
FQ YK GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 273 FQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGN 331
Query: 333 DEGLCGIGTRSSYPL 347
G+CG+ S YP+
Sbjct: 332 SPGVCGLYKSSYYPI 346
>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
Length = 336
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 136/340 (40%), Positives = 206/340 (60%), Gaps = 17/340 (5%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ + V + S+ + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLSISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
N GC GG ++AF Y+ +N+G+ +E YPY A C + AKI+ + ++PSG+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNE 235
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
AL+ AV ++ PVS+AI A Q Y+ GI+ ++LDHAV +VG+ G G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
YW++KNSW + WGD GY+ + +D+ CG+ T++SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 204/335 (60%), Gaps = 11/335 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE++ R+KI+ EN + K
Sbjct: 6 LIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 64
Query: 81 AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y++ N+F DL + EFR++ GY+ + S STF + + +VP
Sbjct: 65 HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 123
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
S+DWR KGA+TP+K+Q +CG CWAF++ A+EG T ++G LI LSEQ L+DCS GN
Sbjct: 124 ESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG ++AF YI N+GI TE+ YPY+A C + A + +PSG+E
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDNVCRYNPRNRGAIDRGFVHIPSGEEDK 243
Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
L AV ++ PVS+AI A FQ Y +G+ + C + LDH V +VG+G +++G +YWL
Sbjct: 244 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDYWL 302
Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
+KNSW WGD GY+KI R+ + CGI T +SYPL
Sbjct: 303 VKNSWSEHWGDEGYIKIARNRKNHCGIATAASYPL 337
>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
Length = 335
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 142/341 (41%), Positives = 207/341 (60%), Gaps = 20/341 (5%)
Query: 20 IIITLLVS-CASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
++ LLV+ C S V ++ S Q ++ H W +QHG+SY ++LE R+ I++ENL
Sbjct: 1 MMFALLVTLCISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDLEVGRRM-IWEENLRK 57
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFF 113
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSG 252
GN GC GG ++AF Y+ +N+G+ +E YPY A C + AKI+ + ++P G
Sbjct: 174 QGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRG 233
Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGF---GTTED 307
+E AL+ AV ++ PVS+AI A Q Y+ GI + C ++LDHAV +VG+ G
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVA 293
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
G YW++KNSW + WGD GY+ + +D+ CGI T +SYPL
Sbjct: 294 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 135/329 (41%), Positives = 193/329 (58%), Gaps = 26/329 (7%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++E+W A + + +D EK R +FKEN I + N +GN TY LG N+FSD+
Sbjct: 41 EESLWALYERWCAHYNMA-RDHGEKTRRFDLFKENARRIYEHNHQGNATYTLGLNRFSDM 99
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD----------------VPTSLDWR 143
T++EF G + +P + + D P ++DWR
Sbjct: 100 TDEEFNRSPYGGCLTAPRMSDDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWR 159
Query: 144 DKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGG 202
+ AVT +K+Q CG CWAF+A+AAVEGI IR+ NL+ LSEQQL+DC N+GC GG
Sbjct: 160 GR-AVTRVKDQGPTCGSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCDKL-NHGCNGG 217
Query: 203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS 262
AF+++++N+G+ E YPY G C P I Y+ VP D AL+ AV+
Sbjct: 218 LMTTAFSFVVRNRGVVPEGAYPYMGREGRCKHVMAPPVT-IYGYQRVPRFDANALMNAVA 276
Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
QPVS+AI A S EF+ Y+ G+FNG CG +L HA T VG+G + G +W++KNSWG W
Sbjct: 277 AQPVSVAIEASSFEFRHYQGGVFNGNCGGRLGHAATAVGYG-ADAGGPFWIVKNSWGPGW 335
Query: 323 GDAGYMKIVRD----EGLCGIGTRSSYPL 347
G+ GY++I R+ +G+CGI T +SYP+
Sbjct: 336 GEGGYVRISRNTPVRQGVCGILTENSYPV 364
>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
tropicalis]
gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 137/339 (40%), Positives = 204/339 (60%), Gaps = 16/339 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + S+ + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
N GC GG ++AF Y+ +N+G+ +E YPY A C + AKI+ + ++P G+E
Sbjct: 176 NQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNE 235
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGF---GTTEDGA 309
AL+ AV ++ PVS+AI A Q Y+ GI + C ++LDHAV +VG+ G G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGN 295
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
YW++KNSW + WGD GY+ + +D+ CGI T +SYPL
Sbjct: 296 RYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 133/313 (42%), Positives = 188/313 (60%), Gaps = 16/313 (5%)
Query: 46 IHEKWM---AQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDL 99
+ ++W A+HGR Y E+ RL +F++N ++I+ N + G T+ L NQF D+
Sbjct: 20 LRQQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDM 79
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T++EF A G+ + PS R T +P +DWR KGAVTP+K+QK+CG
Sbjct: 80 TSEEFTATMNGF-LNVPSRRPTAILRADPDET----LPKEVDWRTKGAVTPVKDQKQCGS 134
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+ ++EG ++ G L+ LSEQ L+DCS GN GC+GG ++AF YI N+GI
Sbjct: 135 CWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGID 194
Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEF 277
TED YPY+A G C A + Y +V G E AL KAV ++ P+S+AI A F
Sbjct: 195 TEDSYPYEAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVAIDASQPSF 254
Query: 278 QSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-E 334
Q Y +G++ G T LDH V VG+G TE G YWL+KNSW +WG+ GY+++ RD +
Sbjct: 255 QFYHDGVYYEEGCSSTMLDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNKGYIQMSRDKK 314
Query: 335 GLCGIGTRSSYPL 347
CGI +++SYPL
Sbjct: 315 NNCGIASQASYPL 327
>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
Length = 335
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 207/341 (60%), Gaps = 20/341 (5%)
Query: 20 IIITLLVS-CASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
++ LLV+ C S V ++ S Q ++ H W +QHG+SY +++E R+ I++ENL
Sbjct: 1 MMFALLVTLCISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKQDP----NRTSKGALFMEPSFF 113
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSG 252
GN GC GG ++AF Y+ +N+G+ +E YPY A C + AKI+ + ++P G
Sbjct: 174 QGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKG 233
Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGF---GTTED 307
+E AL+ AV ++ PVS+AI A Q Y+ GI + C ++LDHAV +VG+ G
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVA 293
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
G YW++KNSW + WGD GY+ + +D+ CGI T +SYPL
Sbjct: 294 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 400
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 130/305 (42%), Positives = 189/305 (61%), Gaps = 4/305 (1%)
Query: 26 VSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG 85
V+C Q S+S E E HEKWMAQ+G+ Y+D E E R +IFK N+++IE N G
Sbjct: 95 VTCGRQC-RSKSRLEACTSERHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFNVAG 153
Query: 86 NRTYKLGTNQFSDLTNDEFRALY-TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRD 144
++ + + NQF DL ++EF+AL G + S +T ++F+Y ++ +T++P ++D R
Sbjct: 154 DKPFNIRINQFPDLHDEEFKALLINGQRKVSGVETATEETSFRYGSV-VTNIPATMDGRK 212
Query: 145 KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSR 204
KG VTPIK+Q G CWA +AVAA+EGI +I + L+ LS+Q+L+D + GC+GG
Sbjct: 213 KGVVTPIKDQGIIGSCWALSAVAAIEGIHQITTSKLMFLSKQKLVDSVKGESEGCIGGYV 272
Query: 205 EKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ 264
E AF +I++ GI +E YPY+ V + + A I YE+VPS +++ALLK V+ Q
Sbjct: 273 EDAFEFIVKKGGILSETHYPYKGVNXCKVEKETHSVAHIKGYEKVPSNNKKALLKVVANQ 332
Query: 265 PVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWG 323
PVS+ I + F+ Y IFN CG+ +H V +VG+G DGA YW +KNSWG WG
Sbjct: 333 PVSVYIDVGAHAFKYYSSEIFNARNCGSDPNHVVAVVGYGKALDGAKYWPVKNSWGTEWG 392
Query: 324 DAGYM 328
YM
Sbjct: 393 GKWYM 397
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 134/337 (39%), Positives = 203/337 (60%), Gaps = 12/337 (3%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F+++ L CA+ + ++ TH++ V + A HG+ Y+ E E+ RLKI+ EN I
Sbjct: 4 FVVLCFL--CAA-MTAAAITHQELVGAEWSAFKALHGKEYQSETEEYYRLKIYMENRMMI 60
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+ N++ +YKL N++ D+ + EF + G++ S S + + +
Sbjct: 61 ARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNGFRRDYRSKPRQGSFYIEPEGIEDKH 120
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN- 194
+P ++DWR KGAVTP+KNQ +CG CWAF+ ++EG +SG+++ LSEQ L+DCST
Sbjct: 121 LPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDCSTAF 180
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
GNNGC GG + AF YI N GI TE YPY GTC + A + + ++P G+E
Sbjct: 181 GNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTDGTCHFKKSDVGATDTGFVDIPEGNE 240
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANY 311
L KAV ++ P+S+AI A FQ Y +G+++ C ++ LDH V +VG+GT +D +Y
Sbjct: 241 HLLKKAVATVGPISVAIDASHQSFQFYSQGVYDEPECSSENLDHGVLVVGYGTKDD-QDY 299
Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
WL+KNSWG TWGD GY+ + R+ + CGI + +SYPL
Sbjct: 300 WLVKNSWGTTWGDGGYIYMTRNKDNQCGIASSASYPL 336
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 136/327 (41%), Positives = 192/327 (58%), Gaps = 26/327 (7%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
+++ E+WM +HGR+Y D EK+ R ++++ N+E +E N N YKL N+F+DLTN+
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKEC-- 157
EFRA G++ + P +T S+ S D+ P S+DWR+KGAV I K C
Sbjct: 86 EFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAV--INRWKICVD 143
Query: 158 -GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
G CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC GC GG AF +++ N G
Sbjct: 144 AGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAV-GCGGGYMSWAFEFVVGNHG 202
Query: 217 IATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
+ TE YPY A G C AA+ +A I+ Y V E L +A + QPVS+A+ S
Sbjct: 203 LTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSF 262
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN----------YWLIKNSWGNTWGDA 325
FQ Y G++ G C ++H VT+VG+G +E + YW++KNSWG WGDA
Sbjct: 263 MFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDA 322
Query: 326 GYMKIVRD-----EGLCGIGTRSSYPL 347
GY+ + RD GLCGI SYP+
Sbjct: 323 GYILMQRDVAGLASGLCGIALLPSYPV 349
>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 388
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 150/355 (42%), Positives = 213/355 (60%), Gaps = 23/355 (6%)
Query: 10 SFKINTTP----MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKE 65
S +++ P M +++ LL C VS+ + + + E W H +SY + E+
Sbjct: 39 SLQVSPGPWGQAMKLLVCLLSLCWGLAVSA-PLGDSELDKHWELWKNWHQKSYH-KAEEG 96
Query: 66 MRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
R +++ENL+ IE N E G TY+LG NQF DLTN+EF+ + + S +R
Sbjct: 97 WRRMVWEENLKVIELHNLEQSLGLHTYQLGMNQFGDLTNEEFQQMLISERHFSEGNRING 156
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
S+ + ++ VPTS+DWRD G VTP+KNQ CG CWAF+ A+EG +SG L+
Sbjct: 157 SA---FLEVNYVQVPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLVS 213
Query: 183 LSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP--A 239
LSEQ L+DCS GN GC GG + AF YI++N+GI +ED YPY A T A KP A
Sbjct: 214 LSEQNLVDCSWQQGNQGCNGGIVDFAFQYILENRGIDSEDCYPYTA-KDTAQCAFKPECA 272
Query: 240 AAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHA 296
A+++ + ++P E+AL+KAV ++ PVS+AI A+ T F+ Y+ GIF C ++ L+HA
Sbjct: 273 TARVTGFVDIPPHSEEALMKAVATVGPVSVAIDAHPTSFRFYQSGIFYEPKCSSERLNHA 332
Query: 297 VTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYPL 347
V +VG+ G E G YW++KNSWG WGD GY + +D G CGI T +SYPL
Sbjct: 333 VLVVGYGYEGEDEAGKKYWIVKNSWGKQWGDHGYFYLSKDRGNHCGIATTASYPL 387
>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
Length = 450
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 129/303 (42%), Positives = 188/303 (62%), Gaps = 7/303 (2%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E W A+HGRSY E+ RL F +N ++ A+ +Y L N F+DLT+DEFRA
Sbjct: 39 EAWCAEHGRSYATPGERAARLAAFADNAAFV-AAHNGAPASYALALNAFADLTHDEFRAA 97
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
G + + + + + VP ++DWR GAVT +K+Q CG CW+F+A
Sbjct: 98 RLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATG 157
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
A+EGI KI++G+LI LSEQ+L+DC + N+GC GG + A+ ++++N GI TE +YPY+
Sbjct: 158 AMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRE 217
Query: 228 VPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
GTC+ + K I Y++VP+ +E LL+AV+ QPVS+ I + FQ Y +GIF+
Sbjct: 218 TDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFD 277
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTR 342
G C T LDHA+ IVG+G +E G +YW++KNSWG +WG GYM + R+ G+CGI
Sbjct: 278 GPCPTSLDHAILIVGYG-SEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQM 336
Query: 343 SSY 345
S+
Sbjct: 337 PSF 339
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 135/335 (40%), Positives = 205/335 (61%), Gaps = 11/335 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE++ R+KI+ EN + K
Sbjct: 2 LIFLLGAVFVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 60
Query: 81 AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y++ N+F DL + EFR++ GY+ + S STF + + +VP
Sbjct: 61 HNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 119
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
S+DWR+KGA+TP+K+Q +CG CWAF++ A+EG T ++G L+ L EQ L+DCS GN
Sbjct: 120 ESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGN 179
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG ++AF YI N+GI TE+ YPY+A C + A + ++PSG+E
Sbjct: 180 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDK 239
Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
L AV ++ PVS+AI A FQ Y +G+ + C + LDH V +VG+G +++G +YWL
Sbjct: 240 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDYWL 298
Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
+KNSW WGD GY+KI R+ + CG+ T +SYPL
Sbjct: 299 VKNSWSEHWGDQGYIKIARNRKNHCGVATAASYPL 333
>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
Length = 335
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 141/342 (41%), Positives = 206/342 (60%), Gaps = 21/342 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
MF ++ L C S V ++ S Q ++ H W +QHG+SY +++E R+ I++ENL
Sbjct: 2 MFALLITL--CISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLR 56
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 57 KIEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKQDP----NRTSKGALFMEPSF 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS
Sbjct: 113 FAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSR 172
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPS 251
GN GC GG ++AF Y+ +N+G+ +E YPY A C + AKI+ + ++P
Sbjct: 173 PQGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPR 232
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGF---GTTE 306
G+E AL+ AV ++ PVS+AI A Q Y+ GI + C ++LDHAV +VG+ G
Sbjct: 233 GNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADV 292
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
G YW++KNSW + WGD GY+ + +D+ CGI T +SYPL
Sbjct: 293 AGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
Length = 330
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 199/338 (58%), Gaps = 18/338 (5%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + ++S+ TH+ S + E+W +HG++Y E + R +++ N+
Sbjct: 2 TPIFLLATLCLG----MISAAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNM 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I N++ G + L N F DLTN EFR L TG++ P + F
Sbjct: 57 KMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQSMGPKETTIFREPF------ 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ D+P SLDWR+ G VTP+KNQ +CG CWAF+AV ++EG ++G L+ LSEQ L+DCS
Sbjct: 111 LGDIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCS 170
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
+ GN GC GG E AF Y+ +N+G+ T + Y Y+A G C K +AA ++ + +VP
Sbjct: 171 WSYGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQDGLCRYNPKYSAANVTGFVKVPL 230
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGA 309
++ + S+ PVS+ I ++ F+ Y G++ T++DHAV +VG+G DG
Sbjct: 231 SEDDLMSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEESDGG 290
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
YWL+KNSWG WG GY+K+ +D+ CGI T + YP
Sbjct: 291 KYWLVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYP 328
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 253 bits (646), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 140/334 (41%), Positives = 200/334 (59%), Gaps = 15/334 (4%)
Query: 25 LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE 84
+V C V ++ TH++ V + A HG+ Y + E+ RLKI+ EN I + N++
Sbjct: 5 IVLCCLFVTAAAITHQELVGAEWSAFKALHGKDYASDTEEYYRLKIYMENRLKIARHNEK 64
Query: 85 GNRT---YKLGTNQFSDLTNDEFRALYTGYKM---PSPSHRSTTSSTFKYQNLSMTDVPT 138
++ YKL N+F DL + EF + G+K SP S +++L + P
Sbjct: 65 YAKSQVSYKLAMNEFGDLLHHEFVSTRNGFKRNYRDSPREGSFFVEPEGFEDLQL---PK 121
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNN 197
++DWR KGAVTP+KNQ +CG CWAF+ ++EG ++ L+ LSEQ L+DCS + GNN
Sbjct: 122 TVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNN 181
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG + AF YI N+GI TE YPY A G C + A + + ++P GDE L
Sbjct: 182 GCEGGLMDNAFKYIKSNKGIDTEWSYPYNATDGVCHFNRSDVGATDTGFVDIPEGDENKL 241
Query: 258 LKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYWLI 314
KAV ++ PVS+AI A FQ Y EG+++ C + QLDH V +VG+G T+DG +YWL+
Sbjct: 242 KKAVAAVGPVSVAIDASHESFQFYSEGVYDEPECSSEQLDHGVLVVGYG-TKDGQDYWLV 300
Query: 315 KNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
KNSWG TWGD GY+ + R+ + CGI + +SYPL
Sbjct: 301 KNSWGTTWGDEGYIYMTRNKDNQCGIASSASYPL 334
>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
Length = 335
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 136/339 (40%), Positives = 203/339 (59%), Gaps = 16/339 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + + + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GYK + TS +
Sbjct: 60 QHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPKFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS +G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
N GC GG ++AF Y+ +N+G+ +E YPY A C + AKI+ + ++P G+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNE 235
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGF---GTTEDGA 309
AL+ AV ++ PVS+AI A Q Y+ GI + C +QLDHAV +VG+ G G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAGN 295
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
YW++KNSW + WGD GY+ + +D+ CGI T +SYPL
Sbjct: 296 RYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 136/320 (42%), Positives = 192/320 (60%), Gaps = 11/320 (3%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
S + S E + ++ +M Q+ ++Y E R FK N+E I N N +Y
Sbjct: 25 SALFSEEVPSEVMLQDMFTAFMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASY 83
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+G N+F+DL+ +EF+ Y GYK R S +Q + PTS+DWR AVT
Sbjct: 84 TMGLNEFADLSFEEFKGKYFGYKHV---EREFARSNNLHQEVEA--APTSIDWRTSNAVT 138
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGN-LIQLSEQQLLDCSTN-GNNGCLGGSREKA 207
PIK+Q +CG CWAF+A ++EG ++ + L LSEQQL+DCST+ G+ GC GG + A
Sbjct: 139 PIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYA 198
Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPV 266
F YII N+GI E YPY+ V G C + IS Y++V SGDE +LL AV ++ PV
Sbjct: 199 FEYIIANKGICAESAYPYKGVGGLCQKSCTKVVT-ISGYKDVASGDEASLLNAVGTVGPV 257
Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
S+AI A FQ Y G+F+G CG LDH V VG+GTT +YW++KNSWG +WG++G
Sbjct: 258 SVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTT-GSQDYWIVKNSWGTSWGESG 316
Query: 327 YMKIVRDEGLCGIGTRSSYP 346
Y++++R++ CGI + SYP
Sbjct: 317 YIRMIRNKNQCGIAIQPSYP 336
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 187/310 (60%), Gaps = 22/310 (7%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W A + RSY E++ R ++++ N+E+IE N+ GN TY LG NQF+DLT +EF LYT
Sbjct: 52 WQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEEEFLDLYT 111
Query: 110 GYKMP----SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFA 164
MP + R+ SS+ + D PTS+DWR KGAVTPIKNQ C CWAF
Sbjct: 112 MKGMPVRRDAGKKRANVSSS-----AAAVDAPTSVDWRSKGAVTPIKNQGPSCSSCWAFV 166
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
A +E ITKI +G L+ LSEQ+L+DC + GC G + ++IQN G+ TE YP
Sbjct: 167 TAATIESITKITTGKLVSLSEQELIDCDPY-DGGCNLGYFVNGYRWVIQNGGLTTEANYP 225
Query: 225 YQAVPGTCS---AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YQA CS AAQ AA IS+Y ++P+G+ Q L+ Q A Q Y
Sbjct: 226 YQARRYACSRSRAAQH--AATISDYVQLPAGEGQ--LQQAVAQQPVAAAIEMGGSLQFYS 281
Query: 282 EGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLC 337
G+F+G CGT+++HA+T+VG+G + G YWL+KNSWG +WG+ GY+++ RD GLC
Sbjct: 282 GGVFSGQCGTRMNHAITVVGYGADSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGRGGLC 341
Query: 338 GIGTRSSYPL 347
GI +YP+
Sbjct: 342 GIALDLAYPV 351
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 176/301 (58%), Gaps = 6/301 (1%)
Query: 52 AQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY 111
A + +SY E EK+ R IFK NL YI N++G +Y L N F DL+ DEFR Y G+
Sbjct: 122 AMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRRKYLGF 180
Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEG 171
K + N+ +++P +DWR +G VTP+K+Q++CG CWAF+ A+EG
Sbjct: 181 KKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEG 240
Query: 172 ITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
++G L+ LSEQ+L+DCS GN C GG AF Y++ + GI +ED YPY A
Sbjct: 241 AHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDE 300
Query: 231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG 290
C A KI +++VP E A+ A++ PVSIAI A FQ Y EG+F+ CG
Sbjct: 301 ECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCG 360
Query: 291 TQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYP 346
T LDH V +VG+GT E ++W++KNSWG WG GYM + +EG CG+ +S+P
Sbjct: 361 TDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDASFP 420
Query: 347 L 347
+
Sbjct: 421 V 421
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 134/254 (52%), Positives = 176/254 (69%), Gaps = 12/254 (4%)
Query: 35 SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTN 94
+R+ E S+ E HE+WMA + R YKD EK+MR KIFKEN++ I+ N E +++YKL N
Sbjct: 27 ARTLQEASMYERHEQWMASYARVYKDANEKQMRYKIFKENVQRIDSFNSESDKSYKLAVN 86
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
QF+DLTN+EF++L G+K H S + F+Y+N+ T VP S+DWR KGAVT IK
Sbjct: 87 QFADLTNEEFKSLRNGFK----GHMCSAQAGHFRYENV--TAVPASIDWRKKGAVTQIKE 140
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYII 212
Q +CG CWAF+AVAAVEGIT+I++G LI LSEQ+L+DC TN + GC GG + AF +I
Sbjct: 141 QGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSEDQGCQGGLMDDAFKFIE 200
Query: 213 QNQGIATEDEYPYQAVPGTCSAAQ--KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
Q+ G+A+E YPY A TC + KP +AKI+ YE+VP+ DE AL AV+ QPVS+AI
Sbjct: 201 QH-GLASEATYPYDAADSTCKTKEEAKP-SAKITGYEDVPANDEAALKNAVANQPVSVAI 258
Query: 271 AAYSTEFQSYKEGI 284
A EFQ Y GI
Sbjct: 259 DAGGFEFQFYSSGI 272
>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
Length = 449
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 130/303 (42%), Positives = 187/303 (61%), Gaps = 8/303 (2%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E W A+HGRSY E+ RL F +N ++ A+ +Y L N F+DLT+DEFRA
Sbjct: 39 EAWCAEHGRSYATPGERAARLAAFADNAAFV-AAHNGAPASYALALNAFADLTHDEFRAA 97
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
G + R + + VP ++DWR GAVT +K+Q CG CW+F+A
Sbjct: 98 RLGRLAAAGPGRDGGAPYLGVDG-GVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATG 156
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
A+EGI KI++G+LI LSEQ+L+DC + N+GC GG + A+ ++++N GI TE +YPY+
Sbjct: 157 AMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRE 216
Query: 228 VPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
GTC+ + K I Y++VP+ +E LL+AV+ QPVS+ I + FQ Y +GIF+
Sbjct: 217 TDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFD 276
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTR 342
G C T LDHA+ IVG+G +E G +YW++KNSWG +WG GYM + R+ G+CGI
Sbjct: 277 GPCPTSLDHAILIVGYG-SEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQM 335
Query: 343 SSY 345
S+
Sbjct: 336 PSF 338
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 135/310 (43%), Positives = 188/310 (60%), Gaps = 14/310 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E W HG+SY+ +E+++RLKI EN I + N E G +Y + N + DL + EF
Sbjct: 28 ESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYGDLLHHEF 87
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A+ GY+ + + S S +N+ + PT +DWR+ GAVTP+KNQ +CG CWAF+
Sbjct: 88 VAMVNGYEYVNKT--SLGGSFIPSKNVKL---PTHVDWREDGAVTPVKNQGQCGSCWAFS 142
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
+ ++EG T ++G LI LSEQ L+DCS GNNGC GG + AF YI N+GI TE Y
Sbjct: 143 STGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGIDTEGSY 202
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY+ V G C + + +V G E+ LLKAV S+ PVS+AI A FQ Y
Sbjct: 203 PYEGVGGRCHYDPSKKGSSDIGFVDVKKGSEEELLKAVASVGPVSVAIDASHMSFQFYSH 262
Query: 283 GI-FNGVCGTQ-LDHAVTIVGFGTTED-GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCG 338
G+ F C + LDH V +VG+GT E+ G +YWL+KNSW WGD GY+K+ R+ + +CG
Sbjct: 263 GVYFESKCSPENLDHGVLVVGYGTDENSGEDYWLVKNSWSENWGDQGYIKMARNKKNMCG 322
Query: 339 IGTRSSYPLA 348
I + +SYP+
Sbjct: 323 IASSASYPVV 332
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 195/318 (61%), Gaps = 15/318 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI--EKANKEGNR-TYKLGTNQF 96
E+ V+EI ++W +H + Y+ E E R + FK NL+YI A ++ N+ + +G N+F
Sbjct: 42 EERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNKF 101
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+D++N+EFR Y K+ P ++ T S + + D P+SLDWR+ G VT +K+Q
Sbjct: 102 ADMSNEEFRKAYLS-KVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKDQGS 160
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF++ A+EGI + +G+LI LSEQ+L++C T+ N GC GG + AF ++I N G
Sbjct: 161 CGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS-NYGCEGGYMDYAFEWVINNGG 219
Query: 217 IATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
I +E +YPY V GTC + ++ I Y++V D ALL AV+ QPVS+ I +
Sbjct: 220 IDSESDYPYTGVDGTCNTTKEETKVVSIDGYQDVEQSD-SALLCAVAQQPVSVGIDGSAI 278
Query: 276 EFQSYKEGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
+FQ Y GI++G C +DHAV IVG+G +ED YW++KNSWG +WG GY + R
Sbjct: 279 DFQLYTGGIYDGSCSDDPDDIDHAVLIVGYG-SEDSEEYWIVKNSWGTSWGIDGYFYLKR 337
Query: 333 DE----GLCGIGTRSSYP 346
D G+C + +SYP
Sbjct: 338 DTDLPYGVCAVNAMASYP 355
>gi|222636309|gb|EEE66441.1| hypothetical protein OsJ_22818 [Oryza sativa Japonica Group]
Length = 318
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 136/343 (39%), Positives = 193/343 (56%), Gaps = 46/343 (13%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ ++ L + A ++SR+ ++ H+KWMA+HGR+YKD EK R ++FK N++
Sbjct: 6 LLVVAGGLSTMAKVTMASRAGTMEAR---HDKWMAEHGRTYKDAAEKARRFRVFKANVDL 62
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-- 135
I+++N GN+ Y+L TN+F+DLT+ EF A+YTGY + + + ++T LS D
Sbjct: 63 IDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATT----RLSSEDDQ 118
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P +DWR +GAVT +KNQ+ CGCCWAF+ VAAVEGI +I +G L+ L+
Sbjct: 119 QPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLTWPTAAASP--- 175
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC----SAAQKPAAAKISNYEEVPS 251
Y YQ G C S++ AA IS Y+ V
Sbjct: 176 -----------------------PRRAYAYQGAQGACQFDASSSASGVAATISGYQRVNP 212
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGA- 309
DE +L AV+ QPVS+AI F+ Y G+F CGT+LDHAV +VG+G DG+
Sbjct: 213 NDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSG 272
Query: 310 --NYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW+IKNSWG TWGD GYMK+ +D +G CG+ SYP+
Sbjct: 273 GGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 315
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 176/301 (58%), Gaps = 6/301 (1%)
Query: 52 AQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY 111
A + +SY E EK+ R IFK NL YI N++G +Y L N F DL+ DEFR Y G+
Sbjct: 121 AMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRRKYLGF 179
Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEG 171
K + N+ +++P +DWR +G VTP+K+Q++CG CWAF+ A+EG
Sbjct: 180 KKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEG 239
Query: 172 ITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
++G L+ LSEQ+L+DCS GN C GG AF Y++ + GI +ED YPY A
Sbjct: 240 AHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDE 299
Query: 231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG 290
C A KI +++VP E A+ A++ PVSIAI A FQ Y EG+F+ CG
Sbjct: 300 ECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCG 359
Query: 291 TQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYP 346
T LDH V +VG+GT E ++W++KNSWG WG GYM + +EG CG+ +S+P
Sbjct: 360 TDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDASFP 419
Query: 347 L 347
+
Sbjct: 420 V 420
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 131/317 (41%), Positives = 190/317 (59%), Gaps = 16/317 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN--KEGNRTYKLGTNQFS 97
E+ + E+ + W +H + YK E E R+ FK NL+YI + N ++ +K+G N+F+
Sbjct: 43 EEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKFA 102
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+N+EFR +Y K+ P T K+++L D P+SLDWR+KG VT +K+Q +C
Sbjct: 103 DLSNEEFREMYLS-KVKKPI---TIEEKRKHRHLQTCDAPSSLDWRNKGVVTAVKDQGDC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CW+F+ A+E I I +G+LI LSEQ+L+DC T N GC GG + AF ++I N GI
Sbjct: 159 GSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGI 218
Query: 218 ATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
TE +YPY V GTC +A ++ I Y +V D ALL A QP+S+ + + +
Sbjct: 219 DTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSD-SALLCATVQQPISVGMDGSALD 277
Query: 277 FQSYKEGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
FQ Y GI++G C +DHA+ IVG+G+ D +YW++KNSWG WG GY I R+
Sbjct: 278 FQLYTGGIYDGDCSGDPNDIDHAILIVGYGSEND-EDYWIVKNSWGTEWGMEGYFYIRRN 336
Query: 334 E----GLCGIGTRSSYP 346
G+C I +SYP
Sbjct: 337 TSKPYGVCAINADASYP 353
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 122/276 (44%), Positives = 188/276 (68%), Gaps = 7/276 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
+H++ ++E+ E W++ ++Y+ EK +R ++FK+NL++I++ NK+G ++Y LG N+F+
Sbjct: 43 SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
DL+++EF+ +Y G K S + F Y+++ VP S+DWR KGAV +KNQ
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ VAAVEGI KI +GNL LSEQ+L+DC T NNGC GG + AF YI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218
Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
+ E++YPY GTC + + I+ +++VP+ DE++LLKA++ QP+S+AI A
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGR 278
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
EFQ Y G+F+G CG LDH V VG+G+++ G++Y
Sbjct: 279 EFQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDY 313
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 192/317 (60%), Gaps = 12/317 (3%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
V+E E + +H + Y+ + E+ R+KIF EN + I NK G++TYKLG N++ D+
Sbjct: 25 VMEEWESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDM 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL--SMTDV--PTSLDWRDKGAVTPIKNQK 155
+ EF + G++ + + F+ + DV P S+DWR+KGAVT +K+Q
Sbjct: 85 LHHEFVNMMNGFRANTSGAGYKANRGFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKDQG 144
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQN 214
CG CWAF+A A+EG ++G+L+ LSEQ L+DCS+ GNNGC GG + AF YI N
Sbjct: 145 SCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIKVN 204
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAY 273
GI TE YPY+A C A A + +V G+E AL KA+ ++ PVS+AI A
Sbjct: 205 GGIDTEKSYPYEAEDEPCRYNPANAGADDRGFVDVREGNENALKKAIATIGPVSVAIDAS 264
Query: 274 STEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
FQ Y+ G+++ LDH V VG+GTTEDG +YWL+KNSW +WGD GY+KI
Sbjct: 265 QDSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTTEDGQDYWLVKNSWSKSWGDQGYIKIA 324
Query: 332 RDE-GLCGIGTRSSYPL 347
R++ +CGI + +SYPL
Sbjct: 325 RNQNNMCGIASAASYPL 341
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 137/338 (40%), Positives = 198/338 (58%), Gaps = 14/338 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F+++ L A+ + TH++ V + A HG+ Y E E+ RLKI+ EN I
Sbjct: 27 FVVLGCLFVTAAAI-----THQELVGAEWSAFKALHGKEYHSETEEYYRLKIYMENRLKI 81
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+ N++ +YKL N+F DL + EF + G+K S S + + +
Sbjct: 82 ARHNEKYANNKASYKLAMNEFGDLLHHEFVSTRNGFKRNYRSTPREGSFYIEPEGIEDKH 141
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN- 194
+P ++DWR KGAVTP+KNQ +CG CWAF+ ++EG ++G ++ LSEQ L+DCS
Sbjct: 142 LPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCSGKF 201
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
GNNGC GG + AF YI N GI TE YPY G C + A + + ++P G+E
Sbjct: 202 GNNGCEGGLMDNAFKYIKANGGIDTELSYPYNGTDGICHFEKSDVGATDTGFVDIPEGNE 261
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANY 311
Q L KAV ++ PVS+AI A FQ Y +G+++ C ++ LDH V +VG+G T+DG +Y
Sbjct: 262 QLLKKAVATVGPVSVAIDASHESFQFYSQGVYDEPECSSESLDHGVLVVGYG-TKDGQDY 320
Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
WL+KNSWG TWGD GY+ + R+ E CGI + +SYPL
Sbjct: 321 WLVKNSWGTTWGDDGYIYMTRNKENQCGIASSASYPLV 358
>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
Length = 355
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 136/327 (41%), Positives = 198/327 (60%), Gaps = 13/327 (3%)
Query: 33 VSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GN 86
V + + Q + E KW G+SY+ E E + ++ F +N+ +IE+ NKE G
Sbjct: 31 VHRQKSLRQKIDEAFNKWDDYKETFGKSYEPEEENDY-MEAFVKNVIHIEEHNKEHRLGR 89
Query: 87 RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKG 146
+T+++G N+ +DL ++R L GY+M S S+ K+ +P S+DWR++G
Sbjct: 90 KTFEMGLNEIADLPFSQYRKL-NGYRMRRQFGDSMQSNGTKFLVPFNVQIPESVDWREEG 148
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSRE 205
VTP+KNQ CG CWAF++ A+EG +G L+ LSEQ L+DCST GN+GC GG +
Sbjct: 149 LVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMD 208
Query: 206 KAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ- 264
AF YI +N G+ TED YPY C + A + ++P GDE+AL KAV+ Q
Sbjct: 209 LAFEYIKENHGVDTEDSYPYVGRETKCHFKRNTVGADDKGFVDLPEGDEEALKKAVATQG 268
Query: 265 PVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
P+SIAI A FQ YK+G+ F+ C + +LDH V +VG+GT + +YWL+KNSWG TW
Sbjct: 269 PISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTW 328
Query: 323 GDAGYMKIVRDE-GLCGIGTRSSYPLA 348
G+ GY++I R+ CG+ T++SYPL
Sbjct: 329 GEKGYIRIARNRNNHCGVATKASYPLV 355
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 134/320 (41%), Positives = 190/320 (59%), Gaps = 20/320 (6%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W A +H + Y E+E + R+KI+ EN I K N+ +G +YKL N+++D+
Sbjct: 23 VREEWSAFKLEHSKRYDSEVEDKFRMKIYLENKHRIAKHNQRFEQGAVSYKLRPNKYADM 82
Query: 100 TNDEFRALYTGY----KMPSPSH---RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
+ EF + G+ K P H R + +TF + P +DWR KGAVT +K
Sbjct: 83 LSHEFVHVMNGFNKTLKHPKAVHGKGRESRPATFIAP--AHVTYPDHVDWRKKGAVTEVK 140
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYI 211
+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS GNNGC GG + AF YI
Sbjct: 141 DQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYI 200
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAI 270
N GI TE YPY+ V C K + A + ++P GDE+ L++AV ++ PVS+AI
Sbjct: 201 KDNGGIDTEKAYPYEGVDDKCRYNAKNSGADDVGFVDIPQGDEEKLMQAVATVGPVSVAI 260
Query: 271 AAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
A FQ Y +G++ T LDH V +VG+GT E G +YWL+KNSWG TWGD GY+
Sbjct: 261 DASQESFQFYSDGVYYDENCSSTDLDHGVMVVGYGTDEQGGDYWLVKNSWGRTWGDLGYI 320
Query: 329 KIVRDE-GLCGIGTRSSYPL 347
K+ R++ CGI + +SYPL
Sbjct: 321 KMARNKNNHCGIASSASYPL 340
>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
Length = 336
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 136/340 (40%), Positives = 204/340 (60%), Gaps = 17/340 (5%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + S+ + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS +G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
N GC GG ++AF Y+ +N+G+ +E YPY A C + AKI+ + ++P G+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNE 235
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
AL+ AV ++ PVS+AI A Q Y+ GI+ ++LDHAV +VG+ G G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
YW++KNSW + WGD GY+ + +D+ CGI T +SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 335
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 134/335 (40%), Positives = 204/335 (60%), Gaps = 11/335 (3%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE++ R+KI+ EN + K
Sbjct: 2 LIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 60
Query: 81 AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y + N+F DL + EFR++ GY+ + S STF + + VP
Sbjct: 61 HNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVTVP 119
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
S+DWR+KGA+TP+K+Q +CG CWAF++ A+EG T ++G L+ LSEQ L+DCS GN
Sbjct: 120 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGN 179
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
GC GG ++AF YI N+GI TE+ YPY+A C + A + ++PSG+E
Sbjct: 180 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDK 239
Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
L AV ++ PVS+AI A FQ Y +G+ + C + LDH V +VG+G +++G +YWL
Sbjct: 240 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDYWL 298
Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
+KNSW WGD GY+K+ R+ + CG+ + +SYPL
Sbjct: 299 VKNSWSEHWGDEGYIKMARNRKNHCGVASAASYPL 333
>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 334
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 141/337 (41%), Positives = 203/337 (60%), Gaps = 14/337 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ + L+ +C VVSS S E +W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFIDFDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
K N + G+ TY LG NQF+DL N+EF +L G++ S ++T STF + ++ D
Sbjct: 60 IKHNLKYDLGHFTYDLGMNQFADLKNEEFVSLMNGFR--GNSSKATRGSTFLPPS-NVFD 116
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
+PT +DWR KG VTP+KNQ +CG CWAF+A ++EG ++G L+ LSEQ L+DCS
Sbjct: 117 MPTMVDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGKE 176
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
GN GC GG ++AF YI+ GI TE YPY A+ G C + A + Y +V +G E
Sbjct: 177 GNMGCEGGLMDQAFQYILDVGGIDTEMSYPYTAMDGQCHFNKANIGATDTGYTDVTTGSE 236
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANY 311
AL AV S+ P+S+AI A FQ YK G++N T LDH V VG+GT+ DG +Y
Sbjct: 237 SALQMAVASVGPISVAIDASHQSFQLYKSGVYNEPACSSTLLDHGVLAVGYGTSSDGTDY 296
Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
+ +SWG WG GY+ + R+ + CGI T++SYPL
Sbjct: 297 FFFFHSWGAAWGMNGYLWMSRNKDNQCGIATKASYPL 333
>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
Length = 363
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 136/315 (43%), Positives = 195/315 (61%), Gaps = 15/315 (4%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ E WM +H + YK+ EK R +IFK+NL+YI++ NK+ N +Y LG N F+
Sbjct: 57 TSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKK-NNSYWLGLNVFA 115
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
D++NDEF+ YTG + ++ +T S + N ++P +DWR KGAVTP+KNQ C
Sbjct: 116 DMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSC 173
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G WAF+AV+ +E I KIR+GNL + SEQ+LLDC + GC GG A + Q GI
Sbjct: 174 GSAWAFSAVSTIESIIKIRTGNLNEYSEQELLDCDRR-SYGCNGGYPWSALQLVAQ-YGI 231
Query: 218 ATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
+ YPY+ V C + +K P AAK +V +E ALL +++ QPVS+ + A +
Sbjct: 232 HYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKD 291
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
FQ Y+ GIF G CG ++DHAV VG+ G NY LI+NSWG WG+ GY++I R
Sbjct: 292 FQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILIRNSWGTGWGENGYIRIKRGTGN 346
Query: 333 DEGLCGIGTRSSYPL 347
G+CG+ T S YP+
Sbjct: 347 SYGVCGLYTSSFYPV 361
>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
Length = 229
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 118/217 (54%), Positives = 152/217 (70%), Gaps = 5/217 (2%)
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
VP S+DWR KGAVT +K+Q +CG CWAF+ + AVEGI +I++ L+ LSEQ+L+DC T+
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
N GC GG + AF +I Q GI TE YPY+A GTC +++ A A I +E VP DE
Sbjct: 62 NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDE 121
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
ALLKAV+ QPVS+AI A ++FQ Y EG+F G CGT+LDH V IVG+GTT DG YW +
Sbjct: 122 NALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTV 181
Query: 315 KNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
KNSWG WG+ GY+++ R EGLCGI +SYP+
Sbjct: 182 KNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPI 218
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 125/286 (43%), Positives = 175/286 (61%), Gaps = 4/286 (1%)
Query: 52 AQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY 111
A +G+SY E E + R IFK NL YI N++G +Y L N F DL+ +EFR Y GY
Sbjct: 124 ATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQG-YSYSLKMNHFGDLSREEFRRKYLGY 182
Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEG 171
+ + +S +DVP+++DWR+KG VTP+K+Q++CG CWAF+A A+EG
Sbjct: 183 NKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSATGALEG 242
Query: 172 ITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
++G L+ LSEQ+L+DCS GN GC GG AF Y++ + G+ +E+ YPY A G
Sbjct: 243 AHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPYLARDG 302
Query: 231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG 290
C A K IS +++VP E A+ A++ PVSIAI A FQ Y EG+F+ CG
Sbjct: 303 ECKRACKKVVT-ISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYHEGVFDASCG 361
Query: 291 TQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 335
T LDH V +VG+GT E ++W++KNSWG+ WG GYM + +G
Sbjct: 362 TDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYMAMHKG 407
>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
(fragment)
gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
gi|226542|prf||1601514A actinidin
Length = 302
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 132/278 (47%), Positives = 174/278 (62%), Gaps = 10/278 (3%)
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
L +I++ N + NR+YK+G NQF+DLT +EFR+ Y G+ S + T + +Y+
Sbjct: 1 LRFIDEHNADTNRSYKVGLNQFADLTGEEFRSTYLGFTGGS----NKTKVSNRYEPRVSQ 56
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC-ST 193
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+ C T
Sbjct: 57 VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGT 116
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSG 252
GC GG F +II N GI T + YPY A G C+ Q I Y VP
Sbjct: 117 QNTRGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYVTIDTYGNVPYN 176
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
+E AL AV+ QPVS+A+ A F+ Y GIF G CGT +DHAVTIVG+G TE G +YW
Sbjct: 177 NEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGIDYW 235
Query: 313 LIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
+++NSW TWG+ GYM+I+R+ G CGI T SYP+
Sbjct: 236 IVENSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 273
>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
Length = 336
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 136/340 (40%), Positives = 203/340 (59%), Gaps = 17/340 (5%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + S+ + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
N GC GG ++AF Y+ +N+G+ +E YPY A C + AKI+ + ++P G+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNE 235
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
AL+ AV ++ PVS+AI A Q Y+ GI+ ++LDHAV +VG+ G G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
YW++KNSW + WGD GY+ + +D+ CGI T +SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 335
>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 343
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 144/357 (40%), Positives = 204/357 (57%), Gaps = 46/357 (12%)
Query: 9 GSFKINTTPMFIIITLLVSCAS----QVVSSRSTH--------EQSVVEIHEKWMAQHGR 56
G+ + + +FI+ +++ +S ++S +H ++ V+ I+E+ +A+HG+
Sbjct: 2 GTNRSSKATIFILFFTVLAVSSALDLSIISYDRSHADKSGWRSDEEVMSIYEEXLAKHGK 61
Query: 57 SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP 116
Y E E R +I KENL+++E+ N GNRTYK+G N+F+D + M P
Sbjct: 62 VYNAIDEMEERFQISKENLKFVEQHNA-GNRTYKVGLNRFADRSR----------MMTRP 110
Query: 117 SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIR 176
S R Y ++ S+DWR +GAV +K Q EC C F +AAVEGI KI
Sbjct: 111 SSR--------YAPRVSDNLSESVDWRKEGAVVRVKTQSECESCRTFTVIAAVEGINKIV 162
Query: 177 SGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
+GNL LS DC N GC GG + A +II N GI TE++YP+Q G C +
Sbjct: 163 TGNLTALS-----DCDRTVNAGCSGGLADYALEFIINNGGIDTEEDYPFQGAVGICDQYK 217
Query: 237 KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIA-IAAYSTEFQSYKEGIFNGVCGTQLDH 295
A + YE VP+ DE AL KAV+ QPVS+A I AY EFQ Y+ GIF G CGT +DH
Sbjct: 218 INA---VDGYERVPAYDELALKKAVANQPVSVAYIEAYGKEFQLYESGIFTGKCGTSIDH 274
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
VT VG+G TE+G +YW++KNSWG WG+AGY+++ R+ G CGI + YP+
Sbjct: 275 GVTAVGYG-TENGIDYWIVKNSWGENWGEAGYVRMERNTAEDTAGKCGIAILTLYPI 330
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 129/305 (42%), Positives = 182/305 (59%), Gaps = 16/305 (5%)
Query: 54 HGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM 113
H + Y E E+ R IFK NL YI N +G +Y L N+F DLT +EFR Y GYK
Sbjct: 96 HNKFYATEEERLKRYAIFKNNLTYIHNHNMQG-YSYVLKMNKFGDLTLEEFRQRYLGYKK 154
Query: 114 PS----PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
P P TT +++ D+PT +DWR +G VT +K+Q +CG CWAF+A A+
Sbjct: 155 PDLRTPPREVDTT-----LESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATGAM 209
Query: 170 EGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV 228
EG+ ++G L+ LS+QQL+DCS GN GC GG E+AF Y+++N GI + + YPY
Sbjct: 210 EGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPYMRK 269
Query: 229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIFNG 287
G C ++Q + A I+ Y VP E+++ A++++ PVS+AI A FQ Y +GIF+
Sbjct: 270 DGVCKSSQCTSVATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYDGIFDA 329
Query: 288 VCGTQLDHAVTIVGFGTTEDG-ANYWLIKNSWGNTWGDAGYMKIVRDE---GLCGIGTRS 343
CGT LDH V +VG+ G +YW++KNSWG WG GYM + + G CG+
Sbjct: 330 PCGTNLDHGVLLVGYSAETAGQGDYWIMKNSWGAAWGKGGYMLMAMHKGPAGQCGVLLDG 389
Query: 344 SYPLA 348
S+P+A
Sbjct: 390 SFPVA 394
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 137/341 (40%), Positives = 200/341 (58%), Gaps = 20/341 (5%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEK 80
+L+ CA VS+ Q + E+W A QH +Y+ E+E R+KI+ E+ I K
Sbjct: 5 VLLLCAVAAVSAV----QFFDLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAK 60
Query: 81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST-----TSSTFKYQNLS 132
N++ G +YKLG N++ D+ + EF G+ + +++ + K+ + +
Sbjct: 61 HNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 120
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P +DWR GAVT IK+Q +CG CW+F+ A+EG +SG L+ LSEQ L+DCS
Sbjct: 121 NVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS 180
Query: 193 TN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
GNNGC GG + AF YI N GI TE YPY+ V C K A+ + ++P
Sbjct: 181 EQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPKNTGAEDVGFVDIPE 240
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDG 308
GDEQ L++AV ++ PVS+AI A T FQ Y G++N T LDH V +VG+GT E G
Sbjct: 241 GDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQG 300
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
+YWL+KNSWG +WG+ GY+K++R++ CGI + +SYPL
Sbjct: 301 VDYWLVKNSWGRSWGELGYIKMIRNKNNRCGIASSASYPLV 341
>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
Length = 330
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 135/341 (39%), Positives = 198/341 (58%), Gaps = 20/341 (5%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + ++S+ TH+ S + E+W +HG++Y E + R +++ N+
Sbjct: 2 TPIFLLATLCLG----MISAAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNM 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I N++ G + L N F DLTN EFR L TG++ F
Sbjct: 57 KMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQGQKTKMMKVFPEPF------ 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP ++DWR G VTP+KNQ CG CWAF+AV ++EG ++G L+ LSEQ L+DCS
Sbjct: 111 LGDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCS 170
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
++GN GC GG + AF Y+ N G+ T YPY+A+ GTC K +AAK+ + +P
Sbjct: 171 WSHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKYSAAKVVGFMSIPP 230
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
E AL+KAV ++ P+S+ I FQ YK G++ T L+HAV +VG+G DG
Sbjct: 231 -SENALMKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEESDG 289
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
YWL+KNSWG WG GY+K+ +D CGI + +SYP+
Sbjct: 290 RKYWLVKNSWGRDWGMDGYIKMAKDWNNNCGIASDASYPIV 330
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 132/315 (41%), Positives = 189/315 (60%), Gaps = 15/315 (4%)
Query: 42 SVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
+V +I +W A+ G SY E E+ R +F +N++ I + N +G+ TY LG NQF+D
Sbjct: 11 AVADIDAQWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGH-TYTLGVNQFAD 69
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LT +EF Y G+K P+ + ++ + +PTS+DW +GAVTP+KNQ +CG
Sbjct: 70 LTVEEFSKTYMGFK--KPAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCG 127
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
CW+F+ ++EG +I +G L+ LSEQQ +DC+ T GN GC GG + AF Y N +
Sbjct: 128 SCWSFSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEAN-AL 186
Query: 218 ATEDEYPYQAVPGTCSAAQ---KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
TE YPY+ G+C A+ A +S Y++V S EQ ++ AV+ QPVSIAI A
Sbjct: 187 CTEQSYPYKGTDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADK 246
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
+ FQ Y G+ G CG LDH V VG+GT G +YW +KNSWG+TWG +GY+ + R +
Sbjct: 247 SVFQLYSGGVLTGACGASLDHGVLAVGYGTL-SGTDYWKVKNSWGSTWGMSGYVLLQRGK 305
Query: 335 ---GLCGIGTRSSYP 346
G CG+ + SYP
Sbjct: 306 GGSGECGLLSEPSYP 320
>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
Length = 533
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 133/305 (43%), Positives = 182/305 (59%), Gaps = 11/305 (3%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-YKLGTNQFSDLTNDEFRALY 108
WM HG ++ D LE RL+ + N YI + N E T LG N FS ++ DEF+
Sbjct: 31 WMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTGVTLGHNAFSHMSFDEFKFKM 90
Query: 109 TGYKMPSPSHRSTTSSTFKYQNL-SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
TG +P +S + L S +VP+++DW DKG VTP+KNQ CG CWAF+
Sbjct: 91 TGLVLPEGYLEQRLAS--RVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCWAFSTTG 148
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
AVEG T + SG L LSEQ+L+DC NG+ GC GG + AF +I + GI +ED+Y Y+A
Sbjct: 149 AVEGATFVSSGKLPSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEYKA 208
Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG 287
C + K++ +++V DE AL AV+ QPVS+AI A FQ YK G+FN
Sbjct: 209 KAQVCRECD--SVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 266
Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRS 343
CGT+LDH V VG+G ++G +W +KNSWG +WG+ GY+++ R+E G CGI +
Sbjct: 267 TCGTRLDHGVLAVGYG-NDNGHKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIASVP 325
Query: 344 SYPLA 348
SYP A
Sbjct: 326 SYPFA 330
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 200/343 (58%), Gaps = 18/343 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M+ + L C S V ++ S +Q + + E+W HG++Y E E+ R I+++NL
Sbjct: 1 MWTYLALFTLCLSGVFAAPSLDKQ-LDDHWEQWKTWHGKNYH-EKEEGWRRMIWEKNLRK 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
I+ N E G TY+LG N F D+ ++EFR + GYK + R S F N
Sbjct: 59 IQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGYK--HKTERKFKGSLFMEPNF--L 114
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
+VP+ LDWR+KG VTP+K+Q ECG CWAF+ A+EG + G L+ LSEQ L+DCS
Sbjct: 115 EVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRP 174
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSG 252
GN GC GG ++AF YI N G+ +E+ YPY C K AA + + ++PSG
Sbjct: 175 EGNEGCNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDDQPCHYDPKYNAANDTGFVDIPSG 234
Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTE 306
E AL+KAV S+ PVS+AI A FQ Y+ GI F C + +LDH V +VG+ G
Sbjct: 235 KEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSEELDHGVLVVGYGFEGEDV 294
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
DG YW++KNSW +WGD GY+ + +D + CGI T +SYPL
Sbjct: 295 DGKKYWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAASYPLV 337
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 130/308 (42%), Positives = 196/308 (63%), Gaps = 13/308 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E++ + GR Y + R IF+ NL++I + N + G+ T+ + N F+DL+N+EF
Sbjct: 34 EQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLSNEEF 93
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
RA + GY+ + + + + + + + +P ++DW KG VTPIKNQ++CG CWAF+
Sbjct: 94 RATFNGYRRLA----AVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCWAFS 149
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
AVA++EG +++G L+ LSEQ L+DCS G+ GC GG + AF Y+IQN+GI TE Y
Sbjct: 150 AVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRGIDTEASY 209
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY+A+ +C + A I ++ +V +GDE AL AV S+ P+S+AI A FQ Y
Sbjct: 210 PYKAIDESCEFKRNSVGATIHSFVDVKTGDESALQNAVASIGPISVAIDAAQPSFQFYSS 269
Query: 283 GIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
G++N C T+ LDH VT VG+GT +GA YW +KNSWG +WG GY+ + R+ + CGI
Sbjct: 270 GVYNEPDCSTEILDHGVTAVGYGTL-NGAPYWKVKNSWGTSWGRKGYIFMSRNKQNQCGI 328
Query: 340 GTRSSYPL 347
T++SYP+
Sbjct: 329 ATKASYPV 336
>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
Length = 336
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 135/340 (39%), Positives = 203/340 (59%), Gaps = 17/340 (5%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + S+ + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS G
Sbjct: 116 PQQVDWRQRGFVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
N GC GG ++AF Y+ +N+G+ +E YPY A C + AKI+ + ++P G+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNE 235
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
AL+ AV ++ PVS+AI A Q Y+ GI+ ++LDHAV +VG+ G G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
YW++KNSW + WGD GY+ + +D+ CG+ T +SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATSASYPL 335
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 192/314 (61%), Gaps = 12/314 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ WM H + Y++ EK R +IFK+NL YI++ NK+ N +Y LG N+F+
Sbjct: 39 TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+NDEF Y G + + +S ++ N ++P ++DWR KGAVTP+++Q C
Sbjct: 98 DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDTVNLPENVDWRKKGAVTPVRHQGSC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC ++GC GG A Y+ +N GI
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN-GI 212
Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
+YPY+A GTC A Q K S V +E LL A++ QPVS+ + +
Sbjct: 213 HLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 272
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
FQ YK GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 273 FQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGN 331
Query: 333 DEGLCGIGTRSSYP 346
G+CG+ S YP
Sbjct: 332 SPGVCGLYKSSYYP 345
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 133/309 (43%), Positives = 189/309 (61%), Gaps = 14/309 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E + H +SY+ ++E+ +R KIF EN I K N + G +YKLG NQF DL EF
Sbjct: 8 EAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPHEF 67
Query: 105 RALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
++ GY R STF N++ + +P ++DWR KGAVTP+K+Q +CG CWAF
Sbjct: 68 AKMFNGYH----GERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAF 123
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDE 222
+A ++EG ++SG L+ LSEQ L+DCS + GN GC GG + AF YI N GI TE+
Sbjct: 124 SATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEES 183
Query: 223 YPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
YPY+A+ G C ++ A + + ++ G E L KAV ++ P+S+AI A + FQ Y
Sbjct: 184 YPYEAMDGDCRFKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQLYS 243
Query: 282 EGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCG 338
EG+++ +LDH V VG+G ++G YWL+KNSW TWGD GY+ + RD + CG
Sbjct: 244 EGVYDEPNCSSEELDHGVLAVGYG-VKNGKKYWLVKNSWAETWGDNGYILMSRDKDNQCG 302
Query: 339 IGTRSSYPL 347
I + +SYPL
Sbjct: 303 IASSASYPL 311
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 140/303 (46%), Positives = 189/303 (62%), Gaps = 12/303 (3%)
Query: 53 QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYT 109
QHGR Y+ E+E R +IFK+NL+YIE+ NK+ G ++Y LG NQF+D+ N+EFR +Y
Sbjct: 48 QHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR-MYN 106
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
G + R S + P +DWR KG VT +KNQ +CG CW+F+ ++
Sbjct: 107 GLRRDYNYSREVQCSN--HLTPEYLVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGSL 164
Query: 170 EGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV 228
EG +SG L+ LSEQQL+DCS GN GC GG ++AF YII N GI TE+EYPY A
Sbjct: 165 EGQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGIETEEEYPYDAR 224
Query: 229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN- 286
C + AA S +V SGDE L +V+ + PVSIAI A FQ Y G+++
Sbjct: 225 QERCHFKKSEVAATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGGVYDE 284
Query: 287 -GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSS 344
T+LDH V +VG+G T+DG +YWL+KNSWG TWG GY+K+ R+ + CG+ T++S
Sbjct: 285 PKCSSTELDHGVLVVGYG-TDDGQDYWLVKNSWGTTWGLEGYVKMSRNQDNQCGVATQAS 343
Query: 345 YPL 347
YPL
Sbjct: 344 YPL 346
>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
Length = 354
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 135/326 (41%), Positives = 198/326 (60%), Gaps = 13/326 (3%)
Query: 33 VSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GN 86
V + + Q + E KW G+SY+ + E + ++ F +N+ +IE+ NKE G
Sbjct: 30 VHRQKSLRQKIDEAFNKWDDYKETFGKSYEPDEENDY-MEAFVKNVIHIEEHNKEHRLGR 88
Query: 87 RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKG 146
+T+++G N+ +DL ++R L GY+M S S+ K+ +P S+DWR++G
Sbjct: 89 KTFEMGLNEIADLPFSQYRKL-NGYRMRRQFGDSLQSNGTKFLVPFNVQIPESVDWREEG 147
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSRE 205
VTP+KNQ CG CWAF++ A+EG +G L+ LSEQ L+DCST GN+GC GG +
Sbjct: 148 LVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMD 207
Query: 206 KAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ- 264
AF YI +N G+ TED YPY C + A + ++P GDE+AL KAV+ Q
Sbjct: 208 LAFEYIKENHGVDTEDSYPYVGRETKCHFKRNAVGADDKGFVDLPEGDEEALKKAVATQG 267
Query: 265 PVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
P+SIAI A FQ YK+G+ F+ C + +LDH V +VG+GT + +YWL+KNSWG TW
Sbjct: 268 PISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTW 327
Query: 323 GDAGYMKIVRDE-GLCGIGTRSSYPL 347
G+ GY++I R+ CG+ T++SYPL
Sbjct: 328 GEKGYIRIARNRNNHCGVATKASYPL 353
>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
Length = 347
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 139/353 (39%), Positives = 206/353 (58%), Gaps = 17/353 (4%)
Query: 3 LIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDEL 62
+IF+ S S N M ++I + ++CAS R H+ + E W +G+ Y+++
Sbjct: 1 MIFQDSKSSPANLLRMKVVIWMFLACASTTAYLR--HDPMLDNHWELWKKTYGKQYEEQN 58
Query: 63 EKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR 119
++ R I+++NL+++ N E G +Y L N SD+T++E +L + ++P+ R
Sbjct: 59 QEVTRRLIWEKNLKFVTLHNLEHSMGLHSYDLSMNHLSDMTSEEVASLMSSLRIPNQWSR 118
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+TT Y+ S +P S+DWRDKG VT +K Q CG CWAF+AV A+E K+++G
Sbjct: 119 NTT-----YRLNSNQKLPDSVDWRDKGCVTEVKYQGTCGSCWAFSAVGALEAQLKLKTGK 173
Query: 180 LIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
L+ LS Q L+DCSTN N+GC GG +AF YII N GI ++ YPY+A G C
Sbjct: 174 LVSLSAQNLVDCSTNEKYENHGCNGGCMTEAFQYIIDNNGIDSDASYPYKAKDGKCQYNP 233
Query: 237 KPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLD 294
AA S Y E+P G E AL +AV+ + PVS+ I A F YK G+ ++ C ++
Sbjct: 234 ANRAATCSRYTELPYGSEDALKEAVANKGPVSVGIDASLPSFFLYKSGVYYDPSCTQNVN 293
Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
H V + G+G DG +YWL+KNSWG ++GD GY++I R+ G CGI SYP
Sbjct: 294 HGVLVTGYGNL-DGKDYWLVKNSWGLSFGDKGYIRIARNRGNHCGIANFPSYP 345
>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
Length = 341
Score = 249 bits (637), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 132/337 (39%), Positives = 198/337 (58%), Gaps = 13/337 (3%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
LLV CA + + V E + +H + Y E E++ R+KI+ EN + K N+
Sbjct: 4 LLVLCAVVAAGTAVSFFDLVREEWNTFKLEHKKQYDSETEEKFRMKIYAENKHKVAKHNQ 63
Query: 84 ---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD----- 135
+G +Y+L TN++SD+ + EF G+ ++ + + +
Sbjct: 64 RYQKGLVSYRLKTNKYSDMLHHEFVNTMNGFNKTVKHNKGLYAKGNDIRGATFVSPANVA 123
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN- 194
P ++DWR GAVTP+K+Q +CG CW+F+ A+EG +SG L+ LSEQ L+DCS+
Sbjct: 124 APPTVDWRQHGAVTPVKDQGKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLIDCSSAY 183
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
GNNGC GG + AF YI N GI TE YPY+AV C K + A+ + ++P+GDE
Sbjct: 184 GNNGCNGGLMDNAFKYIKDNDGIDTEKTYPYEAVDDKCRYNPKNSGAEDVGFVDIPAGDE 243
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANY 311
L+ A+ ++ PVS+AI A FQ Y +G+ ++ C ++ LDH V +VG+GT EDG +Y
Sbjct: 244 HKLMLALATVGPVSVAIDASQESFQLYSDGVYYDENCSSENLDHGVLVVGYGTDEDGGDY 303
Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
WL+KNSWG +WGD GY+K+ R+ + CGI + +SYPL
Sbjct: 304 WLVKNSWGPSWGDEGYIKMARNRDNHCGIASSASYPL 340
>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
Length = 330
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 186/321 (57%), Gaps = 9/321 (2%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
V S+ + + + WM H +SY +E E R +++EN +I++ N++ N +Y L
Sbjct: 15 VASTLAYKHDPLTGVFADWMRTHTKSYSNE-EFVFRWNVWRENYNFIQEENRK-NNSYYL 72
Query: 92 GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
N+F DLTN EF +Y G +H + +P + DWR KGAVT +
Sbjct: 73 TMNKFGDLTNAEFNKVYKGLAFDYSAH--ILKAKAATPAAPAPGLPANFDWRQKGAVTHV 130
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAY 210
KNQ +CG CW+F+ + EG ++ G L+ LSEQ L+DCS + GNNGC GG + AF Y
Sbjct: 131 KNQGQCGSCWSFSTTGSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEY 190
Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
II N+GI TE YPY+ C + +++Y +V SGDE ALL AV+++P S+AI
Sbjct: 191 IINNKGIDTEASYPYETAQYNCRYNPANSGGSLTSYTDVSSGDENALLNAVAIEPTSVAI 250
Query: 271 AAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
A FQ Y G++ + TQLDH V VG+G TE+G +YWL+KNSWG WG GY+
Sbjct: 251 DASHNSFQFYSGGVYYESSCSSTQLDHGVLAVGWG-TENGQDYWLVKNSWGADWGLQGYI 309
Query: 329 KIVRD-EGLCGIGTRSSYPLA 348
K+ R+ CGI T +SYP A
Sbjct: 310 KMARNRHNNCGIATAASYPTA 330
>gi|162138968|ref|NP_001104662.1| uncharacterized protein LOC567623 precursor [Danio rerio]
gi|158254065|gb|AAI54241.1| Zgc:174153 protein [Danio rerio]
Length = 336
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 140/343 (40%), Positives = 205/343 (59%), Gaps = 22/343 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
MF +I L C S V ++ S Q ++ H W +QHG+SY +++E R+ I++ENL
Sbjct: 2 MFALIITL--CISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLR 56
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE+ N E GN T+K+G NQF D+TN+EFR GYK + TS + S
Sbjct: 57 KIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSF 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS
Sbjct: 113 FAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSR 172
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPS 251
GN GC GG + AF Y+ +N+G+ +E YPY A C + AK + + ++PS
Sbjct: 173 PQGNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKSTGFVDIPS 232
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTT 305
G+E AL+ AV ++ PVS+AI A Q Y+ GI+ ++LDHAV +VG+ G
Sbjct: 233 GNEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGAD 292
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
G YW++KNSW + WGD GY+ + +D+ CG+ T++SYPL
Sbjct: 293 VAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
Length = 336
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 134/340 (39%), Positives = 203/340 (59%), Gaps = 17/340 (5%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + + + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GY + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
N GC GG ++AF Y+ +N+G+ +E YPY A C + AKI+ + ++PSG+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNE 235
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
AL+ AV ++ PVS+AI A Q Y+ GI+ ++LDHAV +VG+ G G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
YW++KNSW + WGD GY+ + +D+ CG+ T++SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|356545079|ref|XP_003540973.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 330
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 131/296 (44%), Positives = 182/296 (61%), Gaps = 12/296 (4%)
Query: 29 ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
ASQV + R+ + S+ E HE+WM+++G+ YKD E+E R +IFKEN+ YIE +N +
Sbjct: 5 ASQV-TCRTLQDASMYERHEEWMSRYGKVYKDPREREKRFRIFKENMNYIETSNNVAIKP 63
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
KL NQF+DL N+EF A +K + TF + P KGAV
Sbjct: 64 XKLVINQFADLNNEEFIAPRNIFKGMILCRFLSRKHTFPF--------PYVFLGHKKGAV 115
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKA 207
TP+K+Q CG CWAF VA+ EGI + +G LI LSEQ+L+DC T G + GC G + A
Sbjct: 116 TPVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCECGLMDDA 175
Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPV 266
F +IIQN G+ + YPY+ V G C+A ++ AA I+ E+VP+ +E+AL K V+ QPV
Sbjct: 176 FKFIIQNHGVXDAN-YPYKGVDGKCNANEEANPAATITGXEDVPANNEKALQKVVANQPV 234
Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
+AI A ++FQ YK G+F G C T+L+H VT +G+G + DG YWL+KNS W
Sbjct: 235 FVAIDACDSDFQFYKSGVFTGSCETELNHGVTTMGYGVSHDGTQYWLVKNSXETEW 290
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 132/319 (41%), Positives = 186/319 (58%), Gaps = 16/319 (5%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W+A QH + Y E+E R+KI+ EN I K N+ +G +YKLG N+++D+
Sbjct: 24 VKEEWVAFKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQLYEQGLVSYKLGPNKYTDM 83
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM-----TDVPTSLDWRDKGAVTPIKNQ 154
+ EF GY + ++ + + P +DW KGAVT +K+Q
Sbjct: 84 LHHEFIQAMNGYNRTAKHNKGLYGKKHDVRGATFIPPAHVKYPDHVDWTKKGAVTEVKDQ 143
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC-STNGNNGCLGGSREKAFAYIIQ 213
+CG CWAF+ A+EG +SG L+ LSEQ L+DC ST GNNGC GG + AF YI
Sbjct: 144 GKCGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYIKD 203
Query: 214 NQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAA 272
N GI TE YPY+ V C K + A+ + ++PSGDE+ L++AV ++ PVS+AI A
Sbjct: 204 NGGIDTEKTYPYEGVDDKCRYNPKNSGAEDVGFVDIPSGDEEKLMQAVATVGPVSVAIDA 263
Query: 273 YSTEFQSYKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
FQ Y G++ T LDH V +VG+GT E G +YWL+KNSW TWG+ GY+K+
Sbjct: 264 SQNSFQFYSGGVYYDTECSSTDLDHGVLVVGYGTDEAGGDYWLVKNSWSRTWGELGYIKM 323
Query: 331 VRD-EGLCGIGTRSSYPLA 348
R+ + CGI T +SYPL
Sbjct: 324 ARNRDNHCGIATDASYPLV 342
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 131/303 (43%), Positives = 185/303 (61%), Gaps = 10/303 (3%)
Query: 54 HGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTG 110
H + Y +ELE+ R KIF EN + IEK N K+G ++KL N +D+ E+ +Y G
Sbjct: 34 HRKEYDNELEESYRKKIFLENKKRIEKHNSRYKQGKVSFKLKLNHLADMLIHEYSDVYLG 93
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
+ S ++ + S + + + + +DWR KGAVTP+KNQ CG CWAF+ A+E
Sbjct: 94 FNKSSKANNNKLQS-YTFIPPAHVTLNKEVDWRTKGAVTPVKNQGHCGSCWAFSTTGALE 152
Query: 171 GITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP 229
G ++G L+ LSEQ L+DCS + GNNGC GG + AF YI +N GI TE YPY+
Sbjct: 153 GQNFRKTGKLVSLSEQNLVDCSGSYGNNGCEGGLMDNAFQYIKENHGIDTEKSYPYEGED 212
Query: 230 GTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNG 287
TC + A S + ++ GDE+AL++AV ++ P+S+AI A FQ Y EG+ +
Sbjct: 213 ETCRFRKTSIGATDSGFVDITQGDEEALMQAVATIGPISVAIDASHQSFQFYSEGVYYEP 272
Query: 288 VCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSY 345
C ++ LDH V +VG+G ED YWL+KNSWG WGD GY+K+ RD + CGI T++SY
Sbjct: 273 ECSSENLDHGVLVVGYG-VEDNQKYWLVKNSWGTQWGDGGYIKMARDQDNNCGIATQASY 331
Query: 346 PLA 348
PL
Sbjct: 332 PLV 334
>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 141/345 (40%), Positives = 199/345 (57%), Gaps = 18/345 (5%)
Query: 18 MFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKE 73
M ++I L LV A VSS + +E I E+W AQ + Y+D E+ R K++ +
Sbjct: 1 MKVVIVLGLVVFAISSVSSINLNEV----IEEEWSLFKAQFKKIYEDVKEEAFRKKVYLD 56
Query: 74 NLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKY 128
N I + NK G TY L N F DL E++ + G+K + T +
Sbjct: 57 NKLKIARHNKLYETGEETYALEMNHFGDLMQHEYKKMMNGFKPSLAGGDKNFTDDDAVTF 116
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
VP ++DWR KG VTP+KNQ +CG CW+F+A ++EG ++G L+ LSEQ L
Sbjct: 117 LKSENVVVPKAIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNL 176
Query: 189 LDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
+DCS GNNGC GG + AF YI N+G+ TE YPY+A C + + A +
Sbjct: 177 IDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFV 236
Query: 248 EVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGT 304
++P GDE AL+ A+ ++ PVSIAI A S +FQ YK+G+F N C T+LDH V VG+GT
Sbjct: 237 DIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGT 296
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
G +YW++KNSWG TWGD GY+ + R+ + CG+ + +SYPL
Sbjct: 297 DHKGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASYPLV 341
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 128/333 (38%), Positives = 196/333 (58%), Gaps = 17/333 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F+ + LL+ S V+ E W ++G++Y+ E MR KI+ +N +Y+
Sbjct: 9 FVAVLLLIGLVSAAVND--------AEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYV 60
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
+ N + +++L N+F+DLT +EF ++Y GY ++ ++Y + +P
Sbjct: 61 NEHNSM-DSSFQLEVNEFADLTAEEFSSIYNGYGKGRNRENHENTTIYRYTGGA---IPD 116
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
S+DWR KG VTP+KNQK+CG CWAF+ ++EG ++G L+ LSEQ L+DC ++G
Sbjct: 117 SVDWRTKGLVTPVKNQKQCGSCWAFSTTGSLEGAHAKKTGKLVSLSEQNLVDCDKK-DHG 175
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALL 258
C GG AF YI +N+GI TE+ YPY+A G C + A + + + + D +AL
Sbjct: 176 CQGGLMTTAFKYIEENKGIDTEESYPYKAKNGRCEFKKDDIGATVERHVSILTTDCEALK 235
Query: 259 KAVS-MQPVSIAIAAYSTEFQSYKEGIFN-GVCGT-QLDHAVTIVGFGTTEDGANYWLIK 315
KAV+ + P+S+A+ A + FQ YK GI++ +C + +LDH V +VG+G EDG YWL+K
Sbjct: 236 KAVAEIGPISVAMDASHSSFQLYKSGIYDPKICSSRKLDHGVLVVGYG-KEDGEEYWLVK 294
Query: 316 NSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
NSWG WG GY KI + LCGI T + YP+
Sbjct: 295 NSWGKNWGMEGYFKIASKKNLCGICTSACYPVV 327
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 129/308 (41%), Positives = 195/308 (63%), Gaps = 13/308 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E++ + GR Y + R IF+ NL++I + N + G+ T+ + N F+DL+N+EF
Sbjct: 34 EQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLSNEEF 93
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
RA + GY+ + + + + + + + +P ++DW KG VTPIKNQ++CG CWAF+
Sbjct: 94 RATFNGYRRLA----AVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCWAFS 149
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
AVA++EG +++G L+ LSEQ L+DCS G+ GC GG + AF Y+IQN+GI TE Y
Sbjct: 150 AVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRGIDTEASY 209
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY+A+ +C + A I ++ +V +GDE AL AV S+ P+S+AI A FQ Y
Sbjct: 210 PYKAIDESCEFKRNSIGATIHSFVDVKTGDESALQNAVASIGPISVAIDASQPSFQFYSS 269
Query: 283 GIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
G++N C T+ LDH VT VG+GT +G YW +KNSWG +WG GY+ + R+ + CGI
Sbjct: 270 GVYNEPDCSTEILDHGVTAVGYGTL-NGVPYWKVKNSWGTSWGQKGYIFMSRNKQNQCGI 328
Query: 340 GTRSSYPL 347
T++SYP+
Sbjct: 329 ATKASYPV 336
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 134/318 (42%), Positives = 191/318 (60%), Gaps = 9/318 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT---YKLGTN 94
THE+ V + A HG+ Y+ + E+ RLKI+ EN I + N++ ++ YKL N
Sbjct: 14 THEELVGAEWSAFKALHGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMN 73
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
+F D+ + EF + G+K S + + L +P ++DWR KGAVTP+KNQ
Sbjct: 74 EFGDMLHHEFVSTRNGFKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPVKNQ 133
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQ 213
+CG CW+F+ ++EG + L+ LSEQ L+DCS + GNNGC GG + AF YI
Sbjct: 134 GQCGSCWSFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYIKA 193
Query: 214 NQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAA 272
N+GI TE YPY A G C + A + + ++P GDE L KAV ++ PVS+AI A
Sbjct: 194 NKGIDTEQSYPYNATDGVCHFNKSAVGATDTGFVDIPEGDENKLKKAVATVGPVSVAIDA 253
Query: 273 YSTEFQSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
FQ Y EG+++ C + QLDH V +VG+G T+DG +YWL+KNSWG TWGD GY+ +
Sbjct: 254 SHESFQFYSEGVYDEPECDSEQLDHGVLVVGYG-TKDGQDYWLVKNSWGTTWGDGGYIYM 312
Query: 331 VRD-EGLCGIGTRSSYPL 347
R+ + CGI + +SYPL
Sbjct: 313 SRNKDNQCGIASAASYPL 330
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 134/317 (42%), Positives = 195/317 (61%), Gaps = 14/317 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI-EKANKEGNRTYKLGTNQFSD 98
++S++EI ++W +H ++YK E E R FK NL+YI EK KE +++G N+F+D
Sbjct: 36 DESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHRVGLNKFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFK-YQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
L+N+EF+ LY K+ P +++ + + +NL D P+SLDWR KG VT +K+Q +C
Sbjct: 96 LSNEEFKQLYLS-KVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CW+F+ A+EGI I + +LI LSEQ+L+DC T N GC GG + AF ++I N GI
Sbjct: 155 GSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTT-NYGCEGGYMDYAFEWVINNGGI 213
Query: 218 ATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
TE YPY V GTC+ A++ I Y++V D ALL A + QP+S+ I + +
Sbjct: 214 DTEANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDETD-SALLCAAAQQPISVGIDGSAID 272
Query: 277 FQSYKEGIF---NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
FQ Y GI+ +DHAV IVG+G +E+G +YW++KNSWG +WG GY I R+
Sbjct: 273 FQLYTGGIYDGDCSDDPDDIDHAVLIVGYG-SENGEDYWIVKNSWGTSWGIEGYFYIKRN 331
Query: 334 E----GLCGIGTRSSYP 346
G+C I +SYP
Sbjct: 332 TDLPYGVCAINAMASYP 348
>gi|110625773|ref|NP_081620.2| cathepsin L-like 3 precursor [Mus musculus]
gi|74208432|dbj|BAE26401.1| unnamed protein product [Mus musculus]
gi|187955662|gb|AAI47425.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
gi|187957686|gb|AAI47424.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
Length = 331
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 138/341 (40%), Positives = 199/341 (58%), Gaps = 19/341 (5%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + VVS+ H S+ + E+W +H ++Y E + R +++ N
Sbjct: 2 TPVFLLATLCLG----VVSAAPAHNPSLDAVWEEWKTKHKKTYNMNDEGQKR-AVWENNK 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I+ N++ G + L N F DLTN EFR L TG++ + T +Q
Sbjct: 57 KMIDLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQ-----GQKTKMMMKVFQEPL 111
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DWRD G VTP+K+Q CG CWAF+AV ++EG ++G L+ LS Q L+DCS
Sbjct: 112 LGDVPKSVDWRDHGYVTPVKDQGSCGSCWAFSAVGSLEGQMFRKTGKLVPLSVQNLVDCS 171
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
+ GN GC GG + AF Y+ N G+ T YPY+A+ GTC K +AA ++ + V S
Sbjct: 172 WSQGNQGCDGGLPDLAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKNSAATVTGFVNVQS 231
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
E AL+KAV ++ P+S+ I FQ YKEG++ T LDHAV +VG+G DG
Sbjct: 232 -SEDALMKAVATVGPISVGIDTKHKSFQFYKEGMYYEPDCSSTVLDHAVLVVGYGEESDG 290
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
YWL+KNSWG WG GY+K+ +D CGI + +SYP+
Sbjct: 291 RKYWLVKNSWGRDWGMNGYIKMAKDRNNNCGIASDASYPVV 331
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 136/335 (40%), Positives = 193/335 (57%), Gaps = 13/335 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ + LL + AS V EQ + W H + Y E+ R I+++NL+
Sbjct: 3 LLVAACLLFAVASGFVVKFDEDEQQW----QAWKLFHTKKYTTVTEEGARKAIWRDNLKK 58
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I+K N EG+ ++ L N DLT DEFR YTG + ++ S F S VP
Sbjct: 59 IQKHNAEGH-SFTLAMNHLGDLTQDEFRYFYTGMRSHYSNYTKKQGSAFLAP--SHVQVP 115
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
++DWR +G VTP+KNQ +CG CWAF+ ++EG ++G L+ LSEQ L+DCST GN
Sbjct: 116 DTVDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGN 175
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG + AF YI +N GI TE+ YPY+A C + A + + +V GDE+A
Sbjct: 176 NGCQGGLMDYAFKYIKENGGIDTEESYPYEARNDRCRFQKSNIGAVDTGFVDVTHGDEEA 235
Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWL 313
L A ++ P+S+AI A FQ Y G++N G T LDH V +VG+GT + G++YWL
Sbjct: 236 LKTAAGTVGPISVAIDAGHMSFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQ-GSDYWL 294
Query: 314 IKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
+KNSWG WG GY+ + R++ CG+ T++SYPL
Sbjct: 295 VKNSWGERWGMEGYIMMSRNKNNQCGVATQASYPL 329
>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
Length = 341
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 197/342 (57%), Gaps = 17/342 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLE 76
+I+ LV+ A VSS + +E I E+W Q + Y+D E+ R K++ +N
Sbjct: 4 VIVLGLVAFAISTVSSINLNEV----IEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKL 59
Query: 77 YIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
I + NK G TY L N F DL E+ + G+K + T +
Sbjct: 60 KIARHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKS 119
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+P S+DWR KG VTP+KNQ +CG CW+F+A ++EG ++G L+ LSEQ L+DC
Sbjct: 120 ENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDC 179
Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
S GNNGC GG + AF YI N+G+ TE YPY+A C + + A + ++P
Sbjct: 180 SRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFVDIP 239
Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGTTED 307
GDE AL+ A+ ++ PVSIAI A S +FQ YK+G+F N C T+LDH V VGFG+ +
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKK 299
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
G +YW++KNSWG TWGD GY+ + R+ + CG+ + +SYPL
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 139/339 (41%), Positives = 200/339 (58%), Gaps = 26/339 (7%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ L+V+C S ++R ++ + WM +H +SY ++ E R IF++N++++
Sbjct: 7 LVFCFLIVNCIS---AARVFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYTIFQDNMDFV 62
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL--SMTDV 136
K N++G+ T LG N +DLTN E++ +Y G T +T K NL +TDV
Sbjct: 63 TKWNQKGSDTI-LGLNSMADLTNQEYQRIYLG-----------TKTTVKKPNLIIGVTDV 110
Query: 137 ---PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
P S+DWR GAVT +KNQ +CG C++F+ +VEGI +I S L+ LSEQQ+LDCS
Sbjct: 111 SKAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSG 170
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
+ GNNGC GG +F YII G+ TE YPY+ V G C + A I+ Y+ V SG
Sbjct: 171 SEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKANIGATITGYKNVKSG 230
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGAN 310
E L AV+ QPVS+AI A FQ Y G++ TQLDH V VG+G ++ G +
Sbjct: 231 SESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYG-SQSGQD 289
Query: 311 YWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
YW++KNSWG WG+ G++ + R++ CGI T +SYP A
Sbjct: 290 YWIVKNSWGADWGEKGFILMARNKHNNCGIATMASYPTA 328
>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 190/312 (60%), Gaps = 16/312 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E+W + HG+SY ++ E+ R +++E+L IE N E G +++LG N F D+ N+EF
Sbjct: 30 EQWKSWHGKSY-EQKEETWRRMVWEEHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEF 88
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
R L GYK +H+ S F N +VP +DWRD+G VTP+K+Q +CG CWAF+
Sbjct: 89 RQLMNGYKYKQ-THKKLQGSHFLEPNF--LEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+EG R+G L+ LSEQ L++CS GN GC GG ++AF Y+ N GI +ED Y
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSY 205
Query: 224 PYQAVPGT-CSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
PY T C + AA + + ++PSG E+AL+KA+ ++ PVS+AI A T FQ Y+
Sbjct: 206 PYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQ 265
Query: 282 EGI-FNGVC-GTQLDHAVTIVGFGTTE---DGANYWLIKNSWGNTWGDAGYMKIVRD-EG 335
GI F C T LDH V +VG+G + DG YW++KNSW WG GY+ + +D +
Sbjct: 266 SGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDN 325
Query: 336 LCGIGTRSSYPL 347
CGI T +SYPL
Sbjct: 326 HCGIATAASYPL 337
>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
Length = 336
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 134/340 (39%), Positives = 202/340 (59%), Gaps = 17/340 (5%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
++ LLV+ V + + + + + W +QHG+SY +++E R+ I++ENL IE
Sbjct: 1 MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G NQF D+TN+EFR GY + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
P +DWR +G VTP+K+QK+CG CW+F++ A+EG ++G LI +SEQ L+DCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
N GC GG + AF Y+ +N+G+ +E YPY A C + AKI+ + ++PSG+E
Sbjct: 176 NQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNE 235
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
AL+ AV ++ PVS+AI A Q Y+ GI+ ++LDHAV +VG+ G G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
YW++KNSW + WGD GY+ + +D+ CG+ T++SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 191/313 (61%), Gaps = 17/313 (5%)
Query: 46 IHEKWM---AQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDL 99
+ ++W A+HGR Y E+ RL +F++N ++I+ N + G T+ L NQF D+
Sbjct: 19 LRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDM 78
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T++E A G+ + +P+ R ++ K + ++ P +DWR KGAVTP+K+QK+CG
Sbjct: 79 TSEEIVATMNGF-LGAPTRRP--AAVLKADDETL---PEKVDWRTKGAVTPVKDQKQCGS 132
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+ ++EG ++ G L+ LSEQ L+DCS GN GC+GG ++AF YI N+GI
Sbjct: 133 CWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGID 192
Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEF 277
TED YPY+A G C A + Y +V G E AL KAV ++ P+S+ I A + F
Sbjct: 193 TEDSYPYEAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQSTF 252
Query: 278 QSYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 334
Y G+++ T LDH V VG+G+ E+G ++WL+KNSW +WGD GY+K+ R+
Sbjct: 253 HFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRNRN 312
Query: 335 GLCGIGTRSSYPL 347
CGI +++SYPL
Sbjct: 313 NNCGIASQASYPL 325
>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
endopeptidase; AltName: Full=Papaya peptidase B;
AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
Precursor
gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
Length = 348
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 137/315 (43%), Positives = 194/315 (61%), Gaps = 12/315 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ WM +H ++YK+ EK R +IFK+NL+YI++ NK N Y LG N+FS
Sbjct: 39 TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMIN-GYWLGLNEFS 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+NDEF+ Y G P + ++ N + D+P S+DWR KGAVTP+K+Q C
Sbjct: 98 DLSNDEFKEKYVG---SLPEDYTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ VA VEGI KI++GNL++LSEQ+L+DC + GC G + + Y+ QN GI
Sbjct: 155 ESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQN-GI 212
Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
+YPY A TC A Q K + V S +E +LL A++ QPVS+ + + +
Sbjct: 213 HLRAKYPYIAKQQTCRANQVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRD 272
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
FQ+YK GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 273 FQNYKGGIFEGSCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGN 331
Query: 333 DEGLCGIGTRSSYPL 347
G+CG+ S YP+
Sbjct: 332 SPGVCGVYRSSYYPI 346
>gi|269954686|ref|NP_954599.2| uncharacterized protein LOC218275 precursor [Mus musculus]
Length = 330
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 138/339 (40%), Positives = 200/339 (58%), Gaps = 20/339 (5%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + VVS+ H+ S+ + E+W +H ++Y E + R +++ N+
Sbjct: 2 TPVFLLATLCLG----VVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWENNM 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I N++ G + L N F DLTN EFR L TG++ S H+ T +Q
Sbjct: 57 KMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPL 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DWRD G VTP+K+Q CG CWAF+AV ++EG ++G L+ LSEQ L+DCS
Sbjct: 111 LGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCS 170
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
+ GN GC GG E AF Y+ +N+G+ T + Y Y+A G C K +A I+ + +VP
Sbjct: 171 WSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL 230
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
E AL+ AV S+ PVS+ I + F+ Y+ G + T LDHAV +VG+G DG
Sbjct: 231 -SEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDG 289
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
YWL+KNSWG WG GY+K+ +D + CGI T + YP
Sbjct: 290 RKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYP 328
>gi|74211558|dbj|BAE26509.1| unnamed protein product [Mus musculus]
Length = 338
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 138/339 (40%), Positives = 200/339 (58%), Gaps = 20/339 (5%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + VVS+ H+ S+ + E+W +H ++Y E + R +++ N+
Sbjct: 10 TPVFLLATLCLG----VVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWENNM 64
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I N++ G + L N F DLTN EFR L TG++ S H+ T +Q
Sbjct: 65 KMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPL 118
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DWRD G VTP+K+Q CG CWAF+AV ++EG ++G L+ LSEQ L+DCS
Sbjct: 119 LGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCS 178
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
+ GN GC GG E AF Y+ +N+G+ T + Y Y+A G C K +A I+ + +VP
Sbjct: 179 WSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL 238
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
E AL+ AV S+ PVS+ I + F+ Y+ G + T LDHAV +VG+G DG
Sbjct: 239 -SEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDG 297
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
YWL+KNSWG WG GY+K+ +D + CGI T + YP
Sbjct: 298 RKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYP 336
>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
Length = 324
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 140/342 (40%), Positives = 200/342 (58%), Gaps = 42/342 (12%)
Query: 15 TTPMFIIITLLVSCASQ--VVSSRSTHEQSVVEIHEKWMAQHGRSYKDEL-EKEMRLKIF 71
T + II L S A V S + V I + WM++HG++Y + L +KE R + F
Sbjct: 11 TLSLLIIFLLPPSSAMDLSVTSGGLRSNEEVGFIFQTWMSKHGKTYTNALGDKEQRFQNF 70
Query: 72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
K+NL +I++ N + N +Y+LG QF+DLT E++ L++G + + T +Y L
Sbjct: 71 KDNLRFIDQHNAK-NLSYRLGLTQFADLTVQEYQDLFSGRPI---QKQKALRVTHRYVPL 126
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ +P S+DWR KGAV+ IK+Q C VE I KI +G LI LSEQ+L+DC
Sbjct: 127 AEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSEQELVDC 176
Query: 192 STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA--AAKISNYEEV 249
S + N+GC GG + AF ++I N G+ + +YPYQAV G C+ Q + KI YE+V
Sbjct: 177 SID-NHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKIDGYEDV 235
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ +E +L KAV+ QP GI+ G CGT LDHAV IVG+GT E+G
Sbjct: 236 PANNENSLQKAVAHQP-----------------GIYTGPCGTDLDHAVVIVGYGT-ENGQ 277
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+YW+++NSWG WG+AGY KI R+ G+CGI +SYP+
Sbjct: 278 DYWIVRNSWGTVWGEAGYAKIARNFENPTGVCGIAMVASYPI 319
>gi|148709355|gb|EDL41301.1| cDNA sequence BC051665 [Mus musculus]
Length = 349
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 138/339 (40%), Positives = 200/339 (58%), Gaps = 20/339 (5%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + VVS+ H+ S+ + E+W +H ++Y E + R +++ N+
Sbjct: 21 TPVFLLATLCLG----VVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWENNM 75
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I N++ G + L N F DLTN EFR L TG++ S H+ T +Q
Sbjct: 76 KMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPL 129
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DWRD G VTP+K+Q CG CWAF+AV ++EG ++G L+ LSEQ L+DCS
Sbjct: 130 LGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCS 189
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
+ GN GC GG E AF Y+ +N+G+ T + Y Y+A G C K +A I+ + +VP
Sbjct: 190 WSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL 249
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
E AL+ AV S+ PVS+ I + F+ Y+ G + T LDHAV +VG+G DG
Sbjct: 250 -SEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDG 308
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
YWL+KNSWG WG GY+K+ +D + CGI T + YP
Sbjct: 309 RKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYP 347
>gi|30388235|gb|AAH51665.1| CDNA sequence BC051665 [Mus musculus]
Length = 330
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 138/339 (40%), Positives = 200/339 (58%), Gaps = 20/339 (5%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP+F++ TL + VVS+ H+ S+ + E+W +H ++Y E + R +++ N+
Sbjct: 2 TPVFLLATLCLG----VVSAAPAHDPSLDAVWEEWKTKHRKTYSMNEEAQKR-AVWENNM 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I N++ G + L N F DLTN EFR L TG++ S H+ T +Q
Sbjct: 57 KMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPL 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DWRD G VTP+K+Q CG CWAF+AV ++EG ++G L+ LSEQ L+DCS
Sbjct: 111 LGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCS 170
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
+ GN GC GG E AF Y+ +N+G+ T + Y Y+A G C K +A I+ + +VP
Sbjct: 171 WSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL 230
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
E AL+ AV S+ PVS+ I + F+ Y+ G + T LDHAV +VG+G DG
Sbjct: 231 -SEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDG 289
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
YWL+KNSWG WG GY+K+ +D + CGI T + YP
Sbjct: 290 RKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYP 328
>gi|94480716|emb|CAI91577.1| cathepsin L [Aphrocallistes vastus]
Length = 329
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 138/333 (41%), Positives = 201/333 (60%), Gaps = 10/333 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M ++ LL A V + E+ E W ++ RSY L++E+R KI+ N+ Y
Sbjct: 1 MKLVFLLLGLFAGACVCLQCETEEVQDFAWEGWKLKYNRSYG--LDEELRKKIWANNMLY 58
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
+++ N EG+ +YKL NQF+DLTN E+R +Y GY + R F+ + + D+P
Sbjct: 59 VKEFNAEGH-SYKLAANQFADLTNLEYRQIYLGYDNEARLSRKREGKVFQ-RKMKDEDLP 116
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
T++DWR KG VTP+KNQ +CG CW+F+A ++EG I+SG L+ SEQ+L+DCST+ GN
Sbjct: 117 TTVDWRSKGVVTPVKNQGQCGSCWSFSATGSLEGQYAIKSGKLVSFSEQELVDCSTSLGN 176
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
+GC GG + AF Y N E +Y Y A G C + K S++ ++PS + A
Sbjct: 177 HGCQGGLMDYAFKYWETNLA-EKESDYTYTAKNGKCKYNAQLGVTKDSSFTDIPSENCDA 235
Query: 257 LLKAVSMQ-PVSIAIAAYSTEFQSYKEGIFNG-VCG-TQLDHAVTIVGFGTTEDGANYWL 313
L +AV+ + P+++A+ A T FQ Y GI+ +C T+LDH V +VG+G T++G +YWL
Sbjct: 236 LKEAVANKGPIAVAMDASHTSFQMYHSGIYTPFLCSKTKLDHGVLVVGYG-TDNGVDYWL 294
Query: 314 IKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYP 346
IKNSWG WG GY KI CGI T++SYP
Sbjct: 295 IKNSWGMAWGMDGYFKIEMKSDKCGICTQASYP 327
>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
Length = 493
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 129/290 (44%), Positives = 184/290 (63%), Gaps = 11/290 (3%)
Query: 67 RLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA-LYTGYKMPSPSHRSTT 122
RL++F++NL YI+ N E G ++LG +F+DLT +E+RA L G + + +
Sbjct: 92 RLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVV 151
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
+Y L+ +P ++DWR++GAV +K+Q +CG CWAF+AVAAVEGI KI +G+LI
Sbjct: 152 GRR-RYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSLIS 210
Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAA 241
LSEQ+L+DC + GC GG + AF ++I+N GI TE +YP+ GTC K
Sbjct: 211 LSEQELIDCDKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVV 270
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
I ++E VP E+AL KAV+ QPVS +I A FQ Y GIF+G CGT LDH VT+VG
Sbjct: 271 SIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVG 330
Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL----CGIGTRSSYPL 347
+G +E G +YW++KNSWG WG+AGY+++ R+ + GI YP+
Sbjct: 331 YG-SEGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLYPV 379
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 128/304 (42%), Positives = 186/304 (61%), Gaps = 9/304 (2%)
Query: 52 AQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALY 108
A+HG+SY E E+ RLKI+ EN I K N++ G Y + N+F D+ + EF +
Sbjct: 32 AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G+K S+ + +N+ +P ++DWR KGAVTP+KNQ +CG CWAF+A +
Sbjct: 92 NGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATGS 151
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
+EG +SG+++ LSEQ L+ CST+ GNNGC GG + AF YI N+GI TE YPY
Sbjct: 152 LEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPYNG 211
Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN 286
GTC + A S + ++ G E L KAV ++ P+S+AI A FQ Y +G+++
Sbjct: 212 TDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYD 271
Query: 287 GV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRS 343
C ++ LDH V +VG+GT +G +YW +KNSWG TWGD GY+++ R+ + CGI + +
Sbjct: 272 EPECDSESLDHGVLVVGYGTL-NGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQCGIASSA 330
Query: 344 SYPL 347
S PL
Sbjct: 331 SIPL 334
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 131/304 (43%), Positives = 186/304 (61%), Gaps = 11/304 (3%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W + HG+ Y ++ E+ MR I++ NL+ I N EG ++KL N D+T+ E
Sbjct: 32 WKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHN-EGKHSFKLAMNHLGDMTSLEISQTLL 90
Query: 110 GYKMPSPSHRSTTSSTF-KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G K+ + +TF N+ + D S+DWR KG VTP+KNQ +CG CWAF+ A
Sbjct: 91 GLKLKKHAESQPKGATFLPPANVKVVD---SIDWRSKGYVTPVKNQGQCGSCWAFSTTGA 147
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
+EG ++G L+ LSEQ L+DCS GNNGC GG + AF YI +N GI TE YPY A
Sbjct: 148 LEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPYLA 207
Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN 286
G C + AK + + ++P+GDE AL +A+ S+ P+SIAI A + F Y +G+++
Sbjct: 208 KDGVCHYNKSAIGAKDTGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQGVYD 267
Query: 287 --GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIGTRS 343
T+LDH V VG+G T+DG +YWL+KNSWG +WG+ GY+KI R D CG+ +++
Sbjct: 268 DPDCSSTRLDHGVLAVGYG-TDDGKDYWLVKNSWGPSWGEEGYIKIARNDHDKCGVASKA 326
Query: 344 SYPL 347
SYPL
Sbjct: 327 SYPL 330
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 130/305 (42%), Positives = 182/305 (59%), Gaps = 7/305 (2%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
E W G+SY D +E+ R +++ N ++ N G +Y LG N F+DLT++EF+
Sbjct: 31 EAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFKRF 90
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
Y G K+ RS SSTF ++ +P S+DWR G VTP+K+Q +CG CW+F+
Sbjct: 91 YLGTKVDLNRPRSNFSSTF-IPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSFSTTG 149
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
+VEG ++G L+ LSEQ L+DCS GN GC GG + AF YII N+GI TE YPY
Sbjct: 150 SVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASYPYT 209
Query: 227 AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF 285
A GTC A +S+++++ G E L AV ++ PVS+AI A FQ Y G++
Sbjct: 210 AKDGTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLYTSGVY 269
Query: 286 N--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTR 342
N T LDH V G+GT+ +G YWL+KNSWG++WG AGY+ + R+ CGI T
Sbjct: 270 NEKKCSSTSLDHGVLAAGYGTS-NGTPYWLVKNSWGSSWGQAGYIWMSRNANNQCGIATS 328
Query: 343 SSYPL 347
+SYP+
Sbjct: 329 ASYPI 333
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 135/306 (44%), Positives = 188/306 (61%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE-GNRTYKLGTNQFSDLTNDEFRALY 108
W A+HG+SY++ E+ +R ++ N +YI++ N+ G Y L NQF DL N EF++LY
Sbjct: 25 WKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFKSLY 84
Query: 109 TGYKMP-SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
GY+M +P + Q D+P S+DW KG VTP+KNQ +CG CW+F+A
Sbjct: 85 NGYRMSNAPRKGKPFVPAARVQ-----DLPASVDWSKKGWVTPVKNQGQCGSCWSFSATG 139
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
++EG +G L+ LSEQ L+DCS GN+GC GG + AF Y+I+N GI TE YPY+
Sbjct: 140 SMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASYPYR 199
Query: 227 AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF 285
AV TC A IS Y +V E L AV ++ PVS+AI A FQ Y G++
Sbjct: 200 AVDSTCKFNTADVGATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYSSGVY 259
Query: 286 NG-VC-GTQLDHAVTIVGFGTTEDGA-NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGT 341
+ +C T LDH V VG+GT DG+ +YWL+KNSWG +WG +GY+++VR+ CGI T
Sbjct: 260 DPLICSSTNLDHGVLAVGYGT--DGSKDYWLVKNSWGASWGMSGYIEMVRNHNNKCGIAT 317
Query: 342 RSSYPL 347
+SYP+
Sbjct: 318 SASYPV 323
>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
Length = 333
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 135/341 (39%), Positives = 209/341 (61%), Gaps = 24/341 (7%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ I L + S+ TH+QS+ E +W A+HG+ Y E+ +R ++++NL+
Sbjct: 5 LFLTILCL-----GIASAAPTHDQSLDEQWNQWTAEHGKVYSTG-EESLRRAVWEKNLKM 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E G T+ +G N F D+TN++FR + TG++ + + F Q
Sbjct: 59 IEQHNLEYSQGKHTFTMGMNAFGDMTNEDFRQMMTGFQ----NQKYNKGEVF--QPPQPL 112
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
+VP S+DWR+KG VTP+KNQ CG CWAF+A A+EG ++G L+ LSEQ L+DCS
Sbjct: 113 EVPESVDWREKGYVTPVKNQHRCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQP 172
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
N+GC GG KAF Y+ N G+ +E+ YPY+ + TC + +AA ++ ++ +P+ +
Sbjct: 173 QHNSGCKGGLVIKAFQYVKDNGGLDSEESYPYEEMESTCRYSPGNSAATVTGFKHIPA-E 231
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGAN 310
E+AL KAV S+ P+S+AI A+ FQ Y GI + C + L+HAV +VG+G ++G+N
Sbjct: 232 EKALEKAVASVGPISVAIDAHHHSFQFYTGGILHEPNCSPKWLNHAVLVVGYGVMQEGSN 291
Query: 311 ---YWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
YWL+KNSWG WG GY+ + +D+ CGI + + YP+
Sbjct: 292 NNTYWLVKNSWGERWGVGGYIMMAKDKNNHCGIASDALYPI 332
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 131/306 (42%), Positives = 190/306 (62%), Gaps = 16/306 (5%)
Query: 54 HGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTG 110
H R+Y E E+ R ++F+ NL+ IE N +G +Y++G NQF+D+ EF ++ G
Sbjct: 51 HERNYG-ETEEMQRKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKEFASVVNG 109
Query: 111 YKMPSPSHRSTTSSTFKYQNLSM---TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
++M ++R+ +S +P +DWR +G VTPIK+Q CG CW+F+
Sbjct: 110 FRM---NNRTKVRDHLHSHYISPAIPVSLPAEVDWRKEGYVTPIKDQGHCGSCWSFSTTG 166
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
A+EG ++G L+ LSEQ L+DCST+ GNNGC GG + AF YI N G TED YPY+
Sbjct: 167 ALEGQHFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDNDGDDTEDSYPYE 226
Query: 227 AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQSYKEGIF 285
A G C ++ A + Y ++P GDE+ + +AV+M PVS+AI A T FQ Y+ G++
Sbjct: 227 AADGPCRFKKEYVGATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDASHTSFQMYQSGVY 286
Query: 286 NGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTR 342
+ V C + LDH V +VG+G TE G +YWL+KNSWG WGD GY+K+ R++ CGI +
Sbjct: 287 DEVECDPEGLDHGVLVVGYG-TELGQDYWLVKNSWGTKWGDEGYIKMSRNKNNQCGISSM 345
Query: 343 SSYPLA 348
+SYPL
Sbjct: 346 ASYPLV 351
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 131/305 (42%), Positives = 189/305 (61%), Gaps = 16/305 (5%)
Query: 54 HGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTG 110
H R+Y E E+ R ++F+ NL+ I+ N ++G Y++G NQF+D+ +EF ++ G
Sbjct: 50 HERTYG-ETEESQRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQFADMEANEFASIMNG 108
Query: 111 YKMPSPSHRSTTSSTFKYQNLSM---TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
++M ++R+ +S VP +DWR +G VTP+KNQ +CG CWAF+
Sbjct: 109 FRM---NNRTEVRDHLHANYISPAIPVSVPAEVDWRKEGYVTPVKNQGQCGSCWAFSTTG 165
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
++EG ++G L+ LSEQ L+DCST+ GN GC GG + AF YI N G TE YPY+
Sbjct: 166 SLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIKDNDGDDTEACYPYE 225
Query: 227 AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQSYKEGIF 285
AV GTC A + Y ++P GDE + +AV++ PVS+AI A + FQ Y+ GI+
Sbjct: 226 AVDGTCRFKSVCVGATCTGYTDLPKGDEAKMKEAVALVGPVSVAIDASHSSFQMYQSGIY 285
Query: 286 --NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTR 342
QLDHAV +VG+G TE G +YWL+KNSWG TWGD GY+K+ R+ + CGI ++
Sbjct: 286 VEQECSPKQLDHAVLVVGYG-TEQGQDYWLVKNSWGTTWGDEGYIKMARNMDNQCGIASQ 344
Query: 343 SSYPL 347
+SYPL
Sbjct: 345 ASYPL 349
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 131/305 (42%), Positives = 188/305 (61%), Gaps = 14/305 (4%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
+W H + Y + E+ +R I+K+N I + N +G + L NQF D+TN EF+A +
Sbjct: 29 QWKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGG-DFILKMNQFGDMTNSEFKA-F 86
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
GY SH+ STF N + P ++DWR++G VTP+K+Q +CG CWAF+ +
Sbjct: 87 NGY----LSHKHVNGSTFLTPNNFV--APDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGS 140
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
+EG ++G L+ LSEQ L+DCST GNNGC GG + AF YI +N+GI +E YPY A
Sbjct: 141 LEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPYTA 200
Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN 286
G C + AA + + ++P G+E L +AV S+ P+S+AI A FQ Y G++N
Sbjct: 201 EDGKCVFKKSSVAATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVYN 260
Query: 287 --GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRS 343
T+LDH V +VG+G TE G +YWL+KNSW +WGD GY+K+ R+ + CGI T++
Sbjct: 261 EPSCSSTELDHGVLVVGYG-TESGKDYWLVKNSWNTSWGDKGYIKMRRNAKNQCGIATKA 319
Query: 344 SYPLA 348
SYPL
Sbjct: 320 SYPLV 324
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 139/337 (41%), Positives = 200/337 (59%), Gaps = 19/337 (5%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKE--MRLKIFKENLEY 77
+I++L V+C VS + E E + QH ++Y L+K+ R IF+ N++
Sbjct: 1 MILSLTVACIFVGVSPAAVDAHD--EHWELFKRQHNKTY---LQKQDVGRRAIFEANIKK 55
Query: 78 IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
I N G +Y+LG N F+D+T DEF Y G + + R S ++++
Sbjct: 56 INAHNLLYDLGRSSYRLGLNGFADMTPDEFEK-YRGTRFEANEARV---SKLQHRDNRSM 111
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
VP ++DWR +G VTP+KNQ CG CWAF+ A+EG RSG+L+ LSEQ L+DCS
Sbjct: 112 HVPDTVDWRTEGYVTPVKNQGVCGSCWAFSTTGALEGQHFRRSGDLVSLSEQMLVDCSAV 171
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG + AF +I G+ TE YPY GTC + AK++ + +VPS D
Sbjct: 172 YGNAGCNGGLMDNAFRFIKDAGGLETEKSYPYTGKDGTCHFDARGIGAKLTGFVDVPSRD 231
Query: 254 EQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFNGVC--GTQLDHAVTIVGFGTTEDGAN 310
E+AL +A + PVS+AI A FQ YK+G+++ + T LDH V +VG+GTT DG +
Sbjct: 232 EEALKEAAGVVGPVSVAIDASGQNFQFYKDGVYDEITCSSTSLDHGVLVVGYGTTRDGKD 291
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
YWL+KNSWG++WG +GY+++ R+ E CGI T +SYP
Sbjct: 292 YWLVKNSWGSSWGQSGYIQMSRNKENQCGIATMASYP 328
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 130/300 (43%), Positives = 189/300 (63%), Gaps = 13/300 (4%)
Query: 54 HGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEFRALYTGY 111
H +SY+D E+ +R IF++NL IE+ N+ + LG N+F+D+TN EF + G
Sbjct: 35 HLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEFSNMLLGL 94
Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEG 171
R+ + +++ + D+P +DW KG VT +KNQ +CG CWAF+ ++EG
Sbjct: 95 -----GGRNKIAGDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTGSLEG 149
Query: 172 ITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
++G L+ LSEQ L+DCST+ GN GC GG ++AF YI +N GI TE YPY G
Sbjct: 150 QVFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPYTGSDG 209
Query: 231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG-- 287
TC + A +S + +V SGDE AL +AV ++ P+S+AI A S FQ Y+ G++N
Sbjct: 210 TCRFLENKVGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVYNPWF 269
Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
T+LDH V +VG+G TE G +YWL+KNSWG++WG GY+K+VR+ + CGI T++SYP
Sbjct: 270 CSSTELDHGVLVVGYG-TEGGKDYWLVKNSWGSSWGLKGYIKMVRNKKNRCGIATQASYP 328
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 133/334 (39%), Positives = 195/334 (58%), Gaps = 18/334 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
I ++L+ C ++ V W H ++Y E E+ +R I+K+N+ I
Sbjct: 4 LIFVSLITLCFGYIIEKPIRESSWYV-----WKMAHNKAYSHESEENVRYAIWKDNMNRI 58
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
+ N + ++ L N F D+TN EFRA G + H+ STF S T P
Sbjct: 59 TEYNSK-SKNVILRMNHFGDMTNTEFRAKMNGLLL----HKHQNGSTFLVP--SHTAAPD 111
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNN 197
++DWR +G VTP+KNQ +CG CWAF++ A+EG ++G L+ LSEQ L+DCST+ GNN
Sbjct: 112 AVDWRSEGYVTPVKNQGQCGSCWAFSSTGALEGQHFKKTGRLVSLSEQNLVDCSTDYGNN 171
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG + AF+YI N GI TE YPY+ GTC ++ A + + ++P GDE AL
Sbjct: 172 GCNGGLMDNAFSYIKANGGIDTETGYPYEGQDGTCRYSKSSIGADDTGFVDIPEGDEDAL 231
Query: 258 LKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+AV ++ PVS+AI A FQ Y G+++ + LDH V +VG+G T++G +YWL+
Sbjct: 232 KQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQCSPSALDHGVLVVGYG-TDNGKDYWLV 290
Query: 315 KNSWGNTWGDAGYMKIVR-DEGLCGIGTRSSYPL 347
KNSWG WG GY+ + R ++ CGI +++SYPL
Sbjct: 291 KNSWGTGWGTEGYIYMSRNNQNQCGIASKASYPL 324
>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 360
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 137/350 (39%), Positives = 201/350 (57%), Gaps = 22/350 (6%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIHE--KWMAQHGRSYKDELEKEMRLKIFKEN 74
P+ T+L++ A+ S R ++ + W A H +SY+ E+ R +++++N
Sbjct: 10 PVITASTILLAWAAAAASGRGVDVGDMLMMDRFLMWQATHNQSYRSAEERLRRFQVYRDN 69
Query: 75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS----------- 123
+EYIE N+ G+ TY+LG NQF+DLT +EF A +T Y S
Sbjct: 70 VEYIETTNRRGDLTYQLGENQFADLTREEFIARFTSYNGDDDRTGDDDSVITTAAVGGGD 129
Query: 124 -STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC-WAFAAVAAVEGITKIRSGNLI 181
+ ++ P S+DWR KGAV P K+Q WAF AVA +E + I++G L+
Sbjct: 130 PDLWSSGGDDVSLDPPSVDWRAKGAVVPPKSQSSSCSSSWAFVAVATIESLHAIKTGKLV 189
Query: 182 QLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AA 240
LSEQQL+DC + GC G+ +AF ++IQN G+ TE EYPY A GTC++A+
Sbjct: 190 ALSEQQLVDCDQY-DGGCNRGTFRRAFHWVIQNGGLTTEAEYPYTAAQGTCNSAKSDHHV 248
Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV 300
A IS + VP +E A+ AV+ QPV+ AI ++ Q YK G+++G CG +L+HAVT+V
Sbjct: 249 AAISGHASVPGSNELAMKHAVATQPVAAAI-ELGSDMQFYKSGVYSGPCGARLEHAVTVV 307
Query: 301 GFGTTED-GANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYP 346
G+G E G YW++KNSWG TWG+ GY+++ R GLCGI +YP
Sbjct: 308 GYGADESTGDKYWIVKNSWGQTWGERGYIRMQRKILGPGLCGIMLDVAYP 357
>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 196/342 (57%), Gaps = 17/342 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLE 76
+I+ LV+ A VSS + +E I E+W Q + Y+D E+ R K++ +N
Sbjct: 4 VIVLGLVAFAISTVSSINLNEV----IEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKL 59
Query: 77 YIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
I NK G TY L N F DL E+ + G+K + T +
Sbjct: 60 KIAGHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKS 119
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+P S+DWR KG VTP+KNQ +CG CW+F+A ++EG ++G L+ LSEQ L+DC
Sbjct: 120 ENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDC 179
Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
S GNNGC GG + AF YI N+G+ TE YPY+A C + + A + ++P
Sbjct: 180 SRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFVDIP 239
Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGTTED 307
GDE AL+ A+ ++ PVSIAI A S +FQ YK+G+F N C T+LDH V VGFG+ +
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKK 299
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
G +YW++KNSWG TWGD GY+ + R+ + CG+ + +SYPL
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341
>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 134/312 (42%), Positives = 190/312 (60%), Gaps = 16/312 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E+W + HG+SY ++ E+ R +++++L IE N E G +++LG N F D+ N+EF
Sbjct: 30 EQWKSWHGKSY-EQKEETWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEF 88
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
R L GYK +H+ S F N +VP +DWRD+G VTP+K+Q +CG CWAF+
Sbjct: 89 RQLMNGYKYKQ-THKKLQGSHFLEPNFQ--EVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+EG R+G L+ LSEQ L++CS GN GC GG ++AF Y+ N GI +ED Y
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSY 205
Query: 224 PYQAVPGT-CSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
PY T C + AA + + ++PSG E+AL+KA+ ++ PVS+AI A T FQ Y+
Sbjct: 206 PYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQ 265
Query: 282 EGI-FNGVC-GTQLDHAVTIVGFGTTE---DGANYWLIKNSWGNTWGDAGYMKIVRD-EG 335
GI F C T LDH V +VG+G + DG YW++KNSW WG GY+ + +D +
Sbjct: 266 SGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDN 325
Query: 336 LCGIGTRSSYPL 347
CGI T +SYPL
Sbjct: 326 HCGIATAASYPL 337
>gi|2098464|pdb|1PCI|A Chain A, Procaricain
gi|2098465|pdb|1PCI|B Chain B, Procaricain
gi|2098466|pdb|1PCI|C Chain C, Procaricain
Length = 322
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 192/314 (61%), Gaps = 12/314 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ WM H + Y++ EK R +IFK+NL YI++ NK+ N +Y LG N+F+
Sbjct: 13 TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 71
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+NDEF Y G + + +S ++ N + ++P ++DWR KGAVTP+++Q C
Sbjct: 72 DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDIVNLPENVDWRKKGAVTPVRHQGSC 128
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC ++GC GG A Y+ +N GI
Sbjct: 129 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN-GI 186
Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
+YPY+A GTC A Q K S V +E LL A++ QPVS+ + +
Sbjct: 187 HLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 246
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
FQ YK GIF G CGT++D AVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 247 FQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGN 305
Query: 333 DEGLCGIGTRSSYP 346
G+CG+ S YP
Sbjct: 306 SPGVCGLYKSSYYP 319
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 134/312 (42%), Positives = 190/312 (60%), Gaps = 16/312 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E+W + HG+SY ++ E+ R +++++L IE N E G +++LG N F D+ N+EF
Sbjct: 30 EQWKSWHGKSY-EQKEETWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEF 88
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
R L GYK +H+ S F N +VP +DWRD+G VTP+K+Q +CG CWAF+
Sbjct: 89 RQLMNGYKYKQ-THKKLQGSHFLEPNF--LEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+EG R+G L+ LSEQ L++CS GN GC GG ++AF Y+ N GI +ED Y
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSY 205
Query: 224 PYQAVPGT-CSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
PY T C + AA + + ++PSG E+AL+KA+ ++ PVS+AI A T FQ Y+
Sbjct: 206 PYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQ 265
Query: 282 EGI-FNGVC-GTQLDHAVTIVGFGTTE---DGANYWLIKNSWGNTWGDAGYMKIVRD-EG 335
GI F C T LDH V +VG+G + DG YW++KNSW WG GY+ + +D +
Sbjct: 266 SGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDN 325
Query: 336 LCGIGTRSSYPL 347
CGI T +SYPL
Sbjct: 326 HCGIATAASYPL 337
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 186/310 (60%), Gaps = 13/310 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E W HG++Y +E+++RLKI+ EN I + N E G Y + N + DL + EF
Sbjct: 31 ESWKLMHGKTYSSSIEEKLRLKIYMENSLKISRHNSEALNGIHPYYMKMNHYGDLLHHEF 90
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A+ GY+ + + S + +N+ + PT +DWR++GAVTP+KNQ +CG CW+F+
Sbjct: 91 VAMVNGYQYANKT-ASLGGTYIPNKNIQL---PTHVDWREEGAVTPVKNQGQCGSCWSFS 146
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A A+EG ++G LI LSEQ L+DCS GNNGC GG + AF YI N+GI TE Y
Sbjct: 147 ATGALEGQDFRKTGKLISLSEQNLVDCSRKFGNNGCEGGLMDFAFTYIRDNKGIDTEASY 206
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKE 282
PY+ + G C K + ++ G E+ L KAV+ + P+S+AI A FQ Y
Sbjct: 207 PYEGIDGHCHYNPKNKGGSDIGFVDIKKGSEKDLKKAVAGVGPISVAIDASHMSFQFYSH 266
Query: 283 GIF-NGVCGT-QLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCG 338
G++ C + +LDH V +VGFGT + G +YWL+KNSW WGD GY+K+ R+ E +CG
Sbjct: 267 GVYVESKCSSEELDHGVLVVGFGTDSVSGEDYWLVKNSWSEKWGDQGYIKMARNKENMCG 326
Query: 339 IGTRSSYPLA 348
I + +SYP+
Sbjct: 327 IASSASYPVV 336
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 199/341 (58%), Gaps = 18/341 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ + +L C S +S+ S Q + E + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPVAVLAVCLSAALSAPSLDPQ-LDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKKI 58
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G TY+LG N F D+T++EFR + GYK S R S F N +
Sbjct: 59 ELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYK--RKSERKFKGSLFMEPNF--LE 114
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
P S+DWRD G VTP+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 APRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPE 174
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG ++AF YI NQG+ +ED YPY C K +A + + ++PSG
Sbjct: 175 GNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGK 234
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTED 307
E+AL+KAV ++ PVS+AI A FQ Y+ GI + C + +LDH V +VG+ G D
Sbjct: 235 ERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDVD 294
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
G YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 295 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 143/343 (41%), Positives = 199/343 (58%), Gaps = 19/343 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
MF ++ L + C + +S+ S Q + E W H + Y E E+ R ++++NL+
Sbjct: 1 MFPVVVLAL-CVTAALSAPSLDPQ-LDEHWNLWKDWHSKKYH-EKEEGWRRMVWEKNLKK 57
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G TY LG N F D+T++EFR + GYK+ S R S F N
Sbjct: 58 IELHNLEHSMGKHTYSLGMNHFGDMTHEEFRQIMNGYKLKS--QRKLRGSLFMEPNF--L 113
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
+ P S+DWRDKG VTP+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 114 EAPRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRP 173
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPAAAKISNYEEVPSG 252
GN GC GG ++AF YI N G+ +E+ YPY G C +A + + +VPSG
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPCHYDPSYNSANDTGFVDVPSG 233
Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTE 306
E+AL+KAV S+ PVS+AI A FQ Y GI ++ C + +LDH V +VG+ G
Sbjct: 234 SERALMKAVASVGPVSVAIDAGHESFQFYHSGIYYDKECSSEELDHGVLVVGYGFEGKDV 293
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
DG YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 294 DGKKYWIVKNSWSENWGDKGYIYMAKDKKNHCGIATAASYPLV 336
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 194/336 (57%), Gaps = 17/336 (5%)
Query: 18 MFIIITLLV----SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
+F+I++L++ CA+ + S T++ S + WM +H ++Y E + + FK+
Sbjct: 5 VFLIVSLVILSINVCAATNLFSAQTYQTSFL----GWMKKHNKAYHHH-EFNDKYQTFKD 59
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+++I N + + T LG N+F+DLTN+E++ Y G M + N
Sbjct: 60 NMDFIHNWNSKESDTV-LGLNRFADLTNEEYKKTYLG--MSINVNLRANQVPMNGLNFER 116
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
P+S+DWR GAV +K+Q CG CWAFA AVEG +I++GN++ SEQ L+DCS
Sbjct: 117 FTGPSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDCSG 176
Query: 194 N-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
GNNGC GG AF YII N GIATE+ YPY A C IS Y++VP G
Sbjct: 177 RYGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTATQNRCVYNTTMLGTAISGYKDVPRG 236
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GVCGT-QLDHAVTIVGFGTTEDGAN 310
E AL A+S QPV++AI A FQ YK G++ C + +L+H V VG+GT E G +
Sbjct: 237 SESALTAAISKQPVAVAIDASPITFQLYKSGVYQEATCSSYRLNHGVLAVGYGTLE-GKD 295
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSY 345
Y+++KNSW TWG+ GY+ + R+ CGI T +SY
Sbjct: 296 YYIVKNSWAETWGNQGYILMARNANNHCGIATMASY 331
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 191/308 (62%), Gaps = 11/308 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
E + + H ++YK +E+ +R KIF EN +I K N +G +YKLG NQF+DL EF
Sbjct: 28 EAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADLLPHEF 87
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
+ GY+ + R +T NL+ + +P ++DWR KGAVTP+K+Q +CG CWAF+
Sbjct: 88 VKMMNGYQGKRLAGRGST--YLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFS 145
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
+ ++EG +++G L+ LSEQ L+DCS+ GN GC GG + +F YI N GI TED Y
Sbjct: 146 STGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKANGGIDTEDSY 205
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY+A G C ++ A + + ++ G E+ L KAV ++ PVS+AI A FQ Y E
Sbjct: 206 PYEAEDGDCRYKKEDVGATDTGFVDIKEGSEKDLQKAVATVGPVSVAIDASQQSFQLYSE 265
Query: 283 GIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGI 339
G+++ C ++ LDH V VG+G ++G YWL+KNSW TWG GY+ + RD+ CGI
Sbjct: 266 GVYDEPNCSSESLDHGVLAVGYG-VKNGKKYWLVKNSWAETWGQDGYILMSRDKNNQCGI 324
Query: 340 GTRSSYPL 347
+ +SYPL
Sbjct: 325 ASSASYPL 332
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 135/330 (40%), Positives = 196/330 (59%), Gaps = 18/330 (5%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
LL+ R ++S ++ W H + Y + E+ +R I+K+N I + N
Sbjct: 8 LLLGVTLAYTIERPVKDESWIQ----WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNL 63
Query: 84 EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
+G + L NQF D+TN EF+A + GY SH+ STF N + P ++DWR
Sbjct: 64 KGG-DFLLKMNQFGDMTNSEFKA-FNGYL----SHKHVNGSTFLTPNNFV--APDTVDWR 115
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGG 202
++G VTP+K+Q +CG CWAF+ ++EG ++G L+ LSEQ L+DCST GNNGC GG
Sbjct: 116 NEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCNGG 175
Query: 203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV- 261
+ AF YI +N+GI +E YPY A G C + AA + + ++P G+E L +AV
Sbjct: 176 LMDNAFTYIKENKGIDSEASYPYTAEDGKCVFKKPSVAATDTGFVDLPEGNENKLKEAVA 235
Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
S+ P+S+AI A FQ Y G++N T+LDH V +VG+G TE G +YWL+KNSW
Sbjct: 236 SVGPISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYG-TESGKDYWLVKNSWN 294
Query: 320 NTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
+WGD GY+K+ R+ + CGI T++SYPL
Sbjct: 295 TSWGDKGYIKMRRNAKNQCGIATKASYPLV 324
>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
Length = 341
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 197/342 (57%), Gaps = 17/342 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLE 76
+I+ LV+ A VSS + +E I E+W Q + Y+D E+ R K++ +N
Sbjct: 4 VIVLGLVAFAISSVSSINLNEV----IEEEWSLFKMQFKKLYEDIKEETFRKKVYLDNKL 59
Query: 77 YIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
I + NK G TY L N F DL E+ + G+K + T +
Sbjct: 60 KIARHNKLYESGEETYALEMNHFGDLMQHEYSKMMNGFKPSLAGGDSNFTNDEGVTFLKS 119
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+P S+DWR KG VTP+KNQ +CG CW+F+A ++EG ++G L+ LSEQ L+DC
Sbjct: 120 ENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDC 179
Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
S GNNGC GG + AF YI N+G+ TE YPY+A C + A + + ++P
Sbjct: 180 SRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPDNSGATDNGFVDIP 239
Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGTTED 307
GDE+AL+ A+ ++ PVSIAI A S +FQ YK+G+F N C T+LDH V VGF T +
Sbjct: 240 EGDEEALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFRTDKK 299
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
G +YW++KNSWG TWGD GY+ + R+ + CG+ + +SYPL
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/335 (40%), Positives = 200/335 (59%), Gaps = 20/335 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
I L+++C S ++R ++ + WM +H +SY ++ E R +F++N++ +
Sbjct: 7 LIFCFLIINCCS---AARIFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYSVFQDNMDIV 62
Query: 79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL-SMTDVP 137
K N++G+ T LG N +DLTN+EF+ LY G K + T+K + L ++ +P
Sbjct: 63 AKWNQKGSNTI-LGLNVMADLTNEEFKKLYLGTK---------ANVTYKKKTLVGVSGLP 112
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGN 196
S+DWR GAVT +KNQ +CG C+AF+ +VEGI +I S L+ LSEQQ+LDCS + GN
Sbjct: 113 ASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGSEGN 172
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG +F YII G+ TE YPY G C +K A I+ Y+ V SG E
Sbjct: 173 NGCDGGLMTNSFEYIIAVGGLDTEASYPYTGEVGKCKFNKKNIGATITGYKNVESGSESD 232
Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
L AV+ QPVS+AI A + FQ Y G++ TQLDH V VG+G ++ G +YW++
Sbjct: 233 LQTAVAAQPVSVAIDASQSSFQLYASGVYYEPECSSTQLDHGVLAVGYG-SQSGQDYWIV 291
Query: 315 KNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
KNSWG WG+ G++ + R+ + CGI T +S+P A
Sbjct: 292 KNSWGADWGENGFILMARNKDNNCGIATMASFPTA 326
>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
Length = 341
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 140/345 (40%), Positives = 197/345 (57%), Gaps = 18/345 (5%)
Query: 18 MFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKE 73
M ++I L LV A VSS + +E I E+W Q + Y+D E+ R K++ +
Sbjct: 1 MKVVIVLGLVVFAISSVSSINLNEI----IEEEWDLFKVQFKKIYEDVKEEAFRKKVYLD 56
Query: 74 NLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKY 128
N I + NK G TY L N F DL E+ + G+K + T +
Sbjct: 57 NKLKIARHNKLYETGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDKNFTDDDAVTF 116
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+P S+DWR KG VTP+KNQ +CG CW+F+A ++EG ++G L+ LSEQ L
Sbjct: 117 LKSENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNL 176
Query: 189 LDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
+DCS GNNGC GG + AF YI N+G+ TE YPY+A C + + A +
Sbjct: 177 IDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFV 236
Query: 248 EVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGT 304
++P GDE AL+ A+ ++ PVSIAI A S +FQ YK+G+F N C T+LDH V VG+GT
Sbjct: 237 DIPEGDEDALVHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGT 296
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
G +YW++KNSWG TWGD GY+ + R+ + CG+ + +SYPL
Sbjct: 297 DHKGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASYPLV 341
>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
Length = 344
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/344 (40%), Positives = 199/344 (57%), Gaps = 23/344 (6%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEK 80
+L+ CA VS+ Q + E+W A QH +YK E+E R+KI+ E+ I K
Sbjct: 5 VLLLCAVAAVSAV----QFFDLVKEEWSAFKLQHRLNYKSEVEDNFRMKIYAEHKHIIAK 60
Query: 81 ANKE---GNRTYKLGTN---QFSDLTNDEFRALYTGYKMPSPSHRST-----TSSTFKYQ 129
N++ G +YKLG N + D+ + EF G+ + +++ + K+
Sbjct: 61 HNQKYEMGLVSYKLGMNSWWEHGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 120
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+ + +P +DWR GAVT IK+Q +CG CW+F+ A+EG +SG L+ LSEQ L+
Sbjct: 121 SPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLI 180
Query: 190 DCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
DCS GNNGC GG + AF YI N GI TE YPY+ V C K A+ + +
Sbjct: 181 DCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQAYPYEGVDDKCRYNPKNTGAEDVGFVD 240
Query: 249 VPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTT 305
+P GDEQ L++AV ++ PVS+AI A T FQ Y G++N T LDH V +VG+GT
Sbjct: 241 IPEGDEQKLMEAVATVGPVSVAIDASHTHFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD 300
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
E G +YWL+KNSWG +WG+ GY+K++R++ CGI + +SYPL
Sbjct: 301 EQGVDYWLVKNSWGRSWGELGYIKMIRNKNNRCGIASSASYPLV 344
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/344 (39%), Positives = 199/344 (57%), Gaps = 22/344 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKEN 74
+ +++T+ V+C Q VS + E+W + QH + Y+ E E+ R+KIF +N
Sbjct: 4 LVLLVTIAVAC--QAVSFSEL-------VQEQWNSFKVQHKKQYESETEERFRMKIFMDN 54
Query: 75 LEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGY-KMPSPSHRSTTSSTFKYQN 130
+ K NK +G YKL N++ DL + EF L G+ + + R + +
Sbjct: 55 SHKVAKHNKLFEQGLYPYKLAMNKYGDLLHHEFVGLLNGFNRTKTYLKRGELQDSITFIE 114
Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
+ D+P ++DWR +GAVTP+K+Q CG CW+F+A A+EG ++ L+ LSEQ L+D
Sbjct: 115 PAHVDIPDTVDWRQEGAVTPVKDQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVD 174
Query: 191 CSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
CS+ GNNGC GG + AF YI N GI TE YPY + K A + ++
Sbjct: 175 CSSRFGNNGCNGGLMDNAFRYIKNNGGIDTEAAYPYMGEDEKFRYSAKNRGATDKGFVDI 234
Query: 250 PSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGTTE 306
PSGDE L AV ++ P+SIAI A FQ Y G++ + C T+LDH V +VG+GT E
Sbjct: 235 PSGDEDKLKAAVATVGPISIAIDASHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGTDE 294
Query: 307 D-GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
G +YWL+KNSWG+TWG GY+K+ R+ + CG+ T++SYPL
Sbjct: 295 KTGMDYWLVKNSWGDTWGLDGYIKMARNQDNQCGVATQASYPLV 338
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 141/343 (41%), Positives = 199/343 (58%), Gaps = 20/343 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M + + C S V ++ + +Q ++ H E+W HG+ Y E E+ R ++++NL+
Sbjct: 1 MRVFLAAFALCLSAVFAAPTLDKQ--LDNHWEQWKNWHGKKYH-EKEEGWRRMVWEKNLQ 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE N E G TY+LG N+F D+T++EFR + GYK R S F N
Sbjct: 58 KIELHNLEHSMGTHTYRLGMNRFGDMTHEEFRQVMNGYK--HKKERRFRGSLFMEPNF-- 113
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+VP SLDWR+KG VTP+K+Q ECG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 114 LEVPNSLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSR 173
Query: 194 -NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPS 251
GN GC GG ++AF YI G+ +E+ YPY C K +AA + + ++PS
Sbjct: 174 PEGNEGCNGGLMDQAFQYIKDQNGLDSEESYPYVGTDDQPCHYDPKYSAANDTGFVDIPS 233
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
G E AL+KA+ ++ PVS+AI A FQ Y+ GI + C + +LDH V VG+ G
Sbjct: 234 GKEHALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGED 293
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
DG YW++KNSW WGD GY+ + +D CGI T +SYPL
Sbjct: 294 VDGKKYWIVKNSWSENWGDKGYVYMAKDRHNHCGIATAASYPL 336
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 199/341 (58%), Gaps = 18/341 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ + +L C S +S+ S Q + E + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPVAVLAVCLSAALSAPSLDPQ-LDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKKI 58
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G TY+LG N F D+T++EFR + GYK S R S F N +
Sbjct: 59 ELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYK--RKSERKFKGSLFMEPNF--LE 114
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
P S+DWRD G VTP+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 APRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPE 174
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG ++AF YI NQG+ +ED YPY C K +A + + ++PSG
Sbjct: 175 GNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGK 234
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTED 307
E+AL+KAV ++ PVS+AI A FQ Y+ GI + C + +LDH V +VG+ G D
Sbjct: 235 ERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDVD 294
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
G YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 295 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 336
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 134/338 (39%), Positives = 204/338 (60%), Gaps = 15/338 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
F++ LV+CA+ + + + +W H +SY +++ + R +++EN++ I
Sbjct: 6 FLVAIGLVACATAAFVKPTNPDLDSRWL--EWKIAHTKSYTNDMHELERRLVWEENVKMI 63
Query: 79 EKANKEGN---RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
N + + + ++LG N++ D+ E R+ GYK S + STF S
Sbjct: 64 NMHNLDHSLHKKGFRLGMNEYGDMRLHEVRSTMNGYK--SSNVTKVQGSTF--LTPSNIQ 119
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
VP ++DWR KG VTP+KNQ +CG CWAF+ ++EG T ++ L+ LSEQ L+DCS T
Sbjct: 120 VPDTVDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSRTE 179
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
GN GC GG ++ F Y+I N GI +ED YPY A TC +A+++ + +V SGDE
Sbjct: 180 GNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDETCHYKASCDSAEVTGFTDVTSGDE 239
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANY 311
QAL++AV S+ PVS+AI A FQ Y+ G+++ ++LDH V +VG+G T+ G +Y
Sbjct: 240 QALMEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSELDHGVLVVGYG-TDGGKDY 298
Query: 312 WLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
WL+KNSWG TWG +GY+K+ R++ CGI T +SYPL
Sbjct: 299 WLVKNSWGETWGLSGYIKMSRNKSNQCGIATSASYPLV 336
>gi|302763127|ref|XP_002964985.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
gi|300167218|gb|EFJ33823.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
Length = 320
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 129/326 (39%), Positives = 188/326 (57%), Gaps = 29/326 (8%)
Query: 14 NTTPMFIIITLLVSCASQVVSSRSTHEQS----VVEIHEKWMAQHGRSYKDELEKEMRLK 69
N + +I+ ++V A ++ + E + + E W A+HG+SY + EK R+
Sbjct: 4 NMIALILILLVVVGAAPFAIARPAALEDDRALEIKNMFEDWAAKHGKSYSSDWEKARRMT 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IF + L YIEK N N T+ LG N+FSDLTN EFRA Y G K P ++ + K
Sbjct: 64 IFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRANYVG-KFKPPRYQDRRPA--KDV 120
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A+A++E + + L+ LSEQQL+
Sbjct: 121 DVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIASIESAHFLATNQLVSLSEQQLI 180
Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
DC T + GC E+ YPY + G+C+ A K A+I+ + V
Sbjct: 181 DCDTV-DEGC-------------------QEEAYPYTGLAGSCN-ANKNKVAEITGFNVV 219
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
AL+KAVS PV++ I FQ+Y+ GI +G C DH V ++G+G TE G
Sbjct: 220 TKDKADALMKAVSKTPVTVGICGSDQNFQNYRSGILSGQCCNSRDHVVLVIGYG-TEGGM 278
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG 335
YW+IKNSWG +WG+ G+MKI + +G
Sbjct: 279 PYWIIKNSWGTSWGEDGFMKIEKKDG 304
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 132/311 (42%), Positives = 184/311 (59%), Gaps = 19/311 (6%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRA 106
W + GRSY E++ R++I+ N E + N +G+ TY+LG ++DL ++EF+
Sbjct: 29 WKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEHEEFKQ 88
Query: 107 LYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
G + P S+ ++ NL P ++DWR G VTP+KNQ CG CW
Sbjct: 89 TVFGVCLGSFNASKPRGGSSFLKMHRFYNL-----PQTIDWRQWGFVTPVKNQGSCGSCW 143
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATE 220
+F++ A+EG ++G L+ LSEQ+L+DCS N GN GC GG + AF YI+ GI TE
Sbjct: 144 SFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYIVNKGGIHTE 203
Query: 221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQS 279
D YPY+ G C A A + Y ++PSG+E AL +AV + PVS+AI A FQ
Sbjct: 204 DSYPYEGQVGQCRANYGEIGATCTGYYDIPSGNEHALKEAVATFGPVSVAIHASDQSFQL 263
Query: 280 YKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
Y G++N GT LDHAV IVG+G TE G +YWL+KNSWG WGD GY+K+ R+
Sbjct: 264 YHSGVYNNPYCSGTALDHAVLIVGYG-TEYGQDYWLVKNSWGPAWGDQGYIKMSRNRYNQ 322
Query: 337 CGIGTRSSYPL 347
CGI + +S+PL
Sbjct: 323 CGIASAASFPL 333
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 131/301 (43%), Positives = 189/301 (62%), Gaps = 13/301 (4%)
Query: 54 HGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTG 110
HG+SY + E+ R ++F +++ I N G TY++G N+F+D+T++EFR + G
Sbjct: 26 HGKSYGHD-EEHFRRQLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDMTSEEFRN-FKG 83
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
K + ++ + T + L +PT +DWR+KG VTP+KNQ +CG CWAF+ ++E
Sbjct: 84 LKFDAT--KTKRNGTRFQKELLGEALPTQVDWREKGYVTPVKNQGQCGSCWAFSTTGSLE 141
Query: 171 GITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP 229
G +G L+ LSEQ L+DCS GNNGC GG + F YI QN GI TE+ YPY
Sbjct: 142 GQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQNGGIDTEESYPYTGKD 201
Query: 230 GTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-- 286
G C+ + A++ + +VP DE AL AV S+ PVS+AI A + FQ YKEG+++
Sbjct: 202 GDCAFNENSVGARVKGFVDVPQRDEAALQAAVASVGPVSVAIDASNDSFQYYKEGVYDEP 261
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSY 345
+QLDH V +VG+G TE+G +YWL+KNSWG TWG GY+K++R+ E CGI + +SY
Sbjct: 262 SCSFSQLDHGVLVVGYG-TENGVDYWLVKNSWGPTWGQDGYIKMMRNKENQCGIASMASY 320
Query: 346 P 346
P
Sbjct: 321 P 321
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 147/361 (40%), Positives = 204/361 (56%), Gaps = 26/361 (7%)
Query: 2 VLIFERSGSFKINTTPMFIIITL---LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSY 58
VL R S +N I++L L A +V +H Q W + H + Y
Sbjct: 3 VLFLARRLSRFVNMNVCLTILSLCLGLAFAAPRVDPDLDSHWQL-------WKSWHSKDY 55
Query: 59 KDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
E E+ R ++++NL+ IE N + G +YKLG NQF D+T +EFR L GYK
Sbjct: 56 H-EREESWRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKHKK 114
Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
S R S F S + P S+DWR+KG VTP+K+Q +CG CWAF+ A+EG
Sbjct: 115 -SERKYRGSQF--LEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFR 171
Query: 176 RSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCS 233
++G L+ LSEQ L+DCS GN GC GG ++AF Y+ N GI +E+ YPY A C
Sbjct: 172 KTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCR 231
Query: 234 AAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT 291
+ AA + + ++P G E+AL+KAV S+ PVS+AI A + FQ Y+ GI + C +
Sbjct: 232 YKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSS 291
Query: 292 Q-LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
+ LDH V +VG+ G DG YW++KNSWG WGD GY+ + +D + CGI T +SYP
Sbjct: 292 EDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYP 351
Query: 347 L 347
L
Sbjct: 352 L 352
>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
Length = 262
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 110/223 (49%), Positives = 152/223 (68%), Gaps = 8/223 (3%)
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
++D+P S+DWR KGAVT +K+Q +CG CWAF+ V +VEGI IR+G+L+ LSEQ+L+DC
Sbjct: 1 VSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD 60
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ----KPAAAKISNYEE 248
T N+GC GG + AF YI N G+ TE YPY+A GTC+ A+ P I +++
Sbjct: 61 TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQD 120
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP+ E+ L +AV+ QPVS+A+ A F Y EG+F G CGT+LDH V +VG+G EDG
Sbjct: 121 VPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDG 180
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYPL 347
YW +KNSWG +WG+ GY+++ +D GLCGI +SYP+
Sbjct: 181 KAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPV 223
>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/356 (39%), Positives = 206/356 (57%), Gaps = 27/356 (7%)
Query: 18 MFIIITLLVSCASQVVS----SRSTH----------EQSVVEIHEKW---MAQHGRSYKD 60
MF +++L++ CAS S SR H Q + E + W G+SY
Sbjct: 1 MFRLLSLVLLCASVFASIDSGSRHDHTIRLHRVKSLRQKIDEAFKLWDDYKESFGKSYNK 60
Query: 61 ELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS 117
+ E + ++ F +N+ +I++ N+E G +T+++G N +DL ++R L GY+
Sbjct: 61 DEENDY-MEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKL-NGYRHRRNF 118
Query: 118 HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
S S+ K+ ++P S+DWRDKG VT +KNQ CG CWAF+A A+EG S
Sbjct: 119 GDSMQSNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARAS 178
Query: 178 GNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
G ++ LSEQ L+DCST GN+GC GG + AF YI N GI TE+ YPY C +
Sbjct: 179 GKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKCHFKK 238
Query: 237 KPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGT-QL 293
K A+ + ++P GDE+AL AV+ Q P+SIAI A FQ YK+G+ ++ C + +L
Sbjct: 239 KDIGAEDKGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSEEL 298
Query: 294 DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
DH V +VG+GT + +YWLIKNSWG WG+ GY++I R+ CG+ T++SYPL
Sbjct: 299 DHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEKGYIRIARNRSNHCGVATKASYPLV 354
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/311 (42%), Positives = 190/311 (61%), Gaps = 18/311 (5%)
Query: 48 EKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTN 101
E+W+A Q G+SYK+ E+ R+ ++KEN I++ NK G +YKL N F DL
Sbjct: 24 EEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKLKMNHFGDLMQ 83
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+AL K+ + + + F+ + +P +DWR KGAVTP+K+ +CG CW
Sbjct: 84 HEFKALN---KLKRSAKQQNSGEVFR---ATGGKLPAKVDWRQKGAVTPVKDPGQCGSCW 137
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATE 220
AF++ ++ G +++ L+ LSEQQL+DCS N GN+GC GG +AF YI N GI TE
Sbjct: 138 AFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYIKGNGGIDTE 197
Query: 221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQS 279
YPY+A C K A Y ++ GDE AL +AV+ + P+S+AI A + FQ
Sbjct: 198 GSYPYEAEDDKCRYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVAIDAGNLSFQF 257
Query: 280 YKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
Y EGI++ T+LDH V +VG+G TE+G +YWL+KNSWG +WG+ GY+KI R+
Sbjct: 258 YSEGIYDEPFCSNTELDHGVLVVGYG-TENGQDYWLVKNSWGPSWGENGYIKIARNHNNH 316
Query: 337 CGIGTRSSYPL 347
CGI + +SYP+
Sbjct: 317 CGIASMASYPI 327
>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 129/306 (42%), Positives = 186/306 (60%), Gaps = 12/306 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-YKLGTNQFSDLTNDEFRA 106
E W A +G+SY E++ R ++EN I+ N + ++ Y L N F DLT+ EF +
Sbjct: 28 ELWKATYGKSYLTLEEEKYRRDTWEENSLLIKTHNTDSDKHGYTLEMNSFGDLTSAEFSS 87
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
LY GY+ + S SS+ + +P+SLDWRDK VT +KNQ +CG CWAF+
Sbjct: 88 LYNGYRQNLETSGSVFSSSLR------NAMPSSLDWRDKKVVTDVKNQGKCGSCWAFSTT 141
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
++EG+ +++G+L+ LSEQQL+DCS GNNGC GG+ AF YI G TE+ YPY
Sbjct: 142 GSLEGLHALKTGHLVSLSEQQLMDCSVKYGNNGCDGGNMRSAFQYIKDAGGDDTEESYPY 201
Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI 284
A +C K A Y +PSGDE +L+ A+ + P+S+A+ A FQ YK+GI
Sbjct: 202 TAKNESCRFDPKKVGATDEGYVRIPSGDEVSLMHALYEVGPISVAMDAGLKTFQFYKKGI 261
Query: 285 FNG-VC-GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGT 341
++ +C T L+H VT++G+G + DG+ YWL+KNSWG WG GY + R G +CG+ T
Sbjct: 262 YSDYLCSNTHLNHGVTLIGYGESSDGSPYWLVKNSWGKDWGIDGYFMLARYVGNMCGVAT 321
Query: 342 RSSYPL 347
+SYP+
Sbjct: 322 DASYPI 327
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 126/316 (39%), Positives = 191/316 (60%), Gaps = 17/316 (5%)
Query: 43 VVEIHEKWM---AQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQF 96
+ + ++W A+HGR Y E+ RL +F++N ++I+ N + G T+ L NQF
Sbjct: 15 IPSLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQF 74
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
D+T++E A G+ + +P+ R ++ K + ++ P +DWR KGAVTP+K+QK+
Sbjct: 75 GDMTSEEIVATMNGF-LGAPTRRP--AAVLKADDETL---PEKVDWRTKGAVTPVKDQKQ 128
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQ 215
CG CWAF+ ++EG ++ G L+ LSEQ L+DCS N GC+GG ++AF YI N+
Sbjct: 129 CGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKANK 188
Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
GI TED YPY+A G C A + Y +V G E AL KAV ++ P+S+ I A
Sbjct: 189 GIDTEDSYPYEAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQ 248
Query: 275 TEFQSYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
+ F Y G+++ T LDH V VG+G+ E+G ++WL+KNSW +WGD GY+K+ R
Sbjct: 249 STFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSR 308
Query: 333 DE-GLCGIGTRSSYPL 347
+ CGI +++SYPL
Sbjct: 309 NRNNNCGIASQASYPL 324
>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 422
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 130/314 (41%), Positives = 194/314 (61%), Gaps = 17/314 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIE---KANKEGNRTYKLGTNQFSDLTNDEF 104
++W+A HG++Y E+ RL IF +N E++ +A+ G +++ L N +DLT +EF
Sbjct: 71 DRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLADLTREEF 130
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKGAVTPIKNQKECGCCWA 162
+ + GY S ++S N DV P ++DW +GAVTP+KNQ +CG CWA
Sbjct: 131 KHML-GYDA-SKKRVESSSPPVDAANWEYADVTPPETMDWVSRGAVTPVKNQGQCGSCWA 188
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATED 221
F+ V AVEG+ +++G+LI LSEQ+L+ C+ GNNGC GG + F +I++N+G+ E+
Sbjct: 189 FSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIVENRGVDDEE 248
Query: 222 EYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
++ Y A C+ ++ AA I +++VP DE AL KAVS QPV++AI A EFQ
Sbjct: 249 DWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAIEADHREFQL 308
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGA---NYWLIKNSWGNTWGDAGYMKIVR---- 332
Y G+F+G CGT LDH V +VG+G + A +YW +KNSWG WG+ GY++I R
Sbjct: 309 YSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGYIRIARGGMG 368
Query: 333 DEGLCGIGTRSSYP 346
G CG+ ++SYP
Sbjct: 369 PAGQCGVAMQASYP 382
>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/355 (39%), Positives = 206/355 (58%), Gaps = 27/355 (7%)
Query: 18 MFIIITLLVSCASQVVS----SRSTH----------EQSVVEIHEKW---MAQHGRSYKD 60
MF +++L++ CAS S SR H Q + E + W G+SY
Sbjct: 1 MFRLLSLVLLCASVFASIDSGSRRDHTIRLHRVKSLRQKIDEAFKLWDDYKEAFGKSYNK 60
Query: 61 ELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS 117
+ E + ++ F +N+ +I++ N+E G +T+++G N +DL ++R L GY+
Sbjct: 61 DEENDY-MEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKL-NGYRHRRNF 118
Query: 118 HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
S S+ K+ ++P S+DWRDKG VT +KNQ CG CWAF+A A+EG S
Sbjct: 119 GDSMQSNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARAS 178
Query: 178 GNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
G ++ LSEQ L+DCST GN+GC GG + AF YI N GI TE+ YPY C +
Sbjct: 179 GKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKCHFKK 238
Query: 237 KPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGT-QL 293
K A+ + ++P GDE+AL AV+ Q P+SIAI A FQ YK+G+ ++ C + +L
Sbjct: 239 KDIGAEDKGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSEEL 298
Query: 294 DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
DH V +VG+GT + +YWLIKNSWG WG+ GY++I R+ CG+ T++SYPL
Sbjct: 299 DHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEKGYIRIARNRSNHCGVATKASYPL 353
>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
Length = 348
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 142/349 (40%), Positives = 194/349 (55%), Gaps = 26/349 (7%)
Query: 21 IITLLVSCASQVVSSRSTHEQSVV-EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ +L+V A+ V S + Q +V E E++ +HG+ Y+ E E E R +F ENL I
Sbjct: 1 MYSLVVLLATLVAYSHAISYQVLVQEQWEQFKLEHGKVYESESENEYRQSVFMENLFQIN 60
Query: 80 KANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY-------- 128
+ NK G +Y++ N DLT DEF +YT MP S + +
Sbjct: 61 EHNKLYEMGLSSYQMAMNHLGDLTKDEFMRIYT-VNMPQLPQSENLSDSEPWLDLPQDLQ 119
Query: 129 --------QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
NL D+PT +DWR KGAVTP+KNQ+ CG CW+F+A A+E ++ L
Sbjct: 120 GFVTYALPTNLDEVDLPTDIDWRQKGAVTPVKNQRNCGSCWSFSATGALEAQWFKKTNKL 179
Query: 181 IQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA 239
I LSEQQL+DCS GN+GC GG AF YI +N GI TE YPY A G C+
Sbjct: 180 ISLSEQQLVDCSGRYGNHGCHGGWMHWAFGYIKENGGIDTEQSYPYTAKDGRCAYKPGNK 239
Query: 240 AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQLDHAVT 298
AA +S VP G+ Q K S+ P+SIA A S +FQ Y G+++ CG L+HA+
Sbjct: 240 AATVSQVIMVPRGENQLAAKVSSVGPISIA-AEVSHKFQFYHSGVYDEPQCGHSLNHAML 298
Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
VG+G+ G N+WL+KNSWG WGD GY+++ +D+ CGI +SYP
Sbjct: 299 AVGYGSM-GGKNFWLVKNSWGTGWGDQGYIRMAKDKNNQCGIALMASYP 346
>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
Length = 341
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/344 (38%), Positives = 198/344 (57%), Gaps = 23/344 (6%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENLE 76
++I L V A+ VS + E+W A +H + Y E+E + R+KI+ EN
Sbjct: 4 LVILLCVVAAASAVSFFDL-------VKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKH 56
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
I K N++ G +++L N++ D+ + EF G+ + + + + + +
Sbjct: 57 NIAKHNQKYARGEVSFRLKQNKYGDMLHHEFVHTMNGFNKTTKNSKGLFGKSAGERGATF 116
Query: 134 -----TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+P +DWR GAVT +K+Q +CG CW+F++ A+EG R+ L+ LSEQ L
Sbjct: 117 ITPANVHLPDHVDWRKHGAVTEVKDQGKCGSCWSFSSTGALEGQHYRRTNILVSLSEQNL 176
Query: 189 LDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
+DCS GNNGC GG + AF YI N+GI TE YPY+ + C K A + +
Sbjct: 177 IDCSAAYGNNGCNGGLMDNAFKYIKDNRGIDTEKSYPYEGIDDKCRYNPKNTGADDNGFV 236
Query: 248 EVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGT 304
++PSGDE L+ AV ++ PVS+AI A + FQ Y +G+ F+ C + LDH V +VG+GT
Sbjct: 237 DIPSGDEGKLMAAVATVGPVSVAIDASQSSFQFYSDGVYFDENCSSSSLDHGVLVVGYGT 296
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
E+G +YWL+KNSWG +WGD GY+K+ R+ + CGI T +SYPL
Sbjct: 297 DENGGDYWLVKNSWGRSWGDLGYIKMARNRDNHCGIATAASYPL 340
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 133/338 (39%), Positives = 200/338 (59%), Gaps = 18/338 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S + H+ ++ H + W +G+ YK++ E+ +R I+++NL+
Sbjct: 12 MKWLVLVLLGCSSAMAQ---LHKDPTLDRHWDLWKKTYGKQYKEKNEEGVRRLIWEKNLK 68
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E AL + ++PS R+ T + Q L
Sbjct: 69 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTALMSSLRVPSQWQRNVTYKSNPNQKL-- 126
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
P S+DWRDKG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCS
Sbjct: 127 ---PDSVDWRDKGCVTDVKYQGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSV 183
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
N GC GG +AF YII N GI +E YPY+A+ G C K AA S Y E+P
Sbjct: 184 GKYSNRGCNGGFMTEAFQYIIDNNGIESEASYPYKAMDGKCQYDSKYRAATCSRYTELPE 243
Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
E AL +AV+ + PVS+AI A F Y+ G+ ++ C ++H V +VG+G +G
Sbjct: 244 DSEDALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDPACTLHVNHGVLVVGYGNL-NGK 302
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
+YWL+KNSWG +GD GY+++ R+ G CGI + +SYP
Sbjct: 303 DYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYASYP 340
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 119/252 (47%), Positives = 174/252 (69%), Gaps = 6/252 (2%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
++E+ E WM++HG+ Y+ EK +R +IFK+NL++I++ NK + Y LG N+F+DL++
Sbjct: 4 LIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVS-NYWLGLNEFADLSHH 62
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EF+ Y G K+ + R + S F Y+++ D+P S+DWR KGAVT IKNQ CG CWA
Sbjct: 63 EFKKQYLGLKVDFSTRRES-SEEFTYRDV---DLPKSVDWRKKGAVTNIKNQGSCGSCWA 118
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ VAAVEGI +I +GNL LSEQ+L+DC N+GC GG + AF++I++N G+ ED+
Sbjct: 119 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGGLMDYAFSFIVENGGLHKEDD 178
Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
YPY GTC +++ + IS Y +VP +EQ+LLKA++ QP+S+AI A +FQ Y
Sbjct: 179 YPYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 238
Query: 282 EGIFNGVCGTQL 293
G+F+G CGTQL
Sbjct: 239 GGVFDGHCGTQL 250
>gi|327289213|ref|XP_003229319.1| PREDICTED: cathepsin S-like [Anolis carolinensis]
Length = 333
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 132/317 (41%), Positives = 198/317 (62%), Gaps = 14/317 (4%)
Query: 40 EQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQ 95
+ S+++ H E W ++ + Y+++ E+ +R I+++NL ++ N E G +Y+LG N
Sbjct: 21 KDSMLDGHWELWKKKYNKEYQNKEEEGVRRVIWEKNLRFVMLHNLEQSLGLHSYELGMNH 80
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
D+T++E AL TG K+P R++T Y P ++DWR+KG VT +KNQ
Sbjct: 81 LGDMTSEEVTALMTGLKIPVSQSRNST----LYWARQGASAPDTVDWREKGCVTNVKNQG 136
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQN 214
CG CWAF+AV A+E K+++GNL+ LS Q L+DCS+ GN+GC GG AF Y+I N
Sbjct: 137 SCGSCWAFSAVGALECQLKLKTGNLVSLSPQNLVDCSSAFGNHGCNGGYISAAFQYVIYN 196
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAY 273
GI +E YPY GTC + AA S Y ++PSG+E AL AV+ PVS+AI A
Sbjct: 197 NGIDSEASYPYTGQSGTCRYNLQGRAATCSRYVDLPSGNEAALKDAVANFGPVSVAIDAS 256
Query: 274 STEFQSYKEGIFNGVCGT--QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
F +++G+++ T ++H V +VG+G TEDG +YWL+KNSWG ++GD GY+KI
Sbjct: 257 RPSFFLFRKGVYDDPSCTSAHINHGVLVVGYG-TEDGIDYWLVKNSWGVSFGDQGYIKIA 315
Query: 332 RD-EGLCGIGTRSSYPL 347
R+ + CGI ++ +YPL
Sbjct: 316 RNHDNRCGIASQCTYPL 332
>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 187/308 (60%), Gaps = 12/308 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
E W ++G+SY E+ +R ++++ NL+ +++ N +G Y+LG N ++DL N+EF
Sbjct: 20 ESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEF 79
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
AL + +S+T TFK L +P+S+DWR++G VTP+K+Q +CG CW F+
Sbjct: 80 MALKGSGGLLQAKDKSSTQ-TFK--PLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWTFS 136
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A ++EG ++GNL+ LSEQQL+DC+ GN GC GG E A+ YI G+ E Y
Sbjct: 137 ATGSLEGQHFAKTGNLLSLSEQQLVDCAGRYGNYGCNGGLMESAYDYIKGVGGVELESAY 196
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY A G C + A Y +P GDEQAL++AV ++ PV+++I A FQ Y+
Sbjct: 197 PYTARDGRCKFDRSKVVATCKGYVVIPVGDEQALMQAVGTIGPVAVSIDASGYSFQLYES 256
Query: 283 GI--FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGI 339
G+ F T LDH V VG+G TE G NYWL+KNSWG WGD GY+K+ +D+ CGI
Sbjct: 257 GVYDFRRCSSTNLDHGVLAVGYG-TEGGQNYWLVKNSWGPGWGDQGYIKMSKDKNNQCGI 315
Query: 340 GTRSSYPL 347
T S YPL
Sbjct: 316 ATDSCYPL 323
>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 182/320 (56%), Gaps = 29/320 (9%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
++ E+WMA+ G+ Y EKE R +F++N+ +I L NQF+DLTNDEF
Sbjct: 39 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 98
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
+ +TG K P P + + +P +DWR KGAVT +K+Q CG CWAFA
Sbjct: 99 VSTHTGAKPPCPKDAP--------RGVDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFA 150
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
AVAA+EG+T+IR+G L LSEQ+L+DC T G++GC GG ++AF + GI E Y
Sbjct: 151 AVAAIEGLTQIRTGKLTPLSEQELVDCDT-GSSGCAGGHTDRAFELVAAKGGITAESGYR 209
Query: 225 YQAVPGTCSA--AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
Y+ G C A A AA+I + VP GDE+ L AV+ QPV+ I A FQ Y
Sbjct: 210 YEGYRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGS 269
Query: 283 GIFNGVCGTQL---------DHAVTIVGFGTTEDGAN---YWLIKNSWGNTWGDAGYMKI 330
G+F G CG+ +HAVT+VG+ +DGA+ YW+ KNSWG TWG+ GY+ +
Sbjct: 270 GVFPGPCGSGSGAAAAAPTTNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWGEKGYILL 327
Query: 331 VRD----EGLCGIGTRSSYP 346
+D G CG+ YP
Sbjct: 328 EKDVASPHGTCGVAVSPFYP 347
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 134/350 (38%), Positives = 194/350 (55%), Gaps = 29/350 (8%)
Query: 23 TLLVSCASQVVSSRSTHE----------QSVVEIHEKWM----AQHGRSYKDELE-KEMR 67
LLV+C+ V++ E +S E + W+ R+Y E E R
Sbjct: 12 VLLVACSCLAVAAGFRFENHRLFIQQAIESPREAFDFWVHTVKPPSNRAYASSAEVYERR 71
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
I+ +NL + + N + ++ L ++DL+ DE+R+ GY R ++ F
Sbjct: 72 FNIWLDNLRFAHEYNAR-HTSHWLSMGVYADLSQDEYRSKALGYNAHLHKKRPLRAAPFL 130
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y+ T P +DW GAVTP+K+Q CG CWAF+ AVEG I +G L+ LSEQ
Sbjct: 131 YKG---TVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIATGKLVSLSEQM 187
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNY 246
L+DC + GC GG + AF +I+ N GI TED+YPY+A G C + + I Y
Sbjct: 188 LVDCDREYDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRAEDGICQDNRTRRHVVTIDGY 247
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
++VP DE AL+KAV+ QPVS+AI A FQ Y G+F+ CGT LDHAV +VG+GT
Sbjct: 248 QDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAECGTALDHAVLVVGYGTAS 307
Query: 307 DGAN---YWLIKNSWGNTWGDAGYMKIVRD------EGLCGIGTRSSYPL 347
+G + YWL+KNSWG WG+ GY++++R+ EG CG+ +S+P+
Sbjct: 308 NGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGLAMYASFPI 357
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 137/347 (39%), Positives = 195/347 (56%), Gaps = 25/347 (7%)
Query: 22 ITLLVSCASQVVSSRSTHE----QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
I LVS A V S + +V + ++W+ +HG+ Y EK RL+IF+ NL+Y
Sbjct: 14 IICLVSAAKAVQHSYEVGDINSGNGLVRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQY 73
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY--KMPSPSHRSTTSSTFKYQNLSMT- 134
I NK N +++LG N+F+DLTN+EF+ Y G K R+ L T
Sbjct: 74 IHAHNKNSNSSFRLGLNKFADLTNEEFKTRYFGKNSKQWRDRRRTELEGAELRPVLKQTV 133
Query: 135 -------DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+ +SLDWR KGAVT +K+Q +CG CWAF+ A+EG+ I +G L+ LSEQ+
Sbjct: 134 GSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEGVNFISTGKLVSLSEQE 193
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNY 246
L+ C N GC GG + AF ++IQN GI TE +Y Y V TC+ ++ I Y
Sbjct: 194 LVACDAT-NYGCEGGDMDYAFTWVIQNGGIDTEKDYSYTGVDSTCNTNKEAKKIVSIDGY 252
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG---TQLDHAVTIVGFG 303
+V S D+ ALL A QPVS+ I + +FQ Y GI++G C +DHAV +VG+
Sbjct: 253 TDV-SPDDSALLCAAGSQPVSVGIDGSAIDFQLYTGGIYDGDCSGNPDDIDHAVLVVGY- 310
Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
+ ++G +YW++KNSWG WG GY I+R+ G+C I +SYP
Sbjct: 311 SAKNGKDYWIVKNSWGTDWGLEGYFYILRNTELPYGVCAINAMASYP 357
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 132/343 (38%), Positives = 198/343 (57%), Gaps = 21/343 (6%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENL 75
F ++ L+ +Q VS + E+W QH + YK + E++ R+KIF EN
Sbjct: 3 FFVLALVFIVGAQAVSFFDL-------VQEQWGTFKLQHKKQYKSDTEEKFRMKIFMENS 55
Query: 76 EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL- 131
+ K NK G +YKL N+++D+ + EF G+ + TS +
Sbjct: 56 HKVAKXNKLYEMGLVSYKLKINKYADMLHHEFVHTVNGFNRTKNTPLLGTSEDEQGATFI 115
Query: 132 --SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+ P ++DWR+ GAVT +K+Q CG CW+F+A A+EG ++ L+ LSEQ L+
Sbjct: 116 APANVKFPENVDWREHGAVTXVKDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNLV 175
Query: 190 DCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
DCST GN+GC GG + AF Y+ N GI TE YPY A C K + A + +
Sbjct: 176 DCSTKFGNDGCNGGLMDNAFKYVKYNHGIDTEASYPYHADDEKCHYNPKTSGATDRGFVD 235
Query: 249 VPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTT 305
+P+GDE+ L+ AV ++ PVS+AI A FQ Y EG+ ++ C + +LDH V +VG+GT
Sbjct: 236 IPTGDEEKLMAAVATVGPVSVAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYGTD 295
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
E+G +YW++KNSWG +WG+ GY+K+ R+ + CGI T++SYPL
Sbjct: 296 ENGQDYWIVKNSWGESWGEQGYIKMARNRDNNCGIATQASYPL 338
>gi|356552228|ref|XP_003544471.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 351
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 131/339 (38%), Positives = 197/339 (58%), Gaps = 16/339 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+F++ + + ++S + H + V+ + E+W+ +H + Y EKE R +
Sbjct: 8 LFMVFAVSSALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNALGEKEKRFQ 67
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK NL +I++ N NRTYKLG N F+DLTN E+RA+Y P T Y
Sbjct: 68 IFKNNLRFIDERNSL-NRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTPPRNHYV 126
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+P S+DWR +GAVTP+KNQ C CWAF AV AVE + KI++G+LI LSEQ++
Sbjct: 127 PRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLISLSEQEV 186
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
+DC+T+ + GC GG + + YI +N GI+ E +YPY+ G C + +K A I +
Sbjct: 187 VDCTTSSSRGCGGGDIQHGYIYIRKN-GISLEKDYPYRGDEGKCDSNKKNAIVTIDGHGW 245
Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP+ E+AL +A+ A Y +F +G+F G CGT+L+HA+ +VG+GT +DG
Sbjct: 246 VPTQLEEALNRALF---CYCAYFLYVDKF-FLCQGVFKGKCGTELNHALLLVGYGTEKDG 301
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
+YW+ KNS+ + WG+ GY++I R C G YP+
Sbjct: 302 -DYWIAKNSYSDKWGENGYIRIQRKLSTCKFGNGGYYPI 339
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 204/341 (59%), Gaps = 18/341 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ + +L +C S V+S+ Q + E + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPLLVLTACLSSVLSAPVLDAQ-LNEHWDLWKSWHSKKYH-EKEEGWRRMVWEKNLQKI 58
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G +++LG N F D+T++EFR + GYK+ + R T S F N MT
Sbjct: 59 ELHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLKT--QRKFTGSLFMEPNF-MT- 114
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
P+++DWR+KG VTP+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 APSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPE 174
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG ++AF Y+ NQG+ +ED YPY C +A + + +VPSG
Sbjct: 175 GNEGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDDQPCHYDPLYNSANDTGFVDVPSGK 234
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFG-TTED-- 307
E AL+KAV S+ PVS+AI A FQ Y+ GI + C + +LDH V VG+G ED
Sbjct: 235 EHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDKM 294
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
G +W++KNSWG WGD GY+ + +D + CGI T +SYPL
Sbjct: 295 GKKFWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
Length = 337
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 137/341 (40%), Positives = 196/341 (57%), Gaps = 17/341 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ + +L C S +S+ S Q + + + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPLAVLAVCLSAALSAPSLDPQ-LDDHWDLWKSWHSKKYH-EKEEGWRRMVWEKNLKKI 58
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G Y+LG N F D+T++EFR + GYK + R S F N +
Sbjct: 59 ELHNLEHSMGKHPYRLGMNHFGDMTHEEFRQIMNGYKQ-RKTERKFKGSLFMEPNF--LE 115
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
P +LDWRDKG VTP+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 116 APRALDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPE 175
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG ++AF Y+ NQG+ +ED YPY C +A + + +VPSG
Sbjct: 176 GNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPNYNSANDTGFVDVPSGK 235
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTED 307
E+AL+KAV ++ PVS+AI A FQ Y+ GI+ +LDH V +VG+ G D
Sbjct: 236 ERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELDHGVLVVGYGYEGEDVD 295
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
G YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 296 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 336
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 136/336 (40%), Positives = 197/336 (58%), Gaps = 23/336 (6%)
Query: 19 FIIITLLV-SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
FI+ +LLV + ++ ++ H QS + +HG++YK++ E+ R IF+ENL
Sbjct: 4 FILASLLVVAVSATLLKEDGAHFQS-------FKLKHGKTYKNQAEETKRFAIFRENLRK 56
Query: 78 IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N K+G +Y G N+F+D+T EF+A+ PS +T + +Q
Sbjct: 57 IEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATKT----FQLADGV 112
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
VP S+DWR + VTPIK+Q +CG CWAFA V + EG + +G L + SEQQL+DC+T+
Sbjct: 113 SVPESIDWRSRNVVTPIKDQAQCGSCWAFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTD 172
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
N GC GG + F Y IQ G+ E +YPY G CS K+S+Y VP+ +E
Sbjct: 173 LNYGCDGGYLDDTFPY-IQTNGLELESDYPYTGYDGYCSYESSKVVTKVSSYVSVPA-NE 230
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQ-LDHAVTIVGFGTTEDGANY 311
QALL+AV + PV+IAI A + Q Y GI + C + LDH V VG+ +E+G +Y
Sbjct: 231 QALLEAVGTAGPVAIAINA--DDLQFYFSGIIDDKYCDPEYLDHGVLAVGY-DSENGRDY 287
Query: 312 WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
WLIKNSWG WG++GY + +R + +CG+ + YPL
Sbjct: 288 WLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPL 323
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 198/336 (58%), Gaps = 23/336 (6%)
Query: 19 FIIITLLV-SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
FI+ +LLV + ++ ++ H QS + +HG++YK++ E+ R IF+ENL
Sbjct: 4 FILASLLVVAVSATLLKEDGVHFQS-------FKLKHGKTYKNQAEETKRFAIFRENLRK 56
Query: 78 IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N K+G +Y G N+F+D+T EF+A+ PS +T + +Q
Sbjct: 57 IEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATKT----FQLADGV 112
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
VP S+DWR + VTPIK+Q +CG CW+FA V + EG + +G L + SEQQL+DC+T+
Sbjct: 113 SVPESIDWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTD 172
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
N GC GG + F Y IQ G+ E +YPY G+CS K+S+Y VP+ +E
Sbjct: 173 LNYGCDGGYLDDTFPY-IQTNGLELESDYPYTGYDGSCSYDSSKVVTKVSSYVSVPA-NE 230
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQ-LDHAVTIVGFGTTEDGANY 311
QALL+AV + PV+IAI A + Q Y GI + C + LDH V VG+ +E+G +Y
Sbjct: 231 QALLEAVGTAGPVAIAINA--DDLQFYFSGIIDDKYCDPEWLDHGVLAVGY-NSENGLDY 287
Query: 312 WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
WLIKNSWG WG++GY + +R + +CG+ + YPL
Sbjct: 288 WLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPL 323
>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
Length = 327
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 182/320 (56%), Gaps = 29/320 (9%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
++ E+WMA+ G+ Y EKE R +F++N+ +I L NQF+DLTNDEF
Sbjct: 17 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 76
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
+ +TG K P P + + +P +DWR KGAVT +K+Q CG CWAFA
Sbjct: 77 VSTHTGAKPPCPKDAP--------RGVDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFA 128
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
AVAA+EG+T+IR+G L LSEQ+L+DC T G++GC GG ++AF + GI E Y
Sbjct: 129 AVAAIEGLTQIRTGKLTPLSEQELVDCDT-GSSGCAGGHTDRAFELVAAKGGITAESGYR 187
Query: 225 YQAVPGTCSA--AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
Y+ G C A A AA+I + VP GDE+ L AV+ QPV+ I A FQ Y
Sbjct: 188 YEGYRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGS 247
Query: 283 GIFNGVCGTQ---------LDHAVTIVGFGTTEDGAN---YWLIKNSWGNTWGDAGYMKI 330
G+F G CG+ +HAVT+VG+ +DGA+ YW+ KNSWG TWG+ GY+ +
Sbjct: 248 GVFPGPCGSGSGAAAAAPTTNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWGEKGYILL 305
Query: 331 VRD----EGLCGIGTRSSYP 346
+D G CG+ YP
Sbjct: 306 EKDVASPHGTCGVAVSPFYP 325
>gi|324512246|gb|ADY45078.1| Cathepsin L [Ascaris suum]
Length = 388
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 188/314 (59%), Gaps = 17/314 (5%)
Query: 47 HEKWM---AQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLT 100
+E+W QHG++Y+DE + + F NLE I K N + G ++++GTN +DL
Sbjct: 80 YEQWRLFKEQHGKNYEDEETENDHMLAFLSNLEEIRKHNARYQRGESSFEMGTNHITDLP 139
Query: 101 NDEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
+E+R L GYK SHR+ T + +VP DWRD G VT +KNQ CG
Sbjct: 140 FEEYRKL-NGYKPRYDDSHRNGTKFLVPFN----INVPGHWDWRDHGYVTEVKNQGMCGS 194
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+A A+EG K + G+L+ LSEQ L+DCS GNNGC GG + AF YI N G+
Sbjct: 195 CWAFSATGALEGQHKRKIGSLVSLSEQNLVDCSRKYGNNGCNGGLMDYAFEYIKDNHGVD 254
Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEF 277
TE YPY+ C +K A+ Y ++P GDE+ L AV+ Q P+S+AI A F
Sbjct: 255 TEASYPYKGKEMKCHFNKKTVGAEDEGYVDLPEGDEEKLKIAVATQGPISVAIDAGHPSF 314
Query: 278 QSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-E 334
Q Y++G+ + C ++ LDH V +VG+GT E +YW++KNSWG WG+ GY++I R+ +
Sbjct: 315 QMYRKGVYYEPQCSSESLDHGVLVVGYGTDEIDGDYWIVKNSWGPGWGEKGYVRIARNRD 374
Query: 335 GLCGIGTRSSYPLA 348
CGI +++SYP+
Sbjct: 375 NHCGIASKASYPIV 388
>gi|413947586|gb|AFW80235.1| hypothetical protein ZEAMMB73_542371 [Zea mays]
Length = 264
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 117/254 (46%), Positives = 173/254 (68%), Gaps = 10/254 (3%)
Query: 13 INTTPMFIIITLLVSCA----SQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
+++ +++ +L+ C S V+++R + + ++ E HE+WMA++GR YKD +K R
Sbjct: 2 VSSKAFLLLLAVLIGCVCSFPSPVLAARELSDDAAMAERHERWMAEYGRVYKDAADKARR 61
Query: 68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
++FK+N ++E N + + LG NQF+DLT + F+A G+K S TT FK
Sbjct: 62 FEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTEAFKA-NKGFKPISAEKAPTTG--FK 118
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y+NLS++ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAAVEGI K+ +GNL+ LSEQ+
Sbjct: 119 YENLSISALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAVEGIVKLSTGNLVSLSEQE 178
Query: 188 LLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
L+DC T+ + GC GG + AF ++I+N G+ATE YPY+AV G C K +AA I +
Sbjct: 179 LVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSK-SAATIKGH 237
Query: 247 EEVPSGDEQALLKA 260
E+VP +E AL+KA
Sbjct: 238 EDVPPNNEAALMKA 251
>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
Length = 330
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 134/327 (40%), Positives = 189/327 (57%), Gaps = 13/327 (3%)
Query: 28 CASQVVSSRSTHEQSVVEIHEKWMAQHGRS-YKDELEKEMRLKIFKENLEYIEKANKEGN 86
C V S+ + + + KWM ++ +S Y+ E I++ N+ E+ N++ N
Sbjct: 11 CGLFVASTLAATHDPLTGVFAKWMRENTKSNYRFVYSNEEF--IYRWNVWRDEEHNRQ-N 67
Query: 87 RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKG 146
++Y L NQF DLTN EF L+ G H ++ T +P+ DWR KG
Sbjct: 68 KSYFLAMNQFGDLTNAEFNRLFKGLAFDYSKHAKIHTAA---PEAPATGIPSEFDWRQKG 124
Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSRE 205
AVT +KNQ +CG CW+F+ + EG +++G L+ LSEQ L+DCS + GNNGC GG +
Sbjct: 125 AVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMD 184
Query: 206 KAFAYIIQNQGIATEDEYPYQ-AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ 264
AF YII N+GI TE YPYQ A P TC ++ Y +V SGDE ALL A +
Sbjct: 185 YAFEYIINNRGIDTEASYPYQTAGPLTCQYNAANKGGSLTGYTDVTSGDENALLNAAVKE 244
Query: 265 PVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
PVS+AI A FQ Y G++ + TQLDH V +VG+G +E+G ++W +KNSWG +W
Sbjct: 245 PVSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLVVGWG-SENGQDFWWVKNSWGASW 303
Query: 323 GDAGYMKIVRDE-GLCGIGTRSSYPLA 348
G GY+K+ R++ CGI T +SYP A
Sbjct: 304 GLNGYIKMSRNQNNNCGIATAASYPTA 330
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 130/339 (38%), Positives = 203/339 (59%), Gaps = 15/339 (4%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYI 78
+ L + A+ V+S ++ +V+ E+W + QH ++Y E E+ R+KIF EN +
Sbjct: 1 MKLFLILAAVVISCQAVSFYDLVQ--EQWSSFKMQHSKNYDSETEERFRMKIFMENAHKV 58
Query: 79 EKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS--HRSTTSSTFKYQNLSM 133
K NK +G +KLG N+++D+ + EF + G+ + S + ++ + +
Sbjct: 59 AKHNKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPAN 118
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P ++DWRDKGAVT +K+Q CG CW+F+A ++EG ++G L+ LSEQ L+DCS
Sbjct: 119 VKLPDTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSG 178
Query: 194 N-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
GNNGC GG + AF YI N GI TE YPY A C + + A + ++
Sbjct: 179 RYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAEDEKCHYKAQNSGATDKGFVDIEEA 238
Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGA 309
+E L AV ++ PVSIAI A FQ Y +G+++ C +Q LDH V +VG+GT++DG
Sbjct: 239 NEDDLKAAVATVGPVSIAIDASHETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDGQ 298
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
+YWL+KNSWG +WG GY+K+ R+ + +CG+ +++SYPL
Sbjct: 299 DYWLVKNSWGPSWGLNGYIKMARNQDNMCGVASQASYPL 337
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 194/338 (57%), Gaps = 15/338 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ + L C + +++ S Q ++ E + +QH ++Y +E+ +R KIF EN
Sbjct: 1 MLRLAFLCGCVAAAIAASS---QEILRTEWEAFKSQHNKAYSSHVEELLRFKIFTENTLL 57
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
+ K N + G +YKL N+F DL EF + GY+ ++ + NL+ +
Sbjct: 58 VAKHNAKYAKGLVSYKLAMNKFGDLLPHEFAKMVNGYR--GKQNKEQRPTFIPPANLNDS 115
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
+PT++DWR KGAVTP+KNQ +CG CWAF+ ++EG ++G L+ LSEQ L+DCS +
Sbjct: 116 SLPTTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDD 175
Query: 195 -GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG + F YI N GI TE+ +PY A G C + A + + ++ G
Sbjct: 176 FGNQGCNGGLMDNGFQYIKANGGIDTEESHPYTAQDGDCKFKKADVGATDAGFVDIQQGS 235
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGAN 310
E L KAV ++ PVS+AI A FQ Y +G+++ +QLDH V VG+G ++G
Sbjct: 236 EDDLKKAVATVGPVSVAIDASHGSFQLYSQGVYDEPDCSSSQLDHGVLTVGYG-VKNGKK 294
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
YWL+KNSWG WGD GY+ + RD + CGI + +SYPL
Sbjct: 295 YWLVKNSWGGDWGDNGYILMSRDKDNQCGIASSASYPL 332
>gi|354502595|ref|XP_003513369.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
Length = 330
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 200/338 (59%), Gaps = 20/338 (5%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
P+F + TL + VV + TH+ S+ + ++W +HG++Y + E + R +++ N +
Sbjct: 3 PIFFLATLCLG----VVPAAPTHDPSLDDEWQEWKTRHGKTYSMDEEGQKR-AVWENNRK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE N++ G + L N F DLTN EFR L TG++ T +Q +
Sbjct: 58 MIELHNEDYTKGKHGFHLEMNAFGDLTNIEFRQLMTGFQ------SMGTKEMNVFQEPLL 111
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
DVP S+DWR+ VTP+K+Q +C CWAF+AV ++EG ++G LI LSEQ L+DCS
Sbjct: 112 GDVPKSVDWRNLSYVTPVKDQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLVDCSW 171
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
+ GN GC GG E AF Y+ +N+G+ T YPY+A G C K +AA ++++ ++P
Sbjct: 172 SYGNIGCFGGLMEYAFRYVKENRGLDTRVSYPYEARNGPCRYDPKNSAANVTDFVKIPI- 230
Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGA 309
E AL+KAV ++ P+S+ + ++ F+ YK G++ + LDHAV +VG+G DG
Sbjct: 231 SEDALMKAVATVGPISVGVDSHHHSFRFYKGGMYYEPHCSSSNLDHAVLVVGYGEESDGN 290
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
YW++KNSWG WG GY+K+ RD CGI T + YP
Sbjct: 291 KYWMVKNSWGQGWGMNGYIKMARDRNNNCGIATYAIYP 328
>gi|227018328|gb|ACP18830.1| cysteine proteinase 1 [Chrysomela tremula]
Length = 323
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 137/334 (41%), Positives = 206/334 (61%), Gaps = 21/334 (6%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
+ L+++ A+ VV+ + +Q E+ + HG++YK E+++R IF++ L I
Sbjct: 1 MKLIIAFAAFVVAINAASDQ---ELWADFKKAHGKTYKSLREEKLRFNIFQDTLREIAAH 57
Query: 82 N---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N + G TY L NQFSD+T++EFRA+ PS + NL++ P
Sbjct: 58 NAKYESGESTYYLAINQFSDITDEEFRAMLMKNVESRPSLED-----MEIANLTVGAAPE 112
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNN 197
S+DWR +GAV PI+NQ++CG CWAF+AVAAVEG I+SG+ LS QQL+DCST GN+
Sbjct: 113 SIDWRTEGAVLPIRNQEDCGSCWAFSAVAAVEGQAAIKSGSKTPLSVQQLVDCSTEGGNS 172
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG AF YI N G+ ++ +YPY +C A + + K++ Y++V S E +L
Sbjct: 173 GCNGGLMNGAFDYIKAN-GLESDAKYPYTGTDDSCKADKSSSLVKLTGYKKVAS-SEASL 230
Query: 258 LKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLI 314
+AV ++ P+S+A+ Y+ ++SY GIFN + G LDH VT VG+G T++G YW +
Sbjct: 231 KEAVGTVGPISVAV--YADLWRSYGGGIFNNILCLGFGLDHGVTAVGYG-TDNGKKYWPV 287
Query: 315 KNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
KNSWG +WG+ GY+++ RD CGI ++SYP+
Sbjct: 288 KNSWGESWGEEGYIRMARDTLHNCGINQQASYPI 321
>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 358
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 132/315 (41%), Positives = 183/315 (58%), Gaps = 21/315 (6%)
Query: 52 AQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY 111
A + R+Y E+ R ++++ N++YIE N+ G+ TY+LG NQF+DLT EFRA+YT
Sbjct: 45 ATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQEFRAMYTMP 104
Query: 112 ----KMPSPSHRSTTSSTFK----------YQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
P R +T Y + PTS+DWR KGAVTP+K+Q C
Sbjct: 105 ARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVDWRSKGAVTPVKDQGGC 164
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
GCCWAFA VA +EG+ KI++G L+ LSEQ+L+DC + G E A ++ N G+
Sbjct: 165 GCCWAFATVATIEGLHKIKTGQLVSLSEQELVDCDDADDGCGGGLP-EIAMEWVAHNGGL 223
Query: 218 ATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
TE YPY G C + AAKI+ + V + E L +AV+ QPV++AI A +
Sbjct: 224 TTEANYPYTGKAGKCDRGKASNHAAKIAAAQMVRANSEAELERAVARQPVAVAINAPDS- 282
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
YK G+++G C + DHAVT+VG+G G YW+IKNSW TWG+ GY ++ R
Sbjct: 283 LMFYKSGVYSGPCTAEFDHAVTVVGYGADNKGHKYWIIKNSWAETWGEKGYGRMQRGVAA 342
Query: 333 DEGLCGIGTRSSYPL 347
EGLCGI T +SYP+
Sbjct: 343 KEGLCGIATHASYPV 357
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 134/343 (39%), Positives = 197/343 (57%), Gaps = 19/343 (5%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + ++ L SC + + H + W + + Y D E+ +R ++
Sbjct: 1 MHAISVLAVLALAFSCTLAFDAKLNQHWKL-------WKEANNKRYSDA-EEHVRRATWE 52
Query: 73 ENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
NL+ +++ N + G TY LG N+++D+T EF + GY R+ TF +
Sbjct: 53 GNLQKVQEHNLQADLGVHTYWLGMNKYADMTVTEFVKVMNGYNATMRGQRTQDRHTFSFN 112
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
S +P ++DWRDKG VT +K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+
Sbjct: 113 --SKIALPDTVDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLV 170
Query: 190 DCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
DCS GN GC GG ++AF YI +N GI TED YPY+AV C A + + +
Sbjct: 171 DCSGKQGNMGCNGGLMDQAFEYIKENNGIDTEDSYPYEAVDNQCRFKAANVGATDTGFTD 230
Query: 249 VPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG-VCG-TQLDHAVTIVGFGTT 305
+ S DE AL +AV ++ P+S+AI A T FQ YK G++N C T+LDH V VG+G T
Sbjct: 231 ITSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGVYNEPFCSQTRLDHGVLAVGYG-T 289
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
+ G +YWL+KNSWG WGD GY+K+ R++ CGI T +SYPL
Sbjct: 290 DSGKDYWLVKNSWGEGWGDKGYIKMTRNKRNQCGIATAASYPL 332
>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
Length = 318
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 131/287 (45%), Positives = 175/287 (60%), Gaps = 18/287 (6%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++ + + WM ++ + YKD EK R +IFK+NL+YI++ NK+ N TY LG F+
Sbjct: 39 TSTEKLINLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKK-NNTYWLGLTSFT 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSST----FKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
DLTNDEF+ Y G P + STT + F Y ++ ++P S+DWR KGAVTP++N
Sbjct: 98 DLTNDEFKEKYVG---SIPENWSTTEESNDKEFIYDDV--VNIPASIDWRQKGAVTPVRN 152
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q CG CW F++VAAVEGI KI +G L+ LSEQ+LLDC + GC GG A Y +
Sbjct: 153 QGSCGSCWTFSSVAAVEGINKIVTGQLVSLSEQELLDCERR-SYGCRGGFPPYALQY-VA 210
Query: 214 NQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
N GI YPY+ V C AAQ K K V +EQAL++ +++QPVSI + A
Sbjct: 211 NSGIHLRQYYPYEGVQRQCRAAQAKGPKVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEA 270
Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
FQ+Y+ GIF G CGT +DHAV VG+G Y LIKNSWG
Sbjct: 271 KGRAFQNYRGGIFAGPCGTSIDHAVAAVGYGN-----GYILIKNSWG 312
>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 189/312 (60%), Gaps = 16/312 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E+W + HG+SY ++ E+ R +++++L IE N E G +++LG N F D+ N+EF
Sbjct: 30 EQWKSWHGKSY-EQKEETWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEF 88
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
R L GYK +H+ S F N +VP +DWRD+G VTP+K+Q +CG CWAF+
Sbjct: 89 RQLMNGYKYKQ-THKKLQGSHFLEPNF--LEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+EG R+G L+ LSEQ L++CS GN GC GG ++AF Y+ N GI +ED Y
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSY 205
Query: 224 PYQAVPGT-CSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
PY T C + AA + + ++PSG E+AL+KA+ ++ PVS+AI A T FQ Y+
Sbjct: 206 PYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQ 265
Query: 282 EGI-FNGVC-GTQLDHAVTIVGFGTTE---DGANYWLIKNSWGNTWGDAGYMKIVRD-EG 335
GI F C T LDH V +VG+G + DG YW++KNSW G GY+ + +D +
Sbjct: 266 SGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKLGQNGYILMAKDKDN 325
Query: 336 LCGIGTRSSYPL 347
CGI T +SYPL
Sbjct: 326 HCGIATAASYPL 337
>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
Length = 331
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/336 (40%), Positives = 196/336 (58%), Gaps = 19/336 (5%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
++ TLLV C++ H ++ H W +G+ Y ++ E+ R I+++NL+++
Sbjct: 4 LVWTLLVCCSAMA----QLHRDPALDHHWHLWKKTYGKQYTEKNEETERRLIWEKNLKFV 59
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
N E G +Y LG N D+T++E +L T K+P S R+ T + Q L
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVVSLMTCLKVPRQSQRNVTYKSSPNQKL---- 115
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
P SLDWR+KG VT +K Q CG CWAF+AV A+E K+ +G L+ LS Q L+DCST
Sbjct: 116 -PDSLDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTEK 174
Query: 196 --NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
N GC GG +AF YII N GI +E YPY+A+ C K AA S Y E+P G
Sbjct: 175 YRNEGCHGGFMTEAFQYIIDNNGIDSEASYPYKAMDEKCQYDSKNRAATCSKYTELPFGS 234
Query: 254 EQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANY 311
E+AL +AV+ + PVS+AI A + F Y+ G+ + C ++H V +VG+G +G +Y
Sbjct: 235 EEALKEAVASKGPVSVAIDASHSSFFLYRSGVYYEPACTQVVNHGVLVVGYGNL-NGNDY 293
Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
WL+KNSWG +GD GY+++ R+ E CGI + SSYP
Sbjct: 294 WLVKNSWGLYFGDKGYIRMARNRENHCGIASYSSYP 329
>gi|305434756|gb|ADM53740.1| cathepsin L1 precursor [Lepeophtheirus salmonis]
Length = 325
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 190/307 (61%), Gaps = 15/307 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W HG++Y +E+R+KIF+EN I+K N E G TY L NQ+ DL EF
Sbjct: 24 WTKLHGKTYTSFEIEELRVKIFEENRIKIQKHNAEAQNGLHTYSLEMNQYGDLLQSEFLQ 83
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
YTG S S +T + VP+ ++W GAVT +K+QK+CG CWAF+
Sbjct: 84 GYTGLAKGSYSGDNTVILD------NSAPVPSYVNWTKNGAVTAVKDQKDCGSCWAFSTT 137
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
+VEG I++ L+ SEQQL+DCS++ N GC GG + AF Y+I N+GIATED YPY
Sbjct: 138 GSVEGQYFIKNKKLLSFSEQQLVDCSSDFRNEGCNGGWMDNAFKYLIANKGIATEDTYPY 197
Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGI 284
A G C + AA +IS++++V G E L AV+ + P+S+AI A S +FQ YK+G+
Sbjct: 198 TATDGVCVYNKTMAAGRISSFKDVKHGSEDQLKLAVAQIGPISVAIDASSGDFQFYKKGV 257
Query: 285 F-NGVCGTQ-LDHAVTIVGFGTTED-GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIG 340
+ + C ++ LDH V VG+GT + G +YWL+KNSW +WGD GY+K+ R+ + +CGI
Sbjct: 258 YVDEECSSKYLDHGVLAVGYGTDKGTGLDYWLVKNSWSASWGDQGYIKMARNHKNMCGIA 317
Query: 341 TRSSYPL 347
+ +SYP+
Sbjct: 318 SLASYPV 324
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 195/335 (58%), Gaps = 27/335 (8%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
FI+ +LL+ + + + QS + +H +SY +++E+ RL IF ENL I
Sbjct: 4 FILASLLIVAVGASLENVGSTFQS-------FKLKHSKSYSNQVEEAKRLAIFTENLRDI 56
Query: 79 EKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E+ N G +Y NQF+DLT DEF+A T + P T +T Y +
Sbjct: 57 EEHNALYAAGLVSYNKSVNQFTDLTIDEFKAYLTLHSKP-------TLNTVPYVRTGL-Q 108
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
VPT+LDWR +G VT +K+Q +CG CWAF+ V + EG +G L+ LSEQQL+DC+TN
Sbjct: 109 VPTTLDWRSQGYVTGVKDQGDCGSCWAFSVVGSTEGAYYKSTGKLVSLSEQQLIDCTTNV 168
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
N+GC GG E+ F Y +Q G+ +E YPY G C ++ K+S Y V G E
Sbjct: 169 NDGCDGGYLEETFPY-VQQTGLVSESSYPYTGRDGNCRISESDVVTKVSKY--VLLGGEA 225
Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGT-QLDHAVTIVGFGTTEDGANYW 312
LL+AV S+ PVS+A+ A T SY G++ + +C L+H V +VG+G T+DG +YW
Sbjct: 226 DLLEAVGSVGPVSVAMDA--TYIYSYASGVYESSLCSLYSLNHGVLVVGYG-TQDGKDYW 282
Query: 313 LIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
LIKNSWGNTWG+ GY+K++R CGI YP+
Sbjct: 283 LIKNSWGNTWGEQGYLKLLRGTNECGIAEDDVYPI 317
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 133/313 (42%), Positives = 188/313 (60%), Gaps = 17/313 (5%)
Query: 45 EIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSD 98
E+ +W + HG+ Y E E R+ I++ NL+YIEK N G+ ++ LG N++ D
Sbjct: 22 ELDSEWQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYGD 80
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
+TN+EFR+ GYKM T+ + ++ D+P ++DWR KG VTPIKNQ +CG
Sbjct: 81 MTNEEFRSTMNGYKM----RNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCG 136
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
CW+F+A ++EG T ++G L LSEQ L+DCS GN+GC GG + AF YI N GI
Sbjct: 137 SCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSGI 196
Query: 218 ATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTE 276
TE YPY+A G C A S + ++ S E L AV ++ P+S+AI A
Sbjct: 197 DTESSYPYEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPISVAIDASHMS 256
Query: 277 FQSYKEGIFNG-VCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
FQ Y+ G+++ C T+LDH V VG+G TE G +YWL+KNSWG +WG GY+ + R++
Sbjct: 257 FQLYRSGVYHEFFCSETRLDHGVLAVGYG-TESGKDYWLVKNSWGESWGQKGYIMMSRNK 315
Query: 335 -GLCGIGTRSSYP 346
CGI T +SYP
Sbjct: 316 RNNCGIATSASYP 328
>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
Length = 341
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 131/344 (38%), Positives = 198/344 (57%), Gaps = 23/344 (6%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENLE 76
+++ + V A+ VS + E+W A +H + Y E+E + R+KI+ EN
Sbjct: 4 LVVLMCVVAAASAVSFFDL-------VKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKH 56
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
I K N++ G +++ N++ D+ + EF G+ + + + + + +
Sbjct: 57 KIAKHNQKFARGQVPFRVKQNKYGDMLHHEFVHTMNGFNKTTKNGKGLFGKSAGERGATF 116
Query: 134 -----TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
VP +DWR GAVT +K+Q +CG CW+F+A A+EG ++ L+ LSEQ L
Sbjct: 117 IPPANVRVPDHVDWRKHGAVTEVKDQGKCGSCWSFSATGALEGQHYRQTNILVSLSEQNL 176
Query: 189 LDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
+DCST GNNGC GG + AF YI N+GI TE YPY+AV C + + A +
Sbjct: 177 IDCSTAYGNNGCNGGLMDNAFKYIKDNKGIDTEKSYPYEAVDDKCRYNPRNSGADDVGFI 236
Query: 248 EVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCG-TQLDHAVTIVGFGT 304
++PSGDE L+ AV ++ PVS+AI A FQ Y +G+ F+ C T LDH V +VG+GT
Sbjct: 237 DIPSGDEGKLMAAVATVGPVSVAIDASQETFQFYSDGVYFDENCSSTSLDHGVLVVGYGT 296
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
E+G +YWL+KNSWG +WGD GY+K+ R+ + CGI T +S+PL
Sbjct: 297 DENGGDYWLVKNSWGRSWGDLGYIKMARNRDNHCGIATAASFPL 340
>gi|163658591|gb|ABY28387.1| cathepsin L [Gnathostoma spinigerum]
Length = 398
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 123/310 (39%), Positives = 190/310 (61%), Gaps = 12/310 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDLTNDEF 104
E + +HG+++ D + + F +NLEYI++ N++ R T+++G N +DL DE+
Sbjct: 92 EDFKLEHGKAFDDVENEYDHIFAFTKNLEYIKQHNEKFQRGEVTFEMGVNHLTDLPFDEY 151
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
+ L G++ + R STF + +P ++DWR+ VT +K+Q +CG CWAF+
Sbjct: 152 KKL-NGFRKNNDDSRPRNGSTFLRPHF--VQIPDTVDWRNSSYVTVVKDQGQCGSCWAFS 208
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A A+EG ++ L+ LSEQ L+DCS GNNGC GG + AF YI N GI TE+ Y
Sbjct: 209 ATGALEGQHMRKTHQLVSLSEQNLVDCSRKYGNNGCNGGLMDNAFEYIKDNHGIDTEESY 268
Query: 224 PYQAVPG-TCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
PY+ V G C +K A+ Y ++P GDE+AL AV ++ P+S+AI A FQ+Y+
Sbjct: 269 PYKGVEGKKCHFRRKFVGAEDYGYTDLPEGDEEALKVAVATIGPISVAIDAGHISFQNYR 328
Query: 282 EGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCG 338
+GI+ N LDH V +VG+GT E+ +YW++KNSWG WG+ GY+++ R++ CG
Sbjct: 329 KGIYTENECSPEDLDHGVLVVGYGTDENAGDYWIVKNSWGTRWGEHGYIRMARNKRNQCG 388
Query: 339 IGTRSSYPLA 348
I +++SYP+
Sbjct: 389 IASKASYPIV 398
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 133/313 (42%), Positives = 188/313 (60%), Gaps = 17/313 (5%)
Query: 45 EIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSD 98
E+ +W + HG+ Y E E R+ I++ NL+YIEK N G+ ++ LG N++ D
Sbjct: 22 ELDSEWQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYGD 80
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
+TN+EFR+ GYKM T+ + ++ D+P ++DWR KG VTPIKNQ +CG
Sbjct: 81 MTNEEFRSTMNGYKM----RNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCG 136
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
CW+F+A ++EG T ++G L LSEQ L+DCS GN+GC GG + AF YI N GI
Sbjct: 137 SCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNGI 196
Query: 218 ATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTE 276
TE YPY+A G C A S + ++ S E L AV ++ P+++AI A
Sbjct: 197 DTESSYPYEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASHMS 256
Query: 277 FQSYKEGIFNG-VCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
FQ YK G+++ C T+LDH V VG+G TE G +YWL+KNSWG +WG GY+ + R++
Sbjct: 257 FQLYKSGVYHEFFCSETRLDHGVLAVGYG-TESGKDYWLVKNSWGESWGQKGYIMMSRNK 315
Query: 335 -GLCGIGTRSSYP 346
CGI T +SYP
Sbjct: 316 RNNCGIATSASYP 328
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 134/338 (39%), Positives = 200/338 (59%), Gaps = 18/338 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S + H+ ++ H + W +G+ Y +E E+ R I+++NL+
Sbjct: 10 MKWLLLVLLGCSSAMAQ---LHKDPTLDHHWDLWKKTYGKQYTEENEEVTRRFIWEKNLK 66
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
Y+ N E G +Y LG N +D+T++E L + ++PS R+ T + Q L
Sbjct: 67 YVMLHNLEHSMGMHSYDLGMNHLADMTSEEVMLLMSSLRVPSQWQRNVTFKSNPNQKL-- 124
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
P S+DWRDKG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 125 ---PDSMDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQNLVDCST 181
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
N GC GG +AF YII N GI +E YPY+A+ G C K AA S Y E+P
Sbjct: 182 GKYSNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSKYVELPF 241
Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G+E+AL +AV+ + PVS+AI A F Y+ G+ ++ C ++H V VG+G +G
Sbjct: 242 GNEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDKACTLNVNHGVLAVGYGNY-NGK 300
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
+YWL+KNSWG +G+ GY+++ R+ G CGI + SYP
Sbjct: 301 DYWLVKNSWGLHFGEQGYIRMARNSGNHCGIASYPSYP 338
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 133/328 (40%), Positives = 196/328 (59%), Gaps = 20/328 (6%)
Query: 38 THEQSVVE-IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYK 90
TH S E ++++WM +H + YK ++E+ R+KIF +N I K N +YK
Sbjct: 21 THAVSFFELVNQEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYK 80
Query: 91 LGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSSTF-KYQNLSMTDVPTSLDWRDK 145
L N++ D+ + EF + G+ S R ++F + N+ + P +DWR +
Sbjct: 81 LKMNKYGDMLHHEFVNILNGFNKSINTQLRSERLPVGASFIEPANVVL---PKKVDWRKE 137
Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSR 204
GAVTP+K+Q CG CW+F+A A+EG R+G L+ LSEQ L+DCS GNNGC GG
Sbjct: 138 GAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLM 197
Query: 205 EKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SM 263
++AF YI N+G+ TE YPY+A C + A Y ++P+GDE+ L AV ++
Sbjct: 198 DQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDVGYIDIPTGDEKLLKAAVATI 257
Query: 264 QPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
PVS+AI A FQ Y EG+ + C + +LDH V ++G+GT E+G +YWL+KNSWG T
Sbjct: 258 GPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWGET 317
Query: 322 WGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
WG+ GY+K+ R++ CGI + +SYPL
Sbjct: 318 WGNNGYIKMARNKLNHCGIASSASYPLV 345
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 132/349 (37%), Positives = 207/349 (59%), Gaps = 33/349 (9%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWM---AQHGRSYKDELEKEMRLKIFKENLE 76
I++ ++++CA+ V + S E ++++W+ +H + YK E E+ +R+KI+ +N
Sbjct: 4 ILLLIVITCAA--VQAISFFEL----VNQEWINFKMEHKKCYKHEAEERLRMKIYMKNKL 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
I + N + TY+L N++ D+ N EF+ + GY T + T + + L +
Sbjct: 58 QIAQHNCDYELKKVTYRLKINKYGDMLNHEFKNMLNGYN-------RTINHTLRNERLPV 110
Query: 134 ---------TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
++P +DWR GAVT +K+Q CG CWAF+A ++EG R+G L+ LS
Sbjct: 111 GAAFIEPCNVELPKMVDWRKCGAVTEVKDQGHCGSCWAFSATGSLEGQHFRRTGVLVSLS 170
Query: 185 EQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKI 243
EQ L+DCS + GNNGC GG ++AF+YI N+G+ TE YPY+ C ++ + A
Sbjct: 171 EQNLIDCSGSYGNNGCNGGLMDQAFSYIKDNKGLDTEKTYPYEGEDDKCRYDKRSSGASD 230
Query: 244 SNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVC-GTQLDHAVTIV 300
+ ++P GDEQ L AV ++ PVS+AI A FQ Y +GI F C T LDH V +V
Sbjct: 231 VGFVDIPVGDEQKLKAAVATVGPVSVAIDASHQSFQFYSDGIYFEPECSSTNLDHGVLVV 290
Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
G+GT E+G +YW++KNSWG +WG+ GY+K+ R+ + CGI + +SYP+
Sbjct: 291 GYGTDEEGRDYWIVKNSWGESWGEKGYIKMARNIDNHCGIASSASYPIV 339
>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
Length = 337
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 136/341 (39%), Positives = 198/341 (58%), Gaps = 17/341 (4%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ + +L C S V S + + + + W + H ++Y E RL ++++NL+ I
Sbjct: 1 MLPVAVLTLCLSSAVLSAPSLDPQLDQHWNLWKSWHSKNYHQREEGWRRL-VWEKNLKKI 59
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G +Y+LG N F D+T++EF+ + GYK + R S F N +
Sbjct: 60 ELHNLEHSMGKHSYRLGMNHFGDMTHEEFKQIMNGYK--HKAERKFKGSLFLEPNF--LE 115
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
P S+DWR+KG VTP+K+Q ECG CWAF+ A+EG R+G L+ LS Q L++CS
Sbjct: 116 APRSVDWREKGYVTPVKDQGECGSCWAFSTTGALEGQEFTRTGKLVSLSGQNLVECSRPE 175
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG ++AF Y+ NQG+ +ED YPY C K +AA + + ++PSG+
Sbjct: 176 GNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKFSAANDTGFVDIPSGN 235
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTED 307
E+AL+KAV S+ PVS+AI A FQ Y+ GI + C + +LDH V VG+ G D
Sbjct: 236 ERALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFQGEDVD 295
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
G +W++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 296 GKKFWIVKNSWSENWGDKGYIYMAKDRKNHCGIATAASYPL 336
>gi|3850787|emb|CAA05360.1| cathepsin S [Mus musculus]
Length = 330
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 194/320 (60%), Gaps = 16/320 (5%)
Query: 37 STHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
+T E+ ++ H + W H + YKD+ E+E+R I+++NL++I N E G TY++G
Sbjct: 15 ATAERPTLDHHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVG 74
Query: 93 TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
N D+TN+E ++P S ++ T +++ S +P ++DWR+KG VT +K
Sbjct: 75 MNDMGDMTNEEILCRMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVK 129
Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFA 209
Q CG CWAF+AV A+EG K+++G LI LS Q L+DCS GN GC GG +AF
Sbjct: 130 YQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQ 189
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSI 268
YII N GI + YPY+A+ C K AA S Y ++P GDE AL +AV+ + PVS+
Sbjct: 190 YIIDNGGIEADASYPYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSV 249
Query: 269 AIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
I A + F YK G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY
Sbjct: 250 GIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGY 308
Query: 328 MKIVR-DEGLCGIGTRSSYP 346
+++ R ++ CGI + SYP
Sbjct: 309 IRMARNNKNHCGIASYCSYP 328
>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
Length = 324
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 129/308 (41%), Positives = 185/308 (60%), Gaps = 15/308 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
++ Q+GR Y E+ R ++ +N+E+IE N++ G TY L NQF D+TN+E
Sbjct: 23 HQFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDMTNEEI 82
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A+ G +P+ R + L P +DWR KGAVTP+K+QK CG CWAF+
Sbjct: 83 NAVMNGL-LPASESRGVAVLGGRDDTL-----PAEVDWRTKGAVTPVKDQKACGSCWAFS 136
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A ++EG ++ G L+ LSEQ L+DCST G++GC GG + AF YI N GI TE Y
Sbjct: 137 ATGSLEGQHFLKDGKLVSLSEQNLVDCSTKQGDHGCGGGLMDFAFTYIKDNGGIDTEASY 196
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY+A G C + A ++ Y +V E AL KAV ++ P+S+AI A + F Y +
Sbjct: 197 PYEATDGKCQYNPANSGATVTGYVDVEHDSEDALQKAVATIGPISVAIDASRSTFHFYHK 256
Query: 283 GI-FNGVC-GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGI 339
G+ ++ C T LDH V VG+G T+DG +YWL+KNSW TWG+ G++++ R+ CGI
Sbjct: 257 GVYYDKECSSTSLDHGVLAVGYG-TQDGTDYWLVKNSWNITWGNHGFIEMSRNRNNNCGI 315
Query: 340 GTRSSYPL 347
T++SYPL
Sbjct: 316 ATQASYPL 323
>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
proteinase II; Flags: Precursor
gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
Length = 337
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 132/344 (38%), Positives = 210/344 (61%), Gaps = 13/344 (3%)
Query: 11 FKINTTPMFIIITLLVSCASQ-VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+++ T +F +I L +S S V S ++ S ++ WM + ++Y + E R +
Sbjct: 1 MRLSITLIFTLIVLSISFISAGNVFSHKQYQDSFID----WMRSNNKAYTHK-EFMPRYE 55
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
FK+N++Y+ N +G++T LG NQ +DL+N+E+R Y G + + +
Sbjct: 56 EFKKNMDYVHNWNSKGSKTV-LGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRL 114
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
N P ++DWR+K AVTP+K+Q +CG C++F+ +VEG+T I++G L+ LSEQ +L
Sbjct: 115 NRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNIL 174
Query: 190 DCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA-VPGTCSAAQKPAAAKISNYE 247
DCS++ GN GC GG AF YII+N G+ +E++YPY+ V C + AAKI++Y+
Sbjct: 175 DCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKITSYK 234
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTT 305
E+ +GDE L A+ + PVS+AI A FQ Y G+ + C ++ LDH V VG G T
Sbjct: 235 EIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMG-T 293
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
++G +Y+++KNSWG +WG GY+ + R+ + CGI T +SYP+A
Sbjct: 294 DNGEDYYIVKNSWGPSWGLNGYIHMARNKDNNCGISTMASYPIA 337
>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
Length = 318
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 131/287 (45%), Positives = 174/287 (60%), Gaps = 18/287 (6%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++ + + WM ++ + YKD EK R +IFK+NL+YI++ NK+ N TY LG F+
Sbjct: 39 TSTEKLINLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKK-NNTYWLGLTSFT 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTT----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
DLTNDEF+ Y G P + STT F Y ++ ++P S+DWR KGAVTP++N
Sbjct: 98 DLTNDEFKEKYVG---SIPENWSTTEEPNDKEFIYDDV--VNIPASIDWRQKGAVTPVRN 152
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
Q CG CW F++VAAVEGI KI +G L+ LSEQ+LLDC + GC GG A Y +
Sbjct: 153 QGSCGSCWTFSSVAAVEGINKIVTGQLVSLSEQELLDCERR-SYGCRGGFPPYALQY-VA 210
Query: 214 NQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
N GI YPY+ V C AAQ K K V +EQAL++ +++QPVSI + A
Sbjct: 211 NSGIHLRQYYPYEGVQRQCRAAQAKGPKVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEA 270
Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
FQ+Y+ GIF G CGT +DHAV VG+G Y LIKNSWG
Sbjct: 271 KGRAFQNYRGGIFAGPCGTSIDHAVAAVGYGN-----GYILIKNSWG 312
>gi|52076128|dbj|BAD46641.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|52076135|dbj|BAD46648.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 374
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 139/330 (42%), Positives = 192/330 (58%), Gaps = 27/330 (8%)
Query: 40 EQSVVEIHEKWMAQHG---RSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
E+S+ ++++W +G S +D +K R ++FK+N YI N++ +YKLG N+F
Sbjct: 36 EESMWSLYQRWRHVYGAASSSPRDLADKGSRFEVFKKNARYIHDFNRKKGMSYKLGLNKF 95
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+DLT +EF A YTG P P + D P + DWR+ GAVT +K+Q
Sbjct: 96 ADLTLEEFTAKYTGAN-PGPITGLKNGTGSPPLAAVAGDAPPAWDWREHGAVTRVKDQGP 154
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ V AVEGI I +GNL+ LSEQQ+LDCS G+ C GG AF Y + N G
Sbjct: 155 CGSCWAFSVVEAVEGINAIMTGNLLTLSEQQVLDCSGAGD--CSGGYTSYAFDYAVSN-G 211
Query: 217 IATED------------EYP-YQAVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVS 262
I + YP Y+AV C K KI +Y V DE+AL +AV
Sbjct: 212 ITLDQCFSPPTTGENYFYYPAYEAVQEPCRFDPNKAPIVKIDSYSFVDPNDEEALKQAVY 271
Query: 263 MQ-PVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
Q PVS+ I A S EF Y+ G+F+G CGT+L+HAV +VG+ TEDG YW++KNSWG
Sbjct: 272 SQGPVSVLIEA-SYEFMIYQGGVFSGPCGTELNHAVLVVGYDETEDGTPYWIVKNSWGAG 330
Query: 322 WGDAGYMKIVRD----EGLCGIGTRSSYPL 347
WG++GY++++R+ EG+CGI YP+
Sbjct: 331 WGESGYIRMIRNIPAPEGICGIAMYPIYPI 360
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 131/338 (38%), Positives = 196/338 (57%), Gaps = 14/338 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
I + L + A+Q +S + V E + H ++Y ++E+ R+KIF EN I
Sbjct: 5 IFLLLGILAAAQAISFFNL----VTEEWNTFKVTHRKAYDSKIEESFRMKIFMENWHKIA 60
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQNLSMT 134
N++ +YKLG N++ D+ + EF G+ + ++ +
Sbjct: 61 LHNQKYELNEVSYKLGMNKYGDMLHHEFINTLNGFNKSVSAQLRAQRRPIGSRFIEPANV 120
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
++P+S+DWR GAVTPIK+Q CG CW+F+A A+EG +G L+ LSEQ L+DCS
Sbjct: 121 EIPSSVDWRTHGAVTPIKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGR 180
Query: 195 -GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GNNGC GG ++AF YI N G+ TE YPY+A C + A S Y ++P G+
Sbjct: 181 YGNNGCNGGLMDQAFQYIKDNHGLDTEISYPYEAENDKCRYNPRNNGATDSGYVDIPEGN 240
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGAN 310
E+ L AV ++ PVS+AI A + FQ Y+EG+ + C ++ LDH V +VG+GT ++ +
Sbjct: 241 EKKLKAAVATIGPVSVAIDASAESFQFYREGVYYEPRCSSENLDHGVLVVGYGTDDNDQD 300
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
YWL+KNSWG TWGD GY+K+ R+ + CGI + +SYPL
Sbjct: 301 YWLVKNSWGVTWGDEGYIKMARNKDNHCGIASSASYPL 338
>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
Length = 307
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 129/294 (43%), Positives = 180/294 (61%), Gaps = 11/294 (3%)
Query: 63 EKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA-LYTGYKMPSPSH 118
E+ R++IF+ N + I N E G TY LG NQF+ +TNDEF A + G + +
Sbjct: 15 EESRRMEIFENNTKLINLHNNEADLGMHTYWLGHNQFAHMTNDEFVANVIGGCLLDRNAS 74
Query: 119 RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG 178
+ST +Y + ++ ++P ++DWR KG VTP+KNQ++CG CWAF+ ++EG T ++G
Sbjct: 75 KSTADRVHQYDS-NLVELPDTVDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKKTG 133
Query: 179 NLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
L+ LSEQ L+DCS GN GC GG + AF YI N GI TED YPY+A G C
Sbjct: 134 KLVSLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARDGKCRFKPA 193
Query: 238 PAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLD 294
A ++ Y ++ GDE AL +AV ++ P+S+AI A FQ Y G++ T+LD
Sbjct: 194 DVGATVTGYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSSTELD 253
Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
H V VG+G TE G +YWL+KNSWG WG GY+ + R++ CGI T +SYPL
Sbjct: 254 HGVLAVGYG-TEGGKDYWLVKNSWGEVWGQNGYIMMSRNKNNQCGIATSASYPL 306
>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 194/338 (57%), Gaps = 24/338 (7%)
Query: 22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN----LEY 77
+ LL++ A+ +V + + + E+ W +G+ Y E E+ R I++ N LE+
Sbjct: 1 MKLLIAVAALIVCATAFEYTAEWEL---WKRTNGKDYSSEKEELYRQTIWEANKKIVLEH 57
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
A+K G + L N F+DL + EF A+Y GY+ + +T +Y + +P
Sbjct: 58 NANADKWG---WTLEMNAFADLESSEFAAMYNGYRRSARKSNAT-----RYHVPTGNALP 109
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
++DWR KGAVTP+KNQK+CG CWAF+ ++EG T ++ G L LSEQQL+DCS GN
Sbjct: 110 DTVDWRTKGAVTPVKNQKQCGSCWAFSTTGSLEGQTFLKKGTLPSLSEQQLVDCSDKYGN 169
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
+GC GG + AF YI N GI +E YPY+A G C Q AA + Y+++P D
Sbjct: 170 HGCQGGLMDNAFKYIEANGGIDSEASYPYEAKNGKCRFQQSAVAATCTGYKDIPHDDIDG 229
Query: 257 LLKAVS-MQPVSIAIAAYSTEFQSYKEGIFNGVC--GTQLDHAVTIVGFGTTEDG----- 308
L AV+ + P+S+A+ A + FQ Y G+++ + T+LDH V VG+GT G
Sbjct: 230 LQDAVANVGPISVAMDASHSSFQLYAAGVYDPLLCSSTRLDHGVLAVGYGTEPSGLFHEE 289
Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYP 346
YWL+KNSWG WG GY KIVR + CGI T +SYP
Sbjct: 290 KPYWLVKNSWGPDWGQQGYFKIVRKDNKCGIATDASYP 327
>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
Length = 342
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 200/338 (59%), Gaps = 18/338 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ L+ C+S + H ++ H + W +G+ YK++ E+ R I+++NL+
Sbjct: 12 MNWLVWALLLCSSAMAQ---VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 68
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
+ N E G +Y+LG N D+T++E +L + ++PS R+ T + Q L
Sbjct: 69 TVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKL-- 126
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 127 ---PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 183
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG +AF YII N GI +E YPY+A+ G C K AA S Y E+P
Sbjct: 184 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELPF 243
Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E+AL +AV+ + PVS+ I A + F YK G+ ++ C ++H V +VG+G DG
Sbjct: 244 GSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL-DGK 302
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
+YWL+KNSWG +GD GY+++ R+ G CGI + SYP
Sbjct: 303 DYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYPSYP 340
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 131/299 (43%), Positives = 182/299 (60%), Gaps = 16/299 (5%)
Query: 63 EKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR 119
E ++F++NL+ I K N+E N+ +Y++G N F+ LT +EF A Y GY + +
Sbjct: 47 ESTRAFEVFQKNLDMIMKHNEEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGYG-GAEVEQ 105
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
T K++ S +++P S+DWR+KGAV +KNQ CG CWAF+AVAA+EG + SG
Sbjct: 106 PKTRRAGKHERKSRSEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGE 165
Query: 180 LIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA--TEDEYPYQAVPGTCSAAQ 236
LI LSEQQL+DCS GN+GC GG + AF Y + N G +E +YPY+ + G C +
Sbjct: 166 LISLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCKFSA 225
Query: 237 KPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFNGVCGT---Q 292
A IS Y +V G+E LL AV+ + PVS+AI A Q Y G+FNGV GT
Sbjct: 226 DGVRATISGYNDVKQGNETDLLDAVANVGPVSVAIHA-GAALQFYLRGVFNGVAGTCFGP 284
Query: 293 LDHAVTIVGFGTTE----DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
L+H VT VG+GT +YW+IKNSWG WG+ G+++ R + LCG+ +SYPL
Sbjct: 285 LNHGVTAVGYGTASLRFGRKMDYWIIKNSWGMGWGEKGFVRFARGKNLCGVANGASYPL 343
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 116/250 (46%), Positives = 169/250 (67%), Gaps = 5/250 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T+ ++E+ E WM++H ++YK EK R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42 TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT++EF+ Y G P S + S+ F+Y+++ TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T N+GC GG + AF YII G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
ED+YPY G C ++ IS YE+VP D+++L+KA++ QPVS+AI A +
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 277 FQSYKEGIFN 286
FQ YK G++N
Sbjct: 279 FQFYK-GVYN 287
>gi|354473025|ref|XP_003498737.1| PREDICTED: cathepsin S-like [Cricetulus griseus]
Length = 341
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 130/306 (42%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN---RTYKLGTNQFSDLTNDEFRA 106
W HG+ YK++ E+E R I+++NL+ + N E + +Y LG N D+T++E
Sbjct: 40 WKKFHGKQYKEKNEEEARRLIWEKNLKLVMLHNLEYSLEMHSYSLGMNHMGDMTSEEVLG 99
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
++PS HR++T + Q L P S+DWR+KG VT +K Q CG CWAF+AV
Sbjct: 100 QMRPLRVPSQRHRNSTYKSNPNQKL-----PDSMDWREKGCVTEVKYQGSCGSCWAFSAV 154
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+E K+++G L+ LS Q L+DCST GN GC GG +AF YII N GI ++ Y
Sbjct: 155 GALEAQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCDGGFMTRAFQYIIDNGGIDSDASY 214
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
PY+AV C K AA S Y E+PSGDE+AL +AV+ + PVS+ I A F YK
Sbjct: 215 PYKAVAEKCHYDSKSRAATCSRYMELPSGDEEALKEAVANKGPVSVGIDASHPSFFLYKS 274
Query: 283 GIFN-GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
G+++ C ++H V +VG+G DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 275 GVYDEPSCTENVNHGVLVVGYGNL-DGKDYWLVKNSWGLHFGDQGYIRMARNNKNQCGIA 333
Query: 341 TRSSYP 346
+ SYP
Sbjct: 334 SYGSYP 339
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 132/310 (42%), Positives = 191/310 (61%), Gaps = 12/310 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E + HG+ YK E+ +R IF++N + I++ N+E G R+Y +G NQF DL + E+
Sbjct: 21 EAFKLTHGKQYKSPDEENVRRAIFRDNNQMIKEHNQEAAMGRRSYFMGMNQFGDLAHSEY 80
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
L G + P + ST S +++ V ++DWR KGAVTPIK+Q CG CWAF+
Sbjct: 81 LELVVGPGLL-PLNLSTPSENV-FESTPGLQVDDTVDWRQKGAVTPIKDQGHCGSCWAFS 138
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
++EG +++G L+ LSEQ LLDCS GN GC GG ++AF YI N GI TE+ Y
Sbjct: 139 TTGSLEGQHFMKTGKLVSLSEQNLLDCSRRFGNKGCEGGLMDQAFRYIKSNGGIDTEECY 198
Query: 224 PYQAVP-GTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
PY A C + A +S+Y ++ + DE AL++AV ++ PVS+AI A + YK
Sbjct: 199 PYMAKDEKVCDYKTSCSGATLSSYTDIKAMDEMALMQAVGTVGPVSVAIDASHKSLRFYK 258
Query: 282 EGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCG 338
GI++ T+LDH V VG+G+ DG +YWL+KNSWG+ WGD GY+K+ R++ CG
Sbjct: 259 SGIYDEPECSRTKLDHGVLAVGYGSM-DGMDYWLVKNSWGSAWGDMGYVKMTRNKNNQCG 317
Query: 339 IGTRSSYPLA 348
I T++SYP+
Sbjct: 318 IATKASYPVV 327
>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
boliviensis]
gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
boliviensis]
gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
boliviensis]
Length = 333
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 132/342 (38%), Positives = 199/342 (58%), Gaps = 23/342 (6%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
P I+ + AS ++ + E + KW A H R Y E+E R ++++N+
Sbjct: 2 NPTLILAAFCLGLASAALTFNHSLEAQWI----KWKAMHNRLYGKN-EEEWRRAVWEKNM 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ IE N E G ++ + N F D+TN+EFR + G++ P + +Q
Sbjct: 57 KTIELHNHEYNQGKHSFTMAMNTFGDMTNEEFRQVMNGFQNRKPRNGKV------FQEPL 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ + P S+DWR+KG VTP+KNQ +CG CWAF+A A+EG ++G L+ LSEQ L+DCS
Sbjct: 111 LHEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG + AF Y+ +N G+ +E+ YPY+A +C K + A + + ++P
Sbjct: 171 GPQGNQGCNGGLMDYAFQYVQENGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK 230
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TT 305
E+AL+KAV ++ P+S+AI A FQ YKEGI F C ++ +DH V +VG+G T
Sbjct: 231 -LEKALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGYGFERTG 289
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
D + YWL+KNSWG WG GY+K+ +D + CGI + +SYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMDGYIKMAKDRKNHCGIASAASYP 331
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 134/348 (38%), Positives = 205/348 (58%), Gaps = 26/348 (7%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVE-IHEKWMA---QHGRSYKDELEKEMRLKIFKE 73
M + + L ++ + V H S E ++++WM +H ++YK ++E+ R+KIF +
Sbjct: 1 MKLFLILFITIFATV------HAVSFFELVNQEWMTFKMEHKKAYKSDVEERFRMKIFMD 54
Query: 74 NLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSSTF 126
N I K N +YKL N++ D+ + EF + G+ S R ++F
Sbjct: 55 NKHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERMPIGASF 114
Query: 127 -KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
+ N+++ P +DWR +GAVTP+K+Q CG CW+F+A A+EG R+G L+ LSE
Sbjct: 115 IEPANVAL---PKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSE 171
Query: 186 QQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKIS 244
Q L+DCS GNNGC GG ++AF YI N+G+ TE YPY+A C + A
Sbjct: 172 QNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDV 231
Query: 245 NYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVG 301
Y ++P+G+E+ L AV ++ PVS+AI A FQ Y EG+ + C + +LDH V ++G
Sbjct: 232 GYIDIPTGNEKLLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIG 291
Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
+GT E+G +YWL+KNSWG TWG+ GY+K+ R++ CGI + +SYPL
Sbjct: 292 YGTNENGEDYWLVKNSWGETWGNNGYIKMARNKLNHCGIASSASYPLV 339
>gi|357153071|ref|XP_003576329.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 398
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 185/331 (55%), Gaps = 34/331 (10%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG---NRTYKLGTNQFSDLTNDEF 104
+ WMA GRSY E R +++K N+ YIE N E T++LG F+DLT++EF
Sbjct: 63 QGWMAAQGRSYWTAEETARRFEVYKSNVRYIEAVNAEAATTGLTFELGEGPFTDLTHEEF 122
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV----------------------PTSLDW 142
ALY G MP P + + + T V P S DW
Sbjct: 123 SALYNG-SMPPPEEEEGDDIQEEDEQVIATVVDGVDVNVAVHTNLSAGGPRPWPPRSRDW 181
Query: 143 RDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGG 202
R GAVTPIK+Q CG CWAF VA +EG KI GNL+ LSEQQL+DC N+GC GG
Sbjct: 182 RKHGAVTPIKDQGRCGSCWAFPTVATIEGKHKIVRGNLVSLSEQQLIDCDYT-NSGCKGG 240
Query: 203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS 262
+A+ +I + G+ T YPY+ G C ++ AAA+I+ + V S E AL+ AV+
Sbjct: 241 FVIRAYRWIRKIGGLTTSSAYPYKGARGKCMKRRR-AAARIAGWRSVRSRSEVALVNAVA 299
Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGT-QLDHAVTIVGFGTTED-GANYWLIKNSWGN 320
QPV++ I+A FQ YK+GI NG C T +L+HAVT+VG+G D GA YW++KNSWG
Sbjct: 300 GQPVAVYISASGKNFQHYKKGILNGPCDTARLNHAVTVVGYGRQADTGAKYWIVKNSWGT 359
Query: 321 TWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
TWG GY+ + R G CGI T +PL
Sbjct: 360 TWGQEGYILMKRGTRNPRGQCGIATSPVFPL 390
>gi|291383488|ref|XP_002708302.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 344
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 133/341 (39%), Positives = 209/341 (61%), Gaps = 20/341 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + + L V C S ++S+ T + S+ +W+A H R Y E+E R ++++N++
Sbjct: 1 MHLPLFLAVLC-SGMISAAPTPDHSLDTRWRQWLAAHKRRYGVR-EEEWRRAVWEKNMQM 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IEK N+E G + + N + D+TN+EFR + G++ + +H+ ++ N +
Sbjct: 59 IEKHNREYSQGKHGFTMAMNAYGDMTNEEFRLMMNGFE--NQNHKRGE----EFHNSLLF 112
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
+P LDWR++G VTP+KNQ+ CG WAF+A A+EG ++G L+ LSEQ L+DCS
Sbjct: 113 KIPAFLDWRERGYVTPVKNQELCGSSWAFSATGALEGQMFRKTGRLVSLSEQNLVDCSWP 172
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG + AF Y+ N+G+ +E+ YPY+ G+C + +AA ++ + +V S D
Sbjct: 173 QGNQGCSGGLMDYAFQYVKDNRGLDSEESYPYEQRKGSCKYNPRFSAANVTGFVDV-SKD 231
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGA- 309
E+AL++AV ++ PVS+ IA F Y+ GI ++ C ++ ++HAV +VG+G E G+
Sbjct: 232 EKALMEAVATVGPVSVGIATTPESFLFYEGGIYYDPKCSSENVNHAVLVVGYGFEEVGSK 291
Query: 310 --NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
YWLIKNSWG WG GYMK+ +D+ CGI T +SYPL
Sbjct: 292 NNKYWLIKNSWGKDWGMGGYMKMAKDQNNHCGIATAASYPL 332
>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
Length = 332
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 128/307 (41%), Positives = 182/307 (59%), Gaps = 13/307 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E W H + Y E E++ R KI+++NL+ + K N E G +Y LG N+++DL +EF
Sbjct: 29 EAWKQTHSKQYTKE-EEDNRRKIWEDNLQKVSKHNTEHSLGLHSYTLGMNKYADLRGEEF 87
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
+ G K + R K+ + + P S+DWRD+G VTP+K+Q +CG CWAF+
Sbjct: 88 VQMMNGLKFDASRERQG----IKFLSYAKFQAPDSVDWRDEGYVTPVKDQGQCGSCWAFS 143
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
++EG +G L LSEQ L+DCS + GNNGC GG + AF YI N GI TED+Y
Sbjct: 144 TTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDNLGIDTEDKY 203
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
PY+A TC + A S Y +V SGDE AL +A + P+S+AI A FQ Y+
Sbjct: 204 PYEAEDDTCRFSPDNVGATDSGYVDVDSGDEDALKEACAANGPISVAIDASHESFQLYES 263
Query: 283 GIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
G+++ +LDH V +VG+GT G +YW++KNSWG +WG GY+ + R+ + CGI
Sbjct: 264 GVYDEESCSSIELDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWMSRNKDNQCGI 323
Query: 340 GTRSSYP 346
T +SYP
Sbjct: 324 ATSASYP 330
>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
Length = 331
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 199/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ L+ C+S + H ++ H + W +G+ YK++ E+ R I+++NL+
Sbjct: 1 MNWLVWALLLCSSAMAH---VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
+ N E G +Y+LG N D+T++E +L + ++PS R+ T + Q L
Sbjct: 58 TVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKL-- 115
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 116 ---PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG +AF YII N GI +E YPY+A+ G C K AA S Y E+P
Sbjct: 173 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELPF 232
Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E+AL +AV+ + PVS+ I A + F YK G+ ++ C ++H V +VG+G DG
Sbjct: 233 GSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL-DGK 291
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
+YWL+KNSWG +GD GY+++ R+ G CGI SYP
Sbjct: 292 DYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYP 329
>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 291
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 118/219 (53%), Positives = 152/219 (69%), Gaps = 4/219 (1%)
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP+S+DWR KGAVT +K+Q +CG CWAF+ +AAVEGI IR+ NL LSEQQL+DC
Sbjct: 58 VRDVPSSVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCD 117
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
T N GC GG + AF YI ++ G+A ED YPY+A + + A I YE+VP+
Sbjct: 118 TKSNAGCNGGLMDYAFQYIAKHGGVAAEDAYPYKARQASSCNKKPSAVVTIDGYEDVPAN 177
Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
DE AL KAV+ QPV++AI A + FQ Y EG+F G CGT+LDH V VG+GTT DG YW
Sbjct: 178 DETALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYW 237
Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
++KNSWG WG+ GY+++ RD EGLCGI +SYP+
Sbjct: 238 IVKNSWGPEWGEKGYIRMKRDVEDKEGLCGIAMEASYPV 276
>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
Length = 331
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 195/338 (57%), Gaps = 18/338 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ LL C+ V H+ ++ H W + + YK+E E+ R I+++NL+
Sbjct: 1 MKWLVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T +E +L ++PS R+ T Y++ S
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT-----YRSNSN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 QKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG AF YII N GI +E YPY+A+ G C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKRAATCSKYTELPF 232
Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E AL +AV+ + PVS+AI A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL-NGK 291
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
+YWL+KNSWG +GD GY+++ R+ G CGI + SYP
Sbjct: 292 DYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 329
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 186/310 (60%), Gaps = 17/310 (5%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W H ++Y ++ E RL ++++NL IE N E G +Y+LG N F D+T++EFR
Sbjct: 31 WKGWHSKNYHEKEEGWRRL-VWEKNLRKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQ 89
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
+ GYK R + S F N + P ++DWRDKG VTP+K+Q +CG CWAF+
Sbjct: 90 IMNGYK--RREQRKYSGSLFMEPNF--LEAPRAVDWRDKGYVTPVKDQGQCGSCWAFSTT 145
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
A+EG ++G L+ LSEQ L+DCS GN GC GG ++AF Y+ NQG+ +ED YPY
Sbjct: 146 GALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDFYPY 205
Query: 226 QAVPGT-CSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEG 283
+ C + +A + + ++PSG E+AL+KAV S+ PVS+AI A FQ Y+ G
Sbjct: 206 KGTDDQPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAIDAGHESFQFYQSG 265
Query: 284 I-FNGVCGT-QLDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLC 337
I F C + +LDH V +VG+ G DG YW++KNSW WGD G++ + +D C
Sbjct: 266 IYFEKECSSDELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGFIYMAKDRHNHC 325
Query: 338 GIGTRSSYPL 347
GI T +SYPL
Sbjct: 326 GIATAASYPL 335
>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
Length = 338
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 129/305 (42%), Positives = 188/305 (61%), Gaps = 14/305 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W +GR Y+++ E+ R I+++NL+ + N E G +Y LG N +D+T++E +
Sbjct: 39 WKKTYGRQYQEKNEEVARRLIWEKNLKSVMLHNLEYSMGMHSYDLGMNHLADMTSEEVSS 98
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
L + ++PS + T Y++ S +P S+DWR+KG VT +K Q CG CWAF+AV
Sbjct: 99 LMSSLRVPSQWQANVT-----YKSNSNQKLPDSVDWREKGCVTEVKYQGACGACWAFSAV 153
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
A+E K+++GNL+ LS Q L+DCST GN GC GG KAF YII N GI +E YP
Sbjct: 154 GALEAQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGFMTKAFQYIIDNNGIDSEVSYP 213
Query: 225 YQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEG 283
Y+A+ G C K AA S Y E+P G E AL +AV+ + PVS+AI A + F YK G
Sbjct: 214 YKAMDGNCRYDSKHRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDAKHSSFFLYKSG 273
Query: 284 I-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGT 341
+ ++ C ++H V +VG+G +G +YWL+KNSWG +G+ GY+++ R+ G CGI +
Sbjct: 274 VYYDPSCTQNVNHGVLVVGYGNL-NGRDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIAS 332
Query: 342 RSSYP 346
SYP
Sbjct: 333 YPSYP 337
>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
Length = 333
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 126/312 (40%), Positives = 188/312 (60%), Gaps = 13/312 (4%)
Query: 44 VEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDL 99
+++H + W QHG++YK E+E+ R ++++ NL+ I N E G TY LG N D+
Sbjct: 26 LDLHWQMWKKQHGKNYKTEVEELGRREVWERNLQLISLHNLEASMGMHTYDLGMNHMGDM 85
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
T +E + K+P+ R ++ + S T VP ++DWR KG VT +KNQ CG
Sbjct: 86 TEEEILQSFASLKVPADLKREPSA----FVASSGTPVPDTVDWRQKGYVTQVKNQGSCGS 141
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
CWAF++V A+EG +G L+ LS Q L+DCS+ GN GC GG +AF Y+I N+GI
Sbjct: 142 CWAFSSVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKGCNGGFMSEAFQYVIDNKGID 201
Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEF 277
++ YPYQ V GTC +A + Y +P GDE L +AV+M P+S+AI A F
Sbjct: 202 SDTSYPYQGVQGTCHYNPSYRSANCTRYSFLPEGDETTLKQAVAMIGPISVAIDATRPSF 261
Query: 278 QSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-G 335
++ G++N + C +++HAV +VG+GT DG +YWL+KNSWG +G+ GY+++ R+
Sbjct: 262 ILWRSGVYNDLTCTQKINHAVLVVGYGTL-DGQDYWLVKNSWGTRFGENGYIRMSRNRNN 320
Query: 336 LCGIGTRSSYPL 347
CGI YP+
Sbjct: 321 QCGIALYGCYPI 332
>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
Length = 335
Score = 240 bits (612), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 194/337 (57%), Gaps = 13/337 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + +LV S + + ++W +HG+ Y E E + RLK+F +N+ Y
Sbjct: 6 LFLGLCVLVHVCSAFIPLVLPIPGLYEDYFKEWQEKHGKVYSTEEESQSRLKVFMKNVIY 65
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I+ NK+G+ +Y+L N+++D+T DEF+ Y + P H S T S K D P
Sbjct: 66 IDNHNKQGH-SYELEVNEYADMTLDEFKDQY----LMEPQHCSATHS-LKSDPPKYRDPP 119
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
++DWR KGAVTP+KNQ +CG CW F+ +E +++G L+ LSEQQL+DC+ N
Sbjct: 120 KAIDWRSKGAVTPVKNQGQCGSCWTFSTTGCLESHHFLKTGQLVSLSEQQLVDCAQAFNN 179
Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
NGC GG +AF YI N G+ +E+ YPY+A C +A +SN + S DE
Sbjct: 180 NGCNGGLPSQAFEYIHYNGGLDSEESYPYRAHDEKCHFVPSEVSATVSNVVNITSKDEMQ 239
Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANY 311
L AV ++ PVSIA S +F+ YK+G++ C T ++HAV VG+ TTE G +Y
Sbjct: 240 LYNAVGTVGPVSIAYDV-SADFRFYKKGVYKSKECKTDPEHVNHAVLAVGYNTTESGEDY 298
Query: 312 WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
W++KNSWG +G GY I R E +CG+ +SYP+
Sbjct: 299 WIVKNSWGTKFGINGYFWIARGENMCGLADCASYPIV 335
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 240 bits (612), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 127/308 (41%), Positives = 191/308 (62%), Gaps = 12/308 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
E W ++G+SY E+ +R ++++ NL+ +++ N +G Y+LG N ++DL N+EF
Sbjct: 20 ESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEF 79
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
AL + +S+T TFK L +P+S+DWR++G VTP+K+Q +CG CW+F+
Sbjct: 80 MALKGSSGILQAKDQSSTQ-TFK--PLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFS 136
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A ++EG ++G L+ LSEQQL+DCS + GN GC GG E A+ YI G+ E Y
Sbjct: 137 ATGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYGCSGGLMESAYDYIRDAGGVQLESAY 196
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY A G C Q A A + + +PSGDEQ+L++AV ++ PV++AI A +FQ Y+
Sbjct: 197 PYTAQNGRCHFDQSKAVATCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGYDFQLYES 256
Query: 283 GIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGI 339
G+++ + LDH V G+G TE G +YWL+KNSWG WG GY+K+ R++ CGI
Sbjct: 257 GVYDRSRCSSSSLDHGVLAAGYG-TEGGNDYWLVKNSWGPGWGAQGYIKMSRNKSNQCGI 315
Query: 340 GTRSSYPL 347
T + YPL
Sbjct: 316 ATMACYPL 323
>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
Length = 333
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 197/342 (57%), Gaps = 21/342 (6%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
TP F++ L + +VS+ +Q++ ++W A HGR Y E+ R ++++NL
Sbjct: 2 TPSFVLAALCLG----IVSALPKLDQTLDAQWDQWKAAHGRLYGLN-EEGWRRAVWEKNL 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
IE N E G ++ LG N F D+TN+EFR + G++ H+ + YQ
Sbjct: 57 RMIELHNGEYSQGRHSFTLGMNHFGDMTNEEFRQVMNGFQ-----HQKHKTGKM-YQEPL 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ +P S+DWR+KG VT +KNQ +CG CWAF+A ++EG ++GNL+ LSEQ L+DCS
Sbjct: 111 LLQLPKSVDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDCS 170
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG + AF Y+ N+G+ E YPY G C + +AA + + +VP
Sbjct: 171 RPQGNQGCNGGLMDFAFQYVKDNKGLEAEKSYPYVGKDGECKYKPELSAANDTGFVDVPQ 230
Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGT--TED 307
++ ++ P+S+AI A FQ YKEGI+ G L+H V +VG+GT +E
Sbjct: 231 REKVVQKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGCSSRDLNHGVLLVGYGTDASET 290
Query: 308 G-ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
G +YWLIKNSWG TWG GY+KI R+ CG+ T +SYPL
Sbjct: 291 GKGDYWLIKNSWGTTWGADGYVKIARNRNNHCGVATAASYPL 332
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 133/345 (38%), Positives = 199/345 (57%), Gaps = 24/345 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKEN 74
+F+ + + V +Q +S ++++W +H + YK+++E+ R+KIF +N
Sbjct: 3 LFLFLIVAVLATAQAISFFEL-------VNQEWTTFKMEHNKVYKNDVEERFRMKIFMDN 55
Query: 75 LEYIEKANKEGNR-----TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
I K N GN +YKL N++ D+ + EF G+ + +
Sbjct: 56 KHKIAKHN--GNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIAAS 113
Query: 130 NLSMTDV--PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+ +V P ++DWR+ GAVTP+K+Q CG CW+F+A A+EG R+G LI LSEQ
Sbjct: 114 FIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQN 173
Query: 188 LLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
L+DCS GNNGC GG ++AF YI N+G+ TE YPY+A C + A+ Y
Sbjct: 174 LIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVGY 233
Query: 247 EEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG 303
++P G+E+ L AV ++ PVS+AI A FQ Y EG+ + C ++ LDH V VG+G
Sbjct: 234 VDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYG 293
Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
T E+G +YWL+KNSWG TWGD GY+K+ R++ CGI + +SYPL
Sbjct: 294 TDENGQDYWLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPL 338
>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
Length = 344
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 128/323 (39%), Positives = 187/323 (57%), Gaps = 23/323 (7%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDL 99
+ E+W A +H + Y E+E + R+KI+ EN I K N+ + +YKL N+++D+
Sbjct: 23 VREEWNAFKMEHSKQYDSEVEDKFRMKIYVENKHRIAKHNQRFEQRLVSYKLKPNKYADM 82
Query: 100 TNDEFRALYTGY----------KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+ EF G+ K R ++TF + P +DWR KGAVT
Sbjct: 83 LHHEFVHTMNGFNKTAKHGGRNKAVHSKGRDGRAATFIAP--AHVSYPDHVDWRKKGAVT 140
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAF 208
+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS GNNGC GG + AF
Sbjct: 141 DVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLVDCSAAYGNNGCNGGLMDNAF 200
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVS 267
YI N GI TE YPY+AV C K + A + ++P GDE+ L++AV ++ P+S
Sbjct: 201 KYIKDNGGIDTEKSYPYEAVDDKCRYNPKNSGADDVGFVDIPQGDEEKLMQAVATVGPIS 260
Query: 268 IAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
+AI A FQ Y +G++ T LDH V +VG+GT E+G +YWL+KNSWG +WG+
Sbjct: 261 VAIDASQETFQFYSKGVYYDENCSSTDLDHGVMVVGYGTEEEGGDYWLVKNSWGRSWGEL 320
Query: 326 GYMKIVRDE-GLCGIGTRSSYPL 347
GY+K+ ++ CGI + +SYPL
Sbjct: 321 GYIKMAHNKNNHCGIASSASYPL 343
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 127/302 (42%), Positives = 189/302 (62%), Gaps = 14/302 (4%)
Query: 53 QHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYT 109
Q+ + Y++E E RL +++ NL++I N G T+ +G N++ D+TN+EF
Sbjct: 33 QYNKLYQNEEEARRRL-VWESNLDFITLHNLAADRGEHTFWVGMNEYGDMTNEEFTKTMN 91
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
GY+M ++++ + F N +M D+P ++DWR KG VTPIKNQ +CG CW+F+A ++
Sbjct: 92 GYRM---RNKTSNAPVFMPPN-NMGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATGSL 147
Query: 170 EGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV 228
EG T ++G L+ LSEQ L+DCS GN+GC GG + AF YI N GI TE YPY+A
Sbjct: 148 EGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYIKANNGIDTEASYPYKAR 207
Query: 229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG 287
G C A + + ++ + DE+AL +AV ++ P+S+AI A FQ Y+ G+++
Sbjct: 208 DGKCEFKSADVGATDTGFVDIKTKDEEALKQAVATVGPISVAIDASHMSFQLYRTGVYHD 267
Query: 288 -VCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSS 344
C T+LDH V VG+G TED +YWL+KNSWG +WG GY+++ R+ CGI T +S
Sbjct: 268 WFCSQTKLDHGVLAVGYG-TEDSKDYWLVKNSWGESWGQKGYIQMSRNRRNNCGIATSAS 326
Query: 345 YP 346
YP
Sbjct: 327 YP 328
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 129/336 (38%), Positives = 204/336 (60%), Gaps = 23/336 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+FI LLV+ ++ V+ Q+ + +HG++YK+++E+ R IFK+NL
Sbjct: 3 VFIAACLLVAVSATVLEETGVKFQA-------FKLKHGKTYKNQVEETARFNIFKDNLRA 55
Query: 78 IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N ++G +YK G N+F+D+T +EFRA T P H +TT L+
Sbjct: 56 IEQHNVLYEQGLVSYKKGINRFTDMTQEEFRAFLTLSSSKKP-HFNTTEHV-----LTGL 109
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
VP S+DWR KG VT +K+Q CG CWAF+ + E ++G L+ LSEQQL+DCST+
Sbjct: 110 AVPDSIDWRTKGQVTGVKDQGNCGSCWAFSVTGSTEAAYYRKAGKLVSLSEQQLVDCSTD 169
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
N GC GG ++ F Y ++++G+ E YPY+ G+C + K+S ++ + S DE
Sbjct: 170 INAGCNGGYLDETFTY-VKSKGLEAESTYPYKGTDGSCKYSASKVVTKVSGHKSLKSEDE 228
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCG-TQLDHAVTIVGFGTTEDGANY 311
ALL AV ++ PVS+AI A T SY+ GI+ + C ++L+H V +VG+GT+ +G Y
Sbjct: 229 NALLDAVGNVGPVSVAIDA--TYLSSYESGIYEDDWCSPSELNHGVLVVGYGTS-NGKKY 285
Query: 312 WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
W++KNSWG ++G++GY +++R + CG+ + YP+
Sbjct: 286 WIVKNSWGGSFGESGYFRLLRGKNECGVAEDTVYPI 321
>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
Length = 339
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 138/348 (39%), Positives = 199/348 (57%), Gaps = 24/348 (6%)
Query: 8 SGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEM 66
+GSF M ++ LL C+ V H+ ++ H W + + YK+E E+
Sbjct: 5 AGSF------MKWLVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVA 55
Query: 67 RLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS 123
R I+++NL+++ N E G +Y LG N D+T +E +L ++PS R+ T
Sbjct: 56 RRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT- 114
Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
Y++ S +P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ L
Sbjct: 115 ----YRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSL 170
Query: 184 SEQQLLDCSTN--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA 241
S Q L+DCST GN GC GG AF YII N GI +E YPY+A+ G C K AA
Sbjct: 171 SAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKRAA 230
Query: 242 KISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTI 299
S Y E+P G E AL +AV+ + PVS+AI A F Y+ G+ + C ++H V +
Sbjct: 231 TCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLV 290
Query: 300 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
VG+G +G +YWL+KNSWG +GD GY+++ R+ G CGI + SYP
Sbjct: 291 VGYGNL-NGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 337
>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
Length = 322
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 122/308 (39%), Positives = 184/308 (59%), Gaps = 14/308 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
+ + Q+GR Y E R +F++N ++IE N + G T+ L NQF D+T++EF
Sbjct: 20 QDFKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEF 79
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A G+ + P+ + L P +DWR KGAVTP+K+QK+CG CWAF+
Sbjct: 80 AATMNGF-LNVPTRHPVAILEADDETL-----PKHVDWRTKGAVTPVKDQKQCGSCWAFS 133
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
++EG ++ G L+ LSEQ L+DCS GN GC GG ++AF YI +N+GI TE+ Y
Sbjct: 134 TTGSLEGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESY 193
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKE 282
PY+A G C A + + ++ G+E +L+KAV+ + P+S+AI A FQ Y +
Sbjct: 194 PYEAQDGKCRFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQ 253
Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
G++ T LDH V +G+G T+DG YWL+KNSW +WGD G++++ R+ + CGI
Sbjct: 254 GVYYEKECSSTMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNNCGI 313
Query: 340 GTRSSYPL 347
+++SYPL
Sbjct: 314 ASQASYPL 321
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 132/346 (38%), Positives = 200/346 (57%), Gaps = 24/346 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKEN 74
+F+++ + + +Q +S ++++W +H + YK+++E+ R+KIF +N
Sbjct: 3 LFLLLIVAILATAQAISFFEL-------VNQEWTTFKMEHNKVYKNDIEERFRMKIFMDN 55
Query: 75 LEYIEKANKEGNR-----TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
I K N GN +YKL N++ D+ + EF G+ + +
Sbjct: 56 KHKIAKHN--GNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIGAS 113
Query: 130 NLSMTDV--PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+ +V P ++DWR+ GAVTP+K+Q CG CW+F+A A+EG R+G LI LSEQ
Sbjct: 114 FIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQN 173
Query: 188 LLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
L+DCS GNNGC GG ++AF YI N+G+ TE YPY+A C + A+ Y
Sbjct: 174 LIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVGY 233
Query: 247 EEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG 303
++P G+E+ L AV ++ PVS+AI A FQ Y EG+ + C ++ LDH V VG+G
Sbjct: 234 VDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYG 293
Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
T E+G +YWL+KNSWG TWGD GY+K+ R++ CGI + +SYPL
Sbjct: 294 TDENGQDYWLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPLV 339
>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
Length = 306
Score = 239 bits (611), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 123/310 (39%), Positives = 183/310 (59%), Gaps = 18/310 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
+ + Q+GR Y E R +F++N ++IE N + G T+ L NQF D+T++EF
Sbjct: 4 QDFKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEF 63
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKGAVTPIKNQKECGCCWA 162
A G+ H L D +P +DWR KGAVTP+K+QK+CG CWA
Sbjct: 64 AATMNGFLNVPTRHPVAI--------LEADDETLPKHVDWRTKGAVTPVKDQKQCGSCWA 115
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATED 221
F+ ++EG ++ G L+ LSEQ L+DCS GN GC GG ++AF YI +N+GI TE+
Sbjct: 116 FSTTGSLEGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEE 175
Query: 222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSY 280
YPY+A G C A + + ++ G+E +L+KAV+ + P+S+AI A FQ Y
Sbjct: 176 SYPYEAQDGKCRFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPISVAIDASHPSFQFY 235
Query: 281 KEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLC 337
+G++ T LDH V +G+G T+DG YWL+KNSW +WGD G++++ R+ + C
Sbjct: 236 HQGVYYEKECSSTMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNNC 295
Query: 338 GIGTRSSYPL 347
GI +++SYPL
Sbjct: 296 GIASQASYPL 305
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/305 (41%), Positives = 187/305 (61%), Gaps = 14/305 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W +G+ YK++ E+ R I+++NL+++ N E G +Y LG N D+T++E +
Sbjct: 40 WKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMTSEEVIS 99
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
L + ++PS R+ T Y++ S +P S+DWR+KG VT +K Q CG CWAF+AV
Sbjct: 100 LMSSLRVPSQWPRNVT-----YKSNSNQKLPDSVDWREKGCVTKVKYQGACGACWAFSAV 154
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
A+E K+++G L+ LS Q L+DCST GN GC GG +AF YII N GI +E YP
Sbjct: 155 GALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYP 214
Query: 225 YQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEG 283
Y+A G C K AA S Y E+PSG E L +AV+ + PVS+AI A + F Y+ G
Sbjct: 215 YKATDGKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSG 274
Query: 284 I-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGT 341
+ ++ C ++H V +VG+G +G +YWL+KNSWG +GD GY+++ R+ G CGI +
Sbjct: 275 VYYDPSCTQNVNHGVLVVGYGNL-NGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIAS 333
Query: 342 RSSYP 346
SYP
Sbjct: 334 YPSYP 338
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/305 (41%), Positives = 187/305 (61%), Gaps = 14/305 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W +G+ YK++ E+ R I+++NL+++ N E G +Y LG N D+T++E +
Sbjct: 28 WKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMTSEEVIS 87
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
L + ++PS R+ T Y++ S +P S+DWR+KG VT +K Q CG CWAF+AV
Sbjct: 88 LMSSLRVPSQWPRNVT-----YKSNSNQKLPDSVDWREKGCVTKVKYQGACGACWAFSAV 142
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
A+E K+++G L+ LS Q L+DCST GN GC GG +AF YII N GI +E YP
Sbjct: 143 GALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYP 202
Query: 225 YQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEG 283
Y+A G C K AA S Y E+PSG E L +AV+ + PVS+AI A + F Y+ G
Sbjct: 203 YKATDGKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSG 262
Query: 284 I-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGT 341
+ ++ C ++H V +VG+G +G +YWL+KNSWG +GD GY+++ R+ G CGI +
Sbjct: 263 VYYDPSCTQNVNHGVLVVGYGNL-NGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIAS 321
Query: 342 RSSYP 346
SYP
Sbjct: 322 YPSYP 326
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 132/340 (38%), Positives = 199/340 (58%), Gaps = 21/340 (6%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+++++++S AS V + V+ E W H + Y +E+++RLKIF EN I
Sbjct: 6 ILLLSVIISTASAV-----SFFDVVLSDWESWKLTHQKGYDSSVEEKLRLKIFMENSLRI 60
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMT 134
+ N E G TY + N + DL + EF A+ GY ++++T TF +N+++
Sbjct: 61 SRHNAEAIQGRHTYFMKMNHYGDLLHHEFVAMVNGYIY---NNKTTLGGTFIPSKNINL- 116
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
P +DWR++GAVTP+KNQ +CG CW+F+A ++EG ++G LI LSEQ L+DCS
Sbjct: 117 --PEHVDWREEGAVTPVKNQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRK 174
Query: 195 -GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GNNGC GG + AF YI N GI TE YPY+ + G C K + ++ G
Sbjct: 175 YGNNGCEGGLMDYAFKYIQDNNGIDTEASYPYEGIDGHCHYDPKNKGGSDIGFVDIKKGS 234
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTE-DGA 309
E+ L KA+ ++ P+S+AI A FQ Y G+++ C + LDH V VG+GT E G
Sbjct: 235 EKDLQKALATVGPISVAIDASHMSFQFYSHGVYSEKKCSPENLDHGVLAVGYGTDEVTGE 294
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
+YWL+KNSW WG+ GY+K+ R+ + +CGI + +SYP+
Sbjct: 295 DYWLVKNSWSEKWGEDGYIKMARNKDNMCGIASSASYPVV 334
>gi|12805315|gb|AAH02125.1| Ctss protein [Mus musculus]
Length = 340
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/306 (41%), Positives = 186/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 39 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 98
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
++P S ++ T +++ S +P ++DWR+KG VT +K Q CG CWAF+AV
Sbjct: 99 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+EG K+++G LI LS Q L+DCS GN GC GG +AF YII N GI + Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
PY+A+ C K AA S Y ++P GDE AL +AV+ + PVS+ I A + F YK
Sbjct: 214 PYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273
Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 341 TRSSYP 346
+ SYP
Sbjct: 333 SDCSYP 338
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 188/308 (61%), Gaps = 16/308 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE--GNRTYKLGTNQFSDLTNDEFR 105
E + G++Y+ + E +R IF+ NL +IEK N E +R Y LG QF+D++ EFR
Sbjct: 167 EHFKEHFGKTYEGD-EHALRQGIFQRNLAHIEKFNAEKAASRGYTLGITQFADMSTAEFR 225
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQKECGCCWA 162
Y G +M + ST + K Q + D +P ++DWRDKGAV+P+K+Q +CG CWA
Sbjct: 226 QTYLGLRMNA----STIAKLRKLQREVVADDRDLPEAVDWRDKGAVSPVKDQGQCGSCWA 281
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
F+ A+EG +++G L+ LSEQQ++DCS + GC GG A Y+ N G+ E
Sbjct: 282 FSTSGAIEGQHFLKNGELLSLSEQQMVDCSWL-DFGCNGGQPMLAMEYVRFNGGLELETA 340
Query: 223 YPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYK 281
YPY+ V G+C + +K AAAKI+ + E AL KAV+ + P+S+ + A +FQ YK
Sbjct: 341 YPYKGVGGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAKVGPISVGMDASGEDFQHYK 400
Query: 282 EGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCG 338
GI+N LDHAV VG+GT++DG +YWL+KNSW +WG+ GY K+ R++G CG
Sbjct: 401 SGIYNPESCSSIGLDHAVLAVGYGTSDDG-DYWLVKNSWNTSWGEKGYFKLPRNKGNKCG 459
Query: 339 IGTRSSYP 346
I T YP
Sbjct: 460 IATTPIYP 467
>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 132/334 (39%), Positives = 199/334 (59%), Gaps = 19/334 (5%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
LL + + S+ +Q++ +W A H R Y E+ R ++++N+ IE N
Sbjct: 6 LLAAVCWGIASAIPKFDQNLDTQWYQWKATHKRLYGLN-EEGWRRAVWEKNMRMIELHNG 64
Query: 84 E---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
E G + +G N + D+TN+EFR + G++ + H+ +++ + P S+
Sbjct: 65 EYSQGKHGFTMGMNAYGDMTNEEFRQVMNGFQ--NQKHKKGK----MFRDPLLLQYPKSV 118
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGC 199
DWR+KG VTP+KNQ +CG CWAF+A A+EG ++G LI LSEQ L+DCS GN GC
Sbjct: 119 DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLSEQNLVDCSHPQGNQGC 178
Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLK 259
GG + AF Y+ N G+ +E+ YPY+ + GTC + + A + + ++P G E+ALL+
Sbjct: 179 NGGLMDYAFQYVKDNSGLDSEESYPYEGMDGTCKYKPECSVANDTGFVDIP-GHEKALLR 237
Query: 260 AV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWL 313
AV ++ P+S AI A FQ YK GI ++ C ++ LDH + +VG+ GT + YWL
Sbjct: 238 AVATVGPISAAIDAGHMSFQFYKSGIYYDPDCSSKDLDHGILVVGYGFEGTNSNATKYWL 297
Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
+KNSWG TWGD GY+KI+RD + CGI T +SYP
Sbjct: 298 VKNSWGTTWGDEGYVKIIRDKDNHCGIATAASYP 331
>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
Length = 331
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 134/338 (39%), Positives = 198/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S + H ++ H + W +G+ YK++ E+ R I+++NL+
Sbjct: 1 MKCLVWVLLLCSSAMAQ---LHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
+ N E G +Y LG N D+T++E +L + ++PS R+ T + Q L
Sbjct: 58 TVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKL-- 115
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 116 ---PDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCST 172
Query: 194 NG--NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
N GC GG +AF YII N GI +E YPY+AV G C K AA S Y E+P
Sbjct: 173 EKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPF 232
Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
DE AL +AV+ + PVS+AI A + F Y+ G+ ++ C ++H V +VG+G +G
Sbjct: 233 ADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNL-NGK 291
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
+YWL+KNSWG +GD GY+++ R+ E CGI SYP
Sbjct: 292 DYWLVKNSWGLNFGDGGYIRMARNSENHCGIANYPSYP 329
>gi|74178074|dbj|BAE29827.1| unnamed protein product [Mus musculus]
gi|74178231|dbj|BAE29900.1| unnamed protein product [Mus musculus]
gi|74220784|dbj|BAE31361.1| unnamed protein product [Mus musculus]
Length = 326
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 128/306 (41%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 25 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 84
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
++P S ++ T +++ S +P ++DWR+KG VT +K Q CG CWAF+AV
Sbjct: 85 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 139
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+EG K+++G LI LS Q L+DCS GN GC GG +AF YII N GI + Y
Sbjct: 140 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 199
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
PY+A C K AA S Y ++P GDE AL +AV+ + PVS+ I A + F YK
Sbjct: 200 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 259
Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 260 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 318
Query: 341 TRSSYP 346
+ SYP
Sbjct: 319 SYCSYP 324
>gi|443685370|gb|ELT89004.1| hypothetical protein CAPTEDRAFT_95613, partial [Capitella teleta]
Length = 295
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/299 (42%), Positives = 181/299 (60%), Gaps = 15/299 (5%)
Query: 61 ELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS 117
E E+ R ++F+ N++ I+ N ++G + +G NQFSD+ EF + G++M +
Sbjct: 1 ETEENQRKEVFRNNIKKIQMHNYLHEQGKSPFTMGINQFSDMDEKEFSTIMNGFRM---N 57
Query: 118 HRSTTSSTFKYQNLSM---TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITK 174
+R+ +S VP +DWR KG VTP+KNQ +CG CWAF+A+ A+EG
Sbjct: 58 NRTKVRDHLHSHYISPAIPVSVPAEVDWRKKGYVTPVKNQGQCGSCWAFSAIGALEGQHF 117
Query: 175 IRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS 233
++G L+ LSEQ L+DCS + GNNGC GG + AF YI N G TE YPY+AV G C
Sbjct: 118 RKTGKLVSLSEQNLVDCSKSYGNNGCNGGVMDYAFKYIKDNDGDDTEACYPYEAVDGMCR 177
Query: 234 AAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQSYKEGIF-NGVCGT 291
++ A Y ++P G+E + +AV++ PVS+AI A + F SYK G++ C
Sbjct: 178 FKRECVGATCRGYTDLPWGNEVKMKEAVALVGPVSVAIDASHSSFMSYKGGVYVEKECSP 237
Query: 292 -QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
QLDH V +VG+G TE G +YWL+KNSWG TWGD GY+K+ R+ CGI + + YPL
Sbjct: 238 YQLDHGVLVVGYG-TEQGLDYWLVKNSWGTTWGDQGYIKMARNMHNHCGIASMACYPLV 295
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 116/252 (46%), Positives = 164/252 (65%), Gaps = 6/252 (2%)
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
N R T + + R+ ++ +Y+ + +P S+DWR+KGAV PIK+Q CG C
Sbjct: 6 NSRPRRRTTYFGVRGAGRRTPGLASDRYRYRAGDALPDSVDWREKGAVVPIKDQGGCGSC 65
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
WAF+ +A+VEGI KI +G+LI LSEQ+L+DC N+GC GG + AF +II N GI TE
Sbjct: 66 WAFSTIASVEGINKIVTGDLISLSEQELVDCDKTYNDGCNGGLMDYAFQFIIDNGGIDTE 125
Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
+YPY G C + +K A I++YE+VP DEQAL KA + QP+++AI FQ
Sbjct: 126 KDYPYTEQDGRCDSYRKNAKVVSINSYEDVPVNDEQALKKAAASQPIAVAIDGGGRSFQL 185
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EG 335
Y GIF G CGT LDH VT+VG+G +E G +YW+++NSWG +WG+ GY+++ R+ G
Sbjct: 186 YNSGIFTGKCGTSLDHGVTVVGYG-SESGKDYWIVRNSWGESWGEKGYIRMARNIDSPSG 244
Query: 336 LCGIGTRSSYPL 347
+CGI +SYP+
Sbjct: 245 ICGIAMEASYPI 256
>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
Length = 338
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 185/311 (59%), Gaps = 17/311 (5%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W + H + Y E E+ R ++++NL+ IE N + G TY+LG N F D+TN+EFR
Sbjct: 33 WKSWHTKKYH-EKEEGWRRMVWEKNLKKIELHNLDHSMGKHTYRLGMNHFGDMTNEEFRQ 91
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
L GYK + R S F N + P SLDWRDKG VTP+K+Q +CG CWAF+A
Sbjct: 92 LMNGYK--HKAERKVKGSLFLEPNF--LEAPRSLDWRDKGYVTPVKDQGQCGSCWAFSAT 147
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
A+EG ++G ++QLSEQ L++CS GN GC GG ++AF Y+ NQG+ +E+ YPY
Sbjct: 148 GALEGQQFRKTGKMVQLSEQNLVECSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEESYPY 207
Query: 226 QAVPG-TCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEG 283
C + A + + ++ SG E AL+KAV+ + P+S+AI A FQ Y+ G
Sbjct: 208 LGTDDQKCHYDPRYNAVNDTGFVDIKSGSEHALMKAVTAVGPISVAIDAGHESFQFYQSG 267
Query: 284 I-FNGVCGT-QLDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLC 337
I + C + +LDH V +VG+ G DG YW++KNSW WGD GY+ + +D + C
Sbjct: 268 IYYEPECSSEELDHGVLLVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYVYMAKDRQNHC 327
Query: 338 GIGTRSSYPLA 348
GI T +SYPL
Sbjct: 328 GIATAASYPLV 338
>gi|221117518|ref|XP_002157675.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 340
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 202/340 (59%), Gaps = 15/340 (4%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHE--KWMAQHGRSYKDELEKEMRLKIFKENLEY 77
I+I +LV S + S + Q+ ++ E ++ + G+ Y +E+ + +K N E
Sbjct: 5 ILIGILVQSYSFELQSFLNNSQTPMKDPEWRRFKIKFGKFYSSNIEETSKYLNWKINNEK 64
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
I+ N E NR +K+G NQFSDLT++EF +Y G +K+P T STF S ++
Sbjct: 65 IKNHNSE-NRFFKIGMNQFSDLTHEEFIKIYGGCFKLPKSFINITKGSTF--LPPSNVNI 121
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P +DWR KG V P+KNQ +CG CWAF+ A+EG T ++G L LSEQ L+DC+ + G
Sbjct: 122 PDEVDWRTKGYVNPVKNQGQCGSCWAFSTTGALEGQTFRKTGVLPDLSEQNLVDCTQSYG 181
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQA-VPGTCSAAQKPAAAKISNYEEVPSGDE 254
N C GG + AF YI N+GI +E YPY A G C Q+ A + + ++ SGDE
Sbjct: 182 NEACNGGWMDNAFKYISDNKGIDSEAGYPYYAKALGYCYYNQQFNVASDTGFVDIASGDE 241
Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT---QLDHAVTIVGFGTTEDGA 309
AL AV ++ P+S+AI A F Y+ G+ + CG LDHAV +VG+G TEDG
Sbjct: 242 DALKVAVATVGPISVAIDATKDSFMRYQSGVYYEPTCGNGLENLDHAVLVVGYG-TEDGR 300
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
++WL+KNSW TWGD GY+K+ R+ CGI T++SYPL
Sbjct: 301 DFWLVKNSWDITWGDQGYIKMSRNMSNQCGIATKASYPLV 340
>gi|392306967|ref|NP_067256.3| cathepsin S isoform 2 preproprotein [Mus musculus]
gi|26390492|dbj|BAC25906.1| unnamed protein product [Mus musculus]
gi|148706872|gb|EDL38819.1| cathepsin S [Mus musculus]
Length = 342
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 128/306 (41%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 41 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 100
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
++P S ++ T +++ S +P ++DWR+KG VT +K Q CG CWAF+AV
Sbjct: 101 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 155
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+EG K+++G LI LS Q L+DCS GN GC GG +AF YII N GI + Y
Sbjct: 156 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 215
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
PY+A C K AA S Y ++P GDE AL +AV+ + PVS+ I A + F YK
Sbjct: 216 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 275
Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 276 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 334
Query: 341 TRSSYP 346
+ SYP
Sbjct: 335 SYCSYP 340
>gi|390608645|ref|NP_001254624.1| cathepsin S isoform 1 preproprotein [Mus musculus]
gi|74214026|dbj|BAE29430.1| unnamed protein product [Mus musculus]
Length = 343
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 128/306 (41%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 42 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 101
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
++P S ++ T +++ S +P ++DWR+KG VT +K Q CG CWAF+AV
Sbjct: 102 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 156
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+EG K+++G LI LS Q L+DCS GN GC GG +AF YII N GI + Y
Sbjct: 157 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 216
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
PY+A C K AA S Y ++P GDE AL +AV+ + PVS+ I A + F YK
Sbjct: 217 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 276
Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 277 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 335
Query: 341 TRSSYP 346
+ SYP
Sbjct: 336 SYCSYP 341
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 131/343 (38%), Positives = 199/343 (58%), Gaps = 19/343 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ + + L C S + Q V + + W + Y+ E+E ++ + N
Sbjct: 5 VLLAVVLFAGCCSAM-----QLNQQHVSLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNK 59
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL--- 131
I + N + ++Y+L N++ DLT++EF ++ GY+ R +T + Y NL
Sbjct: 60 ISEHNMQYSLKQKSYRLEMNEYGDLTSEEFSSMMNGYRNDIRLKRKSTGGS-TYLNLLSF 118
Query: 132 -SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
S +PT +DWR G VTP+KNQ +CG CW+F+A ++EG K ++G L+ LSEQ L+D
Sbjct: 119 GSQIQLPTLVDWRKHGLVTPVKNQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLID 178
Query: 191 CST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
CST GN+GC GG ++AF YI GI TE YPY+A TC + A + + ++
Sbjct: 179 CSTPEGNDGCNGGLMDQAFKYIKIQGGIDTEAYYPYEAKDDTCRFNITDSGATDTGFVDI 238
Query: 250 PSGDEQALLK-AVSMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTE 306
SGDE+ L + A ++ P+S+AI A T FQ Y G+++ T LDH V +VG+G TE
Sbjct: 239 KSGDEEMLKEAAATVGPISVAIDASHTSFQFYSNGVYSETACSSTMLDHGVLVVGYG-TE 297
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
+G +YWL+KNSWG WG+AGY+K+ R+ + CGI T++SYPL
Sbjct: 298 NGKDYWLVKNSWGEGWGEAGYIKMSRNADNQCGIATQASYPLV 340
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 136/342 (39%), Positives = 195/342 (57%), Gaps = 18/342 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + + C S V ++ T +Q + + ++W H + Y E+ R I+++NL+
Sbjct: 1 MRVFLAAFTLCLSAVFAA-PTLDQQLNDHWDQWKKWHSKKYH-ATEEGWRRVIWEKNLKK 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G TY+LG N F D+T++EFR + G+K R S F N
Sbjct: 59 IEMHNLEHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFKHKKD--RRFRGSLFMEPNF--I 114
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST- 193
+VP LDWR+KG VTP+K+Q ECG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRP 174
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSG 252
GN GC GG ++AF Y+ G+ +E+ YPY C K +AA + + ++PSG
Sbjct: 175 EGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSG 234
Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTE 306
E+AL+KA+ ++ PVS+AI A FQ Y+ GI + C + +LDH V VG+ G
Sbjct: 235 KERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
DG YW++KNSW WGD GY+ + +D CGI T +SYPL
Sbjct: 295 DGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPL 336
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 194/343 (56%), Gaps = 24/343 (6%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ + +LV C S V ++ Q H W H +SY E E+ R ++++NL+ IE
Sbjct: 4 LYLAVLVLCVSAVCAAPRFDSQLEDHWH-LWKNWHSKSYH-ESEEGWRRMVWEKNLKKIE 61
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSM 133
N E G +Y+LG N F D+TN+EFR GYK TT FK + +
Sbjct: 62 MHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK-------QTTERKFKGSLFMEPNY 114
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
P ++DWR+KG VTP+K+Q CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR 174
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV-PGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG ++AF YI N G+ TE+ YPY C + + A + + ++PS
Sbjct: 175 PEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPS 234
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
G E A++KAV ++ PVS+AI A FQ Y+ GI + C + +LDH V +VG+ G
Sbjct: 235 GKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGED 294
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
DG YW++KNSW WGD GY+ + +D + CGI T SSYPL
Sbjct: 295 VDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 133/313 (42%), Positives = 189/313 (60%), Gaps = 12/313 (3%)
Query: 44 VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLT 100
+E H W + RSY E+ R +I+ N +++ N +G ++Y+LG F+D+
Sbjct: 24 LEFH-AWKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADME 82
Query: 101 NDEF-RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
N+E+ R + G + STF ++ TD+P ++DWRDKG VT +K+QK+CG
Sbjct: 83 NEEYKRVISQGCLHSFNASLPRRGSTF-FRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGS 141
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+A ++EG ++G L+ LSEQQL+DCS + GN GC+GG + AF YI N GI
Sbjct: 142 CWAFSATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYIQANGGID 201
Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEF 277
TE+ YPY+A G C A + Y EV GDE AL +AV ++ P+S+ I A F
Sbjct: 202 TEESYPYEAENGKCRYNPDNIGATSTGYTEVSQGDEDALKEAVATIGPISVGIDASQMSF 261
Query: 278 QSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 334
Q Y+ G++N +LDH V VG+G TEDG +YWL+KNSWG WGD GY+K+ R++
Sbjct: 262 QFYESGVYNEPDCSSLELDHGVLAVGYG-TEDGNDYWLVKNSWGLEWGDKGYIKMSRNKS 320
Query: 335 GLCGIGTRSSYPL 347
CGI T +SYPL
Sbjct: 321 NQCGIATAASYPL 333
>gi|341940310|sp|O70370.2|CATS_MOUSE RecName: Full=Cathepsin S; Flags: Precursor
Length = 340
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 128/306 (41%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 39 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 98
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
++P S ++ T +++ S +P ++DWR+KG VT +K Q CG CWAF+AV
Sbjct: 99 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+EG K+++G LI LS Q L+DCS GN GC GG +AF YII N GI + Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
PY+A C K AA S Y ++P GDE AL +AV+ + PVS+ I A + F YK
Sbjct: 214 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273
Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 341 TRSSYP 346
+ SYP
Sbjct: 333 SYCSYP 338
>gi|296189340|ref|XP_002742739.1| PREDICTED: cathepsin L1 [Callithrix jacchus]
Length = 333
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 133/343 (38%), Positives = 199/343 (58%), Gaps = 25/343 (7%)
Query: 16 TPMFIIITLLVSCASQVVS-SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
P I+ + AS ++ RS Q + KW A H R Y E+E R ++++N
Sbjct: 2 NPTLILTAFCLGLASSALTFDRSLEAQWI-----KWKAMHNRLYGMN-EEEWRRAVWEKN 55
Query: 75 LEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
++ IE N E G ++ + N F D+TN+EFR + G++ P + +Q
Sbjct: 56 MKMIELHNHEYNQGKHSFTMAMNAFGDMTNEEFRQVMNGFQNRKPRNGKV------FQEP 109
Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
+ P S+DWR+KG VTP+KNQ +CG CWAF+A A+EG ++G L+ LSEQ L+DC
Sbjct: 110 LFHEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDC 169
Query: 192 S-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
S GN GC GG + AF Y+ +N G+ +E+ YPY+A +C + + A + + ++P
Sbjct: 170 SGPQGNQGCDGGLMDYAFQYVQENGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIP 229
Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---T 304
E+AL+KAV ++ P+S+AI A FQ YKEGI F C ++ +DH V +VG+G T
Sbjct: 230 K-LEKALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGYGFERT 288
Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
D + YWL+KNSWG WG GY+K+ +D + CGI + +SYP
Sbjct: 289 GSDNSKYWLVKNSWGEKWGMDGYIKMAKDRKNHCGIASAASYP 331
>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 124/307 (40%), Positives = 193/307 (62%), Gaps = 13/307 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
+W AQHG+SY+ E +R +++NL+ IE+ N+E G +++L N+F D++ +EF
Sbjct: 30 HQWKAQHGKSYEAN-EDSLRRATWEKNLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEEF 88
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
+ + GYK + S R T S ++ L+ +P S+DWR+KG VTP+K Q +CG CW+F+
Sbjct: 89 KQVMNGYK-SNGSQRRTKGSLYRESLLAQ--LPESVDWREKGYVTPVKEQGDCGACWSFS 145
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
AV A+EG ++G L+ LS Q L+DC+ GNNGC GG + AF Y+ N GI TE+ Y
Sbjct: 146 AVGAIEGQWFRKTGKLVSLSIQNLIDCTIPEGNNGCDGGFMDNAFQYVQDNGGIDTEECY 205
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY A C + + A I+ + ++PS DE+AL++AV ++ P+S+ I + + F+ Y+
Sbjct: 206 PYVAQDTECKYKPECSGANITGFVDIPSMDERALMEAVATVGPISVGIDSANPSFKFYQS 265
Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
G++ +QLDH V +VG+G+ YW++KNSWG WGD GY+ + +D + CGI
Sbjct: 266 GVYYEPDCSSSQLDHGVLVVGYGSIGKD-EYWIVKNSWGEAWGDNGYILMAKDKDNHCGI 324
Query: 340 GTRSSYP 346
T +SYP
Sbjct: 325 ATEASYP 331
>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
gi|228243|prf||1801240A Cys protease 1
Length = 322
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 129/309 (41%), Positives = 181/309 (58%), Gaps = 19/309 (6%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E++ + GR Y D E+ RL +F +NL+YIE+ NK+ G TY L NQFSD+TN++F
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80
Query: 105 RALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
A+ GYK P P+ T++ T +DWR KGAVTP+K+Q +CG CWAF
Sbjct: 81 NAVMKGYKKGPRPAAVFTSTDA--------APESTEVDWRTKGAVTPVKDQGQCGSCWAF 132
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG--NNGCLGGSREKAFAYIIQNQGIATED 221
+ +EG +++G L+ LSEQQL+DC+ N GC GG E+A Y+ N G+ TE
Sbjct: 133 STTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTES 192
Query: 222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSY 280
YPY+A TC A + Y + G E AL A + P+S+AI A FQSY
Sbjct: 193 SYPYEARDNTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSY 252
Query: 281 KEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLC 337
G++ +QLDHAV VG+G +E G ++WL+KNSW +WG++GY+K+ R+ C
Sbjct: 253 YTGVYYEPSCSSSQLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGESGYIKMARNRNNNC 311
Query: 338 GIGTRSSYP 346
GI T + YP
Sbjct: 312 GIATDACYP 320
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 133/310 (42%), Positives = 186/310 (60%), Gaps = 16/310 (5%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W + H + Y E E+ R ++++NL+ IE N + G +YKLG NQF D+T +EFR
Sbjct: 13 WKSWHNKDYH-EREESWRRVVWEKNLKMIELHNLDHTLGKHSYKLGMNQFGDMTTEEFRQ 71
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
L GY S R S F S + P S+DWR+KG VTP+K+Q +CG CWAF+
Sbjct: 72 LMNGYAHKK-SERKYRGSQF--LEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTT 128
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
A+EG ++G L+ LSEQ L+DCS GN GC GG ++AF Y+ N GI +E+ YPY
Sbjct: 129 GALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPY 188
Query: 226 QAVPG-TCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEG 283
A C + AA + + ++P G E+AL+KAV ++ PVS+AI A + FQ Y+ G
Sbjct: 189 TAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSG 248
Query: 284 I-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLC 337
I + C ++ LDH V +VG+ G DG YW++KNSWG WGD GY+ + +D + C
Sbjct: 249 IYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHC 308
Query: 338 GIGTRSSYPL 347
GI T +SYPL
Sbjct: 309 GIATAASYPL 318
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 194/343 (56%), Gaps = 24/343 (6%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ + +LV C S V ++ Q H W H + Y E E+ R ++++NL+ IE
Sbjct: 4 LYLAVLVLCVSAVCAAPRFDSQLEDHWH-LWKNWHSKHYH-ESEEGWRRMVWEKNLKKIE 61
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSM 133
N E G +Y+LG N F D+TN+EFR GYK TT FK + +
Sbjct: 62 IHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK-------QTTERKFKGSLFMEPNY 114
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
P ++DWR+KG VTP+K+Q CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR 174
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV-PGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG ++AF YI N G+ TE+ YPY C + +AA + + ++PS
Sbjct: 175 PEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPS 234
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
G E A++KAV ++ PVS+AI A FQ Y+ GI + C + +LDH V +VG+ G
Sbjct: 235 GKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGED 294
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
DG YW++KNSW WGD GY+ + +D + CGI T SSYPL
Sbjct: 295 VDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 196/342 (57%), Gaps = 31/342 (9%)
Query: 29 ASQVVSSRSTHEQSVV---------EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
AS + S S H Q V+ I + +M + R+Y D E E R KIF N I
Sbjct: 39 ASPLTSLDSMHMQDVIGVDWNFTLSSIWKHFMTTYKRNYIDPSEHERRFKIFANNFVRIS 98
Query: 80 KANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
K N +G +Y +G N+FSD T++E + L ++ + R + KY ++
Sbjct: 99 KHNVRFIQGQVSYTMGINEFSDKTDEELKRLRC-FRGSLNASRDGS----KYITIAAPP- 152
Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
P+ +DWR+KGAVTP+KNQ CG CWAF+A A+EG + +GNL+ LSEQQL+DCS+ G
Sbjct: 153 PSEIDWRNKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYG 212
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPY------QAVPGTCSAAQKPAAAKISNYEEV 249
NN C GG + AF Y+ + GI TE YPY A P TC K A +++ Y ++
Sbjct: 213 NNACNGGLMDNAFKYVKDSNGIDTEASYPYVSGETGDANP-TCRFNLKEAVVRVTGYIDL 271
Query: 250 PSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIF-NGVCGT-QLDHAVTIVGFGTTE 306
P G L +AV P+S+AI A F SYK G++ + C + LDH V +VG+G E
Sbjct: 272 PRGQVSELKQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYG-EE 330
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
+G YWLIKNSWG WG+ GY+KI+RD LCG+ + +SYPL
Sbjct: 331 NGIPYWLIKNSWGPHWGENGYVKILRDHNNLCGVASMASYPL 372
>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
Length = 333
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 187/310 (60%), Gaps = 19/310 (6%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
KW + H R Y D E+E R ++++N++ IE N EG + + N F D+TN+EF
Sbjct: 30 HKWKSTHRRLY-DTNEEEWRRAVWEKNMKMIELHNGEYSEGKHGFTMEMNAFGDMTNEEF 88
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
R L GYK HR +Q M +P S+DWR+KG VTP+KNQ +CG CWAF+
Sbjct: 89 RQLVNGYK--HQKHRKGKL----FQEPLMLQLPKSVDWREKGCVTPVKNQGQCGSCWAFS 142
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A A+EG +++G L+ LSEQ L+DCS GN GC GG + AF Y++ N+G+ +E+ Y
Sbjct: 143 ACGALEGQMCLKTGVLVSLSEQNLVDCSRGEGNQGCNGGLMDFAFQYVLNNKGLDSEESY 202
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY+A GTC + AAA + Y ++P E+AL+KAV ++ P+++AI A FQ Y
Sbjct: 203 PYEAKDGTCKYKPEFAAANDTGYVDIPQ-LEKALMKAVATVGPIAVAIDASHPSFQFYSS 261
Query: 283 GI-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
GI F C ++ LDH V ++G+ GT + YW++KNSWG WG G+ I +D+
Sbjct: 262 GIYFEPNCSSKDLDHGVLVIGYGFEGTDSNKKKYWIVKNSWGTGWGMGGFFHIAKDKNNH 321
Query: 337 CGIGTRSSYP 346
CGI T +SYP
Sbjct: 322 CGIATAASYP 331
>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
Length = 326
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 126/302 (41%), Positives = 185/302 (61%), Gaps = 13/302 (4%)
Query: 53 QHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYT 109
+H + YKD E+ R +F + +EYI++ N E +R ++++G N+++D+ N+EF +
Sbjct: 28 RHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHSFRVGINEYADMPNEEFVRVMN 87
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
GYKM R + N+ D+P ++DWR KG VT +KNQ +CG CWAF++ ++
Sbjct: 88 GYKMQE--QRPKAPTYMPPSNVG--DLPATVDWRTKGYVTEVKNQGQCGSCWAFSSTGSL 143
Query: 170 EGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV 228
EG T + LI LSEQ L+DCST GN GC GG ++AF YI N GI TE YPY+A
Sbjct: 144 EGQTFKKYNKLISLSEQNLVDCSTEQGNMGCGGGLMDQAFTYIKVNDGIDTETSYPYEAA 203
Query: 229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG 287
G C + A + Y ++ S E L AV ++ P+++AI A FQ YK G+++
Sbjct: 204 SGKCRFNKANVGANDTGYTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKSGVYHY 263
Query: 288 V-CG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSS 344
+ C T+LDH V VG+G T+ G +YWL+KNSWG TWG GY+ + R+ + CGI T++S
Sbjct: 264 IFCSQTRLDHGVLAVGYG-TDSGKDYWLVKNSWGATWGQQGYIMMSRNRDNNCGIATQAS 322
Query: 345 YP 346
YP
Sbjct: 323 YP 324
>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
Length = 337
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 134/342 (39%), Positives = 197/342 (57%), Gaps = 18/342 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + + +LV C +++ Q E + W + H ++Y+ E E+ R ++++NL+
Sbjct: 1 MTLYLVVLVLCTGAALAAPRFDAQ-FDEHWDLWKSWHSKNYQHEKEEGWRRMVWEKNLKK 59
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G +Y LG N F D+TN+EFR + GYK+ R S F N
Sbjct: 60 IEMHNLEHSLGKHSYSLGMNHFGDMTNEEFRQVMNGYKL---QQRKFKGSLFLEPN--NM 114
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST- 193
+ P +DWR++G VTP+K+Q +CG CWAF+ A+EG ++ L+ LSEQ L+DCS
Sbjct: 115 EAPKQVDWREEGYVTPVKDQGQCGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDCSRP 174
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSG 252
GN GC GG ++AF YI N G+ +E+ YPY C+ + +AA + + ++PSG
Sbjct: 175 EGNEGCNGGLMDQAFQYIQDNSGLDSEEAYPYLGTDDQPCNYKAEFSAANDTGFMDIPSG 234
Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTE 306
E AL+KA+ S+ PVS+AI A FQ Y+ GI + C + +LDH V VG+ G
Sbjct: 235 KEHALMKAIASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
DG YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 295 DGKKYWIVKNSWSEKWGDKGYILMAKDRKNHCGIATAASYPL 336
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 133/311 (42%), Positives = 187/311 (60%), Gaps = 16/311 (5%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W + H + Y E E+ R ++++NL+ IE N + G +YKLG NQF D+T +EFR
Sbjct: 137 WKSWHRKDYH-EREEGWRRVVWEKNLKMIEIHNLDHALGKHSYKLGMNQFGDMTTEEFRQ 195
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
L GY + S R S F N + P S+DWR+KG VTP+K+Q +CG CWAF+
Sbjct: 196 LMNGY-VHKKSERKYRGSQFLEPNF--LEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTT 252
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
A+EG ++G L+ LSEQ L+DCS GN GC GG ++AF Y+ N GI +E+ YPY
Sbjct: 253 GALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPY 312
Query: 226 QAVPG-TCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEG 283
A C + AA + + ++P G E+AL+KAV ++ PVS+AI A + FQ Y+ G
Sbjct: 313 TAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSG 372
Query: 284 I-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLC 337
I + C ++ LDH V +VG+ G DG YW++KNSWG WGD GY+ + +D + C
Sbjct: 373 IYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHC 432
Query: 338 GIGTRSSYPLA 348
GI T +SYPL
Sbjct: 433 GIATAASYPLV 443
>gi|72005575|ref|XP_783218.1| PREDICTED: cathepsin L2-like isoform 2 [Strongylocentrotus
purpuratus]
gi|390337647|ref|XP_003724610.1| PREDICTED: cathepsin L2-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 334
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 138/338 (40%), Positives = 195/338 (57%), Gaps = 17/338 (5%)
Query: 19 FIIITLLVSCA-SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
FII+ L V+ A + + SR E+ ++W+ HG+ Y E+ R I+++NL
Sbjct: 4 FIIVLLSVAGALATRLPSRDFDEE-----WKEWVDYHGKEYSAMGEEMERRMIWEDNLRI 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
I K N E G TY+LG N+F D+TN EF A T KM S+ + L +
Sbjct: 59 ITKHNLEHSQGKTTYRLGMNEFGDMTNAEFVATRTMKKMSGVPKVGQGSTFLPSEFLQL- 117
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
P S+DWR +G VTP+K+Q +CG CWAF+ V A+EG +++G L+ LSEQ L+DCS
Sbjct: 118 --PDSVDWRTEGYVTPVKDQGQCGSCWAFSTVGALEGQHFVKTGTLVSLSEQNLVDCSQA 175
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GN+GC GG A YI N GI TE YPY+ V +C A I+ + EV +
Sbjct: 176 EGNDGCNGGWPAWADEYIKSNGGIDTEVGYPYEGVDDSCHYRTSDVGATITGFAEVEADS 235
Query: 254 EQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGAN 310
E+AL KA++ + P+S+ I A FQ Y+ G+++ T LDH VT VG+ +T DG
Sbjct: 236 EKALEKALAQVGPISVCIDATQPSFQLYESGVYDEPDCSSTALDHCVTAVGYDSTADGDK 295
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
Y+++KNSWG TWG GY+ + RD + CGI T ++YPL
Sbjct: 296 YYIVKNSWGTTWGQEGYIWMSRDKQKQCGIATNATYPL 333
>gi|334324655|ref|XP_001370975.2| PREDICTED: cathepsin S-like isoform 1 [Monodelphis domestica]
Length = 331
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 191/317 (60%), Gaps = 15/317 (4%)
Query: 39 HEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTN 94
H +++ H + W HG+ YK + E+ R I+++NL+Y+ N E G +Y L N
Sbjct: 19 HRDPMLDGHWDLWKKTHGKQYKGQNEEIARRLIWEKNLKYVTLHNLEHSMGLHSYDLSMN 78
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
D+T++E +L + ++P+ +R+TT Y+ S +P S+DWR+KG VT +K Q
Sbjct: 79 HLGDMTSEEVISLMSSLRIPNQWNRNTT-----YRLSSNQKLPDSVDWREKGCVTEVKYQ 133
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCLGGSREKAFAYII 212
CG CWAF+AV A+E K+++G L+ LS Q L+DCST+ N+GC GG AF Y+I
Sbjct: 134 GSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYVI 193
Query: 213 QNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIA 271
N GI ++ YPY+A G C AA S Y E+P G E+AL +AV+ + PVS+ I
Sbjct: 194 DNNGIDSDVSYPYKATDGKCQYNPASRAATCSKYTELPYGSEEALKEAVANKGPVSVGID 253
Query: 272 AYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A + F YK G+ ++ C +++H V ++G+G DG +YWL+KNSWG +GD GY++I
Sbjct: 254 AKTPSFFLYKSGVYYDPSCTQKVNHGVLVIGYGNL-DGQDYWLVKNSWGLHFGDKGYVRI 312
Query: 331 VRDEG-LCGIGTRSSYP 346
R+ G CGI SYP
Sbjct: 313 ARNRGNHCGIANFPSYP 329
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 129/312 (41%), Positives = 195/312 (62%), Gaps = 10/312 (3%)
Query: 44 VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLT 100
+E H W + GRSY+ E+ R++I+ N + + N +G ++Y+LG QF+D+
Sbjct: 25 MEFH-AWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMD 83
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
N+E+++L + + + + + + ++ T +PT++DWRDKG VT +K+QK+CG C
Sbjct: 84 NEEYKSLISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSC 143
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIAT 219
WAF+A ++EG ++G L+ LSEQQL+DCS + GN GC GG + AF YI +N GI T
Sbjct: 144 WAFSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDT 203
Query: 220 EDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQ 278
E YPY+A G C + AK + Y +V GDE AL +AV ++ PVS+ I A + FQ
Sbjct: 204 EKSYPYEAEDGQCRFKPENVGAKCTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQ 263
Query: 279 SYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EG 335
Y G+++ C +Q LDH V VG+G T++G +YWL+KNSWG WG GY+ + R+ +
Sbjct: 264 LYDSGVYDEQDCSSQDLDHGVLAVGYG-TDNGQDYWLVKNSWGLGWGQEGYIMMSRNKDN 322
Query: 336 LCGIGTRSSYPL 347
CGI T +SYPL
Sbjct: 323 QCGIATAASYPL 334
>gi|194352766|emb|CAQ00111.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 384
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 134/331 (40%), Positives = 185/331 (55%), Gaps = 35/331 (10%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
WMA HGRSY EK R ++++ N+E+IE AN++ +Y LG F+DLT+DEF A+Y+
Sbjct: 55 WMAAHGRSYPTVEEKLRRFEVYRSNMEFIEAANRDSRMSYSLGETPFTDLTHDEFMAMYS 114
Query: 110 GYKMPS-------------PSHRSTTS--STFKYQNLSMTDV-PTSLDWRDKGAVTPIKN 153
S P H T + + NL++T V P S+DWR KG VTP KN
Sbjct: 115 SNDDSSEWEEATVITTRAGPVHEGTAAVEEPPRRTNLNVTAVLPPSVDWRAKGVVTPAKN 174
Query: 154 Q-KECGCCWAFAAVAAVEGITKIRSGNLIQ-LSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
Q C CWAF +VA +E I +G LSEQQL+DCST ++GC G + AF ++
Sbjct: 175 QGATCFSCWAFTSVATMESAQAISTGGSPPVLSEQQLVDCSTL-HHGCGRGWMDDAFKWV 233
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV-PSGDEQALLKAVSMQPVSIAI 270
I N GI TE YPY G C KP A ++ +Y++V P G+E L +AV+ QPV+++
Sbjct: 234 IMNGGITTEAAYPYTGKAGNCQTG-KPVAVRLRSYKKVTPPGNEAGLKEAVAQQPVAVSF 292
Query: 271 AAYSTEFQSYKEGIFN-----------GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
FQ Y G++N G C T +HA+ +VG+GT DG YW+ KNSW
Sbjct: 293 DYSDPCFQHYIGGVYNAGCSRSGVYIKGACKTAQNHAMALVGYGTKPDGTKYWIGKNSWT 352
Query: 320 NTWGDAGYMKIVRDE---GLCGIGTRSSYPL 347
WGD G++ ++RD GLCG+ YP+
Sbjct: 353 AKWGDKGFIYLLRDSPPLGLCGLAKLPVYPI 383
>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
Length = 492
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 124/305 (40%), Positives = 171/305 (56%), Gaps = 32/305 (10%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
W+ H ++ D E RL+ + N YI N + ++KLG N FS LTN+EFR +
Sbjct: 36 WLKTHHLTFSDAFEYAKRLETYIANDIYILTHNLQ-ESSFKLGHNAFSHLTNEEFRQRFN 94
Query: 110 GYKMPSP--SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
G+K + R S+ N D+P S+DW +KGAVT +KNQ CG CWAF+
Sbjct: 95 GFKASDDYLTKRLAQSNVASSTNFQYIDLPESVDWVEKGAVTGVKNQGMCGSCWAFSTTG 154
Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
A+EG T I SG L+ LSEQ+L+DC NG++GC GG + AF++I ++ GI +E++Y Y
Sbjct: 155 AIEGATFISSGKLVSLSEQELVDCDHNGDHGCNGGLMDHAFSWISEHDGICSEEDYAYIH 214
Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG 287
C + KP + PV++AI A FQ Y+ G++N
Sbjct: 215 SQSLCRSC-KPV-----------------------VSPVAVAIDAGDRSFQFYQSGVYNK 250
Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRS 343
CGTQLDH V VG+G EDG YW +KNSWGN+WG+ GY+++ RD+ G CGI
Sbjct: 251 TCGTQLDHGVLTVGYG-VEDGQKYWKVKNSWGNSWGEKGYIRLSRDQNGRSGQCGIAMVP 309
Query: 344 SYPLA 348
SYP A
Sbjct: 310 SYPTA 314
>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 194/343 (56%), Gaps = 24/343 (6%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ + +LV C S V ++ Q H W H +SY E E+ R ++++NL+ IE
Sbjct: 4 LYLAVLVLCVSAVCAAPRFDSQLEDHWH-LWKNWHSKSYH-ESEEGWRRMVWEKNLKKIE 61
Query: 80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSM 133
N E G +Y+LG N F D+TN+EFR GYK TT FK + +
Sbjct: 62 MHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK-------QTTERKFKGSLFMEPNY 114
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
P ++DWR+KG VTP+K+Q CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR 174
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV-PGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG ++AF YI N G+ TE+ YPY C + + A + + ++PS
Sbjct: 175 PEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPS 234
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
G E A++KAV ++ PVS+AI A FQ Y+ GI + C + +LDH V +VG+ G
Sbjct: 235 GKEHAMMKAVAAVGPVSVAIDAGHESFQFYEFGIYYEKECSSEELDHGVLVVGYGFEGED 294
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
DG YW++KNSW WGD GY+ + +D + CGI T SSYPL
Sbjct: 295 VDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
Length = 349
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 135/326 (41%), Positives = 194/326 (59%), Gaps = 24/326 (7%)
Query: 43 VVEIH-------EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
+V IH + W A++ R+Y E + R ++ EN+++IE N+ G+ +Y+LG NQ
Sbjct: 26 IVPIHIPLLDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGS-SYELGENQ 84
Query: 96 FSDLTNDEFRALYTGYKM----PSPSHRSTTSSTFKYQNLS----MTDVPTSLDWRDKGA 147
F+DLT +EF+ Y K+ SP + T T S + P S+DWR KGA
Sbjct: 85 FADLTEEEFKDTYL-MKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGA 143
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREK 206
VTP+K+Q+ CG CWAFAAVA++EG+ KI++G L+ LSEQ+++DC N+GC GG
Sbjct: 144 VTPVKSQQHCGSCWAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSSS 203
Query: 207 AFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQP 265
A ++ +N G+ TE +YPY G C S AAKI + V +E AL AV+ +P
Sbjct: 204 AMEWVTRNGGLTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRP 263
Query: 266 VSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
V+++I A S FQ YK GIF+G C T +HAVT+VG+G G YW++KNSWG WG+
Sbjct: 264 VAVSINA-SRAFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEK 322
Query: 326 GYMKIVRD----EGLCGIGTRSSYPL 347
GY+++ R EG+CGI Y +
Sbjct: 323 GYVRMQRGVRAREGVCGIAIAPFYAV 348
>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
Length = 331
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 132/337 (39%), Positives = 197/337 (58%), Gaps = 16/337 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + +L+ C++ V ++ + ++ + W + + Y++++E+ R I+++NL++
Sbjct: 1 MKWLACVLLGCSAAV--AQLQRDPTLDRHWDLWKKTYSKHYREKIEEVARRLIWEKNLKF 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
+ N E G +Y LG N D+T++E +L +PS R+ T + Q L
Sbjct: 59 VMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMGSLTVPSQWQRNVTYKSNPNQKL--- 115
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
P SLDWRDKG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 116 --PDSLDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 195 --GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
N GC GG AF YII N GI +E YPY+A G C K AA S Y E+P G
Sbjct: 174 KYSNKGCNGGFMTSAFQYIIDNNGIDSEASYPYKAQDGKCQYDSKFRAATCSKYTELPFG 233
Query: 253 DEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGAN 310
E+AL +AV+ + PVS+AI A F Y+ G+ ++ C +++H V +VG+G DG +
Sbjct: 234 SEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDQSCTLKVNHGVLVVGYGNL-DGKD 292
Query: 311 YWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
YWL+KNSWG +GD GY+++ R+ G CGI + SYP
Sbjct: 293 YWLVKNSWGLNFGDKGYIRMARNSGNHCGIASYPSYP 329
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 137/303 (45%), Positives = 186/303 (61%), Gaps = 15/303 (4%)
Query: 54 HGRSYKDELEKE-MRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYT 109
H ++Y D LE+E R +IF+EN++ IE+ NK G ++Y LG NQFSDL ++EF Y
Sbjct: 63 HDKTY-DALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHEEF-VKYN 120
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
G K S SS NL P S+DWR KG VT +KNQ +CG CW+F+ ++
Sbjct: 121 GLKKTSLK-DGGCSSYLAANNLVE---PDSVDWRKKGYVTDVKNQGQCGSCWSFSTTGSL 176
Query: 170 EGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV 228
EG +SG L+ LSE QL+DCS + GN GC GG + AF YI G+ +E++YPY+
Sbjct: 177 EGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESEEDYPYKPK 236
Query: 229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN- 286
GTC AA + +V SG E AL KAVS + PVS+AI A + FQSY G+++
Sbjct: 237 QGTCKFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSFQSYAGGVYDE 296
Query: 287 -GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSS 344
QLDH V VG+GT + G +YW++KNSWG WG+ GY+K+ R+ + CGI T++S
Sbjct: 297 PECSSEQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEWGEDGYVKMSRNKKNQCGIATQAS 356
Query: 345 YPL 347
YPL
Sbjct: 357 YPL 359
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 116/266 (43%), Positives = 164/266 (61%), Gaps = 7/266 (2%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +WMA HGR+Y E+E R ++F++NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTNDE+RA Y G + R N D+P S+DWR KGAV
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDN---EDLPESVDWRAKGAV 147
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
+K+Q CG CWAF+ +AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG + AF
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAF 207
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TE++YPY+ G C +K A I +YE+VP+ E++L KAV+ QP+S
Sbjct: 208 EFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPIS 267
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQL 293
+AI A FQ Y GIF G CG +
Sbjct: 268 VAIEAGGRAFQLYNSGIFTGTCGNSV 293
>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 365
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 134/360 (37%), Positives = 200/360 (55%), Gaps = 41/360 (11%)
Query: 25 LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE 84
LVS +V++ ++++ +W AQH R Y + ++ R I+++NL IE N E
Sbjct: 7 LVSLCLGLVAAIPKLDRTLDAQWYQWKAQHRRDYGEN--EDWRRAIWEKNLRSIEMHNLE 64
Query: 85 ---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-------------- 127
G ++++ N+F D+TN+EFR + G+ R T F+
Sbjct: 65 YSAGKHSFQMEMNKFGDMTNEEFRQVMNGFSTHR-VQRRTKGRLFREPLLVQIPKSVDWR 123
Query: 128 ----------------YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEG 171
++ + +P S+DWRDKG VTP+KNQ +CG CWAF+A ++EG
Sbjct: 124 DKGYVTPVKNQLVRRLFREPLLVQIPKSVDWRDKGYVTPVKNQGQCGSCWAFSATGSLEG 183
Query: 172 ITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
++G L+ LSEQ L+DCST GN+GC GG + AF Y+ +N GI TE+ YPY A
Sbjct: 184 QWFRKTGKLVSLSEQNLVDCSTAQGNSGCQGGLMDNAFEYVKENGGIDTEESYPYIAADD 243
Query: 231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGV 288
TC + + A I+ Y ++PS E+AL KAV ++ P+S+AI A + FQ Y+ G+ +
Sbjct: 244 TCQYKPQYSGANITGYVDIPSRMEKALEKAVATVGPISVAIDAGHSSFQFYRSGVYYEPE 303
Query: 289 CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
C ++ LDH V VG+G YW++KNSWG WGD+GY+ + RD CGI T +SYP
Sbjct: 304 CSSEDLDHGVLAVGYGVQGKNGKYWIVKNSWGEEWGDSGYILMARDRNNHCGIATAASYP 363
>gi|118424553|gb|ABK90824.1| cathepsin L-like cysteine proteinase [Spodoptera exigua]
Length = 344
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 183/311 (58%), Gaps = 16/311 (5%)
Query: 53 QHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYT 109
+H + Y E+E + R+KI+ EN I K N+ + +YKL N+++D+ + EF
Sbjct: 33 EHSKQYDSEVEDKFRMKIYVENKHRITKHNQRFEQRLVSYKLKPNKYADMLHHEFVHTMN 92
Query: 110 GY-KMPSPSHRSTTSSTFKYQNLSMTDV-------PTSLDWRDKGAVTPIKNQKECGCCW 161
G+ K R+ + + T + P +DWR KGAVT +K+Q +CG CW
Sbjct: 93 GFNKTAKHGGRNKNVHGKGHDGRAATFIAPAHVSYPDHVDWRKKGAVTDVKDQGKCGSCW 152
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATE 220
AF+ A+EG ++G L+ LSEQ L+DCS GNNGC GG + AF YI N GI TE
Sbjct: 153 AFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNGGIDTE 212
Query: 221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQS 279
YPY+AV C K + A + ++P GDE+ L++AV ++ P+S+AI A FQ
Sbjct: 213 KSYPYEAVDDKCRYNPKESGADDVGFVDIPQGDEEKLMQAVATVGPISVAIDASQETFQF 272
Query: 280 YKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
Y +G++ T LDH V +VG+GT EDG++ WL+KNSWG +WG+ GY+K+ R++
Sbjct: 273 YSKGVYYDENCSSTDLDHGVMVVGYGTEEDGSDDWLVKNSWGRSWGELGYIKMARNKNNH 332
Query: 337 CGIGTRSSYPL 347
CGI + +SYPL
Sbjct: 333 CGIASSASYPL 343
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 192/313 (61%), Gaps = 24/313 (7%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
+++ A++G+ Y+ E R ++++N E+I N++ G ++ L NQF D+T +E
Sbjct: 23 QQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTTEEI 82
Query: 105 RALYTGY-----KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
A G+ K+P R T YQ L + ++P ++DWRDKGAVTP+K+QK CG
Sbjct: 83 NAAMNGFLSAGKKVP----RGTM-----YQPL-VDELPDTVDWRDKGAVTPVKDQKACGS 132
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+A ++EG + +G L+ LSEQ L+DCS GN GC GG + AF YI N GI
Sbjct: 133 CWAFSATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGID 192
Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEF 277
TE+ YPY+A G C A +S+Y ++ G E L KAV+ + PVS+AI A ++ F
Sbjct: 193 TEESYPYEAKNGPCRFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTF 252
Query: 278 QSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 334
Y GI ++ C + LDH V VG+G T+D ++YWL+KNSW TWGD+GY+K+ R+
Sbjct: 253 HFYSRGIYYDEKCSSSFLDHGVLAVGYG-TDDSSDYWLVKNSWNETWGDSGYIKMSRNRN 311
Query: 335 GLCGIGTRSSYPL 347
CGI +++SYP+
Sbjct: 312 NNCGIASQASYPV 324
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 135/341 (39%), Positives = 197/341 (57%), Gaps = 18/341 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ + ++ C S +S+ S Q + + E W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPLAVVALCLSAALSAPSLDPQ-LDDHWELWKSWHSKKYH-EKEEGWRRMVWEKNLKKI 58
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G +Y+LG N F D+T++EFR L GYK + + S F N +
Sbjct: 59 ELHNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMNGYKRKAET--KARGSLFLEPNF--LE 114
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
P S+DWRD G VTP+K+Q +CG CWAF+ A+EG ++G L+ LSEQ L+DCS
Sbjct: 115 APKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPE 174
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG ++AF Y+ NQG+ +ED YPY C + + + ++PSG
Sbjct: 175 GNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPTYNSVNDTGFVDIPSGK 234
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTED 307
E+AL+KAV ++ PVS+AI A FQ Y+ GI + C + +LDH V +VG+ G D
Sbjct: 235 ERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQGEDVD 294
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
G YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 295 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
Length = 335
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 200/342 (58%), Gaps = 20/342 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + + C + V ++ +T + ++ + W H +SY + E+ R ++++NL
Sbjct: 1 MALYLVAAALCLTTVFAAPTT-DPALDDHWHLWKNWHKKSYLPK-EEGWRRVLWEKNLRT 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N + G +Y+LG NQF D+TN+EFR L GYK + + STF N
Sbjct: 59 IEFHNLDHSLGKHSYRLGMNQFGDMTNEEFRQLMNGYK----NQKMIKGSTFLAPN--NF 112
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
+ P ++DWR+KG VTP+K+Q +CG CWAF+ A+EG ++G LI LSEQ L+DCS
Sbjct: 113 EAPKTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAGKLISLSEQNLVDCSRA 172
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSG 252
GN GC GG ++AF Y+ N GI +ED YPY A C +A + + +VPSG
Sbjct: 173 QGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVPSG 232
Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTE 306
E+ L+KAV S+ PVS+A+ A FQ Y+ GI ++ C ++ LDH V +VG+ G
Sbjct: 233 SEKDLMKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPECSSEDLDHGVLVVGYGFEGEDV 292
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
DG YW++KNSW WG+ GY+KI +D CGI T +SYPL
Sbjct: 293 DGKRYWIVKNSWSEKWGNNGYIKIAKDRHNHCGIATAASYPL 334
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 189/312 (60%), Gaps = 17/312 (5%)
Query: 48 EKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTN 101
++W+A HG++Y+++ E+ R+K+F +N + I++ N + G +YK+ N DL
Sbjct: 11 QEWLAFKAMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMV 70
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
EF+AL G+K + R+ +NL P S+DWR +GAVTP+K+Q CG CW
Sbjct: 71 HEFKALMNGFKKTPNAERNGKIYVPSNENL-----PKSVDWRQRGAVTPVKDQGHCGSCW 125
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATE 220
+F+A ++EG +++G L+ LSEQ L+DCS T GN+GC GG +AF Y+ N+GI TE
Sbjct: 126 SFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGIDTE 185
Query: 221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQS 279
YPY+A C + Y ++ E+ L AV ++ P+S+ I A FQ
Sbjct: 186 ASYPYEARENNCRFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHESFQF 245
Query: 280 YKEGIFN-GVCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGL 336
Y EG++ C +QLDH V VG+G TE+G +YWL+KNSWG +WG++GY+KI R+ +
Sbjct: 246 YSEGVYKEQYCSPSQLDHGVLTVGYG-TENGQDYWLVKNSWGPSWGESGYIKIARNHKNH 304
Query: 337 CGIGTRSSYPLA 348
CGI + +SYP+
Sbjct: 305 CGIASMASYPVV 316
>gi|149751225|ref|XP_001490531.1| PREDICTED: cathepsin S-like [Equus caballus]
Length = 332
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 129/317 (40%), Positives = 187/317 (58%), Gaps = 15/317 (4%)
Query: 39 HEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTN 94
H ++ H + W +G+ YK++ E+ R I++ NL+++ N E G +Y LG N
Sbjct: 20 HRDPTLDNHWDLWKKTYGKQYKEKNEEVARRLIWERNLKFVMLHNLEHSMGMHSYDLGMN 79
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
D+T++E +L + ++PS R+ T Y++ +P SLDWR+KG VT +K Q
Sbjct: 80 HLGDMTSEEVTSLMSSLRVPSQWQRNVT-----YKSNPNEKLPDSLDWREKGCVTEVKYQ 134
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCLGGSREKAFAYII 212
CG CWAF+AV A+E K+++GNL+ LS Q L+DCST N GC GG AF YII
Sbjct: 135 GSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYII 194
Query: 213 QNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIA 271
N GI ++ YPY+A+ G C K AA S Y E+P G E L +AV+ + PVS+AI
Sbjct: 195 DNNGIDSDASYPYKAMDGKCRYDSKNRAATCSKYTELPFGSEDDLKEAVANKGPVSVAID 254
Query: 272 AYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A F YK G+ ++ C ++H V +VG+G +G +YWL+KNSWG +GD GY+++
Sbjct: 255 ASHPSFFLYKSGVYYDPSCTQNVNHGVLVVGYGNL-NGKDYWLVKNSWGINFGDKGYIRM 313
Query: 331 VRDEG-LCGIGTRSSYP 346
R+ G CGI SYP
Sbjct: 314 ARNSGNHCGIANYCSYP 330
>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 517
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 190/317 (59%), Gaps = 17/317 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY--KLGTNQFS 97
E+ V+E+ ++W ++ + Y+ ++++R + FK NL+YI + N + Y LG N+F+
Sbjct: 43 EEGVIELFQRWKEENKKIYRSPDQEKLRFENFKRNLKYIAEKNSKRISPYGQSLGLNRFA 102
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
D++N+EF++ +T S R+ S ++ S D P SLDWR KG VT +K+Q C
Sbjct: 103 DMSNEEFKSKFTSKVKKPFSKRNGLSG----KDHSCEDAPYSLDWRKKGVVTAVKDQGYC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
GCCWAF++ A+EGI I SG+LI LSE +L+DC N+GC GG + AF +++ N GI
Sbjct: 159 GCCWAFSSTGAIEGINAIVSGDLISLSEPELVDCDRT-NDGCDGGHMDYAFEWVMHNGGI 217
Query: 218 ATEDEYPYQAVPGTCSAAQKPAAA-KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
TE YPY GTC+ A++ I Y V D ++LL A QP+S I S +
Sbjct: 218 DTETNYPYSGADGTCNVAKEETKVIGIDGYYNVEQSD-RSLLCATVKQPISAGIDGSSWD 276
Query: 277 FQSYKEGIFNGVCGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
FQ Y GI++G C + +DHA+ +VG+G+ D +YW++KNSWG +WG GY+ I R+
Sbjct: 277 FQLYIGGIYDGDCSSDPDDIDHAILVVGYGSEGD-EDYWIVKNSWGTSWGMEGYIYIRRN 335
Query: 334 E----GLCGIGTRSSYP 346
G+C I +SYP
Sbjct: 336 TNLKYGVCAINYMASYP 352
>gi|242074968|ref|XP_002447420.1| hypothetical protein SORBIDRAFT_06g000780 [Sorghum bicolor]
gi|241938603|gb|EES11748.1| hypothetical protein SORBIDRAFT_06g000780 [Sorghum bicolor]
Length = 381
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 188/313 (60%), Gaps = 21/313 (6%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
HE ++E WMA HGRSY EK R +I+++N+++IE N++ +T+ G NQF+D
Sbjct: 51 HELLMMERFHAWMAAHGRSYPTAEEKLRRFQIYRDNVKFIEAINRDTTKTFTCGENQFTD 110
Query: 99 LTNDEFRALYTGYKMPS-PSHRSTTSSTFKYQNLSM-----------TDVPTSLDWRDKG 146
LT+ EF A YT S P S++ T + +++ TD+P +DWR++
Sbjct: 111 LTHQEFLARYTMASHDSVPLDLSSSVITTRAGDITESDSGTTMQVEDTDLPEHVDWREQD 170
Query: 147 AVTPIKNQKE-CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
AVTP++NQ + C CW FA+VA +E KI++G+L++LSEQQ++DC+ C GG+ +
Sbjct: 171 AVTPVQNQLQGCHACWVFASVATIESANKIKNGDLLKLSEQQIVDCTA---EKCGGGTLQ 227
Query: 206 KAFAYIIQNQGIATEDEY-PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ 264
+AF Y+ +N GIATE+EY Y A G+C A A +I Y+ +P +E AL + V Q
Sbjct: 228 EAFKYVQKNGGIATEEEYGAYTAKAGSCHAGNVRKAVRIQTYDFLPRENETALAEKVVQQ 287
Query: 265 PVSIAIAAYSTEFQSYKEGIFNG---VCGTQLDHAVTIVGFGTTED-GANYWLIKNSWGN 320
PV++ A+ F YK GI++G L+HA+ IVG+G E G YW+ KNSWG
Sbjct: 288 PVAVLFDAHDPAFAYYKGGIYSGGQPRTRYVLNHAMAIVGYGKNESTGQKYWIAKNSWGT 347
Query: 321 TWGDAGYMKIVRD 333
WGD GY+ I +D
Sbjct: 348 GWGDGGYVYIAKD 360
>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
Length = 336
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 136/341 (39%), Positives = 195/341 (57%), Gaps = 18/341 (5%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
+ + LL S V+S+ S + + + E W H + Y E E+ R I+++NL I
Sbjct: 1 MLPLALLALGVSAVLSAPSL-DARLSDHWELWKNWHSKKYH-EKEEGWRRMIWEKNLNKI 58
Query: 79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G +Y+LG N F D+T++EFR + GY+ + R S F N +
Sbjct: 59 ELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYQ--RKTERKAIGSLFMEPNFMVA- 115
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
P+++DWR+KG VTP+K+Q +CG CWAF+ A+ZG + G L+ LSEQ L+DCS
Sbjct: 116 -PSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDCSRPE 174
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG ++AF Y+ NQG+ +ED YPY C K + + + ++PSG
Sbjct: 175 GNEGCGGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSVNDTGFVDIPSGK 234
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTED 307
E AL+KAV S+ PVS+AI A FQ Y+ GI + C + +LDH V VG+ G D
Sbjct: 235 EHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVD 294
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
G YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 295 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
gi|1582620|prf||2119193A cathepsin L-related Cys protease
Length = 324
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 133/311 (42%), Positives = 181/311 (58%), Gaps = 21/311 (6%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E++ + GR Y D E+ RL +F +NL+YIE+ NK+ G TY L NQFSDLTNDEF
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQFSDLTNDEF 80
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP---TSLDWRDKGAVTPIKNQKECGCCW 161
++ GYK S R + F + TD T +DWR KG VT +K+Q +CG CW
Sbjct: 81 NSMMKGYKT---SLRPKPVAVF-----TSTDAAPETTEVDWRTKGCVTHVKDQGQCGSCW 132
Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCLGGSREKAFAYIIQNQGIAT 219
AF+A ++EG ++ G L+ L+EQQL+DC+ N GC GG +AF YI N GI T
Sbjct: 133 AFSATGSLEGQHFLKYGELVSLAEQQLVDCAGGIYYNQGCNGGWVNQAFKYIKANGGIDT 192
Query: 220 EDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA-LLKAVSMQPVSIAIAAYSTEFQ 278
E YPY+A TC AA S + + G E + + + P+S+AI A FQ
Sbjct: 193 ESSYPYEARDNTCRFNSNSVAATCSGFVSIAQGSESPEVRRTTNTGPISVAIDAAHRSFQ 252
Query: 279 SYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-G 335
SY G++ +QLDHAV VG+G +E G ++WL+KNSWG +WG AGY+ + R+
Sbjct: 253 SYSSGVYYEPSCSSSQLDHAVLAVGYG-SEGGQDFWLVKNSWGTSWGSAGYINMARNRNN 311
Query: 336 LCGIGTRSSYP 346
CGI T +SYP
Sbjct: 312 NCGIATDASYP 322
>gi|298705581|emb|CBJ28832.1| Cathepsin L-like proteinase [Ectocarpus siliculosus]
Length = 553
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 124/335 (37%), Positives = 191/335 (57%), Gaps = 24/335 (7%)
Query: 36 RSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-RTYKLGTN 94
R+ + V WMAQHG ++ + E + RLKIF EN + I+ N + T+ L N
Sbjct: 32 RTAEDAKVANRFRAWMAQHGVTFGTKGEFDRRLKIFAENSDLIDTHNTANDGSTFTLSHN 91
Query: 95 QFSDLTNDEFRALYTGYKM----PSPSHRSTTSSTF--------KYQNLSMTDVPTSLDW 142
+FS L+ DEF+ + GYK P P+ ++ + L+ +++P +DW
Sbjct: 92 EFSHLSWDEFKETHFGYKRSSDKPKPARQTPERRPMEKVAGGRRRLVELTGSEIPDEVDW 151
Query: 143 RDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGG 202
+GAVTP++NQ CG CWAF+ + A+EG + + +LI+ SE+QL+DC + GC GG
Sbjct: 152 VREGAVTPVQNQGMCGSCWAFSTIGAMEGAYYLATDDLIKFSEEQLVDCDKV-DKGCFGG 210
Query: 203 SREKAFAYIIQNQGIATEDEYPYQAV-PG--TCSAAQKPA-AAKISNYEEVPSGDEQALL 258
E+AF +I +N G+ EDEYPY + P TC+ P +++ + +V + DE +
Sbjct: 211 DMEQAFDWIKENGGVCPEDEYPYVGLWPPFKTCATTCTPVEGSQVKEWAQVKATDEALMT 270
Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
++ P++IAI A FQ Y +G++ CG +LDH V VG+GT EDG +YW +KNSW
Sbjct: 271 ALATVGPIAIAIEADQMAFQFYSDGVYTAPCGDKLDHGVLAVGYGTWEDGTDYWKVKNSW 330
Query: 319 GNTWGDAGYMKIVR-----DE-GLCGIGTRSSYPL 347
G++WG GY+ + R DE G CG+ + YP+
Sbjct: 331 GDSWGQGGYILLERADSEEDEGGQCGLLIEAIYPI 365
>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
Length = 333
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 186/310 (60%), Gaps = 19/310 (6%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
KW + + R Y E+E R ++++N++ IE N EG Y + N F D+TN+EF
Sbjct: 30 HKWKSTYRRLYGTN-EEEWRRAVWEKNMKMIELHNGEYSEGKHGYTMEMNAFGDMTNEEF 88
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
R L GYK HR +Q M +P S+DWR+KG VTP+KNQ +CG CWAF+
Sbjct: 89 RQLVNGYK--HQKHRKGKV----FQEPLMLQLPKSVDWREKGCVTPVKNQGQCGSCWAFS 142
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A A+EG +++G L+ LSEQ L+DCS GN GC GG + AF Y++ N+G+ +E+ Y
Sbjct: 143 ACGALEGQMCLKTGVLVSLSEQNLVDCSQAEGNQGCNGGLMDFAFQYVLNNKGLDSEESY 202
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY+A GTC + AAA + Y ++P E+AL+KAV ++ P++IAI A FQ Y
Sbjct: 203 PYEAKDGTCKYKPEFAAANDTGYVDIPQ-LEKALMKAVATVGPIAIAIDASHPSFQFYSS 261
Query: 283 GIF--NGVCGTQLDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
GI+ +LDH V +VG+ GT + YW++KNSWG++WG G+ I +D+
Sbjct: 262 GIYYEPNCSSKELDHGVLVVGYGFEGTDSNKKKYWIVKNSWGSSWGMGGFFHIAKDKNNH 321
Query: 337 CGIGTRSSYP 346
CG+ T +SYP
Sbjct: 322 CGVATAASYP 331
>gi|355753449|gb|EHH57495.1| Cathepsin L1 [Macaca fascicularis]
Length = 333
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 131/341 (38%), Positives = 195/341 (57%), Gaps = 23/341 (6%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
P FI+ + AS + T S+ KW A H R Y E+ R ++++N++
Sbjct: 3 PTFILAAFCLGIASATL----TFNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE N+E G ++ + N F D+T++EFR + G++ P +Q L
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQNRKPRKGKV------FQELLF 111
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
+ P S+DWR+KG VTP+KNQ +CG CWAF+A A+EG ++G L+ LSEQ L+DCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSW 171
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
GN GC GG + AF Y+ N G+ +E+ YPY+A +C + + A + + ++P
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIPK- 230
Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TTE 306
E+AL+KAV ++ P+S+AI A F YKEGI F C ++ +DH V +VG+G T
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
D + YWL+KNSWG WG GY+K+ +D CGI + +SYP
Sbjct: 291 DNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYP 331
>gi|351705687|gb|EHB08606.1| Cathepsin S [Heterocephalus glaber]
Length = 331
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/328 (39%), Positives = 195/328 (59%), Gaps = 15/328 (4%)
Query: 28 CASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE-- 84
C + ++ + +++ H W +G+ Y+++ E+++R I+++NL+++ N E
Sbjct: 8 CVTCSLAGAQLQQDPMLDYHWHLWKKTYGKHYQEKNEEQVRRLIWEKNLKFVMLHNLEHS 67
Query: 85 -GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
G +Y LG N D+T++E R+L + ++P R+ T + Q L P S+DWR
Sbjct: 68 MGMHSYDLGMNHLGDMTSEEVRSLMSSLRVPRQWLRNVTYKSDPNQKL-----PDSVDWR 122
Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG--NNGCLG 201
+KG VT +K Q CG CWAF+AV A+EG K+++G L+ LS Q L+DCST N GC G
Sbjct: 123 EKGCVTEVKYQGACGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEKYRNKGCSG 182
Query: 202 GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV 261
G +AF Y+I N GI +E YPY+A C K AA S Y E+P G E+AL +AV
Sbjct: 183 GFMTEAFQYVIDNNGIDSETSYPYKATDEKCHYDSKNRAATCSRYTELPYGSEEALKEAV 242
Query: 262 SMQ-PVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
+ + PVS+A+ A F YK G+++ C + H V VG+G +G +YWL+KNSWG
Sbjct: 243 ANKGPVSVAVDASRPSFFLYKNGVYDDPSCTQNVTHGVLAVGYGNL-NGKDYWLVKNSWG 301
Query: 320 NTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
+GD GY+++ R++G CGI + SSYP
Sbjct: 302 LYFGDQGYIRMARNKGNHCGIASYSSYP 329
>gi|2746723|gb|AAB94925.1| cathepsin S precursor [Mus musculus]
Length = 340
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 127/306 (41%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 39 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEISC 98
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
++ S ++ T +++ S +P ++DWR+KG VT +K Q CG CWAF+AV
Sbjct: 99 RMGALRISRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+EG K+++G LI LS Q L+DCS GN GC GG +AF YII N GI + Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
PY+A+ C K AA S Y ++P GDE AL +AV+ + PVS+ I A + F YK
Sbjct: 214 PYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273
Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 341 TRSSYP 346
+ SYP
Sbjct: 333 SYCSYP 338
>gi|384941728|gb|AFI34469.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 133/340 (39%), Positives = 196/340 (57%), Gaps = 19/340 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + + L C + S+ +Q++ +W A H R Y E+ R ++++N++
Sbjct: 1 MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAS-EEGWRRAVWEKNMKM 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G + + N F D+TN+EFR + ++ + + F+
Sbjct: 59 IELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFR----NQKLRKGKLFR--EPLFL 112
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST- 193
D+P S+DWR KG VTP+KNQK+CG CWAF+A A+EG ++G L+ LSEQ L+DCS
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG AF Y+ +N G+ +E+ YPY A+ G C + + A + +E VP+G
Sbjct: 173 QGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRSENSVANDTGFEVVPAGK 232
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTED 307
E+AL+KAV ++ P+S+A+ A + FQ YK GI F C ++ LDH V +VG+ G D
Sbjct: 233 EKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSD 292
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
YWL+KNSWG WG GY+KI +D + CGI T +SYP
Sbjct: 293 NNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYP 332
>gi|355567966|gb|EHH24307.1| Cathepsin L2 [Macaca mulatta]
gi|355753494|gb|EHH57540.1| Cathepsin L2 [Macaca fascicularis]
gi|380790509|gb|AFE67130.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 133/340 (39%), Positives = 196/340 (57%), Gaps = 19/340 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + + L C + S+ +Q++ +W A H R Y E+ R ++++N++
Sbjct: 1 MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAS-EEGWRRAVWEKNMKM 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G + + N F D+TN+EFR + ++ + + F+
Sbjct: 59 IELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFR----NQKLRKGKLFR--EPLFL 112
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
D+P S+DWR KG VTP+KNQK+CG CWAF+A A+EG ++G L+ LSEQ L+DCS
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHP 172
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG AF Y+ +N G+ +E+ YPY A+ G C + + A + +E VP+G
Sbjct: 173 QGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRPENSVANDTGFEVVPAGK 232
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTED 307
E+AL+KAV ++ P+S+A+ A + FQ YK GI F C ++ LDH V +VG+ G D
Sbjct: 233 EKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSD 292
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
YWL+KNSWG WG GY+KI +D + CGI T +SYP
Sbjct: 293 NNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYP 332
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 134/307 (43%), Positives = 186/307 (60%), Gaps = 12/307 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLTNDEFRA 106
E W +HG+ Y + E+ R I++ N +Y+++ N + + +G NQF+DL + EF
Sbjct: 23 ESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSEFGR 82
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
LY GY PS + S F + + D+PTS+DWR KG VT IKNQ +CG CWAF+AV
Sbjct: 83 LYNGYN-NKPSMKKAQSKVF---STKVGDLPTSVDWRTKGFVTAIKNQGQCGSCWAFSAV 138
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
A +EG +G L+ LSEQ L+DCST GN GC GG + AF Y+I+N GI TE YPY
Sbjct: 139 AGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTEASYPY 198
Query: 226 QAVPGTCSAAQKPAAAKISNYEEV-PSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEG 283
+AV C + S + ++ P E AL AV++ P+S+AI A T FQ YK G
Sbjct: 199 KAVDQKCKFNAANVGSTCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQLYKSG 258
Query: 284 IFN-GVCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIG 340
+++ C T LDH VT VG+ ++ G YW++KNSWG TWG AGY+ + R++ CGI
Sbjct: 259 VYSESACSQTSLDHGVTAVGYDSSS-GVAYWIVKNSWGTTWGQAGYIWMSRNKNNQCGIA 317
Query: 341 TRSSYPL 347
T +SYP+
Sbjct: 318 TAASYPI 324
>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 132/340 (38%), Positives = 202/340 (59%), Gaps = 20/340 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M +++ L C + S+ S + S+ +W A+H + Y E+ R ++++N++
Sbjct: 1 MNLLLILAAFCVG-ITSATSMFDGSLNAHWYRWKAKHRKLYGMR-EEGWRRAVWEKNMKM 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N+E G + + N F D+TN+EFR + G++ + H+ +Q S
Sbjct: 59 IEVHNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFR--NQKHKKGKV----FQEPSFL 112
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST- 193
+VP S+DWR+KG VTP+KNQ +CG CWAF+A A+EG ++G LI LSEQ L+DCS
Sbjct: 113 EVPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRP 172
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG + AF YI +N G+ +E+ YPY A+ +C + + A + + ++P +
Sbjct: 173 QGNEGCDGGLMDYAFQYIKENGGLDSEESYPYDAMDESCKYRPEYSVANDTGFVDIPK-E 231
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TTED 307
E+AL+KAV ++ P+S+AI A FQ YKEG+ F C + +DH V +VG+G T D
Sbjct: 232 EKALMKAVATVGPISVAIDAGHESFQFYKEGVYFEPECSSDNVDHGVLVVGYGYEETESD 291
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
+WL+KNSWG WG GY+K+ +D+ CGI T +SYP
Sbjct: 292 NNKFWLVKNSWGEEWGLGGYIKMTKDQKNHCGIATAASYP 331
>gi|334332714|ref|XP_001367224.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 192/309 (62%), Gaps = 13/309 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
+W AQHG+SY E R +++NL+ IE+ N+E G +++L N+F D++ +EF
Sbjct: 30 HQWKAQHGKSYAAN-EDSWRRATWEKNLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEEF 88
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
+ + GYK R+ S Y+ + +P S+DWR+KG VTP+K Q+ C CWAF+
Sbjct: 89 KQVMNGYKSNGSQKRTKGS---LYRESLLAQLPESVDWREKGYVTPVKEQRGCYSCWAFS 145
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A A+EG ++G L+ LS Q L+DCS GNNGC GG AF Y+ N GI TE+ Y
Sbjct: 146 AAGAIEGQWFRKTGKLVSLSVQNLVDCSIPEGNNGCDGGLMGNAFQYVQDNGGIDTEECY 205
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKE 282
PY A C + + A ++ + ++PS DE+AL+KAV+ + P+S+AI A + F+ Y+
Sbjct: 206 PYVAQDNECKYQPECSGANVTGFVKIPSTDERALMKAVANVGPISVAIDAGNPSFKFYQS 265
Query: 283 GI-FNGVC-GTQLDHAVTIVGFGTT-EDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCG 338
G+ ++ C +QL+H V +VG+G+ ++G YW++KNSWG WGD GY+ + +DE CG
Sbjct: 266 GVYYDPQCSSSQLNHGVLVVGYGSEGKNGRKYWIVKNSWGENWGDNGYVLMAKDEDNHCG 325
Query: 339 IGTRSSYPL 347
I T +SYP+
Sbjct: 326 IITDASYPI 334
>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
Length = 328
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 130/332 (39%), Positives = 195/332 (58%), Gaps = 18/332 (5%)
Query: 25 LVSCASQVVSSRST--HEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
L+ C + +V+ + H ++ H + W HG+ Y+ + E+ R +++NL +
Sbjct: 3 LLRCMAVLVTLVAVMGHPDPTLDQHWQLWKKAHGKEYRHQAEEGQRRATWEKNLRLVMLH 62
Query: 82 NKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N E G +Y+LG N D+T+++ AL TG ++P H T ST++ + P
Sbjct: 63 NLEHSLGLHSYQLGMNHMGDMTSEDVAALLTGLRVPY-GHNQT--STYRRRG----GAPD 115
Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNN 197
++DWR+KG VT +KNQ CG CWAF+AV A+E K+++G L+ LS Q L+DCS GN
Sbjct: 116 AMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYGNK 175
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG +AF YII N GI +E+ YPY A GTC AA S Y E+P DE AL
Sbjct: 176 GCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGTCQYNVSTRAATCSKYVELPYADEAAL 235
Query: 258 LKAVS-MQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIK 315
AV+ + PVS+AI A F Y+ G+++ C +++H V +VG+GT + ++WL+K
Sbjct: 236 KDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQEVNHGVLVVGYGTLNE-KDFWLVK 294
Query: 316 NSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
NSWG +GD GY+++ R+ CGI + +SYP
Sbjct: 295 NSWGERFGDGGYIRMSRNHANHCGIASYASYP 326
>gi|402898110|ref|XP_003912074.1| PREDICTED: cathepsin L2 [Papio anubis]
Length = 334
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 133/340 (39%), Positives = 196/340 (57%), Gaps = 19/340 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + + L C + S+ +Q++ +W A H R Y E+ R ++++N++
Sbjct: 1 MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAS-EEGWRRAVWEKNMKM 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G + + N F D+TN+EFR + ++ + + F+
Sbjct: 59 IELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQVMGCFR----NQKLRKGKLFR--EPLFL 112
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
D+P S+DWR KG VTP+KNQK+CG CWAF+A A+EG ++G L+ LSEQ L+DCS
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG AF Y+ +N G+ +E+ YPY A+ G C + + A + +E VP+G
Sbjct: 173 QGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRPENSVANDTGFEVVPAGK 232
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTED 307
E+AL+KAV ++ P+S+A+ A + FQ YK GI F C ++ LDH V +VG+ G D
Sbjct: 233 EKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSD 292
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
YWL+KNSWG WG GY+KI +D + CGI T +SYP
Sbjct: 293 NNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYP 332
>gi|194741252|ref|XP_001953103.1| GF17600 [Drosophila ananassae]
gi|190626162|gb|EDV41686.1| GF17600 [Drosophila ananassae]
Length = 333
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 136/340 (40%), Positives = 203/340 (59%), Gaps = 22/340 (6%)
Query: 19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
I+ LL++ A V + Q ++E E+WMA ++ + Y+DE E+++R KIF N
Sbjct: 6 LILFMLLLAIAHAV-----PYAQDILE--EEWMAFKLEYNKVYQDETEEQLRFKIFNYNK 58
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
I + N + G ++ L N+F+DL + EF+ L G KM SPS + SSTF ++
Sbjct: 59 LLIARHNLKWAAGKVSFNLAVNKFADLLDHEFQDLMLG-KM-SPSGSNFGSSTF-LPPVN 115
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+T +P ++DWR G VTP+K+Q CG CWAF+ ++EG ++G LI LSEQ L+DCS
Sbjct: 116 LT-LPDAVDWRKYGFVTPVKDQGSCGSCWAFSTTGSLEGQHFRKTGQLISLSEQNLIDCS 174
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
GNNGC G+ E AF YI N+GI TE YPY+A C + A + + ++ G
Sbjct: 175 P-GNNGCKNGAVEYAFRYIQSNKGIDTEISYPYEAAQNQCRFRRDTIGATSTGFVKLNPG 233
Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG-VCG-TQLDHAVTIVGFGTTEDGA 309
DE L +AV ++ P+S+ I + F+ Y +G++N C +L HAV +VG+GT + G
Sbjct: 234 DEMELAQAVATVGPISVLINSSLDSFKFYHDGVYNDPSCNPNKLTHAVLVVGYGTDDRGG 293
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
++WL+KNSW WG+ GY+KI R+ LCGI + + YPL
Sbjct: 294 DFWLVKNSWSTHWGEQGYVKIKRNANNLCGIASNALYPLV 333
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 189/315 (60%), Gaps = 22/315 (6%)
Query: 47 HEKWM---AQHGRSYKDELEKEM-RLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDL 99
HE W G+ Y D +E+E+ R IF++ LE IE+ N++ G ++Y +G NQFSD+
Sbjct: 51 HETWKEFKTLFGKVY-DTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFSDM 109
Query: 100 TNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
++DE+ L G + S + + S + +DWRDKG VTP+KNQ +
Sbjct: 110 SHDEYLRHNGLRRGNRKYSKGEGCDSYTK------SGKQLDDKVDWRDKGYVTPVKNQGQ 163
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQ 215
CG CW+F+ ++EG ++G LI LSEQQL+DCS T GN GC GG + AF YI
Sbjct: 164 CGSCWSFSTTGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSIG 223
Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
G+ ED+YPY A G C + A + +V SGDE AL A+ S+ P+S+AI A
Sbjct: 224 GLEGEDDYPYTAKQGKCHLKKSLFKANDTGCTDVESGDEDALKDALASVGPISVAIDASH 283
Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
FQSY G+++ C +Q LDH V VG+GT E+G +YWL+KNSWG WG+ GY+K+ R
Sbjct: 284 ASFQSYDGGVYDEEECSSQNLDHGVLTVGYGTEENGGDYWLVKNSWGEMWGEEGYIKMSR 343
Query: 333 D-EGLCGIGTRSSYP 346
+ + CGI T++SYP
Sbjct: 344 NKDNQCGIATQASYP 358
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 132/307 (42%), Positives = 184/307 (59%), Gaps = 16/307 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E + Q+GR Y D E+ R ++F++N + +E NK+ G T+K+ NQF D+TN+EF
Sbjct: 13 EHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFGDMTNEEF 72
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A+ GYK S R ++ F + M +DWR KGAVTP+K+Q +CG CWAF+
Sbjct: 73 NAVMKGYKKGS---RGEPTTVFTAEGRPMA---ADVDWRTKGAVTPVKDQGQCGSCWAFS 126
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A ++EG +++ L+ LSEQ+L+DCST GN+GC GG AF YI N GI TE Y
Sbjct: 127 ATGSLEGQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYIKDNGGIDTESSY 186
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKE 282
PY+A +C A + + EV E+AL +AVS + P+S+AI A FQ Y
Sbjct: 187 PYEAQDRSCRFDANSIGATCTGFVEVQH-TEEALHEAVSDIGPISVAIDASHFSFQFYSS 245
Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
G++ T LDH V VG+G TE +YWL+KNSWG+ WGDAGY+K+ R+ + CGI
Sbjct: 246 GVYYEKKCSPTNLDHGVLAVGYG-TESTEDYWLVKNSWGSGWGDAGYIKMSRNRDNNCGI 304
Query: 340 GTRSSYP 346
+ SYP
Sbjct: 305 ASEPSYP 311
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 131/317 (41%), Positives = 181/317 (57%), Gaps = 18/317 (5%)
Query: 45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT----YKLGTNQFSDLT 100
E+ E+WM +H + Y EK R F NL ++ K N EG R +G N F+DL+
Sbjct: 49 ELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADLS 108
Query: 101 NDEFRALYTG--YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
N+EFR +Y+ + + R + + ++ D P SLDWR +GAVT +KNQ +CG
Sbjct: 109 NEEFREVYSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASLDWRKRGAVTAVKNQGDCG 168
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
CWAF++ A+EGI I +G LI LSEQ+L+DC T N GC GG + AF ++I N GI
Sbjct: 169 SCWAFSSTGAMEGINAITTGELISLSEQELVDCDTT-NEGCDGGYMDYAFEWVINNGGID 227
Query: 219 TEDEYPY--QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
+E YPY QA + ++ I YE+V + E ALL A QPVS+ I S +
Sbjct: 228 SEANYPYTGQADSVCNTTKEEIKVVSIDGYEDVAT-SESALLCAAVQQPVSVGIDGSSLD 286
Query: 277 FQSYKEGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
FQ Y GI++G C +DHAV +VG+G + G +YW++KNSWG WG GY+ I R+
Sbjct: 287 FQLYAGGIYDGDCSGNPDDIDHAVLVVGYG-QQGGTDYWIVKNSWGTDWGMQGYIYIRRN 345
Query: 334 EGL----CGIGTRSSYP 346
GL C I +SYP
Sbjct: 346 TGLPYGVCAIDAMASYP 362
>gi|2961621|gb|AAC05781.1| cathepsin S [Mus musculus]
Length = 340
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 127/306 (41%), Positives = 184/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 39 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEISC 98
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
++ S ++ T +++ S +P ++DWR+KG VT +K Q CG CWAF+AV
Sbjct: 99 RMGALRISRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+EG K+++G LI LS Q L+DCS GN GC GG +AF YII N GI + Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
PY+A C K AA S Y ++P GDE AL +AV+ + PVS+ I A + F YK
Sbjct: 214 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273
Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 341 TRSSYP 346
+ SYP
Sbjct: 333 SYCSYP 338
>gi|405966499|gb|EKC31777.1| Cathepsin L [Crassostrea gigas]
Length = 331
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 184/301 (61%), Gaps = 14/301 (4%)
Query: 54 HGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTG 110
H ++Y + E++MR I+++N+ YI+K N G TY LG N+++D+T EFRA+ G
Sbjct: 35 HKKTYSQD-EEQMRRLIWEDNVNYIQKHNLAADRGEHTYWLGQNEYADMTIFEFRAIMNG 93
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
YKM + + T ++ D+P S+DWR +G VT IKNQ CG CW+F+A ++E
Sbjct: 94 YKMSA----NRTKGDLYMSPSNIGDLPDSVDWRKEGYVTDIKNQGHCGSCWSFSATGSLE 149
Query: 171 GITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP 229
G S L+ LSEQ L+DCS GN+GC GG + AF YI N+GI TE+ YPY A
Sbjct: 150 GQHFKASKKLVSLSEQNLVDCSKKEGNHGCQGGLMDNAFRYIESNKGIDTEESYPYTAKN 209
Query: 230 GTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-- 286
G C + A + Y ++P E L +AV ++ P+S+ I A FQ Y+EG+++
Sbjct: 210 GFCHFKAENVGATDTGYVDIPHMQEDKLQEAVATVGPISVGIDAGHKSFQLYREGVYSEP 269
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSY 345
++LDH V VG+G TE G +YWL+KNSWG +WG GY+ + R++ +CGI T++SY
Sbjct: 270 ACSSSKLDHGVLAVGYG-TESGDDYWLVKNSWGTSWGMQGYVMMARNKHNMCGIATQASY 328
Query: 346 P 346
P
Sbjct: 329 P 329
>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
Length = 301
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 128/299 (42%), Positives = 178/299 (59%), Gaps = 16/299 (5%)
Query: 61 ELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS 117
E E+ R ++++NL+ IE N E G +Y+LG N F D+T++EFR + GYK
Sbjct: 6 EKEEGWRRMVWEKNLKKIEMHNLEHSMGTHSYRLGMNHFGDMTHEEFRQIMNGYK--RKP 63
Query: 118 HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
R T S F N + P ++DWRD G VTP+K+Q +CG CWAF+ A+EG ++
Sbjct: 64 QRKFTGSLFMEPNF--LEAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKT 121
Query: 178 GNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAA 235
G L+ LSEQ L+DCS GN GC GG ++AF YI NQG+ +ED YPY C
Sbjct: 122 GKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYD 181
Query: 236 QKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQ 292
K +A + + ++PSG E+AL+KAV ++ PVS+AI A FQ Y+ GI+ +
Sbjct: 182 PKYNSANDTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEE 241
Query: 293 LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
LDH V +VG+ G DG YW++KNSW WGD GY+ + +D + CGI T +SYPL
Sbjct: 242 LDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 300
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 192/321 (59%), Gaps = 17/321 (5%)
Query: 43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDL 99
V+E + + A+H ++Y +++E++ R+KIF +N + I K N + G YKLG N++SD+
Sbjct: 23 VMEEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQKITKHNTKYQRGEVGYKLGLNKYSDM 82
Query: 100 TNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSM----TDVPTSLDWRDKGAVTPIKN 153
+ EF + G+ + P RS T + + +P +DW GAVTP+K+
Sbjct: 83 LHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFFIPPANVKLPKHVDWVKLGAVTPVKD 142
Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYII 212
Q CG CWAF+A A+EG+ ++ L+ LSEQ L+DCST GNNGC GG ++AF Y+
Sbjct: 143 QGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTEEGNNGCNGGLMDQAFQYVR 202
Query: 213 QNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIA 271
N GI TE YPY+ C + + A + Y +VP GDE AL AV ++ PVS+AI
Sbjct: 203 INGGIDTERSYPYEGNNDVCRYEPENSGAIDTGYTDVPLGDEDALKSAVATVGPVSVAID 262
Query: 272 AYSTEFQSYKEGI-FNGVCGTQ---LDHAVTIVGFGTTEDG-ANYWLIKNSWGNTWGDAG 326
A FQ Y G+ F C + LDH V +VG+GT E+ +YWL+KNSWG++WG+ G
Sbjct: 263 ASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGTDEETQQDYWLVKNSWGDSWGENG 322
Query: 327 YMKIVRD-EGLCGIGTRSSYP 346
Y+K+ R+ + CGI T+ S+P
Sbjct: 323 YIKMARNADNQCGIATQPSFP 343
>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
Length = 314
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 122/294 (41%), Positives = 181/294 (61%), Gaps = 14/294 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
E + A++G++Y+ + R I+ E + + N ++G +YKLG N F+D+ N EF
Sbjct: 28 ESYKAKYGKTYESNENEAARRTIYFMAKEKVMEHNARFEQGLVSYKLGLNSFADMHNGEF 87
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
R + GY+ +P + S + ++T +P S+DWR KGAVTPIKNQ +CG CWAF+
Sbjct: 88 RKMMNGYRRGTPRN-----SVVVHVESNIT-LPASVDWRTKGAVTPIKNQGQCGSCWAFS 141
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
++EG ++ G L+ LSEQ+L+DCS GN+GC GG + AF YI +N GI TE Y
Sbjct: 142 TTGSLEGQHALKKGKLVSLSEQELVDCSAAEGNDGCDGGLMDDAFTYIKKNNGIDTEQSY 201
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKA-VSMQPVSIAIAAYSTEFQSYKE 282
PY GTCS + AA ++ + +V SG E L A ++ P+S+AI A S +FQ Y+
Sbjct: 202 PYTGEDGTCSFKKSDVAATVTGFVDVTSGSESGLQDASATIGPISVAIDASSWDFQLYES 261
Query: 283 GIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
G+++ T+LDH V +VG+G T+DG YWL+KNSWG WG GY+++ R +
Sbjct: 262 GVYDVSDCSTTELDHGVLVVGYG-TDDGTAYWLVKNSWGTDWGHHGYIQMSRKQ 314
>gi|414591039|tpg|DAA41610.1| TPA: hypothetical protein ZEAMMB73_356414 [Zea mays]
Length = 376
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 136/330 (41%), Positives = 189/330 (57%), Gaps = 28/330 (8%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ ++E+W + H S +D EK+ R + FK N +I + NK + YKLG N+F+DL
Sbjct: 38 EESMWSLYERWRSVHTVS-RDLREKQSRFEAFKANARHIGEFNKRKDVPYKLGLNKFADL 96
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQN---------LSMTDVPTSLDWRDKGAVTP 150
T +EF + YTG K+ + +S + + S+ D P + DWRD GAVT
Sbjct: 97 TQEEFVSKYTGAKVVDSEAAARLASGVRVSSSDESPPQLAASVGDAPDAWDWRDHGAVTA 156
Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
+K+Q +CG CWAF+AV AVE + I +GNL+ LSEQQ+LDCS G+ GG A Y
Sbjct: 157 VKDQGQCGSCWAFSAVGAVESVNAIVTGNLLTLSEQQMLDCSGAGDC-TYGGYTYYAMLY 215
Query: 211 IIQNQGIATED--EYPY-------QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV 261
I N G+ + + PY Q +P A+KP KI + + + DE AL +AV
Sbjct: 216 AISN-GLTLDQCGKTPYYQRYDAQQHLPCRFD-AKKPPVVKIDSMYVMNNADEAALKRAV 273
Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
QPVS+ I A + Y EG+F G CGT L+HAV +VG+G T DG YW++KNSWG
Sbjct: 274 YKQPVSVLIDAGGIGY--YSEGVFTGPCGTSLNHAVLLVGYGATADGTKYWIVKNSWGAD 331
Query: 322 WGDAGYMKIVRD----EGLCGIGTRSSYPL 347
WG+ GY ++ RD GLCGI YP+
Sbjct: 332 WGEKGYFRLKRDVGTQGGLCGITMYPIYPI 361
>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
Length = 333
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 130/342 (38%), Positives = 195/342 (57%), Gaps = 23/342 (6%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
P I+ + AS + T + S+ KW A H R Y E+ R ++++N+
Sbjct: 2 NPTLILAAFCLGIASATL----TFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNM 56
Query: 76 EYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ IE+ N +EG ++ + N F D+T++EFR + G++ P +Q
Sbjct: 57 KMIEQHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKV------FQEPL 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ P S+DWR+KG VTP+KNQ +CG CWAF+A A+EG ++G L+ LSEQ L+DCS
Sbjct: 111 FYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG + AF Y+ N G+ +E+ YPY+A +C K + A + + ++P
Sbjct: 171 GPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK 230
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TT 305
E+AL+KAV ++ P+S+A+ A FQ YKEGI F C ++ +DH V +VG+G T
Sbjct: 231 -QEKALMKAVATVGPISVAVDAGHQSFQFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
D YWL+KNSWG WG GY+K+ +D CGI + +SYP
Sbjct: 290 SDNNKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYP 331
>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
Length = 332
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 128/338 (37%), Positives = 199/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG AF YII N+GI ++ YPY+A+ C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPY 232
Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E L +AV+ + PVS+ + A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGK 291
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
YWL+KNSWG+ +G+ GY+++ R++G CGI + SYP
Sbjct: 292 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 133/324 (41%), Positives = 186/324 (57%), Gaps = 24/324 (7%)
Query: 30 SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEK-EMRLKIFKENLEYIEKANKEGNRT 88
SQ + R+ H V++ + HG Y +L E + NL IE A+ GN +
Sbjct: 11 SQFLPRRNLH--LVLKGPTAFRRIHGVFYSSQLGLCEPAFRCHLANLRVIE-AHNAGNSS 67
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS-LDWRDKGA 147
+ +G QF+DLT EF A + M + T + +T+ P +DWR K A
Sbjct: 68 FTMGITQFADLTAAEFSAYVKRFPM---------NVTRPRNEVWITEAPLQEVDWRQKNA 118
Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREK 206
VT IKNQ +CG CW+F+ +VEG I +G L+ LSEQQL+DCST GN+GC GG +
Sbjct: 119 VTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCSTRYGNHGCNGGLMDY 178
Query: 207 AFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQP 265
AF Y+I N G+ TE++YPY A G C+ +K AA+I + VP E L AVS+ P
Sbjct: 179 AFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEIHGFRNVPKEHEDQLAAAVSIGP 238
Query: 266 VSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
VS+AI A FQ Y G+F+G CGT LDH V +VG+ +YW++KNSWG +WG+
Sbjct: 239 VSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGYSD-----DYWIVKNSWGKSWGEE 293
Query: 326 GYMKIVR---DEGLCGIGTRSSYP 346
GY+++ R +G+CGI ++SYP
Sbjct: 294 GYIRLKRGVDKKGMCGITMQASYP 317
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 133/331 (40%), Positives = 196/331 (59%), Gaps = 18/331 (5%)
Query: 25 LVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
L+ C+S + H ++ H + W +G+ Y+++ E+ R I+++NL+ + N
Sbjct: 8 LLLCSSAMAQ---VHRDPTLDHHWDLWKKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNL 64
Query: 84 E---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
E G +Y+LG N D+T++E + + ++PS R+ T + Q L P SL
Sbjct: 65 EHSMGMHSYELGMNHLGDMTSEEVISSMSSLRVPSQWPRNVTYKSSPNQKL-----PDSL 119
Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST--NGNNG 198
DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST GN G
Sbjct: 120 DWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKG 179
Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALL 258
C GG +AF YII N GI +E YPY+A+ G C K AA S Y E+P G E+AL
Sbjct: 180 CNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGRCQYDVKNRAATCSRYIELPFGSEEALK 239
Query: 259 KAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
+AV+ + PVS+ I A T F YK G+ ++ C ++H V +VG+G+ +G +YWL+KN
Sbjct: 240 EAVANKGPVSVGIDAKQTSFFLYKTGVYYDPSCTQNVNHGVLVVGYGSL-NGKDYWLVKN 298
Query: 317 SWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
SWG +GD GY+++ R+ G CGI SYP
Sbjct: 299 SWGLNFGDQGYIRMARNSGNHCGIANFPSYP 329
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.130 0.387
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,388,770,322
Number of Sequences: 23463169
Number of extensions: 222062658
Number of successful extensions: 646414
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6581
Number of HSP's successfully gapped in prelim test: 1124
Number of HSP's that attempted gapping in prelim test: 614430
Number of HSP's gapped (non-prelim): 9535
length of query: 348
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 205
effective length of database: 9,003,962,200
effective search space: 1845812251000
effective search space used: 1845812251000
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)