BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018968
(348 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 189/338 (55%), Positives = 245/338 (72%), Gaps = 9/338 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ ++ +LLV+ + SRS HE S+ H+ WM Q+GR YK +EKE RFKIFKEN+E+
Sbjct: 9 LVLMAMLLVTLWASQSWSRSLHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKENVEF 68
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST-TSSTFKYQNLSMTDV 136
IE N GN+ YKLG N F+DLTN+EFRA + GY M SH+S+ + +F+Y+N+ T V
Sbjct: 69 IESFNNNGNKPYKLGINAFTDLTNEEFRASHNGYTMSMSSHQSSYRTKSFRYENV--TAV 126
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG- 195
P SLDWR K AVT IKDQ +CGCCWAFSAVAA+EGITK+S LI LSEQ+LVDC T+G
Sbjct: 127 PPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLISLSEQELVDCDTSGM 186
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDE 254
+ GC GG M+ AFE+II+N G+ TE YPY+ V G+C+ + A AAKI+ YE VP+ DE
Sbjct: 187 DQGCEGGLMDDAFEFIIENNGLTTEANYPYEGVDGSCNTRKAANHAAKITGYENVPAYDE 246
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+AL KAV+ QPVS+ I A + F+ Y GIF G CGT+LDH VT+VG+GT++DG YWL+
Sbjct: 247 EALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGTELDHGVTVVGYGTSDDGTKYWLV 306
Query: 315 KNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
KNSWG +WG+ GY+++ RD EGLCGI + SYP A
Sbjct: 307 KNSWGTSWGEDGYIRMERDIDAKEGLCGIAMEPSYPTA 344
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 184/342 (53%), Positives = 243/342 (71%), Gaps = 11/342 (3%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
K NT +++L + A+++ ++ +++ HE+WMAQHGR Y D EKE R+ IF
Sbjct: 5 KCNTRIFLPFLLILAAWATKIACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIF 64
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
KEN+E IE N +R YKLG N+F+DLTN+EFRA+Y GYK S SS+F+Y+NL
Sbjct: 65 KENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYHGYKRQSSK---LMSSSFRYENL 121
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
S D+PTS+DWR+ AVTP+KDQ CGCCWAFS VAA+EGI K+ NLI LSEQQLVDC
Sbjct: 122 S--DIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDC 179
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVP 250
T GN GC GG M+ AF+YII+N G+ +ED YPYQ V GTCS+ + A+ A+I+ YE+VP
Sbjct: 180 -TAGNKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQGVDGTCSSEKAASTEAQITGYEDVP 238
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E ALL+AV+ QPVS+G+ +F+ YK G+FNG CGTQ +HAVT +G+GT DG +
Sbjct: 239 QNNENALLQAVAKQPVSVGVDGGGNDFQFYKSGVFNGDCGTQQNHAVTAIGYGTDIDGTD 298
Query: 311 YWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
YWL+KNSWG +WG+ GYM++ R EGLCG+ +SYP A
Sbjct: 299 YWLVKNSWGTSWGENGYMRMRRGIGSSEGLCGVAMDASYPTA 340
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 188/337 (55%), Positives = 238/337 (70%), Gaps = 10/337 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
MF+ ++++ ASQ S RS H+ ++ E HE WMA++GR YKD EKE RF+IF+ N+E+
Sbjct: 10 MFVALLVVGLWASQAWS-RSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE NK GNR YKL N F+DLTN+EF+ GYK S T S+F+Y N+ T VP
Sbjct: 69 IESFNKLGNRPYKLDINEFADLTNEEFKVSKNGYKRSSGVGL-TEKSSFRYANV--TAVP 125
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
TS+DWR AVTPIKDQ +CGCCWAFSAVAA+EGITK+S LI LSEQ+LVDC T+G +
Sbjct: 126 TSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQ 255
GC GG M+ AFE+I QN G+ TE YPYQ GTC+ + AAKI+ YE+VP+ E
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSED 245
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALLKAV+ QPVS+ I A + F+ Y G+F G CGT+LDH VT VG+GT++DG YWL+K
Sbjct: 246 ALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDDGTKYWLVK 305
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG +WG+ GY+++ RD EGLCGI Q SYP A
Sbjct: 306 NSWGTSWGEDGYIRMERDIEAKEGLCGIAMQPSYPTA 342
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 190/337 (56%), Positives = 238/337 (70%), Gaps = 11/337 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
MF+ ++++ SQ S RS H+ ++ E HE WM ++GR YKD EKE RF+IF+ N+E+
Sbjct: 10 MFVALLVVGLWVSQAWS-RSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE NK GNR YKL N F+DLTN+EF+A GYK S S SS F+Y N+ T VP
Sbjct: 69 IESFNKPGNRPYKLDINEFADLTNEEFKASRNGYKRSSNVGLSEKSS-FRYGNV--TAVP 125
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
TS+DWR K AVTPIKDQ +CGCCWAFSAVAA+EGITK+S LI LSEQ+LVDC T+G +
Sbjct: 126 TSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQ 255
GC GG M+ AFE+I QN G+ TE YPYQ GTC+ + AAKI+ YE+VP+ E
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSED 245
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALLKAV+ QPVS+ I A + F+ Y G+F G CGT+LDH VT VG+GT+ DG YWL+K
Sbjct: 246 ALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTS-DGTKYWLVK 304
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG +WG+ GY+++ RD EGLCGI QSSYP A
Sbjct: 305 NSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSYPTA 341
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 189/345 (54%), Positives = 238/345 (68%), Gaps = 13/345 (3%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
+ +I F++ ILL S S V S E S VE HE+WM++ R Y D+ EK RF+IF
Sbjct: 1 MTSIVFFLLAILLSSRTSGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFT 60
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSS----TFK 127
NL+++E N N+TY L N FSDLT++EF+A YTG +P R STT S +F+
Sbjct: 61 NNLKFVESINMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFR 120
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
Y+N+ T S+DW + AVT +K QQ+CGCCWAFSAVAAVEG+TKI+ L+ LSEQQ
Sbjct: 121 YENVGETG--ESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQ 178
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYE 247
L+DCST NNGCGGG M KAF+YI +NQGI TED YPYQ Q TC + AAA IS YE
Sbjct: 179 LLDCSTE-NNGCGGGIMWKAFDYIKENQGITTEDNYPYQGAQQTCES-NHLAAATISGYE 236
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP DE+ALLKAVS QPVS+ I EF Y GIFNG CGTQL HAVTIVG+G +E+
Sbjct: 237 TVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEE 296
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
G YWL+KNSWG++WG+ GYM+I+RD +G+CG+ + + YP+A
Sbjct: 297 GIKYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYYPVA 341
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 178/341 (52%), Positives = 242/341 (70%), Gaps = 11/341 (3%)
Query: 14 NTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKE 73
N++ + I + L+ + ++ + +SR+ + + HE+WMAQ+GR YK+E+EK R+ IFKE
Sbjct: 4 NSLKLLIALALVFATSAYLATSRTLLDSLMAVRHEQWMAQYGRVYKNEVEKTKRYNIFKE 63
Query: 74 NLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+EYIE NK G + YKLG N F+DLTN EF A GY +P H ++++ F+Y+N+S
Sbjct: 64 NVEYIESFNKAGTKPYKLGINAFADLTNKEFIASRNGYILP---HECSSNTPFRYENVSA 120
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
VPT++DWR K AVTP+KDQ +CGCCWAFSAVAA+EGITK+S NLI LSEQ+LVDC
Sbjct: 121 --VPTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDV 178
Query: 194 NG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK-ISNYEEVPS 251
G + GC GG M+ AF +II N+G+ TE YPYQ G+C ++ + +A IS YE+VP+
Sbjct: 179 KGIDQGCEGGLMDDAFTFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPA 238
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
E AL KAV+ QPVS+ I A ++F+ Y G+F G CGT+LDH VT VG+G EDG+ Y
Sbjct: 239 NSESALEKAVANQPVSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKY 298
Query: 312 WLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
WL+KNSWG +WG+ GY+++ +D EGLCGI QSSYP A
Sbjct: 299 WLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQSSYPSA 339
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 184/345 (53%), Positives = 240/345 (69%), Gaps = 13/345 (3%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
+ +I F++ I+L S S S E S +E HE+WM++ R Y D+ EK RF+IFK
Sbjct: 1 MTSIIFFLLAIILSSRTSGATSRGGLFEASAIEKHEQWMSRFHRVYSDDSEKTSRFEIFK 60
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSS----TFK 127
+NL+++E N N+TY L N FSDLT++EF+A YTG +P R STT S +F+
Sbjct: 61 KNLKFVESFNMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRMSTTDSHETVSFR 120
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
Y+N+ T S+DWR++ AVT +K QQ+CGCCWAFSAVAAVEG+TKI+ L+ LSEQQ
Sbjct: 121 YENVGETG--ESMDWREEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIAKGELVSLSEQQ 178
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYE 247
L+DCST N+GC GG M KAF+YI++NQGI ED YPYQ Q TC + AAA IS YE
Sbjct: 179 LLDCSTE-NDGCDGGIMWKAFDYIVENQGITAEDNYPYQGAQQTCES-NHVAAATISGYE 236
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP DE+ALLKAVS QPVS+ I EF Y GIFNG CGT L+HAVTIVG+G +E+
Sbjct: 237 TVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTHLNHAVTIVGYGVSEE 296
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
G YWL+KNSWG++WG+ GYM+I+RD +G+CG+ + + YP+A
Sbjct: 297 GIKYWLLKNSWGESWGEDGYMRIMRDVDAPQGMCGLASLAYYPVA 341
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 184/335 (54%), Positives = 239/335 (71%), Gaps = 10/335 (2%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
I ++++ ASQ +S R+ HE S+ E HE WM +GR+YKD EKE RFKIFKEN+EYIE
Sbjct: 10 ITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEYIE 68
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
N GNR YKL N F+D TN+EF+A GY M S RS+ ++F+Y+N++ VP+S
Sbjct: 69 SVNSAGNRRYKLSINEFADQTNEEFKASRNGYNMSSRP-RSSEITSFRYENVAA--VPSS 125
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNG 198
+DWR K AVTPIKDQ +CGCCWAFSAVAA+EG+T++ LI LSEQ+LVDC T+G + G
Sbjct: 126 MDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQG 185
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK-ISNYEEVPSGDEQAL 257
CGGG M+ AFE+II N G+ TE YPY+ V TC+ + A++A I NYE+VP+ E AL
Sbjct: 186 CGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSEAAL 245
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
LKAV+ PVS+ I A ++F+ Y G+F G CGT+LDH VT VG+G T+DG YWL+KNS
Sbjct: 246 LKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWLVKNS 305
Query: 318 WGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
WG WG+ GY+ + R DEGLCGI ++SYP A
Sbjct: 306 WGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 340
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 177/323 (54%), Positives = 234/323 (72%), Gaps = 11/323 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKL 91
+ +SR+ + +V HE+WMAQ+GR Y++E+EK RF IFKEN+EYIE NK G + YKL
Sbjct: 24 LATSRTLSDSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKPYKL 83
Query: 92 GTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPI 151
G N F+DLTN EF+A GYK+P H ++++ F+Y+N+S VPT++DWR K AVTP+
Sbjct: 84 GINAFADLTNQEFKASRNGYKLP---HDCSSNTPFRYENVS--SVPTTVDWRTKGAVTPV 138
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEY 210
KDQ +CGCCWAFSAVAA+EGITK+S NLI LSEQ+LVDC G + GC GG M+ AF +
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSF 198
Query: 211 IIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK-ISNYEEVPSGDEQALLKAVSMQPVSIG 269
II N+G+ TE YPYQ G+C ++ + +A IS YE+VP+ E AL KAV+ QPVS+
Sbjct: 199 IINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVA 258
Query: 270 IAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMK 329
I A ++F+ Y G+F G CGT+LDH VT VG+G EDG+ YWL+KNSWG +WG+ GY++
Sbjct: 259 IDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIR 318
Query: 330 ILRD----EGLCGIGTQSSYPLA 348
+ +D EGLCGI QSSYP A
Sbjct: 319 MQKDIEAKEGLCGIAMQSSYPSA 341
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 178/323 (55%), Positives = 232/323 (71%), Gaps = 11/323 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKL 91
+ +SR+ + +V HE+WMAQ+GR YK E EK RF IFKEN+EYIE NK G + YKL
Sbjct: 22 LATSRTLSDSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKPYKL 81
Query: 92 GTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPI 151
G N F+DLTN EF+A GYK+P H ++++ F+Y+N+S VPT++DWR K AVTP+
Sbjct: 82 GINAFADLTNQEFKASRNGYKLP---HDCSSNTPFRYENVS--SVPTTVDWRTKGAVTPV 136
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN-GCGGGTMEKAFEY 210
KDQ +CGCCWAFSAVAA+EGITK+S NLI LSEQ+LVDC G + GC GG M+ AF +
Sbjct: 137 KDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSF 196
Query: 211 IIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK-ISNYEEVPSGDEQALLKAVSMQPVSIG 269
II N+G+ TE YPYQ G+C ++ + +A IS YE+VP+ E AL KAV+ QPVS+
Sbjct: 197 IINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVA 256
Query: 270 IAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMK 329
I A ++F+ Y G+F G CGT+LDH VT VG+G EDG+ YWL+KNSWG +WG+ GY++
Sbjct: 257 IDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIR 316
Query: 330 ILRD----EGLCGIGTQSSYPLA 348
+ +D EGLCGI QSSYP A
Sbjct: 317 MQKDIEAKEGLCGIAMQSSYPSA 339
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 174/309 (56%), Positives = 225/309 (72%), Gaps = 11/309 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+++ HE+WMAQHGR Y D EKE R+ IFKEN+E IE N +R YKLG N+F+DLTN+
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EFRA+Y GYK S SS+F+Y+NLS D+PTS+DWR+ AVTP+KDQ CGCCWA
Sbjct: 61 EFRAMYHGYKRQSSK---LMSSSFRYENLS--DIPTSMDWRNDGAVTPVKDQGTCGCCWA 115
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAA+EGI K+ NLI LSEQQLVDC T GN GC GG M+ AF+YII+N G+ +ED
Sbjct: 116 FSTVAAIEGIIKLQTGNLISLSEQQLVDC-TAGNKGCQGGLMDTAFQYIIRNGGLTSEDN 174
Query: 223 YPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPYQ V GTCS+ + A+ A+I+ YE+VP +E ALL+AV+ QPVS+ + +F+ YK
Sbjct: 175 YPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFYK 234
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLC 337
G+F G CGT L+H VT +G+GT DG +YWL+KNSWG +WG++GY ++ R EGLC
Sbjct: 235 SGVFEGDCGTNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASEGLC 294
Query: 338 GIGTQSSYP 346
G+ +SYP
Sbjct: 295 GVAMDASYP 303
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 369 bits (948), Expect = e-99, Method: Compositional matrix adjust.
Identities = 181/338 (53%), Positives = 243/338 (71%), Gaps = 15/338 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F +++ L A QV SSR+ + S+ E HE+WMA++GR YKD EKE RF IFKEN+ YI
Sbjct: 12 FALVLCLGLWAFQV-SSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSIFKENVNYI 70
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV 136
E +N G++ YKLG N+F+DLTN+EF A +K M S R+TT FKY+N++
Sbjct: 71 EASNNAGDKPYKLGVNQFADLTNEEFIATRNKFKGHMSSSITRTTT---FKYENVT---A 124
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG- 195
P+++DWR + AVTP+K+Q CGCCWAFSAVAA EGI K+S NL+ LSEQ+LVDC T+G
Sbjct: 125 PSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGA 184
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDE 254
+ GC GG M+ AF++IIQN G+ TE +YPYQ V GTC+ ++A A I+ YE+VPS +E
Sbjct: 185 DQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEATHVATITGYEDVPSNNE 244
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
QAL +AV+ QP+SI I A ++F++Y+ G+F G CGTQLDH V +VG+G ++DG YWL+
Sbjct: 245 QALQQAVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLV 304
Query: 315 KNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
KNSWG WG+ GY+++ RD EGLCG+ Q SYP A
Sbjct: 305 KNSWGADWGEEGYIRMQRDVDAPEGLCGLAMQPSYPTA 342
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 367 bits (943), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 174/338 (51%), Positives = 235/338 (69%), Gaps = 7/338 (2%)
Query: 18 MFIIIILLVSCASQVVSSRS-THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLE 76
+ + I L SQV SSR +E S+ H++W+A H + YKD EKEMRFKIFKEN+E
Sbjct: 12 LALFFIFLGVWRSQVASSRPINYEASMRARHDQWIAHHDKVYKDLNEKEMRFKIFKENVE 71
Query: 77 YIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N ++ YKLG N+FSDLTN++FR L+TGYK P S++ ++ ++TD+
Sbjct: 72 RIEAFNAGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRSHPKVMSSSKPKTHFRYANVTDI 131
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG- 195
P ++DWR K AVTPIKDQ+ECGCCWAFSAVAA EG+ ++ LI LSEQ+LVDC G
Sbjct: 132 PPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDVEGE 191
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDE 254
+ GC GG ++ AF++I++N+G+ TE YPY+ G C+ + A +AAKI+ YE+VP+ E
Sbjct: 192 DEGCSGGLLDTAFDFILKNKGLTTEANYPYKGEDGVCNKKKSALSAAKIAGYEDVPANSE 251
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+ALL+AV+ QPVS+ I + +F+ Y G+F+G C T L+HAVT VG+G T DG YW+I
Sbjct: 252 KALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKYWII 311
Query: 315 KNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
KNSWG WGD+GYM+I RD EGLCG+ +SYP A
Sbjct: 312 KNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 367 bits (943), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 179/338 (52%), Positives = 243/338 (71%), Gaps = 15/338 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F +++ L A QV SSR+ + S+ E HE+WMA++G+ YKD EKE RF IF+EN++YI
Sbjct: 12 FALVLCLGLWAFQV-SSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNIFQENVKYI 70
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV 136
E +N GN+ YKLG N+F+DLTN EF A +K M S R+TT FKY+N++
Sbjct: 71 EASNNAGNKPYKLGVNQFTDLTNKEFIATRNKFKGHMSSSITRTTT---FKYENVT---A 124
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG- 195
P+++DWR + AVTP+K+Q CGCCWAFSAVAA EGI K+S NL+ LSEQ+LVDC T+G
Sbjct: 125 PSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGA 184
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDE 254
+ GC GG M+ AF++IIQN G+ TE +YPYQ V GTC+ ++ A I+ YE+VPS +E
Sbjct: 185 DQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVTHVATITGYEDVPSNNE 244
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
QAL +AV+ QP+S+ I A ++F++Y+ G+F G CGTQLDH V +VG+G ++DG YWL+
Sbjct: 245 QALQQAVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLV 304
Query: 315 KNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
KNSWG+ WG+ GY+++ RD EGLCGI Q SYP A
Sbjct: 305 KNSWGEDWGEEGYIRMQRDVEAPEGLCGIAMQPSYPTA 342
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 367 bits (942), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 180/337 (53%), Positives = 237/337 (70%), Gaps = 10/337 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M +ILL + A Q +SR+ E S+ E HE+WM Q+GR YKDE EK +RF+IF +N+++
Sbjct: 29 MIAALILLGAWACQA-TSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKF 87
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE+ NK+G ++YKL N F+D TN+EF+A GYKM + S R + ++ F+Y+N+ T VP
Sbjct: 88 IEEFNKDGRQSYKLAVNEFADQTNEEFQASRNGYKM-AVSSRPSQTTLFRYENV--TAVP 144
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
+S+DWR K AVTP+KDQ +CG CWAFS +AA EGITK+ LI LSEQ+LVDC G +
Sbjct: 145 SSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKTGED 204
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQ 255
GC GG ME FE+I++N+GIA E YPY A GTC++ ++A+ AAKIS YE+VP+ E
Sbjct: 205 QGCEGGYMEDGFEFIVKNKGIALEASYPYTAADGTCNSKEEASRAAKISGYEKVPANSET 264
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALLKAV+ QPVS+ I A F+ Y G+F G CGT LDH VT VG+G T DG YWL+K
Sbjct: 265 ALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDLDHGVTAVGYGKTSDGTKYWLVK 324
Query: 316 NSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
NSWG +WGD+GY+ + R GLCGI +SYP A
Sbjct: 325 NSWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYPTA 361
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 367 bits (941), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 177/337 (52%), Positives = 244/337 (72%), Gaps = 13/337 (3%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++++ L + ASQ+ ++RS + S+ E HE+WMA +GR YKD EK+ R+KIF+EN+ I
Sbjct: 10 LVVMVTLGALASQLAAARSLQDASMRERHEEWMASYGRVYKDINEKQKRYKIFEENVALI 69
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E +NK+ N+ YKL N+F+DLTN+EF+A +K H ST S++FKY N+S VP
Sbjct: 70 ESSNKDANKPYKLSVNQFADLTNEEFKASRNRFK----GHICSTKSTSFKYGNVSA--VP 123
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
+++DWR K AVTP+KDQ +CGCCWAFSAVAA EGITK++ LI LSEQ+LVDC T+G +
Sbjct: 124 SAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKLTTGELISLSEQELVDCDTSGVD 183
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQ 255
GC GG M+ AF +I N G+A+E YPY+ V GTC+ ++A AA+I+ +E+VP+ E+
Sbjct: 184 QGCEGGLMDNAFTFIQHNHGLASEANYPYKGVDGTCNTNKQAIHAAEINGFEDVPANSEE 243
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALL AV+ QPVS+ I A + F+ Y +G+F G CGTQLDH VT VG+GT++DG YWL+K
Sbjct: 244 ALLNAVAHQPVSVAIDAGGSGFQFYSKGVFIGACGTQLDHGVTAVGYGTSDDGTKYWLVK 303
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ RD EGLCGI ++SYP A
Sbjct: 304 NSWGTQWGEEGYIRMQRDVDAKEGLCGIAMKASYPTA 340
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 367 bits (941), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 182/334 (54%), Positives = 239/334 (71%), Gaps = 14/334 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ +L + ASQ +SRS HE S+ E HE WMA++GR YKD EKE RFKIFK+N+ IE
Sbjct: 14 LLFILAAWASQA-TSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
NK ++TYKL N F+DLTN+EFR+L +K +H + ++TFKY+N+ T VP+++
Sbjct: 73 FNKAMDKTYKLSINEFADLTNEEFRSLRNRFK----AHICSEATTFKYENV--TAVPSTI 126
Query: 141 DWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGC 199
DWR K AVTPIKDQQ+CGCCWAFSAVAA EGIT+I+ LI LSEQ+LVDC T G N GC
Sbjct: 127 DWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGC 186
Query: 200 GGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALL 258
GG M+ AF + I+ G+A+E YPY+ GTC++ ++A AAKI YE+VP+ +E+AL
Sbjct: 187 SGGLMDDAFRF-IKIHGLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANNEKALQ 245
Query: 259 KAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
KAV+ QPV++ I A EF+ Y G+F G CGT+LDH V VG+G +DG YWL+KNSW
Sbjct: 246 KAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMMYWLVKNSW 305
Query: 319 GDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
G WG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 306 GTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 339
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 174/310 (56%), Positives = 226/310 (72%), Gaps = 10/310 (3%)
Query: 44 VEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDE 103
+E HE WMAQ+GR+YK +EKE R IFK N+E+IE NK G + YKL N F+DLTN+E
Sbjct: 1 MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEE 60
Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
F+A GYKM S S+++ F+Y+N+S VP+++DWR K AVTPIKDQ +CGCCWAF
Sbjct: 61 FQASRNGYKM-SAHLSSSSTKPFRYENVSA--VPSTMDWRKKGAVTPIKDQGQCGCCWAF 117
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDE 222
SAVAA EGIT++S LI LSEQ+LVDC T+G + GC GG M+ AF++IIQN+G+ TE
Sbjct: 118 SAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEAN 177
Query: 223 YPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
YPYQ G C++ + AAAKI+ YE+VP+ E ALLKAV+ QPVS+ I A + F+ Y
Sbjct: 178 YPYQGADGACNSGK--AAAKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYSS 235
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCG 338
G+F G CGT LDH VT VG+G ++DG YWL+KNSWG +WG+ GY+++ RD EGLCG
Sbjct: 236 GVFTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERDIDAQEGLCG 295
Query: 339 IGTQSSYPLA 348
I ++SYP A
Sbjct: 296 IAMEASYPTA 305
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 365 bits (936), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 179/346 (51%), Positives = 241/346 (69%), Gaps = 18/346 (5%)
Query: 11 FKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKI 70
I+++ + ++ L A+ +R+ + S+ E HE+WM Q+G+ Y D EKE+R I
Sbjct: 7 LNISSLALLLVFGFLAFEAN----ARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNI 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKY 128
FKEN++ IE N GN+ YKLG N+F+DLTN+EF+A + G+ + ST + TFKY
Sbjct: 63 FKENVQRIEAFNNAGNKPYKLGINQFADLTNEEFKARNRFKGHMCSN----STRTPTFKY 118
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+++S VP SLDWR K AVTPIKDQ +CGCCWAFSAVAA EGITK+S LI LSEQ+L
Sbjct: 119 EDVS--SVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQEL 176
Query: 189 VDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNY 246
VDC T G + GC GG M+ AF++I+QN+G+ TE +YPYQ V TC+A A+ AA I +
Sbjct: 177 VDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGF 236
Query: 247 EEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP+ E ALLKAV+ QP+S+ I A +EF+ Y G+F G CGT+LDH VT VG+G ++
Sbjct: 237 EDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVSD 296
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
DG YWL+KNSWG+ WG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 297 DGTKYWLVKNSWGEQWGEEGYIRMQRDVAAEEGLCGIAMQASYPTA 342
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 365 bits (936), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 179/337 (53%), Positives = 240/337 (71%), Gaps = 15/337 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+ ++L+ S ++R+ + S+ E HE+WMAQ+G+ YKD EKE+R KIFKEN++ IE
Sbjct: 12 LTLLLVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFKENVQRIE 71
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N GN++YKLG N+F+DLTN+EF+A + G+ + ST + TFKY+++ T VP
Sbjct: 72 AFNNAGNKSYKLGINQFADLTNEEFKARNRFKGHMCSN----STRTPTFKYEHV--TSVP 125
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
SLDWR K AVTPIKDQ +CGCCWAFSAVAA EGITK+S LI LSEQ+LVDC T G +
Sbjct: 126 ASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVD 185
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQ 255
GC GG M+ AF++I+QN+G+ TE +YPYQ V TC+A A+ AA I +E+VP+ E
Sbjct: 186 QGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSES 245
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALLKAV+ QP+S+ I A +EF+ Y G+F G CGT+LDH VT VG+G ++ G YWL+K
Sbjct: 246 ALLKAVANQPISVAIDASGSEFQFYSSGVFTGSCGTELDHGVTAVGYG-SDGGTKYWLVK 304
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG+ WG+ GY+++ RD EGLCG Q+SYP A
Sbjct: 305 NSWGEQWGEQGYIRMQRDVAAEEGLCGFAMQASYPTA 341
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 364 bits (935), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 172/312 (55%), Positives = 224/312 (71%), Gaps = 11/312 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+++ HE+WMAQHGR Y D EKE R+ IFKEN+E IE N +R YKLG N+F+DLTN+
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EFRA++ GYK S SS+F+++NLS +PTS+DWR AVTP+KDQ CGCCWA
Sbjct: 61 EFRAMHHGYKRQSSK---LMSSSFRHENLSA--IPTSMDWRKAGAVTPVKDQGTCGCCWA 115
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATED 221
FSAVAA+EGI K+ LI LSEQQLVDC G + GCGGG M+ AF++I++N G+ +E
Sbjct: 116 FSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSEA 175
Query: 222 EYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
YPYQ V GTC + + A+ AKI+ YE+VP +E ALL+AV+ QPVS+ + +F+ Y
Sbjct: 176 TYPYQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQFY 235
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGL 336
K G+F G CGT LDHAVT +G+GT DG NYWL+KNSWG +WG++GYM++ R EGL
Sbjct: 236 KSGVFKGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRGIGAREGL 295
Query: 337 CGIGTQSSYPLA 348
CG+ +SYP A
Sbjct: 296 CGVAMDASYPTA 307
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 363 bits (933), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 175/336 (52%), Positives = 229/336 (68%), Gaps = 9/336 (2%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F IL++ + V+SR E S+ HE+WM G+ Y D EKE RF+IFK+N+EYI
Sbjct: 10 FFAFILILGMWAYEVASRELQEPSMSARHEQWMETFGKVYADAAEKERRFEIFKDNVEYI 69
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N GN+ YKL N+F+DLTN+E + GY+ P + R ++FKY+N+ T VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADLTNEELKVARNGYRRPLQT-RPMKVTSFKYENV--TAVPA 126
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NN 197
++DWR K AVTPIKDQ +CG CWAFS VAA EGI +++ L+ LSEQ+LVDC T G +
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDTQGEDQ 186
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQA 256
GC GG ME FE+II+N GI TE YPYQA GTC++ ++A+ AKI+ YE VP+ E A
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKEASRIAKITGYESVPANSEAA 246
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LLKAV+ QP+S+ I A ++F+ Y G+F G CGT+LDH VT VG+G T DG YWL+KN
Sbjct: 247 LLKAVASQPISVSIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGETSDGTKYWLVKN 306
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG +WG+ GY+++ RD EGLCGI SSYP A
Sbjct: 307 SWGTSWGEEGYIRMQRDTEAEEGLCGIAMDSSYPTA 342
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 177/337 (52%), Positives = 240/337 (71%), Gaps = 14/337 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ +L + ASQ ++R+ HE S+ E HE WM Q+GR YKD EK R+KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N F+DLTN+EFRA +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
+++DWR K AVTPIKDQ +CG CWAFSAVAA+EGIT++S LI LSEQ+LVDC T+G +
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQ 255
GC GG M+ AF++I QN G+ TE YPY GTC+ + A AAKI+ YE+VP+ +E+
Sbjct: 185 QGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEK 244
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QP+++ I A +EF+ Y G+F G CGT+LDH V+ VG+GT++DG YWL+K
Sbjct: 245 ALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVK 304
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 305 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 360 bits (924), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 178/342 (52%), Positives = 243/342 (71%), Gaps = 16/342 (4%)
Query: 18 MFIIIILLVSCA---SQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
++ I + L+ C + V+SR+ + S+ E H++WM Q+ + Y D E E RF+IFKEN
Sbjct: 7 LYYISLALLMCLGLWAVQVTSRTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIFKEN 66
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLS 132
+ YIE +NKEG R YKLG N+F DLTN+EF A +K M S R+ +T+KY+N+
Sbjct: 67 VNYIETSNKEGGRFYKLGVNQFVDLTNEEFIAPRNRFKGHMCSSIIRT---NTYKYENV- 122
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
T VP+++DWR K AVTP+KDQ +CGCCWAFSAVAA EGI ++S LI LSEQ+LVDC
Sbjct: 123 -TTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCD 181
Query: 193 TNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVP 250
T G + GC GG M+ AF++IIQN G+ TE +YPYQ V GTC+A + + AA I++YE+VP
Sbjct: 182 TKGVDQGCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVDGTCNANEASINAATITSYEDVP 241
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+ +EQAL KAV+ QP+S+ I A ++F+ Y G+F G CGT+LDH VT VG+G ++DG
Sbjct: 242 TNNEQALQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVSDDGTK 301
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
YWL+KNSWG +WG+ GY+++ R EGLCGI Q+SYP+A
Sbjct: 302 YWLVKNSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASYPIA 343
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 360 bits (924), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 177/337 (52%), Positives = 238/337 (70%), Gaps = 14/337 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ +L + ASQ ++RS HE S+ E HE WM Q+GR YKD EK R+KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N F+DLTN+EFRA +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
+++DWR K AVTPIKDQ +CG CWAFSAVAA+EGIT++S LI LSEQ+LVDC T+G +
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQ 255
GC GG M+ AF++I QN G+ TE YPY GTC+ + A AAKI+ YE+VP+ +E+
Sbjct: 185 QGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEK 244
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QP+++ I A +EF+ Y G+F G CGT+LDH V VG+GT++DG YWL+K
Sbjct: 245 ALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVK 304
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSW WG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 305 NSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 360 bits (923), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 179/343 (52%), Positives = 240/343 (69%), Gaps = 19/343 (5%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
++ +F + LL + V+SR+ + S+ E HE+WM +G+ YK+ E+E R +IF EN
Sbjct: 11 SLALFFCLGLL----AIQVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTEN 66
Query: 75 LEYIEKANKEGN-RTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
L+YIE +N GN + YKLG N+F+DLTN+EF A +K M S R+TT FKY+N
Sbjct: 67 LKYIEASNNAGNNKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRTTT---FKYEN- 122
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
T VP+++DWR K AVTP+K+Q +CGCCWAFSA+AA EGI KIS L+ LSEQ+LVDC
Sbjct: 123 --TSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDC 180
Query: 192 STNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEV 249
TNG + GC GG M+ AF++IIQN GI+TE YPYQ V GTC A + + +AA I+ YE+V
Sbjct: 181 DTNGVDQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDV 240
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ +E AL KAV+ QP+S+ I A ++F+ YK G+F G CGT+LDH VT VG+G + DG
Sbjct: 241 PANNENALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGT 300
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
YWL+KNSWG WG+ GY+++ R EGLCGI Q+SYP A
Sbjct: 301 KYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 360 bits (923), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 179/343 (52%), Positives = 240/343 (69%), Gaps = 19/343 (5%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
++ +F + LL + V+SR+ + S+ E HE+WM +G+ YK+ E+E R +IF EN
Sbjct: 11 SLALFFCLGLL----AIQVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTEN 66
Query: 75 LEYIEKANKEGNRT-YKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
L+YIE +N GN+ YKLG N+F+DLTN+EF A +K M S R+TT FKY+N
Sbjct: 67 LKYIEASNNAGNKKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRTTT---FKYEN- 122
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
T VP+++DWR K AVTP+K+Q +CGCCWAFSA+AA EGI KIS L+ LSEQ+LVDC
Sbjct: 123 --TSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDC 180
Query: 192 STNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEV 249
TNG + GC GG M+ AF++IIQN GI+TE YPYQ V GTC A + + +AA I+ YE+V
Sbjct: 181 DTNGVDQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDV 240
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ +E AL KAV+ QP+S+ I A ++F+ YK G+F G CGT+LDH VT VG+G + DG
Sbjct: 241 PANNENALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGT 300
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
YWL+KNSWG WG+ GY+++ R EGLCGI Q+SYP A
Sbjct: 301 KYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 359 bits (922), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 176/342 (51%), Positives = 239/342 (69%), Gaps = 16/342 (4%)
Query: 18 MFIIIILLVSCASQV---VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
++ I + LV C V+SR+ + S+ E H +WM+Q+G+ YKD E+E RFKIF EN
Sbjct: 7 VYHISLALVFCLGLFAIQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTEN 66
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLS 132
+ Y+E +N + ++YKLG N+F+DLTN+EF A +K M S R+TT FKY+N+S
Sbjct: 67 VNYVEASNADDTKSYKLGINQFADLTNEEFVASRNKFKGHMCSSITRTTT---FKYENVS 123
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+P+++DWR K AVTP+K+Q +CGCCWAFSAVAA EGI K+S LI LSEQ+LVDC
Sbjct: 124 A--IPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCD 181
Query: 193 TNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVP 250
T G + GC GG M+ AF++IIQN G++TE +YPY+ V GTC+A + + A I+ YE+VP
Sbjct: 182 TKGVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVP 241
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+ EQAL KAV+ QP+S+ I A ++F+ YK G+F G CGT+LDH VT VG+G + DG
Sbjct: 242 ANSEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTK 301
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
YWL+KNSWG WG+ GY+ + R EGLCGI Q+SYP A
Sbjct: 302 YWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYPTA 343
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 176/337 (52%), Positives = 237/337 (70%), Gaps = 14/337 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ +L + ASQ ++R HE S+ E HE WM Q+GR YKD EK R+KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARXLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N F+DLTN+EFRA +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
+++DWR K AVTPIKDQ +CG CWAFSAVAA+EGIT++S LI LSEQ+LVDC T+G +
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQ 255
GC GG M+ AF++I QN G+ TE YPY GTC+ + A AAKI+ YE+VP+ +E+
Sbjct: 185 QGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEK 244
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QP+++ I A +EF+ Y G+F G CGT+LDH V VG+GT++DG YWL+K
Sbjct: 245 ALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVK 304
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSW WG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 305 NSWSTGWGEEGYIRMQRDVTVKEGLCGIAMQASYPTA 341
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 171/340 (50%), Positives = 237/340 (69%), Gaps = 9/340 (2%)
Query: 16 IPMFIIIILLVSCASQVVSSRS-THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
+ +F I + L S SQV SR +E ++ H++W+ H + YKD EKE+RF+IFKEN
Sbjct: 12 LALFFICLGLWS--SQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQIFKEN 69
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
+E IE N ++ YKLG N+FSDLTN+EFR L+TGYK P +++ ++ ++T
Sbjct: 70 VERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHFRYTNVT 129
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
D+P ++DWR K AVTPIKDQ+ECGCCWAFSAVAA+EG+ ++ LI LSEQ+LVDC
Sbjct: 130 DIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLSEQELVDCDVE 189
Query: 195 G-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSG 252
G + GC GG ++ AF++I++N+G+ TE YPY+ G C+ + A +AAKI+ YE+VP+
Sbjct: 190 GEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCNKKKSALSAAKITGYEDVPAN 249
Query: 253 DEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
E+ALL+AV+ QPVS+ I + +F+ Y G+F+G C T L+HAVT VG+G T DG YW
Sbjct: 250 SEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKYW 309
Query: 313 LIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
+IKNSWG WGD+GYM+I RD EGLCG+ +SYP A
Sbjct: 310 IIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 357 bits (915), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 176/335 (52%), Positives = 237/335 (70%), Gaps = 14/335 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ +L + AS +R+ HE S+ E HE WMAQ+GR YKD EK R+KIFK+N+ IE
Sbjct: 14 LLFVLAAWASHA-KARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRYKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK N++YKL N F+DLTN+EFRA +K +H ST +++FKY+++ VP++
Sbjct: 73 FNKAMNKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYEHVXA--VPST 126
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNG 198
+DWR K AVTPIKDQ +CG CWAFSAVAA+EGIT++S LI LSEQ+LVDC T+G + G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQAL 257
C GG M+ AF++I QN G+ TE YPY GTC+ + A AAKI+ YE+VP+ +E+AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
KAV+ QP+++ I A EF+ Y G+F G CGT+LDH V+ VG+GT++DG YWL+KNS
Sbjct: 247 QKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKNS 306
Query: 318 WGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
WG WG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 307 WGTGWGEEGYIRMQRDVTEKEGLCGIAMQASYPTA 341
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 356 bits (913), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 183/345 (53%), Positives = 230/345 (66%), Gaps = 16/345 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+FI+ I L S S S E S +E HE+WMA+ R Y DE EK RF IFK+NLE+
Sbjct: 6 IFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEF 65
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST------FKYQNL 131
++ N TYK+ N FSDLT++EFRA +TG +P R +T S+ F+Y N+
Sbjct: 66 VQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFRYGNV 125
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
S D S+DWR + AVTP+K Q CG CWAFSAVAAVEGITKI+ L+ LSEQQL+DC
Sbjct: 126 S--DNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDC 183
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA----AAKISNYE 247
+ N GC GG M KAFEYII+NQGI TED YPYQ Q TCS++ + AA IS YE
Sbjct: 184 DRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYE 243
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP +E+ALL+AVS QPVS+GI F+ Y G+FNG CGT L HAVTIVG+G +E+
Sbjct: 244 TVPMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIVGYGMSEE 303
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
G YW++KNSWG+TWG+ GYM+I RD +G+CG+ + YPLA
Sbjct: 304 GTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 355 bits (912), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 176/324 (54%), Positives = 230/324 (70%), Gaps = 12/324 (3%)
Query: 33 VSSRSTHEQSVV-EMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGN-RTYK 90
V+SR+ + S++ E HE+WM +G+ YKD E+E R KIFKEN+ YIE +N GN + YK
Sbjct: 26 VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85
Query: 91 LGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTP 150
LG N+F+DLTN+EF A +K S T +STFKY+N S VP+++DWR K AVTP
Sbjct: 86 LGINQFADLTNEEFIASRNKFKGHMCS-SITKTSTFKYENAS---VPSTVDWRKKGAVTP 141
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFE 209
+K+Q +CGCCWAFSAVAA EGI K+S L+ LSEQ+LVDC T G + GC GG M+ AF+
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201
Query: 210 YIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G+ TE +YPYQ V GTCSA + + A I+ YE+VP+ +EQAL KAV+ QP+S+
Sbjct: 202 FIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISV 261
Query: 269 GIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYM 328
I A ++F+ YK G+F G CGT+LDH VT VG+G DG YWL+KNSWG WG+ GY+
Sbjct: 262 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYI 321
Query: 329 KILRD----EGLCGIGTQSSYPLA 348
K+ R EGLCGI ++SYP A
Sbjct: 322 KMQRGVDAAEGLCGIAMEASYPTA 345
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 355 bits (912), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 178/343 (51%), Positives = 239/343 (69%), Gaps = 17/343 (4%)
Query: 18 MFIIIILLVSCA---SQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
++ I + LV C + V+SR+ + S+ E HE+WM +G+ YKD E+E RFKIF EN
Sbjct: 7 LYHISLALVFCLGLWAIQVTSRTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFTEN 66
Query: 75 LEYIEKANK-EGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
++YIE N + N +YKLG N+F+DLTN+EF A +K M S R+TT FKY+N+
Sbjct: 67 MKYIEAFNNGDNNESYKLGINQFADLTNEEFVASRNKFKGHMCSSIIRTTT---FKYENV 123
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
S +P+++DWR K AVTP+K+Q +CGCCWAFSAVAA EGI K+S L+ LSEQ+LVDC
Sbjct: 124 SA--IPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDC 181
Query: 192 STNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEV 249
T G + GC GG M+ AF++IIQN G+ TE +YPYQ V GTC+A + + A I+ YE+V
Sbjct: 182 DTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCNANKASIQATTITGYEDV 241
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ +EQAL KAV+ QP+S+ I A ++F+ YK G+F G CGT+LDH VT VG+G + DG
Sbjct: 242 PANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGT 301
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
YWL+KNSWG WG+ GY+ + R EGLCGI Q+SYP A
Sbjct: 302 KYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYPTA 344
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 173/337 (51%), Positives = 234/337 (69%), Gaps = 13/337 (3%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+ ++L ++ + V+ RS + S+ E HE+WM ++G+ YKD E+E RF+IFKEN+ YIE
Sbjct: 559 LAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIE 618
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVP 137
N N+ YKL N+F+DLTN+EF A +K M S R+TT FKY+N+ T VP
Sbjct: 619 AFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRTTT---FKYENV--TAVP 673
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
+++DWR K AVTPIKDQ +CGCCWAFSAVAA EGI ++ LI LSEQ+LVDC T G +
Sbjct: 674 STVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVD 733
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQ 255
GC GG M+ AF+++IQN G+ TE YPY+ V G C+A + A I+ YE+VP+ +E+
Sbjct: 734 QGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANNEK 793
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QPVS+ I A ++F+ YK G+F G CGT+LDH VT VG+G + DG YWL+K
Sbjct: 794 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVK 853
Query: 316 NSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ R +EGLCGI Q+SYP A
Sbjct: 854 NSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 890
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 176/335 (52%), Positives = 238/335 (71%), Gaps = 14/335 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ L + ASQ ++R+ E S+ E HE WMAQ+GR YKD EK R+KIFK+N+ IE
Sbjct: 14 LLFFLAAWASQA-TARNLLEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK +++YKL N F+DLTN+EFRA +K +H ST +++FKY++++ VP++
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYEHVAA--VPST 126
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNG 198
+DWR K AVTPIKDQ +CG CWAFSAVAA+EGIT++S LI LSEQ+LVDC T+G + G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQAL 257
C GG M+ AF++I QN G+ATE YPY GTC+ + A AAKI+ YE+VP+ +E+AL
Sbjct: 187 CNGGLMDDAFKFIEQNHGLATEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
KAV+ QP+++ I A EF+ Y G+F G CGT+LDH V VG+GT++DG YWL+KNS
Sbjct: 247 QKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNS 306
Query: 318 WGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
WG WG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 307 WGTGWGEVGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 178/337 (52%), Positives = 235/337 (69%), Gaps = 11/337 (3%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
I L CA QV +SRS S+ E HE+WM+Q+ + YKD E+E R KIF N+ YI
Sbjct: 13 LTFIFCLGLCAIQV-TSRSLQVDSMYERHEQWMSQYSKVYKDPQEREERHKIFTANVNYI 71
Query: 79 EKANKEGN-RTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
E N + N + YKLG N+F+DLTN+EF A +K S + T+ TFKY+N+S +P
Sbjct: 72 EVFNNDANNKLYKLGINQFADLTNEEFIASRNKFKGHMCSSIAKTT-TFKYENVSA--IP 128
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
+++DWR K AVTP+K+Q +CGCCWAFSAVAA EGITK+S L+ LSEQ+LVDC T G +
Sbjct: 129 STVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVD 188
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQ 255
GC GG M+ AF++IIQN G++TE YPYQ V GTC+A + + AA I+ YE+VP+ +EQ
Sbjct: 189 QGCEGGLMDDAFKFIIQNHGLSTEAAYPYQGVDGTCNANKASIHAATITGYEDVPANNEQ 248
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QP+S+ I A ++F+ YK G+F+G CGT+LDH VT VG+G DG YWL+K
Sbjct: 249 ALQKAVANQPISVAIDASGSDFQFYKSGVFSGSCGTELDHGVTAVGYGVGNDGTKYWLVK 308
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ R EGLCGI Q+SYP A
Sbjct: 309 NSWGTDWGEEGYIRMQRGVDAAEGLCGIAMQASYPTA 345
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 174/324 (53%), Positives = 231/324 (71%), Gaps = 12/324 (3%)
Query: 33 VSSRSTHEQSVV-EMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGN-RTYK 90
V+SR+ + S++ E HE+WM +G+ YKD E+E R KIFKEN+ YIE +N GN + YK
Sbjct: 26 VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85
Query: 91 LGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTP 150
LG N+F+D+TN+EF A +K S T +STFKY+N S VP+++DWR K AVTP
Sbjct: 86 LGINQFADITNEEFIASRNKFKGHMCS-SITKTSTFKYENAS---VPSTVDWRKKGAVTP 141
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFE 209
+K+Q +CGCCWAFSAVAA EGI K+S L+ LSEQ+LVDC T G + GC GG M+ AF+
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201
Query: 210 YIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G+ TE +YPYQ V GTCSA + + AA I+ YE+VP+ +E AL KAV+ QP+S+
Sbjct: 202 FIIQNHGLHTEAQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISV 261
Query: 269 GIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYM 328
I A ++F+ YK G+F G CGTQLDH VT VG+G + DG YWL+KNSWG+ WG+ GY+
Sbjct: 262 AIDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEEGYI 321
Query: 329 KILRD----EGLCGIGTQSSYPLA 348
++ R +GLCGI +SYP A
Sbjct: 322 RMQRSVDAAQGLCGIAMMASYPTA 345
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 174/337 (51%), Positives = 235/337 (69%), Gaps = 13/337 (3%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+ ++L ++ + V+ RS + S+ E HE+WM ++G+ YKD E+E RF+IFKEN+ YIE
Sbjct: 12 LAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIE 71
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVP 137
N N+ YKL N+F+DLTN+EF A +K M S R+TT FKY+N+ T VP
Sbjct: 72 AFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRTTT---FKYENV--TAVP 126
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
+++DWR K AVTPIKDQ +CGCCWAFSAVAA EGI ++ LI LSEQ+LVDC T G +
Sbjct: 127 STVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVD 186
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQ 255
GC GG M+ AF+++IQN G+ TE YPY+ V G C+ + A AA I+ YE+VP+ +E+
Sbjct: 187 QGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNVNEAANDAATITGYEDVPANNEK 246
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QPVS+ I A ++F+ YK G+F G CGT+LDH VT VG+G + DG YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVK 306
Query: 316 NSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ R +EGLCGI Q+SYP A
Sbjct: 307 NSWGTEWGEEGYIRMQRGVNSEEGLCGIAMQASYPTA 343
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 176/356 (49%), Positives = 241/356 (67%), Gaps = 13/356 (3%)
Query: 1 MVLIFERSGSFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKD 60
+L F + K + + + ++L ++ + V+ RS + S+ E HE+WM ++G+ YKD
Sbjct: 11 FLLFFASTMVAKNHFCHISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKD 70
Query: 61 ELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSH 118
E+E RF+IFKEN+ YIE N N+ YKL N+F+DLTN+EF A +K M S
Sbjct: 71 PQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSII 130
Query: 119 RSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGA 178
R+TT FKY+N+ T VP+++DWR K AVTPIKDQ +CGCCWAFSAVAA EGI ++
Sbjct: 131 RTTT---FKYENV--TAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSG 185
Query: 179 NLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK 237
LI LSEQ+LVDC T G + GC GG M+ AF+++IQN G+ TE YPY+ V G C+A +
Sbjct: 186 KLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEA 245
Query: 238 AA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHA 296
A I+ YE+VP+ +E+AL KAV+ QPVS+ I A ++F+ YK G+F G CGT+LDH
Sbjct: 246 ANDVVTITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHG 305
Query: 297 VTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
VT VG+G + DG YWL+KNSWG WG+ GY+++ R +EGLCGI Q+SYP A
Sbjct: 306 VTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 361
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 174/341 (51%), Positives = 230/341 (67%), Gaps = 11/341 (3%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVV--EMHEKWMAQHGRSYKDELEKEMRFKIFKE 73
I +F+I+ L+ S + SR + ++ + H++WMA+HGR Y D EK R+ +FK
Sbjct: 6 IQIFLIVSLISSFCLSITLSRPLDDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKR 65
Query: 74 NLEYIEKANK-EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQN 130
N+E IE+ N RT+KL N+F+DLTNDEFR++YTGYK S S T +S+F+YQN
Sbjct: 66 NVERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQN 125
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
+S +P S+DWR K AVTPIK+Q CGCCWAFSAVAA+EG TKI LI LSEQQLVD
Sbjct: 126 VSSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVD 185
Query: 191 CSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEV 249
C TN + GC GG M+ AFE+I+ G+ TE YPY+ TC K A I+ YE+V
Sbjct: 186 CDTN-DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGKDATCKIKNTKPTATSITGYEDV 244
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P DE+AL+KAV+ QPVSIGI +F+ Y G+F G C T LDHAVT VG+G + +G+
Sbjct: 245 PVNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDHAVTAVGYGQSSNGS 304
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
YW+IKNSWG WG++GYM+I +D +GLCG+ ++SYP
Sbjct: 305 KYWIIKNSWGTKWGESGYMRIKKDVKDKKGLCGLAMKASYP 345
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 169/337 (50%), Positives = 230/337 (68%), Gaps = 10/337 (2%)
Query: 19 FIIIILLVSCASQVVSSRS-THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
F+I IL +CA +++R T + S+V HE+WMA++GR Y D EK R ++FK N+ +
Sbjct: 82 FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + L N+F+D+T DEFRA +TGYK P P+++ T+ FKY N+S+ +P
Sbjct: 142 IELVNA-GNDKFSLEANQFADMTVDEFRAAHTGYK-PVPANKGRTTQ-FKYANVSLDALP 198
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
S+DWR K AVTPIKDQ +CGCCWAFS VA+VEGI K+S LI LSEQ+LVDC +G +
Sbjct: 199 ASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDGMD 258
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQ 255
GC GG M+ AFE+II N G+ TE YPY +C++ +++ A I YE+VPS DE
Sbjct: 259 QGCEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVPSNDET 318
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
+LLKAV+ QPVSI + F+ YK G+ +G CGT+LDH + VG+G T DG +WL+K
Sbjct: 319 SLLKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGITSDGTKFWLMK 378
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG +WG+ G++++ RD EGLCG+ Q SYP A
Sbjct: 379 NSWGTSWGEKGFIRMERDIADEEGLCGLAMQPSYPTA 415
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 186/344 (54%), Positives = 231/344 (67%), Gaps = 15/344 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+FI+ I L S S E S +E HE+WMA+ R Y DE EK RF IFK+NLE+
Sbjct: 6 IFILTIFLSYRTSLATSRGGLFEASPIEKHEQWMARFNRVYSDESEKRNRFNIFKKNLEF 65
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSP-SHRSTTSS----TFKYQNLS 132
++ N N TYKL N FSDLT++EFRA +TG +P + ST SS F+Y N+S
Sbjct: 66 VQSFNMNKNITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSSDKTVPFRYGNVS 125
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
D S+DWR + AVTP+K Q CG CWAFSAVAAVEGITKI+ L+ LSEQQL+DC
Sbjct: 126 --DTGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCD 183
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA----AAKISNYEE 248
T+ N GC GG M KAFEYII+NQGI TED YPYQ Q TCS++ + AA IS YE
Sbjct: 184 TDYNQGCHGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYET 243
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP +E+ALL+AVS QPVS+GI F+ Y GIFNG CGT L HAVTIVG+G +E+G
Sbjct: 244 VPMNNEEALLQAVSQQPVSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVTIVGYGMSEEG 303
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
YW++KNSWG+TWG+ G+M+I RD +G+CG+ + YPLA
Sbjct: 304 TKYWVVKNSWGETWGEDGFMRIKRDVDAPQGMCGLAMLAFYPLA 347
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 176/343 (51%), Positives = 242/343 (70%), Gaps = 14/343 (4%)
Query: 16 IPMFIIIILLVSCASQVVS-SRSTHEQSVVEMHEKWMAQHGRSYKDELE--KEMRFKIFK 72
I +F+ ++L + Q+ SR ++ + HE+WM+QHGR Y DE E K RF +FK
Sbjct: 6 IFLFVALVLSFCFSIQLAGLSRPLLDEDSMR-HEEWMSQHGRVYADEQEDHKNKRFNVFK 64
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSP-SHRSTTSSTFKYQNL 131
EN+E IE+ N +T+KL N+F+DLTN+EFRA Y G+K P S + T + F+Y+N+
Sbjct: 65 ENVERIEEFND--GKTFKLAINQFADLTNEEFRASYNGFKGPMVLSSQITKPTPFRYENV 122
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
S + +P S+DWR K AVTP+K+Q +CGCCWAFSAVAA+EGIT+IS LI LSEQ+LVDC
Sbjct: 123 S-SALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISLSEQELVDC 181
Query: 192 STNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEV 249
T G ++GC GG M+ AFE+II N G+ TE YPY+ GTC+ + A I+ YE+V
Sbjct: 182 DTKGIDHGCEGGLMDTAFEFIINNGGLTTESNYPYKGEDGTCNFNKTNPIAVSITGYEDV 241
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ DEQAL+KAV+ QPVS+ I A ++F+ Y G+F G CGT+LDHAVT VG+G +EDG+
Sbjct: 242 PANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGVFTGECGTELDHAVTAVGYGESEDGS 301
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
YW++KNSWG WG++GY+++ +D +GLCGI Q+SYP A
Sbjct: 302 KYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCGIAMQASYPTA 344
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 170/337 (50%), Positives = 232/337 (68%), Gaps = 7/337 (2%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
+ +F I+ S + + + HE S +E HE+WMA+ R Y+DELEK+MR +FK+NL
Sbjct: 8 VTIFTILFTTFSISQATSRTVTFHEPSSLEKHEQWMARFSRVYRDELEKQMRRDVFKKNL 67
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
++IE NK+GN++YKLG N F+D TN+EF A++TG K S T S+ + M
Sbjct: 68 KFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLSSKVVDETISSRSWNISDMVG 127
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
V S DWR + AVTP+K Q +CGCCWAFSAVAAVEG+TKI+G NL+ LSEQQL+DC
Sbjct: 128 V--SKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDCDREY 185
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
+ GC GG M AF YIIQN+GIA+E++Y YQ G C ++ + AA+IS ++ VPS +EQ
Sbjct: 186 DRGCDGGIMSDAFNYIIQNRGIASENDYSYQGSDGRCRSSAR-PAARISGFQTVPSNNEQ 244
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALL+AVS QPVS+ + A F Y G+++G CGT +HAVT VG+GT++DG YWL K
Sbjct: 245 ALLEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAK 304
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG+TWG+ GY++I RD +G+CG+ + YP+A
Sbjct: 305 NSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 341
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 176/335 (52%), Positives = 236/335 (70%), Gaps = 14/335 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ +L + ASQ ++R+ HE S+ E HE WMAQ+GR YKD EK R+KIFK+N+ IE
Sbjct: 14 LLFVLAAWASQA-TARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIES 72
Query: 81 ANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK +++YKL N F+DLTN+EF +K +H ST +++FKY+N+ T VP++
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFGTSRNRFK----AHICSTEATSFKYENV--TAVPST 126
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNG 198
+DWR K AVTPIKDQ +CG CWAFSAVAA+EGIT++S LI LSEQ+LVDC T+G + G
Sbjct: 127 IDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQAL 257
C GG M+ AF++I QN G+ TE YPY GTC+ + A AAKI+ YE+VP+ +E+AL
Sbjct: 187 CNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
KAV QP+++ I A EF+ Y G+F G CGT+LDH V VG+GT++DG YWL+KNS
Sbjct: 247 QKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNS 306
Query: 318 WGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
WG WG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 307 WGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 171/336 (50%), Positives = 225/336 (66%), Gaps = 9/336 (2%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F IL++ + V+SR E + HE+WMA +G+ Y D EKE RFKIFK N+EYI
Sbjct: 10 FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N GN+ YKL N+F+D TN++F+ GY+ P + R ++FKY+N+ T VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT-RPMKVTSFKYENV--TAVPA 126
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NN 197
++DWR K AVTPIKDQ +CG CWAFS VAA EGI +++ L+ LSEQ+LVDC G +
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGEDQ 186
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQA 256
GC GG ME FE+II+N GI TE YPYQA GTC++ ++A+ AKI+ YE VP+ E
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LLK V+ QP+S+ I A ++F+ Y G+F G CGT+LDH VT VG+G T DG YWL+KN
Sbjct: 247 LLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWLVKN 306
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SW +WG+ GY+++ RD EGLCGI SSYP A
Sbjct: 307 SWXTSWGEEGYIRMQRDIDAEEGLCGIAMDSSYPTA 342
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 353 bits (907), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 176/327 (53%), Positives = 228/327 (69%), Gaps = 11/327 (3%)
Query: 29 ASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGN-R 87
A QV S + ++ E HE+WM +G+ YKD E+E R KIFKEN+ YIE +N GN +
Sbjct: 23 AIQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNK 82
Query: 88 TYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKA 147
YKLG N+F+DLTN+EF A +K S T +STFKY+N S VP+++DWR K A
Sbjct: 83 LYKLGINQFADLTNEEFIASRNKFKGHMCS-SITKTSTFKYENAS---VPSTVDWRKKGA 138
Query: 148 VTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEK 206
VTP+K+Q +CGCCWAFSAVAA EGI K+S L+ LSEQ+LVDC T G + GC GG M+
Sbjct: 139 VTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDD 198
Query: 207 AFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQP 265
AF++IIQN G+ TE +YPYQ V GTCSA + + A I+ YE+VP+ +EQAL KAV+ QP
Sbjct: 199 AFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQP 258
Query: 266 VSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDA 325
+S+ I A ++F+ YK G+F G CGT+LDH VT VG+G DG YWL+KNSWG WG+
Sbjct: 259 ISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEE 318
Query: 326 GYMKILRD----EGLCGIGTQSSYPLA 348
GY+K+ R EGLCGI ++SYP A
Sbjct: 319 GYIKMQRGVDAAEGLCGIAMEASYPTA 345
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 353 bits (907), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 181/346 (52%), Positives = 236/346 (68%), Gaps = 19/346 (5%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
+++ IP + L + S +SR+ + EMHE+WM QHG+ YK EK+ RF IF
Sbjct: 6 QLHYIP--FALFLCLGLLSFQATSRTLQNDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKY 128
KEN+ YIE N GN++YKLG N F+DLTN EF R + GY + +TFKY
Sbjct: 64 KENVNYIEAFNNVGNKSYKLGLNHFADLTNHEFIAARNKFNGYL------HGSIITTFKY 117
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+N+S DVP+++DWR + AVTP+K+Q +CGCCWAFSAVA+ EGI K++ NL+ LSEQ+L
Sbjct: 118 KNVS--DVPSAVDWRQEGAVTPVKNQGQCGCCWAFSAVASTEGIHKLTTGNLVSLSEQEL 175
Query: 189 VDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNY 246
VDC TNG + GC GG M+ AFE+IIQN G++TE EYPYQ V GTC+ + ++AA IS Y
Sbjct: 176 VDCDTNGEDQGCEGGLMDDAFEFIIQNNGLSTEAEYPYQGVDGTCNKTEVGSSAATISGY 235
Query: 247 EEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E VP DEQAL KAV+ QPVS+ I A ++F+ YK G+F G CGT+LDH V +VG+G E
Sbjct: 236 ENVPVNDEQALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVAVVGYGVGE 295
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
D YWL+KNSWG WG+ GY+++ R EGLCGI Q SYP A
Sbjct: 296 DETEYWLVKNSWGTQWGEEGYIRMQRGVDASEGLCGIAMQPSYPTA 341
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 353 bits (907), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 175/343 (51%), Positives = 239/343 (69%), Gaps = 17/343 (4%)
Query: 18 MFIIIILLVSCASQV---VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
++ I + L+ C V+SR+ + S+ E H +WM+Q+G+ YKD E+E RFKIFKEN
Sbjct: 7 LYHISLALLFCLGLFAIQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFKEN 66
Query: 75 LEYIEKANK-EGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
+ YIE N + ++YKLG N+F+DLTN+EF A +K M S R+T+ FKY+N+
Sbjct: 67 VNYIETFNNADDTKSYKLGINQFADLTNEEFIASRNKFKGHMCSSIMRTTS---FKYENV 123
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
S +P+++DWR K AVTP+K+Q +CGCCWAFSAVAA EGI K+S LI LSEQ+LVDC
Sbjct: 124 S--GIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDC 181
Query: 192 STNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEV 249
T G + GC GG M+ AF++IIQN G++TE +YPY+ V GTC+A + + A I+ YE+V
Sbjct: 182 DTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDV 241
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ EQAL KAV+ QP+S+ I A ++F+ YK G+F G CGT+LDH VT VG+G + DG
Sbjct: 242 PANSEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGACGTELDHGVTAVGYGVSNDGT 301
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
YWL+KNSWG WG+ GY+ + R EG+CGI Q+SYP A
Sbjct: 302 KYWLVKNSWGTDWGEEGYIMMQRGIEAAEGICGIAMQASYPTA 344
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 353 bits (906), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 167/321 (52%), Positives = 222/321 (69%), Gaps = 8/321 (2%)
Query: 33 VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLG 92
+SR+ ++ +++ HE+WMA HGR Y DE EK++RF+IFK N+ YI+ N +++Y L
Sbjct: 41 ATSRTLNDPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLE 100
Query: 93 TNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
N+F+DLTNDEFRA GYK S S F+Y N+S VP +DWR + AVTP+K
Sbjct: 101 VNKFADLTNDEFRASRNGYKKQPDSDSHVVSGLFRYANVSA--VPDEVDWRKEGAVTPVK 158
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYI 211
DQ +CGCCWAFSAVAA+EGI K+ L+ LSEQ+LVDC +G + GC GG ME AF++I
Sbjct: 159 DQGDCGCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFI 218
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
+ +G+A E YPY G C+ + A AAKIS +E+VP+ +E+ALL+AV+ QPVSI I
Sbjct: 219 EKRKGLAAESVYPYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAI 278
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A EF+ Y G+F G CGT+LDHA+T VG+G T DG YWL+KNSWG +WG+ GY++I
Sbjct: 279 DASGYEFQFYSGGVFTGSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGYIRI 338
Query: 331 LRD----EGLCGIGTQSSYPL 347
RD EGLCGI SYP+
Sbjct: 339 KRDSLAKEGLCGIAMDPSYPV 359
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 353 bits (906), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 170/337 (50%), Positives = 228/337 (67%), Gaps = 10/337 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ I + +++ + S+R HE ++VE HEKWMA+HG+ YKD+ EK RF+IFK N+E+
Sbjct: 10 LLIALFFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEF 69
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE +N GN +Y LG NRF+DLTN+EFRA + GYK P + R T FKY+N+ T +P
Sbjct: 70 IESSNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDASRIVT--PFKYENV--TALP 125
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
S+DWR K AVT IKDQ+ECG CWAFSAVAA EG+ K+ L+ LSEQ+LVDC G +
Sbjct: 126 YSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGED 185
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQ 255
GC GG ME AF++I +N GI TE Y Y+ G C ++A+ AKI+ Y+ VP E
Sbjct: 186 KGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHVAKITGYQVVPENSEA 245
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALLKAV+ QPVS+ I A + F+ Y+ GI+ G CG+ L+H V VG+GT+ G+ YW++K
Sbjct: 246 ALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTSSSGSKYWIVK 305
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ RD +GLCGI SYP A
Sbjct: 306 NSWGPEWGERGYVRMKRDITSRKGLCGIAMDCSYPTA 342
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 353 bits (906), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 170/336 (50%), Positives = 234/336 (69%), Gaps = 13/336 (3%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+ ++ + + + ++RS +E S+ E H++WMA++GR YK EK R IF+ENL+YI+
Sbjct: 12 LALLFTIGVLASLAAARSLNEASMTETHDQWMARYGRVYKTANEKNRRSTIFQENLKYIQ 71
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPT 138
NK N+ YKLG N F+DLTN+EF +K SH +T ++ F+Y+N+ T VP
Sbjct: 72 TFNKANNKPYKLGVNEFADLTNEEFTTSRNKFK----SHVCATVTNVFRYENV--TAVPA 125
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NN 197
++DWR K AVTPIK+Q +CGCCWAFSAVAA+EGIT++ LI LSEQ+LVDC TNG +
Sbjct: 126 TMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQELVDCDTNGEDQ 185
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQA 256
GC GG M+ AF++I QN G++TE YPY GTC+A ++A AA I+ +E+VP+ E A
Sbjct: 186 GCEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEANHAATITGHEDVPANSESA 245
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LLKAV+ QP+S+ I A ++F+ Y G+F G CGT+LDH VT VG+GT DG YWL+KN
Sbjct: 246 LLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYGTAADGTKYWLVKN 305
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG +WG+ GY+++ R EGLCGI Q+SYP A
Sbjct: 306 SWGTSWGEEGYIQMQRGVAAAEGLCGIAMQASYPTA 341
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 353 bits (905), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 172/338 (50%), Positives = 240/338 (71%), Gaps = 15/338 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+ ++LL + ++R+ + S+ E HE+WMAQHG+ YKD EKE+R+KIF++N++ IE
Sbjct: 12 LALLLLFGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIE 71
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N GN+++KLG N+F+DLT +EF+A+ GY S +STFKY+++ T VP
Sbjct: 72 GFNNAGNKSHKLGVNQFADLTEEEFKAINKLKGYMWSKISR----TSTFKYEHV--TKVP 125
Query: 138 TSLDWRDKKAVTPIKDQ-QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
+LDWR K AVTPIK Q +CG CWAF+AVAA EGITK++ LI LSEQ+L+DC TNG+
Sbjct: 126 ATLDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGD 185
Query: 197 NG-CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDE 254
NG C G +++AF++I+QN+G+ATE YPYQAV GTC+A + A I YE+VP+ +E
Sbjct: 186 NGGCKWGIIQEAFKFIVQNKGLATEASYPYQAVDGTCNAKVESKHVASIKGYEDVPANNE 245
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
ALL AV+ QPVS+ + + +F+ Y G+ +G CGT DHAVT+VG+G ++DG YWLI
Sbjct: 246 TALLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDDGTKYWLI 305
Query: 315 KNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
KNSWG WG+ GY++I RD EG+CGI Q+SYP+A
Sbjct: 306 KNSWGVYWGEQGYIRIKRDVAAKEGMCGIAMQASYPIA 343
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 353 bits (905), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 181/347 (52%), Positives = 238/347 (68%), Gaps = 17/347 (4%)
Query: 10 SFK-INTIPMFIIIILLVSCASQVVSSRSTHE-QSVVEMHEKWMAQHGRSYKDELEKEMR 67
+FK + +P ++I+ + ASQ + RS E +S++E HE+WMAQHGR YK+ EK R
Sbjct: 3 AFKTVKLLPALALLIVAI-WASQGEAGRSLGENKSMLERHEQWMAQHGRVYKNAAEKAHR 61
Query: 68 FKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
F+IF+ N+E IE N E N +KLG N+F+DLTN+EF+ T PS ++T S FK
Sbjct: 62 FEIFRANVERIESFNAE-NHKFKLGVNQFADLTNEEFKTRNT----LKPSKMASTKS-FK 115
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
Y+N+ T VP ++DWR K AVTPIKDQ +CG CWAFSAVAA EGITK+S LI LSEQ+
Sbjct: 116 YENV--TAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLISLSEQE 173
Query: 188 LVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISN 245
+VDC T+ + GC GG M+ AFEYII+N+GI TE YPY+A GTC+ + A+ AA I+
Sbjct: 174 VVDCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPYKAADGTCNTKKAASHAASITG 233
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
YE+V E ALLKA + QP+++ I A F+ Y G+F G CGT LDH VT+VG+G T
Sbjct: 234 YEDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGVFTGDCGTDLDHGVTLVGYGAT 293
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
DG YWL+KNSWG +WG+ GY+++ RD EGLCGI +SYP A
Sbjct: 294 SDGTKYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGIAMDASYPTA 340
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 353 bits (905), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 171/336 (50%), Positives = 225/336 (66%), Gaps = 9/336 (2%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F IL++ + V+SR E + HE+WMA +G+ Y D EKE RFKIFK N+EYI
Sbjct: 10 FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N GN+ YKL N+F+D TN++F+ GY+ P + R ++FKY+N+ T VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT-RPMKVTSFKYENV--TAVPA 126
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NN 197
++DWR K AVT IKDQ +CG CWAFS VAA EGI +++ L+ LSEQ+LVDC G +
Sbjct: 127 TMDWRKKGAVTLIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGEDQ 186
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQA 256
GC GG ME FE+II+N GI TE YPYQA GTC++ ++A+ AKI+ YE VP+ E
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LLK V+ QP+S+ I A ++F+ Y G+F G CGT+LDH VT VG+G T DG YWL+KN
Sbjct: 247 LLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWLVKN 306
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG +WG+ GY+++ RD EGLCGI SSYP A
Sbjct: 307 SWGTSWGEEGYIRMQRDIDTEEGLCGIAMDSSYPTA 342
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 174/337 (51%), Positives = 237/337 (70%), Gaps = 13/337 (3%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+ ++L ++ + V+ R+ + S+ E HE+WM ++G+ YKD E+E RF++FKEN+ YIE
Sbjct: 12 LAMLLCMTFLAFQVTCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENVNYIE 71
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVP 137
N N++YKLG N+F+DLTN EF A G+K M S R+TT FK++N++ T P
Sbjct: 72 AFNNAANKSYKLGINQFADLTNKEFIAPRNGFKGHMCSSIIRTTT---FKFENVTAT--P 126
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
+++DWR K AVTPIKDQ +CGCCWAFSAVAA EGI +S LI LSEQ+LVDC T G +
Sbjct: 127 STVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVD 186
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK-ISNYEEVPSGDEQ 255
GC GG M+ AF++IIQN G+ TE YPY+ V G C+A + A A I+ YE+VP+ +E
Sbjct: 187 QGCEGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEAAKNAATITGYEDVPANNEM 246
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QPVS+ I A ++F+ YK G+F G CGT+LDH VT VG+G ++DG YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWLVK 306
Query: 316 NSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ R +EGLCGI Q+SYP A
Sbjct: 307 NSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 343
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 179/349 (51%), Positives = 235/349 (67%), Gaps = 17/349 (4%)
Query: 10 SFKINTIPMFIIIILLVS--CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMR 67
+FK F + + LV CA + ++R+ + + E HE+WMA HG+ Y EKE +
Sbjct: 2 AFKKVLFQYFTLALCLVFAFCAFEG-NARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQK 60
Query: 68 FKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRAL--YTGYKMPSPSHRSTTSST 125
++ FKEN++ IE N GN+ YKLG N F+DLTN+EF+A+ + G+ + T + T
Sbjct: 61 YQTFKENVQRIEAFNHAGNKPYKLGINHFADLTNEEFKAINRFKGH----VCSKITRTPT 116
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
F+Y+N MT VP +LDWR + AVTPIKDQ +CGCCWAFSAVAA EGITK+S LI LSE
Sbjct: 117 FRYEN--MTAVPATLDWRQEGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSE 174
Query: 186 QQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKI 243
Q+LVDC T G + GC GG M+ AF++I+QN+G+A E YPY+ V GTC+A A+ A I
Sbjct: 175 QELVDCDTKGVDQGCEGGLMDDAFKFILQNKGLAAEAIYPYEGVDGTCNAKAEGNHATSI 234
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
YE+VP+ E ALLKAV+ QPVS+ I A EF+ Y G+F G CGT LDH VT VG+G
Sbjct: 235 KGYEDVPANSESALLKAVANQPVSVAIEASGFEFQFYSGGVFTGSCGTNLDHGVTAVGYG 294
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
++DG YWL+KNSWG WGD GY+++ RD EGLCGI +SYP A
Sbjct: 295 VSDDGTKYWLVKNSWGVKWGDKGYIRMQRDVAAKEGLCGIAMLASYPNA 343
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 351 bits (901), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 176/338 (52%), Positives = 235/338 (69%), Gaps = 14/338 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M ++ + L +C+ ++ + S+ E H +WMA+HGR+YKD EKE R IFK N+EY
Sbjct: 9 MALLALGLGACSP---AAAELGDASMAERHVEWMARHGRTYKDAAEKEQRLGIFKSNVEY 65
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N G R Y+L N+F+DLT++EF+A++TG+K PS + + F++ +LS VP
Sbjct: 66 IESFNA-GKRKYQLAANQFADLTHEEFKAMHTGFK-PSGTGAKKAGNGFRHGSLS--SVP 121
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
S+DWR K AVTP+KDQ CG CWAF+ VAAVEGITKI LI LSEQQLVDC +G +
Sbjct: 122 DSVDWRSKGAVTPVKDQGLCGSCWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKD 181
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQ 255
GC GG M+ AFE+I+ N GI +E YPY+ VQ C+A + A I ++E+VP+ DE+
Sbjct: 182 QGCQGGDMDAAFEFIVNNGGITSEANYPYEEVQRLCNAHNASFVVATIESHEDVPTNDEK 241
Query: 256 ALLKAVSMQPVSIGI-AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
AL KAV+ QPVS+GI A + +F+ Y G+F+G CGT LDHAVT+VG+GTT DG YWL
Sbjct: 242 ALRKAVANQPVSVGIDAGSSLDFQLYSGGVFSGECGTDLDHAVTVVGYGTTSDGTKYWLA 301
Query: 315 KNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
KNSWG+TWG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 302 KNSWGETWGENGYIRMERDVAAKEGLCGIAMQASYPTA 339
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 351 bits (901), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 176/326 (53%), Positives = 228/326 (69%), Gaps = 17/326 (5%)
Query: 33 VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK-EGNRTYKL 91
V+SR T + + E H +WM+Q+G+ YKD E+E RFKIF EN+ YIE NK + N+ Y L
Sbjct: 25 VTSR-TLQDDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTL 83
Query: 92 GTNRFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
G N+F+DLTNDEF R + G+ S T +STFKY+N S +P+S+DWR K AV
Sbjct: 84 GVNQFADLTNDEFTSSRNKFKGHMCSSI----TRTSTFKYENASA--IPSSVDWRKKGAV 137
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKA 207
TP+K+Q +CGCCWAFSAVAA EGI K+S LI LSEQ+LVDC T G + GC GG M+ A
Sbjct: 138 TPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDA 197
Query: 208 FEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPV 266
F++IIQN G+ TE YPYQ V GTC+A + + A I+ YE+VP+ +EQAL KAV+ QP+
Sbjct: 198 FKFIIQNHGLNTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPI 257
Query: 267 SIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAG 326
S+ I A ++F+ YK G+F G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ G
Sbjct: 258 SVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEEG 317
Query: 327 YMKILRD----EGLCGIGTQSSYPLA 348
Y+ + R EGLCGI Q+SYP A
Sbjct: 318 YIMMQRGVDAAEGLCGIAMQASYPTA 343
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 351 bits (900), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 177/340 (52%), Positives = 231/340 (67%), Gaps = 14/340 (4%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
T+ +F+I CA + ++R+ + + E HE+WMA HG+ YK EKE +++IF EN
Sbjct: 10 TLALFLIFAF---CAFEA-NARTLEDAPMRERHEQWMATHGKVYKHSYEKEQKYQIFMEN 65
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
++ IE N G + YKLG N F+DLTN+EF+A+ +K S R+ T+ TF+Y+N+ T
Sbjct: 66 VQRIEAFNNAGXKPYKLGINHFADLTNEEFKAI-NRFKGHVCSKRTRTT-TFRYENV--T 121
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
VP SLDWR K AVTPIKDQ +CGCCWAFSAVAA EGITK+ LI LSEQ+LVDC T
Sbjct: 122 AVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQELVDCDTK 181
Query: 195 G-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSG 252
G + GC GG M+ AF++I+QN+G+ATE YPY+ GTC+A A A I YE+VP+
Sbjct: 182 GVDQGCEGGLMDDAFKFILQNKGLATEAIYPYEGFDGTCNAKADGNHAGSIKGYEDVPAN 241
Query: 253 DEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
E ALLKAV+ QPVS+ I A +F+ Y G+F G CGT LDH VT VG+G +DG YW
Sbjct: 242 SESALLKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVTSVGYGVGDDGTKYW 301
Query: 313 LIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
L+KNSWG WG+ GY+++ RD EGLCGI +SYP A
Sbjct: 302 LVKNSWGVKWGEKGYIRMQRDVAAKEGLCGIAMLASYPSA 341
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 351 bits (900), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 173/345 (50%), Positives = 237/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS E SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y YQ Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 351 bits (900), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 171/343 (49%), Positives = 228/343 (66%), Gaps = 14/343 (4%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
KI I +F ++ + CA Q +SR HE + HEKWMA+HG+ YKD+ EK RF+IF
Sbjct: 8 KILPIALFFVLAM---CADQA-ASRELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
K N+ +IE N GN++Y LG N+F+DLTN+EFRA + GYK P + R T FKY+N+
Sbjct: 64 KSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAFWNGYKRPLGASRKIT--PFKYENV 121
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
T +P+S+DWR K AVTPIKDQ CG CWAFSAVAA EGI K+ L+ LSEQ+LVDC
Sbjct: 122 --TALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDC 179
Query: 192 STNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEV 249
G + GC GG M AF++I ++ G+ +E YPYQ G C ++A+ A KI+ Y+ V
Sbjct: 180 DVKGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQAV 239
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P E ALLKAV+ QPVS+ I A + F+ Y+ GIF G+CG ++H V VG+G + G+
Sbjct: 240 PKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTGICGKDINHGVAAVGYGRSNSGS 299
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
YW++KNSWG WG+ GY+++ RD EGLCGI + SYP A
Sbjct: 300 KYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAMECSYPTA 342
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 350 bits (899), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 168/337 (49%), Positives = 233/337 (69%), Gaps = 9/337 (2%)
Query: 20 IIIILLVSCASQVVSSRST--HEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
++IIL +SR+ EQS+V+ HE+WMA+ R Y+DELEK MR +FK+NL++
Sbjct: 10 VLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKF 69
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSMTD- 135
IE NK+GN++YKLG N F+D TN+EF A++TG K + S + T Q +++D
Sbjct: 70 IENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTWNVSDM 129
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
V S DWR + AVTP+K Q +CGCCWAFSAVAAVEG+ KI+G NL+ LSEQQL+DC
Sbjct: 130 VVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREY 189
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
+ GC GG M AF Y++QN+GIA+E++Y YQ G C + + AA+IS ++ VPS +E+
Sbjct: 190 DRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNAR-PAARISGFQTVPSNNER 248
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALL+AVS QPVS+ + A F Y G+++G CGT +HAVT VG+GT++DG YWL K
Sbjct: 249 ALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAK 308
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG+TWG+ GY++I RD +G+CG+ + YP+A
Sbjct: 309 NSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 350 bits (898), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 174/337 (51%), Positives = 234/337 (69%), Gaps = 15/337 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ +++LL C SQV+S R HE S+ E HE+WM ++G+ YKD EK+ R IFK+N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDV 136
IE N GN+ YKLG N +D TN+EF A + GYK H+++ S T FKY+N+ T V
Sbjct: 69 IESFNAAGNKPYKLGINHLADQTNEEFVASHNGYK-----HKASHSQTPFKYENV--TGV 121
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
P ++DWR+ AVT +KDQ +CG CWAFS VAA EGI +I+ + L+ LSEQ+LVDC + +
Sbjct: 122 PNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV-D 180
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQ 255
+GC GG ME FE+II+N GI++E YPY AV GTC A ++A+ AA+I YE VP+ E
Sbjct: 181 HGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSED 240
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QPVS+ I A + F+ Y G+F G CGTQLDH VT VG+G+T+DG YW++K
Sbjct: 241 ALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIVK 300
Query: 316 NSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ R EGLCGI +SYP A
Sbjct: 301 NSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 174/337 (51%), Positives = 233/337 (69%), Gaps = 15/337 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ +++LL C SQV+S R+ HE S+ E HE+WM ++G+ YKD EK+ R IFK+N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDV 136
IE N GNR YKL N +D TN+EF A + GYK H+ + S T FKY+N+ T V
Sbjct: 69 IESFNAAGNRPYKLSINHLADQTNEEFVASHNGYK-----HKGSHSQTPFKYENV--TGV 121
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
P ++DWR+ AVT +KDQ +CG CWAFS VAA EGI +I+ + L+ LSEQ+LVDC + +
Sbjct: 122 PNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV-D 180
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQ 255
+GC GG ME FE+II+N GI++E YPY AV GTC A ++A+ AA+I YE VP+ E
Sbjct: 181 HGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSED 240
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QPVS+ I A + F+ Y G+F G CGTQLDH VT VG+G+T+DG YW++K
Sbjct: 241 ALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIVK 300
Query: 316 NSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ R EGLCGI +SYP A
Sbjct: 301 NSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 174/337 (51%), Positives = 234/337 (69%), Gaps = 14/337 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ +++LL C SQV+S R+ HE S+ E HE+WM ++G+ YKD EK+ R IFK+N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN+ YKL N +D TN+EF A + GYK SH T FKY N+ TD+P
Sbjct: 69 IESFNAAGNKPYKLSINHLADQTNEEFVASHNGYKYKG-SHSQT---PFKYGNV--TDIP 122
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
T++DWR AVT +KDQ +CG CWAFS VAA EGI +IS L+ LSEQ+LVDC + ++
Sbjct: 123 TAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV-DH 181
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQA 256
GC GG ME FE+II+N GI++E YPY AV GTC A+++A+ AA+I YE VP+ E+A
Sbjct: 182 GCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSEEA 241
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN-YWLIK 315
L +AV+ QPVS+ I A + F+ Y G+F G CGTQLDH VT+VG+GTT+DG + YW++K
Sbjct: 242 LQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGTHEYWIVK 301
Query: 316 NSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ R EGLCGI +SYP+
Sbjct: 302 NSWGTQWGEEGYIRMQRGIDAQEGLCGIAMDASYPMG 338
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 238/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + +RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T++EF A +TG +P SPS +T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMPSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K+Q +CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + K AA +ISN
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQGKTAAVQISN 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SHDLQFYAGGTYDGSCANRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 170/342 (49%), Positives = 239/342 (69%), Gaps = 8/342 (2%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ KI+ + + I + ++S + ++RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKIDLMSILITLFFVISMFNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKY 128
IFKEN+++IE NK GN +YKLG N F+D+T++EF +TG +PS S SST FK
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGINIPSYLSPSPMSSTEFKI 121
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+LS D+P++LDWR+ AVT +K+Q +CGCCWAFSAV ++EG KI+ NL++ SEQ+L
Sbjct: 122 NDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQEL 181
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEE 248
+DC+TN N GC GG M AF++I +N GI++E +Y YQ Q TC + +K AA +IS+Y+
Sbjct: 182 LDCTTN-NYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQYTCRSQEKTAAVQISSYQV 240
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT E G
Sbjct: 241 VPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKG 298
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
YWL+KNSWG +WG+ G+MKI+RD G C I SSYP
Sbjct: 299 QKYWLLKNSWGTSWGENGFMKIIRDSGNPGGHCDIAKMSSYP 340
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 176/344 (51%), Positives = 235/344 (68%), Gaps = 20/344 (5%)
Query: 18 MFIIIILLVSCASQV---VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
+ I + L+ C+ + V+ R+ + S+ E HE+WM ++ + YKD E+E RFKIFKEN
Sbjct: 7 FYQISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLS 132
+ YIE N N+ Y LG N+F+DLTN+EF A +K M S R+TT FKY+N+
Sbjct: 67 VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV- 122
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
T +P+++DWR K AVTPIKDQ +CGCCWAFSAVAA EGI +S LI LSEQ++VDC
Sbjct: 123 -TAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCD 181
Query: 193 TNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAA---AKISNYEE 248
T G + GC GG M+ AF++IIQN G+ E YPY+AV G C+A KAAA A I+ YE+
Sbjct: 182 TKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNA--KAAANHVATITGYED 239
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP +E+AL KAV+ QPVS+ I A ++F+ Y+ G+F G CGT+LDH VT VG+G + DG
Sbjct: 240 VPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADG 299
Query: 309 ANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
YWL+KNSWG WG+ GY+++ R +EGLCGI +SYP A
Sbjct: 300 TEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 176/344 (51%), Positives = 235/344 (68%), Gaps = 20/344 (5%)
Query: 18 MFIIIILLVSCASQV---VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
+ I + L+ C+ + V+ R+ + S+ E HE+WM ++ + YKD E+E RFKIFKEN
Sbjct: 7 FYQISLALLFCSGFLTFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLS 132
+ YIE N N+ Y LG N+F+DLTN+EF A +K M S R+TT FKY+N+
Sbjct: 67 VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV- 122
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
T +P+++DWR K AVTPIKDQ +CGCCWAFSAVAA EGI +S LI LSEQ++VDC
Sbjct: 123 -TAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCD 181
Query: 193 TNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAA---AKISNYEE 248
T G + GC GG M+ AF++IIQN G+ E YPY+AV G C+A KAAA A I+ YE+
Sbjct: 182 TKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNA--KAAANHVATITGYED 239
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP +E+AL KAV+ QPVS+ I A ++F+ Y+ G+F G CGT+LDH VT VG+G + DG
Sbjct: 240 VPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADG 299
Query: 309 ANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
YWL+KNSWG WG+ GY+++ R +EGLCGI +SYP A
Sbjct: 300 TEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 238/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISIFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q +CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKTNDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYSGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E+G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 347 bits (891), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 237/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E+G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 ENGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 347 bits (891), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 237/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + +RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 347 bits (891), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 237/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + +RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPSGLCDIAKMSSYP 341
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 240/345 (69%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + +RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T++EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK ++S D+P++LDWR+ AVT +K+Q +CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCANRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E+G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 ENGQKYWLLKNSWGTSWGEKGFMKIIRDYGNPSGLCDIAKLSSYP 341
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 347 bits (889), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 237/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + +RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 347 bits (889), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 237/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E+G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 ENGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 165/336 (49%), Positives = 223/336 (66%), Gaps = 12/336 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ I+ L + C + + + + ++V HE+WMAQ+ R YKD EK RF++FK N+++
Sbjct: 8 ILAILGLALFCGAALAARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKF 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE N GNR + LG N+F+DLTNDEFRA T G+K PSP T F+Y+N+S+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFK-PSPVKVPTG---FRYENVSVDA 123
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
+P S+DWR K AVTPIKDQ +CGCCWAFSAVAA EGI KIS LI LSEQ+LVDC +G
Sbjct: 124 LPASIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVHG 183
Query: 196 -NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
+ GC GG M+ AF++II+N G+ TE YPY A G C + +AA I +E+VP+ DE
Sbjct: 184 EDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTATDGKCKSGTNSAA-NIKGFEDVPANDE 242
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
AL+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G T DG YWL+
Sbjct: 243 AALMKAVANQPVSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLL 302
Query: 315 KNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
KNSWG TWG+ GY+++ +D G+CG+ + SYP
Sbjct: 303 KNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 338
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 172/348 (49%), Positives = 236/348 (67%), Gaps = 18/348 (5%)
Query: 11 FKINTIPMFIIIILLVSCAS--QVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRF 68
++ FI + LL + ++R+ + S+ E HE+WMAQ+GR YKD+ EKE R+
Sbjct: 1 MRLTKQSQFICLALLFVLGAWPSKSAARTLQDVSMYERHEQWMAQYGRVYKDDAEKETRY 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTF 126
IFKEN+ I+ N + ++YKLG N+F+DL+N+EF+A +K M SP + F
Sbjct: 61 NIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKASRNRFKGHMCSPQ-----AGPF 115
Query: 127 KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
+Y+N+S VP ++DWR K AVTP+KDQ +CGCCWAFSAVAA+EGI +++ LI LSEQ
Sbjct: 116 RYENVSA--VPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTTGKLISLSEQ 173
Query: 187 QLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKIS 244
++VDC T G + GC GG M+ AF++I QN+G+ TE YPY GTC+ ++A AAKI+
Sbjct: 174 EVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTCNTQKEATHAAKIT 233
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT 304
+E+VP+ E AL+KAV+ QPVS+ I A EF+ Y GIF G CGTQLDH VT VG+G
Sbjct: 234 GFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGTQLDHGVTAVGYGI 293
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
+ DG YWL+KNSWG WG+ GY+++ +D EGLCGI Q+SYP A
Sbjct: 294 S-DGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPSA 340
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 346 bits (888), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 346 bits (888), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 346 bits (888), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 175/324 (54%), Positives = 227/324 (70%), Gaps = 13/324 (4%)
Query: 33 VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLG 92
V+SR+ + S+ E HE+WMA++ + YKD E+E RFKIFKEN+ YIE N N+ YKLG
Sbjct: 25 VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLG 84
Query: 93 TNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTP 150
N+F+DLTN+EF A +K M S R+TT FKY+N+ T +P+++DWR K AVTP
Sbjct: 85 INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TALPSTVDWRQKGAVTP 139
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFE 209
IKDQ +CGCCWAFSAVAA EGI ++ LI LSEQ++VDC T G + GC GG M+ AF+
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK-ISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G+ TE YPY+AV G C+A + A A I+ YE+VP +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSV 259
Query: 269 GIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYM 328
I A ++F+ YK G+F G CGTQLDH VT VG+G + DG YWL+KNSWG WG+ GY+
Sbjct: 260 AIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYI 319
Query: 329 KILR----DEGLCGIGTQSSYPLA 348
+ R EGLCGI +SYP A
Sbjct: 320 MMQRGVKAQEGLCGIAMMASYPTA 343
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 346 bits (888), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 173/332 (52%), Positives = 228/332 (68%), Gaps = 32/332 (9%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++++ + ASQ ++ + +E ++VE HE+WMA+HGR+Y+D EKE RF+IFK NLEYI+
Sbjct: 13 LLVVFSTWASQAMARQLINEDALVEKHEQWMARHGRTYQDSEEKERRFQIFKSNLEYIDN 72
Query: 81 ANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
NK N+TY+LG N F+DL+++E+ A YT KMP +VP S+
Sbjct: 73 FNKASNQTYQLGLNNFADLSHEEYVATYTARKMP-------------------VEVPESI 113
Query: 141 DWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCG 200
DWRD AVTPIK+Q +CGCCWAFSA AAVEGI AN + LS QQL+DC ++ N GC
Sbjct: 114 DWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGIV----ANGVSLSAQQLLDCVSD-NQGCK 168
Query: 201 GGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKA 260
GG M AF YIIQNQGIA E +YPYQ +Q CS+ + AAA+IS +E+V DE+AL++A
Sbjct: 169 GGWMNNAFNYIIQNQGIALETDYPYQQMQQMCSS--RMAAAQISGFEDVTPKDEEALMRA 226
Query: 261 VSMQPVSIGIAAYTT-EFKSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
V+ QPVS+ I A + FK YKEG+F CG HAVT+VG+GT+EDG YWL KNSW
Sbjct: 227 VAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVTLVGYGTSEDGTKYWLAKNSW 286
Query: 319 GDTWGDAGYMKILRDEGL----CGIGTQSSYP 346
G+TWG++GYM++ RD GL CGI +SYP
Sbjct: 287 GETWGESGYMRLQRDIGLEGGPCGIALYASYP 318
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS E SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI++E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS E SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI++E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 238/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + +RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMSILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K+Q +CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCANRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 341
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 237/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFK+N+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKKNMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q +CGCCWAFSAV ++EG KI+ L++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y EG ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAEGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 171/343 (49%), Positives = 227/343 (66%), Gaps = 10/343 (2%)
Query: 13 INTIPMFIIIILLVS-CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
+ I +F+I+ L+ S C S +S E + + H++WMA+HGR+Y D EK R+ +F
Sbjct: 3 LEHIKIFLIVSLVSSFCFSTTLSRLLDDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVF 62
Query: 72 KENLEYIEKANK-EGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKY 128
K N+E IE+ N RT+KL N+F+DLTNDEFR +YTGYK S T S++F+Y
Sbjct: 63 KRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRY 122
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
QN+ +P ++DWR K AVTPIK+Q CGCCWAFSAVAA+EG T+I LI LSEQQL
Sbjct: 123 QNVFFGALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQL 182
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCS-AAQKAAAAKISNYE 247
VDC TN + GC GG M+ AFE+I+ G+ TE YPY+ C + K +AA I+ YE
Sbjct: 183 VDCDTN-DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGEDANCKIKSTKPSAASITGYE 241
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
+VP DE AL+KAV+ QPVS+GI +F+ Y G+F G C T LDHAVT VG+ +
Sbjct: 242 DVPVNDENALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSA 301
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
G+ YW+IKNSWG WG+ GYM+I +D EGLCG+ ++SYP
Sbjct: 302 GSKYWIIKNSWGTKWGEGGYMRIKKDIKDKEGLCGLAMKASYP 344
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 237/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + +RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENIKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI++E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 172/337 (51%), Positives = 229/337 (67%), Gaps = 15/337 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ ++++L + SQ + + +++ E HE+WMA+HGR+Y D EKE RF+IFK NL+Y
Sbjct: 11 VITLLMILGTWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQIFKNNLDY 70
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSMTD 135
IE NK N+TYKLG N+FSDL+ +EF Y GY+MP+ P+ +T TF + +
Sbjct: 71 IENFNKAFNKTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTANTTVKPTFFSNYYNQDE 130
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
VP S+DWR+ VT +K+Q ECGCCWAFSAVAAVEGI N LS QQL+DC
Sbjct: 131 VPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGI----AGNGASLSAQQLLDC-VGD 185
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
N+GCGGGTM KAFEYI+QNQGI ++ +YPY+ Q C + AA+I+ YE V E+
Sbjct: 186 NSGCGGGTMIKAFEYIVQNQGIVSDTDYPYEQTQEMCRSGSN-VAARITGYESVIQ-SEE 243
Query: 256 ALLKAVSMQPVSIGIAAYT-TEFKSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWL 313
AL +AV+ QP+S+ I A + FKSY G+F+ CGT L HAVT+VG+GTTEDG YWL
Sbjct: 244 ALKRAVAKQPISVAIDASSGPNFKSYISGVFSAEDCGTHLTHAVTLVGYGTTEDGTKYWL 303
Query: 314 IKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
+KNSWG+ WG++GYM++ RD EG CGI Q+SYP
Sbjct: 304 VKNSWGEEWGESGYMRLQRDVGAMEGPCGIAMQASYP 340
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 178/337 (52%), Positives = 222/337 (65%), Gaps = 13/337 (3%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQ--SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLE 76
+ + LL++ V SR HE S++E HE+WMA++ + YKD EKE RF IFK+N+E
Sbjct: 11 ILALFLLLAVGISRVISRELHETETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVE 70
Query: 77 YIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+IE N GN+ YKLG N +DLT +EF+A G K TTS FKY+N+ T +
Sbjct: 71 FIESFNAAGNKPYKLGVNHLADLTIEEFKASRNGLKRSYDYEVGTTS--FKYENV--TAI 126
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG- 195
P S+DWR K AVTPIKDQ +CG CWAFS VAA EGI KIS L+ LSEQ+LVDC G
Sbjct: 127 PASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGT 186
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
+ GC GG ME FE+II+N GI TE YPY+AV G+C A A AA+I YE+VP E+
Sbjct: 187 DQGCEGGYMEDGFEFIIKNGGITTEANYPYKAVDGSCKNAT-APAAQIKGYEKVPVNSEK 245
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALLKAV+ QPVS+ I A F Y GIF G CGT+LDH VT VG+G +G +YW++K
Sbjct: 246 ALLKAVANQPVSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAVGYGRA-NGTDYWIVK 304
Query: 316 NSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ R EGLCGI SSYP A
Sbjct: 305 NSWGTVWGEQGYIRMQRGIAAKEGLCGIAMDSSYPTA 341
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + +RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 166/337 (49%), Positives = 231/337 (68%), Gaps = 9/337 (2%)
Query: 20 IIIILLVSCASQVVSSRST--HEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
++IIL +SR+ EQS+V+ HE+WMA+ R Y+DELEK MR +FK+NL++
Sbjct: 10 VLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKF 69
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSMTD- 135
IE NK+GN++YKLG N F+D TN+EF A++TG K + S + T Q +++D
Sbjct: 70 IENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTWNVSDM 129
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
V S DWR + AVTP+K Q +CGCCWAFSAVAAVEG+ KI+G NL+ LSEQQL+DC
Sbjct: 130 VVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREY 189
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
+ C GG M AF Y++QN+GIA+E++Y YQ G C + + AA+IS ++ VPS +E+
Sbjct: 190 DRDCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNAR-PAARISGFQTVPSNNER 248
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALL+AVS QPVS+ + A F Y G+++G CGT +HAVT VG+GT++DG YWL K
Sbjct: 249 ALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAK 308
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG+TW + GY++I RD +G+CG+ + YP+A
Sbjct: 309 NSWGETWEEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS +P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTCRSREKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C Q++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGNCADQINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E+G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 174/324 (53%), Positives = 227/324 (70%), Gaps = 13/324 (4%)
Query: 33 VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLG 92
V+SR+ + S+ E HE+WMA++ + YKD E+E RFKIFKEN+ YIE N ++ YKLG
Sbjct: 25 VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLG 84
Query: 93 TNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTP 150
N+F+DLTN+EF A +K M S R+TT FKY+N+ T +P+++DWR K AVTP
Sbjct: 85 INQFADLTNEEFIAPRNKFKGHMCSSITRTTT---FKYENV--TALPSTVDWRQKGAVTP 139
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFE 209
IKDQ +CGCCWAFSAVAA EGI ++ LI LSEQ++VDC T G + GC GG M+ AF+
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199
Query: 210 YIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK-ISNYEEVPSGDEQALLKAVSMQPVSI 268
+IIQN G+ TE YPY+AV G C+A + A A I+ YE+VP +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSV 259
Query: 269 GIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYM 328
I A ++F+ YK G+F G CGTQLDH VT VG+G + DG YWL+KNSWG WG+ GY+
Sbjct: 260 AIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYI 319
Query: 329 KILR----DEGLCGIGTQSSYPLA 348
+ R EGLCGI +SYP A
Sbjct: 320 MMQRGVKAQEGLCGIAMMASYPTA 343
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 175/344 (50%), Positives = 234/344 (68%), Gaps = 20/344 (5%)
Query: 18 MFIIIILLVSCASQV---VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
+ I + L+ C+ + V+ R+ + S+ E HE+WM ++ + YKD E+E RFKIFKEN
Sbjct: 7 FYQISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLS 132
+ YIE N N+ Y LG N+F+DLTN+EF A +K M S R+TT FKY+N+
Sbjct: 67 VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV- 122
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
T +P+++DWR K AVTPIKDQ +CGCCWAFSAVAA EGI +S LI LSEQ++VDC
Sbjct: 123 -TAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCD 181
Query: 193 TNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAA---AKISNYEE 248
T G + GC GG M+ AF++IIQN G+ E YPY+AV G C+A KAAA A I+ YE+
Sbjct: 182 TKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNA--KAAANHVATITGYED 239
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP +E+AL KAV+ QPVS+ I A ++F+ Y+ G+F G CGT+LDH VT VG+G + DG
Sbjct: 240 VPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADG 299
Query: 309 ANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
YWL+KNSWG WG+ GY+++ R +EGL GI +SYP A
Sbjct: 300 TEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGLXGIAMMASYPTA 343
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + +RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 173/335 (51%), Positives = 235/335 (70%), Gaps = 15/335 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
+I L + ASQ ++ R+ + S+ E HE+WM + R Y D EKE+R+KIFKEN++ IE
Sbjct: 14 LIFFLGALASQAIA-RTLQDASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRIES 72
Query: 81 ANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK ++YKLG N+F+DLTN+EF+ +K H S+ + F+Y+N+ T VP+S
Sbjct: 73 FNKASEKSYKLGINQFADLTNEEFKTSRNRFK----GHMCSSQAGPFRYENI--TAVPSS 126
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNG 198
+DWR + AVT IKDQ +CG CWAFSAVAAVEGIT+++ + LI LSEQ+LVDC T G + G
Sbjct: 127 MDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQG 186
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQAL 257
C GG M+ AF++I QNQG+ TE YPY+ GTC+ Q+A AAKI+ +E+VP+ +E AL
Sbjct: 187 CQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGAL 246
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+KAV+ QPVS+ I A EF+ Y GIF G CGT+LDH V VG+G + +G NYWL+KNS
Sbjct: 247 MKAVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYGES-NGMNYWLVKNS 305
Query: 318 WGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
WG WG+ GY+++ +D EGLCGI Q+SYP A
Sbjct: 306 WGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + +RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
K +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
F +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 344 bits (882), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 168/348 (48%), Positives = 234/348 (67%), Gaps = 11/348 (3%)
Query: 8 SGSFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMR 67
+ +F + I + +++ ++S +V+SR+ E S++E HE WM HGR YKD++EKE R
Sbjct: 2 ASNFFLKNITVVLLLFSILSLYPFIVTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHR 61
Query: 68 FKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
FK FKEN+E+IE NK G + YKL N+++DLT +EF + G S + +T++T
Sbjct: 62 FKTFKENVEFIESFNKNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTS 121
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
++ S+T+VP S+DWR + +VT +KDQ CGCCWAFSA AA+EG +I+ LI LSEQQ
Sbjct: 122 FKYDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQ 181
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQ--GIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
L+DCST N GC GG M A+++++QN GI TE YPY+ Q C Q AA I+
Sbjct: 182 LLDCSTQ-NKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVCKTEQPAAVT-ING 239
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
YE VPS DE +LLKAV QP+S+GIAA EF Y GI++G C ++L+HAVT++G+GT+
Sbjct: 240 YEVVPS-DESSLLKAVVNQPISVGIAA-NDEFHMYGSGIYDGSCNSRLNHAVTVIGYGTS 297
Query: 306 -EDGANYWLIKNSWGDTWGDAGYMKILRDEGL----CGIGTQSSYPLA 348
EDG YW++KNSWG WG+ GYM+I RD G+ CGI +S+P A
Sbjct: 298 EEDGTKYWIVKNSWGSDWGEEGYMRIARDVGVDGGHCGIAKVASFPTA 345
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (882), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (881), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (881), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
F +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (881), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (880), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
F +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 343 bits (880), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 174/335 (51%), Positives = 235/335 (70%), Gaps = 15/335 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
+I LL + SQ ++ R+ + S+ E HE+WM++ GR Y D EKE+R+KIFKEN++ IE
Sbjct: 14 LIFLLGALVSQAMA-RTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRIES 72
Query: 81 ANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
NK ++YKLG N+F+DLTN+EF+ +K H S+ + F+Y+NL T P+S
Sbjct: 73 FNKASGKSYKLGINQFADLTNEEFKTSRNRFK----GHMCSSQAGPFRYENL--TAAPSS 126
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNG 198
+DWR K AVT IKDQ +CG CWAFSAVAAVEGIT+++ + LI LSEQ+LVDC T G + G
Sbjct: 127 MDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQG 186
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQAL 257
C GG M+ AF++I QNQG+ TE YPY+ GTC+ Q+A AAKI+ +E+VP+ +E AL
Sbjct: 187 CQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGAL 246
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+KAV+ QPVS+ I A F+ Y GIF G CGT+LDH V VG+G + +G NYWL+KNS
Sbjct: 247 MKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYGES-NGMNYWLVKNS 305
Query: 318 WGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
WG WG+ GY+++ +D EGLCGI Q+SYP A
Sbjct: 306 WGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (880), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 343 bits (879), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 163/345 (47%), Positives = 230/345 (66%), Gaps = 14/345 (4%)
Query: 16 IPMFIIIILLVSC---ASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
IP ++ +++ C S V+S+R + ++VE HE+WMAQHGR YKD EK RF+ F+
Sbjct: 3 IPKVFLLAVVLGCICLCSTVLSARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAFR 62
Query: 73 ENLEYIEKANKEGNR-TYKLGTNRFSDLTNDEFRALYT--GY--KMPSPSHRSTTSSTFK 127
N+ +IE N GNR + LG N+F+DLTNDEFRA T G+ + + ++++ + TF+
Sbjct: 63 NNVVFIESFNAAGNRRKFWLGVNQFTDLTNDEFRATKTNKGFIKRNAAAVNKASPTGTFR 122
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
Y N+S +P ++DWR K AVTPIK+Q +CGCCWAFSAVAA EGI ++S L+ LSEQ+
Sbjct: 123 YSNVSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQE 182
Query: 188 LVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISN 245
LVDC NG ++GC GG M+ AFE+II+N G+ +E YPY A G C A + A I
Sbjct: 183 LVDCDANGADHGCEGGEMDDAFEFIIKNGGLTSETNYPYTAQDGQCKAKNTINSVATIKG 242
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
YE+VP+ DE +L+KAV+ QPVS+ + F+ Y G+ +G CGT LDH + VG+G
Sbjct: 243 YEDVPANDEASLMKAVAAQPVSVAVDGGDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGAA 302
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
+DG +WL+KNSWG TWG+ GY+++ +D G+CG+ Q SYP
Sbjct: 303 DDGTKFWLMKNSWGTTWGEDGYIRMEKDVADAGGMCGLAMQPSYP 347
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 343 bits (879), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 160/350 (45%), Positives = 232/350 (66%), Gaps = 18/350 (5%)
Query: 13 INTIPMFIIIILLVSCASQVVSS----------RST--HEQSVVEMHEKWMAQHGRSYKD 60
+ T+ + + +I + C Q + R+T E ++ ++KWMAQ+ R YKD
Sbjct: 13 MTTLMLLLCVIAIADCICQAAVAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKD 72
Query: 61 ELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPS--PSH 118
+ EK RF++FK N E+I+++N G + Y LGTN+F+DLT+ EF A+YTG + P+ PS
Sbjct: 73 DAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSG 132
Query: 119 RSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGA 178
+ FKYQN + D +DWR + AVTP+K+Q +CGCCWAFSAV A+EG+ I+
Sbjct: 133 AKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTG 192
Query: 179 NLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK 237
NL+ LSEQQ++DC ++GN GC GG M+ AF+Y++ N G+ TED YPY AVQGTC Q
Sbjct: 193 NLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQGTCQNVQP 252
Query: 238 AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG-VCGTQLDHA 296
AA IS ++++PSGDE AL AV+ QPVS+G+ ++ F+ Y+ GI++G CGT ++HA
Sbjct: 253 AAT--ISGFQDLPSGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNHA 310
Query: 297 VTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYP 346
VT +G+G + G YW++KNSWG WG+ G+M++ G CGI T +SYP
Sbjct: 311 VTAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQMGVGACGISTMASYP 360
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 175/331 (52%), Positives = 230/331 (69%), Gaps = 11/331 (3%)
Query: 22 IILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKA 81
IILL +CA +S R+ E SVVE H++WM ++ R+Y + E E R KIFKENLEYIE
Sbjct: 9 IILLWACAYPTMS-RTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENF 67
Query: 82 NKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
N GN++YKLG NR+SDLT++EF A +TG+K+ S S NL+ DVPT+ D
Sbjct: 68 NNVGNKSYKLGLNRYSDLTSEEFIASHTGFKVSDQLSDSKMRSVAIPFNLN-DDVPTNFD 126
Query: 142 WRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGG 201
WR+K VT +K+Q++CGCCWAF+AVAAVEGI KI NLI LSEQQLVDC ++GCGG
Sbjct: 127 WREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDCDRQ-SSGCGG 185
Query: 202 GTMEKAFEYIIQNQGIATEDEYPYQA--VQGTCSAAQKAAAAKISNYEEVPSGDEQALLK 259
G AF+ II+++GI ED+YPY+A VQ TC Q AA+I+ Y +VP+ DEQ LL+
Sbjct: 186 GDFVLAFDSIIKSRGIVKEDDYPYKANDVQ-TCQLGQIPGAAQINGYFKVPANDEQQLLR 244
Query: 260 AVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
AV QPVS+ I+ + +F Y G++ G CG +L+HAVTI+G+G +E G YWLIKNSWG
Sbjct: 245 AVLQQPVSVAIST-SYDFHHYMGGVYEGSCGPKLNHAVTIIGYGVSEAGKKYWLIKNSWG 303
Query: 320 DTWGDAGYMKILRDE----GLCGIGTQSSYP 346
+TWG+ GYMK+LR+ G C I ++YP
Sbjct: 304 ETWGEKGYMKVLRESSATGGQCSIAVHAAYP 334
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 174/355 (49%), Positives = 239/355 (67%), Gaps = 15/355 (4%)
Query: 4 IFERSGSF--KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDE 61
IF+R + K + + + ++L + + V+ + + S+ E HE+WM +HG+ YKD
Sbjct: 90 IFKRDSTMVAKNHFYHISLAMLLCTAFLAFQVTCCTLQDASMYERHEQWMTRHGKVYKDP 149
Query: 62 LEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHR 119
E+E RF+IF EN+ Y+E N N+ YKLG N+F DLTN EF A +K M S R
Sbjct: 150 REREKRFRIFNENVNYVEAFNNAANKPYKLGINQFXDLTNQEFIAPRNRFKGHMCSSIIR 209
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGAN 179
+TT FKY+N+ T VP+++DWR AVTP+KDQ +CGCCWAFSAVAA EGI +SG
Sbjct: 210 TTT---FKYENV--TTVPSTVDWRQNGAVTPVKDQGQCGCCWAFSAVAATEGIHALSGGK 264
Query: 180 LIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA 238
LI LSEQ+LVDC T G + GC GG M+ A+++IIQN G+ TE YPY+ V G C+A + A
Sbjct: 265 LISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNHGLNTEANYPYKGVDGKCNANEAA 324
Query: 239 AAAK-ISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAV 297
A I+ YE+VP+ +E+AL KAV+ QPVS+ I A +++F+ YK G F G CGT+LDH V
Sbjct: 325 NHAATITGYEDVPANNEKALQKAVANQPVSVAIDASSSDFQFYKSGAFTGSCGTELDHGV 384
Query: 298 TIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
T VG+G ++ G YWL+KNSWG WG+ GY+++ R +EG+CGI Q+SYP A
Sbjct: 385 TAVGYGVSDHGTKYWLVKNSWGTEWGEEGYIRMQRGVDSEEGVCGIAMQASYPTA 439
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 169/322 (52%), Positives = 221/322 (68%), Gaps = 12/322 (3%)
Query: 33 VSSRSTHEQ-SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKL 91
V SR +E S+ E HE+WM+++G+ YKD +EKE RF IFK+N+E+IE N N+ YKL
Sbjct: 25 VMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKL 84
Query: 92 GTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPI 151
N +DLT DEF+A GYK R +++FKY+N+ T +P ++DWR K AVTPI
Sbjct: 85 SVNHLADLTLDEFKASRNGYK---KIDREFATTSFKYENV--TAIPEAVDWRVKGAVTPI 139
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEY 210
KDQ +CG CWAFS VAA+EGI +I+ LI LSEQ+LVDC T G + GC GG ME FE+
Sbjct: 140 KDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 211 IIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
II+N GI +E YPY+A G+CSAA A AKI+ YE+VP E +LLKAV+ QP+S+ I
Sbjct: 200 IIKNGGITSETNYPYKAADGSCSAATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSI 259
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A + F Y GI+ G CGT+LDH VT VG+G+ +G +YW++KNSWG WG+ GY+++
Sbjct: 260 DASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRM 318
Query: 331 LR----DEGLCGIGTQSSYPLA 348
R EGLCGI SSYP A
Sbjct: 319 QRGIADKEGLCGIAMDSSYPTA 340
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HG YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGHVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q +CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI++E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 164/342 (47%), Positives = 236/342 (69%), Gaps = 7/342 (2%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+F + + ++ ++V S+ E+++ H++WMA+HGR+YKDE EK RF+
Sbjct: 12 TFTAAALMILAVMTMVVEARDLSTSTGGYGEEAMKVRHQQWMAEHGRTYKDEAEKARRFQ 71
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
+FK N ++++++N G ++Y+L N F+D+TNDEF A+YTG K P P+ + FKY+
Sbjct: 72 VFKANADFVDRSNAAGGKSYELAINEFADMTNDEFVAMYTGLK-PVPAGPKKMAG-FKYE 129
Query: 130 NLSMTDVPT-SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
NL+++DV ++DWR K AVT IK+Q +CGCCWAF+AVAAVE I +I+ NL+ LSEQQ+
Sbjct: 130 NLTLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIHQITTGNLVSLSEQQV 189
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEE 248
+DC T+GNNGC GG ++ AF+YII N G+ATED YPY A QGTC ++ + A IS+Y++
Sbjct: 190 LDCDTDGNNGCNGGYIDNAFQYIISNGGLATEDAYPYAAAQGTCQSSVQ-PAVTISSYQD 248
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG-VCGT-QLDHAVTIVGFGTTE 306
VPSGDE AL AV+ QPV++ I A+ F+ Y G+ CGT L+HAVT VG+ T E
Sbjct: 249 VPSGDEAALAAAVANQPVAVAIDAHNN-FQFYSSGVLTADTCGTPSLNHAVTAVGYSTAE 307
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYPLA 348
DG YWL+KN WG WG+ GY+++ R CG+ Q+SYP+A
Sbjct: 308 DGTPYWLLKNQWGQNWGEGGYLRVERGTNACGVAQQASYPVA 349
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 342 bits (877), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + +++ + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVITMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 168/343 (48%), Positives = 235/343 (68%), Gaps = 9/343 (2%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITVFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFK 127
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P+ S +S+ FK
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSSTEFK 121
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
+LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SEQ+
Sbjct: 122 INDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQE 181
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYE 247
L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+Y+
Sbjct: 182 LLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISSYK 240
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT E
Sbjct: 241 VVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 162/335 (48%), Positives = 221/335 (65%), Gaps = 9/335 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ I+ + C+S V+S+R + ++VE HE+WMA+ R YKD EK RF++FK N+ +
Sbjct: 8 LLAIVGCICLCSSAVLSARELGDTAMVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N E NR + LG N+F+DLTNDEFRA T + R+ T FKY N+S+ +P
Sbjct: 68 IESFNAE-NRKFWLGVNQFTDLTNDEFRATKTNKGLKMSGGRAPTG--FKYSNVSIDALP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
T++DWR K VTPIKDQ +CGCCWAFSAV A EGI K+S LI LSEQ+LVDC +G +
Sbjct: 125 TAVDWRTKGVVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHGVD 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQ 255
GC GG M+ AF++II+N G+ TE YPY A G C ++ + A I YE+VP+ DE
Sbjct: 185 QGCEGGEMDDAFKFIIKNGGLTTEANYPYTAQDGQCKTSIASNSVATIKGYEDVPANDES 244
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
+L+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G T DG YWL+K
Sbjct: 245 SLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIAAIGYGMTSDGTKYWLLK 304
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
NSWG TWG++GY+++ +D G+CG+ Q SYP
Sbjct: 305 NSWGTTWGESGYLRMEKDISDKSGMCGLAMQPSYP 339
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 160/339 (47%), Positives = 223/339 (65%), Gaps = 10/339 (2%)
Query: 15 TIPMFIIIILLVS--CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
T+ I+ IL + C + + + + + ++V HE+WMAQ+ R YKD EK RF++FK
Sbjct: 3 TLKASILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N+++IE N GN + LG N+F+DLTNDEFR++ T S + + T F+Y+N+S
Sbjct: 63 ANVKFIESFNAGGNNKFWLGVNQFADLTNDEFRSIKTNKGFKSSNMKIPTG--FRYENVS 120
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+ +PT++DWR K AVTPIKDQ +CGCCWAFSAVAA EGI KIS L+ L+EQ+LVDC
Sbjct: 121 VDALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCD 180
Query: 193 TNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
+G + GC GG M+ AF++II N G+ TE YPY A G C + +AA I YE+VP+
Sbjct: 181 VHGEDQGCEGGLMDDAFKFIINNGGLTTESSYPYTAADGKCKSGSNSAAT-IKGYEDVPA 239
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
DE AL+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G T DG Y
Sbjct: 240 NDEAALMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYGKTSDGTKY 299
Query: 312 WLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
WL+KNSWG TWG+ GY+++ +D G+CG+ + SYP
Sbjct: 300 WLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 338
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
K +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 162/334 (48%), Positives = 227/334 (67%), Gaps = 10/334 (2%)
Query: 19 FIIIIL-LVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
F++ IL S S V+++R + ++VE HE WM ++GR YKD EK RF++FK+N+ +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEVFKDNVAF 66
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
+E N N + LG N+F+DLT +EF+A G+K S TT FKY+NLS++ +P
Sbjct: 67 VESFNTNKNNKFWLGINQFADLTIEEFKA-NKGFKPISAEKVPTTG--FKYENLSVSALP 123
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
T++DWR K AVTPIK+Q +CGCCWAFSAVAA+EGI K+S NLI LSEQ+LVDC T+ +
Sbjct: 124 TAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMD 183
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M+ AFE++I+N G+AT YPY+AV G C K+AA I +E+VP DE A
Sbjct: 184 EGCEGGWMDSAFEFVIKNGGLATVSSYPYKAVDGKCKGGSKSAAT-IKGHEDVPVNDEAA 242
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+ + A F Y G+ G CGT+LDH + +G+G DG YW++KN
Sbjct: 243 LMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYWILKN 302
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
SWG TWG+ G++++ +D +G+CG+ + SYP
Sbjct: 303 SWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
K +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 235/345 (68%), Gaps = 12/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFK 121
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
K +LS D+P++LDWR+ AVT +K Q +CGCCWAFSAV ++EG KI+ L++ SE
Sbjct: 122 -KINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSE 180
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 181 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 239
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 240 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 297
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 298 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 342
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 12/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFK 121
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
K +LS +P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 122 -KINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 180
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++II+N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 181 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 239
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 240 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGNCADRINHAVTAIGYGTD 297
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E+G YWL+KNSWG +WG+ GYMKI+RD GLC I SSYP
Sbjct: 298 EEGQKYWLLKNSWGTSWGENGYMKIIRDSGDPSGLCDIAKMSSYP 342
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
F +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
F +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 235/345 (68%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + +RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFCAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 341 bits (874), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 173/339 (51%), Positives = 232/339 (68%), Gaps = 12/339 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + + L SQV+ R H+ ++ E HE WMA++G+ YKD EKE RF+IFK+N+E+
Sbjct: 10 MLALFLFLAVGISQVMP-RKLHQTALRERHENWMAEYGKIYKDAAEKEKRFQIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDV 136
IE N GN+ YKLG N +DLT +EF+ G K +T + FKY+N+ TD+
Sbjct: 69 IESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENV--TDI 126
Query: 137 PTSLDWRDKKAVTPIKDQ-QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
P ++DWR K AVTPIKDQ +CG CWAFS VAA EGI +IS L+ LSEQ+LVDC +
Sbjct: 127 PEAIDWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV- 185
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDE 254
++GC GG ME FE+II+N GI++E YPY AV GTC A+++A+ AA+I YE VP+ E
Sbjct: 186 DHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSE 245
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN-YWL 313
+AL +AV+ QPVS+ I A + F+ Y G+F G CGTQLDH VT+VG+GTT+DG + YW+
Sbjct: 246 EALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGTHEYWI 305
Query: 314 IKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
+KNSWG WG+ GY+++ R EGLCGI +SYP A
Sbjct: 306 VKNSWGTQWGEEGYIRMQRGIDALEGLCGIAMDASYPTA 344
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 161/336 (47%), Positives = 220/336 (65%), Gaps = 12/336 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ ++ C + + + + ++V HE+WMAQ+ R YKD EK RF++FK N+++
Sbjct: 8 ILAVLSFAFFCGAALAARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKF 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE N GNR + LG N+F+DLTNDEFR T G+K PS ST F+Y+N+S+
Sbjct: 68 IESFNTGGNRKFWLGINQFADLTNDEFRTTKTNKGFK-PSLDKVSTG---FRYENVSVDA 123
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
+P ++DWR AVTPIKDQ +CGCCWAFSAVAA EGI KIS LI LSEQ+LVDC +G
Sbjct: 124 IPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQELVDCDVHG 183
Query: 196 -NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
+ GC GG M+ AF++II+N G+ TE YPY A G C + +AA I YE+VP+ DE
Sbjct: 184 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSNSAA-NIKGYEDVPTNDE 242
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
AL+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G T DG YWL+
Sbjct: 243 AALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLM 302
Query: 315 KNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
KNSWG TWG+ GY+++ +D +G+CG+ + SYP
Sbjct: 303 KNSWGTTWGENGYLRMEKDISDKKGMCGLAMEPSYP 338
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
F +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 167/348 (47%), Positives = 224/348 (64%), Gaps = 18/348 (5%)
Query: 16 IPMFIIIILL----VSCASQVVSSR---STHEQSVVEMHEKWMAQHGRSYKDELEKEMRF 68
IP +++ +L C++ V+++R E ++V HE+WM QHGR YKDE +K RF
Sbjct: 3 IPKALLLAILGCGVCLCSAAVLAARELGGDDELAMVARHEQWMVQHGRVYKDETDKAHRF 62
Query: 69 KIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST 125
+FK N+++IE N GNR + LG N+F+DLTNDEFRA T + T
Sbjct: 63 LVFKANVKFIESFNAAAAAGNRKFWLGVNQFADLTNDEFRATKTNKGFNPNVVKVPTG-- 120
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
F+YQNLS+ +P ++DWR K AVTPIKDQ +CGCCWAFSAVAA EGI KIS L LSE
Sbjct: 121 FRYQNLSIDALPQTVDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLTSLSE 180
Query: 186 QQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKIS 244
Q+LVDC +G + GC GG M+ AF++II+N G+ TE YPY A G C + AA I
Sbjct: 181 QELVDCDVHGEDQGCNGGEMDDAFKFIIKNGGLTTESNYPYTAQDGQCKSGSNGAAT-IK 239
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT 304
YE+VP+ DE AL+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G
Sbjct: 240 GYEDVPANDEAALMKAVASQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGK 299
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
T DG YWL+KNSWG TWG+ G++++ +D +G+CG+ Q SYP A
Sbjct: 300 TSDGTKYWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMQPSYPTA 347
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 163/338 (48%), Positives = 226/338 (66%), Gaps = 10/338 (2%)
Query: 15 TIPMFIIIIL-LVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKE 73
T F++ IL S S V+++R + ++VE HE WM ++GR YKD EK RF+ FK
Sbjct: 3 TSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKH 62
Query: 74 NLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+ ++E N + LG N+F+DLT +EF+A G+K S TT FKY+NLS+
Sbjct: 63 NVAFVESFNTNKKNKFWLGVNQFADLTTEEFKA-NKGFKPISAEMVPTTG--FKYENLSV 119
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+ +PT++DWR K AVTPIK+Q +CGCCWAFSAVAA+EGI K+S NLI LSEQ+LVDC T
Sbjct: 120 SALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDT 179
Query: 194 NG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
+ + GC GG M+ AFE++I+N G+ATE YPY+AV G C K+AA I +E+VP
Sbjct: 180 HSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSKSAAT-IKGHEDVPVN 238
Query: 253 DEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
DE AL+KAV+ QPVS+ + A F Y G+ G CGT+LDH + +G+G DG YW
Sbjct: 239 DEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYW 298
Query: 313 LIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
++KNSWG TWG+ G++++ +D +G+CG+ + SYP
Sbjct: 299 ILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 170/325 (52%), Positives = 228/325 (70%), Gaps = 15/325 (4%)
Query: 33 VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLG 92
V+SR+ + S+ E HE+WMA++G+ YKD EKE RF++FKEN+ YIE N N+ YKLG
Sbjct: 25 VASRTLQDASMYERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLG 84
Query: 93 TNRFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVT 149
N+F+DLT++EF R + G+ S +T ++TFKY+N+++ +P S+DWR K AVT
Sbjct: 85 INQFADLTSEEFIVPRNRFNGHTRSS----NTRTTTFKYENVTV--LPDSIDWRQKGAVT 138
Query: 150 PIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAF 208
PIK+Q CGCCWAFSA+AA EGI KIS L+ LSEQ++VDC T G ++GC GG M+ AF
Sbjct: 139 PIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAF 198
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
++IIQN GI TE YPY+ V G C+ ++A AA I+ YE+VP +E+AL KAV+ QPVS
Sbjct: 199 KFIIQNHGINTEASYPYKGVDGKCNIKEEAVHAATITGYEDVPINNEKALQKAVANQPVS 258
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A +F+ YK GIF G CGT+LDH VT VG+G +G YWL+KNSWG WG+ GY
Sbjct: 259 VAIDASGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEGTKYWLVKNSWGTEWGEEGY 318
Query: 328 MKILRD----EGLCGIGTQSSYPLA 348
+ + R EG+CGI +SYP A
Sbjct: 319 IMMQRGVKAVEGICGIAMMASYPTA 343
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 171/342 (50%), Positives = 234/342 (68%), Gaps = 16/342 (4%)
Query: 18 MFIIIILLVSCASQV---VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
++ I + L+ C + V+ R+ + S+ E H +WMA++ + YKD E+E RF+IFKEN
Sbjct: 7 LYHISLALLFCMGFLAFQVTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIFKEN 66
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLS 132
+ YIE N N++YKL N+F+DLTN+EF A +K M S R+TT FKY+N++
Sbjct: 67 VNYIETFNSADNKSYKLDINQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENVT 123
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+ +P+++DWR K AVTPIKDQ +CGCCWAFSAVAA EGI ++ LI LSEQ++VDC
Sbjct: 124 V--IPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCD 181
Query: 193 TNGNN-GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK-ISNYEEVP 250
T G + GC GG M+ AF++IIQN G+ TE YPY+A G C+A A A I+ YE+VP
Sbjct: 182 TKGQDQGCAGGFMDGAFKFIIQNHGLNTEPNYPYKAADGKCNAKAAANHAATITGYEDVP 241
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E+AL KAV+ QPVS+ I A ++F+ YK G+F G CGT+LDH VT VG+G + DG
Sbjct: 242 VNNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSADGTE 301
Query: 311 YWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
YWL+KNSWG WG+ GY+++ R +EGLCGI +SYP A
Sbjct: 302 YWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
FK +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++E KI+ NL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (872), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 234/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
F +LS D+P++LDWR+ AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 340 bits (872), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 172/330 (52%), Positives = 223/330 (67%), Gaps = 16/330 (4%)
Query: 28 CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR 87
C SQV SR H+ S+ E HE+WM ++G+ YKD E E RF IF+ N+E+IE N GN+
Sbjct: 20 CTSQV-KSRKLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNK 78
Query: 88 TYKLGTNRFSDLTNDEFRALYTGYKMPSPSH----RSTTSSTFKYQNLSMTDVPTSLDWR 143
YKL N +D TN+EF A + GYK SH R TT + FKY+N+ TD+P ++DWR
Sbjct: 79 PYKLSINHLADQTNEEFMASHKGYK---GSHWQGLRITTQTPFKYENV--TDIPWAVDWR 133
Query: 144 DKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGT 203
K T IKDQ +CG CWAFSAVAA EGI +I+ NL+ LSEQ+LVDC + ++GC GG
Sbjct: 134 QKGDATSIKDQGQCGICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSV-DHGCDGGL 192
Query: 204 MEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVS 262
ME FE+II+N GI++E YPY AV GTC ++A+ A+I YE VP E+ L KAV+
Sbjct: 193 MEHGFEFIIKNGGISSEANYPYTAVNGTCDTNKEASPGAQIKGYETVPVNCEEELQKAVA 252
Query: 263 MQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTW 322
QPVS+ I A + F+ Y G+F G CGTQLDH VT VG+G+T+DG YW++KNSWG W
Sbjct: 253 NQPVSVSIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQW 312
Query: 323 GDAGYMKILR----DEGLCGIGTQSSYPLA 348
G+ GY+++LR EGLCGI +SYP A
Sbjct: 313 GEEGYIRMLRGIDAQEGLCGIAMDASYPTA 342
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 340 bits (872), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 166/322 (51%), Positives = 219/322 (68%), Gaps = 12/322 (3%)
Query: 33 VSSRSTHEQ-SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKL 91
V SR +E S+ E HE+WM +HG+ Y+D +EKE RF IFK+N+E+IE N N+ YKL
Sbjct: 25 VMSRKLYESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKL 84
Query: 92 GTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPI 151
N +DLT DEF+A GYK R T+++FKY+N+ T +P ++DWR K AVTPI
Sbjct: 85 SVNHLADLTLDEFKASRNGYK---KIDREFTTTSFKYENV--TAIPAAVDWRVKGAVTPI 139
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEY 210
KDQ +CG CWAFS VAA EGI +I+ L+ LSEQ+LVDC T G + GC GG ME FE+
Sbjct: 140 KDQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 211 IIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
II+N GI +E YPY+A G+C+ A AKI+ YE+VP E++LLKAV+ QP+S+ I
Sbjct: 200 IIKNGGITSETNYPYKAADGSCNTATTTPVAKITGYEKVPVNSEKSLLKAVANQPISVSI 259
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A + F Y GI+ G CGT+LDH VT VG+G+ +G +YW++KNSWG WG+ GY+++
Sbjct: 260 DASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRM 318
Query: 331 LR----DEGLCGIGTQSSYPLA 348
R EGLCGI SSYP A
Sbjct: 319 QRGIAAKEGLCGIAMDSSYPTA 340
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 183/340 (53%), Positives = 240/340 (70%), Gaps = 14/340 (4%)
Query: 17 PMFIIIILLVSCASQVVSSRSTHEQS--VVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
P+ + +L +CA +S E S V + H++WM Q+GRSY ++ E E RFKIF EN
Sbjct: 6 PIIALCTMLWACAYTAMSRTLYDETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFMEN 65
Query: 75 LEYIEKANK-EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
LEYIEK N GN++YKL N+FSDLTN+EF A +TG M PS S++S +L +
Sbjct: 66 LEYIEKFNNAPGNKSYKLDLNQFSDLTNEEFIASHTGL-MIDPSKPSSSSKRASPASLDL 124
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+D PTSLDWR++ AVT +K+Q CG CWAFSAVAAVEGI KI NLI LSEQQLVDC++
Sbjct: 125 SDTPTSLDWREQGAVTDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCAS 184
Query: 194 N-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPS 251
N N GCGGG M+ AF YI +N GIA+E++Y Y+ GTC + AA+IS YE+VP+
Sbjct: 185 NEQNQGCGGGFMDNAFSYITEN-GIASENDYQYRGGAGTCQNNEMITPAARISGYEDVPA 243
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT-EDGAN 310
G++Q LL AVS QPVS+ IA + F YKEGI++G CG+ L+H VT+VG+GT+ EDG
Sbjct: 244 GEDQLLL-AVSQQPVSVAIAVGQS-FHLYKEGIYSGPCGSSLNHGVTLVGYGTSEEDGTK 301
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
YWLIKNSWG++WG+ GYM++LR+ EG CGI ++S+P
Sbjct: 302 YWLIKNSWGESWGENGYMRLLRESGQSEGHCGIAVKASHP 341
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 227/340 (66%), Gaps = 10/340 (2%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVE-MHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
+ +F+ + + S + SR + +++ H +WM +HGR Y D EK R+ +FK N
Sbjct: 6 MQIFLFVAIFSSFYFSISLSRPLDNELIMQKRHIEWMTKHGRVYADVKEKSNRYVVFKSN 65
Query: 75 LEYIEKANK-EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSP--SHRSTTSSTFKYQNL 131
+E IE N RT+KL N+F+DLTNDEFR++YTG+K S S T +++F+YQN+
Sbjct: 66 VERIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQTKTTSFRYQNV 125
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
S +P S+DWR K AVTPIK+Q CGCCWAFSAVAA+EG T+I LI LSEQQLVDC
Sbjct: 126 SSGALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDC 185
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVP 250
TN + GC GG M+ AFE+I+ G+ TE YPY+ TC++ + A I+ YE+VP
Sbjct: 186 DTN-DFGCEGGLMDTAFEHIMATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVP 244
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
DEQAL+KAV+ QPVS+GI +F+ Y G+F G C T LDHAVT +G+G + +G+
Sbjct: 245 VNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGQSTNGSK 304
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
YW+IKNSWG WG++GYM+I +D +GLCG+ ++SYP
Sbjct: 305 YWIIKNSWGTKWGESGYMRIQKDIKDKQGLCGLAMKASYP 344
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 158/331 (47%), Positives = 217/331 (65%), Gaps = 8/331 (2%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
II C + + + + + +V HE+WMAQ+ R YKD EK RF++FK N+++IE
Sbjct: 104 IIGFAFFCGAAMAARDLSDDSVMVARHEQWMAQYSRVYKDASEKARRFEVFKANVQFIES 163
Query: 81 ANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
N GN + LG N+F+DLTNDEFR+ T + S + + T F+Y+N+S +PT++
Sbjct: 164 FNAGGNNKFWLGVNQFADLTNDEFRSTKTNKGLKSSNMKIPTG--FRYENVSADALPTTI 221
Query: 141 DWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGC 199
DWR K AVTPIKDQ +CGCCWAFSAVAA EGI KIS L+ L+EQ+LVDC +G + GC
Sbjct: 222 DWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGC 281
Query: 200 GGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLK 259
GG M+ AF++II+N G+ TE YPY A G C + +AA I YE+VP+ DE AL+K
Sbjct: 282 EGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNSAAT-IKGYEDVPANDEAALMK 340
Query: 260 AVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
AV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G T DG YWL+KNSWG
Sbjct: 341 AVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWG 400
Query: 320 DTWGDAGYMKILRD----EGLCGIGTQSSYP 346
TWG+ GY+++ +D G+CG+ + SYP
Sbjct: 401 TTWGENGYLRMEKDISDKRGMCGLAMEPSYP 431
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 167/322 (51%), Positives = 220/322 (68%), Gaps = 12/322 (3%)
Query: 33 VSSRSTHEQ-SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKL 91
V SR +E S+ E HE+WM+++G+ YKD +EKE RF IFK+N+E+IE N N+ YKL
Sbjct: 25 VMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKL 84
Query: 92 GTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPI 151
N +DLT DEF+A GYK R +++FKY+N+ T +P ++DWR K AVTPI
Sbjct: 85 SVNHLADLTLDEFKASRNGYK---KIDREFATTSFKYENV--TAIPEAVDWRVKGAVTPI 139
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEY 210
KDQ +CG CWAFS VAA+EGI +I+ LI LSEQ+LVDC T G + GC GG ME FE+
Sbjct: 140 KDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 211 IIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
II+N GI +E YPY+A G+C+ A A AKI+ YE+VP E +LLKAV+ QP+S+ I
Sbjct: 200 IIKNGGITSETNYPYKAADGSCNTATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSI 259
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A + F Y GI+ G CGT+LDH VT VG+G+ +G +YW++KNSWG WG+ GY+++
Sbjct: 260 DASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRM 318
Query: 331 LR----DEGLCGIGTQSSYPLA 348
R EGLCGI SSYP A
Sbjct: 319 QRGIADKEGLCGIAMDSSYPTA 340
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 160/338 (47%), Positives = 226/338 (66%), Gaps = 11/338 (3%)
Query: 15 TIPMFIIIIL-LVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKE 73
T F++ IL S S V+++R + ++VE HE WM ++GR YKD EK RF+ FK
Sbjct: 3 TSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKH 62
Query: 74 NLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+ ++E N + LG N+F+DLT +EF+A G+K P+ ++ FKY+NLS+
Sbjct: 63 NVAFVESFNTNKKNKFWLGVNQFADLTTEEFKA-NKGFK---PTAEKVPTTGFKYENLSV 118
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+ +PT++DWR K AVTPIK+Q +CGCCWAFSAVAA+EGI K+S NLI LSEQ+LVDC T
Sbjct: 119 SALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDT 178
Query: 194 NG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
+ + GC GG M+ AFE++I+N G+ATE YPY+AV G C K+AA I +E+VP
Sbjct: 179 HSMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSKSAAT-IKGHEDVPVN 237
Query: 253 DEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
+E AL+KAV+ QPVS+ + A F Y G+ G CGT+LDH + +G+G DG YW
Sbjct: 238 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYW 297
Query: 313 LIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
++KNSWG TWG+ G++++ +D G+CG+ + SYP
Sbjct: 298 ILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 335
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 168/304 (55%), Positives = 219/304 (72%), Gaps = 13/304 (4%)
Query: 51 MAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG 110
MA++GR YKD EKE RFKIFK+N+ IE NK ++TYKL N F+DLTN+EFR+L
Sbjct: 1 MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNR 60
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVE 170
+K +H + ++TFKY+N+ T VP+++DWR K AVTPIKDQQ+CGCCWAFSAVAA E
Sbjct: 61 FK----AHICSEATTFKYENV--TAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATE 114
Query: 171 GITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQ 229
GIT+I+ LI LSEQ+LVDC T G N GC GG M+ AF + I+ G+A+E YPY+
Sbjct: 115 GITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRF-IKIHGLASEATYPYEGDD 173
Query: 230 GTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGV 288
GTC++ ++A AAKI YE+VP+ +E+AL KAV+ QPV++ I A EF+ Y G+F G
Sbjct: 174 GTCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQ 233
Query: 289 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSS 344
CGT+LDH V VG+G +DG YWL+KNSWG WG+ GY+++ RD EGLCGI Q+S
Sbjct: 234 CGTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQAS 293
Query: 345 YPLA 348
YP A
Sbjct: 294 YPTA 297
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 170/342 (49%), Positives = 228/342 (66%), Gaps = 13/342 (3%)
Query: 18 MFIIIILLVSC--ASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
MF+ + +L SQ S + HE V E H++WM + R Y DELEK+MRF +FK+NL
Sbjct: 7 MFVSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNL 66
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK----MPSPSHRSTTSSTFKYQNL 131
++IEK NK+G+RTYKLG N F+D T +EF A +TG K +PS ++ + N+
Sbjct: 67 KFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIPSSEFVDEMIPSWNW-NV 125
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
S P DWR + AVTP+K Q +CGCCWAFS+VAAVEG+TKI G NL+ LSEQQL+DC
Sbjct: 126 SDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNLVSLSEQQLLDC 185
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
+NGC GG M AF YII+N+GIA+E YPYQ +GTC K +A I ++ VPS
Sbjct: 186 DRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQETEGTCRYNAKPSAW-IRGFQTVPS 244
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFN-GVCGTQLDHAVTIVGFGTTEDGAN 310
+E+ALL+AVS QPVS+ I A F Y G+++ CGT ++HAVT VG+GT+ +G
Sbjct: 245 NNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDVNHAVTFVGYGTSPEGIK 304
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
YWL KNSWG+TWG+ GY++I RD +G+CG+ + YP+A
Sbjct: 305 YWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 346
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 160/344 (46%), Positives = 232/344 (67%), Gaps = 14/344 (4%)
Query: 13 INTIPMFIIIILLVSCA----SQVVSSRS-THEQSVVEMHEKWMAQHGRSYKDELEKEMR 67
+++ +++ +L CA S V+++R + + ++ E HE+WMA +GR YKD EK R
Sbjct: 2 VSSRAFLLLLAILTGCACSFPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAEKARR 61
Query: 68 FKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
F++FK+NL ++E N + + LG N+F+DLT +EF+A G+K S TT FK
Sbjct: 62 FEVFKDNLAFVESFNADKKNKFWLGVNQFADLTTEEFKA-NKGFKPISAEEVPTTG--FK 118
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
Y+NLS++ +PT++DWR K AVTPIK+Q +CGCCWAFSAVAA+EGI K+S NL+ LSEQ+
Sbjct: 119 YENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLSEQE 178
Query: 188 LVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNY 246
LVDC T+ + GC GG M+ AFE++I+N G+ATE YPY+AV G C K+AA I +
Sbjct: 179 LVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSKSAAT-IKGH 237
Query: 247 EEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP +E AL+KAV+ QPVS+ + A F Y G+ G CGTQLDH + +G+G
Sbjct: 238 EDVPPNNEAALMKAVASQPVSVAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIGYGVES 297
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
DG YW++KNSWG TWG+ ++++ +D +G+CG+ + SYP
Sbjct: 298 DGTKYWILKNSWGTTWGEKRFLRMEKDISDKQGMCGLAMKPSYP 341
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 167/321 (52%), Positives = 219/321 (68%), Gaps = 13/321 (4%)
Query: 33 VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLG 92
V R HE S+ E HE+WM ++G+ YKD EK+ RF+IFK+N+E+IE N +GN+ YKLG
Sbjct: 24 VMCRKLHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLG 83
Query: 93 TNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
N +DLT +EF+A G+K P H +T +TFKY+N+ T +P ++DWR K AVTPIK
Sbjct: 84 VNHLADLTVEEFKASRNGFKRP---HEFST-TTFKYENV--TAIPAAIDWRTKGAVTPIK 137
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYI 211
DQ +CG CWAFS +AA EGI +I+ L+ LSEQ+LVDC T G + GC GG ME FE+I
Sbjct: 138 DQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFI 197
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIA 271
I+N GI +E YPY+AV G C+ A + A+I YE+VP E AL KAV+ QPVS+ I
Sbjct: 198 IKNGGITSETNYPYKAVDGKCNKAT-SPVAQIKGYEKVPPNSETALQKAVANQPVSVSID 256
Query: 272 AYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKIL 331
A F Y GI+NG CGT+LDH VT VG+GT +G +YW++KNSWG WG+ GY+++
Sbjct: 257 ADGAGFMFYSSGIYNGECGTELDHGVTAVGYGTA-NGTDYWIVKNSWGTQWGEKGYVRMQ 315
Query: 332 R----DEGLCGIGTQSSYPLA 348
R GLCGI SSYP +
Sbjct: 316 RGIAAKHGLCGIALDSSYPTS 336
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 163/337 (48%), Positives = 232/337 (68%), Gaps = 16/337 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+ ++ ++ ++R+ + + E HE+WM Q+GR YKD+ E+ R+ IFKEN+ I+
Sbjct: 12 LALLFILGAWPSKSTARTLLDAPMYERHEQWMTQYGRVYKDDNERATRYSIFKENVARID 71
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVP 137
N + ++YKLG N+F+DLTN+EF+A +K M SP + F+Y+N+S VP
Sbjct: 72 AFNSQTGKSYKLGVNQFADLTNEEFKASRNRFKGHMCSPQ-----AGPFRYENVSA--VP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
+++DWR + AVTP+KDQ +CGCCWAFSAVAA+EGI K++ LI LSEQ++VDC T G +
Sbjct: 125 STVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQ 255
GC GG M+ AF++I QN+G+ TE YPY+ GTC+ + A AAKI+ +E+VP+ E
Sbjct: 185 QGCNGGLMDDAFKFIEQNKGLTTEANYPYKGTDGTCNTNKAAIHAAKITGFEDVPANSEA 244
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL+KAV+ QPVS+ I A ++F+ Y GIF G C TQLDH VT VG+G + DG+ YWL+K
Sbjct: 245 ALMKAVAKQPVSVAIDAGGSDFQFYSSGIFTGSCDTQLDHGVTAVGYGVS-DGSKYWLVK 303
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ +D EGLCGI Q+SYP A
Sbjct: 304 NSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPTA 340
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 168/345 (48%), Positives = 233/345 (67%), Gaps = 13/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P SPS S+T
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
K +LS D+P++LDW + AVT +K Q CGCCWAFSAV ++EG KI+ NL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
Q+L+DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
Y+ VP G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 167/343 (48%), Positives = 226/343 (65%), Gaps = 10/343 (2%)
Query: 13 INTIPMFIIIILLVS-CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
+ + +F+ + + S C S +S +E + + H +WM +HGR Y D E+ R+ +F
Sbjct: 3 LKHMQIFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVF 62
Query: 72 KENLEYIEKANK-EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSP--SHRSTTSSTFKY 128
K N+E IE N RT+KL N+F+DLTNDEFR++YTG+K S S T S F+Y
Sbjct: 63 KNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRY 122
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
QN+S +P S+DWR K AVTPIK+Q CGCCWAFSAVAA+EG T+I LI LSEQQL
Sbjct: 123 QNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQL 182
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYE 247
VDC TN + GC GG M+ AFE+I G+ TE YPY+ TC++ + A I+ YE
Sbjct: 183 VDCDTN-DFGCEGGLMDTAFEHIKATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYE 241
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
+VP DEQAL+KAV+ QPVS+GI +F+ Y G+F G C T LDHAVT +G+G + +
Sbjct: 242 DVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTN 301
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
G+ YW+IKNSWG WG++GYM+I +D +GLCG+ ++SYP
Sbjct: 302 GSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 170/349 (48%), Positives = 233/349 (66%), Gaps = 14/349 (4%)
Query: 12 KINTIPMFIIIILLVSC---ASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRF 68
K+ +I ++ + ++S SQ S + HE V E H++WM + R Y DELEK+MRF
Sbjct: 9 KMTSILFMLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRF 68
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK----MPSPSHRSTTSS 124
+FK+NL++IEK NK+G+RTYKLG N F+D T +EF A +TG K +PS
Sbjct: 69 DVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIP 128
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
++ + N+S + DWR + AVTP+K Q +CGCCWAFS+VAAVEG+TKI G NL+ LS
Sbjct: 129 SWNW-NVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLS 187
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKIS 244
EQQL+DC +NGC GG M AF YII+N+GIA+E YPYQA +GTC K +A I
Sbjct: 188 EQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQAAEGTCRYNGKPSAW-IR 246
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFN-GVCGTQLDHAVTIVGFG 303
++ VPS +E+ALL+AVS QPVS+ I A F Y G+++ CGT ++HAVT VG+G
Sbjct: 247 GFQTVPSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYG 306
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
T+ +G YWL KNSWG+TWG+ GY++I RD +G+CG+ + YP+A
Sbjct: 307 TSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 163/346 (47%), Positives = 226/346 (65%), Gaps = 16/346 (4%)
Query: 17 PMFIIIILLVSC------ASQVVSSRSTH-EQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
P+ + I+ + C + V ++R + ++ HE+WMAQHGR YKD EK R +
Sbjct: 7 PLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARRLE 66
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKY 128
+FK N+ +IE N G Y LG N+F+DLT++EF+A T K +P++ S+ FKY
Sbjct: 67 VFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVSTGFKY 126
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+N+S +P S+DWR K AVT IKDQ +CGCCWAFSAVAA+EGI K+S LI LSEQ+L
Sbjct: 127 ENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQEL 186
Query: 189 VDCSTNGNN-GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAAKISNY 246
VDC +GN+ GC GG ++ AF++I+ N G+ E YPY A G C + A AA I Y
Sbjct: 187 VDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAASIRGY 246
Query: 247 EEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP+ DE +L+KAV+ QPVS+ + A ++F+ Y G+ G CGT LDH VT++G+G
Sbjct: 247 EDVPANDEPSLMKAVAGQPVSVAVDA--SKFQFYGGGVMAGECGTSLDHGVTVIGYGAAS 304
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
DG YWL+KNSWG TWG+AGY+++ +D G+CG+ Q SYP A
Sbjct: 305 DGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPTA 350
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 173/335 (51%), Positives = 226/335 (67%), Gaps = 30/335 (8%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
I ++++ ASQ +S R+ HE S+ E HE WM +GR+YKD EKE RFKIFKEN+EYIE
Sbjct: 10 ITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEYIE 68
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
NK F+A GY M S RS+ ++F+Y+N++ VP+S
Sbjct: 69 SVNK--------------------FKASRNGYNMSSRP-RSSEITSFRYENVAA--VPSS 105
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNG 198
+DWR K AVTPIKDQ +CGCCWAFSAVAA+EG+T++ LI LSEQ+LVDC T+G + G
Sbjct: 106 MDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQG 165
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK-ISNYEEVPSGDEQAL 257
CGGG M+ AFE+II N G+ TE YPY+ V TC+ + A++A I NYE+VP+ E AL
Sbjct: 166 CGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSEAAL 225
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
LKAV+ PVS+ I A ++F+ Y G+F G CGT+LDH VT VG+G T+DG YWL+KNS
Sbjct: 226 LKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWLVKNS 285
Query: 318 WGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
WG WG+ GY+ + R DEGLCGI ++SYP A
Sbjct: 286 WGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 320
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 220/336 (65%), Gaps = 9/336 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F I+ L C++ + + + ++V HE+WM Q+GR YKD EK RF+IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG N+F+DLTN EFRA T + R T TF+Y+N+S+ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGFIPSTVRVPT--TFRYENVSIDTLP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
++DWR K AVTPIKDQ +CGCCWAFSAVAA+EGI K+S LI LSEQ+LVDC +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M+ AF++II+N G+ TE +YPY A G C+ +AA I YEEVP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSAAT-IKGYEEVPANNEAA 243
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKN 303
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG TWG+ G++++ +D G+CG+ + SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 166/341 (48%), Positives = 234/341 (68%), Gaps = 12/341 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ KI+ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKIDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P+ S+ S +
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPN-SYLSPS----PIN 116
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
+LS D+P++LDWR+ AVT +K+Q +CGCCWAFSAV ++EG KI+ NL++ SEQ+L+
Sbjct: 117 DLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 176
Query: 190 DCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEV 249
DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+Y+ V
Sbjct: 177 DCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISSYQVV 235
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT E G
Sbjct: 236 PEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQ 293
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 294 KYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 165/341 (48%), Positives = 234/341 (68%), Gaps = 12/341 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ K++ + + I + ++S + RS + SV E HE WM++HGR YKDE+EK RF
Sbjct: 2 AMKVDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFKEN+++IE NK GN +YKLG N F+D+T+ EF A +TG +P+ S+ S +
Sbjct: 62 IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPN-SYLSPS----PIN 116
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
+LS D+P++LDWR+ AVT +K+Q +CGCCWAFSAV ++EG KI+ NL++ SEQ+L+
Sbjct: 117 DLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 176
Query: 190 DCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEV 249
DC+TN N GC GG M AF++I +N GI+ E +Y Y Q TC + +K AA +IS+Y+ V
Sbjct: 177 DCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISSYQVV 235
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P G E +LL+AV+ QPVSIGIAA + + + Y G ++G C +++HAVT +G+GT E G
Sbjct: 236 PEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQ 293
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
YWL+KNSWG +WG+ G+MKI+RD GLC I SSYP
Sbjct: 294 KYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 159/326 (48%), Positives = 222/326 (68%), Gaps = 12/326 (3%)
Query: 19 FIII-ILLVSCA---SQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
F+++ ++ +CA S + +Q++V HE+WMA++ R Y D EK RF++FK N
Sbjct: 9 FVLLSVVAWACALSGSLAARDLADQDQAMVARHEEWMAKYDRVYSDAAEKARRFEVFKAN 68
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK----MPSPSHRSTTSST-FKYQ 129
+ IE N GN + L NRF+DLT+DEFRA +TGY+ S RS T++T FKY
Sbjct: 69 MALIESVNA-GNHKFWLEANRFADLTDDEFRATWTGYRPKTAAASSKGRSRTATTGFKYA 127
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
N+S+ DVP S+DWR K AVTPIK+Q ECGCCWAFSAVA++EG+ K+S L+ LSEQ+LV
Sbjct: 128 NVSLDDVPASVDWRTKGAVTPIKNQGECGCCWAFSAVASMEGVVKLSTGKLVSLSEQELV 187
Query: 190 DCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYE 247
DC NG + GC GG M+ AF++I+ N G+ TE YPY A GTC++ + + AA I YE
Sbjct: 188 DCDVNGMDQGCEGGEMDDAFDFIVGNGGLTTESRYPYTASDGTCNSNEASGDAASIKGYE 247
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
+VP+ DE +L KAV+ QPVS+ + + F+ YK G+ +G CGT+LDH + VG+G D
Sbjct: 248 DVPANDEASLRKAVANQPVSVAVDGGDSHFRFYKGGVLSGACGTELDHGIAAVGYGVASD 307
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD 333
G YW++KNSWG +WG+AGY+++ RD
Sbjct: 308 GTKYWVMKNSWGTSWGEAGYIRMERD 333
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 162/314 (51%), Positives = 222/314 (70%), Gaps = 16/314 (5%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+ E HE+WM Q+GR YKD+ E+ R+ IFKEN+ I+ N + ++YKLG N+F+DLTN+
Sbjct: 1 MYERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNE 60
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
EF+A +K M SP + F+Y+N+S VP+++DWR + AVTP+KDQ +CGCC
Sbjct: 61 EFKASRNRFKGHMCSPQ-----AGPFRYENVSA--VPSTVDWRKEGAVTPVKDQGQCGCC 113
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIAT 219
WAFSAVAA+EGI K++ LI LSEQ++VDC T G + GC GG M+ AF++I QN+G+ T
Sbjct: 114 WAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTT 173
Query: 220 EDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E YPY+ GTC+ + A AAKI+ +E+VP+ E AL+KAV+ QPVS+ I A ++F+
Sbjct: 174 EANYPYKGTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQ 233
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----E 334
Y GIF G C TQLDH VT VG+G + DG+ YWL+KNSWG WG+ GY+++ +D E
Sbjct: 234 FYSSGIFTGSCDTQLDHGVTAVGYGVS-DGSKYWLVKNSWGAQWGEEGYIRMQKDISAKE 292
Query: 335 GLCGIGTQSSYPLA 348
GLCGI Q+SYP A
Sbjct: 293 GLCGIAMQASYPTA 306
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 167/328 (50%), Positives = 223/328 (67%), Gaps = 11/328 (3%)
Query: 30 SQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTY 89
SQ S + HE V E H++WM + R Y DELEK+MRF +FK+NL++IEK NK+G+RTY
Sbjct: 6 SQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTY 65
Query: 90 KLGTNRFSDLTNDEFRALYTGYK----MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDK 145
KLG N F+D T +EF A +TG K +PS ++ + N+S + DWR +
Sbjct: 66 KLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNW-NVSDVAGRETKDWRYE 124
Query: 146 KAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTME 205
AVTP+K Q +CGCCWAFS+VAAVEG+TKI G NL+ LSEQQL+DC +NGC GG M
Sbjct: 125 GAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMS 184
Query: 206 KAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQP 265
AF YII+N+GIA+E YPYQA +GTC K +A I ++ VPS +E+ALL+AVS QP
Sbjct: 185 DAFSYIIKNRGIASEASYPYQAAEGTCRYNGKPSAW-IRGFQTVPSNNERALLEAVSKQP 243
Query: 266 VSIGIAAYTTEFKSYKEGIFN-GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGD 324
VS+ I A F Y G+++ CGT ++HAVT VG+GT+ +G YWL KNSWG+TWG+
Sbjct: 244 VSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGE 303
Query: 325 AGYMKILRD----EGLCGIGTQSSYPLA 348
GY++I RD +G+CG+ + YP+A
Sbjct: 304 NGYIRIRRDVAWPQGMCGVAQYAFYPVA 331
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 159/336 (47%), Positives = 220/336 (65%), Gaps = 9/336 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F I+ L C++ + + + ++V HE+WM Q+GR YKD EK RF+IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG N+F+DLTN EFRA T + R T TF+Y+N+S+ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGFIPSTVRVPT--TFRYENVSIDTLP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
++DWR K AVTPIKDQ +CGCCWAFSAVAA+EGI K+S LI LSEQ+LVDC +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M+ AF++II+N G+ TE +YPY A G C+ +AA I YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSAAT-IKGYEDVPANNEAA 243
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKN 303
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG TWG+ G++++ +D G+CG+ + SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 166/343 (48%), Positives = 226/343 (65%), Gaps = 10/343 (2%)
Query: 13 INTIPMFIIIILLVS-CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
+ + +F+ + + S C S +S +E + + H +WM +HGR Y D E+ R+ +F
Sbjct: 3 LKHMQIFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVF 62
Query: 72 KENLEYIEKANK-EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSP--SHRSTTSSTFKY 128
K N+E IE N RT+KL N+F+DLTNDEF ++YTG+K S S T S F+Y
Sbjct: 63 KNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTKMSPFRY 122
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
QN+S +P S+DWR K AVTPIK+Q CGCCWAFSAVAA+EG T+I LI LSEQQL
Sbjct: 123 QNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQL 182
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYE 247
VDC TN + GC GG M+ AFE+I G+ TE +YPY+ TC++ + A I+ YE
Sbjct: 183 VDCDTN-DFGCEGGLMDTAFEHIKATGGLTTESDYPYKGEDATCNSKKTNPKATSITGYE 241
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
+VP DEQAL+KAV+ QPVS+GI +F+ Y G+F G C T LDHAVT +G+G + +
Sbjct: 242 DVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTN 301
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
G+ YW+IKNSWG WG++GYM+I +D +GLCG+ ++SYP
Sbjct: 302 GSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 334 bits (857), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 161/344 (46%), Positives = 224/344 (65%), Gaps = 16/344 (4%)
Query: 17 PMFIIIILLVSC------ASQVVSSRSTH-EQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
P+ + I+ + C + V ++R + ++ HE+WMAQHGR YKD EK R +
Sbjct: 7 PLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARRLE 66
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKY 128
+FK N+ +IE N G Y LG N+F+DLT++EF+A T K +P++ S+ FKY
Sbjct: 67 VFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVSTGFKY 126
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+N+S +P S+DWR K AVT IKDQ +CGCCWAFSAVAA+EG K+S LI LSEQ+L
Sbjct: 127 ENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVKLSTGKLISLSEQEL 186
Query: 189 VDCSTNGNN-GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAAKISNY 246
VDC +GN+ GC GG ++ AF++I+ N G+ E YPY A G C + A AA I Y
Sbjct: 187 VDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAASIRGY 246
Query: 247 EEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP+ DE +L+KAV+ QPVS+ + A ++F+ Y G+ G CGT LDH VT++G+G
Sbjct: 247 EDVPANDEPSLMKAVAGQPVSVAVDA--SKFQFYGGGVMAGECGTSLDHGVTVIGYGAAS 304
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
DG YWL+KNSWG TWG+AGY+++ +D G+CG+ Q SYP
Sbjct: 305 DGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYP 348
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 334 bits (856), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 156/330 (47%), Positives = 225/330 (68%), Gaps = 7/330 (2%)
Query: 22 IILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKA 81
+ V ++ V + E ++ ++KWMAQ+ R YKD+ EK RF++FK N E+I+++
Sbjct: 34 VAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRS 93
Query: 82 NKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPS--PS-HRSTTSSTFKYQNLSMTDVPT 138
N G + Y LGTN+F+DLT+ EF A+YTG + P+ PS + ++ KYQN + D
Sbjct: 94 NAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDV 153
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNN 197
+DWR + AVTP+K+Q +CGCCWAFSAV A+EG+ I+ NL+ LSEQQ++DC ++GN
Sbjct: 154 QVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQ 213
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG M+ AF+Y+I N G+ TED YPY AVQGTC Q AA IS ++++PSGDE AL
Sbjct: 214 GCNGGYMDNAFQYVINNGGVTTEDAYPYSAVQGTCQNVQPAAT--ISGFQDLPSGDENAL 271
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
AV+ QPVS+G+ ++ F+ Y+ GI++G CGT ++HAVT +G+G + G YW++KN
Sbjct: 272 ANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKN 331
Query: 317 SWGDTWGDAGYMKILRDEGLCGIGTQSSYP 346
SWG WG+ G+M++ G CGI T +SYP
Sbjct: 332 SWGTGWGENGFMQLQMGVGACGISTMASYP 361
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 333 bits (854), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 162/336 (48%), Positives = 222/336 (66%), Gaps = 8/336 (2%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++I+ L+++ + V SR E E HEKWMAQ+GR YKD EKE RF++FK N+ +I
Sbjct: 9 YLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFI 68
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L N+F+DL ++EF+AL + + ++T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTETSFRYE--SVTKIPA 126
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
++DWR + AVTPIKDQ CG CWAFSAVAA EGI +I+ L+ LSEQ+LVDC + G
Sbjct: 127 TIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEG 186
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQAL 257
C GG ++ AFE+I + GIA+E YPY+ V TC ++ A+I YE+VPS +E+AL
Sbjct: 187 CIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKAL 246
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LKAV+ QPVS+ I A T FK Y GIFN CGT +HAV +VG+G DG+ YWL+KN
Sbjct: 247 LKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDGSKYWLVKN 306
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG WG+ GY++I RD EGLCGI YP A
Sbjct: 307 SWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 333 bits (854), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 163/336 (48%), Positives = 222/336 (66%), Gaps = 8/336 (2%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++I+ L++S + V SR E E HEKWMAQ+GR YKD EKE RF++FK N+ +I
Sbjct: 9 YLILFLVLSVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFI 68
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L N+F+DL ++EF+AL + + ++T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTQTSFRYE--SVTKIPA 126
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
++DWR + AVTPIKDQ CG CWAFSAVAA EGI +I+ L+ LSEQ+LVDC + G
Sbjct: 127 TIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEG 186
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQAL 257
C GG ++ AFE+I + GIA+E YPY+ V TC ++ A+I YE+VPS +E+AL
Sbjct: 187 CIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKAL 246
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFN-GVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LKAV+ QPVS+ I A T FK Y GIFN CGT +HAV +VG+G DG+ YWL+KN
Sbjct: 247 LKAVANQPVSVYIDAGTHAFKYYSSGIFNVRNCGTDPNHAVAVVGYGKALDGSKYWLVKN 306
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG WG+ GY++I RD EGLCGI YP A
Sbjct: 307 SWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 333 bits (854), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 154/336 (45%), Positives = 221/336 (65%), Gaps = 9/336 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR Y+D+ EK RF++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG N+F+DLTNDEFR + T + R T F+Y+N+++ +P
Sbjct: 68 IESFNA-GNHNFWLGVNQFADLTNDEFRWMKTNKGFIPSTTRVPTG--FRYENVNIDALP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
++DWR K AVTPIKDQ +CGCCWAFSAVAA+EGI K+S LI LSEQ+LVDC +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M+ AF++II+N G+ TE YPY A C + + A+ I YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSNSVAS-IKGYEDVPANNEAA 243
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+ + F+ YK G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 303
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG TWG+ G++++ +D G+CG+ + SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 154/336 (45%), Positives = 220/336 (65%), Gaps = 9/336 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR YKD+ EK RF++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG N+F+DLTNDEFR+ T + R T F+Y+N+++ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTG--FRYENVNIDALP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
++DWR K VTPIKDQ +CGCCWAFSAVAA+EGI K+S LI LSEQ+LVDC +G +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M+ AF++II+N G+ TE YPY A C + + A+ I YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSNSVAS-IKGYEDVPANNEAA 243
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+ + F+ YK G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 303
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG TWG+ G++++ +D G+CG+ + SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 158/336 (47%), Positives = 219/336 (65%), Gaps = 9/336 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F I+ L C++ + + + ++V HE+WM Q+GR YKD EK RF+IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + L N+F+DLTN EFRA T + R T TF+Y+N+S+ +P
Sbjct: 68 IESFNA-GNHKFWLSVNQFADLTNYEFRATKTNKGFIPSTVRVPT--TFRYENVSIDTLP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
++DWR K AVTPIKDQ +CGCCWAFSAVAA+EGI K+S LI LSEQ+LVDC +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M+ AF++II+N G+ TE +YPY A G C+ +AA I YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSAAT-IKGYEDVPANNEAA 243
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKN 303
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG TWG+ G++++ +D G+CG+ + SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 155/338 (45%), Positives = 220/338 (65%), Gaps = 10/338 (2%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQS--VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLE 76
F+ +++ + A + +R + + HE+WMA++GR Y D EK R ++FK N+
Sbjct: 3 FLFALVVCTFALGALGARDLADDDWLIAARHEQWMARYGRVYSDVAEKARRLEVFKANVG 62
Query: 77 YIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+IE N GN + L N+F+D+T DEFRA++ GYKM ++ + F+Y N+S+ D+
Sbjct: 63 FIESVNA-GNHKFWLEANQFADITKDEFRAMHKGYKMQVIGSKARATG-FRYANVSIDDL 120
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-G 195
P S+DWR AVTP+KDQ +CGCCWAFS VA++EGI K+S LI LSEQ+LVDC
Sbjct: 121 PASVDWRANGAVTPVKDQGQCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVGMQ 180
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDE 254
N GCGGG M+ AFE+I+ N G+ TE +YPY GTC++ +++ AA I YE+VP+ DE
Sbjct: 181 NKGCGGGLMDNAFEFIVNNGGLDTEADYPYTGADGTCNSNKESNIAASIKGYEDVPANDE 240
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+L KAV+ QPVSI + F+ YK G+ G CGT+LDH V VG+G DG YWL+
Sbjct: 241 ASLQKAVAAQPVSIAVDGGDDLFRFYKGGVLTGACGTELDHGVAAVGYGVAGDGTKYWLV 300
Query: 315 KNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
KNSWG +WG+ G++++ RD G+CG+ + SYP A
Sbjct: 301 KNSWGTSWGEDGFIRLERDVADEAGMCGLAMKPSYPTA 338
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 154/336 (45%), Positives = 220/336 (65%), Gaps = 9/336 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR Y+D+ EK RF++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG N+F+DLTNDEFR T + R T F+Y+N+++ +P
Sbjct: 68 IESFNA-GNHNFWLGVNQFADLTNDEFRWTKTNKGFIPSTTRVPTG--FRYENVNIDALP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
++DWR K AVTPIKDQ +CGCCWAFSAVAA+EGI K+S LI LSEQ+LVDC +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M+ AF++II+N G+ TE YPY A C + + A+ I YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSNSVAS-IKGYEDVPANNEAA 243
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+ + F+ YK G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 303
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG TWG+ G++++ +D G+CG+ + SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 161/340 (47%), Positives = 224/340 (65%), Gaps = 7/340 (2%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
+ I +F+I+ L+ S + SR E ++ + H WM +HGR Y D EK R+ +FK
Sbjct: 3 LTQIQIFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFK 62
Query: 73 ENLEYIEKANK-EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
N+E IE+ N+ + T+KL N+F+DLTN+EFR++YTGYK S T ++F+YQ++
Sbjct: 63 RNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGNSVLSSRTKPTSFRYQHV 122
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
S +P S+DWR K AVTPIKDQ CG CWAFSAVAA+EG+ +I LI LSEQ+LVDC
Sbjct: 123 SSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDC 182
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVP 250
TN ++GC GG M AF Y + G+ +E YPY++ GTC+ + K A I +E+VP
Sbjct: 183 DTN-DDGCMGGYMNSAFNYTMTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVP 241
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+ DE+AL+KAV+ PVSIGIA T F+ Y G+F+G C T LDH V +VG+G + +G+
Sbjct: 242 ANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSK 301
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
YW++KNSWG WG+ GYM+I +D G CG+ +SYP
Sbjct: 302 YWILKNSWGPKWGERGYMRIKKDTKAKHGQCGLAMNASYP 341
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 170/338 (50%), Positives = 223/338 (65%), Gaps = 12/338 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + + L SQV+ R H+ ++ E HE WMA++G+ YKD EKE RF+IFK+N+E+
Sbjct: 10 MLALFLFLAVGISQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDV 136
IE N GN+ YKLG N +DLT +EF+ G K +T + FKY+N+ TD+
Sbjct: 69 IESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENV--TDI 126
Query: 137 PTSLDWRDKKAVTPIKDQ-QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
P ++DWR K AVTPIKDQ +CG CWAFS +AA EGI +IS NL+ LSEQ+LVDC +
Sbjct: 127 PEAIDWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSV- 185
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDE 254
++GC GG ME FE+II+N GI +E YPY+ V GTC+ A+ A+I YE VPS E
Sbjct: 186 DDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIKGYEIVPSYSE 245
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+AL KAV+ QPVS+ I A F Y GI+NG CGT LDH VT VG+G TE+G +YW++
Sbjct: 246 EALQKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYG-TENGTDYWIV 304
Query: 315 KNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
KNSWG WG+ GY+++ R G+CGI SSYP A
Sbjct: 305 KNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 168/335 (50%), Positives = 218/335 (65%), Gaps = 14/335 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
I + LL++ + SR HE S+ E HE+WMA++G+ YKD EKE RF IFK N+E+IE
Sbjct: 11 IALFLLLALGIPQMMSRKLHETSMRERHEQWMAEYGKVYKDAAEKEKRFLIFKHNVEFIE 70
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
N N+ YKLG N +DLT +EF+A G K P +++ FKY+N+ T +P +
Sbjct: 71 SFNAAANKPYKLGVNHLADLTVEEFKASRNGLKRP----YELSTTPFKYENV--TAIPAA 124
Query: 140 LDWRDKKAVTPIKDQQEC-GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NN 197
+DWR K AVT IKDQ +C G CWAFS VAA EGI +I+ L+ LSEQ+LVDC T G +
Sbjct: 125 IDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLSEQELVDCDTKGVDQ 184
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG ME FE+II+N GI +E YPY+AV G C+ A + A+I YE+VP E+ L
Sbjct: 185 GCEGGYMEDGFEFIIKNGGITSEANYPYKAVDGKCNKAT-SPVAQIKGYEKVPPNSEKTL 243
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
KAV+ QPVS+ I A F Y GI+NG CGT+LDH VT VG+G +G +YWL+KNS
Sbjct: 244 QKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELDHGVTAVGYGIA-NGTDYWLVKNS 302
Query: 318 WGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
WG WG+ GY+++ R GLCGI SSYP A
Sbjct: 303 WGTQWGEKGYVRMQRGVAAKHGLCGIALDSSYPTA 337
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 161/321 (50%), Positives = 222/321 (69%), Gaps = 8/321 (2%)
Query: 31 QVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYK 90
++ SSR+ E V+ M+E W+ +HG+SY EKE RF+IFK+NL +I++ N E +RTYK
Sbjct: 32 ELSSSRTDDE--VMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAE-SRTYK 88
Query: 91 LGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTP 150
+G NRF+DLTNDE+R++Y G + S ST + +Y ++ +P S+DWR+K AV
Sbjct: 89 VGLNRFADLTNDEYRSMYLGARTGSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVG 148
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEY 210
+KDQ CG CWAFS +AAVEGI +I +LI LSEQ+LVDC T+ N GC GG M+ AFE+
Sbjct: 149 VKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEF 208
Query: 211 IIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIG 269
II+N GI TE++YPY A G C +K A I +YE+VP +EQAL KAV+ QPVS+
Sbjct: 209 IIKNGGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVA 268
Query: 270 IAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMK 329
I A F+ Y+ G+F G CGT LDH VT VG+G TE+ +YW++KNSWG +WG++GY++
Sbjct: 269 IEASGMAFQFYESGVFTGNCGTALDHGVTAVGYG-TENSVDYWIVKNSWGSSWGESGYIR 327
Query: 330 ILRDEGL---CGIGTQSSYPL 347
+ R+ G CGI + SYP+
Sbjct: 328 MERNTGATGKCGIAVEPSYPI 348
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 165/352 (46%), Positives = 233/352 (66%), Gaps = 21/352 (5%)
Query: 14 NTIPMFIIIILLVSCAS----QVVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDE 61
+++ +F+ ++L ++ AS ++ TH ++ V+ ++E W+A+HG+SY
Sbjct: 8 SSMAVFLFLLLGLASASAXDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNAL 67
Query: 62 LEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST 121
EKE RF+IFK+NL +I++ N E NRTYK+G NRF+DLTN+E+R++Y G + + RS+
Sbjct: 68 GEKERRFQIFKDNLRFIDEHNAE-NRTYKVGLNRFADLTNEEYRSMYLGTRTAA-KRRSS 125
Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLI 181
+ +Y +P S+DWR K AV +KDQ CG CWAFS +AAVEGI KI LI
Sbjct: 126 NKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLI 185
Query: 182 QLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAA 240
LSEQ+LVDC T+ N GC GG M+ AFE+II N GI +E++YPY+A G C +K A
Sbjct: 186 SLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAXV 245
Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIV 300
I YE+VP DE++L KAV+ QPVS+ I A EF+ Y+ GIF G CGT LDH VT V
Sbjct: 246 VTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAV 305
Query: 301 GFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
G+G TE+G +YW++KNSWG +WG+ GY+++ RD G CGI ++SYP+
Sbjct: 306 GYG-TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPI 356
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 162/341 (47%), Positives = 224/341 (65%), Gaps = 8/341 (2%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRST-HEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
+ I +F+I+ L+ S + + SR E ++ + H +WM +HGR Y D EK R+ +F
Sbjct: 3 LTQIQIFLIVSLVSSFSLSITLSRPLLDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVF 62
Query: 72 KENLEYIEKANK-EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
K N+E IE+ N + T+KL N+F+DLTN+EFR++YTG+K S T ++F+YQN
Sbjct: 63 KRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGNSVLSSRTKPTSFRYQN 122
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
+S +P S+DWR K AVTPIKDQ CG CWAFSAVAA+EG+ +I LI LSEQ+LVD
Sbjct: 123 VSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVD 182
Query: 191 CSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEV 249
C TN + GC GG M+ AF Y I G+ +E YPY++ GTC+ + K A I +E+V
Sbjct: 183 CDTN-DGGCMGGLMDTAFNYTITIGGLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDV 241
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ DE+AL+KAV+ PVSIGIA F+ Y G+F+G C T LDH VT VG+G +++G
Sbjct: 242 PANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGL 301
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
YW++KNSWG WG+ GYM+I +D G CG+ +SYP
Sbjct: 302 KYWILKNSWGPKWGERGYMRIKKDIKPKHGQCGLAMNASYP 342
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 166/348 (47%), Positives = 228/348 (65%), Gaps = 21/348 (6%)
Query: 18 MFIIIILLVSCAS----QVVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEKE 65
M + + LL+ AS ++ TH ++ V+ ++E W+A+HG+SY EKE
Sbjct: 10 MAVFLFLLLGLASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALGEKE 69
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST 125
RF+IFK+NL +I++ N E NRTYK+G NRF+DLTN+E+R++Y G + + RS+ +
Sbjct: 70 RRFQIFKDNLRFIDEHNAE-NRTYKVGLNRFADLTNEEYRSMYLGTRTAA-KRRSSNKIS 127
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
+Y +P S+DWR K AV +KDQ CG CWAFS +AAVEGI KI LI LSE
Sbjct: 128 DRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSE 187
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKIS 244
Q+LVDC T+ N GC GG M+ AFE+II N GI +E++YPY+A G C +K A I
Sbjct: 188 QELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAKVVTID 247
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT 304
YE+VP DE++L KAV+ QPVS+ I A EF+ Y+ GIF G CGT LDH VT VG+G
Sbjct: 248 GYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYG- 306
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
TE+G +YW++KNSWG +WG+ GY+++ RD G CGI ++SYP+
Sbjct: 307 TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPI 354
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 156/341 (45%), Positives = 220/341 (64%), Gaps = 9/341 (2%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
I + I+ L C+S + + + S+ HE WMAQ+GR YKD EK +F++FK
Sbjct: 3 IPKASILAILGCLCFCSSVLAARELNDDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N +I+ N E N + LG N+F+DLTN+EF+A T S +++ S+ FKY+NL
Sbjct: 63 ANARFIDSFNAE-NHKFWLGINQFADLTNEEFKATKTNKGFIS--NKARVSTGFKYENLK 119
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+ +PTS+DWR K AVTP+KDQ +CGCCWAFSAVAA EGI K+S L+ LSEQ+LVDC
Sbjct: 120 IEALPTSIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 TNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
+G + GC GG M+ AF++II N G+ E YPY A G C + K+A I +YE+VP+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSGSKSAGT-IKSYEDVPA 238
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
+E AL+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G T DG +
Sbjct: 239 NNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKF 298
Query: 312 WLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
WL+KNSWG TWG+ G++++ +D +G+CG+ + SYP A
Sbjct: 299 WLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 327 bits (839), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 155/316 (49%), Positives = 215/316 (68%), Gaps = 9/316 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E ++E W+ +HG++Y EKE RFKIFK+NL +IE+ N G+++YKLG N+F+DL
Sbjct: 41 ESHTRHVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADL 100
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSS--TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
TN+E+RA++ G + P +++ + T +Y + ++P +DWR+K AVTPIKDQ +C
Sbjct: 101 TNEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQGQC 160
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS V AVEGI +I NL LSEQ+LVDC N GC GG M+ AFE+I+QN GI
Sbjct: 161 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYAFEFIVQNGGI 220
Query: 218 ATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
TE++YPY A TC +K A I YE+VP+ DE++L+KAV+ QPVS+ I A E
Sbjct: 221 DTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIEAGGME 280
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ Y+ G+F G CGT LDH V VG+G TE+G +YWL++NSWG WG+ GY+K+ R+
Sbjct: 281 FQLYQSGVFTGRCGTNLDHGVVAVGYG-TENGTDYWLVRNSWGSAWGENGYIKLERNVQN 339
Query: 334 --EGLCGIGTQSSYPL 347
G CGI ++SYP+
Sbjct: 340 TETGKCGIAIEASYPI 355
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 167/345 (48%), Positives = 221/345 (64%), Gaps = 21/345 (6%)
Query: 14 NTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKE 73
N + +F I+ L S V+SSR ++E HE+WM +HG+ YKD EKE RF+IFKE
Sbjct: 11 NILTLFFILTLWTSL---VISSR------LLEKHEQWMEEHGKFYKDAAEKEQRFQIFKE 61
Query: 74 NLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALY-TGYKMPSPS---HRSTTSSTFKYQ 129
NLE+IE N G+ + L N+F D TNDEF+A Y G K P S F+Y+
Sbjct: 62 NLEFIESFNAAGDNGFNLSINQFGDQTNDEFKANYLNGKKKPLIGVGIAAIEEESVFRYE 121
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
N+ T+VP ++DWR++ AVTPIK Q CG CWAF+ VAA+EGI +I+ L+ LSEQ+LV
Sbjct: 122 NV--TEVPATMDWRERGAVTPIKHQHLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELV 179
Query: 190 DC-STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYE 247
DC TN +GC GG +E A ++I++ GI +E YPY V G C+ + AKI YE
Sbjct: 180 DCVKTNTTDGCNGGYVEDACDFIVKKGGITSETNYPYTRVDGKCNVRKGTYNVAKIKGYE 239
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP+ +E+ALLKAV+ QP+++ IAA F+ Y GI G CG LDH VTIVG+GT++D
Sbjct: 240 HVPANNEKALLKAVANQPIAVYIAATKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTSDD 299
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
G YWL+KNSWG WG+ GY+KI RD EG CGI +YP+
Sbjct: 300 GVKYWLVKNSWGTKWGEKGYIKIKRDVHAKEGSCGIAMVPTYPIV 344
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 158/333 (47%), Positives = 218/333 (65%), Gaps = 9/333 (2%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++I+ L+++ + V SR E E HEKWMAQ+G+ Y D EKE RF+IFK N+++I
Sbjct: 9 YLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFI 68
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L N+F+DL N+EF+A + + T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETATETSFRYE--SITKIPV 126
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
++DWR + AVTPIKDQ CG CWAFS VAA+EGI +I+ L+ LSEQ+LVDC + G
Sbjct: 127 TMDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLVSLSEQELVDCVKGKSEG 186
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQAL 257
C G E+AFE++ +N G+A+E YPY+A TC ++ A+I YE VPS E+AL
Sbjct: 187 CNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVPSNSEKAL 246
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
LKAV+ QPVS+ I A +F Y GIF G CGT +HAVT++G+G GA YWL+KNS
Sbjct: 247 LKAVANQPVSVYIDAGALQF--YSSGIFTGKCGTAPNHAVTVIGYGKARGGAKYWLVKNS 304
Query: 318 WGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
WG WG+ GY+K+ RD EGLCGI T +SYP
Sbjct: 305 WGTKWGEKGYIKMKRDIRAKEGLCGIATNASYP 337
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 169/338 (50%), Positives = 222/338 (65%), Gaps = 12/338 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + + L SQV+ R H+ ++ E HE WMA++G+ YKD EKE RF+IFK+N+E+
Sbjct: 10 MLALFLFLAVGISQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEF 68
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDV 136
IE N GN+ YKLG N +DLT +EF+ G K +T + FKY+N+ TD+
Sbjct: 69 IESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENV--TDI 126
Query: 137 PTSLDWRDKKAVTPIKDQ-QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
P ++DWR K AVTPIKDQ +CG WAFS +AA EGI +IS NL+ LSEQ+LVDC +
Sbjct: 127 PEAIDWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSV- 185
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDE 254
++GC GG ME FE+II+N GI +E YPY+ V GTC+ A+ A+I YE VPS E
Sbjct: 186 DDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIKGYEIVPSYSE 245
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+AL KAV+ QPVS+ I A F Y GI+NG CGT LDH VT VG+G TE+G +YW++
Sbjct: 246 EALKKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYG-TENGTDYWIV 304
Query: 315 KNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
KNSWG WG+ GY+++ R G+CGI SSYP A
Sbjct: 305 KNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 327 bits (838), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 221/336 (65%), Gaps = 8/336 (2%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++I+ L+++ + V SR E E HEKWMAQ+GR YKD EKE RF++FK N+ +I
Sbjct: 9 YLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFI 68
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L N+F+DL ++EF+AL + + ++T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTETSFRYE--SVTKIPA 126
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
++D R + AVTPIKDQ CG CWAFSAVAA EGI +I+ L+ LSEQ+LVDC + G
Sbjct: 127 TIDRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEG 186
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQAL 257
C GG ++ AFE+I + GIA+E YPY+ V TC ++ A+I YE+VPS +E+AL
Sbjct: 187 CIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKAL 246
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LKAV+ QPVS+ I A T FK Y GIFN CGT +HAV +VG+G D + YWL+KN
Sbjct: 247 LKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDDSKYWLVKN 306
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG WG+ GY++I RD EGLCGI YP+A
Sbjct: 307 SWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPIA 342
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 155/323 (47%), Positives = 213/323 (65%), Gaps = 8/323 (2%)
Query: 16 IPMFIIIILLVS---CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
IP +++ ++ S C+S V+S+R + ++VE HE+WMA+ R YKD EK RFK FK
Sbjct: 3 IPKALLLAIIGSICLCSSTVLSARELGDAAMVEKHEQWMAKFNRVYKDSTEKAQRFKAFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N+ +IE N GN + LG N+F+DLTNDEFRA T + R+ T FKY N+S
Sbjct: 63 ANVAFIESFN-TGNHKFWLGVNQFTDLTNDEFRATKTNKGLKRNGARAPTR--FKYNNVS 119
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+P ++DWR K VTPIKDQ +CGCCWAFSAVAA EGI K+S L+ LSEQ+LVDC
Sbjct: 120 TDALPAAVDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 TNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVP 250
+G + GC GG M+ AF++II+N G+ TE YPY A G C + + + A I YE+VP
Sbjct: 180 VHGVDQGCEGGEMDNAFKFIIKNGGLTTEANYPYTAQDGQCKTSTTSNSVATIKGYEDVP 239
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+ DE +L+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G T DG
Sbjct: 240 ANDESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIVAIGYGMTSDGTK 299
Query: 311 YWLIKNSWGDTWGDAGYMKILRD 333
+WL+KNSWG TWG++GY+++ +D
Sbjct: 300 FWLLKNSWGTTWGESGYLRMEKD 322
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 326 bits (836), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 162/344 (47%), Positives = 233/344 (67%), Gaps = 17/344 (4%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTH---EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
+ + I+ + + ++ +SS ST E+++ H++WMA+HGR+Y+DE EK RF++FK
Sbjct: 17 VALTILAVTTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAHRFQVFK 76
Query: 73 ENLEYIEKANKEGN--RTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
N ++++ +N G+ ++Y+L N F+D+TNDEF A+YTG + P P+ + FKY N
Sbjct: 77 ANADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTGLR-PVPAGAKKMAG-FKYGN 134
Query: 131 LSMTDVPT---SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
++++D ++DWR K AVT IK+Q +CGCCWAF+AVAAVEGI +I+ NL+ LSEQQ
Sbjct: 135 VTLSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQ 194
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYE 247
++DC T+GNNGC GG ++ AF+YI+ N G+ TED YPY A Q C + Q AA IS Y+
Sbjct: 195 VLDCDTDGNNGCNGGYIDNAFQYIVGNGGLGTEDAYPYTAAQAMCQSVQPVAA--ISGYQ 252
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGV-CGT--QLDHAVTIVGFGT 304
+VPSGDE AL AV+ QPVS+ I A+ F+ Y G+ C T L+HAVT VG+GT
Sbjct: 253 DVPSGDEAALAAAVANQPVSVAIDAHN--FQLYGGGVMTAASCSTPPNLNHAVTAVGYGT 310
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYPLA 348
EDG YWL+KN WG WG+ GY+++ R CG+ Q+SYP+A
Sbjct: 311 AEDGTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 326 bits (836), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 168/331 (50%), Positives = 223/331 (67%), Gaps = 17/331 (5%)
Query: 28 CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR 87
C SQV SR H+ S+ E HE+WM ++G+ YKD E + RF IF+ N+E+IE N GN+
Sbjct: 20 CTSQV-KSRKLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNK 78
Query: 88 TYKLGTNRFSDLTNDEFRALYTGYKMPSPSH----RSTTSSTFKYQNLSMTDVPTSLDWR 143
YKL N +D TN+EF A + GYK SH R TT + FKY+N+ TD+P ++DWR
Sbjct: 79 PYKLSINHLADQTNEEFMASHKGYK---GSHWQGLRITTQTPFKYENV--TDIPWAVDWR 133
Query: 144 DKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGT 203
K VT IKDQ +CG CWAFSAVAA EGI +I+ NL+ LSE++LVDC + ++GC GG
Sbjct: 134 QKGDVTSIKDQAQCGNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSV-DHGCDGGL 192
Query: 204 MEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVS 262
ME FE+II+N GI++E YPY AV GTC ++A+ A+I+ YE VP E+ L KAV+
Sbjct: 193 MEHGFEFIIKNGGISSEANYPYTAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVA 252
Query: 263 MQ-PVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDT 321
Q +S+ I A + F+ Y G+F G CGTQLDH VT VG+G+T+ G YW++KNSWG
Sbjct: 253 NQLTMSVSIDAGGSAFQFYPSGVFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQ 312
Query: 322 WGDAGYMKILR----DEGLCGIGTQSSYPLA 348
WG+ GY+++LR EGLCGI +SYP A
Sbjct: 313 WGEEGYIRMLRGIDAQEGLCGIAMDASYPTA 343
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 151/314 (48%), Positives = 219/314 (69%), Gaps = 8/314 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
+ +++E++E W+AQH ++Y EK+ RF +FK+N YI + N +GN +YKLG N+F+DL
Sbjct: 37 DDAIMELYELWLAQHKKAYNGLGEKQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADL 96
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
+++EF+A Y G K+ + R + S + +YQ D+P S+DWR+K AVT +KDQ CG
Sbjct: 97 SHEEFKATYLGAKLDT-KKRLSNSPSPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGS 155
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS VAAVEGI +I NL LSEQ+LVDC T+ N GC GG M+ AF++II N G+ +
Sbjct: 156 CWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIINNGGLDS 215
Query: 220 EDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
ED+YPY+A G+C A +K A I +YE+VP DE++L KA + QP+S+ I A F+
Sbjct: 216 EDDYPYKANDGSCDAYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQ 275
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----- 333
Y+ G+F CGTQLDH VT+VG+G +E G +YW++KNSWG +WG+ G++++ R+
Sbjct: 276 FYESGVFTSTCGTQLDHGVTLVGYG-SESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVS 334
Query: 334 EGLCGIGTQSSYPL 347
G+CGI ++SYPL
Sbjct: 335 TGMCGIAMEASYPL 348
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 158/341 (46%), Positives = 225/341 (65%), Gaps = 13/341 (3%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
T + ++ +S ++ +S RS E V E+++ W+A+HG++Y E+E RF+IFKEN
Sbjct: 5 TTSLALLSFFFLSISASALSRRSDGE--VREIYDLWLAKHGKAYNGIDEREKRFQIFKEN 62
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQNLS 132
L++I+ N E NRTYK+G N F+DLTN+E+RALY G + P P+ R + T +Y +
Sbjct: 63 LKFIDDHNSE-NRTYKVGLNMFADLTNEEYRALYLGTRSP-PARRVMKAKTASRRYAVNN 120
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+ +P S+DWR + AV P+K+Q CG CWAFS +AAVEGI +I LI LSEQ+LV C
Sbjct: 121 LDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCD 180
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPS 251
N+GC GG M+ AF++II N G+ TE++YPY+A G C +K A I YE+VP+
Sbjct: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPA 240
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
DE++L KAV+ QPVS+ I A + Y+ G+F G CG+ LDH V VG+G E+G +Y
Sbjct: 241 NDEESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYG-KENGVDY 299
Query: 312 WLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
WL++NSWG +WG+ GY K+ R+ EG CGI Q+SYP+
Sbjct: 300 WLVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPV 340
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 166/336 (49%), Positives = 226/336 (67%), Gaps = 33/336 (9%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ +L + ASQ ++R+ HE S+ E HE WM Q+GR YKD EK R+KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N F+DLTN+EFRA +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
+++DWR K AVTPIKDQ +CG CWAFSAVAA+EGIT++S LI LSEQ+LVDC T+G
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSG-- 182
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQA 256
++QG YPY GTC+ + A AAKI+ YE+VP+ +E+A
Sbjct: 183 ---------------EDQGCTN---YPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 224
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L KAV+ QP+++ I A +EF+ Y G+F G CGT+LDH V+ VG+GT++DG YWL+KN
Sbjct: 225 LQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKN 284
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG WG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 285 SWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/326 (48%), Positives = 222/326 (68%), Gaps = 8/326 (2%)
Query: 30 SQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTY 89
S+ S + HE ++ H+KWM R Y DE EK+MR ++F ENL++IE N G+++Y
Sbjct: 21 SEATSRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSY 80
Query: 90 KLGTNRFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKKA 147
KLG N+F+D T +EF A +TG + S + T N +++DV T+ DWR++ A
Sbjct: 81 KLGVNKFTDWTKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGA 140
Query: 148 VTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKA 207
VTP+K Q ECG CWAFSA+AAVEG+TKI+ NLI LSEQQL+DC+ NNGC GGTM +A
Sbjct: 141 VTPVKYQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCAREQNNGCKGGTMIEA 200
Query: 208 FEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
F YI++N G+++E+ YPYQ +G C + A I +E VPS +E+ALL+AVS QPV+
Sbjct: 201 FNYIVKNGGVSSENAYPYQVKEGPCR-SNDIPAIVIRGFENVPSNNERALLEAVSRQPVA 259
Query: 268 IGIAAYTTEFKSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAG 326
+ I A T F Y G++N CGT ++HAVT+VG+GT+++G YWL KNSWG TWG+ G
Sbjct: 260 VDIDASETGFIHYSGGVYNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKNSWGKTWGENG 319
Query: 327 YMKILRD----EGLCGIGTQSSYPLA 348
Y++I RD +G+CG+ +SYP+A
Sbjct: 320 YIRIRRDVEWPQGMCGVAQYASYPVA 345
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 160/342 (46%), Positives = 227/342 (66%), Gaps = 13/342 (3%)
Query: 19 FIIIILLVSCASQVVSSRSTH-----EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKE 73
F+ ++L + +S ++ S+V+ H++WM Q R Y DE EK++R ++ E
Sbjct: 6 FVCVVLTIFFMDLKISEATSRVALYKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTE 65
Query: 74 NLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQNLS 132
NL++IE N GN++YKLG N F+D T +EF A YTG + + S + T N +
Sbjct: 66 NLKFIESFNNMGNQSYKLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETKPAWNWT 125
Query: 133 MTDV-PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
++DV T+ DWR++ AVTP+K Q ECG CWAFSA+AAVEG+TKI+ NLI LSEQQL+DC
Sbjct: 126 VSDVLGTNKDWRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDC 185
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
+ NNGC GGT AF YII+++GI++E+EYPYQ +G C + + A I +E VPS
Sbjct: 186 TREQNNGCKGGTFVNAFNYIIKHRGISSENEYPYQVKEGPCRSNARPAIL-IRGFENVPS 244
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGAN 310
+E+ALL+AVS QPV++ I A F Y G++N CGT ++HAVT+VG+GT+ +G
Sbjct: 245 NNERALLEAVSRQPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTSPEGMK 304
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
YWL KNSWG TWG+ GY++I RD +G+CG+ +SYP+A
Sbjct: 305 YWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASYPVA 346
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 166/336 (49%), Positives = 224/336 (66%), Gaps = 33/336 (9%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ +L + ASQ ++RS HE S+ E HE WM Q+GR YKD EK R+KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N F+DLTN+EFRA +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
+++DWR K AVTPIKDQ +CG CWAFSAVAA+EGIT++S LI LSEQ+LVDC T+G
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSG-- 182
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQA 256
++QG YPY GTC+ + A AAKI+ YE+VP+ +E+A
Sbjct: 183 ---------------EDQGCTN---YPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 224
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L KAV+ QP+++ I A +EF+ Y G+F G CGT+LDH V VG+GT++DG YWL+KN
Sbjct: 225 LQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 284
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SW WG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 285 SWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 162/344 (47%), Positives = 232/344 (67%), Gaps = 17/344 (4%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTH---EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
+ + I+ + + ++ +SS ST E+++ H++WMA+HGR+Y+DE EK RF++FK
Sbjct: 17 VALTILAVKTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAHRFQVFK 76
Query: 73 ENLEYIEKANKEGN--RTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
N ++++ +N G+ ++Y++ N F+D+TNDEF A+YTG + P P+ + FKY N
Sbjct: 77 ANADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTGLR-PVPAGAKKMAG-FKYGN 134
Query: 131 LSMTDVPT---SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
++++D ++DWR K AVT IK+Q +CGCCWAF+AVAAVEGI +I+ NL+ LSEQQ
Sbjct: 135 VTLSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQ 194
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYE 247
++DC T GNNGC GG ++ AF+YI N G+ATED YPY A Q C + Q AA IS Y+
Sbjct: 195 VLDCDTEGNNGCNGGYIDNAFQYIAGNGGLATEDAYPYTAAQAMCQSVQPVAA--ISGYQ 252
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGV-CGT--QLDHAVTIVGFGT 304
+VPSGDE AL AV+ QPVS+ I A+ F+ Y G+ C T L+HAVT VG+GT
Sbjct: 253 DVPSGDEAALAAAVANQPVSVAIDAHN--FQLYGGGVMTAASCSTPPNLNHAVTAVGYGT 310
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYPLA 348
EDG YWL+KN WG WG+ GY+++ R CG+ Q+SYP+A
Sbjct: 311 AEDGTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 154/341 (45%), Positives = 218/341 (63%), Gaps = 9/341 (2%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
I + I+ L C S + + + S+V HE WM Q+GR YKD EK +F++FK
Sbjct: 3 IPKASLLAILGCLCLCGSVLAARELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N E+I N GN + LG N+F+D+TN+EF+A T S R T F Y+N+S
Sbjct: 63 ANAEFINSFNA-GNHKFWLGINQFADITNEEFKATKTNKGFISNKVRVPTG--FMYENMS 119
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+P ++DWR K AVTPIKDQ +CGCCWAFSAVAA+EGI K+S L+ LSEQ+LVDC
Sbjct: 120 FDALPATIDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 TNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
+G + GC GG M+ AF++II+N G+ E YPY A G C + ++AA I +YE+VP+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIIKNGGLTQESNYPYDAADGKCKSGS-SSAATIKSYEDVPA 238
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
+E AL+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+GTT DG +
Sbjct: 239 NNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGTTSDGTKF 298
Query: 312 WLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
W++KNSWG +WG+ G++++ +D +G+CG+ + SYP A
Sbjct: 299 WIMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 324 bits (831), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 156/333 (46%), Positives = 217/333 (65%), Gaps = 9/333 (2%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++I+ L+++ + V SR E E HEKWMAQ+G+ Y D EKE RF+IFK N+++I
Sbjct: 9 YLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFI 68
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N G++ + L N+F+DL N+EF+A + + T ++F+Y+ S+T +P
Sbjct: 69 ESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETATETSFRYE--SITKIPV 126
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
++DWR + AVTPIKDQ CG CWAFS VAA+EGI +I+ L+ LSEQ+LVDC + G
Sbjct: 127 TMDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDCVKGKSEG 186
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQAL 257
C G E+AFE++ +N G+A+E YPY+A TC ++ A+I YE VPS E+AL
Sbjct: 187 CNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVPSNSEKAL 246
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
LKAV+ QPVS+ I A +F Y GIF G CGT +HA T++G+G GA YWL+KNS
Sbjct: 247 LKAVANQPVSVYIDAGALQF--YSSGIFTGKCGTAPNHAATVIGYGKARGGAKYWLVKNS 304
Query: 318 WGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
WG WG+ GY+++ RD EGLCGI T +SYP
Sbjct: 305 WGTKWGEKGYIRMKRDIRAKEGLCGIATNASYP 337
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 323 bits (828), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 166/339 (48%), Positives = 226/339 (66%), Gaps = 17/339 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQS--VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
+ +++LL C SQV+S R+ HE S + E HE+W ++G+ YKD EK+ R IFK+N+
Sbjct: 10 ILALVLLLPICISQVMS-RNLHEASXCMSERHEQWTKKYGKVYKDAAEKQKRLLIFKDNV 68
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMT 134
E+IE N GN+ YKL N +D TN+EF A + GYK H+ + S T FKY+N+ T
Sbjct: 69 EFIESFNAAGNKPYKLSINHLTDQTNEEFVASHNGYK-----HKGSHSQTPFKYENI--T 121
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
VP ++DWR+ AV +KDQ +CG CWAFS VA EGI +I+ + L+ LSEQ+LVDC +
Sbjct: 122 GVPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATTEGIYQITTSMLMSLSEQELVDCDSV 181
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGD 253
++GC GG ME FE+I +N GI++E YPY AV GT A ++A+ AA+I YE VP+
Sbjct: 182 -DHGCDGGYMEGGFEFIXKNGGISSEANYPYTAVDGTYDANKEASPAAQIKGYETVPANS 240
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E AL KAV+ QPVS+ I + F+ G+F G CGTQLDH VT VG+G+T+DG YW+
Sbjct: 241 EDALQKAVANQPVSVTIDVGGSAFQFNSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWI 300
Query: 314 IKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
+KNSWG WG+ GY+++ R EGLCGI +SYP A
Sbjct: 301 VKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 339
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 323 bits (827), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 157/328 (47%), Positives = 218/328 (66%), Gaps = 7/328 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+I+ L+ S + SR E ++ + H WM +HGR Y D EK R+ +FK N+E
Sbjct: 2 IFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVES 61
Query: 78 IEKANK-EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE+ N+ + T+KL N+F+DLTN+EFR++YTGYK S T ++F+YQ++S +
Sbjct: 62 IERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGNSVLSSRTKPTSFRYQHVSSDAL 121
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
P S+DWR K AVTPIKDQ CG CWAFSAVAA+EG+ +I LI LSEQ+LVDC TN +
Sbjct: 122 PISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-D 180
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQ 255
+GC GG M AF Y + G+ +E YPY++ GTC+ + K A I +E+VP+ DE+
Sbjct: 181 DGCMGGYMNSAFNYTMTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEK 240
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL+KAV+ PVSIGIA T F+ Y G+F+G C T LDH V +VG+G + +G+ YW++K
Sbjct: 241 ALMKAVAHHPVSIGIAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILK 300
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGI 339
NSWG WG+ GYM+I +D G CG+
Sbjct: 301 NSWGPKWGERGYMRIKKDTKAKHGQCGL 328
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 323 bits (827), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 158/324 (48%), Positives = 214/324 (66%), Gaps = 10/324 (3%)
Query: 30 SQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTY 89
+ +SRS E ++ M+E+W+ +HG+ Y EKE RF+IFK+NL +I+ N + +RTY
Sbjct: 64 AHAATSRSDEE--LMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQEDRTY 121
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVT 149
KLG NRF+DLTN+E+RA Y G K+ P+ R + + +Y +P S+DWR + AV
Sbjct: 122 KLGLNRFADLTNEEYRAKYLGTKI-DPNRRLGKTPSNRYAPRVGDKLPESVDWRKEGAVP 180
Query: 150 PIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFE 209
P+KDQ CG CWAFSA+ AVEGI KI LI LSEQ+LVDC T N GC GG M+ AFE
Sbjct: 181 PVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNEGCNGGLMDYAFE 240
Query: 210 YIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+II N GI +E++YPY+ V G C +K A I +YE+VP+ DE AL KAV+ QPVS+
Sbjct: 241 FIINNGGIDSEEDYPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSV 300
Query: 269 GIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYM 328
I EF+ Y G+F G CGT LDH V VG+GT +G +YW+++NSWG +WG+ GY+
Sbjct: 301 AIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGTA-NGHDYWIVRNSWGPSWGEDGYI 359
Query: 329 KILRD-----EGLCGIGTQSSYPL 347
++ R+ G CGI + SYPL
Sbjct: 360 RLERNLANSRSGKCGIAIEPSYPL 383
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 154/323 (47%), Positives = 222/323 (68%), Gaps = 10/323 (3%)
Query: 32 VVSSRSTHEQ-SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYK 90
++SS+ E +++E++E W+A+H R+Y EK+ RF +FK+N YI + N +GNR+YK
Sbjct: 26 IISSKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHN-QGNRSYK 84
Query: 91 LGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTP 150
LG N+F+DL+++EF+A Y G K+ + S S +YQ D+P S+DWR+K AVT
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKLDTKKRLSRPPSR-RYQYSDGEDLPESIDWREKGAVTS 143
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEY 210
+KDQ CG CWAFS VAAVEGI +I +LI LSEQ+LVDC T+ N GC GG M+ AFE+
Sbjct: 144 VKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEF 203
Query: 211 IIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIG 269
II N G+ +E++YPY A G+C + +K A I +YE+VP DE++L KA + QP+S+
Sbjct: 204 IINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVA 263
Query: 270 IAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMK 329
I A EF+ Y G+F CGTQLDH VT+VG+G +E G +YW +KNSWG +WG+ G+++
Sbjct: 264 IEASGREFQFYDSGVFTSTCGTQLDHGVTLVGYG-SESGTDYWTVKNSWGKSWGEEGFIR 322
Query: 330 ILRD-----EGLCGIGTQSSYPL 347
+ R+ G+CGI ++SYP+
Sbjct: 323 LQRNIEVASTGMCGIAMEASYPV 345
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 150/314 (47%), Positives = 220/314 (70%), Gaps = 8/314 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
+ +++E++E W+AQH ++Y EK+ +F +FK+N YI + N +GN +YKLG N+F+DL
Sbjct: 37 DDAIMELYELWLAQHKKAYNGLDEKQKKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADL 96
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
+++EF+A Y G K+ + R + S + +YQ D+P S+DWR+K AVT +K+Q CG
Sbjct: 97 SHEEFKAAYLGTKLDA-KKRLSRSPSPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGS 155
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS VAAVEGI +I NL LSEQ+LVDC T+ N GC GG M+ AF++II N G+ +
Sbjct: 156 CWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIISNGGLDS 215
Query: 220 EDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
ED+YPY+A G+C A +K A I +YE+VP DE++L KA + QP+S+ I A F+
Sbjct: 216 EDDYPYKANNGSCDAYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQ 275
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----- 333
Y+ G+F CGTQLDH VT+VG+G +E G +YWL+KNSWG++WG+ G++K+ R+
Sbjct: 276 FYESGVFTSNCGTQLDHGVTLVGYG-SESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGAS 334
Query: 334 EGLCGIGTQSSYPL 347
G+CGI ++SYP+
Sbjct: 335 TGMCGIAMEASYPV 348
>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 340
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 164/339 (48%), Positives = 220/339 (64%), Gaps = 14/339 (4%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
++ F ++ L C + SSR+ E S+ HE+WMA H R Y D EK+ R +IFKEN
Sbjct: 9 SVGTFFMLFLTCICRA---SSRTLSESSIATQHEEWMAMHDRVYADSAEKDRRQQIFKEN 65
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG--YKMPSPSHRSTTSSTFKYQNLS 132
LE+IEK N EG + Y L N F+DLTN+EF A +TG YK P+ + + + +S
Sbjct: 66 LEFIEKHNNEGKKRYNLSLNSFADLTNEEFVASHTGALYKPPTQLGSFKINHSLGFHKMS 125
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+ D+ SLDWR + AV IK+Q CG CWAFSAVAAVEGI +I L+ LSEQ LVDC+
Sbjct: 126 VGDIEASLDWRKRGAVNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSLSEQNLVDCA 185
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
+ N+GC G +EKAF+Y I++ G+A E+EYPY GTCS A +I Y+ V
Sbjct: 186 S--NDGCHGQYVEKAFDY-IRDYGLANEEEYPYVETVGTCS-GNSNPAIQIRGYQSVTPQ 241
Query: 253 DEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
+E+ LL AV+ QPVS+ + A F+ Y G+F+G CGT+L+HAVTIVG+G +G YW
Sbjct: 242 NEEQLLTAVASQPVSVLLEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYGEEAEG-KYW 300
Query: 313 LIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
LI+NSWG +WG+ GYMK++RD +GLCGI Q+SYP
Sbjct: 301 LIRNSWGKSWGEGGYMKLMRDTGNPQGLCGINMQASYPF 339
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 224/350 (64%), Gaps = 18/350 (5%)
Query: 15 TIPMFIIIILLVSCASQV--------VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEM 66
+ F++ +L++S A+ + ++ + + ++ HEKWMA+HG++YKDE EK
Sbjct: 2 ALSTFVLAVLVMSGAAALGRELAGDGAAAAAAADVAMASRHEKWMAKHGKTYKDEEEKAR 61
Query: 67 RFKIFKENLEYIEKAN----KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTT 122
R ++F+ N + I+ N K+G ++L TNRF+DLT+DEFRA TGY+ P P+ +
Sbjct: 62 RLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDEFRAARTGYQRP-PAAVAGA 120
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
F Y+N S+ P S+DWR AVT +KDQ CGCCWAFSAVAAVEG+ KI L+
Sbjct: 121 GGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLAKIRTGQLVS 180
Query: 183 LSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAA 241
LSEQ+LVDC G + GC GG M+ AF+YI + G+A E YPY+ V G C AA AAA
Sbjct: 181 LSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAESSYPYRGVDGACRAAAGRAAA 240
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQLDHAVTIV 300
I +++VPS DE AL+ AV+ QPVS+ I F+ Y G+ G CGT+L+HAVT V
Sbjct: 241 SIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYDRGVLGGAGCGTELNHAVTAV 300
Query: 301 GFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
G+GT DG YWL+KNSWG +WG+ GY++I R EG CGI +SYP+
Sbjct: 301 GYGTASDGTGYWLMKNSWGASWGEGGYVRIRRGVGREGACGIAQMASYPV 350
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 154/341 (45%), Positives = 215/341 (63%), Gaps = 9/341 (2%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
I + I+ L C+S + + + S+V HE WM Q+GR YKD EK +F++FK
Sbjct: 3 IPKASLLAILGCLCFCSSVLAARELNDDLSMVARHESWMLQYGRVYKDAAEKASKFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N +I+ N GN + LG N+F+D+TN EF+A T S R+ T F Y+N+S
Sbjct: 63 ANAGFIDSFNA-GNHKFWLGINQFADITNKEFKATKTNKGFISNKVRAPTG--FSYENVS 119
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+P S+DWR K AVTP+KDQ +CGCCWAFSAVAA EGI K+S L+ LSEQ+LVDC
Sbjct: 120 FDALPASIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 TNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
+G + GC GG M+ AF++II N G+ E YPY A G C + K+A I +YE+VP+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIISNGGLTQESSYPYDAEDGKCKSGSKSAGT-IKSYEDVPA 238
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
+E AL+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G T DG Y
Sbjct: 239 NNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKY 298
Query: 312 WLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
WL+KNSWG +WG+ G++++ +D +G+CG+ + SYP A
Sbjct: 299 WLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 168/364 (46%), Positives = 227/364 (62%), Gaps = 27/364 (7%)
Query: 9 GSFKINTIP----MFIIIILL----VSCA--SQVVSSRSTH---------EQSVVEMHEK 49
GS I T P M I++L VS A ++S S H E+ ++ M+E+
Sbjct: 2 GSSSITTSPATMTMAAIVLLFTVFAVSSALDMSIISYDSAHADKAATLRTEEELMSMYEQ 61
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYT 109
W+ +HG+ Y EKE RF+IFK+NL +I+ N +RTYKLG NRF+DLTN+E+RA Y
Sbjct: 62 WLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYL 121
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAV 169
G K+ P+ R + + +Y +P S+DWR + AV P+KDQ CG CWAFSA+ AV
Sbjct: 122 GTKI-DPNRRLGKTPSNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAV 180
Query: 170 EGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQ 229
EGI KI LI LSEQ+LVDC T N GC GG M+ AFE+II N GI ++++YPY+ V
Sbjct: 181 EGINKIVTGELISLSEQELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDSDEDYPYRGVD 240
Query: 230 GTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGV 288
G C +K A I +YE+VP+ DE AL KAV+ QPVS+ I EF+ Y G+F G
Sbjct: 241 GRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGR 300
Query: 289 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQS 343
CGT LDH V VG+GT + G +YW+++NSWG +WG+ GY+++ R+ G CGI +
Sbjct: 301 CGTALDHGVVAVGYGTAK-GHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGIAIEP 359
Query: 344 SYPL 347
SYPL
Sbjct: 360 SYPL 363
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 166/337 (49%), Positives = 222/337 (65%), Gaps = 33/337 (9%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ +L + ASQ ++R+ HE S+ E HE WMAQ+GR YKD EK R+KIFK+N+ I
Sbjct: 12 LALLFVLAAWASQA-TARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARI 70
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
E NK +++YKL N F+DLTN+EF +K +H ST +++FKY+N+ T VP
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFGTSRNRFK----AHICSTEATSFKYENV--TAVP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
+++DWR K AVTPIKDQ +CG CWAFSAVAA+EGIT++S LI LSEQ+LVDC T+G +
Sbjct: 125 STIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQ 255
GC G YPY GTC+ + A AAKI+ YE+VP+ +E+
Sbjct: 185 QGCNGAN-------------------YPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEK 225
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV QP+++ I A EF+ Y G+F G CGT+LDH V VG+GT++DG YWL+K
Sbjct: 226 ALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVK 285
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
NSWG WG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 286 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 322
>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
Length = 351
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 169/355 (47%), Positives = 232/355 (65%), Gaps = 22/355 (6%)
Query: 10 SFKINTIPM------FIIIILLVSCASQVVSSRSTH------EQSVVEMHEKWMAQHGRS 57
S+ N P+ + ++ + +C V++R E+++ HEKWM +HGR+
Sbjct: 3 SYIANNKPLITAAVALLTVLAIANCIGCAVAARDLSSSTGYGEEAMTARHEKWMVEHGRT 62
Query: 58 YKDELEKEMRFKIFKENLEYIEKANKE-GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSP 116
YKDE EK RF++FK N +++ +N G + Y L NRF+D+T+DEF A YTG+K P P
Sbjct: 63 YKDEAEKARRFQVFKANAAFVDTSNAAAGGKKYHLAINRFADMTHDEFMARYTGFK-PLP 121
Query: 117 SHRSTTSSTFKYQNLSMT-DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKI 175
+ FKY N++++ + ++DWR K AVT +K+QQ+CGCCWAFSAVAA+EG+ +I
Sbjct: 122 ATGKKMPG-FKYANVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEGMHQI 180
Query: 176 SGANLIQLSEQQLVDCST-NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA 234
+ L+ LSEQQLVDCST NNGCGGGTME AF+Y+I N GIATE YPY A+QG C
Sbjct: 181 NTGELVSLSEQQLVDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQGMCQN 240
Query: 235 AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG-VCGTQL 293
Q A A + +Y++VP DE AL AV+ QPVS+ + A F+ YK G+ CGT L
Sbjct: 241 VQPAVA--VRSYQQVPRDDEDALAAAVAGQPVSVAVDA--NNFQFYKGGVMTADSCGTNL 296
Query: 294 DHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYPLA 348
+HAVT VG+GT EDG YWL+KN WG TWG+ GY+++ R G CG+ +SYP+A
Sbjct: 297 NHAVTAVGYGTAEDGTPYWLLKNQWGSTWGEEGYLRLQRGVGACGVAKDASYPVA 351
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 152/341 (44%), Positives = 219/341 (64%), Gaps = 9/341 (2%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
I + I+ L AS + + + S+V HE WM+Q+GRSYKD EK+ +F++FK
Sbjct: 3 IPNASLLAILGCLCFFASGLAARELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N +I+ N + N + LG N+F+D+TN+EF+ T S R++T F Y+N+S
Sbjct: 63 ANAAFIDSFNAK-NHKFWLGINQFADITNEEFKVTKTNKGFISNKVRASTG--FSYENVS 119
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+ +P ++DWR K AVTP+KDQ +CGCCWAFSAVAA EGI K+S L+ LSEQ+LVDC
Sbjct: 120 IDALPATIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179
Query: 193 TNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
+G + GC GG M+ AF++II N G+ E YPY A G C + K+A I +YE+VP+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSGSKSAGT-IKSYEDVPA 238
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
+E AL+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G T DG Y
Sbjct: 239 NNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKY 298
Query: 312 WLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
WL+KNSWG +WG+ G++++ +D +G+CG+ + SYP A
Sbjct: 299 WLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 155/313 (49%), Positives = 212/313 (67%), Gaps = 8/313 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E V EM E W+ +HG+SY EK+ RFKIF++NL+YI++ N NR+YKLG NRF+D+
Sbjct: 43 EDEVKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADI 102
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
TN+E+R Y G K + S S + +Y ++ +P S+DWR+K AVT +KDQ CG
Sbjct: 103 TNEEYRTGYLGAKRDA-SRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGS 161
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS +AAVEG+ +++ NLI LSEQ+LVDC N GC GG M AF++II+N GI +
Sbjct: 162 CWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFIIKNGGIDS 221
Query: 220 EDEYPYQAVQGTCSAAQK--AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEF 277
E++YPY G C + ++ A A I YEEVP +E++L KAV+ QPVS+ I A +F
Sbjct: 222 EEDYPYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGGYDF 281
Query: 278 KSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD---- 333
+ Y GIF G CGT LDH V VG+G TE+G +YW++KNSWGD WG+ GY+++ R+
Sbjct: 282 QLYSSGIFTGSCGTDLDHGVAAVGYG-TENGVDYWIVKNSWGDYWGEKGYVRMQRNVKAK 340
Query: 334 EGLCGIGTQSSYP 346
GLCGI ++SYP
Sbjct: 341 TGLCGIAMEASYP 353
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 163/315 (51%), Positives = 214/315 (67%), Gaps = 27/315 (8%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E S +E HE+WM++ R Y D+ EK RF+IFK+NL+++E N N TYKL N+FSDL
Sbjct: 11 EASAIEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDL 70
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
T++EF+A Y G + S + +F+Y+N+S T S+DWR + AVTP+KDQ +CGC
Sbjct: 71 TDEEFQARYMGLVPEGMTGDSQKTVSFRYENVSETG--ESMDWRLEGAVTPVKDQGQCGC 128
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN-GCGGGTMEKAFEYIIQNQGIA 218
CWAF+AVAAVEG+TKI+ L+ LSEQQLVDCST NN GC GG A++YI +NQGI
Sbjct: 129 CWAFAAVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQGIT 188
Query: 219 TEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
+E+ YPYQAVQ TC + AAA IS YE VP DE+ALLKAVS
Sbjct: 189 SEENYPYQAVQQTCKSTDPAAAT-ISGYEAVPKDDEEALLKAVS---------------- 231
Query: 279 SYKEGIF-NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD---- 333
+ GIF + CGT HAVTIVG+GT+E+G YWL+KNSWG++WG+ GYM+I RD
Sbjct: 232 --QHGIFEDEYCGTDSHHAVTIVGYGTSEEGIKYWLLKNSWGESWGENGYMRIKRDVDEP 289
Query: 334 EGLCGIGTQSSYPLA 348
+G+CG+ ++ YP+A
Sbjct: 290 QGMCGLAHRAYYPVA 304
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 320 bits (821), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 150/330 (45%), Positives = 214/330 (64%), Gaps = 9/330 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR YKD+ EK RF++FK N +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANAAF 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG N+F+DLTNDEFR T + R T F+Y+N+++ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRLTKTNKGFIPSTTRVPTG--FRYENVNIDALP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
++DWR K VTPIKDQ +CGCCWAFSAVAA+EGI K+S LI LSEQ+LVDC +G +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M+ AF++II+N G+ TE YPY A C + + A+ I YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSNSVAS-IKGYEDVPANNEAA 243
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+ + F+ YK G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGDDMTFQFYKGGVMIGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 303
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQ 342
SWG TWG+ G++++ +D G+CG+ +
Sbjct: 304 SWGMTWGENGFLRMEKDISDKRGMCGLAME 333
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 320 bits (821), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 153/311 (49%), Positives = 215/311 (69%), Gaps = 10/311 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E+ EKW+A+H ++Y EK RF++FK+NL++I+K N+E +Y LG N F+DLT++
Sbjct: 146 IIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVT-SYWLGLNEFADLTHE 204
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF+A Y G P+P+ S S FKY+++S D+P S+DWR K AVT +K+Q +CG CWA
Sbjct: 205 EFKATYLGLAPPAPARESRGS--FKYEDVSADDLPKSVDWRTKGAVTEVKNQGQCGSCWA 262
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI I NL LSEQ+L+DCS +GNNGC GG M+ AF YI + G+ TE+
Sbjct: 263 FSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGGLMDYAFSYIASSGGLHTEEA 322
Query: 223 YPYQAVQGTCSAAQK--AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
YPY +G+C +K + A IS YE+VP+ +EQAL+KA++ QPVS+ I A F+ Y
Sbjct: 323 YPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQPVSVAIEASGRHFQFY 382
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGDTWGDAGYMKILR----DEG 335
G+F+G CGTQLDH V VG+G+ + G +Y +++NSWG WG+ GY+++ R EG
Sbjct: 383 SGGVFDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAKWGEKGYIRMKRGTGKGEG 442
Query: 336 LCGIGTQSSYP 346
LCGI +SYP
Sbjct: 443 LCGINKMASYP 453
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 320 bits (820), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 158/329 (48%), Positives = 218/329 (66%), Gaps = 8/329 (2%)
Query: 18 MFIIIILLVSCASQVVSSRST-HEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLE 76
+F+I+ L+ S + + SR E ++ + H +WM +HGR Y D EK R+ +FK N+E
Sbjct: 2 IFLIVSLVSSFSLSITLSRPLLDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVE 61
Query: 77 YIEKANK-EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
IE+ N + T+KL N+F+DLTN+EFR++YTG+K S T ++F+YQN+S
Sbjct: 62 RIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGNSVLSSRTKPTSFRYQNVSSDA 121
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
+P S+DWR K AVTPIKDQ CG CWAFSAVAA+EG+ +I LI LSEQ+LVDC TN
Sbjct: 122 LPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN- 180
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDE 254
+ GC GG M+ AF Y I G+ +E YPY++ GTC+ + K A I +E+VP+ DE
Sbjct: 181 DGGCMGGLMDTAFNYTITIGGLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPANDE 240
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+AL+KAV+ PVSIGIA F+ Y G+F+G C T LDH VT VG+G +++G YW++
Sbjct: 241 KALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWIL 300
Query: 315 KNSWGDTWGDAGYMKILRD----EGLCGI 339
KNSWG WG+ GYM+I +D G CG+
Sbjct: 301 KNSWGPKWGERGYMRIKKDIKPKHGQCGL 329
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 320 bits (819), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 166/333 (49%), Positives = 217/333 (65%), Gaps = 14/333 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+ + LL+S V SR HE S+ E HE W+A++G+ YK EKE F+IFKEN+E+IE
Sbjct: 11 LALFLLLSIEISQVMSRKLHETSLREEHENWIARYGQVYKVAAEKET-FQIFKENVEFIE 69
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
N N+ YKLG N F+DLT +EF+ G K +H + + FKY+N+ TD+P +
Sbjct: 70 SFNAAANKPYKLGVNLFADLTLEEFKDFRFGLK---KTHEFSITP-FKYENV--TDIPEA 123
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNG 198
LDWR+K AVTPIKDQ +CG CWAFS VAA EGI +I+ NL+ L EQ+LV C T G + G
Sbjct: 124 LDWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKGVDQG 183
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQAL 257
C GG ME FE+II+N GI T+ YPY+ V GTC+ A+ A+I YE VPS E+AL
Sbjct: 184 CEGGYMEDGFEFIIKNGGITTKANYPYKGVNGTCNTTIAASTVAQIKGYETVPSYSEEAL 243
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
KAV+ QPVS+ I A F Y GI+ G CGT LDH VT VG+GTT + +YW++KNS
Sbjct: 244 QKAVANQPVSVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNE-TDYWIVKNS 302
Query: 318 WGDTWGDAGYMKILR----DEGLCGIGTQSSYP 346
WG W + G++++ R GLCG+ SSYP
Sbjct: 303 WGTGWDEKGFIRMQRGITVKHGLCGVALDSSYP 335
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 320 bits (819), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 155/316 (49%), Positives = 214/316 (67%), Gaps = 10/316 (3%)
Query: 42 SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR-TYKLGTNRFSDLT 100
++ + HE+WMA+HGR+Y D+ EK R ++F++N+ +IE N ++ + L N+F+DLT
Sbjct: 35 AMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLT 94
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
N EFRA TG + PS S + ++F+Y N+S D+P S+DWR K AV P+KDQ +CGCC
Sbjct: 95 NAEFRATRTGLR-PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCC 153
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIAT 219
WAFSAVAA+EG K++ L+ LSEQQLV C G + GC GG M+ AF++II+N G+A
Sbjct: 154 WAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAA 213
Query: 220 EDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E +YPY A C +A AAAA I YE+VP+ DE ALLKAV+ QPVS+ I F+
Sbjct: 214 ESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQ 273
Query: 279 SYKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
YK G+ +G C T+LDHA+T VG+G DG YWL+KNSWG +WG+ GY+++ R
Sbjct: 274 FYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVAD 333
Query: 333 DEGLCGIGTQSSYPLA 348
EG+CG+ +SYP A
Sbjct: 334 KEGVCGLAMMASYPTA 349
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 320 bits (819), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 162/339 (47%), Positives = 217/339 (64%), Gaps = 11/339 (3%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
IPMF+I + V+SSR E + HEKWM Q G+SYKD EKE RF+IFK N+
Sbjct: 9 IPMFLIFTTWM--LPYVMSSR-VLEPYLSNKHEKWMTQFGKSYKDAAEKEKRFQIFKNNV 65
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMT 134
E+IE N GN+ + L N F+DLTN+EF+A G K+ +++F+Y N+ T
Sbjct: 66 EFIELFNAVGNKPFNLSINHFADLTNEEFKASLNGNKKLHDKFDILNETTSFRYHNV--T 123
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
VP S+DWR + AVTPIK+Q CG CWAFS VA++EGI +I+ L+ LSEQ+L+DC
Sbjct: 124 SVPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIEGIHQITTGELVSLSEQELIDCVRG 183
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGD 253
++GC GG +E AF++I + G+A+E YPY+ C +++ A+I YE+VPS
Sbjct: 184 NSSGCSGGYLEDAFKFIAKKGGMASETNYPYKETDEKCKFKKESKHVAEIKGYEKVPSNS 243
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E LLKAV+ QPVS+ + A F+ Y GIF G CGT DH VTIVG+G + D YWL
Sbjct: 244 ENDLLKAVANQPVSVYVDAGDYVFQFYSGGIFTGKCGTDTDHVVTIVGYGVSLDYTEYWL 303
Query: 314 IKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
+KNSWG WG+ GYMK+ R+ +GLCGI T SYP+A
Sbjct: 304 VKNSWGTGWGEKGYMKLKRNVDSKKGLCGIATNPSYPVA 342
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 148/309 (47%), Positives = 207/309 (66%), Gaps = 9/309 (2%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+ E HE+WMA++ R YKD EK RF++FK+N ++E N + + LG N+F+DLT +
Sbjct: 1 MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTE 60
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF+A G+K S TT FKY+NLS++ +PT++DWR K AVTPIK+Q +CGCCWA
Sbjct: 61 EFKA-NKGFKPISAEEVPTTG--FKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWA 117
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCST-NGNNGCGGGTMEKAFEYIIQNQGIATED 221
FSA+AA+EGI K+S NL+ LSEQ+ VDC T N + GC GG M+ AFE++I+N G+ATE
Sbjct: 118 FSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLATES 177
Query: 222 EYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY+ V G C K+AA I +E+VP +E AL+K V+ QPVS+ + A F Y
Sbjct: 178 SYPYKVVDGKCKGGSKSAAT-IKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTFMLYS 236
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
G+ G CGTQLDH + +G+G D YW++KNSWG TWG+ G++++ +D G+C
Sbjct: 237 GGVMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDISDKRGMC 296
Query: 338 GIGTQSSYP 346
+ + SYP
Sbjct: 297 DLAMKPSYP 305
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 152/318 (47%), Positives = 209/318 (65%), Gaps = 16/318 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E S+ ++E+W + H S +D +K+ RF +FKEN+++I + NK + T+KL N+F D+
Sbjct: 31 EDSLWSLYERWRSHHAVS-RDLDQKQKRFNVFKENVKFIHEFNKNKDVTFKLALNKFGDM 89
Query: 100 TNDEFRALYTGYK------MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKD 153
TN EFRA Y G K M H S + + F Y+N P S+DWR++ AV +K+
Sbjct: 90 TNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAKFMYENAV---APPSIDWRERGAVAAVKN 146
Query: 154 QQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQ 213
Q +CG CWAFSA+AAVEGI +I L+ LSEQ+L+DC T+ N GC GG M+ AFE+I
Sbjct: 147 QGQCGSCWAFSAIAAVEGINQIVTKELVPLSEQELIDCDTDQNQGCSGGLMDYAFEFIKN 206
Query: 214 NQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
N GI TED YPYQA TC + + A I YE+VP+ DE AL+KAV+ QPV++ I A
Sbjct: 207 NGGITTEDVYPYQAEDATCK--KNSPAVVIDGYEDVPTNDEDALMKAVANQPVAVAIEAS 264
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
F+ Y EG+F G CGT+LDH V +VG+GTT+DG YW ++NSWG WG++GY+++ R
Sbjct: 265 GYVFQFYSEGVFTGRCGTELDHGVAVVGYGTTQDGTKYWTVRNSWGADWGESGYVRMQRG 324
Query: 333 ---DEGLCGIGTQSSYPL 347
GLCGI Q+SYP+
Sbjct: 325 IKATHGLCGIAMQASYPI 342
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 155/315 (49%), Positives = 213/315 (67%), Gaps = 10/315 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR-TYKLGTNRFSDLTN 101
+ + HE+WMA+HGR+Y D+ EK R ++F++N+ +IE N ++ + L N+F+DLTN
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCW 161
EFRA TG + PS S + ++F+Y N+S D+P S+DWR K AV P+KDQ +CGCCW
Sbjct: 61 AEFRATRTGLR-PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCW 119
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATE 220
AFSAVAA+EG K++ L+ LSEQQLV C G + GC GG M+ AF++II+N G+A E
Sbjct: 120 AFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAE 179
Query: 221 DEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKS 279
+YPY A C +A AAAA I YE+VP+ DE ALLKAV+ QPVS+ I F+
Sbjct: 180 SDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQF 239
Query: 280 YKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----D 333
YK G+ +G C T+LDHA+T VG+G DG YWL+KNSWG +WG+ GY+++ R
Sbjct: 240 YKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADK 299
Query: 334 EGLCGIGTQSSYPLA 348
EG+CG+ +SYP A
Sbjct: 300 EGVCGLAMMASYPTA 314
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 155/315 (49%), Positives = 213/315 (67%), Gaps = 10/315 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR-TYKLGTNRFSDLTN 101
+ + HE+WMA+HGR+Y D+ EK R ++F++N+ +IE N ++ + L N+F+DLTN
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCW 161
EFRA TG + PS S + ++F+Y N+S D+P S+DWR K AV P+KDQ +CGCCW
Sbjct: 61 AEFRATRTGLR-PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCW 119
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATE 220
AFSAVAA+EG K++ L+ LSEQQLV C G + GC GG M+ AF++II+N G+A E
Sbjct: 120 AFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAE 179
Query: 221 DEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKS 279
+YPY A C +A AAAA I YE+VP+ DE ALLKAV+ QPVS+ I F+
Sbjct: 180 SDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQF 239
Query: 280 YKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----D 333
YK G+ +G C T+LDHA+T VG+G DG YWL+KNSWG +WG+ GY+++ R
Sbjct: 240 YKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADK 299
Query: 334 EGLCGIGTQSSYPLA 348
EG+CG+ +SYP A
Sbjct: 300 EGVCGLAMMASYPTA 314
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 154/344 (44%), Positives = 231/344 (67%), Gaps = 12/344 (3%)
Query: 14 NTIPMFIIIILLVSCASQVVS----SRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+TI + II +VS ++ +S + + + + ++E W+ +HG++Y EK++RF
Sbjct: 6 STIFLLFSIIFIVSSSALDLSIIDRAFNRPDDEIASLYETWLVKHGKNYNGLGEKQLRFN 65
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKY 128
IFK+NL ++++ N E N ++KLG NRF+DLTN+E+R++Y G + S + RS S + +Y
Sbjct: 66 IFKDNLRFVDERNSE-NLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGRSKSDRY 124
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+ +P S+DWR K AV IKDQ CG CWAFSA+AAVEG+ +I +LI LSEQ+L
Sbjct: 125 AFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVTGDLISLSEQEL 184
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYE 247
V+C T+ N+GC GG M+ AFE+II+N+GI ++++YPY G C +K A I +YE
Sbjct: 185 VECDTSYNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRDGRCDTNRKNAKVVTIDDYE 244
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
+ P DE++L KAV+ QPVS+ I +F+ Y G+F G CGT LDH V +VG+G TED
Sbjct: 245 DSPVYDEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHGVAVVGYG-TED 303
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G +YW+++NSWGDTWG+ GY+++ R+ G+CGI + SYP+
Sbjct: 304 GLDYWIVRNSWGDTWGEGGYIRMQRNTKLPSGICGIAIEPSYPI 347
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 154/351 (43%), Positives = 223/351 (63%), Gaps = 11/351 (3%)
Query: 7 RSGSFKINTIPMFIIIILLVSCASQVVSSRSTH-----EQSVVEMHEKWMAQHGRSYKDE 61
S + I+ + M I L + ++S TH + V ++E W+ +HG+SY
Sbjct: 4 HSSTLTISILLMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNAL 63
Query: 62 LEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST 121
EK+ RF+IFK+NL YI++ N N++YKLG +F+DLTN+E+R++Y G K + +
Sbjct: 64 GEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLS 123
Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLI 181
+ + +Y +P S+DWR+K + +KDQ CG CWAFSAVAA+E I I NLI
Sbjct: 124 KNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLI 183
Query: 182 QLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAA 240
LSEQ+LVDC + N GC GG M+ AFE++I+N GI TE++YPY+ G C +K A
Sbjct: 184 SLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKV 243
Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIV 300
KI +YE+VP +E+AL KAV+ QPVSI + A +F+ YK GIF G CGT +DH V I
Sbjct: 244 VKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIA 303
Query: 301 GFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G+G TE+G +YW+++NSWG WG+ GY+++ R+ GLCG+ + SYP+
Sbjct: 304 GYG-TENGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCGLAIEPSYPV 353
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 156/353 (44%), Positives = 226/353 (64%), Gaps = 18/353 (5%)
Query: 11 FKINTIPMFIIIILLVSCASQVV----------SSRSTHEQSVVEMHEKWMAQHGRSYKD 60
F++ F+ ++ +S AS + S E +++M+E W+ +HG++Y
Sbjct: 6 FRLCIAISFLFMVFSLSLASMSIIDYDLPADPLQSTERTEAHMMKMYEHWLVKHGKNYNA 65
Query: 61 ELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSH-R 119
EKE RF+IFK+NL ++++ N RTYKLG +F+DLTN+E+RA+Y G KM R
Sbjct: 66 IGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKMEKKEKLR 125
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGAN 179
+ S + ++ + D+P+ +DWR+K AVT +KDQ +CG CWAFS V +VEGI +I +
Sbjct: 126 TERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGINQIVTGD 185
Query: 180 LIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-A 238
LI LSEQ+LVDC N GC GG M+ AFE+II+N GI +E +YPY+A C + +K A
Sbjct: 186 LISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGIDSEADYPYRASDNMCDSNRKNA 245
Query: 239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVT 298
I YE+VP DE++L KAV+ QPVS+ I A EF+ Y+ G+F G CGT LDH V
Sbjct: 246 HVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSGVFTGRCGTNLDHGVV 305
Query: 299 IVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-----DEGLCGIGTQSSYP 346
VG+G TE+G +YW+++NSWG WG++GY+++ R D G CGI ++SYP
Sbjct: 306 AVGYG-TENGIDYWIVRNSWGPKWGESGYIRMERNVASTDTGKCGIAMEASYP 357
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 317 bits (813), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 158/341 (46%), Positives = 216/341 (63%), Gaps = 14/341 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHE------QSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
F+ + L ++ + S HE +S+ +++E+W + H S + EK RF +FK
Sbjct: 6 FLFVALSLALVLGITESLDFHEKDLESEESLWDLYERWRSHHTVSTSLD-EKHKRFNVFK 64
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKYQNL 131
EN+ ++ K NK G + YKL N+F+D+TN EFR++Y G K+ R TT +
Sbjct: 65 ENVMHVHKTNKMG-KPYKLKLNKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGNGSFMYG 123
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+ VPTS+DWR K AVT +KDQ +CG CWAFS + AVEGI I L+ LSEQ+LVDC
Sbjct: 124 KVEKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINYIKTNELVSLSEQELVDC 183
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVP 250
T N GC GG ME AFE+I + +GI TE YPY+A G C AA++ A I YE+VP
Sbjct: 184 DTTENQGCNGGLMEYAFEFIKKKRGITTESTYPYKAEDGHCDAAKENNPAVSIDGYEKVP 243
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
DE ALLKA + QPVS+ I A ++F+ Y EG+F G CGT+LDH V +VG+GTT DG
Sbjct: 244 ENDEDALLKAAANQPVSVAIDAGGSDFQFYSEGVFIGECGTELDHGVAVVGYGTTLDGTK 303
Query: 311 YWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
YW+++NSWG WG+ GY+++ R EGLCGI ++SYP+
Sbjct: 304 YWIVRNSWGPEWGEKGYIRMQRGISDKEGLCGIAMEASYPI 344
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 152/313 (48%), Positives = 216/313 (69%), Gaps = 10/313 (3%)
Query: 41 QSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLT 100
+ +VE+ EKW+A+H ++Y EK RF++FK+NL++I+K N+E +Y LG N F+DLT
Sbjct: 43 ERLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINREVT-SYWLGLNEFADLT 101
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
+DEF+A Y G + + R +S +F+Y+++S +D+P S+DWR K AVT +K+Q +CG C
Sbjct: 102 HDEFKAAYLG--LDAAPARRGSSRSFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSC 159
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATE 220
WAFS VAAVEGI I NL LSEQ+L+DCS +GN+GC GG M+ AF YI + G+ TE
Sbjct: 160 WAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTE 219
Query: 221 DEYPYQAVQGTCSAAQKA--AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
+ YPY +G+C +KA A IS YE+VP+ DEQAL+KA++ QPVS+ I A F+
Sbjct: 220 EAYPYLMEEGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQPVSVAIEASGRHFQ 279
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGDTWGDAGYMKILR----D 333
Y G+F+G CG QLDH V VG+G+ + G +Y +++NSWG WG+ GY+++ R
Sbjct: 280 FYSGGVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAQWGEKGYIRMKRGTSNG 339
Query: 334 EGLCGIGTQSSYP 346
EGLCGI +SYP
Sbjct: 340 EGLCGINKMASYP 352
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 155/324 (47%), Positives = 210/324 (64%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS ++ M+ +WMA HGR+Y E+E R+++F++NL YI+ N G +
Sbjct: 31 IVSYGERSDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHS 90
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTNDE+RA Y G + R + +Y D+P S+DWR K AV
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGARTRPQRERKLGA---RYHAADNEDLPESVDWRAKGAV 147
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
+KDQ CG CWAFS +AAVEGI +I +LI LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 207
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
E+II N GI TE +YPY+ G C +K A I +YE+VP+ DE++L KAV+ QPVS
Sbjct: 208 EFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVS 267
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A T F+ Y GIF G CGT LDH VT VG+G TE+G +YW++KNSWG +WG++GY
Sbjct: 268 VAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 326
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 327 VRMERNIKASSGKCGIAVEPSYPL 350
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 151/314 (48%), Positives = 212/314 (67%), Gaps = 8/314 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
+ V+ ++E W+ +HG+SY E+E RF+IFK+NL +IE+ N NRTYK+G NRF+DL
Sbjct: 47 DAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVGLNRFADL 105
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
TN+E+R+ Y G + + + + +Y + D+P S+DWR+K AV P+KDQ CG
Sbjct: 106 TNEEYRSRYLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGS 165
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS +AAVEGI +I+ +LI LSEQ+LVDC + N GC GG M+ AFE+II N GI +
Sbjct: 166 CWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDS 225
Query: 220 EDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E++YPY+A TC +K A I YE+VP DE++L KAV+ QPVS+ I A F+
Sbjct: 226 EEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQ 285
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-----D 333
Y+ G+F G CGTQLDH V VG+G TE+ +YW+++NSWG WG++GY+K+ R +
Sbjct: 286 LYQSGVFTGQCGTQLDHGVVAVGYG-TENSVDYWIVRNSWGPNWGESGYIKLERNLAGTE 344
Query: 334 EGLCGIGTQSSYPL 347
G CGI + SYP+
Sbjct: 345 TGKCGIAIEPSYPI 358
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 154/336 (45%), Positives = 217/336 (64%), Gaps = 8/336 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ ++ ++L SQV+S R + S V+ HEKWMAQ+G+ YKD EKE RF+IFK N+ +
Sbjct: 10 ILVVFLVLTVWTSQVMSRRLSEAYSSVK-HEKWMAQYGKVYKDAAEKEKRFQIFKNNVHF 68
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE + G++ + L N+F+DL +F+AL + + R+ T++ ++ S+T +P
Sbjct: 69 IESFHAAGDKPFNLSINQFADL--HKFKALLINGQKKEHNVRTATATEASFKYDSVTRIP 126
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
+SLDWR + AVTPIKDQ C CWAFS VA +EG+ +I+ L+ LSEQ+LVDC +
Sbjct: 127 SSLDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKGELVSLSEQELVDCVKGDSE 186
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQA 256
GC GG +E AFE+I + G+A+E YPY+ V TC ++ +I YE+VPS E+A
Sbjct: 187 GCYGGYVEDAFEFIAKKGGVASETHYPYKGVNKTCKVKKETHGVVQIKGYEQVPSNSEKA 246
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LLKAV+ QPVS + A F+ Y GIF G CGT +DH+VT+VG+G G YWL+KN
Sbjct: 247 LLKAVAHQPVSAYVEAGGYAFQFYSSGIFTGKCGTDIDHSVTVVGYGKARGGNKYWLVKN 306
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG WG+ GY+++ RD EGLCGI T + YP A
Sbjct: 307 SWGTEWGEKGYIRMKRDIRAKEGLCGIATGALYPTA 342
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 150/313 (47%), Positives = 207/313 (66%), Gaps = 7/313 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E ++E W+ +HGR+Y EKE RF+IFK+NL++I++ N GN +YKLG N+F+DL
Sbjct: 18 EAETRRIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADL 77
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
+NDE+R++Y G +M + +Y D+P ++DWR+K AV P+KDQ +CG
Sbjct: 78 SNDEYRSVYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGS 137
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS V AVEGI +I NL LSEQ+LVDC N GC GG M+ AF++II+N GI T
Sbjct: 138 CWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIENGGIDT 197
Query: 220 EDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E++YPY+A+ C +K A I YE+VP DE++L KAV+ QPVS+ I A F+
Sbjct: 198 EEDYPYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRGFQ 257
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----- 333
Y+ G+F G CGTQLDH V VG+G TE G +YW+++NSWG WG+ GY+++ RD
Sbjct: 258 LYQSGVFTGSCGTQLDHGVVTVGYG-TEHGVDYWIVRNSWGPAWGENGYIRMERDVASTE 316
Query: 334 EGLCGIGTQSSYP 346
G CGI ++SYP
Sbjct: 317 TGKCGIAMEASYP 329
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 163/349 (46%), Positives = 224/349 (64%), Gaps = 20/349 (5%)
Query: 15 TIPMFIIIILLVSCA--SQVVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEK 64
TI + + +L VS A ++S +H ++ V+ ++E+W+ +HG+ Y EK
Sbjct: 10 TILIVLFTVLAVSSALDMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHGKVYNAVEEK 69
Query: 65 EMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS 124
E RF+IFK+NL +IE+ N NRTYK+G NRFSDL+N+E+R+ Y G K+ PS R
Sbjct: 70 EKRFQIFKDNLNFIEEHNAV-NRTYKVGLNRFSDLSNEEYRSKYLGTKI-DPS-RMMARP 126
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
+ +Y ++P S+DWR + AV +K+Q EC CWAFSA+AAVEGI KI NL LS
Sbjct: 127 SRRYSPRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLTALS 186
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKI 243
EQ+L+DC N GC GG ++ AFE+II N GI TE++YP+Q G C + A A I
Sbjct: 187 EQELLDCDRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADGICDQYKINARAVTI 246
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
YE VP+ DE AL KAV+ QPVS+ I AY EF+ Y+ GIF G CGT +DH VT VG+G
Sbjct: 247 DGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSIDHGVTAVGYG 306
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
TE+G +YW++KNSWG+ WG+AGY+ + R+ G CGI + YP+
Sbjct: 307 -TENGIDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPI 354
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 156/338 (46%), Positives = 217/338 (64%), Gaps = 11/338 (3%)
Query: 19 FIIIILLVS----CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
F++ +L+V C + + + ++ HEKWMA+HGR+YKDE EK R ++F+ N
Sbjct: 6 FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
E I+ N G +++L TNRF+DLT +EFRA TG + P P+ S + F+Y+N S+
Sbjct: 66 AELIDSFNAAGTHSHRLATNRFADLTVEEFRAARTGLR-PRPAP-SAGAGRFRYENFSLA 123
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
D S+DWR AVT +KDQ CGCCWAFSAVAAVEG+ KI L+ LSEQ+LVDC +
Sbjct: 124 DAAQSVDWRAMGAVTGVKDQGACGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVS 183
Query: 195 G-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK-ISNYEEVPSG 252
G + GC GG M+ AF+++ + G+A+E YPYQ G C ++ AA A I +E+VP
Sbjct: 184 GVDQGCDGGLMDNAFQFVARRGGLASESGYPYQGRDGPCRSSAAAARAASIRGHEDVPRN 243
Query: 253 DEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
+E AL AV+ QPVS+ I F+ Y G+ G CGT L+HA+T VG+GT DG YW
Sbjct: 244 NEAALAAAVANQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTANDGTRYW 303
Query: 313 LIKNSWGDTWGDAGYMKI---LRDEGLCGIGTQSSYPL 347
L+KNSWG +WG+ GY++I +R EG+CG+ SYP+
Sbjct: 304 LMKNSWGASWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 341
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 156/337 (46%), Positives = 216/337 (64%), Gaps = 10/337 (2%)
Query: 19 FIIIILLVS----CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
F++ +L+V C + + + ++ HEKWMA+HGR+YKDE EK R ++F+ N
Sbjct: 6 FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
E I+ N G +++L TNRF+DLT EFRA TG + P P+ S + F+Y+N S+
Sbjct: 66 AELIDSFNAAGTHSHRLATNRFADLTVQEFRAARTGLR-PRPAP-SAGAGRFRYENFSLA 123
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
D S+DWR AVT +KDQ GCCWAFSAVAAVEG+ KI L+ LSEQ+LVDC +
Sbjct: 124 DAAQSVDWRAMGAVTGVKDQGASGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVS 183
Query: 195 G-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
G + GC GG M+ AF+++ + G+A+E YPYQ G C ++ AAAA I +E+VP +
Sbjct: 184 GVDQGCDGGLMDNAFQFVARRGGLASESGYPYQCRDGPCRSSAAAAAASIRGHEDVPRNN 243
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E AL AV+ QPVS+ I F+ Y G+ G CGT L+HA+T VG+GT DG YWL
Sbjct: 244 EAALAAAVAHQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTAADGTRYWL 303
Query: 314 IKNSWGDTWGDAGYMKI---LRDEGLCGIGTQSSYPL 347
+KNSWG +WG+ GY++I +R EG+CG+ SYP+
Sbjct: 304 MKNSWGASWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 340
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 153/315 (48%), Positives = 213/315 (67%), Gaps = 10/315 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
H ++++ E+W+A++ ++Y EK RF++FK+NL +I++ANK+ TY LG N F+
Sbjct: 57 VHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVT-TYWLGLNAFA 115
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DLT+DEF+A Y G + P + TT S F+Y ++ DVP S+DWR K AVT +K+Q +C
Sbjct: 116 DLTHDEFKATYLGLRQPET--KKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQGQC 173
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS VAAVEGI +I NL LSEQ+LVDCST+GNNGC GG M+ AF YI + G+
Sbjct: 174 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAFSYIASSGGL 233
Query: 218 ATEDEYPYQAVQGTCS--AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
TE+ YPY +G C A IS YE+VP+ DEQAL+KA++ QP+S+ I A
Sbjct: 234 RTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAIEASGR 293
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR--- 332
F+ Y G+FNG CG++LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R
Sbjct: 294 HFQFYSGGVFNGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGSHWGEKGYIRMKRGTG 352
Query: 333 -DEGLCGIGTQSSYP 346
EGLCGI +SYP
Sbjct: 353 KPEGLCGINKMASYP 367
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 155/316 (49%), Positives = 209/316 (66%), Gaps = 7/316 (2%)
Query: 37 STHEQSVVEMHEKWMAQHGRSYKD-ELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNR 95
S ++ V+ ++E W+ +HG+SY EK+ RF+IFK+NL YI++ N G+R+YKLG NR
Sbjct: 39 SRSDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNR 98
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
F+DLTN+E+R+ Y G K + + T S +Y + +P S+DWR+K AV +KDQ
Sbjct: 99 FADLTNEEYRSTYLGAKTDARRRIAKTKSDRRYAPKAGGSLPDSIDWREKGAVAEVKDQG 158
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS +AAVEGI +I LI LSEQ+LVDC T+ N GC GG M+ AFE+II+N
Sbjct: 159 SCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNG 218
Query: 216 GIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
GI TE +YPY G C +K A I YE+V DE AL +AV+ QPVS+ I A
Sbjct: 219 GIDTEADYPYTGRYGRCDQTRKNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEAGG 278
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
+F+ Y GIF G CGT LDH VT VG+G TE+G +YW++KNSW +WG+ GY+++ R+
Sbjct: 279 RDFQLYSSGIFTGSCGTDLDHGVTAVGYG-TENGVDYWIVKNSWAASWGEKGYLRMQRNV 337
Query: 334 ---EGLCGIGTQSSYP 346
GLCGI + SYP
Sbjct: 338 KDKNGLCGIAIEPSYP 353
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 155/324 (47%), Positives = 209/324 (64%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS + M+ +WMA HGR+Y E+E R+++F++NL YI+ N G +
Sbjct: 26 IVSYGERSXEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHS 85
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTNDE+RA Y G + R + +Y D+P S+DWR K AV
Sbjct: 86 FRLGLNRFADLTNDEYRATYLGARTRPQRERKLGA---RYHAADNEDLPESVDWRAKGAV 142
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
+KDQ CG CWAFS +AAVEGI +I +LI LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 143 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 202
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
E+II N GI TE +YPY+ G C +K A I +YE+VP+ DE++L KAV+ QPVS
Sbjct: 203 EFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVS 262
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A T F+ Y GIF G CGT LDH VT VG+G TE+G +YW++KNSWG +WG++GY
Sbjct: 263 VAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 321
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 322 VRMERNIKASSGKCGIAVEPSYPL 345
>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 172/338 (50%), Positives = 225/338 (66%), Gaps = 17/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ I++++LV+ SQ + E +V E HE+WMA+HGR+Y+D+ EKE RF IFK+NL++
Sbjct: 9 LAIVLMILVTWVSQAMPRPLIDEDAVAEKHEQWMARHGRTYQDDEEKERRFHIFKKNLKH 68
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSMTD 135
IE N NRTYKLG N F+DLT++EF A YTGYKMP P+ TT +T L +
Sbjct: 69 IENFNNAFNRTYKLGLNHFADLTDEEFLATYTGYKMPKVLPTANITTKTTQSSDVLYEAN 128
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
VP S+DWR + VTP+K+Q CGCCWAFSA AAVEGI N + LS QQL+DC +
Sbjct: 129 VPESIDWRTRGVVTPVKNQGRCGCCWAFSAAAAVEGII----GNGVSLSAQQLLDCVPD- 183
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
+NGC GG M+ AF YIIQNQG+A+ YPYQ ++ C + AA+IS Y +V DE+
Sbjct: 184 SNGCNGGFMDNAFRYIIQNQGLASATYYPYQLMREMCRPSNN--AARISGYVDVTPADEE 241
Query: 256 ALLKAVSMQPVSIGIAAYTTE--FKSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYW 312
L AV+ QPVS + A T+E FK Y GIF CG+ L HA+TIVG+GT+ +G YW
Sbjct: 242 TLKSAVARQPVSAAVDA-TSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTSAEGTKYW 300
Query: 313 LIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
LIKNSWG+ WG+ GYM++ RD G CGI ++SYP
Sbjct: 301 LIKNSWGEGWGEGGYMRLQRDVGSYGGACGIALRASYP 338
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 150/320 (46%), Positives = 219/320 (68%), Gaps = 9/320 (2%)
Query: 35 SRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTN 94
S S ++ V+ ++ +W+A+HG++Y E+E RF+IFK+NL+++++ N E NR+YK+G N
Sbjct: 35 SSSRTDEEVMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSE-NRSYKVGLN 93
Query: 95 RFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-VPTSLDWRDKKAVTPIKD 153
RF+DLTN+E+R+++ G K S + S + + +D +P S+DWR+ AV PIKD
Sbjct: 94 RFADLTNEEYRSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKD 153
Query: 154 QQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQ 213
Q CG CWAFS VAAVEG+ +I+ +IQLSEQ+LVDC + GC GG M+ AFE+II
Sbjct: 154 QGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAGCNGGLMDYAFEFIIN 213
Query: 214 NQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAA 272
N GI TE++YPY+ V GTC +K I++YE+VP DE AL KAV+ QPVS+ I A
Sbjct: 214 NGGIDTEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEA 273
Query: 273 YTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
F+ Y G+F G CG LDH V +VG+G T++GA++W+++NSWG +WG+ GY+++ R
Sbjct: 274 SGRAFQLYLSGVFTGECGRALDHGVVVVGYG-TDNGADHWIVRNSWGTSWGENGYIRMER 332
Query: 333 D-----EGLCGIGTQSSYPL 347
+ G CGI Q+SYP+
Sbjct: 333 NVVDNFGGKCGIAMQASYPI 352
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 151/318 (47%), Positives = 212/318 (66%), Gaps = 13/318 (4%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
TH+Q ++ ++E W+ +H ++Y EKE RF IFK+N+ ++++ N N++YKLG N+F+
Sbjct: 52 THDQ-LLSLYESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFA 110
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKKAVTPIKDQ 154
DLTNDE+R+LY KM ++ F+ D +P S+DWRD+ AV P+KDQ
Sbjct: 111 DLTNDEYRSLYLSGKMMKRERKNEDG--FRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQ 168
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
+CG CWAFS V AVEGI KI LI LSEQ+LVDC N GC GG M+ AFE+I++N
Sbjct: 169 GQCGSCWAFSTVGAVEGINKIVTGELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKN 228
Query: 215 QGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
GI TED+YPY+ V G C +K A I+ YE+VP DE++L KAV+ QPVS+ I A
Sbjct: 229 GGIDTEDDYPYKGVDGLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAG 288
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
F+ Y+ G+F G CGT+LDH V VG+G +E+G +YW+++NSWG WG++GY+++ R+
Sbjct: 289 GRAFQLYESGVFTGQCGTELDHGVVAVGYG-SENGKDYWIVRNSWGPDWGESGYIRLERN 347
Query: 334 -----EGLCGIGTQSSYP 346
G CGI Q+SYP
Sbjct: 348 VASTSTGKCGIAMQASYP 365
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 157/339 (46%), Positives = 215/339 (63%), Gaps = 10/339 (2%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
IP +++ S A+ +S + E V++M+E+W+ +H + Y EKE RF++FK+NL
Sbjct: 6 IPTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNL 64
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMT 134
+I+ N + N TY LG N+F+D+TN+E+RA+Y G + + T +T +Y S
Sbjct: 65 GFIQDHNAQ-NNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGD 123
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
+P +DWR K AV PIKDQ CG CWAFS VAAVEGI I + LSEQ+LVDC
Sbjct: 124 QLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDRE 183
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCS-AAQKAAAAKISNYEEVPSGD 253
+ GC GG M+ AF++IIQN GI TE++YPYQ + GTC +K +I YE+VPS +
Sbjct: 184 YDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGYEDVPSNN 243
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E AL KAVS QPVS+ I A + Y+ G+F G CGT LDH V +VG+G TE+G +YWL
Sbjct: 244 ENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYG-TENGVDYWL 302
Query: 314 IKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
++NSWG WG+ GY K+ R+ EG CGI SYP+
Sbjct: 303 VRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 154/315 (48%), Positives = 212/315 (67%), Gaps = 10/315 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRF 96
E+ + ++E W+A+HGR+ EKE RF+IFK+N+ +I+ N G+R+++LG NRF
Sbjct: 43 EEEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRF 102
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+D+TN+E+R +Y G + P+ R + +Y+ + ++P S+DWRDK AVT +KDQ
Sbjct: 103 ADMTNEEYRTVYLGTR-PASHRRRARLGSDRYRYNAGEELPESVDWRDKGAVTTVKDQGS 161
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS +AAVEGI KI +LI LSEQ+LVDC N GC GG M+ AFE+II N G
Sbjct: 162 CGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQGCNGGLMDYAFEFIINNGG 221
Query: 217 IATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
I TE++YPY+A G C +K A I YE+VP DE+AL KAV+ QPVS+ I A
Sbjct: 222 IDTEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGR 281
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-- 333
EF+ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG WG++GY+++ R+
Sbjct: 282 EFQLYHSGIFTGRCGTDLDHGVVAVGYG-TENGKDYWIVRNSWGGDWGESGYIRMERNVN 340
Query: 334 --EGLCGIGTQSSYP 346
G CGI +SSYP
Sbjct: 341 ASTGKCGIAMESSYP 355
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 152/341 (44%), Positives = 215/341 (63%), Gaps = 12/341 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTH---EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
+ ++ L S A+ + E S+ ++E+W + H S +D EK+ RF +FKEN
Sbjct: 6 LILVASFLASVAATAIDIADKDLETEDSLWNLYERWRSHHTVS-RDLDEKQKRFNVFKEN 64
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSP-----SHRSTTSSTFKYQ 129
YI NK + YKL N+F+DLTN EFR+ Y G ++ S R +++F YQ
Sbjct: 65 PRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRGSRRGGATNSFMYQ 124
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
+L +P S+DWR K AVT +KDQ +CG CWAFS VAAVEGI +I L+ LSEQ+L+
Sbjct: 125 SLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLLSLSEQELI 184
Query: 190 DCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEV 249
DC T+ NNGC GG M+ AF++I +N GI++E EYPY A C+ +K+ I +E+V
Sbjct: 185 DCDTDENNGCNGGLMDYAFDFIKKNGGISSEAEYPYAAEDSYCATEKKSHVVSIDGHEDV 244
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ DE +LLKAV+ QPVSI I A +F+ Y EG+F G GT+LDH V IVG+G T+ G
Sbjct: 245 PANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGVFTGRSGTELDHGVAIVGYGKTQQGT 304
Query: 310 NYWLIKNSWGDTWGDAGYMKI---LRDEGLCGIGTQSSYPL 347
YW+++NSWG WG+ GY++I + LCG+ ++SYP+
Sbjct: 305 KYWIVRNSWGAEWGEKGYIRISAASDSKRLCGLAMEASYPI 345
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 159/350 (45%), Positives = 226/350 (64%), Gaps = 25/350 (7%)
Query: 18 MFIIIILLVSCASQV----VSSRSTH---------EQSVVEMHEKWMAQHGRSYKDE--L 62
M I+ + +V+ AS V +S H + V+ ++E W+ +HG++ +
Sbjct: 1 MVILFLAMVAVASAVDMSIISYDEKHGVSTTGGRSDAEVMSIYEAWLVKHGKAQNQNSLV 60
Query: 63 EKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK+ RF+IFK+NL +I+ NK+ N +Y+LG RF+DLTNDE+R+ Y G KM R T+
Sbjct: 61 EKDRRFEIFKDNLRFIDDHNKK-NLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS 119
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
+Y+ ++P S+DWR K AV +KDQ CG CWAFS + AVEGI +I +LI
Sbjct: 120 Q---RYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLIT 176
Query: 183 LSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAA 241
LSEQ+LVDC T+ N GC GG M+ AFE+II+N GI T+ +YPY+ V GTC +K A
Sbjct: 177 LSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVV 236
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
I +YE+VP+ E++L KAV+ QPVS+ I A F+ Y GIF+G CGTQLDH V VG
Sbjct: 237 TIDSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGIFDGTCGTQLDHGVVAVG 296
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+G TE+G +YW+++NSWG +WG++GY+K+ R+ G CGI + SYP+
Sbjct: 297 YG-TENGKDYWIVRNSWGKSWGESGYLKMARNIASSSGKCGIAIEPSYPI 345
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 153/322 (47%), Positives = 215/322 (66%), Gaps = 15/322 (4%)
Query: 37 STHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRF 96
++HE+ ++E+ EK+MA++ ++Y EK RF++FK+NL +I++ NK+ Y LG N F
Sbjct: 43 ASHER-LMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKK-ITGYWLGLNEF 100
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+DLT+DEF+A Y G + +P+ R++ F+Y+ + +P +DWR K AVT +K+Q +
Sbjct: 101 ADLTHDEFKAAYLGLTL-TPARRNSNDQLFRYEEVEAASLPKEVDWRKKGAVTEVKNQGQ 159
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS VAAVEGI I NL +LSEQ+L+DC T+GNNGC GG M+ AF YI N G
Sbjct: 160 CGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLMDYAFSYIAANGG 219
Query: 217 IATEDEYPYQAVQGTCSA--------AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+ TE+ YPY +GTC + AAA IS YE+VP +EQALLKA++ QPVS+
Sbjct: 220 LHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALAHQPVSV 279
Query: 269 GIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYM 328
I A F+ Y G+F+G CGT+LDH VT VG+GT G +Y ++KNSWG WG+ GY+
Sbjct: 280 AIEASGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTASKGHDYIIVKNSWGSHWGEKGYI 339
Query: 329 KILR----DEGLCGIGTQSSYP 346
++ R +GLCGI +SYP
Sbjct: 340 RMRRGTGKHDGLCGINKMASYP 361
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 158/352 (44%), Positives = 222/352 (63%), Gaps = 19/352 (5%)
Query: 15 TIPMFIIIILLVSCASQVVSSRS-------------THEQSVVEMHEKWMAQHGRSYKDE 61
++ F++ +L+V+ + R+ ++V HEKWMA+HGR+Y DE
Sbjct: 2 SVSRFVLTVLVVASVCTAAAPRALAVRELAGEEESAAVAAAMVSRHEKWMAEHGRTYTDE 61
Query: 62 LEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST 121
EK R +IF+ N E+I+ N G +++L TNRF+DLT++EFRA TG++ +
Sbjct: 62 AEKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDEEFRAARTGFRPRPAPAAAA 121
Query: 122 TSST-FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANL 180
S F+Y+N S+ D S+DWR AVT +KDQ ECGCCWAFSAVAAVEG+ KI L
Sbjct: 122 GSGGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCWAFSAVAAVEGLNKIRTGRL 181
Query: 181 IQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA 239
+ LSEQ+LVDC NG + GC GG M+ AF++I + G+A+E YPYQ G+C ++ AA
Sbjct: 182 VSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASESGYPYQGDDGSCRSSAAAA 241
Query: 240 AAK-ISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVT 298
A I +E+VP +E AL AV+ QPVS+ I F+ Y G+ G CGT L+HA+T
Sbjct: 242 RAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRFYDSGVLGGECGTDLNHAIT 301
Query: 299 IVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI---LRDEGLCGIGTQSSYPL 347
VG+GT DG+ YWL+KNSWG +WG+ GY++I +R EG+CG+ SYP+
Sbjct: 302 AVGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 353
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 155/313 (49%), Positives = 206/313 (65%), Gaps = 16/313 (5%)
Query: 45 EMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEF 104
E++E+W + H S + EK+ RF +FK N+ Y+ NK+ ++ YKL N+F+D+TN EF
Sbjct: 36 ELYERWRSHHTVSRSLD-EKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEF 93
Query: 105 RALYTGYKMPSPSHRS-----TTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
R Y G K+ HRS + TF Y N+ DVP S+DWR K AVTP+KDQ +CG
Sbjct: 94 RHHYAGSKIKH--HRSFLGASRANGTFMYANVE--DVPPSVDWRKKGAVTPVKDQGKCGS 149
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS V AVEGI +I L+ LSEQ+LVDC T+ N GC GG M+ AFE+I + GI T
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINT 209
Query: 220 EDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E+ YPY A G C ++ + I YE+VP DE +LLKAV+ QPVS+ I A ++F+
Sbjct: 210 EENYPYMAEGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQ 269
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DE 334
Y EG+F G CGT+LDH V IVG+GTT DG YW+++NSWG WG+ GY+++ R +E
Sbjct: 270 FYSEGVFTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQREIDAEE 329
Query: 335 GLCGIGTQSSYPL 347
GLCGI Q SYP+
Sbjct: 330 GLCGIAMQPSYPI 342
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 154/348 (44%), Positives = 227/348 (65%), Gaps = 23/348 (6%)
Query: 15 TIPMFIIIILLVSCAS----------QVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEK 64
T+ +F+ +I++ S VSSRS E V ++E+W+ +HG++ EK
Sbjct: 2 TVILFLAMIVVSSAMDMSIISYDKNHHTVSSRSDVE--VSRLYEEWVVKHGKAQNSLTEK 59
Query: 65 EMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS 124
+ RF+IFK+NL +I++ N + N +Y+LG +F+DLTNDE+R++Y G ++ R T +
Sbjct: 60 DRRFEIFKDNLRFIDEHNGK-NLSYRLGLTKFADLTNDEYRSMYLGSRL----KRKATKT 114
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
+ +Y+ +P S+DWR + AV +KDQ CG CWAFS + AVEGI KI +LI LS
Sbjct: 115 SLRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLS 174
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKI 243
EQ+LVDC T+ N GC GG M+ AFE+II+N GI TE++YPY+ V G C +K A I
Sbjct: 175 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTI 234
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
+YE+VP+ E++L KA+S QP+S+ I F+ Y GIF+G+CGT LDH V VG+G
Sbjct: 235 DSYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYG 294
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
TE+G +YW++KNSWG +WG++GY+++ R+ G CGI + SYP+
Sbjct: 295 -TENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPI 341
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 158/343 (46%), Positives = 216/343 (62%), Gaps = 17/343 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+F++ L + ++S TH + V+ M+E+W+ +HG++Y EKE RF+
Sbjct: 5 LFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFE 64
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL +I++ N E NRTY +G NRF+DLTN+EFR++Y G + TS +Y
Sbjct: 65 IFKDNLMFIDQHNSE-NRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSD--RYA 121
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
+P S+DWR + AV +KDQ CG CWAFS +AAVEGI KI +LI LSEQ+LV
Sbjct: 122 PRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELV 181
Query: 190 DCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEE 248
DC T+ N GC GG M+ AFE+II N GI TED+YPY G C +K A I +YE+
Sbjct: 182 DCDTSYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYED 241
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP DE AL KAV+ QPVS+ I F+ Y G+F G CGT LDH V VG+G TE G
Sbjct: 242 VPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYG-TEKG 300
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+YW+++NSWG +WG++GY+++ R+ G CGI + SYP+
Sbjct: 301 KDYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPI 343
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 157/339 (46%), Positives = 214/339 (63%), Gaps = 10/339 (2%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
IP +++ S A+ +S + E V++M+E+W+ +H + Y EKE RF++FK+NL
Sbjct: 6 IPTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNL 64
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMT 134
+I+ N + N TY LG N+F+D+TN E+RA+Y G + + T +T +Y S
Sbjct: 65 GFIQDHNAQ-NNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGD 123
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
+P +DWR K AV PIKDQ CG CWAFS VAAVEGI I + LSEQ+LVDC
Sbjct: 124 QLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDRE 183
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCS-AAQKAAAAKISNYEEVPSGD 253
+ GC GG M+ AF++IIQN GI TE++YPYQ + GTC +K +I YE+VPS +
Sbjct: 184 YDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGYEDVPSNN 243
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E AL KAVS QPVS+ I A + Y+ G+F G CGT LDH V +VG+G TE+G +YWL
Sbjct: 244 ENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYG-TENGVDYWL 302
Query: 314 IKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
++NSWG WG+ GY K+ R+ EG CGI SYP+
Sbjct: 303 VRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 215/314 (68%), Gaps = 9/314 (2%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T+ ++E+ E WM++H ++YK EK RF++F+ENL +I++ N E N +Y LG N F+
Sbjct: 42 TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DLT++EF+ Y G P S + S+ F+Y+++ TD+P S+DWR K AV P+KDQ +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS VAAVEGI +I+ NL LSEQ+L+DC T N+GC GG M+ AF+YII G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 218 ATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
ED+YPY +G C ++ IS YE+VP D+++L+KA++ QPVS+ I A +
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ YK G+FNG CGT LDH V VG+G+++ G++Y ++KNSWG WG+ G++++ R+
Sbjct: 279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGK 337
Query: 334 -EGLCGIGTQSSYP 346
EGLCGI +SYP
Sbjct: 338 PEGLCGINKMASYP 351
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 155/348 (44%), Positives = 225/348 (64%), Gaps = 23/348 (6%)
Query: 15 TIPMFIIIILLVSCAS----------QVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEK 64
T+ +F+ +I++ S VSSRS E V ++E+W+ +HG++ EK
Sbjct: 8 TVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAE--VSRLYEEWLVKHGKAQNSLTEK 65
Query: 65 EMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS 124
+ RF+IFK+NL +I++ N + N +Y+LG +F+DLTNDE+R++Y G ++ R T S
Sbjct: 66 DRRFEIFKDNLRFIDEHNGK-NLSYRLGLTKFADLTNDEYRSMYLGSRL----KRKATKS 120
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
+ +Y+ +P S+DWR + AV +KDQ CG CWAFS + AVEGI KI +LI LS
Sbjct: 121 SLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLS 180
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKI 243
EQ+LVDC T+ N GC GG M+ AFE+II N GI TE++YPY+ V G C +K A I
Sbjct: 181 EQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTI 240
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
YE+VP+ E++L KA+S QP+S+ I F+ Y GIF+G+CGT LDH V VG+G
Sbjct: 241 DLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYG 300
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
TE+G +YW++KNSWG +WG++GY+++ R+ G CGI + SYP+
Sbjct: 301 -TENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPI 347
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 152/338 (44%), Positives = 220/338 (65%), Gaps = 20/338 (5%)
Query: 15 TIPMFIIIIL-LVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKE 73
T F++ IL S S V+++R + ++VE HE WM ++GR YKD EK RF++FK+
Sbjct: 3 TSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKD 62
Query: 74 NLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N+ ++E N N + LG N+F+DLT +EF+A G+K P+ ++ FKY+NLS+
Sbjct: 63 NVAFVESFNTNKNNKFWLGVNQFADLTTEEFKA-NKGFK---PTAEKVPTTGFKYENLSV 118
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+ +PT++DWR K AVTPIK+Q +C AA+EGI K+S NLI LSEQ+LVDC T
Sbjct: 119 SALPTAVDWRTKGAVTPIKNQGQC---------AAMEGIVKLSTGNLISLSEQELVDCDT 169
Query: 194 NG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
+ + GC GG M+ AFE++I+N G+ATE YPY+AV G C K+AA I +E+VP
Sbjct: 170 HSMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSKSAAT-IKGHEDVPVN 228
Query: 253 DEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
+E AL+KAV+ QPVS+ + A F Y G+ G CGT+LDH + +G+G DG YW
Sbjct: 229 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYW 288
Query: 313 LIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
++KNSWG TWG+ G++++ +D G+CG+ + SYP
Sbjct: 289 ILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 326
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 153/321 (47%), Positives = 205/321 (63%), Gaps = 20/321 (6%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+++ ++E+W +H + +D +K RF +FK N+ I + N+ + YKL NRF D+
Sbjct: 149 EEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDM 206
Query: 100 TNDEFRALYTGYKMPSPSHR---------STTSSTFKYQNLSMTDVPTSLDWRDKKAVTP 150
T DEFR Y G ++ HR S ++S+F Y + DVP S+DWR K AVT
Sbjct: 207 TADEFRRHYAGSRVAH--HRMFRGDRQGSSASASSFMYAD--ARDVPASVDWRQKGAVTD 262
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEY 210
+KDQ +CG CWAFS +AAVEGI I NL LSEQQLVDC T N GC GG M+ AF+Y
Sbjct: 263 VKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQY 322
Query: 211 IIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
I ++ G+A ED YPY+A Q +C + A I YE+VP+ DE AL KAV+ QPVS+ I
Sbjct: 323 IAKHGGVAAEDAYPYRARQASCKKS-PAPVVTIDGYEDVPANDESALKKAVAHQPVSVAI 381
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A + F+ Y EG+F+G CGT+LDH V VG+G T DG YWL+KNSWG WG+ GY+++
Sbjct: 382 EASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRM 441
Query: 331 LRD----EGLCGIGTQSSYPL 347
RD EG CGI ++SYP+
Sbjct: 442 ARDVAAKEGHCGIAMEASYPV 462
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 154/317 (48%), Positives = 215/317 (67%), Gaps = 11/317 (3%)
Query: 37 STHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRF 96
++H++ ++E+ EKW+A++ ++Y EK RF++FK+NL +I+ NK+ +Y LG N F
Sbjct: 42 ASHDR-LIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGLNEF 99
Query: 97 SDLTNDEFRALYTGYKMPSPSHRST---TSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKD 153
+DLT+DEF+A Y G P P+ ++ +S F+Y +S +VP +DWR K AVT +K+
Sbjct: 100 ADLTHDEFKATYLGL-TPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKN 158
Query: 154 QQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQ 213
Q +CG CWAFS VAAVEGI I NL LSEQ+L+DCST+GNNGC GG M+ AF YI
Sbjct: 159 QGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIAS 218
Query: 214 NQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
G+ TE+ YPY +G C + AA IS YE+VP+ DEQAL+KA++ QPVS+ I A
Sbjct: 219 TGGLRTEEAYPYAMEEGDCDEGKGAAVVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 278
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
F+ Y G+F+G CG QLDH VT VG+GT++ G +Y ++KNSWG WG+ GY+++ R
Sbjct: 279 GRHFQFYSGGVFDGPCGEQLDHGVTAVGYGTSK-GQDYIIVKNSWGPHWGEKGYIRMKRG 337
Query: 333 ---DEGLCGIGTQSSYP 346
EGLCGI +SYP
Sbjct: 338 TGKGEGLCGINKMASYP 354
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 155/348 (44%), Positives = 225/348 (64%), Gaps = 23/348 (6%)
Query: 15 TIPMFIIIILLVSCAS----------QVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEK 64
T+ +F+ +I++ S VSSRS E V ++E+W+ +HG++ EK
Sbjct: 2 TVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAE--VSRLYEEWLVKHGKAQNSLTEK 59
Query: 65 EMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS 124
+ RF+IFK+NL +I++ N + N +Y+LG +F+DLTNDE+R++Y G ++ R T S
Sbjct: 60 DRRFEIFKDNLRFIDEHNGK-NLSYRLGLTKFADLTNDEYRSMYLGSRL----KRKATKS 114
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
+ +Y+ +P S+DWR + AV +KDQ CG CWAFS + AVEGI KI +LI LS
Sbjct: 115 SLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLS 174
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKI 243
EQ+LVDC T+ N GC GG M+ AFE+II N GI TE++YPY+ V G C +K A I
Sbjct: 175 EQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTI 234
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
YE+VP+ E++L KA+S QP+S+ I F+ Y GIF+G+CGT LDH V VG+G
Sbjct: 235 DLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYG 294
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
TE+G +YW++KNSWG +WG++GY+++ R+ G CGI + SYP+
Sbjct: 295 -TENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPI 341
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 154/320 (48%), Positives = 206/320 (64%), Gaps = 19/320 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+++ ++E+W +H + +D +K RF +FK N+ I + N+ + YKL NRF D+
Sbjct: 42 EEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDM 99
Query: 100 TNDEFRALYTGYKMPSPSHR--------STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPI 151
T DEFR Y G ++ HR S+ S++F Y + DVP S+DWR K AVT +
Sbjct: 100 TADEFRRHYAGSRVAH--HRMFRGDRQGSSASASFMYAD--ARDVPASVDWRQKGAVTDV 155
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYI 211
KDQ +CG CWAFS +AAVEGI I NL LSEQQLVDC T N GC GG M+ AF+YI
Sbjct: 156 KDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYI 215
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIA 271
++ G+A ED YPY+A Q +C + A I YE+VP+ DE AL KAV+ QPVS+ I
Sbjct: 216 AKHGGVAAEDAYPYRARQASCKKS-PAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIE 274
Query: 272 AYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKIL 331
A + F+ Y EG+F+G CGT+LDH VT VG+G T DG YWL+KNSWG WG+ GY+++
Sbjct: 275 ASGSHFQFYSEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMA 334
Query: 332 RD----EGLCGIGTQSSYPL 347
RD EG CGI ++SYP+
Sbjct: 335 RDVAAKEGHCGIAMEASYPV 354
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 150/311 (48%), Positives = 212/311 (68%), Gaps = 9/311 (2%)
Query: 44 VEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDE 103
+ ++E+W+ +HG++Y EK+ RF IFK+NL +I+ N + NRTYKLG NRF+DLTN+E
Sbjct: 1 MSLYEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNAD-NRTYKLGLNRFADLTNEE 59
Query: 104 FRALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
+RA Y G ++ P+ T + + +Y ++P S+DWR++ AV P+KDQ CG CWA
Sbjct: 60 YRARYLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWA 119
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS + AVEGI KI +LI LSEQ+LVDC T+ N GC GG M+ A+E+II N GI +E++
Sbjct: 120 FSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDSEED 179
Query: 223 YPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY+AV GTC +K A I +YE+VP+ DE AL KAV+ QPVS+ I EF+ Y
Sbjct: 180 YPYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYV 239
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGL 336
G+F G CGT LDH V VG+G+ + G +YW+++NSWG +WG+ GY+++ R+ G
Sbjct: 240 SGVFTGRCGTALDHGVVAVGYGSVK-GHDYWIVRNSWGASWGEEGYVRLERNLAKSRSGK 298
Query: 337 CGIGTQSSYPL 347
CGI + SYP+
Sbjct: 299 CGIAIEPSYPI 309
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 314 bits (805), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 153/316 (48%), Positives = 204/316 (64%), Gaps = 13/316 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+++ ++E+W +H + +D +K RF +FKEN+ I N+ + YKL NRF D+
Sbjct: 40 EEALWALYERWRGRHAVA-RDLGDKARRFNVFKENVRLIHDFNQR-DEPYKLRLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSP----SHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
T DEFR Y G ++ R ++S+F Y D+PTS+DWR K AVT +KDQ
Sbjct: 98 TADEFRRHYAGSRVAHHRMFRGDRQGSASSFMYAG--ARDLPTSVDWRQKGAVTDVKDQG 155
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
+CG CWAFS +AAVEGI I NL LSEQQLVDC T GN GC GG M+ AF+YI ++
Sbjct: 156 QCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAKHG 215
Query: 216 GIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
G+A ED YPY+A Q +C + A A I YE+VP+ DE AL KAV+ QPVS+ I A +
Sbjct: 216 GVAAEDAYPYKARQASCKKS-PAPAVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGS 274
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-- 333
F+ Y EG+F G CGT+LDH VT VG+G DG YW++KNSWG WG+ GY+++ RD
Sbjct: 275 HFQFYSEGVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEKGYIRMARDVA 334
Query: 334 --EGLCGIGTQSSYPL 347
EG CGI ++SYP+
Sbjct: 335 AKEGHCGIAMEASYPV 350
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 314 bits (805), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 148/313 (47%), Positives = 208/313 (66%), Gaps = 7/313 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E + ++E W+ ++G++Y EKE RF+IFK+NL+++++ N GN +YKLG N+F+DL
Sbjct: 42 EAETLRLYEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADL 101
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
+N+E+RA Y G +M + +Y D+P S+DWR+K AV P+KDQ +CG
Sbjct: 102 SNEEYRAAYLGTRMDGKRRLLGGPKSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGS 161
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS V AVEGI +I NL LSEQ+LVDC N GC GG M+ AFE+I++N GI T
Sbjct: 162 CWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDT 221
Query: 220 EDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E++YPY+AV C +K A I YE+VP DE++L KAV+ QPVS+ I A F+
Sbjct: 222 EEDYPYKAVDSMCDPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQ 281
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-----D 333
Y+ G+F G CGTQLDH V VG+G TE+G +YW+++NSWG WG+ GY+++ R +
Sbjct: 282 LYQSGVFTGSCGTQLDHGVVAVGYG-TENGVDYWVVRNSWGPAWGENGYIRMERNVASTE 340
Query: 334 EGLCGIGTQSSYP 346
G CGI ++SYP
Sbjct: 341 TGKCGIAMEASYP 353
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 221/351 (62%), Gaps = 11/351 (3%)
Query: 7 RSGSFKINTIPMFIIIILLVSCASQVVSSRSTH-----EQSVVEMHEKWMAQHGRSYKDE 61
S + I+ + M I L + ++S TH + V ++E W+ +HG+SY
Sbjct: 4 HSSTLTISLLLMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNAL 63
Query: 62 LEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST 121
EK+ RF+IFK+NL+YI++ N N++YKLG +F+DLTN+E+R++Y G K + +
Sbjct: 64 GEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLS 123
Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLI 181
+ + +Y +P S+DWRDK + +KDQ CG CWAFSAVAA+E I I NLI
Sbjct: 124 KNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLI 183
Query: 182 QLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAA 240
LSEQ+LVDC + N GC GG M+ AFE++I N GI TE++YPY+ C +K A
Sbjct: 184 SLSEQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKV 243
Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIV 300
KI +YE+VP +E+AL KAV+ QPVSI I A + + YK GIF G CGT +DH V
Sbjct: 244 VKIDSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHGVVAA 303
Query: 301 GFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G+G +E+G +YW+++NSWG WG+ GY+++ R+ GLCG+ T+ SYP+
Sbjct: 304 GYG-SENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGLATEPSYPV 353
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 152/313 (48%), Positives = 209/313 (66%), Gaps = 9/313 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
+ V M+E W+ +HG++Y EKE RF+IFK+NL +I++ N +R+YK+G NRF+DL
Sbjct: 44 DSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSV-DRSYKVGLNRFADL 102
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
TN+E++A++ G KM +R + + +Y D+P ++DWR+K AV P+KDQ +CG
Sbjct: 103 TNEEYKAMFLGTKMER-KNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGS 161
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS V AVEGI +I LI LSEQ+LVDC + N GC GG M+ AFE+II N GI T
Sbjct: 162 CWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDT 221
Query: 220 EDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E++YPY+A C +K A I YE+VP DE +L KAV+ QPVS+ I A F+
Sbjct: 222 EEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAFQ 281
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----- 333
YK G+F G CGT+LDH V VG+G TE+G NYW+++NSWG WG++GY+++ R+
Sbjct: 282 LYKSGVFTGRCGTELDHGVVAVGYG-TENGVNYWIVRNSWGSAWGESGYIRMERNVANTK 340
Query: 334 EGLCGIGTQSSYP 346
G CGI Q SYP
Sbjct: 341 TGKCGIAIQPSYP 353
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 158/346 (45%), Positives = 220/346 (63%), Gaps = 16/346 (4%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
K+ I + I ++L+VS + + ++S+ +++E+W + H S ++ EK+ RF +F
Sbjct: 5 KLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRFNVF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-----STTSSTF 126
K N+ ++ NK ++ YKL N+F+D+TN EF+ Y G K+ HR S TF
Sbjct: 64 KSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNH--HRMFRGTPRVSGTF 120
Query: 127 KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
Y+N T P S+DWR K AVT +KDQ +CG CWAFS V AVEGI +I L+ LSEQ
Sbjct: 121 MYENF--TKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178
Query: 187 QLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISN 245
+L+DC N GC GG ME AFEYI Q GI TE YPY A G+C A ++ A I
Sbjct: 179 ELIDCDNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDATKENVPAVSIDG 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
+E VP+ DE ALLKAV+ QPVS+ I A ++F+ Y EG+F G CG +L+H V IVG+GTT
Sbjct: 239 HETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT 298
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
DG NYW+++NSWG WG+ GY+++ R+ EGLCGI ++SYP+
Sbjct: 299 VDGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPV 344
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 214/314 (68%), Gaps = 9/314 (2%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++E+ E WM++H + YK EK RF++F+ENL +I++ N E N +Y LG N F+
Sbjct: 42 TSTEKLLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DLT++EF+ Y G P S + S+ F+Y+++ TD+P S+DWR K AV P+KDQ +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS VAAVEGI +I+ NL LSEQ+L+DC T N+GC GG M+ AF+YII G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 218 ATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
ED+YPY +G C ++ IS YE+VP D+++L+KA++ QPVS+ I A +
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ YK G+FNG CGT LDH V VG+G+++ G++Y ++KNSWG WG+ G++++ R+
Sbjct: 279 FQFYKGGVFNGQCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGK 337
Query: 334 -EGLCGIGTQSSYP 346
EGLCGI +SYP
Sbjct: 338 PEGLCGINKMASYP 351
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 151/317 (47%), Positives = 212/317 (66%), Gaps = 14/317 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
++S ++EKWM HGR Y EKE RF+IF++N EYIE+ N++ N+TY LG N F+D+
Sbjct: 27 DRSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADM 86
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
T+DEF+ALY G K+P + T S F+Y++ T++P DWR K AV +K+Q CG
Sbjct: 87 THDEFKALYFGTKVPLSN---TIKSGFRYKD--ATNLPLDTDWRSKGAVATVKNQGACGS 141
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS VAAVEG+ +I L+ LSEQ+LVDC N GC GG M+ AFE+IIQN G+ +
Sbjct: 142 CWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDS 201
Query: 220 EDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E +YPY+AV G+C +++ + I +E+VP+ E LLKAV+ QPVS+ I A F+
Sbjct: 202 EADYPYKAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQ 261
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGT--TEDGA--NYWLIKNSWGDTWGDAGYMKILRD- 333
Y G++ G CG +LDH V VG+GT T DG +YW+++NSWGD WG++GY+++ R+
Sbjct: 262 LYSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNV 321
Query: 334 ---EGLCGIGTQSSYPL 347
G CGI +SYP+
Sbjct: 322 ASPRGKCGIAMMASYPV 338
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 313 bits (803), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 147/326 (45%), Positives = 216/326 (66%), Gaps = 14/326 (4%)
Query: 33 VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLG 92
V+S + ++ V +E W+A+HG++Y EKE RF+IF +NL++I++ N GNR+YK+G
Sbjct: 22 VTSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVG 81
Query: 93 TNRFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKA 147
N+F+DLTN+E+R++Y G Y+ + R S + Q M P +DWR++ A
Sbjct: 82 LNQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEM--FPAKVDWRERGA 139
Query: 148 VTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKA 207
V+P+K+Q CG CWAFS VA+VEGI KI +LI LSEQ+LVDC N+GC GG+M+ A
Sbjct: 140 VSPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYA 199
Query: 208 FEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPV 266
F++I+ N GI +E +YPY+ V C + KA I YE+VP +E+AL+KAV+ QPV
Sbjct: 200 FQFIVSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPV 259
Query: 267 SIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAG 326
S+GI A F+ Y G+ G CGT LDH V +VG+G +E+G +YW+++NSWG WG+ G
Sbjct: 260 SVGIEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYG-SENGKDYWIVRNSWGPEWGEDG 318
Query: 327 YMKILRDE-----GLCGIGTQSSYPL 347
Y+++ R+ G+CGI +SYP+
Sbjct: 319 YIRMERNMVDTPVGMCGITLMASYPI 344
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 313 bits (803), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 153/338 (45%), Positives = 209/338 (61%), Gaps = 7/338 (2%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
I + + +SCA + + + V+ M+E+W+ +H + Y EK+ RF++FK+NL
Sbjct: 9 ISTLLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKRFQVFKDNL 68
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMT 134
+I++ N N TYKLG N+F+D+TN+E+R +Y G K + T ST +Y +
Sbjct: 69 GFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYSAGD 128
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
+P +DWR K AV PIKDQ CG CWAFS VA VE I KI + LSEQ+LVDC
Sbjct: 129 QLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRA 188
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGD 253
N GC GG M+ AFE+IIQN GI T+ +YPY+ G C +K A A I YE+VP D
Sbjct: 189 YNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKAVNIDGYEDVPPYD 248
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E AL KAV+ QPVSI I A + Y+ G+F G CGT LDH V +VG+G +E+G +YWL
Sbjct: 249 ENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVVVVGYG-SENGVDYWL 307
Query: 314 IKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
++NSWG WG+ GY K+ R+ G CGI ++SYP+
Sbjct: 308 VRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 313 bits (802), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 152/335 (45%), Positives = 224/335 (66%), Gaps = 9/335 (2%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
++ + V+ ++ +SS E+ V+ M++ WMA+HG++Y EKE RF+IFK+NL++I+
Sbjct: 19 LLFLFFVASSAADLSSSWRSEEEVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFID 78
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPT 138
+ N + NRTYK+G NRF+DLTN+E+RA+Y G + P +++ +Y + +P
Sbjct: 79 EHNAQ-NRTYKVGLNRFADLTNEEYRAIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLPE 137
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
S+DWR+ AV P+KDQ+ CG CWAFS VAAVEGI +I LI LSEQ+LVDC T + G
Sbjct: 138 SVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMG 197
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQAL 257
C GG M+ AF++II+N G+ TE +YPY G C+ + K++ I YE+VP DE+AL
Sbjct: 198 CNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKAL 257
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
KAV+ QPVS+ + A + Y GIF G CGT LDH + VG+G TE+G +YW+++NS
Sbjct: 258 QKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYG-TENGTDYWIVRNS 316
Query: 318 WGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
WG +WG+ GY+++ R+ G CGI ++SYP+
Sbjct: 317 WGSSWGENGYIRMERNMADAFSGKCGIAMEASYPI 351
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 154/324 (47%), Positives = 210/324 (64%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS ++ M+ +WMA HGR+Y E+E R+++F++NL YI+ N G +
Sbjct: 29 IVSYGERSDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHS 88
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTNDE+RA Y G + R + +Y D+P S+DWR K AV
Sbjct: 89 FRLGLNRFADLTNDEYRATYLGARTRPQRERKLGA---RYHAADNEDLPESVDWRAKGAV 145
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
+KDQ G CWAFS +AAVEGI +I +LI LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 146 AEVKDQGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 205
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
E+II N GI TE +YPY+ G C +K A I +YE+VP+ DE++L KAV+ QPVS
Sbjct: 206 EFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVS 265
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A T+F+ Y GIF G CGT LDH VT VG+G TE+G +YW++KNSWG +WG++GY
Sbjct: 266 VAIEAAGTQFQLYSSGIFTGSCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 324
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 325 VRMERNIKASSGKCGIAVEPSYPL 348
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 156/319 (48%), Positives = 210/319 (65%), Gaps = 12/319 (3%)
Query: 36 RSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLG 92
RS E + ++E W+A+HGR+Y EKE RF+IFK+N+ +I+ N G+R+++LG
Sbjct: 41 RSEEEMRI--LYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLG 98
Query: 93 TNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
NRF+D+TN+E+RA+Y G + P+ R + +Y+ + D+P S+DWR K AV +K
Sbjct: 99 LNRFADMTNEEYRAVYLGTR-PAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVK 157
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYII 212
DQ CG CWAFS VAAVEGI KI +LI LSEQ+LVDC N GC GG M+ FE+II
Sbjct: 158 DQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQGCNGGLMDYGFEFII 217
Query: 213 QNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIA 271
N GI TE++YPY A G C +K A I YE+VP DE+AL KAV+ QPVS+ I
Sbjct: 218 NNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIE 277
Query: 272 AYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKIL 331
A EF+ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG WG++GY+++
Sbjct: 278 AGGREFQLYHSGIFTGRCGTDLDHGVVAVGYG-TENGKDYWIVRNSWGGDWGESGYIRME 336
Query: 332 RD----EGLCGIGTQSSYP 346
R+ G CGI + SYP
Sbjct: 337 RNVNTSTGKCGIAIEPSYP 355
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 150/337 (44%), Positives = 214/337 (63%), Gaps = 10/337 (2%)
Query: 18 MFIIIILLVSCASQVVSSR--STHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
+ I+ L C++ V+++R + ++ HE+WMAQ GR YKD EK R ++FK N+
Sbjct: 10 LVAIVGCLCLCSTAVLAARELGDADNAMAARHEQWMAQFGRVYKDPAEKAHRLEVFKANV 69
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+IE N E N + LG N+F+DLTNDEFRA T + R + FKY ++S+
Sbjct: 70 AFIESFNAE-NHEFWLGANQFADLTNDEFRASKTNKGIKQGGVRDAPTG-FKYSDVSIDA 127
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
+P S+DWR K AVTPIK+Q +CG CWAFSAVAA EG+ K+S L+ LSEQ+LVDC +G
Sbjct: 128 LPASVDWRTKGAVTPIKNQGQCGSCWAFSAVAATEGVVKLSTGKLVSLSEQELVDCDVHG 187
Query: 196 -NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGD 253
+ GC GG M+ AF++II+N G+ TE YPY C + + AA I YE+VP+ D
Sbjct: 188 VDQGCMGGWMDDAFKFIIKNGGLTTEANYPYTGEDDKCKSNETVNVAATIKGYEDVPAND 247
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E AL+KAV+ QPVS+ + F+ Y G+ G CG ++DH + +G+G T +G YWL
Sbjct: 248 ESALMKAVAHQPVSVVVDGGDMTFQLYAGGVMTGSCGVEMDHGIAAIGYGATSNGTKYWL 307
Query: 314 IKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
+KNSWG TWG+ G++++ +D G+CG+ + SYP
Sbjct: 308 MKNSWGTTWGEKGFLRMAKDIPDKRGMCGLAMKPSYP 344
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 208/324 (64%), Gaps = 10/324 (3%)
Query: 33 VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLG 92
+ SR E E HE WMAQ+G+ YKD EK+ RF+IFK N+ +IE N G++ + L
Sbjct: 24 IMSRRLFEACTSERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLS 83
Query: 93 TNRFSDLTNDEFRALYTGYKMPSPSHRST---TSSTFKYQNLSMTDVPTSLDWRDKKAVT 149
N+F+DL ++EF+AL T S T T ++FKY + T + ++DWR + AVT
Sbjct: 84 INQFADLHDEEFKALLTNGNKKVRSVVGTATETETSFKYNRV--TKLLATMDWRKRGAVT 141
Query: 150 PIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFE 209
PIKDQ+ CG CWAFSAVAA+EGI +I+ + L+ LSEQ+LVDC + GC GG ME AFE
Sbjct: 142 PIKDQRRCGSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFE 201
Query: 210 YIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
++ + GIA+E YPY+ +C ++ ++I YE+VPS E+AL KAV+ QPVS+
Sbjct: 202 FVAKKGGIASESYYPYKGKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQPVSV 261
Query: 269 GIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYM 328
+ A F+ Y GIF G CGT DHA+T+VG+G + G YWL+KNSWG WG+ GY+
Sbjct: 262 YVEAGGNAFQFYSSGIFTGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWGEKGYI 321
Query: 329 KILRD----EGLCGIGTQSSYPLA 348
++ RD EGLCGI + YP A
Sbjct: 322 RMKRDIRAKEGLCGIAMNAFYPTA 345
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 149/311 (47%), Positives = 204/311 (65%), Gaps = 9/311 (2%)
Query: 45 EMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEF 104
E HEKWMAQ+G+ YKD EKE RF++FK N+++IE N G++ + L N+F+DL ++EF
Sbjct: 33 ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEF 92
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ-QECGCCWAF 163
+AL + + + T ++F+Y+N+ T +P+++DWR + AVTPIKDQ CG CWAF
Sbjct: 93 KALLNNVQKKASRVETATETSFRYENV--TKIPSTMDWRKRGAVTPIKDQGYTCGSCWAF 150
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
+ VA VE + +I+ L+ LSEQ+LVDC + GC GG +E AFE+I GI +E Y
Sbjct: 151 ATVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGITSEAYY 210
Query: 224 PYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
PY+ +C ++ A+I YE VPS E+ALLKAV+ QPVS+ I A FK Y
Sbjct: 211 PYKGKDRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKFYSS 270
Query: 283 GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
GIF CGT LDHAV +VG+G DG YWL+KNSW WG+ GYM+I RD +GLC
Sbjct: 271 GIFEARNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGYMRIKRDIRAKKGLC 330
Query: 338 GIGTQSSYPLA 348
GI + +SYP+A
Sbjct: 331 GIASNASYPIA 341
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 150/311 (48%), Positives = 209/311 (67%), Gaps = 14/311 (4%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR 105
++EKWM HGR Y EKE RF+IF++N EYIE+ N++ N+TY LG N F+D+T+DEF+
Sbjct: 33 LYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFK 92
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
ALY G K+P + T S F+Y++ T++P DWR K AV +K+Q CG CWAFS
Sbjct: 93 ALYFGTKVPLSN---TIKSGFRYED--ATNLPLDTDWRSKGAVATVKNQGACGSCWAFST 147
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
VAAVEG+ +I L+ LSEQ+LVDC N GC GG M+ AFE+IIQN G+ +E +YPY
Sbjct: 148 VAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPY 207
Query: 226 QAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGI 284
+AV G+C +++ + I +E+VP+ E LLKAV+ QPVS+ I A F+ Y G+
Sbjct: 208 KAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGV 267
Query: 285 FNGVCGTQLDHAVTIVGFGT--TEDGA--NYWLIKNSWGDTWGDAGYMKILRD----EGL 336
+ G CG +LDH V VG+GT T DG +YW+++NSWGD WG++GY+++ R+ G
Sbjct: 268 YTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASSRGK 327
Query: 337 CGIGTQSSYPL 347
CGI +SYP+
Sbjct: 328 CGIAMMASYPV 338
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 155/344 (45%), Positives = 220/344 (63%), Gaps = 17/344 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+F + L + ++S + H ++ V ++E+W+ +HG+ Y EK+ RF+
Sbjct: 3 LFALFALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKRFQ 62
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL +I++ N E NRTYKLG NRF+DLTN+E+RA Y G K+ P+ R + + +Y
Sbjct: 63 IFKDNLRFIDQQNAE-NRTYKLGLNRFADLTNEEYRARYLGTKI-DPNRRLGRTPSNRYA 120
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
+P S+DWR + AV P+KDQ CG CWAFSA+ AVEGI KI +LI LSEQ+LV
Sbjct: 121 PRVGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQELV 180
Query: 190 DCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEE 248
DC T N GC GG M+ AFE+II+N GI +E++YPY+ V G C +K A I YE+
Sbjct: 181 DCDTGYNMGCNGGLMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGYED 240
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
V + DE AL KAV+ QPVS+ + EF+ Y G+F G CGT LDH V VG+G T++G
Sbjct: 241 VNTYDELALKKAVANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYG-TDNG 299
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
++W+++NSWG WG+ GY+++ R+ G CGI + SYP+
Sbjct: 300 HDFWIVRNSWGADWGEEGYIRLERNLGNSRSGKCGIAIEPSYPI 343
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 153/324 (47%), Positives = 209/324 (64%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS ++ M+ +WMA HGR+Y +E R+++F++NL YI+ N G +
Sbjct: 29 IVSYGERTDEEARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHS 88
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTNDE+ A Y G + R + +Y D+P S+DWR K AV
Sbjct: 89 FRLGLNRFADLTNDEYPATYLGARTRPQRDRKLGA---RYHAADNEDLPESVDWRAKGAV 145
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
+KDQ CG CWAFS +AAVEGI +I +LI LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 146 AEVKDQGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 205
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
E+II N GI TE +YPY+ G C +K A I +YE+VP+ DE++L KAV+ QPVS
Sbjct: 206 EFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVS 265
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A T F+ Y GIF G CGT+LDH VT VG+G TE+G +YW++KNSWG +WG++GY
Sbjct: 266 VAIEAAGTAFQLYSSGIFTGSCGTRLDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 324
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 325 VRMERNIKASSGKCGIAVEPSYPL 348
>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 352
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 222/351 (63%), Gaps = 19/351 (5%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
S K+ + +++++ ++ + ++ ++ H+KWMA+HGR+YKD EK RF+
Sbjct: 5 SSKLQVMAASLLLVVAGGLSTMAKVTMASRAGTMEARHDKWMAEHGRTYKDAAEKARRFR 64
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
+FK N++ I+++N GN+ Y+L TNRF+DLT+ EF A+YTGY + + + ++T
Sbjct: 65 VFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATT---- 120
Query: 130 NLSMTD--VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
LS D P +DWR + AVT +K+Q+ CGCCWAFS VAAVEGI +I+ L+ LSEQQ
Sbjct: 121 RLSSEDDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQ 180
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC----SAAQKAAAAKI 243
L+DC+ NG GC GG+++ AF+Y+ + G+ TE Y YQ QG C S++ AA I
Sbjct: 181 LLDCADNG--GCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAATI 238
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG-VCGTQLDHAVTIVGF 302
S Y+ V DE +L AV+ QPVS+ I F+ Y G+F CGT+LDHAV +VG+
Sbjct: 239 SGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGY 298
Query: 303 GTTEDGA---NYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
G DG+ YW+IKNSWG TWGD GYMK+ +D +G CG+ SYP+
Sbjct: 299 GAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 349
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 158/350 (45%), Positives = 226/350 (64%), Gaps = 25/350 (7%)
Query: 18 MFIIIILLVSCASQV----VSSRSTH---------EQSVVEMHEKWMAQHGR--SYKDEL 62
M I+ + +V+ +S V +S H E V+ ++E W+ +HG+ S +
Sbjct: 8 MAILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLV 67
Query: 63 EKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK+ RF+IFK+NL ++++ N E N +Y+LG RF+DLTNDE+R+ Y G KM R T+
Sbjct: 68 EKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS 126
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
+Y+ ++P S+DWR K AV +KDQ CG CWAFS + AVEGI +I +LI
Sbjct: 127 ---LRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183
Query: 183 LSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAA 241
LSEQ+LVDC T+ N GC GG M+ AFE+II+N GI T+ +YPY+ V GTC +K A
Sbjct: 184 LSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVV 243
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
I +YE+VP+ E++L KAV+ QP+SI I A F+ Y GIF+G CGTQLDH V VG
Sbjct: 244 TIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVG 303
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+G TE+G +YW+++NSWG +WG++GY+++ R+ G CGI + SYP+
Sbjct: 304 YG-TENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPI 352
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 156/348 (44%), Positives = 221/348 (63%), Gaps = 19/348 (5%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHG--RSYKDELEKEMRFK 69
K+ I +F ++IL +C E+ + ++++W + H RS E+E RF
Sbjct: 3 KLLLIFLFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPRSLN---EREKRFN 59
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG-----YKMPSPSHRSTTSS 124
+F+ N+ ++ NK+ NR+YKL N+F+DLT +EF+ YTG ++M R +
Sbjct: 60 VFRHNVMHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQF 118
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
+ ++NLS +P+S+DWR K AVT IK+Q +CG CWAFS VAAVEGI KI L+ LS
Sbjct: 119 MYDHENLSK--LPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLS 176
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKI 243
EQ+LVDC T N GC GG ME AFE+I +N GI TED YPY+ + G C A++ I
Sbjct: 177 EQELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTI 236
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
+E+VP DE ALLKAV+ QPVS+ I A +++F+ Y EG+F G CGT+L+H V VG+G
Sbjct: 237 DGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYG 296
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+E G YW+++NSWG WG+ GY+KI R+ EG CGI ++SYP+
Sbjct: 297 -SERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPI 343
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 158/350 (45%), Positives = 226/350 (64%), Gaps = 25/350 (7%)
Query: 18 MFIIIILLVSCASQV----VSSRSTH---------EQSVVEMHEKWMAQHGR--SYKDEL 62
M I+ + +V+ +S V +S H E V+ ++E W+ +HG+ S +
Sbjct: 8 MAILFLAMVTVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLV 67
Query: 63 EKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK+ RF+IFK+NL ++++ N E N +Y+LG RF+DLTNDE+R+ Y G KM R T+
Sbjct: 68 EKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS 126
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
+Y+ ++P S+DWR K AV +KDQ CG CWAFS + AVEGI +I +LI
Sbjct: 127 ---LRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183
Query: 183 LSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAA 241
LSEQ+LVDC T+ N GC GG M+ AFE+II+N GI T+ +YPY+ V GTC +K A
Sbjct: 184 LSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVV 243
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
I +YE+VP+ E++L KAV+ QP+SI I A F+ Y GIF+G CGTQLDH V VG
Sbjct: 244 TIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVG 303
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+G TE+G +YW+++NSWG +WG++GY+++ R+ G CGI + SYP+
Sbjct: 304 YG-TENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPI 352
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 158/350 (45%), Positives = 226/350 (64%), Gaps = 25/350 (7%)
Query: 18 MFIIIILLVSCASQV----VSSRSTH---------EQSVVEMHEKWMAQHGR--SYKDEL 62
M I+ + +V+ +S V +S H E V+ ++E W+ +HG+ S +
Sbjct: 8 MAILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLV 67
Query: 63 EKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK+ RF+IFK+NL ++++ N E N +Y+LG RF+DLTNDE+R+ Y G KM R T+
Sbjct: 68 EKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS 126
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
+Y+ ++P S+DWR K AV +KDQ CG CWAFS + AVEGI +I +LI
Sbjct: 127 ---LRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183
Query: 183 LSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAA 241
LSEQ+LVDC T+ N GC GG M+ AFE+II+N GI T+ +YPY+ V GTC +K A
Sbjct: 184 LSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVV 243
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
I +YE+VP+ E++L KAV+ QP+SI I A F+ Y GIF+G CGTQLDH V VG
Sbjct: 244 TIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVG 303
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+G TE+G +YW+++NSWG +WG++GY+++ R+ G CGI + SYP+
Sbjct: 304 YG-TENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPI 352
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 167/350 (47%), Positives = 227/350 (64%), Gaps = 24/350 (6%)
Query: 14 NTIPMFIIIILLVSCASQVVSSRS----THEQ--SVVEMHEK----WMAQHGRSYKDELE 63
+TI I ILL+ C + V++S S TH+Q S VE +K W+ +HGR YK E
Sbjct: 3 STILTTTIFILLMLCNTCVIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDE 62
Query: 64 KEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTS 123
+E+RF I++ N++YI+ N + N +Y L N+F+DLTN+EF++ Y G SH +
Sbjct: 63 REVRFGIYQANVQYIQCKNAQKN-SYNLTDNKFADLTNEEFQSTYMGLSTRLRSH----N 117
Query: 124 STFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQL 183
+ F+Y D+P S DWR + AVT I DQ +CG CWAF+AVAAVEGI KI LI L
Sbjct: 118 TGFRYDEHG--DLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISL 175
Query: 184 SEQQLVDCST-NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AA 241
SEQ+L+DC +GN GC GG ME A+ +II+N G+ TE +YPY+ V GTC + A AA
Sbjct: 176 SEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYAA 235
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
IS YEEVP+ +E L A + QPVS+ I A F+ Y EG+F+G+CG QL+H VT+VG
Sbjct: 236 SISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVVG 295
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+G E YW++KNSWG WG++GY+++ RD EG+CGI Q+SYPL
Sbjct: 296 YG-KETINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQASYPL 344
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 153/324 (47%), Positives = 208/324 (64%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS E+ M+ +WMA HGR+Y E+E RF++F++NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTNDE+RA Y G + R +Y D+P S+DWR K AV
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGD---RYLAGDNEDLPESVDWRAKGAV 147
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
IKDQ CG CWAFS +AAVEGI +I ++I LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 148 AEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAF 207
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
E+II N GI TE++YPY+ G C +K A I +YE+VP+ E++L KAV+ QP+S
Sbjct: 208 EFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPIS 267
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A F+ Y GIF G CGT LDH VT VG+G TE+G +YW++KNSWG +WG++GY
Sbjct: 268 VAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 326
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 327 VRMERNIKASSGKCGIAVEPSYPL 350
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 155/341 (45%), Positives = 215/341 (63%), Gaps = 26/341 (7%)
Query: 15 TIPMFIIIILLVS--CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
TI I+ IL ++ C + + + + ++V HE+WM Q+ R YKD EK RF++FK
Sbjct: 3 TIKASILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQN 130
N+++IE N GNR + LG N+F+DLTNDEFRA T G+K PSP ST F+Y+N
Sbjct: 63 ANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFK-PSPVKVSTG---FRYEN 118
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
+S+ +P ++DWR K AVTPIKDQ +C EGI KIS LI LSEQ+LVD
Sbjct: 119 VSVDALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVD 166
Query: 191 CSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEV 249
C +G + GC GG M+ AF++II+N G+ TE YPY A G C + +AA + +E+V
Sbjct: 167 CDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNSAAT-VKGFEDV 225
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ DE AL+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G T DG
Sbjct: 226 PANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGT 285
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
YWL+KNSWG TWG+ GY+++ +D G+CG+ + SYP
Sbjct: 286 KYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 326
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 156/344 (45%), Positives = 219/344 (63%), Gaps = 19/344 (5%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHG--RSYKDELEKEMRFKIFKE 73
I +F ++IL +C E+ + +++++W + H RS E+E RF +F+
Sbjct: 7 IFLFSLVILETACGFDYEDKEIESEEGLSKLYDRWRSHHSVPRSLH---EREKRFNVFRH 63
Query: 74 NLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-----STTSSTFKY 128
N+ ++ +NK+ NR+YKL N+F+DLT EF+ YTG K+ HR S F Y
Sbjct: 64 NVMHVHNSNKK-NRSYKLKLNKFADLTIHEFKNAYTGSKIKH--HRMLQGPKRGSKQFMY 120
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+ +++ +P+S+DWR K AVT IK+Q +CG CWAFS VAAVEGI KI L+ LSEQ+L
Sbjct: 121 DHENVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQEL 180
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYE 247
VDC TN N GC GG ME AFE+I +N GI TED YPY+ + G C A++ I +E
Sbjct: 181 VDCDTNQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHE 240
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP DE ALLKAV+ QPVS+ I A +++F+ Y EG+F G CGT+L+H V VG+G ++
Sbjct: 241 NVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELNHGVATVGYG-SQG 299
Query: 308 GANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
G YW+++NSWG WG+ GY+KI R EG CGI ++SYP+
Sbjct: 300 GKKYWIVRNSWGTEWGEGGYIKIERGIDEPEGRCGIAMEASYPI 343
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 158/347 (45%), Positives = 223/347 (64%), Gaps = 20/347 (5%)
Query: 18 MFIIIILLVSCAS----QVVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEKE 65
MF+++ L + +S ++S TH + V+ ++E+W+ + G+ Y E+E
Sbjct: 11 MFVLLFLSFTLSSASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGKVYNALGERE 70
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST 125
RF++FK+NL +I++ N E NRTYKLG N F+DLTN+E+R+ Y G + +R +S
Sbjct: 71 KRFQVFKDNLRFIDEHNSE-NRTYKLGLNGFADLTNEEYRSTYLGARGGMKRNRLRKTSD 129
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
+Y +P S+DWR + AV +KDQ CG CWAFS +AAVEGI KI +LI LSE
Sbjct: 130 -RYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSE 188
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKIS 244
Q+LVDC T+ N GC GG M+ AFE+II N GI TE++YPY A G C +K A I
Sbjct: 189 QELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDTYRKNAKVVTID 248
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT 304
+YE+VP E AL KAV+ QPVS+ I A +F+ Y GIF+G CGTQLDH V VG+G
Sbjct: 249 DYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLDHGVAAVGYG- 307
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
TE+G +YW+++NSWG +WG+ GY+++ R G+CGI ++SYP+
Sbjct: 308 TENGKDYWIVRNSWGKSWGENGYLRMARSINSPTGICGIAMEASYPI 354
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 148/330 (44%), Positives = 209/330 (63%), Gaps = 22/330 (6%)
Query: 38 THEQSVVEMHEKWMAQH--------GRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTY 89
+ E+S+ ++E+W +++ G D+ E RF +F EN YI +AN+ G R +
Sbjct: 33 SSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPF 92
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRS------TTSSTFKYQNLSMTDVPTSLDWR 143
+L N+F+D+T DEFR Y G + + HRS +F+Y ++P ++DWR
Sbjct: 93 RLALNKFADMTTDEFRRTYAGSR--ARHHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWR 150
Query: 144 DKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGT 203
++ AVT IKDQ +CG CWAFSAVAAVEG+ KI L+ LSEQ+LVDC T N GC GG
Sbjct: 151 ERGAVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGL 210
Query: 204 MEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVS 262
M+ AF++I +N GI TE YPY+A QG C+ A+ ++ I YE+VP+ DE AL KAV+
Sbjct: 211 MDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVA 270
Query: 263 MQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTW 322
QPV++ + A +F+ Y EG+F G CGT LDH V VG+G T DG YW++KNSWG+ W
Sbjct: 271 NQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDW 330
Query: 323 GDAGYMKILR-----DEGLCGIGTQSSYPL 347
G+ GY+++ R GLCGI ++SYP+
Sbjct: 331 GERGYIRMQRGVSSDSNGLCGIAMEASYPV 360
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 151/307 (49%), Positives = 208/307 (67%), Gaps = 13/307 (4%)
Query: 47 HEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRA 106
++KW+ Q+GR Y + E +RF I+ N+++IE N + N ++KL N+F+DLTNDEF +
Sbjct: 46 YDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQ-NLSFKLTDNKFADLTNDEFNS 104
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
+Y GY++ RS + + + TD+P ++DWR+ AVTPIKDQ +CG CWAFSAV
Sbjct: 105 IYLGYQI-----RSYKRRNLSHMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAV 159
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTNGNN-GCGGGTMEKAFEYIIQNQGIATEDEYPY 225
AAVEGI KI NL+ LSEQ+LVDC NG+N GC GG MEKAF +I G+ TE++YPY
Sbjct: 160 AAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYPY 219
Query: 226 QAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGI 284
+ G+C A+ A I YE VP+ +E +L AVS QPVS+ I A EF+ Y EG+
Sbjct: 220 KGTDGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGV 279
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIG 340
F+G CG QL+H VTIVG+G +G YWL+KNSWG WG++GY+++ RD +G+CGI
Sbjct: 280 FSGYCGIQLNHGVTIVGYGDN-NGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCGIA 338
Query: 341 TQSSYPL 347
+ SYP+
Sbjct: 339 MEPSYPI 345
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 208/324 (64%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS E+ M+ +WMA HGR+Y E+E RF++F++NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTNDE+RA Y G + R +Y D+P S+DWR K AV
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGD---RYLAGDNEDLPESVDWRAKGAV 147
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
+KDQ CG CWAFS +AAVEGI +I ++I LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAF 207
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
E+II N GI TE++YPY+ G C +K A I +YE+VP+ E++L KAV+ QP+S
Sbjct: 208 EFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPIS 267
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A F+ Y GIF G CGT LDH VT VG+G TE+G +YW++KNSWG +WG++GY
Sbjct: 268 VAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 326
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 327 VRMERNIKASSGKCGIAVEPSYPL 350
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 150/315 (47%), Positives = 206/315 (65%), Gaps = 11/315 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ ++E+W + H S +D EK RF +FKEN ++I + NK+ + YKLG N+F+D+
Sbjct: 33 EESLWGLYERWRSHHTVS-RDLSEKNKRFNVFKENAKFIHEFNKK-DAPYKLGLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSP-SHRSTTSST--FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
TN EFR+ Y G K+ + R T +T F Y+N+ +P S+DWR + AV P+KDQ +
Sbjct: 91 TNQEFRSTYAGSKIHHHRTQRGTPRATGSFMYENVH--SIPASVDWRTQGAVAPVKDQGQ 148
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS +A+VEGI KI L+ LS QQLVDC T+ N GC GG M+ AFE+I N G
Sbjct: 149 CGSCWAFSTIASVEGINKIKTNQLVPLSGQQLVDCDTDQNEGCNGGLMDYAFEFIKSNGG 208
Query: 217 IATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
I +E YPY A QG+C++ A I YE+VP+ +E AL+KAV+ Q VS+ I A
Sbjct: 209 ITSESAYPYTAEQGSCASESSAPVVTIDGYEDVPANNEAALMKAVANQVVSVAIEASGMA 268
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
F+ Y EG+F G CG +LDH V +VG+G T DG YW+++NSWG WG+ GY+++ R
Sbjct: 269 FQFYSEGVFTGSCGNELDHGVAVVGYGATRDGTKYWIVRNSWGAEWGEKGYIRMQRGIRA 328
Query: 333 DEGLCGIGTQSSYPL 347
GLCGI + SYPL
Sbjct: 329 RHGLCGIAMEPSYPL 343
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 154/342 (45%), Positives = 215/342 (62%), Gaps = 26/342 (7%)
Query: 15 TIPMFIIIILLVS--CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
TI I+ IL ++ C + + + + ++V HE+WM Q+ R YKD EK RF++FK
Sbjct: 3 TIKASILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQN 130
N+++IE N GNR + LG N+F+DLTNDEFRA T G+K PSP T F+Y+N
Sbjct: 63 ANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFK-PSPVKVPTG---FRYEN 118
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
+S+ +P ++DWR K AVTPIKDQ +C EGI KIS LI LSEQ+LVD
Sbjct: 119 VSVDALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVD 166
Query: 191 CSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEV 249
C +G + GC GG M+ AF++II+N G+ TE YPY A G C + +AA + +E+V
Sbjct: 167 CDVHGEDQGCEGGLMDDAFQFIIKNGGLTTESSYPYTAADGKCKSGSNSAAT-VKGFEDV 225
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ DE AL+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G T DG
Sbjct: 226 PANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGT 285
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
YWL+KNSWG TWG+ GY+++ +D G+CG+ + SYP+
Sbjct: 286 KYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPI 327
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 153/318 (48%), Positives = 204/318 (64%), Gaps = 16/318 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+ E++E+W + H S + EK RF +FK N+ Y+ NK+ ++ YKL N+F+D+
Sbjct: 31 EEKFWELYERWRSHHTVSRSLD-EKHKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
TN EFR Y G K+ HR S + TF Y N +VP S+DWR K AVTP+KDQ
Sbjct: 89 TNHEFRQHYAGSKIKH--HRTLLGASRANGTFMYANED--NVPPSIDWRKKGAVTPVKDQ 144
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
+CG CWAFS V AVEGI +I L+ LSEQ+LVDC T N GC GG M+ AF++I +
Sbjct: 145 GQCGSCWAFSTVVAVEGINQIKTKKLVSLSEQELVDCDTTENQGCNGGLMDPAFDFIKKR 204
Query: 215 QGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
GI TE+ YPY+A C ++ I +E+VP DE ALLKAV+ QP+S+ I A
Sbjct: 205 GGITTEERYPYKAEDDKCDIQKRNTPVVSIDGHEDVPPNDEDALLKAVANQPISVAIDAS 264
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
++F+ Y EG+F G CGT+LDH V IVG+GTT DG YW++KNSWG WG+ GY+++ R
Sbjct: 265 GSQFQFYSEGVFTGECGTELDHGVAIVGYGTTVDGTKYWIVKNSWGAGWGEKGYIRMQRK 324
Query: 333 ---DEGLCGIGTQSSYPL 347
+EGLCGI Q SYP+
Sbjct: 325 VDAEEGLCGIAMQPSYPI 342
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 154/315 (48%), Positives = 207/315 (65%), Gaps = 12/315 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYK-DELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSD 98
E+S+ +++ W QH S D E RF+IFKEN++YI+ NK+ + YKLG N+F+D
Sbjct: 39 EKSLRSLYDNWALQHRSSRSLDSEEHAERFEIFKENVKYIDSVNKK-DSPYKLGLNKFAD 97
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
L+N+EF+A+Y G KM R S +F YQN +P S+DWR K AV +K+Q CG
Sbjct: 98 LSNEEFKAIYMGTKMDLRGDREVQSGSFMYQN--SEPLPASIDWRQKGAVAAVKNQGHCG 155
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFS VA+VEGI I+ NL+ LSEQQLVDCST N+GC GG M+ AF+YII N GI
Sbjct: 156 SCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTE-NSGCNGGLMDTAFQYIINNGGIV 214
Query: 219 TEDEYPYQAVQGTCSAAQ---KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
TED YPY A CS+ + + I +E+VP+ +EQAL +AV+ QPVS+ I A
Sbjct: 215 TEDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEASGQ 274
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-- 333
+F+ Y G+F G CGT LDH V VG+GT+ +G NYW+++NSWG WG+ GY+++ +
Sbjct: 275 DFQFYSTGVFTGKCGTALDHGVVAVGYGTSPEGINYWIVRNSWGPKWGEEGYIRMQQGIE 334
Query: 334 --EGLCGIGTQSSYP 346
EG CGI Q+SYP
Sbjct: 335 AAEGKCGIAMQASYP 349
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 156/329 (47%), Positives = 210/329 (63%), Gaps = 17/329 (5%)
Query: 32 VVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK 83
++S TH + V+ M+E+W+ +HG++Y EKE RF+IFK+NL +I++ N
Sbjct: 28 IISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNS 87
Query: 84 EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
E NRTY +G NRF+DLTN+EFR++Y G + TS +Y +P S+DWR
Sbjct: 88 E-NRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSD--RYAPRVGDSLPDSVDWR 144
Query: 144 DKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGT 203
+ AV +KDQ CG CWAFS +AAVEGI KI +LI LSEQ+LVDC T+ N GC GG
Sbjct: 145 KEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGL 204
Query: 204 MEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVS 262
M+ AFE+II N GI TED+YPY G C +K A I +YE+VP DE AL KAV+
Sbjct: 205 MDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVA 264
Query: 263 MQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTW 322
QPVS+ I F+ Y G+F G CGT LDH V VG+G TE G +YW+++NSWG +W
Sbjct: 265 NQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYG-TEKGKDYWIVRNSWGKSW 323
Query: 323 GDAGYMKILRD----EGLCGIGTQSSYPL 347
G++GY+++ R+ G CGI + SYP+
Sbjct: 324 GESGYIRMERNIASPTGKCGIAIEPSYPI 352
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 159/344 (46%), Positives = 230/344 (66%), Gaps = 20/344 (5%)
Query: 18 MFIIIILLVSCASQVVSSRST---HEQSVVEMH---EKWMAQHGRSYKDEL-EKEMRFKI 70
+F++I+ ++S S + +T H +S E+ + WM++HG++Y + L EKE RF+
Sbjct: 12 LFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRFQN 71
Query: 71 FKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
FK+NL +I++ N + N +Y+LG RF+DLT E+R L+ G P P R+ +S +Y
Sbjct: 72 FKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLKTSR-RYVP 127
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
L+ +P S+DWR + AV+ IKDQ C CWAFS VAAVEG+ KI LI LSEQ+LVD
Sbjct: 128 LAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVD 187
Query: 191 CSTNGNNGC-GGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA--AAKISNYE 247
C+ NNGC G G M+ AF+++I N G+ +E +YPYQ QG+C+ Q + I +YE
Sbjct: 188 CNL-VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITIDSYE 246
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
+VP+ DE +L KAV+ QPVS+G+ + EF Y+ I+NG CGT LDHA+ IVG+G +E+
Sbjct: 247 DVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYG-SEN 305
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G +YW+++NSWG TWGDAGY+KI R+ +GLCGI +SYP+
Sbjct: 306 GQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 349
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 155/332 (46%), Positives = 222/332 (66%), Gaps = 19/332 (5%)
Query: 32 VVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDEL---EKEMRFKIFKENLEYIEK 80
+VS TH + V+ ++E+W+ ++G+++ + EKE RF++FK+NL +I++
Sbjct: 28 IVSYDQTHLTKSSWRTDDEVMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDE 87
Query: 81 ANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
N E NR+YK+G NRF+DLTN+E+R++Y G + + +R + SS +Y +P S+
Sbjct: 88 HNSE-NRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRSSN-RYLPRVGDSLPDSV 145
Query: 141 DWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCG 200
DWR + AV +KDQ CG CWAFS +AAVEGI KI +LI LSEQ+LVDC + N GC
Sbjct: 146 DWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSYNEGCN 205
Query: 201 GGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLK 259
GG M+ AF++II N GI +E++YPY A GTC +K A I NYE+VP DE+AL K
Sbjct: 206 GGLMDYAFQFIINNGGIDSEEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPVNDEKALQK 265
Query: 260 AVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
AV+ QPVS+ I A EF+ Y+ GIF G CGT LDH V VG+G TE+G +YW+++NSWG
Sbjct: 266 AVANQPVSVAIEAGGREFQFYQSGIFTGRCGTALDHGVAAVGYG-TENGKDYWIVRNSWG 324
Query: 320 DTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+WG++GY+++ R+ G CGI + SYP+
Sbjct: 325 KSWGESGYIRMERNIATATGKCGIAIEPSYPI 356
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 157/341 (46%), Positives = 211/341 (61%), Gaps = 14/341 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTH------EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
+ ++L S V +S H E+S+ +++E+W + H S + EK RF +FK
Sbjct: 6 LLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRFNVFK 64
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKYQNL 131
NL ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ P R T +
Sbjct: 65 ANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYE 123
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+ VP S+DWR K AVT +KDQ +CG CWAFS V AVEGI +I L+ LSEQ+LVDC
Sbjct: 124 KVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDC 183
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVP 250
N GC GG ME AFE+I Q GI TE YPY+A +GTC A++ A I +E VP
Sbjct: 184 DKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHENVP 243
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+ DE ALLKAV+ QPVS+ I A ++F+ Y EG+F G C T L+H V IVG+GTT DG N
Sbjct: 244 ANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTN 303
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
YW+++NSWG WG+ GY+++ R+ EGLCGI SYP+
Sbjct: 304 YWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 344
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 150/335 (44%), Positives = 208/335 (62%), Gaps = 7/335 (2%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
+ + +SCA + + + V+ M+E+W+ +H + Y EK+ RF++FK+NL +I
Sbjct: 12 LLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFI 71
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDVP 137
++ N N TYKLG N+F+D+TN+E+R +Y G K + T ST +Y + +P
Sbjct: 72 QEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYSAGDRLP 131
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
+DWR K AV PIKDQ CG CWAFS VA VE I KI + LSEQ+LVDC N
Sbjct: 132 VHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNE 191
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQA 256
GC GG M+ AFE+IIQN GI T+ +YPY+ G C +K A I +E+VP DE A
Sbjct: 192 GCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGFEDVPPYDENA 251
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L KAV+ QPVSI I A + + Y+ G+F G CGT LDH V +VG+G +E+G +YWL++N
Sbjct: 252 LKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYG-SENGVDYWLVRN 310
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
SWG WG+ GY K+ R+ G CGI ++SYP+
Sbjct: 311 SWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 209/313 (66%), Gaps = 6/313 (1%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
+ V+ M+ W+ +HG+SY EKE RF+IFK+NL YI+ N + +R+Y+LG NRF+DL
Sbjct: 42 DDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADL 101
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
TN+E+RA Y G K + + + +Y + ++P S+DWR+K AV +KDQ CG
Sbjct: 102 TNEEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEELPDSIDWREKGAVAAVKDQGSCGS 161
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFSA+ AVEGI +I+ LI LSEQ+LVDC + N GC GG M+ AF +II+N GI +
Sbjct: 162 CWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGGLMDYAFNFIIKNGGIDS 221
Query: 220 EDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
+ +YPY GTC+ + A I +YE+VP DE+AL KA + QP+S+ I A +F+
Sbjct: 222 DLDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDFQ 281
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----E 334
Y GIF G CGT +DH V +VG+G +E+G +YW+++NSWG WG+AGY+K+ R+
Sbjct: 282 LYVSGIFTGKCGTAVDHGVVVVGYG-SEEGMDYWIVRNSWGAAWGEAGYLKMQRNVGKSS 340
Query: 335 GLCGIGTQSSYPL 347
GLCGI + SYP+
Sbjct: 341 GLCGITIEPSYPV 353
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 153/317 (48%), Positives = 215/317 (67%), Gaps = 9/317 (2%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++V HEKWMA+HGR+Y +E EK R ++F+ N + I+ N + T++L TNRF+
Sbjct: 35 TVDSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFA 94
Query: 98 DLTNDEFRALYTGYKMP--SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
DLT++EFRA TG + P + + + + F+Y+N S+ D S+DWR AVT +KDQ
Sbjct: 95 DLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQG 154
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN-GCGGGTMEKAFEYIIQN 214
CGCCWAFSAVAAVEG+TKI L+ LSEQQLVDC G++ GC GG M+ AFEY+I
Sbjct: 155 SCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINR 214
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
G+ TE YPY+ G+C + A+AA I YE+VP+ +E AL+ AV+ QPVS+ I
Sbjct: 215 GGLTTESSYPYRGTDGSCR--RSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGD 272
Query: 275 TEFKSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI--- 330
+ F+ Y G+ G CGT+L+HA+T VG+GT DG YW++KNSWG +WG+ GY++I
Sbjct: 273 SVFRFYDSGVLGGSGCGTELNHAITAVGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRG 332
Query: 331 LRDEGLCGIGTQSSYPL 347
+R EG+CG+ +SYP+
Sbjct: 333 VRGEGVCGLAQLASYPV 349
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 158/355 (44%), Positives = 229/355 (64%), Gaps = 28/355 (7%)
Query: 18 MFIIIILLVSCAS-------QVVSSRSTH---------EQSVVEMHEKWMAQHGRSYKDE 61
M ++I+L++S + ++S TH + V+ M+E+W+ +HG+SY
Sbjct: 10 MKLMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGL 69
Query: 62 LEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST 121
EK+ RF+IFK+NL++I++ N N TY+LG RF+DLTN+E+R+ + G K+ P+ R
Sbjct: 70 GEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKI-DPNRRMK 127
Query: 122 T---SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGA 178
S + +Y +P S+DWR + AV +KDQ CG CWAFSA+AAVEGI KI
Sbjct: 128 KLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTG 187
Query: 179 NLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK- 237
+LI LSEQ+LVDC T+ N GC GG M+ AFE+II N GI +ED+YPY+AV G C +K
Sbjct: 188 DLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKN 247
Query: 238 AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAV 297
A I +YE+VP+ DE AL KAV+ QP+++ + EF+ Y+ G+F G CGT LDH V
Sbjct: 248 AKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGV 307
Query: 298 TIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
VG+G TE+G +YW+++NSWG +WG+ GY+++ R+ G CGI + SYP+
Sbjct: 308 AAVGYG-TENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 158/355 (44%), Positives = 229/355 (64%), Gaps = 28/355 (7%)
Query: 18 MFIIIILLVSCAS-------QVVSSRSTH---------EQSVVEMHEKWMAQHGRSYKDE 61
M ++I+L++S + ++S TH + V+ M+E+W+ +HG+SY
Sbjct: 10 MKLMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGL 69
Query: 62 LEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST 121
EK+ RF+IFK+NL++I++ N N TY+LG RF+DLTN+E+R+ + G K+ P+ R
Sbjct: 70 GEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKI-DPNRRMK 127
Query: 122 T---SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGA 178
S + +Y +P S+DWR + AV +KDQ CG CWAFSA+AAVEGI KI
Sbjct: 128 KLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTG 187
Query: 179 NLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK- 237
+LI LSEQ+LVDC T+ N GC GG M+ AFE+II N GI +ED+YPY+AV G C +K
Sbjct: 188 DLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKN 247
Query: 238 AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAV 297
A I +YE+VP+ DE AL KAV+ QP+++ + EF+ Y+ G+F G CGT LDH V
Sbjct: 248 AKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGV 307
Query: 298 TIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
VG+G TE+G +YW+++NSWG +WG+ GY+++ R+ G CGI + SYP+
Sbjct: 308 AAVGYG-TENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 147/330 (44%), Positives = 209/330 (63%), Gaps = 22/330 (6%)
Query: 38 THEQSVVEMHEKWMAQH--------GRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTY 89
+ E+S+ ++E+W +++ G D+ E RF +F EN YI +AN+ G R +
Sbjct: 33 SSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPF 92
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS------TFKYQNLSMTDVPTSLDWR 143
+L N+F+D+T DEFR Y G + + HRS + +F+Y ++P ++DWR
Sbjct: 93 RLALNKFADMTTDEFRRTYAGSR--ARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWR 150
Query: 144 DKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGT 203
++ AVT IKDQ +CG CWAFS VAAVEG+ KI L+ LSEQ+LVDC T N GC GG
Sbjct: 151 ERGAVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGL 210
Query: 204 MEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVS 262
M+ AF++I +N GI TE YPY+A QG C+ A+ ++ I YE+VP+ DE AL KAV+
Sbjct: 211 MDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVA 270
Query: 263 MQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTW 322
QPV++ + A +F+ Y EG+F G CGT LDH V VG+G T DG YW++KNSWG+ W
Sbjct: 271 NQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDW 330
Query: 323 GDAGYMKILR-----DEGLCGIGTQSSYPL 347
G+ GY+++ R GLCGI ++SYP+
Sbjct: 331 GERGYIRMQRGVSSDSNGLCGIAMEASYPV 360
>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
Length = 342
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 155/343 (45%), Positives = 219/343 (63%), Gaps = 22/343 (6%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ ++ L + A ++SR+ ++ H+KWMA+HGR+YKD EK RF++FK N++
Sbjct: 6 LLVVAGGLSTMAKVTMASRAGTMEA---RHDKWMAEHGRTYKDAAEKARRFRVFKANVDL 62
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-- 135
I+++N GN+ Y+L TNRF+DLT+ EF A+YTGY + + + ++T LS D
Sbjct: 63 IDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATT----RLSSEDDQ 118
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
P +DWR + AVT +K+Q+ CGCCWAFS VAAVEGI +I+ L+ LSEQQL+DC+ NG
Sbjct: 119 QPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCADNG 178
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC----SAAQKAAAAKISNYEEVPS 251
GC GG+++ AF+Y+ + G+ TE Y YQ QG C S++ AA IS Y+ V
Sbjct: 179 --GCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNP 236
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGA- 309
DE +L AV+ QPVS+ I F+ Y G+F CGT+LDHAV +VG+G DG+
Sbjct: 237 NDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSG 296
Query: 310 --NYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
YW+IKNSWG TWGD GYMK+ +D +G CG+ SYP+
Sbjct: 297 GGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 339
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 159/343 (46%), Positives = 229/343 (66%), Gaps = 19/343 (5%)
Query: 18 MFIIIILLVSCASQVVSSRST---HEQSVVEMH---EKWMAQHGRSYKDEL-EKEMRFKI 70
+F++I+ ++S S + +T H +S E+ + WM++HG++Y + L EKE RF+
Sbjct: 12 LFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRFQN 71
Query: 71 FKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
FK+NL +I++ N + N +Y+LG RF+DLT E+R L+ G P P R+ +S +Y
Sbjct: 72 FKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLKTSR-RYVP 127
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
L+ +P S+DWR + AV+ IKDQ C CWAFS VAAVEG+ KI LI LSEQ+LVD
Sbjct: 128 LAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVD 187
Query: 191 CSTNGNNGC-GGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEE 248
C+ NNGC G G M+ AF+++I N G+ +E +YPYQ QG+C+ Q I +YE+
Sbjct: 188 CNL-VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQVHLLVITIDSYED 246
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP+ DE +L KAV+ QPVS+G+ + EF Y+ I+NG CGT LDHA+ IVG+G +E+G
Sbjct: 247 VPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYG-SENG 305
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+YW+++NSWG TWGDAGY+KI R+ +GLCGI +SYP+
Sbjct: 306 QDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 348
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 153/350 (43%), Positives = 219/350 (62%), Gaps = 19/350 (5%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTH------------EQSVVEMHEKWMAQHGRSYKDEL 62
T+ F +I ++ + +++ +TH + V ++E W+ +HG++Y
Sbjct: 8 TLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKTYNALG 67
Query: 63 EKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK+ RF+IFK+NL +I++ N G+ TYKLG N+F+DLTN+E+R YTG K + +
Sbjct: 68 EKDRRFQIFKDNLRFIDEHN-SGDHTYKLGLNKFADLTNEEYRMTYTGIKTIDDKKKLSK 126
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
+ +Y S +P +DWR++ AVT +KDQ CG CWAFS +VEG+ KI +LI
Sbjct: 127 MKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIVTGDLIS 186
Query: 183 LSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAA 241
+SEQ+LV+C T+ N GC GG M+ AFE+II+N GI TE++YPY G C +K A
Sbjct: 187 VSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNKKNAKVV 246
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
I +YE+VP DE +L KAVS QPV++ I A +F+ Y GIF G CGT LDH V G
Sbjct: 247 TIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDHGVLAAG 306
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+G TEDG +YWL+KNSWG WG+ GY+K+ R+ G CGI ++SYP+
Sbjct: 307 YG-TEDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAMEASYPI 355
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 152/347 (43%), Positives = 223/347 (64%), Gaps = 20/347 (5%)
Query: 18 MFIIIILLVSCAS----QVVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEKE 65
MF+++ + +S ++S +H + V+ ++E W+ +HG++Y EKE
Sbjct: 1 MFMLLFFASTLSSASDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLVKHGKAYNSLGEKE 60
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST 125
RF++FK+NL +I++ N E NRTY++G NRF+DLTN+E+R++Y G + +
Sbjct: 61 RRFEVFKDNLRFIDEHNSE-NRTYRVGLNRFADLTNEEYRSMYLG-ALSGIRRNKLRKIS 118
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
+Y +P S+DWR + AV +KDQ CG CWAFSAVAAVEGI KI +LI LSE
Sbjct: 119 DRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSE 178
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKIS 244
Q+LVDC + N GC GG M+ FE+II N GI +E++YPY A G C +K A I
Sbjct: 179 QELVDCDNSYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVSID 238
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT 304
+YE+VP +E AL KAV+ QPVS+ I A +F+ Y G+F+G CGT LDH V VG+G
Sbjct: 239 SYEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGYG- 297
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
TE+G +YW+++NSWG +WG++GY+++ R+ G+CGI ++SYP+
Sbjct: 298 TENGQDYWIVRNSWGKSWGESGYLRMARNIRKPTGICGIAMEASYPI 344
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 159/335 (47%), Positives = 207/335 (61%), Gaps = 11/335 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + +S V+ M+E W+ + G+SY EKEMRF+IFKENL
Sbjct: 13 LFFSTLLILSLALDIENSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRI 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I+ N + NR+Y LG NRF+DLT++E+R+ Y G KM + S +Y +P
Sbjct: 73 IDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPKTDVSN-----EYMPKVGEALP 127
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+DWR AV +K+Q C CWAFSAV AVEGI KI NLI LSEQ+LVDC T
Sbjct: 128 DYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDCGRTQRT 187
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQ 255
GC G M AF++II N GI TED YPY A G C+ + K I NY+ VPS +E
Sbjct: 188 KGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCNLSLKNQKYVTIDNYKNVPSNNEM 247
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QPVS+G+ + +FK Y GIF G CGT +DH VTIVG+G TE G +YW++K
Sbjct: 248 ALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTIVGYG-TERGMDYWIVK 306
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSWG WG+ GY++I R+ G CGI SYP+
Sbjct: 307 NSWGTNWGENGYIRIQRNIGGAGKCGIARMPSYPV 341
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 163/360 (45%), Positives = 223/360 (61%), Gaps = 26/360 (7%)
Query: 8 SGSFKINTIPMFIIIILLVSCASQVV--------SSRSTHEQSVVEMHEKWMAQHGRSYK 59
S SF + I ++II++ C + +V ++ + ++ E +EKW A HGR+YK
Sbjct: 5 SSSFSLAAI---LLIIIMYCCPTGLVEAARKGPAAAGGGDDSAMRERYEKWAADHGRTYK 61
Query: 60 DELEKEMRFKIFKENLEYIEKANKEGNR-TYKLGTNRFSDLTNDEFRALYTGYKMPSPSH 118
D LEK RF++F+ N +I+ N G + + +L TN+F+DLTN+EF A Y G +P
Sbjct: 62 DSLEKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEF-AEYYGRPFSTPV- 119
Query: 119 RSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGA 178
S F Y N+ +DVP +++WRD+ AVT +K+Q++C CWAFSAVAAVEGI +I
Sbjct: 120 --IGGSGFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEGIHQIRSH 177
Query: 179 NLIQLSEQQLVDCSTNGNN-GCGGGTMEKAFEYIIQNQGIATEDEYPYQ-AVQGTCSAAQ 236
NL+ LS QQL+DCST NN GC G M++AF YI N GIA E +YPY+ GTC A+
Sbjct: 178 NLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDYPYEDRALGTCRASG 237
Query: 237 KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIF----NGVCGTQ 292
K AA I ++ VP +E ALL AV+ QPVS+ + + + G+F N C T
Sbjct: 238 KPVAASIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVSQFFSSGVFGAMQNETCTTD 297
Query: 293 LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
L+HA+T VG+GT E G YWL+KNSWG WG+ GYMKI RD GLCG+ Q SYP+A
Sbjct: 298 LNHAMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIARDVASNTGLCGLAMQPSYPVA 357
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 150/308 (48%), Positives = 213/308 (69%), Gaps = 11/308 (3%)
Query: 44 VEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDE 103
+E+ E WM++H ++Y+ EK RF+IF +NL++I++ NK+ + +Y LG N F+DL+++E
Sbjct: 44 IELFESWMSKHSKTYRSIEEKLHRFEIFLDNLKHIDETNKKVS-SYWLGLNEFADLSHEE 102
Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
F++ Y G ++ P RS S F Y ++ D+P S+DWR K AVTP+K+Q CG CWAF
Sbjct: 103 FKSKYLGLRVEFPRKRS--SRGFSYGDVE--DLPESVDWRTKGAVTPVKNQGSCGSCWAF 158
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
S VAAVEGI +I NL LSEQ+L+DC + NNGC GG M+ AF+YI+ N G+ E++Y
Sbjct: 159 STVAAVEGINQIVTGNLTSLSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDY 218
Query: 224 PYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
PY +G C ++ IS YE+VP+ DEQ+LLKA+S QPVS+ I A + F+ YK
Sbjct: 219 PYLMEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKG 278
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCG 338
GIF G CGTQ+DH VT VG+G++E G +Y ++KNSWG WG+ GY+++ R+ EGLCG
Sbjct: 279 GIFTGRCGTQMDHGVTAVGYGSSE-GTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCG 337
Query: 339 IGTQSSYP 346
I +SYP
Sbjct: 338 INQMASYP 345
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 151/333 (45%), Positives = 211/333 (63%), Gaps = 12/333 (3%)
Query: 22 IILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKA 81
+ L + +VS E+ V M+ +WMA+HG +Y E+E RF+ F++NL YI++
Sbjct: 18 VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 82 N---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N G +++LG NRF+DLTN+E+R+ Y G + R ++ +YQ ++P
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPE 134
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
S+DWR K AV +KDQ CG CWAFSA+AAVEGI +I ++I LSEQ+LVDC T+ N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQAL 257
C GG M+ AFE+II N GI +E++YPY+ C A +K A I YE+VP E++L
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
KAV+ QP+S+ I A F+ YK GIF G CGT LDH V VG+G TE+G +YWL++NS
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYG-TENGKDYWLVRNS 313
Query: 318 WGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
WG WG+ GY+++ R+ G CGI + SYP
Sbjct: 314 WGSVWGEDGYIRMERNIKASSGKCGIAVEPSYP 346
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 150/308 (48%), Positives = 213/308 (69%), Gaps = 11/308 (3%)
Query: 44 VEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDE 103
+E+ E WM++H ++Y+ EK RF+IF +NL++I++ NK+ + +Y LG N F+DL+++E
Sbjct: 44 IELFESWMSKHSKAYRSIEEKLHRFEIFLDNLKHIDETNKKVS-SYWLGLNEFADLSHEE 102
Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
F++ Y G ++ P RS S F Y ++ D+P S+DWR K AVTP+K+Q CG CWAF
Sbjct: 103 FKSKYLGLRVEFPRKRS--SRGFSYGDVE--DLPESVDWRTKGAVTPVKNQGSCGSCWAF 158
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
S VAAVEGI +I NL LSEQ+L+DC + NNGC GG M+ AF+YI+ N G+ E++Y
Sbjct: 159 STVAAVEGINQIVTGNLTSLSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDY 218
Query: 224 PYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
PY +G C ++ IS YE+VP+ DEQ+LLKA+S QPVS+ I A + F+ YK
Sbjct: 219 PYLMEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKG 278
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCG 338
GIF G CGTQ+DH VT VG+G++E G +Y ++KNSWG WG+ GY+++ R+ EGLCG
Sbjct: 279 GIFTGRCGTQMDHGVTAVGYGSSE-GTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCG 337
Query: 339 IGTQSSYP 346
I +SYP
Sbjct: 338 INQMASYP 345
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 155/346 (44%), Positives = 214/346 (61%), Gaps = 18/346 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+F+ L + ++S H + V+ M+ W+A+H ++Y E+E RF+
Sbjct: 11 LFLFFTLSSAWDMSILSHNHGHHHQSSWRSDNEVISMYNWWLAKHSKTYNKLGEREKRFE 70
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR--STTSSTFK 127
IFK NL +I++ N NRTYK+G RF+DLTN+E+RA + G K P R + + + +
Sbjct: 71 IFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFLGTK-SDPKRRLMKSKNPSQR 129
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
Y + +P S+DWR AV+ IKDQ CG CWAFS +AAVEG+ KI LI LSEQ+
Sbjct: 130 YAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLSEQE 189
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNY 246
LVDC + N GC GG M+ AF++II N GI T+ +YPYQAV G C + K A I +
Sbjct: 190 LVDCDRSYNAGCNGGLMDNAFQFIINNGGIDTDKDYPYQAVDGKCDTTKVKNKAVTIDGF 249
Query: 247 EEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+V + DE AL KAV+ QPVS+ I A + Y+ G+F G CG+ LDH V IVG+G TE
Sbjct: 250 EDVMAFDEMALQKAVAHQPVSVAIEASGMALQFYQSGVFTGECGSALDHGVVIVGYG-TE 308
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
DG +YWL++NSWG WG+ GY+K+ R+ G CGI +SSYP+
Sbjct: 309 DGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGKCGIAMESSYPI 354
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 152/317 (47%), Positives = 214/317 (67%), Gaps = 9/317 (2%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++V HEKWMA+HGR+Y +E EK R ++F+ N + I+ N + T++L TNRF+
Sbjct: 35 TVDAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFA 94
Query: 98 DLTNDEFRALYTGYKMP--SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
DLT++EFRA TG + P + + + + F+Y+N S+ D S+DWR AVT +KDQ
Sbjct: 95 DLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQG 154
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN-GCGGGTMEKAFEYIIQN 214
CGCCWAFSAVAAVEG+TKI L+ LSEQQLVDC G++ GC GG M+ AFEY+I
Sbjct: 155 SCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINR 214
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
G+ TE YPY+ G+C + A+AA I YE+VP+ +E AL+ AV+ QPVS+ I
Sbjct: 215 GGLTTESSYPYRGTDGSCR--RSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGD 272
Query: 275 TEFKSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI--- 330
+ F+ Y G+ G CGT+L+HA+T G+GT DG YW++KNSWG +WG+ GY++I
Sbjct: 273 SVFRFYDSGVLGGSGCGTELNHAITAAGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRG 332
Query: 331 LRDEGLCGIGTQSSYPL 347
+R EG+CG+ +SYP+
Sbjct: 333 VRGEGVCGLAQLASYPV 349
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 156/320 (48%), Positives = 211/320 (65%), Gaps = 18/320 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYK--DELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
++S+ +++KW QH RS + D E RF+IFKEN+++I+ NK+ + YKLG N+F+
Sbjct: 38 DESLRGLYDKWALQH-RSTRSLDSDEHARRFEIFKENVKHIDSVNKK-DGPYKLGLNKFA 95
Query: 98 DLTNDEFRALYTGYKMPSPS----HRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKD 153
DL+N+EF+A++ KM R S +F YQN +P S+DWR K AVTP+K+
Sbjct: 96 DLSNEEFKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKR--LPASIDWRKKGAVTPVKN 153
Query: 154 QQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQ 213
Q +CG CWAFS +A+VEGI I L+ LSEQQLVDCS N GC GG M+ AF+YII
Sbjct: 154 QGQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSKE-NAGCNGGLMDNAFQYIID 212
Query: 214 NQGIATEDEYPYQAVQGTCSAAQ---KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
N GI TEDEYPY A G CS + K+ A I +E+VP+ +E AL KAV+ QPVSI I
Sbjct: 213 NGGIVTEDEYPYTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAI 272
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A +F+ Y G+F G CGT+LDH V +VG+G + +G NYW+++NSWG WG+ GY+++
Sbjct: 273 EASGHDFQFYSTGVFTGKCGTELDHGVVVVGYGKSPEGINYWIVRNSWGPEWGEQGYIRM 332
Query: 331 LR----DEGLCGIGTQSSYP 346
R EG CGI Q+SYP
Sbjct: 333 QRGIEATEGKCGISMQASYP 352
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 210/341 (61%), Gaps = 7/341 (2%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
+ I + + +S A + + + + V+ M+E+W+ +H + Y + +K+ RF++FK
Sbjct: 4 MTMIYTLLFLSFTLSYAIKTSTIINYTDNEVMAMYEEWLVRHQKGYNELGKKDKRFQVFK 63
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+NL +I++ N N TYKLG N+F+D+TN+E+RA+Y G K + T ST S
Sbjct: 64 DNLGFIQEHNNNLNNTYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKSTGHRYAFS 123
Query: 133 MTD-VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
D +P +DWR K AV PIKDQ CG CWAFS VA VE I KI + LSEQ+LVDC
Sbjct: 124 ARDRLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC 183
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVP 250
N GC GG M+ AFE+IIQN GI T+ +YPY+ G C +K A I YE+VP
Sbjct: 184 DRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGYEDVP 243
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
DE AL KAV+ QPVS+ I A + Y+ G+F G CGT LDH V +VG+G +E+G +
Sbjct: 244 PYDENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHGVVVVGYG-SENGVD 302
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
YWL++NSWG WG+ GY K+ R+ G CGI ++SYP+
Sbjct: 303 YWLVRNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPV 343
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 206/322 (63%), Gaps = 13/322 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE----GNRTYKLGTNR 95
++++ E +EKWMA+ GR+YKD EK RF++FK N +I+ N G KL TN+
Sbjct: 13 DKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTNK 72
Query: 96 FSDLTNDEFRALY-TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
F+DLT DEFR +Y TG+++ T + FK+ +S++DVP S+DWR + AVT +KDQ
Sbjct: 73 FADLTEDEFRNIYVTGHRVNYRPTSLVTDTVFKFGAVSLSDVPPSIDWRARGAVTSVKDQ 132
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
C CCWAFS+ AAVEGI +I+ N + LS QQLVDCS N C G ++KA+EYI ++
Sbjct: 133 HLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYEYIARS 192
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
G+ + +YPY+ GTC K A A+IS ++ VP+ +E ALL AV+ QPVS+ + +
Sbjct: 193 GGLVADQDYPYEGHSGTCRVYGKQAVARISGFQYVPARNETALLLAVAHQPVSVALDGLS 252
Query: 275 TEFKSYKEGIFNGV---CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKIL 331
+ GIF C T L+HA+TIVG+GT E G YWL+KNSWG WGD GY+K
Sbjct: 253 RALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGDKGYVKFA 312
Query: 332 RD-----EGLCGIGTQSSYPLA 348
RD G+CG+ ++SYP+A
Sbjct: 313 RDVASEINGVCGLALEASYPVA 334
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 146/318 (45%), Positives = 216/318 (67%), Gaps = 13/318 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSY----KDELEKEMRFKIFKENLEYIEKAN-KEGNRTYKLGTN 94
E V M++ W+A+HGR+Y + E E++ RF +F +NL +++ N + G R ++LG N
Sbjct: 50 EPEVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERAGARGFRLGMN 109
Query: 95 RFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
+F+DLTNDEFRA Y G +P+ + +++ + ++P S+DWR+K AV P+K+Q
Sbjct: 110 QFADLTNDEFRAAYLGAMVPAARRGAVVGERYRHDG-AAEELPESVDWREKGAVAPVKNQ 168
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQ 213
+CG CWAFSAV++VE + +I ++ LSEQ+LV+CST+ GN+GC GG M+ AF++II+
Sbjct: 169 GQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIK 228
Query: 214 NQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAA 272
N GI TED+YPY+AV G C +K A I +E+VP DE++L KAV+ QPVS+ I A
Sbjct: 229 NGGIDTEDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 288
Query: 273 YTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
EF+ YK G+F+G C T LDH V VG+G E+G +YW+++NSWG WG+AGY+++ R
Sbjct: 289 GGREFQLYKSGVFSGSCTTNLDHGVVAVGYG-AENGKDYWIVRNSWGPKWGEAGYIRMER 347
Query: 333 D----EGLCGIGTQSSYP 346
+ G CGI +SYP
Sbjct: 348 NVNASTGKCGIAMMASYP 365
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 159/335 (47%), Positives = 215/335 (64%), Gaps = 14/335 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + ++ T+++ V M+E W+ +HG+SY E+E RF+IFKE L +
Sbjct: 13 LFFSTLLILSLA---LDAKRTNDE-VKAMYESWLIKHGKSYNSLGERERRFEIFKETLRF 68
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + +R+YK+G N+F+DLTN+EFR+ Y G+ S + T + +Y+ +P
Sbjct: 69 IDEHNADTSRSYKVGLNQFADLTNEEFRSTYLGFTRGS----NKTKVSNRYEPRVGQVLP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+DWR + AV IK+Q +CG CWAFSA+AAVEGI KI NLI LSEQ+LVDC T
Sbjct: 125 DYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGINKIVTGNLISLSEQELVDCGRTQST 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQ 255
GC GG M FE+II N GI TE+ YPY A +G C Q I NYE VP +E
Sbjct: 185 KGCDGGYMTDGFEFIINNGGINTEENYPYTAQEGQCDLNLQNEKYVTIDNYENVPYYNEW 244
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL AV+ QPVS+ + + F+ Y GIF G CGT DHAVTIVG+G TE G +YW++K
Sbjct: 245 ALQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGTATDHAVTIVGYG-TEGGIDYWIVK 303
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSW TWG+ GYM+ILR+ G CGI T SYP+
Sbjct: 304 NSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 338
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 162/359 (45%), Positives = 227/359 (63%), Gaps = 28/359 (7%)
Query: 14 NTIPMFIIIILLV------SCASQVVSSRSTH--------EQSVVEMHEKWMAQHGR--S 57
N PM +I+I+ + ++S TH ++ V ++E+W +HG+ +
Sbjct: 6 NRSPMLVILIVFTLFTATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNN 65
Query: 58 YKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPS 117
D EK+ RF+IFK+NL++I++ N E NRTYK+G NRF+DL+N+E+R+ Y G K+
Sbjct: 66 NIDGSEKDKRFEIFKDNLKFIDEHNAE-NRTYKVGLNRFADLSNEEYRSRYLGTKIDPIG 124
Query: 118 H---RSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITK 174
R+ T S +Y +P S+DWR + AV +KDQ CG CWAFS +AAVEGI K
Sbjct: 125 MMMARTKTRSN-RYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINK 183
Query: 175 ISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA 234
I L+ LSEQ+LVDC N GC GG ME AFE+II N GI ++++YPY+ V G C
Sbjct: 184 IVTGELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVDGKCDQ 243
Query: 235 AQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQL 293
+K A I +YE+VP+ DE AL KAV+ QP+S+ I A EF+ Y GIF G CGT L
Sbjct: 244 YKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGKCGTAL 303
Query: 294 DHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
DH VT VG+G TE+G +YW+++NSWG +WG++GY+++ R+ G CGI QSSYP+
Sbjct: 304 DHGVTAVGYG-TENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGIVMQSSYPI 361
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 155/346 (44%), Positives = 218/346 (63%), Gaps = 16/346 (4%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
K+ I + I ++L+VS + + ++S+ +++E+W + H S ++ EK+ RF +F
Sbjct: 5 KLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRFNVF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-----STTSSTF 126
K N+ ++ NK ++ YKL N+F+D+TN EF+ Y G K+ HR S TF
Sbjct: 64 KSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGTKVNH--HRMFRGTPRVSGTF 120
Query: 127 KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
Y+N T P S+DWR K AVT +KDQ +CG CWAFS V AVEGI +I L+ LSEQ
Sbjct: 121 MYENF--TKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178
Query: 187 QLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISN 245
+L+DC N GC GG ME AFEYI Q G+ TE YPY A G+C A ++ I
Sbjct: 179 ELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDG 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
+E VP+ DE ALLKAV+ QPVS+ I A ++F+ Y EG+F G CG +L+H V IVG+GTT
Sbjct: 239 HETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT 298
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
DG NYW+++NSWG WG+ G +++ R+ EGLCGI ++SYP+
Sbjct: 299 VDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 155/324 (47%), Positives = 216/324 (66%), Gaps = 17/324 (5%)
Query: 33 VSSRSTHEQSVVEMHEKWMAQHGRSYKDE----LEKEMRFKIFKENLEYIEKANKEGNRT 88
VSSRS E V ++E WM +HG+ ++ EK+ RF+IFK+NL YI++ N + N +
Sbjct: 38 VSSRSDAE--VERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTK-NLS 94
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
YKLG RF+DLTNDE+R++Y G K P R +S +Y+ +P S+DWR + AV
Sbjct: 95 YKLGLTRFADLTNDEYRSMYLGAK---PVKRVLKTSD-RYEARVGDALPDSVDWRKEGAV 150
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
+KDQ CG CWAFS + AVEGI KI +LI LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 151 ADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 210
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
E+II+N GI TE +YPY+A G C +K A I +YE+VP E +L KA++ QP+S
Sbjct: 211 EFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPIS 270
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A F+ Y G+F+G+CGT+LDH V VG+G TE+G +YW+++NSWG+ WG++GY
Sbjct: 271 VAIEAGGRAFQLYSSGVFDGICGTELDHGVVAVGYG-TENGKDYWIVRNSWGNRWGESGY 329
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+K+ R+ G CGI ++SYP+
Sbjct: 330 IKMARNIAEPTGKCGIAMEASYPI 353
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 153/318 (48%), Positives = 207/318 (65%), Gaps = 13/318 (4%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
+ E V +E W+A+HGR+Y EKE RF+IFK+NL +IE N GNRTYK+G N+F+
Sbjct: 41 SDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSS---TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
DLTN+E+R +Y G K S + R S + +Y + +P S+DWR + AV PIK+Q
Sbjct: 101 DLTNEEYRTMYLGTK--SDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQ 158
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
CG CWAFS VAAVEGI +I +I LSEQ+LVDC N+GC GG M+ AFE+II N
Sbjct: 159 GSCGSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN 218
Query: 215 QGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
G+ TE YPY+ V+G C +K I YE+VP +E+AL KAV+ QPV + I A
Sbjct: 219 GGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEAS 277
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
F+ Y G+F G CG ++DH V +VG+G +EDG +YW+++NSWG WG+ GY+K+ R+
Sbjct: 278 GRAFQLYSSGVFTGECGEEVDHGVVVVGYG-SEDGVDYWIVRNSWGTKWGENGYVKMERN 336
Query: 334 E-----GLCGIGTQSSYP 346
G CGI T++SYP
Sbjct: 337 VKKSHLGKCGIMTEASYP 354
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 154/314 (49%), Positives = 213/314 (67%), Gaps = 24/314 (7%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+ E HE+WMAQ+GR YKD+ EKE R+ IFKEN+ I+ N + ++Y LG N+F+DL+N+
Sbjct: 1 MYERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNE 60
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
EF+A +K M SP + F+Y+N+S VP ++DWR K AVTP+KDQ +C
Sbjct: 61 EFKASRNRFKGHMCSPQ-----AGPFRYENVSA--VPATMDWRKKGAVTPVKDQGQC--- 110
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIAT 219
VAA+EGI +++ LI LSEQ++VDC T G + GC GG M+ AF++I QN+G+ T
Sbjct: 111 -----VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTT 165
Query: 220 EDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E YPY GTC+ ++ + AAKI+ +++VP+ E AL+KAV+ QPVS+ I A EF+
Sbjct: 166 EANYPYTGTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQ 225
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----E 334
Y GIF G CGT+LDH VT VG+G + DG YWL+KNSWG WG+ GY+++ +D E
Sbjct: 226 FYSSGIFTGSCGTELDHGVTAVGYGGS-DGTKYWLVKNSWGAQWGEEGYIRMQKDISAKE 284
Query: 335 GLCGIGTQSSYPLA 348
GLCGI Q+SYP A
Sbjct: 285 GLCGIAMQASYPTA 298
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 155/346 (44%), Positives = 218/346 (63%), Gaps = 16/346 (4%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
K+ I + I ++L+VS + + ++S+ +++E+W + H S ++ EK+ RF +F
Sbjct: 5 KLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRFNVF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-----STTSSTF 126
K N+ ++ NK ++ YKL N+F+D+TN EF+ Y G K+ HR S TF
Sbjct: 64 KSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNH--HRMFRGTPRVSGTF 120
Query: 127 KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
Y+N T P S+DWR K AVT +KDQ +CG CWAFS V AVEGI +I L+ LSEQ
Sbjct: 121 MYENF--TKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178
Query: 187 QLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISN 245
+L+DC N GC GG ME AFEYI Q G+ TE YPY A G+C A ++ I
Sbjct: 179 ELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDG 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
+E VP+ DE ALLKAV+ QPVS+ I A ++F+ Y EG+F G CG +L+H V IVG+GTT
Sbjct: 239 HETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT 298
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
DG NYW+++NSWG WG+ G +++ R+ EGLCGI ++SYP+
Sbjct: 299 VDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 158/335 (47%), Positives = 209/335 (62%), Gaps = 11/335 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + +S V+ M+E W+ +HG+SY EKEMRF+IFKENL
Sbjct: 13 LFFSTLLILSSAIDIENSVQRTNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFKENLRI 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I+ N + NR+Y LG NRF+DLT++E+R+ Y G K T + +Y +P
Sbjct: 73 IDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLK-----RGPKTDVSNQYMPKVGDALP 127
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+DWR AV +K+Q C CWAFSAVAAVEGI KI NLI LSEQ+LVDC T
Sbjct: 128 DYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQIT 187
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQ 255
GC G M AF++II N GI TE+ YPY A G C+ + K I +Y+ VPS +E
Sbjct: 188 KGCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCNLSLKNQKYVTIDSYKNVPSNNEM 247
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL KAV+ QPVS+G+ + +FK Y GIF G CGT +DH VTIVG+G TE G +YW++K
Sbjct: 248 ALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYG-TERGMDYWIVK 306
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSWG WG++GY++I R+ G CGI SYP+
Sbjct: 307 NSWGTNWGESGYIRIQRNIGGAGKCGIAKMPSYPV 341
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 154/345 (44%), Positives = 219/345 (63%), Gaps = 12/345 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRST--HEQSVVEMHEKWMAQHGRSYKDELEKEMR 67
S K T+ + I +LL+S + V++ T +E M+E+W+ ++ ++Y EKE R
Sbjct: 4 SIKSITLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERR 63
Query: 68 FKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
F+IFK+NL+++E+ + NRTY++G RF+DLTNDEFRA+Y KM K
Sbjct: 64 FEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKM---ERTRVPVKGEK 120
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
Y +P ++DWR K AV P+KDQ CG CWAFSA+ AVEGI +I LI LSEQ+
Sbjct: 121 YLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQ-GTCSAAQK-AAAAKISN 245
LVDC T+ N+GCGGG M+ AF++II+N GI TE++YPY A C++ +K I
Sbjct: 181 LVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDG 240
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
YE+VP DE++L KA++ QP+S+ I A F+ Y G+F G CGT LDH V VG+G +
Sbjct: 241 YEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYG-S 299
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G +YW+++NSWG WG++GY K+ R+ G CG+ +SYP
Sbjct: 300 EGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYP 344
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 224/349 (64%), Gaps = 15/349 (4%)
Query: 10 SFKINTIPMFIIIILLVSCA--SQVVSSRSTH-----EQSVVEMHEKWMAQHGRSYKDEL 62
S + +++ + + S A ++S TH + + ++EKW+ HG++Y
Sbjct: 3 SVRASSVACLLFLCFAFSSALDMSIISYDQTHPPQRTDAEAMAIYEKWLTTHGKAYNAIG 62
Query: 63 EKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EKE RF+IFK+NL ++++ N +Y++G NRF+DLTN+E+R+++ G M RS +
Sbjct: 63 EKERRFEIFKDNLRFVDEHNAVAG-SYRVGLNRFADLTNEEYRSMFLGGNMEM-KERSAS 120
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
+ + +Y + +P S+DWR+K AV+P+KDQ +CG CWAFS ++AVEGI +I LI
Sbjct: 121 TKSDRYAFRAGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELIS 180
Query: 183 LSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAA 241
LSEQ+LVDC + N GC GG M+ F++II N GI TE++YPY+AV GTC +K A
Sbjct: 181 LSEQELVDCDKSYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVV 240
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
I+ YE+VP DE +L KAV+ QPVS+ I A F+ Y+ G+F G CGT LDH V VG
Sbjct: 241 SINGYEDVPEDDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTGHCGTNLDHGVVAVG 300
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
+G TE+G +YW ++NSWG WG+ GY+K+ R+ G CGI + +SYP
Sbjct: 301 YG-TENGVDYWTVRNSWGPKWGENGYIKLERNINATSGKCGIASMASYP 348
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 157/346 (45%), Positives = 217/346 (62%), Gaps = 17/346 (4%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
K+ + + ++L V+ + + E+ + +++E+W + H S + EK RF +F
Sbjct: 5 KVFFVALSFALVLRVAESFEFNEKDLESEEGLWDLYERWRSHHTVS-RSLDEKHNRFNVF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-----STTSSTF 126
K N+ ++ +NK ++ YKL NRF+D+TN EFR++Y G K+ HR + TF
Sbjct: 64 KGNVMHVHSSNKM-DKPYKLKLNRFADMTNHEFRSIYAGSKVNH--HRMFRGTPRGNGTF 120
Query: 127 KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
YQN+ VP+S+DWR K AVT +KDQ +CG CWAFS + AVEGI +I L+ LSEQ
Sbjct: 121 MYQNVDR--VPSSVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTHKLVPLSEQ 178
Query: 187 QLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISN 245
+LVDC T N GC GG ME AFE+I Q GI T YPY+A GTC A++ A I
Sbjct: 179 ELVDCDTTQNQGCNGGLMESAFEFIKQ-YGITTASNYPYEAKDGTCDASKVNEPAVSIDG 237
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
+E VP +E ALLKAV+ QPVS+ I A +F+ Y EG+F G CGT LDH V IVG+GTT
Sbjct: 238 HENVPVNNEAALLKAVAHQPVSVAIEAGGIDFQFYSEGVFTGNCGTALDHGVAIVGYGTT 297
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+DG YW +KNSWG WG+ GY+++ R +GLCGI ++SYP+
Sbjct: 298 QDGTKYWTVKNSWGSEWGEKGYIRMKRSISVKKGLCGIAMEASYPI 343
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 205/312 (65%), Gaps = 6/312 (1%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ ++EKW A H S +D + + RF +FKEN+++I + N++ + TYKL N+F D+
Sbjct: 34 EESLWSLYEKWRAHHAVS-RDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDM 92
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
TN EFR+ Y G K+ ++ D+PTS+DWR+K AVT +KDQ +CG
Sbjct: 93 TNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGS 152
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS V AVEGI +I L+ LSEQQLVDC T N+GC GG M+ AF++I N G+++
Sbjct: 153 CWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDTK-NSGCNGGLMDYAFDFIKNNGGLSS 211
Query: 220 EDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKS 279
ED YPY A Q +C + +A I Y++VP +E AL+KAV+ QPVS+ I A F+
Sbjct: 212 EDSYPYLAEQKSCGSEANSAVVTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQF 271
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEG 335
Y +G+F+G CGT+LDH V VG+G +DG YW++KNSWG+ WG++GY+++ R G
Sbjct: 272 YSQGVFSGHCGTELDHGVAAVGYGVDDDGKKYWIVKNSWGEGWGESGYIRMERGIKDKRG 331
Query: 336 LCGIGTQSSYPL 347
CGI ++SYP+
Sbjct: 332 KCGIAMEASYPI 343
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 145/326 (44%), Positives = 214/326 (65%), Gaps = 12/326 (3%)
Query: 32 VVSSRSTH-----EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGN 86
+++ TH + ++ +E W+ +HG+SY EKE RF+IFK+N YI++ N +
Sbjct: 24 IITYDQTHAVGSTDDVIMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKD 83
Query: 87 RTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKK 146
R++KLG NRF+DLTN+E+R+ YTG + S + + + +Y +L+ +P S+DWR+
Sbjct: 84 RSFKLGLNRFADLTNEEYRSKYTGIRTKD-SRKKVSGKSQRYASLAGESLPESVDWREHG 142
Query: 147 AVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEK 206
AV +KDQ +CG CWAFS ++AVEGI +I+ LI LSEQ+LVDC + N GC GG M+
Sbjct: 143 AVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDD 202
Query: 207 AFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQP 265
AF++II N GI ++ +YPY G C +K A I +YE+VP DE+AL KA + QP
Sbjct: 203 AFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQP 262
Query: 266 VSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDA 325
+S+ I A +F+ Y GIF G CGT LDH V +VG+G TE+G +YW+++NSWG WG+
Sbjct: 263 ISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYG-TENGKDYWIVRNSWGADWGEK 321
Query: 326 GYMKILR----DEGLCGIGTQSSYPL 347
GY+++ R G+CGI ++ SYP+
Sbjct: 322 GYLRMERGISSKAGICGITSEPSYPV 347
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 306 bits (785), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 156/341 (45%), Positives = 210/341 (61%), Gaps = 14/341 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTH------EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
+ ++L S V +S H E+S+ +++E+W + H S + EK RF +FK
Sbjct: 5 LLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRFNVFK 63
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKYQNL 131
NL ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ R T +
Sbjct: 64 ANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRGTPHENGAFMYE 122
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+ VP S+DWR K AVT +KDQ +CG CWAFS V AVEGI +I L+ LSEQ+LVDC
Sbjct: 123 KVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDC 182
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVP 250
N GC GG ME AFE+I Q GI TE YPY+A +GTC A++ A I +E VP
Sbjct: 183 DKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHENVP 242
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+ DE ALLKAV+ QPVS+ I A ++F+ Y EG+F G C T L+H V IVG+GTT DG N
Sbjct: 243 ANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTN 302
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
YW+++NSWG WG+ GY+++ R+ EGLCGI SYP+
Sbjct: 303 YWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 343
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 153/312 (49%), Positives = 203/312 (65%), Gaps = 16/312 (5%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR 105
++E+W + H S + EK+ RF +FK N ++ ANK ++ YKL N+F+D+TN EFR
Sbjct: 37 LYERWRSHHTVS-RSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFR 94
Query: 106 ALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
Y+G K+ HR + TF Y+ + VP S+DWR K AVT +KDQ +CG C
Sbjct: 95 NTYSGSKVKH--HRMFRGGPRGNGTFMYEKVDT--VPASVDWRKKGAVTSVKDQGQCGSC 150
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATE 220
WAFS + AVEGI +I L+ LSEQ+LVDC T+ N GC GG M+ AFE+I Q GI TE
Sbjct: 151 WAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTE 210
Query: 221 DEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKS 279
YPY+A GTC +++ A A I +E VP DE ALLKAV+ QPVS+ I A ++F+
Sbjct: 211 ANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQF 270
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEG 335
Y EG+F G CGT+LDH V IVG+GTT DG YW +KNSWG WG+ GY+++ R EG
Sbjct: 271 YSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEG 330
Query: 336 LCGIGTQSSYPL 347
LCGI ++SYP+
Sbjct: 331 LCGIAMEASYPI 342
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 306 bits (784), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 152/317 (47%), Positives = 218/317 (68%), Gaps = 11/317 (3%)
Query: 37 STHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRF 96
S+H++ +VE+ EKW+A+H ++Y EK RF++FK+NL+ I++ N+E +Y LG N F
Sbjct: 35 SSHDR-LVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKLIDEINRE-VTSYWLGLNEF 92
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+DLT+DEF+ Y G + P R ++S +F+Y+N++ D+P ++DWR K AVT +K+Q +
Sbjct: 93 ADLTHDEFKTTYLG--LSPPPARRSSSRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQ 150
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS VAAVEGI I NL LSEQ+L+DCS +GN+GC GG M+ AF YI + G
Sbjct: 151 CGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGMMDYAFSYIASSGG 210
Query: 217 IATEDEYPYQAVQGTCSAAQK--AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
+ TE+ YPY +G+C +K + A IS YE+VP+ DEQAL+KA++ QPVS+ I A
Sbjct: 211 LHTEEAYPYLMEEGSCGDGKKSESEAVSISGYEDVPTKDEQALIKALAHQPVSVAIEASG 270
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGDTWGDAGYMKILR- 332
F+ Y G+F+G CG QLDH V VG+G+ + G +Y ++KNSWG WG+ GY+++ R
Sbjct: 271 RHFQFYSGGVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVKNSWGGKWGEKGYIRMKRG 330
Query: 333 ---DEGLCGIGTQSSYP 346
EGLCGI +SYP
Sbjct: 331 TGKSEGLCGINKMASYP 347
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 156/344 (45%), Positives = 212/344 (61%), Gaps = 18/344 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+F + + + +++ +TH + V+ M+E W+ +HG+SY EKE RF+
Sbjct: 13 LFALFVASSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQ 72
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL +I++ N E N +YK+G NRF+DLTN+E+R+ Y G K P S +Y
Sbjct: 73 IFKDNLRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAK-SKPKLSKVKSD--RYA 129
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
+P S+DWR K AV PIKDQ CG CWAFS V AVEGI +I LI LSEQ+LV
Sbjct: 130 PRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELV 189
Query: 190 DCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEE 248
DC + N GC GG M+ FE+II N GI T+ +YPY C +K A I +YE+
Sbjct: 190 DCDKSYNEGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYED 249
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP +E+AL KAV+ QPVS+GI F+ Y GIF G CGT LDH V +VG+G TE G
Sbjct: 250 VPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYG-TEKG 308
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
+YW+++NSWG +WG+AGY+++ R+ G CGI + SYPL
Sbjct: 309 KDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPL 352
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 151/339 (44%), Positives = 227/339 (66%), Gaps = 9/339 (2%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
+ M + I L AS+ +SR HE S+ E HE+WMA++ R+YKD+ E+E RF +FK+N+
Sbjct: 5 VCMTLHIYYLEHRASEA-TSRPLHEASMYERHEQWMARYSRNYKDDAEEERRFXMFKDNV 63
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
++I+ + GN KLG N +D+T++EFRA +K+P + +++F++QN+ T
Sbjct: 64 DFIQTFDTAGNMPNKLGVNALADMTHEEFRASGNTFKIPPNLGLRSETTSFRHQNV--TR 121
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
+P+++DWR K+ VT IK+Q +CG CWAFSAVAA+EGI K+ + I LSEQ+LVDC G
Sbjct: 122 IPSTMDWRKKRTVTHIKNQLQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIFG 181
Query: 196 NN-GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGD 253
+N GC GG M+ AF++IIQN+G+ +E Y Y+ V+G C+ ++++ AA+I++YE +P
Sbjct: 182 SNIGCEGGCMDDAFKFIIQNRGLNSEARYLYKGVEGHCNKKKESSRAARINDYENMPEFS 241
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E+ALLK V+ QP+S+ I A + F+ Y+ GI G LD+ VT G+G + DG +WL
Sbjct: 242 EKALLKVVAHQPISVAIDAGGSAFQFYEIGIITXESGNDLDYGVTTDGYGRSADGKKHWL 301
Query: 314 IKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
+KNSWG WG+ GY ++ R GLCG Q+SYP A
Sbjct: 302 VKNSWGTDWGENGYTRMERGVKATTGLCGFTMQASYPTA 340
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 147/309 (47%), Positives = 209/309 (67%), Gaps = 11/309 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+VE+ E W++ HG++Y EK RF++FKENL++I++ NKE +Y LG N F+DL+++
Sbjct: 43 LVELFESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKEVT-SYWLGLNEFADLSHE 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF++ + G P R +S F Y+++ D+P S+DWR K AVTP+K+Q CG CWA
Sbjct: 102 EFKSKFLGLYPEFP--RKKSSEDFSYRDV--VDLPKSIDWRKKGAVTPVKNQGSCGSCWA 157
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI +I NL LSEQQL+DC T+ NNGC GG M+ AFE+I+ N G+ E++
Sbjct: 158 FSTVAAVEGINQIVAGNLTSLSEQQLIDCDTSFNNGCNGGLMDYAFEFIVNNGGLHKEED 217
Query: 223 YPYQAVQGTCSAA-QKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY +GTC ++ IS Y +VP DEQ+LLKA++ QP+S+ I A +F+ Y
Sbjct: 218 YPYLMEEGTCDEKREEMEVVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYS 277
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
G+F+G CGT LDH V VG+G++ G +Y ++KNSWG WG+ GY+++ R+ EGLC
Sbjct: 278 GGVFSGPCGTDLDHGVAAVGYGSSS-GIDYIIVKNSWGPKWGERGYLRMKRNTGKPEGLC 336
Query: 338 GIGTQSSYP 346
GI +SYP
Sbjct: 337 GINKMASYP 345
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 145/307 (47%), Positives = 201/307 (65%), Gaps = 12/307 (3%)
Query: 45 EMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEF 104
++++KW+ +HG++Y E + RF+IFKEN+ YI N N ++ LG N+F+DLTN EF
Sbjct: 36 QVYQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEF 95
Query: 105 RALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
R LY G + P+P H + + D TS+DWR K VT IKDQ +CG CWAF
Sbjct: 96 RGLYVGRLQRPAPFHEVGDIAL-------VADTATSVDWRKKGGVTEIKDQGDCGSCWAF 148
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
SAVAAVEG+T +S L+ LSEQ+LVDC T N GC GG M+ AF+Y+I+N GI ++ Y
Sbjct: 149 SAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRNGGITSQSNY 208
Query: 224 PYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
PY+A++G C + K AA I+ ++ +P E+ LL+AV+ QPVS+ I A +F+ Y
Sbjct: 209 PYRALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYSS 268
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD---EGLCGI 339
G+F G CG+ LDH V IVG+GT G YWL+KNSWG WG++GY+++ R G+CGI
Sbjct: 269 GVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGSGWGESGYVRMERQGPGAGVCGI 328
Query: 340 GTQSSYP 346
+SYP
Sbjct: 329 NLDASYP 335
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 206/321 (64%), Gaps = 15/321 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+++ +++E+W H R + EK RF FK N+ +I NK G+R Y+L NRF D+
Sbjct: 39 EEALWDLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSST-----FKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
+ EFRA + G ++ S R ++ F Y ++++D+P S+DWR K AVT +K+Q
Sbjct: 98 SQAEFRATFAGSRV-SDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVKNQ 156
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
+CG CWAFS V +VEGI I L+ LSEQ+L+DC T N+GC GG M+ AFEYI +N
Sbjct: 157 GKCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYIKKN 216
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAAA----KISNYEEVPSGDEQALLKAVSMQPVSIGI 270
G+ TE YPY+A GTC AA+ A ++ I +++VP+ E+AL KAV+ QPVS+GI
Sbjct: 217 GGLTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSVGI 276
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A F Y EG+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++
Sbjct: 277 DASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEKGYIRV 336
Query: 331 LRDE----GLCGIGTQSSYPL 347
+D GLCGI ++SY +
Sbjct: 337 EKDSGAEGGLCGIAMEASYAV 357
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 147/315 (46%), Positives = 214/315 (67%), Gaps = 11/315 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDEL-EKEMRFKIFKENLEYIEKAN-KEGNRTYKLGTNRFS 97
E V M+E W+ +HGR + L E + RF++F +NL +++ N + G ++LG N+F+
Sbjct: 49 EAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERAGEHGFRLGMNQFA 108
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DLTNDEFRA Y G ++P+ RS + Y++ ++P S+DWR+K AV P+K+Q +C
Sbjct: 109 DLTNDEFRAAYLGARIPAA--RSGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQGQC 166
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQG 216
G CWAFSAV++VE I +I ++ LSEQ+LV+CST+ GN+GC GG M+ AF +II+N G
Sbjct: 167 GSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGG 226
Query: 217 IATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
I TED+YPY+AV G C ++ A I +E+VP DE++L KAV+ QPVS+ I A
Sbjct: 227 IDTEDDYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEAGGR 286
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-- 333
+F+ YK G+F+G C T LDH V VG+G TE+G +YW+++NSWG WG+AGY+++ R+
Sbjct: 287 QFQLYKSGVFSGSCTTNLDHGVVAVGYG-TENGKDYWIVRNSWGPKWGEAGYIRMERNIN 345
Query: 334 --EGLCGIGTQSSYP 346
G CGI +SYP
Sbjct: 346 ATTGKCGIAMMASYP 360
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/357 (42%), Positives = 225/357 (63%), Gaps = 24/357 (6%)
Query: 9 GSFKINTIPMFIIIILLVSCASQVVSSRSTH---------EQSVVEMHEKWMAQHGRSYK 59
GS K+ + + ++I + + ++S H + V ++E WM +HG+ +
Sbjct: 2 GSVKVTILLLAMMIGVSYAADMSIISYDEKHHITAENERSDAEVARIYEAWMEKHGKKAQ 61
Query: 60 DE----LEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPS 115
EK+ RF+IFK+NL +I++ N + N +YKLG RF+DLTN+E+R++Y G K
Sbjct: 62 SNGLVGEEKDQRFEIFKDNLRFIDEHNNK-NLSYKLGLTRFADLTNEEYRSIYLGAK--- 117
Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKI 175
S + ++ +YQ +P S+DWR + AV +KDQ CG CWAFS + AVEGI KI
Sbjct: 118 -SKKRVLKTSDRYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKI 176
Query: 176 SGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAA 235
+LI LSEQ+LVDC T+ N GC GG M+ AFE+II+N GI TE++YPY+A G C
Sbjct: 177 VTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQT 236
Query: 236 QK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLD 294
+K A I YE+VP +E AL K ++ QP+S+ I A F+ Y G+F+G+CGT+LD
Sbjct: 237 RKNAKVVTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGVFDGICGTELD 296
Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
H V VG+G TE+G +YW+++NSWG +WG++GY+K+ R+ G CGI ++SYP+
Sbjct: 297 HGVVAVGYG-TENGKDYWIVRNSWGGSWGESGYIKMARNIAEPTGKCGIAMEASYPI 352
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 152/316 (48%), Positives = 203/316 (64%), Gaps = 12/316 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ +++E+W + H S + EK RF +FKEN+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHHTVS-RSLTEKHKRFNVFKENVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSPSHRSTT---SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
TN EFR+ Y G K+ T + TF Y+ + VP S+DWR K AVT +KDQ +
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVG--SVPASVDWRKKGAVTDVKDQGQ 148
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS V AVEGI +I L+ LSEQ+LVDC N GC GG ME AFE+I Q G
Sbjct: 149 CGSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGG 208
Query: 217 IATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
I TE YPY A +GTC A++ A I +E VP DE ALLKAV+ QPVS+ I A +
Sbjct: 209 ITTESNYPYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGS 268
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-- 333
+F+ Y EG+ G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+
Sbjct: 269 DFQFYSEGVLTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNIS 328
Query: 334 --EGLCGIGTQSSYPL 347
EGLCGI +SYP+
Sbjct: 329 KKEGLCGIAMMASYPI 344
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 151/295 (51%), Positives = 205/295 (69%), Gaps = 14/295 (4%)
Query: 63 EKEMRFKIFKENLEYIEKANKE-GNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHR 119
E+E R +IF +N+ YIE +N N+ YKL N+F+DLTN+EF A +K M S R
Sbjct: 3 EREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIASRNKFKGHMCSSIIR 62
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGAN 179
+TT FKY+N S +P+++DWR K AVTP+K+Q +CG CWAFSAVAA EGI ++S
Sbjct: 63 TTT---FKYENASA--IPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGK 117
Query: 180 LIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA 238
L+ LSEQ+L+DC T G + GC GG M+ AF++IIQN G++TE +YPY+ V GTC+A + +
Sbjct: 118 LVSLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKAS 177
Query: 239 A-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAV 297
A I+ YE+VP+ +E AL KAV+ QP+S+ I A ++F+ Y G+F G CGT+LDH V
Sbjct: 178 IHAVTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGV 237
Query: 298 TIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
T VG+G DG YWL+KNSWG WG+ GY+++ R EGLCGI Q+SYP A
Sbjct: 238 TAVGYGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYPTA 292
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 152/318 (47%), Positives = 207/318 (65%), Gaps = 13/318 (4%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
+ E V +E W+A+HGR+Y EKE RF+IFK+NL +IE+ N GNRTYK+G N+F+
Sbjct: 41 SDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSS---TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
DLTN+E+R +Y G K S + R S + +Y + +P S+DWR + AV PIK+Q
Sbjct: 101 DLTNEEYRTMYLGTK--SDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQ 158
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
CG CWAFS VAAV GI +I +I LSEQ+LVDC N+GC GG M+ AFE+II N
Sbjct: 159 GSCGSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN 218
Query: 215 QGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
G+ TE YPY+ V+G C +K I YE+VP +E+AL KAV+ QPV + I A
Sbjct: 219 GGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEAS 277
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
F+ Y G+F G CG ++DH V +VG+G +EDG +YW+++NSWG WG+ GY+K+ R+
Sbjct: 278 GRAFQLYSSGVFTGECGEEVDHGVVVVGYG-SEDGVDYWIVRNSWGTKWGENGYVKMERN 336
Query: 334 E-----GLCGIGTQSSYP 346
G CGI T++SYP
Sbjct: 337 VKKSHLGKCGIMTEASYP 354
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 147/302 (48%), Positives = 207/302 (68%), Gaps = 10/302 (3%)
Query: 51 MAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG 110
+ +H ++Y KE RF+IFK+NL +I++ NK N+++KLG N+F+DL+N+E+++++ G
Sbjct: 11 LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVE 170
+M + S FKY ++P S+DWR+K AV P+KDQ +CG CWAFS VAAVE
Sbjct: 71 GRMVR-DRKGFESDRFKYG--VGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVE 127
Query: 171 GITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG 230
GI +I+ +LI LSEQ+LVDC N GC GG M+ AFE+I++N GI TED+YPY+ V G
Sbjct: 128 GINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDG 187
Query: 231 TCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVC 289
C +K A I+ +E+VP DE++L KAV+ QPVS+ I A F+ Y+ GIFNG+C
Sbjct: 188 QCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGLC 247
Query: 290 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-----DEGLCGIGTQSS 344
GT LDH V VG+G TEDG +YW+++NSWG WG+ GY+++ R + G CGI Q S
Sbjct: 248 GTDLDHGVVAVGYG-TEDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQPS 306
Query: 345 YP 346
YP
Sbjct: 307 YP 308
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 142/317 (44%), Positives = 202/317 (63%), Gaps = 11/317 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+++ +++E+W + H R + EK RF FK N +I NK G+ Y+L NRF D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 100 TNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
EFRA + G + +PS + + F Y L+++D+P S+DWR K AVT +KDQ +CG
Sbjct: 98 DQAEFRATFVGDLRRDTPS-KPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCG 156
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFS V +VEGI I +L+ LSEQ+L+DC T N+GC GG M+ AFEYI N G+
Sbjct: 157 SCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLI 216
Query: 219 TEDEYPYQAVQGTCSAAQKA----AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
TE YPY+A +GTC+ A+ A I +++VP+ E+ L +AV+ QPVS+ + A
Sbjct: 217 TEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASG 276
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE 334
F Y EG+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 277 KAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDS 336
Query: 335 ----GLCGIGTQSSYPL 347
GLCGI ++SYP+
Sbjct: 337 GASGGLCGIAMEASYPV 353
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 149/317 (47%), Positives = 209/317 (65%), Gaps = 10/317 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRF 96
+ V +++ W AQH RSY E E R +IF++NL +I++ N G +++LG RF
Sbjct: 40 DDEVHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRF 99
Query: 97 SDLTNDEFRALYTGYKMP-SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
+DLTN+E+R+ Y G + S R++T + +Y+ S D+P S+DWRDK AV +KDQ
Sbjct: 100 ADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKDQG 159
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS +AAVEGI I +LI LSEQ+LVDC T N GC GG M+ AFE+II N
Sbjct: 160 SCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIISNG 219
Query: 216 GIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
GI T+++YPY G+C +K A I +YE+VP DE++L KAV+ QPVS+ I A
Sbjct: 220 GIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAGG 279
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F+ Y+ GIF G CGT+LDH VT +G+G +E+G YW++KNSWG WG++GY+++ R+
Sbjct: 280 RAFQLYESGIFTGYCGTELDHGVTAIGYG-SENGKYYWIVKNSWGSDWGESGYIRMERNI 338
Query: 334 ---EGLCGIGTQSSYPL 347
G CGI ++SYP+
Sbjct: 339 NSATGKCGIAMEASYPI 355
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 156/343 (45%), Positives = 222/343 (64%), Gaps = 12/343 (3%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKD-ELEKEMRF 68
S I + F+ I L + S ++ R+ E V+ ++++W A+HG+ + + E E RF
Sbjct: 6 SSPIMALLFFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPENRF 63
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
IFK+NL++I++ N + N Y+LG N F+DLTN+E+R+ Y G K S S R+ TS+ +Y
Sbjct: 64 HIFKDNLKFIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSN--RY 120
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
D+P S+DWR K AV P+KDQ CG CWAFS VA+VE I +I +LI LSEQ+L
Sbjct: 121 LPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQEL 180
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYE 247
VDC + N GC GG M+ AFE+II+N G+ TE++YPY +C +K A I +YE
Sbjct: 181 VDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVAIDSYE 240
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
+VP +E+AL KAVS Q VS+ I F+ Y+ GIF G CGT LDH V +VG+G +E
Sbjct: 241 DVPVNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYG-SEG 299
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
G +YW+++NSWG +WG++GY+K+ R+ GLCGI + SYP
Sbjct: 300 GVDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYP 342
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 144/315 (45%), Positives = 205/315 (65%), Gaps = 11/315 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
+ ++++ +W+ +H R Y EK+ RF+IFK+NL YI NK+ ++Y LG N+FSDL
Sbjct: 45 DDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQ-EKSYWLGLNKFSDL 103
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
T+DEFRALY G + +H F Y+++ ++ +DWR K AV+ +KDQ CG
Sbjct: 104 THDEFRALYLGIRPAGRAHGLRNGDRFIYEDVVAEEM---VDWRKKGAVSDVKDQGSCGS 160
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFSA+ +VEG+ I LI LSEQ+LVDC N GC GG M+ AF++II+N GI T
Sbjct: 161 CWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMDYAFDFIIKNGGIDT 220
Query: 220 EDEYPYQAVQGTCSAAQK--AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEF 277
E++YPY+A G C A+K + I +Y++VP+ E +LLKAVS PVS+ I A +F
Sbjct: 221 EEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKNPVSVAIEAGGRDF 280
Query: 278 KSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----- 332
+ Y+ G+F G CGT LDH V VG+GT +DG NYW++KNSWG +WG+ GY+++ R
Sbjct: 281 QHYQGGVFTGPCGTDLDHGVLAVGYGTDDDGVNYWIVKNSWGPSWGEKGYIRMERMGSNS 340
Query: 333 DEGLCGIGTQSSYPL 347
G CGI + S+P+
Sbjct: 341 TSGKCGINIEPSFPI 355
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 151/333 (45%), Positives = 211/333 (63%), Gaps = 12/333 (3%)
Query: 22 IILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKA 81
+ L + +VS E+ V M+ +WMA+HG +Y E+E RF+ F++NL YI++
Sbjct: 18 VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 82 N---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N G +++LG NRF+DLTN+E+R+ Y G + R ++ +YQ ++P
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPE 134
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
S+DWR K AV +KDQ CG CWAFSA+AAVEGI +I ++I LSEQ+LVDC T+ N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQAL 257
C GG M+ AFE+II N GI +E++YPY+ C A +K A I YE+VP E++L
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
KAV+ QP+S+ I A F+ YK GIF G CGT LDH V VG+G TE+G +YWL++NS
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYG-TENGKDYWLVRNS 313
Query: 318 WGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
WG WG+ GY+++ R+ G CGI + SYP
Sbjct: 314 WGSVWGEDGYIRMERNIKASSGKCGIAVEPSYP 346
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 155/335 (46%), Positives = 209/335 (62%), Gaps = 11/335 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A +V+S V +M+E W+ + G+SY EKEMRF+IFK+NL
Sbjct: 13 LFFSTLLILSSALDIVNSAQRTNDQVRDMYESWLVEQGKSYNSLDEKEMRFEIFKDNLRI 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I+ N + NR++ LG NRF+DLT++E+R+ Y G+K P + + K ++ +P
Sbjct: 73 IDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFK-SGPKAKVSNRYVPKVGDV----LP 127
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+DWR AV +K+Q C CWAFSAVAAVEGI KI NL+ LSEQ+LVDC T
Sbjct: 128 NYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGNLLSLSEQELVDCGRTQST 187
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQ 255
GC G M AF++II N GI TED YPY A G C+ Q I +YE VPS +E
Sbjct: 188 RGCNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQCNRYLQNQKYVTIDDYENVPSNNEW 247
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL AV+ QPVS+G+ + +FK Y GIF CGT +DH VTIVG+G TE G +YW++K
Sbjct: 248 ALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTAIDHGVTIVGYG-TERGLDYWIVK 306
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSWG WG+ GY++I R+ G CGI +SYP+
Sbjct: 307 NSWGTNWGENGYIRIQRNIGGAGKCGIARMASYPV 341
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 145/332 (43%), Positives = 209/332 (62%), Gaps = 8/332 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK RF+IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG N+F+D+TN+EF YTG +P R S + +++++ V
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVTQYTGVSLPLNFKREPVVS---FDDVNISAVG 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
S+DWRD AVT +KDQ CG CWAFSA+A VEGI KI L+ LSEQ+++DC+ + N
Sbjct: 125 QSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS--N 182
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG ++ A+++II N G+A+E +YPYQA +G C+A +A I+ Y V S DE ++
Sbjct: 183 GCDGGFVDNAYDFIISNNGVASEADYPYQAYEGDCTANSWPNSAYITGYSYVRSNDESSM 242
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
AV QP++ I A F+ Y G+F+G CGT L+HA+TI+G+G G YW++KNS
Sbjct: 243 KYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQYWIVKNS 302
Query: 318 WGDTWGDAGYMKILR---DEGLCGIGTQSSYP 346
WG +WG+ GY+++ R GLCGI YP
Sbjct: 303 WGSSWGERGYVRMARGVSSSGLCGIAMDPLYP 334
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 150/347 (43%), Positives = 222/347 (63%), Gaps = 19/347 (5%)
Query: 18 MFIIIILLVSCASQ--VVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEKEMR 67
+FI + +S A ++S TH V+ M+E+W+ +HG++Y EKE R
Sbjct: 8 LFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNALGEKEKR 67
Query: 68 FKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKM-PSPSHRSTTSSTF 126
F+IFK+NL +I++ N + N +++LG NRF+DLTN+E+R + G ++ P+ +R S T
Sbjct: 68 FEIFKDNLGFIDEHNSK-NLSFRLGLNRFADLTNEEYRTRFLGTRINPNRRNRKVNSQTN 126
Query: 127 KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
+Y +P S+DWR + AV +KDQ CG CWAFSA+AAVEG+ K++ +LI LSEQ
Sbjct: 127 RYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLATGDLISLSEQ 186
Query: 187 QLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISN 245
+LVDC T+ N GC GG M+ AFE+II + E++YPY+A+ G C +K A I
Sbjct: 187 ELVDCDTSYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRCDQNRKNAKVVSIDQ 246
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
YE+VP+ DE AL KAV+ Q +++ + EF+ Y G+F G CGT LDH V VG+G T
Sbjct: 247 YEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGTALDHGVAAVGYG-T 305
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
E+G +YW+++NSWG +WG+AGY+++ R+ G CGI + SYP+
Sbjct: 306 ENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPSYPI 352
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 207/314 (65%), Gaps = 8/314 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN-KEGNRTYKLGTNRFSD 98
E ++E W+A+HGR+Y E++ RF++F +NL +++ N + ++LG N+F+D
Sbjct: 102 EPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFAD 161
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
LTNDEFRA Y G ++P+ R T ++P S+DWR+K AV P+K+Q +CG
Sbjct: 162 LTNDEFRAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCG 221
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGI 217
CWAFSAV++VE + +I ++ LSEQ+LV+CST+ GN+GC GG M+ AF++II+N GI
Sbjct: 222 SCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGI 281
Query: 218 ATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
TE +YPY+AV G C + A I +E+VP DE++L KAV+ QPVS+ I A E
Sbjct: 282 DTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGRE 341
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ YK G+F G C T LDH V VG+G TE+G +YW+++NSWG WG+ GY+++ R+
Sbjct: 342 FQLYKAGVFTGTCTTNLDHGVVAVGYG-TENGKDYWIVRNSWGAKWGEDGYIRMERNVNA 400
Query: 334 -EGLCGIGTQSSYP 346
G CGI +SYP
Sbjct: 401 TTGKCGIAMMASYP 414
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 152/340 (44%), Positives = 216/340 (63%), Gaps = 12/340 (3%)
Query: 15 TIPMFIIIILLVSCASQVVSSRST--HEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
T+ + I +LL+S + V++ T +E M+E+W+ ++ ++Y EKE RF+IF
Sbjct: 9 TLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEKETRFEIFT 68
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+NL+YIE+ N N+T+++G RF+DLTNDEFRA+Y KM +Y
Sbjct: 69 DNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKM---ERTRVPVKGERYLYKV 125
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+P +DWR K AV P+KDQ CG CWAFSA+ AVEGI +I LI LSEQ+LVDC
Sbjct: 126 GDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV-QGTCSAAQK-AAAAKISNYEEVP 250
T+ N GCGGG M+ AF++II+N GI TE++YPY A C++ +K + I YE+VP
Sbjct: 186 TSYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVP 245
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
DE++L KA++ QP+S+ I A F+ YK G+F G CGT LDH V VG+G +E G +
Sbjct: 246 QNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYG-SEGGQD 304
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
YW+++NSWG WG++GY K+ R+ G CG+ +SYP
Sbjct: 305 YWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYP 344
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 208/314 (66%), Gaps = 8/314 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN-KEGNRTYKLGTNRFSD 98
E ++E W+A+HGR+Y E++ RF++F +NL +++ N + ++LG N+F+D
Sbjct: 42 EPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFAD 101
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
LTNDEFRA Y G ++P+ R T ++P S+DWR+K AV P+K+Q +CG
Sbjct: 102 LTNDEFRAAYLGARIPAARRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCG 161
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGI 217
CWAFSAV++VE + +I ++ LSEQ+LV+CST+ GN+GC GG M+ AF++II+N GI
Sbjct: 162 SCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGI 221
Query: 218 ATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
TE +YPY+AV G C + A I +E+VP DE++L KAV+ QPVS+ I A E
Sbjct: 222 DTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGRE 281
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ YK G+F+G C T LDH V VG+G TE+G +YW+++NSWG WG+ GY+++ R+
Sbjct: 282 FQLYKAGVFSGTCTTNLDHGVVAVGYG-TENGKDYWIVRNSWGAKWGEDGYIRMERNVNA 340
Query: 334 -EGLCGIGTQSSYP 346
G CGI +SYP
Sbjct: 341 TTGKCGIAMMASYP 354
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 207/314 (65%), Gaps = 8/314 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN-KEGNRTYKLGTNRFSD 98
E ++E W+A+HGR+Y E++ RF++F +NL +++ N + ++LG N+F+D
Sbjct: 45 EPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFAD 104
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
LTNDEFRA Y G ++P+ R T ++P S+DWR+K AV P+K+Q +CG
Sbjct: 105 LTNDEFRAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCG 164
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGI 217
CWAFSAV++VE + +I ++ LSEQ+LV+CST+ GN+GC GG M+ AF++II+N GI
Sbjct: 165 SCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGI 224
Query: 218 ATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
TE +YPY+AV G C + A I +E+VP DE++L KAV+ QPVS+ I A E
Sbjct: 225 DTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGRE 284
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ YK G+F G C T LDH V VG+G TE+G +YW+++NSWG WG+ GY+++ R+
Sbjct: 285 FQLYKAGVFTGTCTTNLDHGVVAVGYG-TENGKDYWIVRNSWGAKWGEDGYIRMERNVNA 343
Query: 334 -EGLCGIGTQSSYP 346
G CGI +SYP
Sbjct: 344 TTGKCGIAMMASYP 357
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 198/316 (62%), Gaps = 9/316 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+++ +++E+W + H R + EK RF FK N +I NK G+ Y+L NRF D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
EFRA + G + + F Y L+++D+P S+DWR K AVT +KDQ +CG
Sbjct: 98 DQAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGS 157
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS V +VEGI I +L+ LSEQ+L+DC T N+GC GG M+ AFEYI N G+ T
Sbjct: 158 CWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLIT 217
Query: 220 EDEYPYQAVQGTCSAAQKA----AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
E YPY+A +GTC+ A+ A I +++VP+ E+ L +AV+ QPVS+ + A
Sbjct: 218 EAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGK 277
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE- 334
F Y EG+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 278 AFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337
Query: 335 ---GLCGIGTQSSYPL 347
GLCGI ++SYP+
Sbjct: 338 ASGGLCGIAMEASYPV 353
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 160/334 (47%), Positives = 209/334 (62%), Gaps = 55/334 (16%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ +L + ASQ +SRS HE S+ E HE WMA++GR YKD EKE RFKIFK+N+
Sbjct: 14 LLFILAAWASQA-TSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV----- 67
Query: 81 ANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
++TFKY+N+ T VP+++
Sbjct: 68 ----------------------------------------AQATTFKYENV--TAVPSTI 85
Query: 141 DWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGC 199
DWR K AVTPIKDQQ+CG CWAFSAVAA EGIT+I+ LI LSEQ+LVDC T G N GC
Sbjct: 86 DWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGC 145
Query: 200 GGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALL 258
GG + AF +I + G+A+E YPY+ GTC++ ++A AAKI YE+VP+ +E+AL
Sbjct: 146 SGGLXDDAFRFIXIH-GLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANNEKALQ 204
Query: 259 KAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
KAV+ QPV++ I A EF+ Y G+F G CGT+LDH V VG+G +DG YWL+KNSW
Sbjct: 205 KAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMXYWLVKNSW 264
Query: 319 GDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
G WG+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 265 GTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 298
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 149/326 (45%), Positives = 215/326 (65%), Gaps = 15/326 (4%)
Query: 31 QVVSSRSTHEQSVVEMHEKWMAQHGRSYKDE----LEKEMRFKIFKENLEYIEKANKEGN 86
+ + S + V ++E WM +HG+ ++ EK+ RF+IFK+NL +I++ N + N
Sbjct: 34 HITTETSRSDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK-N 92
Query: 87 RTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKK 146
+YKLG RF+DLTN+E+R++Y G K P+ R +S +YQ +P S+DWR +
Sbjct: 93 LSYKLGLTRFADLTNEEYRSMYLGAK---PTKRVLKTSD-RYQARVGDALPDSVDWRKEG 148
Query: 147 AVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEK 206
AV +KDQ CG CWAFS + AVEGI KI +LI LSEQ+LVDC T+ N GC GG M+
Sbjct: 149 AVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDY 208
Query: 207 AFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQP 265
AFE+II+N GI TE +YPY+A G C +K A I +YE+VP E +L KA++ QP
Sbjct: 209 AFEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQP 268
Query: 266 VSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDA 325
+S+ I A F+ Y G+F+G+CGT+LDH V VG+G TE+G +YW+++NSWG+ WG++
Sbjct: 269 ISVAIEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYG-TENGKDYWIVRNSWGNRWGES 327
Query: 326 GYMKILRD----EGLCGIGTQSSYPL 347
GY+K+ R+ G CGI ++SYP+
Sbjct: 328 GYIKMARNIEAPTGKCGIAMEASYPI 353
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 146/335 (43%), Positives = 212/335 (63%), Gaps = 10/335 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR YKD+ EK RF++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG N+F+DLTNDEFR+ T + R T F+ +N+++ +P
Sbjct: 68 IESFN-AGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTG--FRNENVNIDALP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
++DWR K VTPIKDQ +CGCCWAFSAVAA+EGI K+S LI S + + T +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSL--LTVMSM 182
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG M+ AF++II+N G+ TE YPY AV + + A+ I YE+VP+ +E AL
Sbjct: 183 GCEGGLMDDAFKFIIKNGGLTTESNYPYAAVDDKFKSVSNSVAS-IKGYEDVPANNEAAL 241
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+KAV+ QPVS+ + F+ YK G+ G CGT LDH + +G+G DG YWL+KNS
Sbjct: 242 MKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNS 301
Query: 318 WGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
WG TWG+ G++++ +D G+CG+ + SYP A
Sbjct: 302 WGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 336
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 303 bits (777), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 150/324 (46%), Positives = 210/324 (64%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS E+ V M+ +WMA++GR+Y E+E RF++F++NL Y+++ N G +
Sbjct: 27 IVSYGERSEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHS 86
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTN+E+R Y G + R + +YQ ++P S+DWR+K AV
Sbjct: 87 FRLGLNRFADLTNEEYRDTYLGVRTKPVRERRLSG---RYQAADNEELPESVDWREKGAV 143
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
+KDQ CG CWAFSA+AAVEGI +I ++I LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 144 AKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQGCNGGLMDYAF 203
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
E+II N GI +E++YPY+ C A +K A I YE+VP E +L KAV+ QP+S
Sbjct: 204 EFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQPIS 263
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A F+ YK GIF G CGT LDH VT VG+G +E+G +YW++KNSWG WG+ GY
Sbjct: 264 VAIEAGGRAFQLYKSGIFTGRCGTALDHGVTAVGYG-SENGKDYWIVKNSWGTVWGEDGY 322
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 323 VRLERNIKATSGKCGIAIEPSYPL 346
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 303 bits (777), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 206/313 (65%), Gaps = 9/313 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
++ V ++E W+ HG++Y EKE RF+IFK+NL +I++ N+E +RTYK+G RF+DL
Sbjct: 55 DEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRE-SRTYKVGLTRFADL 113
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
TN+E+RA + G + S R + + + +Y D+P +DWR K AV +KDQ +CG
Sbjct: 114 TNEEYRARFLGGRF-SRKPRLSAAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQCGS 172
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS+VAAVEGI +I LI LSEQ+LVDC + N GC GG M+ AF++II N GI T
Sbjct: 173 CWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGIDT 232
Query: 220 EDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E++YPY+ C +K A I YE+VP DE +L KAV+ QPVS+ I A F+
Sbjct: 233 EEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQ 292
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----- 333
Y+ G+F G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+
Sbjct: 293 LYQSGVFTGRCGTDLDHGVVAVGYG-TDNGTDYWIVRNSWGKDWGESGYIRLERNVANIT 351
Query: 334 EGLCGIGTQSSYP 346
G CGI Q SYP
Sbjct: 352 TGKCGIAVQPSYP 364
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 152/318 (47%), Positives = 204/318 (64%), Gaps = 16/318 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ +++E+W + H S + EK RF +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
TN EFR+ Y G +KM S S TF Y+ + VP S+DWR K AVT +KDQ
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
+CG CWAFS + AVEGI +I L+ LSEQ+LVDC N GC GG ME AFE+I Q
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206
Query: 215 QGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
GI TE YPY+A +GTC ++ A I +E VP DE ALLKAV+ QPVS+ I A
Sbjct: 207 GGITTESNYPYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
++F+ Y EG+F G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326
Query: 334 ----EGLCGIGTQSSYPL 347
EGLCGI +SYP+
Sbjct: 327 ISKKEGLCGIAMMASYPI 344
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 158/347 (45%), Positives = 214/347 (61%), Gaps = 26/347 (7%)
Query: 19 FIIIILLVSCASQVVSSRSTH------EQSVVEMHEKWMAQH--GRSYKDELEKEMRFKI 70
F+ ++L +S V +S H E+S+ +++E+W + H RS D K RF +
Sbjct: 6 FLWVVLSLSLVLGVANSFDFHDKDLESEESLWDLYERWRSHHTVSRSLGD---KHKRFNV 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-----STTSST 125
FK N+ ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ HR + T
Sbjct: 63 FKANMMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNH--HRMFRDMPRGNGT 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
F Y+ + VP S+DWR K AVT +KDQ CG CWAFS V AVEGI +I L+ LSE
Sbjct: 120 FMYEKVG--SVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSE 177
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKIS 244
Q+LVDC T N GC GG ME AF++I Q GI TE YPY A GTC A++ A I
Sbjct: 178 QELVDCDTEENAGCNGGLMESAFQFIKQKGGITTESYYPYTAQDGTCDASKANDLAVSID 237
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT 304
+E VP DE ALLKAV+ QPVS+ I A ++F+ Y EG+F G C T+L+H V IVG+G
Sbjct: 238 GHENVPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGA 297
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
T DG +YW+++NSWG WG+ GY+++ R+ EGLCGI +SYP+
Sbjct: 298 TVDGTSYWIVRNSWGPEWGELGYIRMQRNISKKEGLCGIAMLASYPI 344
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 146/321 (45%), Positives = 207/321 (64%), Gaps = 9/321 (2%)
Query: 34 SSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGT 93
SS + V+ +++ W+ QHG++Y E+E RF+IFK+NL +I++ N N TYKLG
Sbjct: 32 SSSWRSDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGL 91
Query: 94 NRFSDLTNDEFRALYTGYKMPSPSHRSTTSS--TFKYQNLSMTDVPTSLDWRDKKAVTPI 151
N+F+DLTN E+RA + G + P R S + +Y + + ++P S+DWRD AV+P+
Sbjct: 92 NKFADLTNQEYRAKFLGTRT-DPRRRLMKSKIPSSRYAHRAGDNLPDSVDWRDHGAVSPV 150
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYI 211
KDQ CG CWAFS +A VEGI KI L+ LSEQ+LVDC + + GC GG M+ AF++I
Sbjct: 151 KDQGSCGSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAGCNGGLMDYAFQFI 210
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
+ N GI TE +YPY C +K A I YE+VP+ +E AL KAV+ QPVSI I
Sbjct: 211 MDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPN-NENALKKAVAHQPVSIAI 269
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A F+ Y+ G+FNG CG LDH V VG+GT ++G +YW+++NSWG WG+ GY+++
Sbjct: 270 EAGGRAFQLYESGVFNGECGLALDHGVVAVGYGTDDNGQDYWIVRNSWGSNWGENGYIRM 329
Query: 331 LR----DEGLCGIGTQSSYPL 347
R + G CGI ++SYP+
Sbjct: 330 ERNINANTGKCGIAMEASYPV 350
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 303 bits (776), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 149/323 (46%), Positives = 209/323 (64%), Gaps = 12/323 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS E+ V M+ +WM++H R+Y E+E RF++F++NL YI++ N G +
Sbjct: 26 IVSYGERSEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHS 85
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTN+E+R+ Y G + R ++ +YQ ++P ++DWR K AV
Sbjct: 86 FRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQADDNEELPETVDWRKKGAV 142
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
IKDQ CG CWAFSA+AAVEGI +I ++I LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 143 AAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMDYAF 202
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
E+II N GI +E++YPY+ C A +K A I YE+VP E++L KAV+ QP+S
Sbjct: 203 EFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPIS 262
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A F+ YK GIF G CGT LDH V VG+G TE+G +YWL++NSWG WG+ GY
Sbjct: 263 VAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYG-TENGKDYWLVRNSWGTVWGEDGY 321
Query: 328 MKILRD----EGLCGIGTQSSYP 346
+++ R+ G CGI + SYP
Sbjct: 322 IRMERNIKASSGKCGIAVEPSYP 344
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 151/313 (48%), Positives = 203/313 (64%), Gaps = 16/313 (5%)
Query: 45 EMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEF 104
E++E+W + H S + EK+ RF +FK N+ Y+ NK+ ++ YKL N+F+D+TN EF
Sbjct: 36 ELYERWRSHHTVS-RSLDEKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEF 93
Query: 105 RALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
R Y G K+ HR S + TF Y + VP ++DWR K AVTP+KDQ +CG
Sbjct: 94 RHHYAGSKIKH--HRTFLGASRANGTFMYAHED--SVPPTVDWRKKGAVTPVKDQGKCGS 149
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS V AVEGI +I L+ LSEQ+LVDC T+ N GC GG M+ AFE+I + GI T
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINT 209
Query: 220 EDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E+ YPY A G C ++ + I +E+VP DE +LLKAV+ QPVS+ I A ++F+
Sbjct: 210 EENYPYMAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQ 269
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DE 334
Y EG+F G CGT+LDH V IVG+GTT D YW++KNSWG WG+ GY+++ R +E
Sbjct: 270 FYSEGVFTGDCGTELDHGVAIVGYGTTLDRTKYWIVKNSWGPEWGEKGYIRMQREIDAEE 329
Query: 335 GLCGIGTQSSYPL 347
GLCGI Q SYP+
Sbjct: 330 GLCGIAMQPSYPI 342
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 303 bits (776), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 149/317 (47%), Positives = 205/317 (64%), Gaps = 15/317 (4%)
Query: 39 HEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSD 98
HE ++E W +HG++Y D + RF ++K+NL YI + E NRTY LG +F+D
Sbjct: 46 HENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHS--ETNRTYSLGLTKFAD 103
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
LTN+EFR +YTG ++ S R+ + F+Y + ++ P S+DWR AVT +KDQ CG
Sbjct: 104 LTNEEFRRMYTGTRIDR-SRRAKRRTGFRYAD---SEAPESVDWRKNGAVTSVKDQGSCG 159
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFSAV +VEGI I + LSEQ+LVDC N GC GG M+ AF++IIQN GI
Sbjct: 160 SCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGGID 219
Query: 219 TEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEF 277
TE +YPY+ G C ++K A I YE+VP DE+AL KAV+ QPVS+ I A +F
Sbjct: 220 TEKDYPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 279
Query: 278 KSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD---- 333
+ Y +G+F+G CGT LDH V VG+G TEDG +YW++KNSWG+ WG++GY+++ R+
Sbjct: 280 QLYAQGVFSGECGTDLDHGVLAVGYG-TEDGVDYWIVKNSWGEYWGESGYLRMKRNMKDS 338
Query: 334 ---EGLCGIGTQSSYPL 347
GLCGI + SY +
Sbjct: 339 NDGPGLCGINIEPSYAV 355
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 154/350 (44%), Positives = 218/350 (62%), Gaps = 26/350 (7%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQ------SVVEMHEKWMAQH--GRSYKDELEKEMR 67
+ FI++ L + + S HE+ S+ E++E+W + H RS + EK R
Sbjct: 1 MKRFIVLALCMLMVLETTKSLDFHEKDVESEDSLWELYERWKSHHTIARSLE---EKAKR 57
Query: 68 FKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-----STT 122
F +FK N+++I + NK+ N +YKL N+F D+T++EFR Y G + HR T
Sbjct: 58 FNVFKHNVKHIHETNKKEN-SYKLKLNKFGDMTSEEFRRTYAGSNIKH--HRMFQGERQT 114
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
+ +F Y N+ +PTS+DWR AVTP+K+Q +CG CWAFS V AVEGI +I L
Sbjct: 115 TKSFMYANVDT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTS 172
Query: 183 LSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAA 241
LSEQ+LVDC TN N GC GG M+ AFE+I + G+ +E YPY+A TC + + A
Sbjct: 173 LSEQELVDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVV 232
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
I +E+VP E L+KAV+ QPVS+ I A ++F+ Y EG+F G CGT+L+H V +VG
Sbjct: 233 SIDGHEDVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVG 292
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
+GTT DG YW++KNSWG+ WG+ GY+++ R EGLCGI ++SYPL
Sbjct: 293 YGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 154/335 (45%), Positives = 206/335 (61%), Gaps = 11/335 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + +S V+ M+E W+ + G+SY EKEMRF+IFKENL
Sbjct: 15 LFFSTLLILSSALDIKNSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRI 74
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I+ N + NR+Y LG NRF+DLT++E+R+ Y G+K + S +Y +P
Sbjct: 75 IDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSGPKAKVSN-----RYVPKVGVVLP 129
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+DWR AV +KDQ C CWAFSAVAAVEGI KI NLI LSEQ+LVDC T
Sbjct: 130 NYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQRT 189
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQ 255
GC G M AF++II N GI TED YPY A G C +K I NYE++P+ +E
Sbjct: 190 RGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCDWYRKNQRYVTIDNYEQLPANNEW 249
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
L AV+ QP+++G+ + +FK Y GI+ G CGT +DH VTIVG+G TE G +YW++K
Sbjct: 250 VLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTIVGYG-TERGLDYWIVK 308
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSWG WG+ GY++I R+ G CGI SYP+
Sbjct: 309 NSWGTNWGENGYIRIQRNIGGAGKCGIAMVPSYPV 343
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 152/318 (47%), Positives = 203/318 (63%), Gaps = 16/318 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ +++E+W + H S + EK RF +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
TN EFR+ Y G +KM S S TF Y+ + VP S+DWR K AVT +KDQ
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
+CG CWAFS + AVEGI +I L+ LSEQ+LVDC N GC GG ME AFE+I Q
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206
Query: 215 QGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
GI TE YPY A +GTC ++ A I +E VP DE ALLKAV+ QPVS+ I A
Sbjct: 207 GGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
++F+ Y EG+F G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326
Query: 334 ----EGLCGIGTQSSYPL 347
EGLCGI +SYP+
Sbjct: 327 ISKKEGLCGIAMMASYPI 344
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 146/309 (47%), Positives = 207/309 (66%), Gaps = 12/309 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK----ANKEGNRTYKLGTNRFSDLTNDE 103
+ W+ +H ++Y EKE RF IF++NLE+I++ N G ++LG N+F+DLTNDE
Sbjct: 6 QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65
Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
FR +Y G K P ++ + + +Y ++P S+DWR K AV+ +KDQ +CG CWAF
Sbjct: 66 FRRIYFGVKRP---EKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAF 122
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
SA+ AVEGI KI +LI LSEQ+LVDC T+ N+GC GG M+ AF +II N GI T+ +Y
Sbjct: 123 SAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKDY 182
Query: 224 PYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
PY+A G+C + +K A I E+VP+ +E+AL KAV+ QPV + I A +F+ YK
Sbjct: 183 PYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYKS 242
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCG 338
G+F G CGT LDH V VG+GTT+DG +YW+++NSWGD WG+ GY+++ R+ G CG
Sbjct: 243 GVFTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTESKSGKCG 302
Query: 339 IGTQSSYPL 347
I + SYP+
Sbjct: 303 IAIEPSYPV 311
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 155/352 (44%), Positives = 219/352 (62%), Gaps = 21/352 (5%)
Query: 14 NTIPMFIIIILLVSCASQ--VVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELE 63
+++ +F+++I S A +VS H + V+ M+E W+ +HG++Y E
Sbjct: 6 SSLSLFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGE 65
Query: 64 KEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSH--RST 121
KE RF IFK+NL +I++ N + N TY+LG NRF+DLTN+E+R++Y G K P + R
Sbjct: 66 KEKRFGIFKDNLRFIDEHNSQ-NLTYRLGLNRFADLTNEEYRSMYLGVK-PGATRVTRKV 123
Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLI 181
+ + ++ +P +DWR + AV +KDQ CG CWAFS +AAVEGI +I +LI
Sbjct: 124 SRKSDRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLI 183
Query: 182 QLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAA 240
LSEQ+LVDC T+ N GC GG M+ AFE+II N GI +E++YPY+A C +K A
Sbjct: 184 SLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANV 243
Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIV 300
I YE+VP DE AL KAV+ QPVS+ I A F+ Y+ G+F G CGT LDH V V
Sbjct: 244 VSIDGYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAV 303
Query: 301 GFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
G+G TE+G +YW++ NSWG WG+ GY+++ R+ G CGI SYP+
Sbjct: 304 GYG-TENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPI 354
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 151/353 (42%), Positives = 220/353 (62%), Gaps = 26/353 (7%)
Query: 18 MFIIIILLVSCAS----QVVSSRSTHE--------QSVVE-----MHEKWMAQHGRSYKD 60
+F++ +++ SCA+ VVSS H Q + + M E WM +HG+ Y
Sbjct: 10 IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDS 69
Query: 61 ELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRS 120
EKE R IF++NL +I N E N +Y+LG NRF+DL+ E+ + G P +
Sbjct: 70 VAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV 128
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANL 180
+S+ +Y+ +P S+DWR++ AVT +KDQ C CWAFS V AVEG+ KI L
Sbjct: 129 FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGEL 188
Query: 181 IQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-- 238
+ LSEQ L++C+ NNGCGGG +E A+E+I+ N G+ T+++YPY+A+ G C K
Sbjct: 189 VTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDN 247
Query: 239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVT 298
I YE +P+ DE AL+KAV+ QPV+ + + + EF+ Y+ G+F+G CGT L+H V
Sbjct: 248 KNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVV 307
Query: 299 IVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+VG+G TE+G +YW++KNS GDTWG+AGYMK+ R+ GLCGI ++SYPL
Sbjct: 308 VVGYG-TENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/320 (48%), Positives = 202/320 (63%), Gaps = 20/320 (6%)
Query: 40 EQSVVEMHEKWMAQH--GRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
E+S +++E+W + H RS D K RF +FK N+ ++ NK ++ YKL N+F+
Sbjct: 33 EESFWDLYERWRSHHTVSRSLGD---KHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFA 88
Query: 98 DLTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
D+TN EFR+ Y G K+ HR + TF Y+ + VP S+DWR AVT +K
Sbjct: 89 DMTNHEFRSTYAGSKVNH--HRMFQGTPRGNGTFMYEKVG--SVPPSVDWRKNGAVTGVK 144
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYII 212
DQ +CG CWAFS V AVEGI +I L+ LSEQ+LVDC T N GC GG ME AFE+I
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIK 204
Query: 213 QNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIA 271
Q GI TE YPY A GTC A++ A I +E VP+ DE ALLKAV+ QPVS+ I
Sbjct: 205 QKGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAID 264
Query: 272 AYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKIL 331
A ++F+ Y EG+F G C T+L+H V IVG+GTT DG NYW ++NSWG WG+ GY+++
Sbjct: 265 AGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQ 324
Query: 332 RD----EGLCGIGTQSSYPL 347
R EGLCGI +SYP+
Sbjct: 325 RSISKKEGLCGIAMMASYPI 344
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 152/318 (47%), Positives = 202/318 (63%), Gaps = 16/318 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ ++E+W + H S + EK RF +FKEN+ ++ + NK+ + YKL N+F+D+
Sbjct: 31 EESLWNLYERWRSHHTVS-RSLDEKHKRFNVFKENVNFVHEFNKK-DEPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSS-----TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
TN EFR+ Y G K+ HR S +F Y+ + VP S+DWR K AVTPIKDQ
Sbjct: 89 TNHEFRSTYAGSKVNH--HRMFRGSQHAAGSFMYEKVK--SVPPSVDWRKKGAVTPIKDQ 144
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
+CG CWAFS V AVEGI I L+ LSEQ+LVDC T+ N GC GG M AFE+I +
Sbjct: 145 GQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEK 204
Query: 215 QGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
GI TE YPY A GTC ++ + I +E VP +E ALLKA + QP+S+ I A
Sbjct: 205 GGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAG 264
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
+ F+ Y EG+F G CGT LDH V IVG+GTT DG YW++KNSWG WG+ GY+++ R
Sbjct: 265 GSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRG 324
Query: 333 ---DEGLCGIGTQSSYPL 347
EGLCGI ++SYP+
Sbjct: 325 ISAKEGLCGIAVEASYPI 342
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 156/348 (44%), Positives = 220/348 (63%), Gaps = 25/348 (7%)
Query: 18 MFIIIILLVSCASQVVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
M + I+L+S S + +S+ E++V +++E+W H S + E RF
Sbjct: 1 MKLFFIVLISFLSLLQASKGFDFDEKELETEENVWKLYERWRGHHSVS-RASHEAIKRFN 59
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG-----YKMPSPSHRSTTSS 124
+F+ N+ ++ + NK+ N+ YKL NRF+D+T+ EFR+ Y G ++M R S
Sbjct: 60 VFRHNVLHVHRTNKK-NKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRG--SG 116
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
F Y+N+ T VP+S+DWR+K AVT +K+QQ+CG CWAFS VAAVEGI KI L+ LS
Sbjct: 117 GFMYENV--TRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLS 174
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA--VQGTCSAAQKAAAAK 242
EQ+LVDC T N GC GG ME AFE+I N GI TE+ YPY + VQ + +
Sbjct: 175 EQELVDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVT 234
Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGF 302
I +E VP DE+ LLKAV+ QPVS+ I A +++F+ Y EG+F G CGTQL+H V IVG+
Sbjct: 235 IDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGY 294
Query: 303 GTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYP 346
G T++G YW+++NSWG WG+ GY++I R +EG CGI ++SYP
Sbjct: 295 GETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYP 342
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 147/317 (46%), Positives = 206/317 (64%), Gaps = 15/317 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E ++ +M+E+W + ++ ++L RF +FK N+ ++ + NK ++ YKL N+F+D+
Sbjct: 33 EDNLWDMYERWRHKVATNHGEKLR---RFNVFKSNVLHVHETNKM-DKPYKLKLNKFADM 88
Query: 100 TNDEFRALYTGYKMP----SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
TN EFR++Y G K+ S + S TF Y N+ VPTS+DWR K AV P+KDQ
Sbjct: 89 TNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVE--SVPTSVDWRKKGAVAPVKDQG 146
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
+CG CWAFS VAAVEGI KI L+ LSEQ+LVDC T N GC GG M+ AF++I +
Sbjct: 147 QCGSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDFIKKTG 206
Query: 216 GIATEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
G+ ED YPY A G C S + I +E+VP DEQ+L+KAV+ QPV++ I A +
Sbjct: 207 GLTREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAIDAGS 266
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-- 332
++F+ Y EG+F G CGTQLDH V VG+GTT DG YW+++NSWG WG+ GY+++ R
Sbjct: 267 SDFQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSEWGEKGYIRMERGI 326
Query: 333 --DEGLCGIGTQSSYPL 347
GLCGI ++SYP+
Sbjct: 327 SDKRGLCGIAMEASYPI 343
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 147/310 (47%), Positives = 205/310 (66%), Gaps = 10/310 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
V+ M E W+ ++G+SY EKE RF+IFK+NL ++++ N + NR+YK+G N+FSDLT+
Sbjct: 44 VIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTDA 103
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
E+ ++Y G K + T+ + +Y+ +P S+DWR K AV +K+Q CG CW
Sbjct: 104 EYSSIYLGTKF----NIRMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGNCGSCWT 159
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATED 221
F+++AAVEGI KI NLI LSEQ++VDC NNGC GGT+ A+++II N GI TE
Sbjct: 160 FASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGGINTEA 219
Query: 222 EYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
YPY G C +K I YE VPS +E+AL KAV+ QPVS+ IA+ +T FKSY
Sbjct: 220 NYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNSTAFKSY 279
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD---EGLC 337
K GIFNG CG ++DH VTIVG+G TE G +YW+++NSWG WG++GY+++ R+ G C
Sbjct: 280 KSGIFNGPCGPRIDHGVTIVGYG-TEGGKDYWIVRNSWGPNWGESGYVRMQRNVGGSGKC 338
Query: 338 GIGTQSSYPL 347
I YP+
Sbjct: 339 FIARAPVYPV 348
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 155/342 (45%), Positives = 222/342 (64%), Gaps = 18/342 (5%)
Query: 15 TIPMFIIII---LLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
TI + I+I+ ++ S ++ + ST+ + + +E W+ ++GR Y+D E E+RF I+
Sbjct: 4 TITLSIVILNLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRFDIY 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
+ N++YIE N + N +YKL NRF+D+TN+EF++ Y GY +P R + F+Y
Sbjct: 64 QSNVQYIEFYNSQ-NYSYKLIDNRFADITNEEFKSTYLGY-LP----RFRVQTEFRYH-- 115
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
++P S+DWR K AVT +KDQ CG CWAFSAVAAVEGI KI NL+ LSEQQL+DC
Sbjct: 116 KHGELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDC 175
Query: 192 S-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEV 249
+GN GC GG M AF YI ++ GIAT EYPY+ G C+ ++ K A IS YE V
Sbjct: 176 DIKSGNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNNAVTISGYESV 235
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ +E+ L AV+ QPVSI A F+ Y +GIF+G CG L+H +TIVG+G E+G
Sbjct: 236 PARNEKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYG-EENGD 294
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
YW++KNSW + WG++GY+++ RD +G CGI ++YP+
Sbjct: 295 KYWIVKNSWANDWGESGYVRMKRDTKDKDGTCGIAMDATYPV 336
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 143/306 (46%), Positives = 201/306 (65%), Gaps = 9/306 (2%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRAL 107
E W+ +HG+ Y EKE R IFK+NL +I N E N Y+LG NRF+DL+ E++ +
Sbjct: 65 ESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSE-NLGYRLGLNRFADLSLHEYKEI 123
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
G P + SS+ +Y+ + +P S+DWR++ AVT +KDQ C CWAFS V
Sbjct: 124 CHGADPKPPRNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVG 183
Query: 168 AVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA 227
AVEG+ KI L+ LSEQ L++C+ NNGCGGG +E A+E+I+ N G+ T+++YPY+A
Sbjct: 184 AVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIVSNGGLGTDNDYPYKA 242
Query: 228 VQGTCSAAQKAAAAK--ISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIF 285
V G C K I YE +P+ DE AL+KAV+ QPV+ I + + EF+ Y+ G+F
Sbjct: 243 VNGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQLYESGVF 302
Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGT 341
+G CGT L+H V +VG+G TE+G NYW+++NSWG+TWG+AGYMK+ R+ GLCGI
Sbjct: 303 DGRCGTNLNHGVVVVGYG-TENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRGLCGIAM 361
Query: 342 QSSYPL 347
+ SYPL
Sbjct: 362 RVSYPL 367
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 150/318 (47%), Positives = 209/318 (65%), Gaps = 17/318 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E++V +++E+W H + + E RF +F+ N+ ++ + NK+ N+ YKL NRF+D+
Sbjct: 30 EENVWKLYERWRDHHSVT-RASHEALKRFNVFRHNVLHVHRTNKK-NKPYKLKVNRFADI 87
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
T+ EFR+ Y G ++M R S F Y+N+ T VP+S+DWR+K AVT +K+Q
Sbjct: 88 THHEFRSSYAGSNVKHHRMLRGPKRG--SGGFMYENV--TRVPSSVDWREKGAVTEVKNQ 143
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
Q+CG CWAFS VAAVEGI KI L+ LSEQ+LVDC T N GC GG ME AFE+I N
Sbjct: 144 QDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNN 203
Query: 215 QGIATEDEYPYQA--VQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAA 272
GI TE+ YPY + VQ + + I +E VP DE+ALLKAV+ QPVS+ I A
Sbjct: 204 GGIKTEETYPYDSNDVQFCRAKSIDGETVTIDGHEHVPENDEEALLKAVAHQPVSVAIDA 263
Query: 273 YTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
+++F+ Y EG+F G CGTQL+H V IVG+G T++G YW+++NSWG WG+ GY++I R
Sbjct: 264 GSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIER 323
Query: 333 ----DEGLCGIGTQSSYP 346
+EG CGI ++SYP
Sbjct: 324 GISENEGRCGIAMEASYP 341
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 206/315 (65%), Gaps = 9/315 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
+ V+ +++ W+ QHG++Y E+E RF+IFK+NL +I++ N N TYKLG N+F+DL
Sbjct: 39 DDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADL 98
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSS--TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
TN E+RA + G + P R S + +Y + + ++P S++WRD AV+ +KDQ C
Sbjct: 99 TNQEYRAKFLGTRT-DPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSC 157
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFSA+AAVEGI KI LI LSEQ+LVDC + + GC GG M+ AF++II N GI
Sbjct: 158 GSCWAFSAIAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDNGGI 217
Query: 218 ATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
TE +YPY C +K A I YE+VP+ +E AL KAV+ QPVSI I A
Sbjct: 218 DTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPN-NENALKKAVAHQPVSIAIEAGGRA 276
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
F+ Y+ G+FNG CG LDH V VG+G+ ++G +YW+++NSWG WG+ GY+++ R
Sbjct: 277 FQLYESGVFNGECGLALDHGVVAVGYGSDDNGQDYWIVRNSWGGNWGENGYIRMERNINA 336
Query: 333 DEGLCGIGTQSSYPL 347
+ G CGI ++SYP+
Sbjct: 337 NTGKCGIAMEASYPV 351
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 146/333 (43%), Positives = 209/333 (62%), Gaps = 9/333 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK RF+IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG N+F+D+TN+EF A YTG P + S + +++++ V
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPVVS---FDDVNISAV 124
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
S+DWRD AVT +KDQ CG CWAFSA+A VEGI KI L+ LSEQ+++DC+ +
Sbjct: 125 GQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS-- 182
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
NGC GG ++ A+++II N G+A+E +YPYQA QG C+A +A I+ Y V S DE +
Sbjct: 183 NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPNSAYITGYSYVRSNDESS 242
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
+ AV QP++ I A F+ Y G+F+G CGT L+HA+TI+G+G G YW++KN
Sbjct: 243 MKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQYWIVKN 302
Query: 317 SWGDTWGDAGYMKILR---DEGLCGIGTQSSYP 346
SWG +WG+ GY+++ R GLCGI YP
Sbjct: 303 SWGSSWGERGYIRMARGVSSSGLCGIAMDPLYP 335
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 152/335 (45%), Positives = 208/335 (62%), Gaps = 27/335 (8%)
Query: 37 STHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRF 96
S+HE S+ E+ E+W+++H R+Y EK RF++FK+NL +I++ N++ + +Y LG N F
Sbjct: 50 SSHE-SLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRKVS-SYWLGLNEF 107
Query: 97 SDLTNDEFRALYTGYK------MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTP 150
+DLT+DEF+A Y G + Y+ + +P S+DWR K AVT
Sbjct: 108 ADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSVDWRSKGAVTG 167
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEY 210
+K+Q +CG CWAFS VAAVEGI +I NL LSEQ+L+DC T+GNNGC GG M+ AF Y
Sbjct: 168 VKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNNGCNGGLMDYAFSY 227
Query: 211 IIQNQGIATEDEYPYQAVQGTC---------------SAAQKAAAAKISNYEEVPSGDEQ 255
I N G+ TE+ YPY +GTC A AA IS YE+VP +EQ
Sbjct: 228 IAHNGGLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISGYEDVPRNNEQ 287
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
ALLKA++ QPVS+ I A F+ Y G+F+G CGTQLDH V VG+GT G +Y ++K
Sbjct: 288 ALLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVAAVGYGTAAKGHDYIIVK 347
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
NSWG +WG+ GY+++ R +GLCGI +SYP
Sbjct: 348 NSWGPSWGEKGYIRMRRGTGKRQGLCGINKMASYP 382
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 150/315 (47%), Positives = 211/315 (66%), Gaps = 12/315 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ ++E+W + H S + EK RF +FKENL++I K N++ +R YKL N+F+D+
Sbjct: 33 EESLWNLYERWRSHHTVS-RSLTEKNQRFNVFKENLKHIHKVNQK-DRPYKLRLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSPS--HRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
TN EF Y G K+ H S + F ++N S ++P+S+DWR + AVT +KDQ +C
Sbjct: 91 TNHEFLQHYGGSKVSHYRMFHGSRRQTGFAHENTS--NLPSSIDWRKQGAVTGVKDQGKC 148
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS+VAAVEGI KI LI LSEQ+LVDC++ N+GC GG ME+AF +I + G+
Sbjct: 149 GSCWAFSSVAAVEGINKIKTGELISLSEQELVDCNSV-NHGCDGGLMEQAFSFIEKTGGL 207
Query: 218 ATEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
TE+ YPY+A G C SA I YE VP DE AL++AV+ QPVSI I A +
Sbjct: 208 TTENNYPYRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIAIDAGGQD 267
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
F+ Y EG++ G CGT+L+H V +VG+G T+DG YW++KNSWG WG+ G++++ R
Sbjct: 268 FQFYSEGVYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWGSEWGENGFIRMQRENDV 327
Query: 333 DEGLCGIGTQSSYPL 347
+EGLCGI ++SYP+
Sbjct: 328 EEGLCGITLEASYPI 342
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 151/322 (46%), Positives = 203/322 (63%), Gaps = 16/322 (4%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDEL--------EKEMRFKIFKENLEYIEKANKEGNRTY 89
+ E+ + + + WM QHG+SY D EK R+ IFK+NL +I N E N+ Y
Sbjct: 48 SSEERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGY 106
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVT 149
LG N F+DLTN+EFRA G + R T+ F+Y ++ + D+P S+DWR+K AV
Sbjct: 107 FLGLNAFADLTNEEFRAQRHGGRFDRSRER-TSHEEFRYGSVQLKDLPDSIDWREKGAVV 165
Query: 150 PIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFE 209
+KDQ CG CWAFSAVAA+EG+ K++ L+ LSEQ+LVDC + GC GG M+ AF
Sbjct: 166 GVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFG 225
Query: 210 YIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
++I+N G+ TE +YPY+ C ++ A I YE+VP DE ALLKAV+ QPVS+
Sbjct: 226 FVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSV 285
Query: 269 GIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYM 328
I A + + Y+ GIF G CGT LDH VT VG+G EDG YW+IKNSWG WG+ GY+
Sbjct: 286 AIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYG-KEDGKAYWIIKNSWGSNWGEKGYV 344
Query: 329 KILRD----EGLCGIGTQSSYP 346
K+ R+ GLCGI ++SYP
Sbjct: 345 KMARNTGLAAGLCGINMEASYP 366
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 149/335 (44%), Positives = 215/335 (64%), Gaps = 12/335 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+I +L + + S + + ++ E W +HG++Y + +K RFKIF+EN E+
Sbjct: 7 LFLITLLFFN----LSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEF 62
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
++K N +GN +Y L N F+DLT+ EF+A G S S + + F + + DVP
Sbjct: 63 VKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGK-LSRRNFPLHDF-VGDVP 120
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
S+DWR K AV+ +KDQ CG CW+FSA A+EGI KI +L+ LSEQ+LVDC + NN
Sbjct: 121 ISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNN 180
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQA 256
GC GG M+ A++++I+N GI TE++YPYQA + TC+ + K I Y +VP +E+
Sbjct: 181 GCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKE 240
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
LLKAV+ QPVS+GI F+ Y +GIF G C T LDHAV IVG+G +E+G +YW++KN
Sbjct: 241 LLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKN 299
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
SWG WG GYM +LR+ +GLCGI +S+P+
Sbjct: 300 SWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPV 334
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 144/309 (46%), Positives = 209/309 (67%), Gaps = 10/309 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++ E W+++HG+ YK EK RF++F+ENL +I++ NKE + +Y LG N F+DL+++
Sbjct: 400 LIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHE 458
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF++ Y G + P R S F+Y++++ D+P S+DWR K AVT +K+Q CG CWA
Sbjct: 459 EFKSKYLGLRAEFPRSRDY-SGEFRYRDVA--DLPESVDWRKKGAVTHVKNQGACGSCWA 515
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI +I NL LSEQ+L+DC T N+GC GG M+ AF +I N G+ ED+
Sbjct: 516 FSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDD 575
Query: 223 YPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY +GTC ++ IS YE+VP DE++LLKA++ QP+S+ I A +F+ Y
Sbjct: 576 YPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYS 635
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
G+FNG CGT+LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R+ EGLC
Sbjct: 636 GGVFNGPCGTELDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLC 694
Query: 338 GIGTQSSYP 346
GI +SYP
Sbjct: 695 GINKMASYP 703
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 142/308 (46%), Positives = 212/308 (68%), Gaps = 13/308 (4%)
Query: 47 HEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT--YKLGTNRFSDLTNDEF 104
++ W+A++GRSY E+E RF++F +NL++++ N + ++LG NRF+DLTNDEF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
R+ + G K+ S ++ +Y++ + ++P S+DWR+K AV P+K+Q +CG CWAFS
Sbjct: 109 RSTFLGAKVVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFS 164
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEY 223
AV+ VE I ++ +I LSEQ+LV+CSTNG N+GC GG M+ AF++II+N GI TED+Y
Sbjct: 165 AVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDY 224
Query: 224 PYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
PY+AV G C + A I +E+VP DE++L KAV+ QPVS+ I A EF+ Y
Sbjct: 225 PYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHS 284
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCG 338
G+F+G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+ G CG
Sbjct: 285 GVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINATTGKCG 343
Query: 339 IGTQSSYP 346
I +SYP
Sbjct: 344 IAMMASYP 351
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 151/348 (43%), Positives = 218/348 (62%), Gaps = 22/348 (6%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTH------EQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ FI++ L + + H E S+ E++E+W + H + E EK RF
Sbjct: 1 MKRFIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFN 59
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG-----YKMPSPSHRSTTSS 124
+FK N+++I + NK+ +++YKL N+F D+T++EFR Y G ++M ++T S
Sbjct: 60 VFKHNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKS- 117
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
F Y N++ +PTS+DWR AVTP+K+Q +CG CWAFS V AVEGI +I L LS
Sbjct: 118 -FMYANVNT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLS 174
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAAKI 243
EQ+LVDC TN N GC GG M+ AFE+I + G+ +E YPY+A TC + + A I
Sbjct: 175 EQELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSI 234
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
+E+VP E L+KAV+ QPVS+ I A ++F+ Y EG+F G CGT+L+H V +VG+G
Sbjct: 235 DGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYG 294
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
TT DG YW++KNSWG+ WG+ GY+++ R EGLCGI ++SYPL
Sbjct: 295 TTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 300 bits (767), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 215/342 (62%), Gaps = 8/342 (2%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
K+ + + ++++ ++ + E+S+ +++E+W + H S +D EK RF +F
Sbjct: 3 KVILVALSLVLVFGLAESFDFDEKDLASEESLWDLYERWRSYHTVS-RDLEEKNKRFNVF 61
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSH-RSTTSSTFKYQN 130
KEN +++ K N+ ++ YKL N+F+D+TN EFR+ Y G K+ R T + +
Sbjct: 62 KENTKHVHKVNQM-DKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTGGFMH 120
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
T +P S+DWR K AVT IKDQ +CG CWAFS V VEGI +I L+ LSEQQL+D
Sbjct: 121 EKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQQLID 180
Query: 191 CSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEV 249
C + ++GC GG ME AFE+I +N GI TE+ YPY+A C + A I +E V
Sbjct: 181 CDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDGHESV 240
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P DE+AL+KAV+ QPVS+ I A ++ + Y EG+F+G CGT+LDH V IVG+GTT DG
Sbjct: 241 PVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTTLDGT 300
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
YW++KNSWG WG+ GY+++ R EG CGI ++SYP+
Sbjct: 301 KYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPV 342
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 300 bits (767), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 148/298 (49%), Positives = 199/298 (66%), Gaps = 10/298 (3%)
Query: 56 RSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPS 115
++Y EK RF++FK+NL +I+ NK+ +Y LG N F+DLT+DEF+A Y G P
Sbjct: 38 KAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGLNEFADLTHDEFKATYLGL-TPP 95
Query: 116 PSHRST---TSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGI 172
P+ ++ +S F+Y +S +VP +DWR K AVT +K+Q +CG CWAFS VAAVEGI
Sbjct: 96 PTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGI 155
Query: 173 TKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC 232
I NL LSEQ+L+DCST+GNNGC GG M+ AF YI G+ TE+ YPY +G C
Sbjct: 156 NAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTEEAYPYAMEEGDC 215
Query: 233 SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQ 292
+ AA IS YE+VP+ DEQAL+KA++ QPVS+ I A F+ Y G+F+G CG Q
Sbjct: 216 DEGKGAAVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGEQ 275
Query: 293 LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYP 346
LDH VT VG+GT++ G +Y ++KNSWG WG+ GY+++ R EGLCGI +SYP
Sbjct: 276 LDHGVTAVGYGTSK-GQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCGINKMASYP 332
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 155/316 (49%), Positives = 207/316 (65%), Gaps = 16/316 (5%)
Query: 37 STHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRF 96
S+ + + ++KWM ++GR YK E E RF I++ N++YI+ N N ++ L N F
Sbjct: 9 SSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNF 67
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+DLTN+EF+A Y GYK S + F+Y N M ++PT++DWR + AVTPIK+Q +
Sbjct: 68 ADLTNEEFKATYLGYKTVS-----IPDTCFRYGN--MVNLPTNVDWRQEGAVTPIKNQGQ 120
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFSAVAAVEGI KI LI LSEQ+LVDC T+GN GC GG M KAFE+ I+
Sbjct: 121 CGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF-IKRT 179
Query: 216 GIATEDEYPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
G+ TE EYPYQ + C+ +K IS YE+VP DE++L AV+ QPVS+ I A
Sbjct: 180 GLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEG 239
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F+ Y GIF+G CG QL+H V IVG+G T + A YWL+KNSWG WG++GY+++ RD
Sbjct: 240 NNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQA-YWLVKNSWGTDWGESGYIRMKRDS 298
Query: 334 ---EGLCGIGTQSSYP 346
+G CGI +SYP
Sbjct: 299 TDRQGTCGIAMMASYP 314
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 155/335 (46%), Positives = 208/335 (62%), Gaps = 10/335 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + V M+E W+ ++G+SY E E RF+IFKE L +
Sbjct: 13 LFFSTLLILSLAFNTKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + NR+YK+G N+F+DLT++EFR+ Y G+ S S+++ S+ +Y+ +P
Sbjct: 73 IDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRVGQVLP 128
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+ +DWR AV IK Q ECG CWAFSA+A VEGI KI LI LSEQ+L+DC T
Sbjct: 129 SYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNT 188
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQ 255
GC GG + F++II N GI TE+ YPY A G C+ Q I YE VP +E
Sbjct: 189 RGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNNEW 248
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL AV+ QPVS+ + A FK Y GIF G CGT +DHAVTIVG+G TE G +YW++K
Sbjct: 249 ALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG-TEGGIDYWIVK 307
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSW TWG+ GYM+ILR+ G CGI T SYP+
Sbjct: 308 NSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 215/342 (62%), Gaps = 8/342 (2%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
K+ + + ++++ ++ + E+S+ +++E+W + H S +D EK RF +F
Sbjct: 5 KVILVALSLVLVFGLAESFDFDEKDLASEESLWDLYERWRSYHTVS-RDLEEKNKRFNVF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSH-RSTTSSTFKYQN 130
KEN +++ K N+ ++ YKL N+F+D+TN EFR+ Y G K+ R T + +
Sbjct: 64 KENTKHVHKVNQM-DKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTGGFMH 122
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
T +P S+DWR K AVT IKDQ +CG CWAFS V VEGI +I L+ LSEQQL+D
Sbjct: 123 EKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQQLID 182
Query: 191 CSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEV 249
C + ++GC GG ME AFE+I +N GI TE+ YPY+A C + A I +E V
Sbjct: 183 CDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDGHESV 242
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P DE+AL+KAV+ QPVS+ I A ++ + Y EG+F+G CGT+LDH V IVG+GTT DG
Sbjct: 243 PVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTTLDGT 302
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
YW++KNSWG WG+ GY+++ R EG CGI ++SYP+
Sbjct: 303 KYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPV 344
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 207/317 (65%), Gaps = 11/317 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDEL-EKEMRFKIFKENLEYIEKAN-KEGNRTYKLGTNRFS 97
E V M+E+WMA+HG++ + L E + RF+ F +NL +++ N + G R Y+LG NRF+
Sbjct: 45 EAQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGYRLGINRFA 104
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DLTN EFRA Y + + +T ++ +Y++ + +P +DWR K AV P+K+Q +C
Sbjct: 105 DLTNAEFRAAY--LSAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVAPVKNQGQC 162
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQG 216
G CWAFSAV AVEGI +I L+ LSEQ+LVDCS NG N GC GG M+ AF +I+ N G
Sbjct: 163 GSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVGNGG 222
Query: 217 IATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
I T+ +YPY A G C A+++ I +E VP DE++L KAV+ QPV++ I A
Sbjct: 223 IDTDKDYPYTARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVAVAIEAGGR 282
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA-NYWLIKNSWGDTWGDAGYMKILRD- 333
EF+ Y+ G+F G CGT LDH V VG+GT DG +YWL++NSWG WG+ GY+++ R+
Sbjct: 283 EFQLYQSGVFTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGYIRMERNV 342
Query: 334 ---EGLCGIGTQSSYPL 347
G CGI ++SYP+
Sbjct: 343 GARAGKCGIAMEASYPV 359
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 155/335 (46%), Positives = 208/335 (62%), Gaps = 10/335 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + V M+E W+ ++G+SY E E RF+IFKE L +
Sbjct: 13 LFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + NR+YK+G N+F+DLT++EFR+ Y G+ S S+++ S+ +Y+ +P
Sbjct: 73 IDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRVGQVLP 128
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+ +DWR AV IK Q ECG CWAFSA+A VEGI KI LI LSEQ+L+DC T
Sbjct: 129 SYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNT 188
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQ 255
GC GG + F++II N GI TE+ YPY A G C+ Q I YE VP +E
Sbjct: 189 RGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQNEKYVTIDTYENVPYNNEW 248
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL AV+ QPVS+ + A FK Y GIF G CGT +DHAVTIVG+G TE G +YW++K
Sbjct: 249 ALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG-TEGGIDYWIVK 307
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSW TWG+ GYM+ILR+ G CGI T SYP+
Sbjct: 308 NSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 147/315 (46%), Positives = 203/315 (64%), Gaps = 12/315 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRF 96
E+ V M+ +WMA+H +Y E+E RF+ F+ NL YI++ N G +++LG NRF
Sbjct: 35 EEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRF 94
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+DLTN+E+R+ Y G + R ++ +YQ ++P S+DWR K AV +KDQ
Sbjct: 95 ADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPESVDWRKKGAVGAVKDQGG 151
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFSA+AAVEGI +I ++I LSEQ+LVDC T+ N GC GG M+ AFE+II N G
Sbjct: 152 CGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGG 211
Query: 217 IATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
I +E++YPY+ C A +K A I YE+VP E++L KAV+ QP+S+ I A
Sbjct: 212 IDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGR 271
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-- 333
F+ YK GIF G CGT LDH V VG+G TE+G +YWL++NSWG WG+ GY+++ R+
Sbjct: 272 AFQLYKSGIFTGTCGTALDHGVAAVGYG-TENGKDYWLVRNSWGSVWGENGYIRMERNIK 330
Query: 334 --EGLCGIGTQSSYP 346
G CGI + SYP
Sbjct: 331 ASSGKCGIAVEPSYP 345
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 155/316 (49%), Positives = 207/316 (65%), Gaps = 16/316 (5%)
Query: 37 STHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRF 96
S+ + + ++KWM ++GR YK E E RF I++ N++YI+ N N ++ L N F
Sbjct: 9 SSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNF 67
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+DLTN+EF+A Y GYK S + F+Y N M ++PT++DWR + AVTPIK+Q +
Sbjct: 68 ADLTNEEFKATYLGYKTVS-----IPDTCFRYGN--MVNLPTNVDWRQEGAVTPIKNQGQ 120
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFSAVAAVEGI KI LI LSEQ+LVDC T+GN GC GG M KAFE+ I+
Sbjct: 121 CGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF-IKRT 179
Query: 216 GIATEDEYPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
G+ TE EYPYQ + C+ +K IS YE+VP DE++L AV+ QPVS+ I A
Sbjct: 180 GLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEG 239
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F+ Y GIF+G CG QL+H V IVG+G T + A YWL+KNSWG WG++GY+++ RD
Sbjct: 240 NNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQA-YWLVKNSWGTDWGESGYIRMKRDS 298
Query: 334 ---EGLCGIGTQSSYP 346
+G CGI +SYP
Sbjct: 299 TDKQGTCGIAMMASYP 314
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 150/355 (42%), Positives = 222/355 (62%), Gaps = 28/355 (7%)
Query: 18 MFIIIILLV--SCAS----QVVSSRSTHE--------QSVVE-----MHEKWMAQHGRSY 58
M ++++ +V SCA+ +VSS H Q V + M E WM +HG+ Y
Sbjct: 8 MLVLLLAMVISSCATAMDMSIVSSNDNHHVTNGPGRRQGVFDAEATLMFESWMVKHGKVY 67
Query: 59 KDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSH 118
+ EKE R IF++NL +I N E N +Y+LG NRF+DL+ E+ + G P +
Sbjct: 68 ESVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYAQICHGADPRPPRN 126
Query: 119 RSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGA 178
+S+ +Y+ +P S+DWR++ AVT +KDQ +C CWAFS V AVEG+ KI
Sbjct: 127 HVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFSTVGAVEGLNKIVTG 186
Query: 179 NLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA 238
L+ LSEQ L++C+ NNGCGGG +E A+E+I+ N G+ T+++YPY+A+ G C+ K
Sbjct: 187 ELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCNDRLKE 245
Query: 239 --AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHA 296
I YE +P+ DE AL+KAV+ QPV+ + + + EF+ Y G+F+G CGT L+H
Sbjct: 246 NNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASGVFDGTCGTNLNHG 305
Query: 297 VTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
V +VG+G TE+G +YW+++NS G+TWG+AGYMK+ R+ GLCGI ++SYPL
Sbjct: 306 VVVVGYG-TENGRDYWIVRNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 153/325 (47%), Positives = 208/325 (64%), Gaps = 12/325 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK-EGNRTYK 90
+V+ R+ E+ V ++E W+ +G++Y EKE RF+IF +NL YI+ N+ E N +Y
Sbjct: 25 IVAERT--EEEVRLLYEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYT 82
Query: 91 LGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT--DVPTSLDWRDKKAV 148
LG RF+DLTN+E+R+ Y G K R + + ++LS D+P +DWR+K AV
Sbjct: 83 LGLTRFADLTNEEYRSTYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAV 142
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
PIKDQ CG CWAFS VAAVEGI +I +LI LSEQ+LVDC T N GC GG M+ AF
Sbjct: 143 APIKDQGGCGSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYAF 202
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
++II N GI TE++YPY+ G C +K A I +YE+V DE AL AV+ QPVS
Sbjct: 203 QFIISNGGIDTEEDYPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVS 262
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I F+ YK GIF+G CG LDH V VG+G TE G +YW+++NSWG +WG+AGY
Sbjct: 263 VAIEGGGRSFQLYKSGIFDGRCGIDLDHGVVAVGYG-TESGKDYWIVRNSWGKSWGEAGY 321
Query: 328 MKILRD-----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYP+
Sbjct: 322 IRMERNLPSSSSGKCGIAIEPSYPI 346
>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 334
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 153/344 (44%), Positives = 220/344 (63%), Gaps = 23/344 (6%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
+ ++ + + I+ + SQ + +EQS+V+ H++WM Q R YKDE EKEMR K+FK
Sbjct: 4 VRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFK 63
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+NL++IE N GN++Y LG N F+D +EF A +TG ++ S + T +N +
Sbjct: 64 KNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWN 123
Query: 133 MTDVPT---SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
M+D+ S DWRD+ AVTP+K Q C +TKISG NL+ LSEQQL+
Sbjct: 124 MSDIDMEDESKDWRDEGAVTPVKYQGAC-------------RLTKISGKNLLTLSEQQLI 170
Query: 190 DCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEE 248
DC N GC GG E+AF+YII+N G++ E EYPYQ + +C A A++A +I ++
Sbjct: 171 DCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQM 230
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQLDHAVTIVGFGTTED 307
VPS +E+ALL+AV QPVS+ I A F YK G++ G+ CGT ++HAVTIVG+GT
Sbjct: 231 VPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTM-S 289
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G NYW++KNSWG++WG+ GYM+I RD +G+CGI ++YP+
Sbjct: 290 GLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 333
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 155/335 (46%), Positives = 208/335 (62%), Gaps = 10/335 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + V M+E W+ ++G+SY E E RF+IFKE L +
Sbjct: 13 LFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + NR+YK+G N+F+DLT++EFR+ Y G+ S S+++ S+ +Y+ +P
Sbjct: 73 IDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRVGQVLP 128
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+ +DWR AV IK Q ECG CWAFSA+A VEGI KI LI LSEQ+L+DC T
Sbjct: 129 SYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNT 188
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQ 255
GC GG + F++II N GI TE+ YPY A G C+ Q I YE VP +E
Sbjct: 189 RGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYNNEW 248
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL AV+ QPVS+ + A FK Y GIF G CGT +DHAVTIVG+G TE G +YW++K
Sbjct: 249 ALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGIDYWIVK 307
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSW TWG+ GYM+ILR+ G CGI T SYP+
Sbjct: 308 NSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 146/292 (50%), Positives = 188/292 (64%), Gaps = 18/292 (6%)
Query: 68 FKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-------- 119
F +FK N+ I + N+ + YKL NRF D+T DEFR Y G ++ HR
Sbjct: 70 FNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFRRHYAGSRVAH--HRMFRGDRQG 126
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGAN 179
S+ S++F Y + DVP S+DWR K AVT +KDQ +CG CWAFS +AAVEGI I N
Sbjct: 127 SSASASFMYAD--ARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKN 184
Query: 180 LIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA 239
L LSEQQLVDC T N GC GG M+ AF+YI ++ G+A ED YPY+A Q +C + A
Sbjct: 185 LTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASCKKS-PAP 243
Query: 240 AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTI 299
I YE+VP+ DE AL KAV+ QPVS+ I A + F+ Y EG+F+G CGT+LDH V
Sbjct: 244 VVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAA 303
Query: 300 VGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
VG+G T DG YWL+KNSWG WG+ GY+++ RD EG CGI ++SYP+
Sbjct: 304 VGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPV 355
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 157/316 (49%), Positives = 208/316 (65%), Gaps = 25/316 (7%)
Query: 42 SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL-- 99
S+ E E W ++G YKD E++ F+IFK N+ YI+ N GN+ YKL NRF D
Sbjct: 37 SLSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPI 96
Query: 100 --TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
++D F + +T ++TFKY+N+ TD+P ++DWR + AVTPIK+Q +C
Sbjct: 97 EDSDDGFER----------TTTTTPTTTFKYENV--TDIPATVDWRKRGAVTPIKNQGKC 144
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN-NGCGGGTMEKAFEYIIQNQG 216
G CWAFSAVAA+EGI KI+ NL+ LSEQQLVDC +G GC G M AF++I++N G
Sbjct: 145 GSCWAFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGG 204
Query: 217 IATEDEYPY-QAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
IATE YPY + V+GTC + +I +YEEVPS E +LLKAV+ QPVS+GI
Sbjct: 205 IATEANYPYKRVVKGTCKKV--SHKVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRGM 262
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-- 333
FK Y GIF G CGT+ +HA+TIVG+GT++DG YWL+KNSW WG+ GY++I RD
Sbjct: 263 -FKFYSSGIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRDID 321
Query: 334 --EGLCGIGTQSSYPL 347
EGLCGI + SYP+
Sbjct: 322 AKEGLCGIAMKPSYPI 337
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 155/335 (46%), Positives = 208/335 (62%), Gaps = 10/335 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + V M+E W+ ++G+SY E E RF+IFKE L +
Sbjct: 13 LFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + NR+YK+G N+F+DLT++EFR+ Y G+ S S+++ S+ +Y+ +P
Sbjct: 73 IDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRFGQVLP 128
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+ +DWR AV IK Q ECG CWAFSA+A VEGI KI LI LSEQ+L+DC T
Sbjct: 129 SYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNT 188
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQ 255
GC GG + F++II N GI TE+ YPY A G C+ Q I YE VP +E
Sbjct: 189 RGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYNNEW 248
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL AV+ QPVS+ + A FK Y GIF G CGT +DHAVTIVG+G TE G +YW++K
Sbjct: 249 ALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGIDYWIVK 307
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSW TWG+ GYM+ILR+ G CGI T SYP+
Sbjct: 308 NSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 144/307 (46%), Positives = 200/307 (65%), Gaps = 7/307 (2%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR 105
+ E W+ HG+SY E+E RF+IFK NL YI++ N +R +KLG N+F+DLTN+E+R
Sbjct: 44 LFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEYR 103
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
+ YTG K + ++ + +Y LS +P S+DWR+ AV +KDQ CG CWAFS
Sbjct: 104 SKYTGIK-SKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGSCWAFST 162
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
++AVEGI +I+ LI LSEQ+LVDC + N GC GG M+ AFE+II N GI T+ +YPY
Sbjct: 163 ISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINNGGIDTDVDYPY 222
Query: 226 QAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGI 284
G C +K A I +YE+VP+ DE AL KA + QP+S+ I A +F+ Y GI
Sbjct: 223 TGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQFYDSGI 282
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIG 340
F G CG LDH V +VG+G TE+G +YW+++NSWG WG+ GY+++ R G+CGI
Sbjct: 283 FTGKCGIALDHGVVVVGYG-TENGKDYWIVRNSWGADWGENGYLRMERGISSKTGICGIA 341
Query: 341 TQSSYPL 347
+ SYP+
Sbjct: 342 IEPSYPV 348
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 151/345 (43%), Positives = 212/345 (61%), Gaps = 14/345 (4%)
Query: 13 INTIPMFIIIILL---VSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ +P F+ L+ ++ Q+ + RS E V+ M+E+W+ +H + Y EK+ RF+
Sbjct: 4 MTILPFFLFFSLITFSLALDIQLPTGRSNDE--VMTMYEEWLVKHQKVYNGLREKDQRFQ 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKY 128
IFK+NL +I++ N + N TY +G N+F+D+TN+E+R +Y G + T +Y
Sbjct: 62 IFKDNLNFIDEHNAQ-NYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHRY 120
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
S +P +DWR K A+T IKDQ CG CWAFS +A VE I KI L+ LSEQ+L
Sbjct: 121 AYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQEL 180
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYE 247
VDC N GC GG M+ AFE+II N GI T+ YPY+ +G C +K A I YE
Sbjct: 181 VDCDRAFNEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDGYE 240
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
+VPS +E AL KAV+ QPVS+ I A + Y+ G+F G CGT LDHAV IVG+G +E+
Sbjct: 241 DVPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYG-SEN 299
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
G +YWL++NSWG WG+ GY K+ R+ G CGI ++SYP+
Sbjct: 300 GLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPV 344
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 202/307 (65%), Gaps = 11/307 (3%)
Query: 50 WMAQHGRSYKDEL-EKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFR 105
W A+HG + L E+E RF+ F +NL +++ N G ++LG NRF+DLTNDEFR
Sbjct: 55 WRAEHGSGNSNSLGEEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEFR 114
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
A Y G K + +Y++ + ++P ++DWR+K AV P+K+Q +CG CWAFSA
Sbjct: 115 AAYLGVKGAGQRRSARAGVGERYRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWAFSA 174
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYP 224
V+AVE I ++ L+ LSEQ+LV+C NG +NGC GG M+ AF++II N GI TED+YP
Sbjct: 175 VSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINNGGIDTEDDYP 234
Query: 225 YQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEG 283
Y+A+ G C ++ A I +E+VP DE++L KAV+ QPVS+ I A EF+ Y G
Sbjct: 235 YKALDGKCDINRRNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 294
Query: 284 IFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGI 339
+F G CGT+LDH V VG+G TE+G +YW+++NSWG WG+AGY+++ R+ G CGI
Sbjct: 295 VFTGRCGTELDHGVVAVGYG-TENGKDYWIVRNSWGPKWGEAGYLRMERNINATTGKCGI 353
Query: 340 GTQSSYP 346
SSYP
Sbjct: 354 AMMSSYP 360
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 150/343 (43%), Positives = 213/343 (62%), Gaps = 16/343 (4%)
Query: 14 NTIPMFIIIILLVSCASQVVSSRSTHEQS----VVEMHEKWMAQHGRSYKDELEKEMRFK 69
N I +I++++V ++ + E + M E W A+HG+SY +LEK R
Sbjct: 4 NMIASTLILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDLEKARRLM 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKY 128
IF + L YIEK N + N T+ LG N+FSDLTN EFRA++ G +K P R
Sbjct: 64 IFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAED---- 119
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+++ ++ +PTSLDWR K AVTPIKDQ +CG CWAFSA+A++E ++ L+ LSEQQL
Sbjct: 120 EDVDVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQL 179
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA---AAAKISN 245
+DC T + GC GG ME AF+++++N G+ TE YPY G+C+A + A A+I+
Sbjct: 180 MDCDTV-DAGCDGGLMETAFKFVVKNGGVTTEASYPYTGSVGSCNANKVAIINKVAEITG 238
Query: 246 YEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
++ V AL+KAVS PV++ I F++YK GI +G CG LDH V ++G+G T
Sbjct: 239 FKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGQCGDSLDHGVLLIGYG-T 297
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD--EGLCGIGTQSSYP 346
E G YW+IKNSWG +WG+ G+MKI R +G+CG+ SSYP
Sbjct: 298 EGGMPYWIIKNSWGTSWGEDGFMKIERKDGDGICGMNGDSSYP 340
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 145/309 (46%), Positives = 203/309 (65%), Gaps = 9/309 (2%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
V+ M E W+ ++G+SY EKE RF+IFK+NL ++++ N + NR+YK+G N+FSDLT +
Sbjct: 44 VMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTLE 103
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
E+ ++Y G K T+ + +Y+ +P S+DWR K AV +K+Q CG CW
Sbjct: 104 EYSSIYLGTKF----DMRMTNVSDRYEPRVGDQLPNSIDWRKKGAVLGVKNQGNCGSCWT 159
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATED 221
F+ +AAVE I +I NLI LSEQQ+VDC NNGC GG+ A+++II N GI TE
Sbjct: 160 FAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDNGGINTEA 219
Query: 222 EYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY+A G C + I YE VP +E+AL KAVS Q VS+GIA+ ++EFK+YK
Sbjct: 220 NYPYKAQDGECDEQKNQKYVTIDRYENVPRKNEKALQKAVSNQLVSVGIASNSSEFKAYK 279
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---DEGLCG 338
GIF G CG ++DHAVTIVG+G TE G +YW+++NSWG WG+ GY+++ R + G C
Sbjct: 280 SGIFTGPCGAKIDHAVTIVGYG-TEGGMDYWIVRNSWGSNWGENGYVRMQRNVGNAGTCF 338
Query: 339 IGTQSSYPL 347
I T +YP+
Sbjct: 339 IATSPNYPV 347
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 150/322 (46%), Positives = 203/322 (63%), Gaps = 16/322 (4%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDEL--------EKEMRFKIFKENLEYIEKANKEGNRTY 89
+ E+ + + + WM QHG+SY + EK R+ IFK+NL +I N E N+ Y
Sbjct: 48 SSEERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGY 106
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVT 149
LG N F+DLTN+EFRA G + R T+ F+Y ++ + D+P S+DWR+K AV
Sbjct: 107 FLGLNAFADLTNEEFRAQRHGGRFDRSRER-TSYEEFRYGSVQLKDLPDSIDWREKGAVV 165
Query: 150 PIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFE 209
+KDQ CG CWAFSAVAA+EG+ K++ L+ LSEQ+LVDC + GC GG M+ AF
Sbjct: 166 GVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFG 225
Query: 210 YIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
++I+N G+ TE +YPY+ C ++ A I YE+VP DE ALLKAV+ QPVS+
Sbjct: 226 FVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSV 285
Query: 269 GIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYM 328
I A + + Y+ GIF G CGT LDH VT VG+G EDG YW+IKNSWG WG+ GY+
Sbjct: 286 AIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYG-KEDGKAYWIIKNSWGSNWGEKGYI 344
Query: 329 KILRD----EGLCGIGTQSSYP 346
K+ R+ GLCGI ++SYP
Sbjct: 345 KMARNTGLAAGLCGINMEASYP 366
>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 220/332 (66%), Gaps = 17/332 (5%)
Query: 27 SCASQVVSSRSTHEQSVVE---MHEKWMAQHGRSYKDEL-EKEMRFKIFKENLEYIEKAN 82
S A + ++ H +S E + + WM++HG++Y + L EKE RF+ FK+NL +I++ N
Sbjct: 25 SSAIDLPATSGGHNRSNEEVGFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHN 84
Query: 83 KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDW 142
+ N +Y+LG RF+DLT E+R L+ G P P R+ S +Y L +P S+DW
Sbjct: 85 AK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLRISR-RYVPLDGDQLPESVDW 140
Query: 143 RDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGC-GG 201
R++ AV+ IKDQ C CWAFS VAAVEGI KI L+ LSEQ+LVDC+ NNGC G
Sbjct: 141 RNEGAVSAIKDQGTCNSCWAFSTVAAVEGINKIVTGELVSLSEQELVDCNLV-NNGCYGS 199
Query: 202 GTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAA--KISNYEEVPSGDEQALLK 259
GTM+ AF+++I N G+ ++ +YPYQ QG C+ + + I +YE+VP+ DE +L K
Sbjct: 200 GTMDAAFQFLINNGGLDSDTDYPYQGSQGYCNRKESTSNKIITIDSYEDVPANDEISLQK 259
Query: 260 AVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
AV+ QPVS+G+ + EF Y+ GI+NG CGT LDHA+ IVG+G +E+G +YW+++NSWG
Sbjct: 260 AVAHQPVSVGVDKKSQEFMLYRSGIYNGPCGTDLDHALVIVGYG-SENGQDYWIVRNSWG 318
Query: 320 DTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
TWGDAGY K+ R+ G+CGI +SYP+
Sbjct: 319 TTWGDAGYAKMARNFEYPSGVCGIAMLASYPV 350
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 143/336 (42%), Positives = 208/336 (61%), Gaps = 25/336 (7%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F I+ L C++ + + + + ++ HE+WMAQ+GR YKD+ EK RF++FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N GN + LG N+F+DLTNDEFR+ T + R T F+ +N+++ +P
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTG--FRNENVNIDALP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
++DWR K VTPIKDQ +CGCCWAFSAVAA+E +LVDC +G +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAME----------------ELVDCDVHGED 168
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M+ AF++II+N G+ TE YPY AV + + A+ I YE+VP+ +E A
Sbjct: 169 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVDDKFKSVSNSVAS-IKGYEDVPANNEAA 227
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L+KAV+ QPVS+ + F+ YK G+ G CGT LDH + +G+G DG YWL+KN
Sbjct: 228 LMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 287
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
SWG TWG+ G++++ +D G+CG+ + SYP A
Sbjct: 288 SWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 323
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 215/328 (65%), Gaps = 12/328 (3%)
Query: 31 QVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYK 90
++ S S + V+ ++E W+ QH ++Y EKE RF IFK+NLE+I++ N + ++T+K
Sbjct: 37 NLLPSSSRSDDEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFK 96
Query: 91 LGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-----YQNLSMTDVPTSLDWRDK 145
+G N+F+DLTN+EFR++Y G K S S +S+ K Y ++P ++DWR
Sbjct: 97 VGLNKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKN 156
Query: 146 KAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTME 205
AV +KDQ +CG CWAFS +AAVEGI +I L+ LSEQ+LVDC T+ N+GC GG M+
Sbjct: 157 GAVAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNSGCDGGLMD 216
Query: 206 KAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQ 264
A+E+II N GI T+ +YPY A G C +K A I ++E+VP DE+AL KAV+ Q
Sbjct: 217 YAYEFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQ 276
Query: 265 PVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGD 324
PVS+ I A + F+ Y+ G+F G CG LDH V VG+G ++DG +YW+++NSWG WG+
Sbjct: 277 PVSVAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYG-SDDGKDYWIVRNSWGADWGE 335
Query: 325 AGYMKILRD-----EGLCGIGTQSSYPL 347
+GY+++ R+ G CGI + SYP+
Sbjct: 336 SGYIRMERNLETVKTGKCGIAIEPSYPI 363
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 153/348 (43%), Positives = 217/348 (62%), Gaps = 23/348 (6%)
Query: 17 PMFIIIILL----VSCASQVVSSRS--THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKI 70
P FI + L+ +S A + + E S+ ++EKW H + +D EK RF +
Sbjct: 4 PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVA-RDLDEKNRRFNV 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRS-----TTSST 125
FKEN+++I + N++ + YKL N+F D+TN EFR+ Y G K+ HRS + +
Sbjct: 63 FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQH--HRSQRGIQKNTGS 120
Query: 126 FKYQNLSMTDVPT-SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
F Y+N+ +P S+DWR K AVT +KDQ +CG CWAFS +A+VEGI +I L+ LS
Sbjct: 121 FMYENVG--SLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLS 178
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAAKI 243
EQ+LVDC T+ N GC GG M+ AFE+I Q GI TED YPY GTC S + I
Sbjct: 179 EQELVDCDTSYNEGCNGGLMDYAFEFI-QKNGITTEDSYPYAEQDGTCASNLLNSPVVSI 237
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
+++VP+ +E AL++AV+ QP+S+ I A F+ Y EG+F G CGT+LDH V IVG+G
Sbjct: 238 DGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYG 297
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
T DG YW++KNSWG+ WG++GY+++ R G CGI ++SYP+
Sbjct: 298 ATRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI 345
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 145/311 (46%), Positives = 198/311 (63%), Gaps = 24/311 (7%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+V HE+WM Q+ R YKD EK RF++FK N+++IE N GNR + LG N+F+DLTND
Sbjct: 1 MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTND 60
Query: 103 EFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
EFRA T G+K PSP T F+Y+N+S+ +P ++DWR K AVTPIKDQ +C
Sbjct: 61 EFRATKTNKGFK-PSPVKVPTG---FRYENISVDALPATIDWRTKGAVTPIKDQGQC--- 113
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIAT 219
EGI KIS LI LSEQ+LVDC +G + GC GG M+ AF++II+ G+ T
Sbjct: 114 ---------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTT 164
Query: 220 EDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKS 279
E YPY A G C + + A + +E+VP+ DE +L+KAV+ QPVS+ + F+
Sbjct: 165 ESSYPYTAADGKCKSGSNSVAT-VKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTFQF 223
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EG 335
Y G+ G CGT LDH + +G+G T DG YWL+KNSWG TWG+ GY+++ +D G
Sbjct: 224 YSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKRG 283
Query: 336 LCGIGTQSSYP 346
+CG+ + SYP
Sbjct: 284 MCGLAMEPSYP 294
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 150/341 (43%), Positives = 209/341 (61%), Gaps = 12/341 (3%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
TI + L+ + S RS E V+ M+E+W+ +H + Y EK+ RF+IFK+N
Sbjct: 5 TITSLLFFSLITLSLAMDTSMRSNEE--VMTMYEEWLVKHHKVYNGLGEKDQRFEIFKDN 62
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSH--RSTTSSTFKYQNLS 132
L +I++ N + N TYK+G N+F+D TN+E+R +Y G K + + + ++ +Y S
Sbjct: 63 LGFIDEHNAQ-NYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIKITTGHRYAFNS 121
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+P +DWR K AV IKDQ CG CWAFS +A VE I KI L+ LSEQ+LVDC
Sbjct: 122 GDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCD 181
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPS 251
N GC GG M+ AFE+I++N GI TE +YPY+ +G C +K A I YE+VP+
Sbjct: 182 RAFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKVVSIDGYEDVPA 241
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
+E AL KAV QPVS+ I A + Y+ G+F G CGT LDH V +VG+G E+G +Y
Sbjct: 242 YNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGYG-FENGVDY 300
Query: 312 WLIKNSWGDTWGDAGYMKILR-----DEGLCGIGTQSSYPL 347
WL++NSWG WG+ GY K+ R + G CGI Q+SYP+
Sbjct: 301 WLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPV 341
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 142/307 (46%), Positives = 210/307 (68%), Gaps = 12/307 (3%)
Query: 47 HEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN-KEGNRTYKLGTNRFSDLTNDEFR 105
++ W+A++GRSY E E RF++F +NL + + N + + ++LG NRF+DLTN+EFR
Sbjct: 54 YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 113
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
A + G K+ S ++ +Y++ + ++P S+DWR+K AV P+K+Q +CG CWAFSA
Sbjct: 114 ATFLGAKVVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 169
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYP 224
V+ VE I ++ +I LSEQ+LV+CSTNG N+GC GG M+ AF++II+N GI TED+YP
Sbjct: 170 VSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYP 229
Query: 225 YQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEG 283
Y+AV G C + A I +E+VP DE++L KAV+ QPVS+ I A EF+ Y G
Sbjct: 230 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 289
Query: 284 IFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGI 339
+F+G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+ G CGI
Sbjct: 290 VFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGI 348
Query: 340 GTQSSYP 346
+SYP
Sbjct: 349 AMMASYP 355
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 146/330 (44%), Positives = 199/330 (60%), Gaps = 13/330 (3%)
Query: 28 CASQVVSSRSTH-EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGN 86
CA+ R ++++ +++E+W H + EK RF FK+N+ YI + NK G
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRGG 84
Query: 87 RTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWR 143
R Y+L NRF D+ +EFRA + G + F Y+ + D+P ++DWR
Sbjct: 85 RGYRLRLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVR--DLPRAVDWR 142
Query: 144 DKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGT 203
K AVT +KDQ +CG CWAFS V +VEGI I L+ LSEQ+L+DC T N+GC GG
Sbjct: 143 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGL 202
Query: 204 MEKAFEYIIQNQGIATEDEYPYQAVQGTCSA--AQKAAAAKISNYEEVPSGDEQALLKAV 261
ME AFEYI + GI TE YPY+A GTC A A++A I ++ VP+ E AL KAV
Sbjct: 203 MENAFEYIKHSGGITTESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAV 262
Query: 262 SMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDT 321
+ QPVS+ I A F+ Y +G+F G CGT LDH V +VG+G T DG YW++KNSWG
Sbjct: 263 ANQPVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTA 322
Query: 322 WGDAGYMKILRDE----GLCGIGTQSSYPL 347
WG+ GY+++ RD GLCGI ++SYP+
Sbjct: 323 WGEGGYIRMQRDSGYDGGLCGIAMEASYPV 352
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 144/344 (41%), Positives = 214/344 (62%), Gaps = 24/344 (6%)
Query: 21 IIILLVSCASQVVSSRST-----------HEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+++L+++ Q + R+ + +++++ +W+ H R Y+ EK RF+
Sbjct: 12 LVLLVIAIGQQADAGRANAIVDYEGNQLHSDDAILDVFHQWLETHSRVYRSLSEKHHRFQ 71
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFKEN YI NK+ ++Y LG N+FSDLT+ EFRA Y G K P +R + F Y+
Sbjct: 72 IFKENFLYIHAHNKQ-QKSYWLGLNKFSDLTHQEFRAQYLGTK---PVNRQRKEANFMYE 127
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
++ + +DWR K AVT +KDQ CG CWAFSAV +VEG+ I L+ LSEQ+LV
Sbjct: 128 DV---EAEPKVDWRLKGAVTDVKDQGACGSCWAFSAVGSVEGVNAIKTGELVSLSEQELV 184
Query: 190 DCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEE 248
DC N GC GG M+ AFE+II+N GI TE +YPY+A G C ++ + I +Y++
Sbjct: 185 DCDRKQNQGCNGGLMDYAFEFIIKNGGIDTEKDYPYKARDGRCDEGRRNSKVVVIDDYQD 244
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP+ E AL+KA++ PVS+ I A +F+ Y+ G+F G CG++LDH V VG+GT +DG
Sbjct: 245 VPTQSESALMKALTKNPVSVAIEAGGRDFQHYQGGVFTGPCGSELDHGVLAVGYGTDDDG 304
Query: 309 ANYWLIKNSWGDTWGDAGYMKILR-----DEGLCGIGTQSSYPL 347
NYW++KNSWG WG+ GY+++ R +G CGI ++S+P+
Sbjct: 305 VNYWIVKNSWGPGWGEKGYIRMERFGSDSTDGKCGINIEASFPI 348
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 154/335 (45%), Positives = 207/335 (61%), Gaps = 10/335 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + V M+E W+ ++G+SY E E RF+IFKE L +
Sbjct: 13 LFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + NR+YK+G N+F+DLT++EFR+ Y + S S+++ S+ +Y+ +P
Sbjct: 73 IDEHNADTNRSYKVGLNQFADLTDEEFRSTYL--RFTSGSNKTKVSN--RYEPRVGQVLP 128
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+ +DWR AV IK Q ECG CWAFSA+A VEGI KI LI LSEQ+L+DC T
Sbjct: 129 SYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNT 188
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQ 255
GC GG + F++II N GI TE+ YPY A G C+ Q I YE VP +E
Sbjct: 189 RGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNNEW 248
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL AV+ QPVS+ + A FK Y GIF G CGT +DHAVTIVG+G TE G +YW++K
Sbjct: 249 ALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYG-TEGGIDYWIVK 307
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSW TWG+ GYM+ILR+ G CGI T SYP+
Sbjct: 308 NSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|9502426|gb|AAF88125.1|AC021043_18 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 365
Score = 297 bits (760), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 156/362 (43%), Positives = 226/362 (62%), Gaps = 28/362 (7%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
+ ++ + + I+ + SQ + +EQS+V+ H++WM Q R YKDE EKEMR K+FK
Sbjct: 4 VRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFK 63
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+NL++IE N GN++Y LG N F+D +EF A +TG ++ S + T +N +
Sbjct: 64 KNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWN 123
Query: 133 MTDVPT---SLDWRDKKAVTPIKDQQECG------------CCWAFSAVAAV------EG 171
M+D+ S DWRD+ AVTP+K Q C ++ + V EG
Sbjct: 124 MSDIDMEDESKDWRDEGAVTPVKYQGACPEFPTKQIRRNSLVGKQYTKLLGVLSDWGDEG 183
Query: 172 ITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGT 231
+TKISG NL+ LSEQQL+DC N GC GG E+AF+YII+N G++ E EYPYQ + +
Sbjct: 184 LTKISGKNLLTLSEQQLIDCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKES 243
Query: 232 CSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGV-C 289
C A A++A +I ++ VPS +E+ALL+AV QPVS+ I A F YK G++ G+ C
Sbjct: 244 CRANARRAPHTQIRGFQMVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDC 303
Query: 290 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSY 345
GT ++HAVTIVG+GT G NYW++KNSWG++WG+ GYM+I RD +G+CGI ++Y
Sbjct: 304 GTDVNHAVTIVGYGTM-SGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAY 362
Query: 346 PL 347
P+
Sbjct: 363 PV 364
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 146/321 (45%), Positives = 206/321 (64%), Gaps = 16/321 (4%)
Query: 40 EQSVVEMHEKW----MAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNR 95
E+S+ ++E+W M +++ +K F +FKEN+ YI +ANK+G R+++L N+
Sbjct: 35 EESLRALYEQWRSHYMVSRPAGLQEQDDKARWFNVFKENVRYIHEANKKG-RSFRLALNK 93
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT-----DVPTSLDWRDKKAVTP 150
F+D+T DEFR Y + HR+ +S ++ + S ++P ++DWR + AVT
Sbjct: 94 FADMTTDEFRRAYAAGSR-TRHHRALSSGIRRHGDGSFMYAQAGNLPLAVDWRQRGAVTG 152
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEY 210
IKDQ +CG CWAFS +AAVEGI KI L+ LSEQ+LVDC N GC GG M+ AF+Y
Sbjct: 153 IKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDVDNQGCNGGLMDYAFQY 212
Query: 211 IIQNQGIATEDEYPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIG 269
I +N GI TE YPY A Q +C+ A +++ I YE+VP+ +E AL KAV+ QPVSI
Sbjct: 213 IKRNGGITTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVANQPVSIA 272
Query: 270 IAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMK 329
I A +F+ Y EG+F G CGT+LDH V VG+G T DG YW++KNSWG+ WG+ GY++
Sbjct: 273 IEASGQDFQFYSEGVFTGSCGTELDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERGYIR 332
Query: 330 ILR----DEGLCGIGTQSSYP 346
+ R +GLCGI + SYP
Sbjct: 333 MQRGISDSQGLCGIAMEPSYP 353
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 149/286 (52%), Positives = 196/286 (68%), Gaps = 15/286 (5%)
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKY 128
KEN+ YIE N N+ YKLG N+F+DLT++EF R + G+ S +T ++TFKY
Sbjct: 5 KENVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNGHMRFS----NTRTTTFKY 60
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+N+++ +P S+DWR K AVTPIK+Q CGCCWAFSA+AA EGI KIS L+ LSEQ++
Sbjct: 61 ENVTV--LPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEV 118
Query: 189 VDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNY 246
VDC T G ++GC GG M+ AF++IIQN GI TE YPY+ V G C+ ++A A I+ Y
Sbjct: 119 VDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGY 178
Query: 247 EEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP +E+AL KAV+ QPVS+ I A +F+ YK GIF G CGT+LDH VT VG+G
Sbjct: 179 EDVPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENN 238
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
+G YWL+KNSWG WG+ GY + R EG+CGI +SYP A
Sbjct: 239 EGTKYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICGIAMLASYPTA 284
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 148/317 (46%), Positives = 203/317 (64%), Gaps = 14/317 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKD--ELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
E+++ ++E+W + + S + +E RF +FKEN YI + NK+ +R ++L N+F+
Sbjct: 33 EENLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYIHEGNKK-DRPFRLALNKFA 91
Query: 98 DLTNDEFRALYTGYKMP---SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
D+T DEFR Y G ++ S S +F+Y + ++P ++DWR K AVT IKDQ
Sbjct: 92 DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGSFRYGDAD--NLPPAVDWRQKGAVTAIKDQ 149
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
+CG CWAFS + AVEGI KI L+ LSEQ+L+DC N GC GG M+ AF++I +N
Sbjct: 150 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIHKN 209
Query: 215 QGIATEDEYPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
GI TE YPYQ QG+C A +KA A I YE+VP+ DE AL KAV+ QPVS+ I A
Sbjct: 210 -GITTESNYPYQGEQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 268
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
+F+ Y EG+F G C T LDH V VG+GTT DG YW++KNSWG+ WG+ GY+++ R
Sbjct: 269 GNDFQFYSEGVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRG 328
Query: 334 ----EGLCGIGTQSSYP 346
EG CGI Q+SYP
Sbjct: 329 VSQAEGQCGIAMQASYP 345
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 154/335 (45%), Positives = 206/335 (61%), Gaps = 10/335 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + V M+E W+ ++G+SY E E RF+IFKE L +
Sbjct: 13 LFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + NR+YK+G N+F+DLT++EFR+ Y G+ S S+++ S+ +Y+ +P
Sbjct: 73 IDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRVGQVLP 128
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+ +DWR AV IK Q ECG CWAFSA+A VEGI KI LI LSEQ+L+DC T
Sbjct: 129 SYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNT 188
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQ 255
GC G + F +II N GI TE+ YPY A G C+ Q I YE VP +E
Sbjct: 189 RGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNNEW 248
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL AV+ QPVS+ + A FK Y GIF G CGT +DHAVTIVG+G TE G +YW++K
Sbjct: 249 ALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG-TEGGIDYWIVK 307
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSW TWG+ GYM+ILR+ G CGI T SYP+
Sbjct: 308 NSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 296 bits (758), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 204/321 (63%), Gaps = 20/321 (6%)
Query: 40 EQSVVEMHEKWMAQHG---RSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRF 96
E+S+ ++E W + H R E E RF +FKEN+ YI +ANK+ +R ++L N+F
Sbjct: 33 EESLRGLYETWRSHHTVSRRGLGAEAEAR-RFNVFKENVRYIHEANKK-DRPFRLALNKF 90
Query: 97 SDLTNDEFRALYTGYKMPSPSHRS------TTSSTFKYQNLSMTDVPTSLDWRDKKAVTP 150
+D+T DEFR Y G ++ HRS +F Y + ++P ++DWR K AVTP
Sbjct: 91 ADMTTDEFRRTYAGSRVRH--HRSLSGGRRQGGGSFMYADAE--NLPAAVDWRQKGAVTP 146
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEY 210
IKDQ +CG CWAFS + AVEGI KI L+ LSEQ+L+DC+ N+GC GG M+ AF++
Sbjct: 147 IKDQGQCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQF 206
Query: 211 IIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIG 269
I QN GI TE YPYQ Q +C +++ + I YE+VP+ DE AL KAV+ QPVS+
Sbjct: 207 IQQNGGITTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVA 266
Query: 270 IAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMK 329
I A +F+ Y EG+F GT LDH V VG+GTT DG YW++KNSWG+ WG+ GY++
Sbjct: 267 IDASGNDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIR 326
Query: 330 ILRD----EGLCGIGTQSSYP 346
+ R EGLCGI ++SYP
Sbjct: 327 MQRGVKQAEGLCGIAMEASYP 347
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 296 bits (758), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 154/341 (45%), Positives = 214/341 (62%), Gaps = 19/341 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEM------HEKWMAQHGRSYKDELEKEMRFKIFK 72
F ++I+ S S HE EM +E+W+ QHGR YK+ E + F I++
Sbjct: 12 FALLIMWTVGVSWSAFSEE-HEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRHFGIYQ 70
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N+ +I N + N ++ L N+F+D+TN+E++ALY G S ++ S+FK +
Sbjct: 71 SNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKN--QSSFKRERSK 127
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+ +P S+DWR AVTP+++Q ECG CWAFS VAAVEGI KI L+ LSEQ+L+DC
Sbjct: 128 V--LPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDCD 185
Query: 193 TN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVP 250
+ GN GC GG M AF++I QN GI T YPY QG C+ + A KIS YE VP
Sbjct: 186 IDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKISGYETVP 245
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E+ L AV+ QPVS+ I A EF+ Y +GIFNG CG QL+HAVT++G+G ++G
Sbjct: 246 PNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYG-EDNGKK 304
Query: 311 YWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
YWL+KNSWG WG+AGY +++R DEG+CGI ++SYP+
Sbjct: 305 YWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 345
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 141/323 (43%), Positives = 203/323 (62%), Gaps = 18/323 (5%)
Query: 40 EQSVVEMHEKWMAQHGR----SYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNR 95
E+S+ ++E+W + + R D+ ++ RF +FKEN Y+ +AN++ R ++L N+
Sbjct: 34 EESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKDGRPFRLALNK 93
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL-------SMTDVPTSLDWRDKKAV 148
F+D+T DEFR Y G + + HR+ + + T++P ++DWR + AV
Sbjct: 94 FADMTTDEFRRTYAGSR--TRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAVDWRLRGAV 151
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
T +KDQ +CG CWAFSA+AAVEG+ KI L+ LSEQ+LVDC N GC GG M+ AF
Sbjct: 152 TGVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAF 211
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
+YI +N G+ TE YPY A Q +C+ A +++ I YE+VP+ +E AL KAV+ QPV+
Sbjct: 212 QYIQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVA 271
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A +F+ Y EG+F G CGT LDH V VG+GTT DG YW +KNSWG+ WG+ GY
Sbjct: 272 VAIEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGY 331
Query: 328 MKILR----DEGLCGIGTQSSYP 346
+++ R GLCGI + SYP
Sbjct: 332 IRMQRGVPDSRGLCGIAMEPSYP 354
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 153/339 (45%), Positives = 212/339 (62%), Gaps = 31/339 (9%)
Query: 37 STHEQSVVEMHEKWMAQHGRSYKDELEKEMR-FKIFKENLEYIEKANKEGNRTYKLGTNR 95
S+HE S+ E+ E+W+++H + LE+++R F++FK+NL +I++ N++ + +Y LG N
Sbjct: 39 SSHE-SLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRKVS-SYWLGLNE 96
Query: 96 FSDLTNDEFRALYTGYKMPSPS------HRSTTSST-------------FKYQNLSMTDV 136
F+DLT+DEF+A Y G H F+Y+ + +
Sbjct: 97 FADLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAARL 156
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
P S+DWR K AVT +K+Q +CG CWAFS VAAVEGI +I NL LSEQ+LVDC T+GN
Sbjct: 157 PKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTDGN 216
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
NGC GG M+ AF YI N G+ TE+ YPY +GTCS AA IS YE+VP +EQA
Sbjct: 217 NGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCSRGSSAAVVTISGYEDVPRNNEQA 276
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT--EDG---ANY 311
LLKA++ QPVS+ I A + Y G+F+G CGTQLDH V VG+GT ++G A+Y
Sbjct: 277 LLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAAVGYGTAGKDNGHVVADY 336
Query: 312 WLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
++KNSWG +WG+ GY+++ R +GLCGI SYP
Sbjct: 337 IIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMPSYP 375
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 146/314 (46%), Positives = 208/314 (66%), Gaps = 11/314 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T ++++ E W+++ GR Y+ EK RF+IFK+NL +I+ NK+ R Y LG N F+
Sbjct: 38 TSNDKLIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKK-VRNYWLGLNEFA 96
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DL+++EF+ Y G K P S R+ F Y++++ +P S+DWR K AVTP+K+Q C
Sbjct: 97 DLSHEEFKNKYLGLK-PDLSKRAQCPEEFTYKDVA---IPKSVDWRKKGAVTPVKNQGSC 152
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS VAAVEGI +I NL LSEQ+L+DC T NNGC GG M+ AF YI+ N G+
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGL 212
Query: 218 ATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
E++YPY +GTC +++ A IS Y +VP E++LLKA++ QP+SI I A +
Sbjct: 213 HKEEDYPYIMEEGTCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRD 272
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ Y G+F+G CGT+LDH V VG+GT++ G +Y ++KNSWG WG+ GY+++ R
Sbjct: 273 FQFYSGGVFDGHCGTELDHGVAAVGYGTSK-GLDYIIVKNSWGPKWGEKGYIRMKRKTSK 331
Query: 334 -EGLCGIGTQSSYP 346
EG+CGI +SYP
Sbjct: 332 PEGICGIYKMASYP 345
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 146/307 (47%), Positives = 203/307 (66%), Gaps = 12/307 (3%)
Query: 47 HEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRA 106
+E+W+ QHGR YK+ E + F I++ N+ +I N + N ++ L N+F+D+TN+E++A
Sbjct: 41 YERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKA 99
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
LY G S ++ S+FK + + +P S+DWR AVTP+++Q ECG CWAFS V
Sbjct: 100 LYMGLGTSETSRKN--QSSFKRERSKV--LPISVDWRKMGAVTPVRNQGECGSCWAFSTV 155
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
AAVEGI KI L+ LSEQ+L+DC + GN GC GG M AF++I QN GI T YPY
Sbjct: 156 AAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPY 215
Query: 226 QAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGI 284
QG C+ + A KIS YE VP +E+ L AV+ QPVS+ I A EF+ Y +GI
Sbjct: 216 IGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI 275
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIG 340
FNG CG QL+HAVT++G+G ++G YWL+KNSWG WG+AGY +++R DEG+CGI
Sbjct: 276 FNGFCGKQLNHAVTVIGYG-EDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIA 334
Query: 341 TQSSYPL 347
++SYP+
Sbjct: 335 MEASYPI 341
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/317 (46%), Positives = 207/317 (65%), Gaps = 14/317 (4%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T +V + E+W+A++ ++Y EK RF++FK+NL +I++AN++ +Y LG N F+
Sbjct: 63 TQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFA 122
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKKAVTPIKDQQ 155
DLT+DEF+A Y G +P + T+ F+Y + P S+DWR K AVT +K+Q
Sbjct: 123 DLTHDEFKATYLGL-LP----KRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQG 177
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
+CG CWAFS VAAVEGI +I NL LSEQQLVDCST+GNNGC GG M+ AF +I
Sbjct: 178 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGA 237
Query: 216 GIATEDEYPYQAVQGTCS--AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
G+ +E+ YPY +G C A IS YE+VP+ DEQAL+KA++ QPVS+ I A
Sbjct: 238 GLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 297
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
F+ Y G+F+G CG++LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R
Sbjct: 298 GRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGTHWGEKGYIRMKRG 356
Query: 333 ---DEGLCGIGTQSSYP 346
EGLCGI +SYP
Sbjct: 357 TGKPEGLCGINKMASYP 373
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 143/335 (42%), Positives = 213/335 (63%), Gaps = 12/335 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK RF+IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG N+F+D+TN+EF A YTG +P R S + ++ ++ VP
Sbjct: 68 IETFNSRNGNSYTLGINQFTDMTNNEFVAQYTGVSLPLNIEREPVVS---FDDVDISAVP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
S+DWR+ AVT +K+ CG CWAF+A+A VE I KI LI LSEQQ++DC+ +
Sbjct: 125 QSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQQVLDCAV--SY 182
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAV--QGTCSAAQKAAAAKISNYEEVPSGDEQ 255
GC GG + KA+++II N+G+A+ YPY+A QGTC +A I+ Y V S +E+
Sbjct: 183 GCDGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTCRINGVPNSAYITGYTRVQSNNER 242
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
+++ AVS QP++ I A + +F+ YK G+F+G CGT L+HA+TI+G+G G +W+++
Sbjct: 243 SMMYAVSNQPIAASIEA-SGDFQHYKRGVFSGPCGTSLNHAITIIGYGQDSSGKKFWIVR 301
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
NSWG +WG+ GY+++ RD GLCGI + YP
Sbjct: 302 NSWGASWGERGYIRMARDVSSSSGLCGIAIRPLYP 336
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 143/320 (44%), Positives = 205/320 (64%), Gaps = 14/320 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDEL----EKEMRFKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A+HG ++E RF F +NL +++ N G ++L
Sbjct: 45 EAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLA 104
Query: 93 TNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
NRF+DLTNDEFRA Y G K + +R+ +Y++ ++P ++DWR+K AV P+K
Sbjct: 105 MNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAPVK 164
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYI 211
+Q +CG CWAFSAV+ VE I +I ++ LSEQ+LV+C NG ++GC GG M+ AFE+I
Sbjct: 165 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFI 224
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
I+N GI TED+YPY+AV G C +K A I +E+VP DE++L KAV+ PVS+ I
Sbjct: 225 IKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAI 284
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A EF+ Y G+F+G CGTQLDH V VG+G TE+G +YW+++NSWG WG+AGY+++
Sbjct: 285 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGEAGYLRM 343
Query: 331 LRD----EGLCGIGTQSSYP 346
R+ G CGI SSYP
Sbjct: 344 ERNINVTSGKCGIAMMSSYP 363
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/317 (46%), Positives = 207/317 (65%), Gaps = 14/317 (4%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T +V + E+W+A++ ++Y EK RF++FK+NL +I++AN++ +Y LG N F+
Sbjct: 77 TQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFA 136
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKKAVTPIKDQQ 155
DLT+DEF+A Y G +P + T+ F+Y + P S+DWR K AVT +K+Q
Sbjct: 137 DLTHDEFKATYLGL-LP----KRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQG 191
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
+CG CWAFS VAAVEGI +I NL LSEQQLVDCST+GNNGC GG M+ AF +I
Sbjct: 192 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGA 251
Query: 216 GIATEDEYPYQAVQGTCS--AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
G+ +E+ YPY +G C A IS YE+VP+ DEQAL+KA++ QPVS+ I A
Sbjct: 252 GLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 311
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
F+ Y G+F+G CG++LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R
Sbjct: 312 GRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGTHWGEKGYIRMKRG 370
Query: 333 ---DEGLCGIGTQSSYP 346
EGLCGI +SYP
Sbjct: 371 TGKPEGLCGINKMASYP 387
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 148/344 (43%), Positives = 213/344 (61%), Gaps = 12/344 (3%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
K+ + +++ ++L + + E+S+ +++EKW + H S + EK RF +F
Sbjct: 3 KLLFVALYLALVLGFTESFDFHEKDLESEESLWDLYEKWRSHHTVSTSLD-EKRKRFNVF 61
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSH---RSTTSSTFKY 128
+ N+ ++ NK ++ YKL N+F+D+TN EFR Y K+ + + +F Y
Sbjct: 62 RANVLHVHNTNKM-DKPYKLKLNKFADMTNHEFRTAYASSKVKHHTMFRGAPLGNGSFMY 120
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
N+ VP S+DWR K AVTP+KDQ +CG CWAFS + AVEGI I LI LSEQ+L
Sbjct: 121 GNID--KVPASIDWRKKGAVTPVKDQGKCGSCWAFSTIVAVEGINFIKTNKLISLSEQEL 178
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYE 247
VDC+T N+GC GG M+ AFE+I + +GI TE YPY+A G C A + A I +E
Sbjct: 179 VDCNTGENHGCNGGLMDYAFEFITKQKGITTEANYPYRAQDGHCDANKANQPAVSIDGHE 238
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
+V +E ALLKAV+ QPVS+ I A ++F+ Y EG+F G CG +LDH V IVG+GTT D
Sbjct: 239 DVLHNNENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGECGKELDHGVAIVGYGTTVD 298
Query: 308 GANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
G YW+++NSWG WG+ GY+++ R GLCGI ++SYP+
Sbjct: 299 GTKYWIVRNSWGPEWGERGYIRMQRGISDRRGLCGIAMEASYPI 342
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 143/320 (44%), Positives = 205/320 (64%), Gaps = 14/320 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDEL----EKEMRFKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A+HG ++E RF F +NL +++ N G ++L
Sbjct: 45 EAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLA 104
Query: 93 TNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
NRF+DLTNDEFRA Y G K + +R+ +Y++ ++P ++DWR+K AV P+K
Sbjct: 105 MNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVK 164
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYI 211
+Q +CG CWAFSAV+ VE I +I ++ LSEQ+LV+C NG ++GC GG M+ AFE+I
Sbjct: 165 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFI 224
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
I+N GI TED+YPY+AV G C +K A I +E+VP DE++L KAV+ PVS+ I
Sbjct: 225 IKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAI 284
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A EF+ Y G+F+G CGTQLDH V VG+G TE+G +YW+++NSWG WG+AGY+++
Sbjct: 285 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGEAGYLRM 343
Query: 331 LRD----EGLCGIGTQSSYP 346
R+ G CGI SSYP
Sbjct: 344 ERNINVTSGKCGIAMMSSYP 363
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 143/320 (44%), Positives = 205/320 (64%), Gaps = 14/320 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDEL----EKEMRFKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A+HG ++E RF F +NL +++ N G ++L
Sbjct: 45 EAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLA 104
Query: 93 TNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
NRF+DLTNDEFRA Y G K + +R+ +Y++ ++P ++DWR+K AV P+K
Sbjct: 105 MNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVK 164
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYI 211
+Q +CG CWAFSAV+ VE I +I ++ LSEQ+LV+C NG ++GC GG M+ AFE+I
Sbjct: 165 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFI 224
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
I+N GI TED+YPY+AV G C +K A I +E+VP DE++L KAV+ PVS+ I
Sbjct: 225 IKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAI 284
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A EF+ Y G+F+G CGTQLDH V VG+G TE+G +YW+++NSWG WG+AGY+++
Sbjct: 285 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGEAGYLRM 343
Query: 331 LRD----EGLCGIGTQSSYP 346
R+ G CGI SSYP
Sbjct: 344 ERNINVTSGKCGIAMMSSYP 363
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 149/337 (44%), Positives = 219/337 (64%), Gaps = 16/337 (4%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
T+ +I+I ++ ++Q + + ++ E ++ W ++ YKD+ E+E +IFK N
Sbjct: 9 TLINILIVIWVMFPSNQ--NQENDQSLTLSERYKHWKIKYRVIYKDDAEEEKHIQIFKHN 66
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
+ YI+ N GN++YKL NRF+DL + + K+ TTSS FKY+N+ T
Sbjct: 67 VAYIDSFNAAGNKSYKLTINRFADLPTEPSDDGFKKRKL-----EPTTSSLFKYKNI--T 119
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD-CST 193
D+P ++DWR + AVTP+K+Q+ECG CWAFSAV A+EGI +I+ NL+ LSEQ+LVD +
Sbjct: 120 DIPAAVDWRKRGAVTPVKNQRECGSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVRS 179
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
N NGC GG + AFE++++N GIATE YPY+ V+G ++ + + +I +YE+VP
Sbjct: 180 NWTNGCNGGYLIDAFEFVLENGGIATEASYPYRGVKGN-NSKKVSRQVQIKSYEQVPRNS 238
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E +LLK V+ QPVS+GI + + Y GIF G CGT+ +HAV IVG+GT+ DG YWL
Sbjct: 239 EDSLLKVVANQPVSVGIDI-SGMIRFYSSGIFTGECGTKPNHAVIIVGYGTSNDGTKYWL 297
Query: 314 IKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
+KNSWG WG+ Y+++ RD EGLCGI +SYP
Sbjct: 298 VKNSWGIRWGEKRYIRMKRDIDAKEGLCGIPMDASYP 334
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 146/320 (45%), Positives = 208/320 (65%), Gaps = 16/320 (5%)
Query: 40 EQSVVEMHEKWMAQHGR-SYKDE---LEKEMRFKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A+HG SY + E+E RF+ F +NL +++ N G ++L
Sbjct: 43 EAEARAVYDLWLAEHGGGSYPNANSIPERERRFRAFWDNLRFVDAHNARAAAGEEGFRLA 102
Query: 93 TNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
NRF+DLTNDEFRA Y G K R +Y++ ++P ++DWR+K AV P+K
Sbjct: 103 MNRFADLTNDEFRAAYLGVK--GQRARPGRVVGERYRHDGAEELPEAVDWREKGAVAPVK 160
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYI 211
+Q +CG CWAFSA++ VE I +I ++ LSEQ+LV+C TNG ++GC GG M+ AFE+I
Sbjct: 161 NQGQCGSCWAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFI 220
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
I+N GI TED+YPY+A+ G C +K A I +E+VP DE++L KAV+ QPVS+ I
Sbjct: 221 IKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAI 280
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A EF+ Y G+F+G CGTQLDH V VG+G TE+G +YW+++NSWG WG+AGY+++
Sbjct: 281 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGEAGYLRM 339
Query: 331 LRD----EGLCGIGTQSSYP 346
R+ G CGI SSYP
Sbjct: 340 ERNINVTSGKCGIAMMSSYP 359
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 139/333 (41%), Positives = 208/333 (62%), Gaps = 10/333 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+ + L V AS +S +++ E+WMA++GR YKD EK +RF+IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG N+F+D+TN+EF A YTG +P R S + ++ ++ VP
Sbjct: 68 IETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVVS---FDDVDISSVP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
S+DWRD AVT +K+Q CG CWAF+++A VE I KI NL+ LSEQQ++DC+ +
Sbjct: 125 QSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAV--SY 182
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG + KA+ +II N+G+A+ YPY+A +GTC +A I+ Y V +E+ +
Sbjct: 183 GCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNSAYITRYTYVQRNNERNM 242
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+ AVS QP++ + A + F+ YK G+F G CGT+L+HA+ I+G+G G +W+++NS
Sbjct: 243 MYAVSNQPIAAALDA-SGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNS 301
Query: 318 WGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
WG WG+ GY+++ RD GLCGI YP
Sbjct: 302 WGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 153/320 (47%), Positives = 199/320 (62%), Gaps = 20/320 (6%)
Query: 40 EQSVVEMHEKWMAQH--GRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
E+S +++E+W + RS D K RF +FK N+ ++ NK ++ YKL N+F+
Sbjct: 33 EESFWDLYERWRSYRTVSRSLGD---KHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFA 88
Query: 98 DLTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
D+TN EFR+ Y G K+ HR + TF Y+ + VP S DWR AVT +K
Sbjct: 89 DMTNHEFRSTYAGSKVNH--HRMFQGTPRGNGTFMYEKVG--SVPPSADWRKNGAVTGVK 144
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYII 212
DQ +CG CWAFS V AVEGI +I L+ LSEQ+LVDC T N GC GG ME AFE+I
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIK 204
Query: 213 QNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIA 271
Q GI TE YPY A GTC A++ A I +E VP+ DE ALLKAV+ QPVS+ I
Sbjct: 205 QKGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAID 264
Query: 272 AYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMK-- 329
A +F+ Y EG+F G C T+L+H V IVG+GTT DG NYW ++NSWG WG+ GY++
Sbjct: 265 AGGFDFQFYFEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQ 324
Query: 330 --ILRDEGLCGIGTQSSYPL 347
I + EGLCGI +SYP+
Sbjct: 325 RSIFKKEGLCGIAMMASYPI 344
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 139/333 (41%), Positives = 210/333 (63%), Gaps = 10/333 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+ + L AS +SR +++ E+WMA++GR YKD+ EK RF+IFK N+++
Sbjct: 8 VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG N+F+D+T EF A YTG +P R S + +++++ VP
Sbjct: 68 IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVS---FDDVNISAVP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
S+DWRD AV +K+Q CG CW+F+A+A VEGI KI L+ LSEQ+++DC+ +
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV--SY 182
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG + KA+++II N G+ TE+ YPY A QGTC+A +A I+ Y V DE+++
Sbjct: 183 GCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNSAYITGYSYVRRNDERSM 242
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+ AVS QP++ I A + F+ Y G+F+G CGT L+HA+TI+G+G G YW+++NS
Sbjct: 243 MYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNS 301
Query: 318 WGDTWGDAGYMKILR----DEGLCGIGTQSSYP 346
WG +WG+ GY+++ R G+CGI +P
Sbjct: 302 WGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 149/348 (42%), Positives = 220/348 (63%), Gaps = 21/348 (6%)
Query: 18 MFIIIILLV--SCASQVVSSRSTHE-----QSVVE-----MHEKWMAQHGRSYKDELEKE 65
M I+++ +V SCA+ + S +++ SV + + E WM +HG+ Y EKE
Sbjct: 8 MLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAEKE 67
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST 125
R IF++NL +I N E N +Y+LG F+DL+ E++ + G P + +S+
Sbjct: 68 RRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMTSS 126
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
+Y+ + +P S+DWR++ AVT +KDQ C CWAFS V AVEG+ KI L+ LSE
Sbjct: 127 DRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSE 186
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA--AAAKI 243
Q L++C+ NNGCGGG +E A+E+I++N G+ T+++YPY+AV G C K I
Sbjct: 187 QDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMI 245
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
YE +P+ DE AL+KAV+ QPV+ I + + EF+ Y+ G+F+G CGT L+H V +VG+G
Sbjct: 246 DGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYG 305
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
TE+G +YWL+KNS G TWG+AGYMK+ R+ GLCGI ++SYPL
Sbjct: 306 -TENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 352
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 145/334 (43%), Positives = 206/334 (61%), Gaps = 11/334 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+ + L V AS +S +++ E+WM ++GR YKD EK RF+IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG N+F+D+TN+EF A YTG P R S + ++ ++ V
Sbjct: 68 IETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGISRPLNIEREPVVS---FDDVDISAV 124
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
P S+DWRD AVT +K+Q CG CWAF+A+A VE I KI L LSEQQ++DC+
Sbjct: 125 PQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAKG-- 182
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG +AFE+II N+G+A+ YPY+A +GTC +A I+ Y VP +E +
Sbjct: 183 YGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTCKTNGVPNSAYITGYARVPRNNESS 242
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
++ AVS QP+++ + A F+ YK G+FNG CGT L+HAVT +G+G +G YW++KN
Sbjct: 243 MMYAVSKQPITVAVDA-NANFQYYKSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVKN 301
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
SWG WG+AGY+++ RD G+CGI S YP
Sbjct: 302 SWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 201/315 (63%), Gaps = 9/315 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E S+ ++E+W QH + +D EK RF +F+EN+ I + N+ G+ YKL NRF D+
Sbjct: 40 EDSLWALYERWREQHTVA-RDLGEKARRFNVFRENVRLIHEFNR-GDAPYKLRLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQN---LSMTDVPTSLDWRDKKAVTPIKDQQE 156
T DEFR Y ++ S + + S+ DVP S+DWR K AVT +KDQ +
Sbjct: 98 TADEFRRAYASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQGQ 157
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS +AAVEGI I NL LSEQQLVDC T N GC GG M+ AF+YI ++ G
Sbjct: 158 CGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCNGGLMDYAFQYIAKHGG 217
Query: 217 IATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
+A ED YPY+A Q + + +A I YE+VP+ DE AL KAV+ QPV++ I A +
Sbjct: 218 VAAEDAYPYKARQASSCNKKPSAVVTIDGYEDVPANDETALKKAVAAQPVAVAIEASGSH 277
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ Y EG+F G CGT+LDH V VG+GTT DG YW++KNSWG WG+ GY+++ RD
Sbjct: 278 FQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWGPEWGEKGYIRMKRDVKD 337
Query: 334 -EGLCGIGTQSSYPL 347
EGLCGI ++SYP+
Sbjct: 338 KEGLCGIAMEASYPV 352
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 137/309 (44%), Positives = 211/309 (68%), Gaps = 10/309 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E+ E WM++HG+ Y+ EK +RF++FK+NL++I+ NK + Y LG N F+DL++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF+ Y G K+ R ++ F Y+++ D+P S+DWR K AVTP+K+Q +CG CWA
Sbjct: 102 EFKNKYLGLKVDLSQRRESSEEEFTYRDV---DLPKSVDWRKKGAVTPVKNQGQCGSCWA 158
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI +I NL LSEQ+L+DC T NNGC GG M+ AF +I++N G+ E++
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKNGGLHKEED 218
Query: 223 YPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY + TC ++ + I+ Y +VP +EQ+LLKA++ QP+S+ I A +F+ Y
Sbjct: 219 YPYIMEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
G+F+G CG++LDH V+ VG+GT++ G +Y ++KNSWG WG+ G++++ R+ EG+C
Sbjct: 279 GGVFDGHCGSELDHGVSAVGYGTSK-GLDYIIVKNSWGAKWGEKGFIRMKRNIGKSEGIC 337
Query: 338 GIGTQSSYP 346
G+ +SYP
Sbjct: 338 GLYKMASYP 346
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 145/311 (46%), Positives = 211/311 (67%), Gaps = 14/311 (4%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+V++ W +H + Y EK R+++FK+NL++I + N+ N +Y LG N+F+D+ ++
Sbjct: 44 LVDLFSSWSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRR-NGSYWLGLNQFADVAHE 102
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
EF++ Y G K M P+ T F+Y+N ++P S+DWR K AVTP+K+Q ECG C
Sbjct: 103 EFKSTYLGLKTGMDGPARAPTA---FRYEN--SVNLPWSVDWRKKGAVTPVKNQGECGSC 157
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATE 220
WAFS VAAVEGI +I+ L LSEQ+L+DC T ++GCGGG M+ AF YI+ N GI T+
Sbjct: 158 WAFSTVAAVEGINQIATGKLESLSEQELMDCDTTFDHGCGGGFMDFAFAYIMGNLGIHTD 217
Query: 221 DEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKS 279
D+YPY +G C Q ++ IS YE+VP E +LLKA++ QP+S+GIAA + +F+
Sbjct: 218 DDYPYLMEEGYCKEKQPQSKVVTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQF 277
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEG 335
YK G+F G CGT+LDHA+T VG+G++ DG +Y ++KNSWG +WG+ GY +I R EG
Sbjct: 278 YKRGVFEGSCGTELDHALTAVGYGSS-DGQDYIIMKNSWGKSWGEQGYFRIKRGTGKPEG 336
Query: 336 LCGIGTQSSYP 346
+C I + +SYP
Sbjct: 337 VCSIYSMASYP 347
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 145/320 (45%), Positives = 207/320 (64%), Gaps = 16/320 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDEL----EKEMRFKIFKENLEYIEKANKE---GNRTYKLG 92
E +++ W+A++G E+E RF+ F +NL +++ N G Y+LG
Sbjct: 46 EAEARAVYDLWLAENGGGSSPNANSIPERERRFRAFWDNLNFVDAHNARAAAGEEGYRLG 105
Query: 93 TNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
NRF+DLTNDEFRA Y G K + R +Y++ ++P ++DWR+K AV P+K
Sbjct: 106 MNRFADLTNDEFRAAYLGVK--AQRARPGRMVGERYRHDGAEELPEAVDWREKGAVAPVK 163
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYI 211
+Q +CG CWAFSAV+ VE I +I ++ LSEQ+LV+C TNG ++GC GG M+ AFE+I
Sbjct: 164 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFI 223
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
I+N GI TED+YPY+A+ G C +K A I +E+VP DE++L KAV+ QPVS+ I
Sbjct: 224 IKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAI 283
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A EF+ Y G+F+G CGTQLDH V VG+G TE+G +YW+++NSWG WG++GY+++
Sbjct: 284 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGESGYLRM 342
Query: 331 LRD----EGLCGIGTQSSYP 346
R+ G CGI SSYP
Sbjct: 343 ERNINVTSGKCGIAMMSSYP 362
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 294 bits (752), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 150/320 (46%), Positives = 204/320 (63%), Gaps = 19/320 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+++ E++E+W QH R +D EK RF +FK+N+ I + N+ + YKL NRF D+
Sbjct: 41 EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98
Query: 100 TNDEFRALYTGYKMPSPSH------RSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKD 153
T DEFR Y ++ SH R S F Y D+P ++DWR+K AV +KD
Sbjct: 99 TADEFRRAYASSRV---SHHRMFRGRGERRSGFMY--AGARDLPAAVDWREKGAVGAVKD 153
Query: 154 QQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYII 212
Q +CG CWAFS +AAVEGI I +NL LSEQQLVDC T GN GC GG M+ AF+YI
Sbjct: 154 QGQCGSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIA 213
Query: 213 QNQGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIA 271
++ G+A YPY+A Q +C ++ ++ I YE+VP+ E AL KAV+ QPVS+ I
Sbjct: 214 KHGGVAASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIE 273
Query: 272 AYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKIL 331
A + F+ Y EG+F G CGT+LDH V VG+GTT DG YW+++NSWG WG+ GY+++
Sbjct: 274 AGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMK 333
Query: 332 RD----EGLCGIGTQSSYPL 347
RD EGLCGI ++SYP+
Sbjct: 334 RDVSAKEGLCGIAMEASYPI 353
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 294 bits (752), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 149/348 (42%), Positives = 220/348 (63%), Gaps = 21/348 (6%)
Query: 18 MFIIIILLV--SCASQVVSSRSTHE-----QSVVE-----MHEKWMAQHGRSYKDELEKE 65
M I+++ +V SCA+ + S +++ SV + + E WM +HG+ Y EKE
Sbjct: 1 MLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAEKE 60
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST 125
R IF++NL +I N E N +Y+LG F+DL+ E++ + G P + +S+
Sbjct: 61 RRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMTSS 119
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
+Y+ + +P S+DWR++ AVT +KDQ C CWAFS V AVEG+ KI L+ LSE
Sbjct: 120 DRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSE 179
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA--AAAKI 243
Q L++C+ NNGCGGG +E A+E+I++N G+ T+++YPY+AV G C K I
Sbjct: 180 QDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMI 238
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
YE +P+ DE AL+KAV+ QPV+ I + + EF+ Y+ G+F+G CGT L+H V +VG+G
Sbjct: 239 DGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYG 298
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
TE+G +YWL+KNS G TWG+AGYMK+ R+ GLCGI ++SYPL
Sbjct: 299 -TENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 345
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 152/362 (41%), Positives = 225/362 (62%), Gaps = 27/362 (7%)
Query: 9 GSFKINTIPMFIIIILLVSCAS----QVVSSRSTHE--QSVVEMH-----------EKWM 51
GS K T+ + ++ +++ SCA+ VVSS + H S +H + WM
Sbjct: 2 GSAKSATL-ILLVAMVITSCATAMDMSVVSSNNNHHLTTSPGRLHSGFDAEASLIFDSWM 60
Query: 52 AQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGY 111
+HG+ Y EKE R IF++NL +I N E N +Y+LG +F+DL+ E+ + G
Sbjct: 61 VKHGKVYGSVAEKERRLTIFEDNLRFISNRNAE-NLSYRLGLTQFADLSLHEYGEVCHGA 119
Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEG 171
P + +S+ +Y+ + +P S+DWR++ AVT +KDQ C CWAFS V AVEG
Sbjct: 120 DPRPPRNHVFMTSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEG 179
Query: 172 ITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGT 231
+ KI L+ LSEQ L++C+ NNGCGGG +E A+E+I++N G+ T+++YPY+AV G
Sbjct: 180 LNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMKNGGLGTDNDYPYKAVNGV 238
Query: 232 CSAAQKA--AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVC 289
C K I +E +P+ DE AL+KAV+ QPV+ I + + EF+ Y+ G+F+G C
Sbjct: 239 CDGRLKENNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSC 298
Query: 290 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSY 345
GT L+H V +VG+G TE+G +YWL+KNS G+TWG+AGYMK+ R+ GLCGI ++SY
Sbjct: 299 GTNLNHGVVVVGYG-TENGRDYWLVKNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASY 357
Query: 346 PL 347
PL
Sbjct: 358 PL 359
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 293 bits (750), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 146/335 (43%), Positives = 209/335 (62%), Gaps = 12/335 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ + ++ +S + + S RS E V+ M+EKW+ +H + Y EK RF+IFK+NL +
Sbjct: 8 LILFGLITLSLSLDMSSGRSNKE--VMTMYEKWLVKHQKVYYGLGEKNQRFQIFKDNLIF 65
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N N +Y++G N FSD+TN E+R Y + TS + Y+ +P
Sbjct: 66 IDEHNAP-NHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIKNKITSVRYAYKAGHNNKLP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
S+DWR A+TPIK+Q CG CWAFSAVAAVE I KI +L+ LSEQ+LVDC N
Sbjct: 125 VSVDWRG--ALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVSLSEQELVDCDRTKNK 182
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQA 256
GC GG A+ +I++N G+ ++ +YPY Q TC+ A+K I+ Y+ V E A
Sbjct: 183 GCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKNTKVVSINGYKNVQRNSESA 242
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
L++AV+ QPVS+GI AY +F+ Y+ G+F G CGT LDHAV +VG+G +E+G +YWL+KN
Sbjct: 243 LMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAVVVVGYG-SENGKDYWLVKN 301
Query: 317 SWGDTWGDAGYMKILR-----DEGLCGIGTQSSYP 346
SWG WG+ GY+KI R + G CGI ++YP
Sbjct: 302 SWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYP 336
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 144/303 (47%), Positives = 199/303 (65%), Gaps = 8/303 (2%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR 105
M E W A+HG+SY + EK R IF + L YIEK N + N T+ LG N+FSDLTN EFR
Sbjct: 1 MFEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
A Y G K SP ++ + K ++ ++ +PTSLDWR + AVTPIKDQ +CG CWAFSA
Sbjct: 61 ANYVG-KFKSPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
+A++E ++ L+ LSEQQL+DC T + GC GG E AF+++++N G+ TE+ YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPEDAFKFVVENGGVTTEEAYPY 176
Query: 226 QAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIF 285
G+C+ A K +I+ Y++V AL+KAVS PV++GI F++Y+ GI
Sbjct: 177 TGFAGSCN-ANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--EGLCGIGTQS 343
+G C DHAV ++G+G TE G YW+IKNSWG +WG+ G+MKI + EG+CG+ QS
Sbjct: 236 SGQCSNSRDHAVLVIGYG-TEGGMPYWIIKNSWGTSWGENGFMKIKKKDGEGMCGMNGQS 294
Query: 344 SYP 346
SYP
Sbjct: 295 SYP 297
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 145/341 (42%), Positives = 205/341 (60%), Gaps = 36/341 (10%)
Query: 13 INTIPMFIIIILLVS--CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKI 70
+ T+ I+ IL + C + + + + + ++V HE+WMAQ+ R YKD EK RFK
Sbjct: 1 MATLKASILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFK- 59
Query: 71 FKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
F+DLTN EFR++ T S + + T F+Y+N
Sbjct: 60 -------------------------FADLTNHEFRSVKTNKGFKSSNMKILTG--FRYEN 92
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
+S +PT++DWR K VTPIKDQ +CGCC AFSAVAA EGI KIS L+ L++Q+LVD
Sbjct: 93 VSADALPTTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVSLADQELVD 152
Query: 191 CSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEV 249
C +G + GC GG M+ AF++II+N G+ TE YPY A G C++ +AA I YE+V
Sbjct: 153 CDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCNSGSNSAAT-IKGYEDV 211
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ DE AL+KA++ QPVS+ + F+ Y G+ G CGT LDH + +G+G T DG
Sbjct: 212 PANDEAALMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGT 271
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
YWL+KNSWG TWG+ GY+++ +D G+CG+ + SYP
Sbjct: 272 KYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 312
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 211/310 (68%), Gaps = 11/310 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E+ E WM++HG+ Y+ EK +RF++FK+NL++I++ NK + Y LG N F+DL++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCW 161
EF+ Y G K+ R S+ F Y+++ D+P S+DWR K AVTP+K+Q +CG CW
Sbjct: 102 EFKNKYLGLKVNLSQRRESSNEEEFTYRDV---DLPKSVDWRKKGAVTPVKNQGQCGSCW 158
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATED 221
AFS VAAVEGI +I NL LSEQ+L+DC T NNGC GG M+ AF +I+QN G+ ED
Sbjct: 159 AFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGGLHKED 218
Query: 222 EYPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
+YPY + TC ++ I+ Y +VP +EQ+LLKA++ QP+S+ I A + +F+ Y
Sbjct: 219 DYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFY 278
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGL 336
G+F+G CG+ LDH V+ VG+GT+++ +Y ++KNSWG WG+ G++++ R+ EG+
Sbjct: 279 SGGVFDGHCGSDLDHGVSAVGYGTSKN-LDYIIVKNSWGAKWGEKGFIRMKRNIGKPEGI 337
Query: 337 CGIGTQSSYP 346
CG+ +SYP
Sbjct: 338 CGLYKMASYP 347
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 150/325 (46%), Positives = 206/325 (63%), Gaps = 18/325 (5%)
Query: 38 THEQS-------VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYK 90
+H+QS V+ +++ W+ +HG++Y EK RF+IFK NL +I++ N + NRTYK
Sbjct: 12 SHDQSSWRSDDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQ-NRTYK 70
Query: 91 LGTNRFSDLTNDEFRALYTGYKMPSPSHR--STTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
+G +F+DLTN E+RA++ G + P R + + + +Y + +P S+DWR K AV
Sbjct: 71 VGLTKFADLTNQEYRAMFLGTR-SDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAV 129
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
PIKDQ CG CWAFS VAAVEGI +I LI LSEQ+LVDC N GC GG M+ AF
Sbjct: 130 NPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGCNGGLMDYAF 189
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
++II N G+ TE +YPY TC + K A I +E+V DE+AL KAV+ QPVS
Sbjct: 190 QFIINNGGLDTEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQPVS 249
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A + Y+ G+F G CGT LDH V +VG+G TE G +YWL++NSWG WG+ GY
Sbjct: 250 VAIEASGMALQFYQSGVFTGECGTALDHGVVVVGYG-TEKGLDYWLVRNSWGTEWGEHGY 308
Query: 328 MKILRD-----EGLCGIGTQSSYPL 347
+K+ R+ G CGI +SSYP+
Sbjct: 309 IKMQRNVRDTYTGRCGIAMESSYPV 333
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 151/335 (45%), Positives = 208/335 (62%), Gaps = 10/335 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + + M+E W+ ++G+SY E E RF+IFKE L +
Sbjct: 13 LFFSTLLVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + NR+Y++G N+F+D TN+EF++ Y G+ S S++ S+ +Y+ +P
Sbjct: 73 IDEHNADTNRSYRVGLNQFADQTNEEFQSTYLGFT--SGSNKMKVSN--RYEPRVGQVLP 128
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+DWR AV IK Q +CG CWAFSA+A VEGI KI +LI LSEQ+LVDC T
Sbjct: 129 DYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRTQNT 188
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQ 255
GC GG++ F++II N GI TE YPY A G C+ Q A I YE VP +E
Sbjct: 189 RGCDGGSITDGFQFIINNGGINTEANYPYTAEDGQCNLDLQNEKYASIDTYENVPYNNEW 248
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL AV+ QPVS+ + A F+ Y GIF G CGT +DHAVTIVG+G TE G +YW++K
Sbjct: 249 ALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYG-TEGGIDYWIVK 307
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSW TWG+ GY++ILR+ G CGI T+ SYP+
Sbjct: 308 NSWDTTWGEEGYIRILRNVGGAGTCGIATKPSYPV 342
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 196/316 (62%), Gaps = 12/316 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
++++ +++E+W H R ++ EK RF FKEN +I NK G+R Y+L NRF D+
Sbjct: 35 DEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGDM 93
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+EFR+ + ++ T + F Y + TD+P S+DWR K AVT +K+Q
Sbjct: 94 GREEFRSGFADSRINDLRREPTAAPAVPGFMYDD--ATDLPRSVDWRQKGAVTAVKNQGR 151
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS V AVEGI I +L+ LSEQ+L+DC T+ NGC GG ME AFE+I + G
Sbjct: 152 CGSCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTD-ENGCQGGLMENAFEFIKSHGG 210
Query: 217 IATEDEYPYQAVQGTCSAAQ--KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
I TE YPY A GTC A+ + I ++ VP+G E AL KAV+ QPVS+ I A
Sbjct: 211 ITTESAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGG 270
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-- 332
+ Y EG+F G CGT LDH V VG+G ++DG YW++KNSWG +WG+ GY+++ R
Sbjct: 271 QALQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGEGGYIRMQRGT 330
Query: 333 -DEGLCGIGTQSSYPL 347
+ GLCGI ++S+P+
Sbjct: 331 GNGGLCGIAMEASFPI 346
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 202/324 (62%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R+ F++NL YI++ N G +
Sbjct: 26 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 85
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTN+E+R Y G + R + N ++ P S+DWR K AV
Sbjct: 86 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 142
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
IKDQ CG CWAFSA+AAVEGI +I +LI LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 143 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 202
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
++II N GI TED+YPY+ C +K A I +YE+V E +L KAV+ QPVS
Sbjct: 203 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 262
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A F+ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 263 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 321
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 322 VRMERNIKASSGKCGIAVEPSYPL 345
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 142/334 (42%), Positives = 208/334 (62%), Gaps = 11/334 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK RF+IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYT-GYKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG N+F+D+T EF A YT G P R S + +++++ V
Sbjct: 68 IETFNSHNGNSYTLGINQFTDMTKSEFVAQYTGGISRPLNIEREPVVS---FDDVNISAV 124
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
P S+DWRD AV +K+Q CG CWAF+A+A VEGI KI L+ LSEQ+++DC+ +
Sbjct: 125 PQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV--S 182
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG + KA+++II N G+ TE+ YPYQA QGTC+A +A I+ Y V DE++
Sbjct: 183 YGCKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTCNANSFPNSAYITGYSYVRRNDERS 242
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
++ AVS QP++ I A + F+ Y G+F+G CGT L+HA+TI+G+G G YW+++N
Sbjct: 243 MMYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRN 301
Query: 317 SWGDTWGDAGYMKILR----DEGLCGIGTQSSYP 346
SWG +WG+ GY+++ R G CGI +P
Sbjct: 302 SWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFP 335
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 146/321 (45%), Positives = 209/321 (65%), Gaps = 12/321 (3%)
Query: 33 VSSRSTHEQSV-VEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKL 91
V++++ H V+M E+W+ ++ ++Y EK+ RF+IF +NL+++++ N N++Y+L
Sbjct: 22 VTAKADHRNPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYEL 81
Query: 92 GTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPI 151
G RF+DLTN+EFRA+Y KM R + S N+ +P +DWR K AV P+
Sbjct: 82 GLTRFADLTNEEFRAIYLRSKMERT--RDSVKSERYLHNVG-DKLPDEVDWRAKGAVVPV 138
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYI 211
KDQ CG CWAFSA+ AVEGI +I L+ LSEQ+LVDC T+ NNGCGGG M+ AF++I
Sbjct: 139 KDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFI 198
Query: 212 IQNQGIATEDEYPYQAV-QGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIG 269
I N GI TE++YPY A C+ +K I YE+VP +E +L KA++ QP+S+
Sbjct: 199 ISNGGIDTEEDYPYTATDDNICNTDKKNTRVVTIDGYEDVPE-NENSLKKALANQPISVA 257
Query: 270 IAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMK 329
I A F+ YK G+F G CGT LDH V VG+GT+E G +YW+I+NSWG WG++GY+K
Sbjct: 258 IEAGGRGFQLYKSGVFTGTCGTALDHGVVAVGYGTSE-GQDYWIIRNSWGSNWGESGYIK 316
Query: 330 ILRD----EGLCGIGTQSSYP 346
+ R+ G CG+ +SYP
Sbjct: 317 LQRNIKDSSGKCGVAMMASYP 337
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 202/324 (62%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R+ F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTN+E+R Y G + R + N ++ P S+DWR K AV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
IKDQ CG CWAFSA+AAVEGI +I +LI LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
++II N GI TED+YPY+ C +K A I +YE+V E +L KAV+ QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 261
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A F+ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 148/335 (44%), Positives = 209/335 (62%), Gaps = 15/335 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQS-----VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
+I+L+V A+ +R + + M E W A+HG+SY + EK R IF + L
Sbjct: 6 LILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDWEKARRLMIFSDTL 65
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMT 134
YIEK N + N T+ LG N+FSDLTN EFRA++ G +K P R +++ ++
Sbjct: 66 AYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAED----EDVDVS 121
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
+PTSLDWR K AVTPIKDQ +CG CWAFSA+A++E ++ L+ LSEQQL+DC T
Sbjct: 122 SLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCDTV 181
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGD 253
+ GC GG ME AF+++++N G+ TE YPY G+C+A + K A+I+ ++ V
Sbjct: 182 -DAGCDGGLMETAFKFVVKNGGVTTEAAYPYTGSVGSCNANKAKNKVAEITGFKVVTEDS 240
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
AL+KAVS PV++ I F++YK GI +G C LDH V ++G+G TE G YW+
Sbjct: 241 ADALMKAVSKTPVTVSICGSDENFQNYKSGILSGKCDDSLDHGVLLIGYG-TEGGMPYWI 299
Query: 314 IKNSWGDTWGDAGYMKILRD--EGLCGIGTQSSYP 346
IKNSWG +WG+ G+MKI R +G+CG+ SSYP
Sbjct: 300 IKNSWGTSWGEDGFMKIERKDGDGMCGMNGDSSYP 334
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 197/315 (62%), Gaps = 10/315 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKD--ELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
E+S+ ++E+W + + S + +E RF +FKEN Y+ + NK +R ++L N+F+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYVHEGNKR-DRPFRLALNKFA 92
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-VPTSLDWRDKKAVTPIKDQQE 156
D+T DEFR Y G ++ S + D +P ++DWR K AVT IKDQ +
Sbjct: 93 DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKDQGQ 152
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS + AVEGI KI L+ LSEQ+L+DC N GC GG M+ AF++I +N G
Sbjct: 153 CGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCEGGLMDYAFQFIQKN-G 211
Query: 217 IATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
I TE YPYQ QG+C A++ A A I YE+VP+ DE AL KAV+ QPVS+ I A
Sbjct: 212 ITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQ 271
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-- 333
+F+ Y EG+F G C T LDH V VG+G T DG YW++KNSWG+ WG+ GY+++ R
Sbjct: 272 DFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVS 331
Query: 334 --EGLCGIGTQSSYP 346
EGLCGI Q+SYP
Sbjct: 332 QTEGLCGIAMQASYP 346
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 210/314 (66%), Gaps = 10/314 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T ++E+ E+W++ HG+ Y+ EK RF++FK+NL++I++ NK+ +Y LG N F+
Sbjct: 36 TSMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFA 94
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DLT+ EF+ +Y G K+ S R + F Y+++ D+P S+DWR K AVT +K+Q C
Sbjct: 95 DLTHQEFKNMYLGLKVESSRTRQSPEE-FTYKDV--VDLPKSVDWRKKGAVTRVKNQGSC 151
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS VAAVEGI KI G NL LSEQ+L+DC NNGC GG M+ AF +I+ + G+
Sbjct: 152 GSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGL 211
Query: 218 ATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
E++YPY V+ TC + + IS Y++VP +E +L+KA++ QP+S+ I A +
Sbjct: 212 HKEEDYPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRD 271
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ Y G+F+G CGTQLDH VT VG+G+++ G +Y ++KNSWG WG+ GY+++ R+
Sbjct: 272 FQFYSGGVFDGPCGTQLDHGVTAVGYGSSK-GVDYIIVKNSWGPKWGEKGYIRMKRNTGK 330
Query: 334 -EGLCGIGTQSSYP 346
GLCGI +SYP
Sbjct: 331 PAGLCGINKMASYP 344
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 208/314 (66%), Gaps = 10/314 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++ E WM++HG+SY+ EK RF++F++NL++I++ NK+ + +Y LG N F+
Sbjct: 39 TSMDKLTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DL+++EF+ Y G K+ P R + F Y++++ D+P S+DWR K AV +K+Q C
Sbjct: 98 DLSHEEFKRKYLGLKIELPKRRDSPEE-FSYKDVA--DLPKSVDWRKKGAVAHVKNQGAC 154
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS VAAVEGI +I NL LSEQ+L+DC NNGC GG M+ AF +II N G+
Sbjct: 155 GSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGL 214
Query: 218 ATEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
E++YPY +GTC ++ IS Y +VP +EQ+ LKA++ QP+S+ I A +
Sbjct: 215 RKEEDYPYVMEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRG 274
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ Y GIFNG CGT+LDH V VG+GT++ G +Y +KNSWG WG+ GY+++ R+
Sbjct: 275 FQFYSGGIFNGHCGTELDHGVAAVGYGTSK-GVDYITVKNSWGSKWGEKGYIRMKRNVGK 333
Query: 334 -EGLCGIGTQSSYP 346
EG+CGI +SYP
Sbjct: 334 PEGICGIYKMASYP 347
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 138/312 (44%), Positives = 200/312 (64%), Gaps = 18/312 (5%)
Query: 45 EMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEF 104
++ E W +HG+SY + E+ R K+F++N +++ K N +GN +Y L N F+DLT+ EF
Sbjct: 27 QLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEF 86
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMT----DVPTSLDWRDKKAVTPIKDQQECGCC 160
+ G S ++NL +T D+P S+DWR+K VT +KDQ CG C
Sbjct: 87 KTSRLGL--------SAAPLNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGAC 138
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATE 220
W+FSA A+EGI KI +L+ LSEQ+L++C + N+GCGGG M+ AF+++I N GI TE
Sbjct: 139 WSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTE 198
Query: 221 DEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKS 279
++YPY+A GTC+ + K I Y +VP +E+ LL+AV+ QPVS+GI F+
Sbjct: 199 EDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQM 258
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EG 335
Y +GIF G C T LDHAV IVG+G +E+G +YW++KNSWG WG GYM + R+ +G
Sbjct: 259 YSKGIFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQG 317
Query: 336 LCGIGTQSSYPL 347
+CGI +SYP+
Sbjct: 318 VCGINMLASYPV 329
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 154/322 (47%), Positives = 208/322 (64%), Gaps = 16/322 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ ++E+W A+H S +D EK RF +F+EN + + N + YKL NRF+DL
Sbjct: 42 EESLWALYERWRARHTVS-RDLAEKSRRFNVFRENARLVHEFNLRRDAPYKLRLNRFADL 100
Query: 100 TNDEFRALY-----TGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKKAVTPI 151
T+DEFR Y + ++M P + + + S T +PTS+DWR+K AVT +
Sbjct: 101 TSDEFRRSYASSRVSHHRMFKP-RAANNNDDDDDKGSSFTHGGALPTSVDWREKGAVTGV 159
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYI 211
KDQ +CG CWAFS +AAVEGI I NL LSEQQLVDC T N GC GG M+ AF YI
Sbjct: 160 KDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCDGGLMDDAFSYI 219
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQKAAAAKIS--NYEEVPSGDEQALLKAVSMQPVSIG 269
++ G+A E YPY+A Q + ++KAAAA +S YE+VP DE AL KAV+ QPV++
Sbjct: 220 AKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALKKAVAAQPVAVA 279
Query: 270 IAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMK 329
I A + F+ Y EG+F G CGT+LDH V VG+G T DG YW++KNSWG+ WG+ GY++
Sbjct: 280 IEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWIVKNSWGEEWGEKGYIR 339
Query: 330 ILRD----EGLCGIGTQSSYPL 347
+ RD EGLCGI ++SYP+
Sbjct: 340 MKRDVADKEGLCGIAMEASYPV 361
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 209/322 (64%), Gaps = 16/322 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDEL----EKEMRFKIFKENLEYIEKANKEG-NRTYKLGTN 94
++ V ++ +W A+HG++ + +++ RF IFK+NL +I+ N++ N TYKLG
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLT 101
Query: 95 RFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKKAVTPI 151
+F+DLTNDE+R LY G + P+ R + KY ++ +VP ++DWR K AV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYI 211
KDQ CG CWAFS AAVEGI KI LI LSEQ+LVDC + N GC GG M+ AF++I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
++N G+ TE +YPY+ G C++ K + I YE+VP+ DE AL KA+S QPVS+ I
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A F+ Y+ GIF G CGT LDHAV VG+G +E+G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339
Query: 331 LRD-----EGLCGIGTQSSYPL 347
R+ G CGI ++SYP+
Sbjct: 340 ERNLAASKSGKCGIAVEASYPV 361
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 210/314 (66%), Gaps = 10/314 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T ++E+ E+W++ HG+ Y+ EK RF++FK+NL++I++ NK+ +Y LG N F+
Sbjct: 39 TSMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DLT+ EF+ +Y G K+ S R + F Y+++ D+P S+DWR K AVT +K+Q C
Sbjct: 98 DLTHQEFKNMYLGLKVESSRTRQSPEE-FTYKDV--VDLPKSVDWRKKGAVTRVKNQGSC 154
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS VAAVEGI KI G NL LSEQ+L+DC NNGC GG M+ AF +I+ + G+
Sbjct: 155 GSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGL 214
Query: 218 ATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
E++YPY V+ TC + + IS Y++VP +E +L+KA++ QP+S+ I A +
Sbjct: 215 HKEEDYPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRD 274
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ Y G+F+G CGTQLDH VT VG+G+++ G +Y ++KNSWG WG+ GY+++ R+
Sbjct: 275 FQFYSGGVFDGPCGTQLDHGVTAVGYGSSK-GVDYIIVKNSWGPKWGEKGYIRMKRNTGK 333
Query: 334 -EGLCGIGTQSSYP 346
GLCGI +SYP
Sbjct: 334 PAGLCGINKMASYP 347
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 202/324 (62%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG++Y E+E R+ F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTN+E+R Y G + R + N ++ P S+DWR K AV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
IKDQ CG CWAFSA+AAVEGI +I +LI LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
++II N GI TED+YPY+ C +K A I +YE+V E +L KAV+ QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 261
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A F+ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 208/322 (64%), Gaps = 16/322 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDEL----EKEMRFKIFKENLEYIEKANKEG-NRTYKLGTN 94
++ V ++ +W A+HG++ + +++ RF IFK+NL +I+ N+ N TYKLG
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLT 101
Query: 95 RFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKKAVTPI 151
+F+DLTNDE+R LY G + P+ R + KY ++ +VP ++DWR K AV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYI 211
KDQ CG CWAFS AAVEGI KI LI LSEQ+LVDC + N GC GG M+ AF++I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
++N G+ TE +YPY+ G C++ K + I YE+VP+ DE AL KA+S QPVS+ I
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A F+ Y+ GIF G CGT LDHAV VG+G +E+G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339
Query: 331 LRD-----EGLCGIGTQSSYPL 347
R+ G CGI ++SYP+
Sbjct: 340 ERNLAASKSGKCGIAVEASYPV 361
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 149/340 (43%), Positives = 216/340 (63%), Gaps = 20/340 (5%)
Query: 20 IIIILLVSCASQVVSSRS-THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ ++LV+ S ++ R E+S+ +++E+W + H S +D EK RF +FK N+ +I
Sbjct: 12 VLAVILVAAMSMEITERDLASEESLWDLYERWRSHHTVS-RDLSEKRKRFNVFKANVHHI 70
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTG----YKMPSPSHRSTTSSTFKYQNLSMT 134
K N++ ++ YKL N F+D+TN EFR Y+ Y+M S +T K ++L
Sbjct: 71 HKVNQK-DKPYKLKLNSFADMTNHEFREFYSSKVKHYRMLHGSRANTGFMHGKTESL--- 126
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
P S+DWR + AVT +K+Q +CG CWAFS V VEGI KI L+ LSEQ+LVDC T+
Sbjct: 127 --PASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETD 184
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGD 253
N GC GG ME A+E+I ++ GI TE YPY+A G+C S+ A A I +E VP+ D
Sbjct: 185 -NEGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDSSKMNAPAVTIDGHEMVPAND 243
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYW 312
E AL+KAV+ QPVS+ I A ++ + Y EG++ G CG +LDH V +VG+GT DG YW
Sbjct: 244 ENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVGYGTALDGTKYW 303
Query: 313 LIKNSWGDTWGDAGYMKILR-----DEGLCGIGTQSSYPL 347
++KNSWG WG+ GY+++ R + G+CGI ++SYPL
Sbjct: 304 IVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPL 343
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 156/345 (45%), Positives = 211/345 (61%), Gaps = 16/345 (4%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
K+ I + + +I V+ E+S+ ++E+W + H + ++ EK RF +F
Sbjct: 5 KLLFISLSLALIFTVANTFDFNEHDLESEKSLWNLYERWRSHHTVT-RNLDEKHNRFNVF 63
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-----STTSSTF 126
K N+ ++ NK ++ YKL N+F D+TN EFR +Y K+ HR S + TF
Sbjct: 64 KANVMHVHNTNKL-DKPYKLKLNKFGDMTNYEFRRIYADSKISH--HRMFRGMSHENGTF 120
Query: 127 KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
Y+N DVP+S+DWR+K AVT +KDQ +CG CWAFS +AAVEGI +I L+ LSEQ
Sbjct: 121 MYEN--AVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQ 178
Query: 187 QLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNY 246
QLVDC T N GC GG ME AFE+I QN GI TE YPY A GTC ++ A I +
Sbjct: 179 QLVDCDTEENEGCNGGLMEYAFEFIKQN-GITTESNYPYAAKDGTCDVEKEDKAVSIDGH 237
Query: 247 EEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E VP +E ALLKA + QPVS+ I A F+ Y EG+F G C T L+H V IVG+G T+
Sbjct: 238 ENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNHGVAIVGYGVTQ 297
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
D YW++KNSWG WG+ GY+++ R EGLCGI ++SYP+
Sbjct: 298 DRTKYWIMKNSWGSEWGEQGYIRMQRGISSREGLCGIAMEASYPI 342
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 139/309 (44%), Positives = 195/309 (63%), Gaps = 7/309 (2%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+ + E W QHG++Y + EK R K+F++N +++ + N +GN +Y L N F+DLT+
Sbjct: 26 IAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHH 85
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF+A G + + + S + + + DVP S+DWR AVT +KDQ CG CW+
Sbjct: 86 EFKASRLGLSSAASASLNVDRSNRQIPDF-VADVPASVDWRKNGAVTQVKDQGNCGACWS 144
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FSA A+EGI KI +L+ LSEQ+LVDC + NNGC GG M+ AF+++I N GI TE++
Sbjct: 145 FSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEED 204
Query: 223 YPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPYQ +C+ + K I Y +VP +E+ LLKAV+ QPVS+GI F+ Y
Sbjct: 205 YPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYS 264
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
+GIF G C T LDHAV IVG+G +E+G +YW++KNSWG WG GYM + R+ GLC
Sbjct: 265 KGIFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLC 323
Query: 338 GIGTQSSYP 346
GI +SYP
Sbjct: 324 GINMLASYP 332
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 140/309 (45%), Positives = 206/309 (66%), Gaps = 11/309 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E+ E WM++HG+ Y+ EK +RF+IFK+NL++I++ NK + Y LG N F+DL++
Sbjct: 43 LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF+ Y G K+ S R + F Y+++ ++P S+DWR K AV P+K+Q CG CWA
Sbjct: 102 EFKNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI +I NL LSEQ+L+DC NNGC GG M+ AF +I++N G+ E++
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 223 YPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY +GTC ++ IS Y +VP +EQ+LLKA++ QP+S+ I A +F+ Y
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 277
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
G+F+G CG+ LDH V VG+GT + G +Y ++KNSWG WG+ GY+++ R+ EG+C
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYIIVKNSWGSKWGEKGYIRMRRNIGKPEGIC 336
Query: 338 GIGTQSSYP 346
GI +SYP
Sbjct: 337 GIYKMASYP 345
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 206/322 (63%), Gaps = 16/322 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDEL----EKEMRFKIFKENLEYIEKAN-KEGNRTYKLGTN 94
++ V ++ +W A HG++ + +++ RF IFK+NL +I+ N K N TYKLG
Sbjct: 42 DEEVRSIYLQWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEKNKNATYKLGLT 101
Query: 95 RFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKKAVTPI 151
+F+DLTN+E+R+LY G + P R + + + D VP ++DWR K AV PI
Sbjct: 102 KFTDLTNEEYRSLYLGART-EPVRRIAKAKNVNQKYSAAVDGKEVPETVDWRLKGAVNPI 160
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYI 211
KDQ CG CWAFS AAVEGI KI LI LSEQ+LVDC + N GC GG M+ AF++I
Sbjct: 161 KDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVDCDNSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
++N G+ TE +YPY+ G C++ K A I YE+VP+ DE AL +A+S+QPVS+ I
Sbjct: 221 MKNGGLKTEKDYPYRGFGGKCNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVSVAI 280
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A F+ Y+ GIF G CGT LDHAV VG+G +E+G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQTGIFTGNCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339
Query: 331 LRD-----EGLCGIGTQSSYPL 347
R+ G CGI ++SYP+
Sbjct: 340 ERNLASSKSGKCGIAVEASYPV 361
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 215/315 (68%), Gaps = 11/315 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
+H++ ++E+ E W++ ++Y+ EK +RF++FK+NL++I++ NK+G ++Y LG N F+
Sbjct: 43 SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
DL+++EF+ +Y G K S + F Y+++ VP S+DWR K AV +K+Q
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS VAAVEGI KI NL LSEQ+L+DC T NNGC GG M+ AFEYI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218
Query: 217 IATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
+ E++YPY +GTC + ++ I+ +++VP+ DE++LLKA++ QP+S+ I A
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGR 278
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-- 333
EF+ Y G+F+G CG LDH V VG+G+++ G++Y ++KNSWG WG+ GY+++ R+
Sbjct: 279 EFQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTG 337
Query: 334 --EGLCGIGTQSSYP 346
EGLCGI +S+P
Sbjct: 338 KPEGLCGINKMASFP 352
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 202/324 (62%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R+ F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTN+E+R Y G + R + N ++ P S+DWR K AV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
IKDQ+ G CWAFSA+AAVEGI +I +LI LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 142 AEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
++II N GI TED+YPY+ C +K A I +YE+V E +L KAV+ QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 261
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A F+ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344
>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 379
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 144/355 (40%), Positives = 207/355 (58%), Gaps = 23/355 (6%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVV------EMHEKWMAQHGRSYKDELEKE 65
+++ + + ++ + S A ++ + E+ + +++E+W H R ++ EK
Sbjct: 3 QVSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKG 61
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKM------PSPSHR 119
RF FKEN+ +I NK G+R Y+L NRF D+ +EFR+ + ++ SP+ R
Sbjct: 62 RRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAAR 121
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGAN 179
+ F Y S D P S+DWR + AVT +KDQ CG CWAFS V AVEGI I +
Sbjct: 122 AGAVPGFMYD--SAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGS 179
Query: 180 LIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ--- 236
L LSEQ+L+DC T+ NGC GG ME AFE+I GI TE YPY+A GTC +
Sbjct: 180 LASLSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARR 238
Query: 237 -KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDH 295
I ++ VP+G E AL KAV+ QPVS+ + A F+ Y EG+F G CGT LDH
Sbjct: 239 GGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDH 298
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---DEGLCGIGTQSSYPL 347
V VG+G +DG YW++KNSWG +WG+ GY+++ R + GLCGI ++S+P+
Sbjct: 299 GVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 353
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 143/326 (43%), Positives = 205/326 (62%), Gaps = 21/326 (6%)
Query: 40 EQSVVEMHEKWMAQH-------GRSYKDEL---EKEMRFKIFKENLEYIEKANKEGNRTY 89
E+S+ ++E+W +++ G + +L + RF +FKEN++YI +ANK+ +R +
Sbjct: 31 EESLRGLYERWRSRYTVSPSTPGSGLRGKLADHDPARRFNVFKENVKYIHEANKK-DRPF 89
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKK 146
+L N+F+D+T DE R Y G ++ HR+ + N + +D +P ++DWR+K
Sbjct: 90 RLALNKFADMTTDELRHSYAGSRVRH--HRALSGGRRAQGNFTYSDAENLPPAVDWREKG 147
Query: 147 AVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEK 206
AVT IKDQ +CG CWAFS +AAVE I KI L+ LSEQ+L+DC + GC GG M+
Sbjct: 148 AVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLSEQELMDCDNVNDQGCDGGLMDY 207
Query: 207 AFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQP 265
AF++I +N G+ +E YPYQ Q TC A++ I YE+VP+ DE AL KAV+ QP
Sbjct: 208 AFQFIQKNGGVTSEANYPYQGQQNTCDQAKENTHDVAIDGYEDVPANDESALQKAVAYQP 267
Query: 266 VSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDA 325
VS+ I A +F+ Y EG+F G C T LDH V VG+GT DG YW++KNSWG WG+
Sbjct: 268 VSVAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAVGYGTARDGTKYWIVKNSWGLDWGEK 327
Query: 326 GYMKILRD----EGLCGIGTQSSYPL 347
GY+++ R EGLCGI Q+SYP+
Sbjct: 328 GYIRMQRGVSQAEGLCGIAMQASYPI 353
>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 423
Score = 290 bits (741), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 144/355 (40%), Positives = 207/355 (58%), Gaps = 23/355 (6%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVV------EMHEKWMAQHGRSYKDELEKE 65
+++ + + ++ + S A ++ + E+ + +++E+W H R ++ EK
Sbjct: 47 QVSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKG 105
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKM------PSPSHR 119
RF FKEN+ +I NK G+R Y+L NRF D+ +EFR+ + ++ SP+ R
Sbjct: 106 RRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAAR 165
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGAN 179
+ F Y S D P S+DWR + AVT +KDQ CG CWAFS V AVEGI I +
Sbjct: 166 AGAVPGFMYD--SAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGS 223
Query: 180 LIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ--- 236
L LSEQ+L+DC T+ NGC GG ME AFE+I GI TE YPY+A GTC +
Sbjct: 224 LASLSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARR 282
Query: 237 -KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDH 295
I ++ VP+G E AL KAV+ QPVS+ + A F+ Y EG+F G CGT LDH
Sbjct: 283 GGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDH 342
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---DEGLCGIGTQSSYPL 347
V VG+G +DG YW++KNSWG +WG+ GY+++ R + GLCGI ++S+P+
Sbjct: 343 GVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 397
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 290 bits (741), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 209/310 (67%), Gaps = 11/310 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E+ E WM++HG+ Y+ EK +RF++FK+NL++I+ NK + Y LG N F+DL++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCW 161
EF+ Y G K+ R S+ F Y+++ D+P S+DWR K AVTP+K+Q +CG CW
Sbjct: 102 EFKNKYLGLKVDLSQRRESSNEEEFTYRDV---DLPKSVDWRKKGAVTPVKNQGQCGSCW 158
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATED 221
AFS VAAVEGI +I NL LSEQ+L+DC T NNGC GG M+ AF +I QN G+ E+
Sbjct: 159 AFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGGLHKEE 218
Query: 222 EYPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
+YPY + TC ++ I+ Y +VP +EQ+LLKA++ QP+S+ I A + +F+ Y
Sbjct: 219 DYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFY 278
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGL 336
G+F+G CG+ LDH V+ VG+GT+++ +Y ++KNSWG WG+ G++++ RD EG+
Sbjct: 279 SGGVFDGHCGSDLDHGVSAVGYGTSKN-LDYIIVKNSWGAKWGEKGFIRMKRDIGKPEGI 337
Query: 337 CGIGTQSSYP 346
CG+ +SYP
Sbjct: 338 CGLYKMASYP 347
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 142/303 (46%), Positives = 198/303 (65%), Gaps = 8/303 (2%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR 105
M E W A+HG+SY + EK R IF + L YIEK N N T+ LG N+FSDLTN EFR
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
A Y G K P ++ + K ++ ++ +PTSLDWR + AVTPIKDQ +CG CWAFSA
Sbjct: 61 ANYVG-KFKPPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
+A++E ++ L+ LSEQQL+DC T + GC GG E AF+++++N G+ TE+ YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPEDAFKFVVENGGVTTEEAYPY 176
Query: 226 QAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIF 285
G+C+ A K +I+ Y++V AL+KAVS PV++GI F++Y+ GI
Sbjct: 177 TGFAGSCN-ANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--EGLCGIGTQS 343
+G C DHAV ++G+G TE G YW+IKNSWG +WG+ G+M+I ++ EG+CG+ QS
Sbjct: 236 SGHCSNSRDHAVLVIGYG-TEGGMPYWIIKNSWGTSWGEDGFMRIKKEDGEGMCGMNGQS 294
Query: 344 SYP 346
SYP
Sbjct: 295 SYP 297
>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
Length = 348
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 145/336 (43%), Positives = 209/336 (62%), Gaps = 18/336 (5%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
+ I +V+C +T ++S+ +++E+W +QH S + EK+ RF +FK N+ +I +
Sbjct: 15 LFIGVVNCIDFTEKDLAT-DKSLWDLYERWGSQHMVSRAPD-EKKKRFNVFKYNVNHINR 72
Query: 81 ANKEGNRTYKLGTNRFSDLTNDEFRALYTG----YKMPSPSHRSTTSSTFKYQNLSMTDV 136
N+ G + YKL N F+D+TN EF+A + ++M R T + + TD
Sbjct: 73 VNQLG-KPYKLKLNEFADMTNHEFKAGFDSKILHFRMLKGKRRQTP-----FTHAKTTDP 126
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
P S+DWR AV PIK+Q CG CWAFS + VEGI KI L+ LSEQ+LVDC T+
Sbjct: 127 PPSIDWRTNGAVNPIKNQGRCGSCWAFSTIVGVEGINKIKTNQLVSLSEQELVDCETDCE 186
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQ 255
GC GG ME +E+I + G+ TE YPY A G C +++ + KI +E VP+ DE
Sbjct: 187 -GCNGGLMENGYEFIKETGGVTTEQIYPYFARNGRCDISKRNSPVVKIDGFENVPANDES 245
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
A+L+AV+ QPVSI I A F+ Y +G+FNG CGT+L+H V IVG+GTT+DG NYW+++
Sbjct: 246 AMLRAVANQPVSIAIDAGGLNFQFYSQGVFNGACGTELNHGVAIVGYGTTQDGTNYWIVR 305
Query: 316 NSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
NSWG WG+ GY+++ R EGLCG+ +SYP+
Sbjct: 306 NSWGTGWGEQGYVRMQRGVNVPEGLCGLAMDASYPI 341
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 207/322 (64%), Gaps = 16/322 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDEL----EKEMRFKIFKENLEYIEKANKEG-NRTYKLGTN 94
++ V ++ +W A+HG++ + +++ RF IFK+NL +I+ N+ N TYKLG
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLT 101
Query: 95 RFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKKAVTPI 151
+F+DLTNDE+R LY G + P+ R + KY ++ +VP ++DWR K AV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYI 211
KDQ CG CWAFS AAVEGI KI LI LSEQ+LVDC + N GC GG M+ AF++I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
++N G+ TE +YPY+ G C++ K + I YE+VP+ DE AL KA+S QPV + I
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVRVAI 280
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A F+ Y+ GIF G CGT LDHAV VG+G +E+G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339
Query: 331 LRD-----EGLCGIGTQSSYPL 347
R+ G CGI ++SYP+
Sbjct: 340 ERNLAASKSGKCGIAVEASYPV 361
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 210/310 (67%), Gaps = 15/310 (4%)
Query: 47 HEKWMAQHGRSYKDEL--EKEMRFKIFKENLEYIEKANKEGNRT--YKLGTNRFSDLTND 102
++ W+A++G + L E E RF +F +NL++++ N + ++LG NRF+DLTN+
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EFRA + G K+ + RS + +Y++ + ++P S+DWR+K AV P+K+Q +CG CWA
Sbjct: 112 EFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATED 221
FSAV+ VE I ++ +I LSEQ+LV+CSTNG N+GC GG M+ AF++II+N GI TED
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227
Query: 222 EYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
+YPY+AV G C + A I +E+VP DE++L KAV+ QPVS+ I A EF+ Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGL 336
G+F+G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+ G
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 346
Query: 337 CGIGTQSSYP 346
CGI +SYP
Sbjct: 347 CGIAMMASYP 356
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 145/314 (46%), Positives = 205/314 (65%), Gaps = 10/314 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T ++++ E W+++HG+ Y+ EK +RF+IFK+NL +I++ NK+ Y LG N FS
Sbjct: 24 TSGDKIIDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKK-VVNYWLGLNEFS 82
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DL+++EF+ Y G K+ S R S F Y+++ +P S+DWR K AVT +K+Q C
Sbjct: 83 DLSHEEFKNKYLGLKV-DMSERRECSQEFNYKDV--MSIPKSVDWRKKGAVTDVKNQGSC 139
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS VAAVEGI +I NL LSEQ+LVDC T N GC GG M+ AF YII N G+
Sbjct: 140 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAFSYIISNGGL 199
Query: 218 ATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
E +YPY +GTC +++ IS Y +VP E++LLKA++ QP+S+ I A +
Sbjct: 200 HKEVDYPYIMEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRD 259
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ Y G+F+G CGTQLDH V VG+G+T +G +Y ++KNSWG WG+ GY+++ R+
Sbjct: 260 FQFYSGGVFDGHCGTQLDHGVAAVGYGST-NGLDYIIVKNSWGSKWGEKGYIRMKRNTGK 318
Query: 334 -EGLCGIGTQSSYP 346
GLCGI +SYP
Sbjct: 319 PAGLCGINKMASYP 332
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 141/271 (52%), Positives = 191/271 (70%), Gaps = 13/271 (4%)
Query: 86 NRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
N+ YKLG N+F+DLTN+EF+A +K M S R+TT FKY+N S +P+++DWR
Sbjct: 7 NKLYKLGINKFADLTNEEFKASRNKFKGHMCSSIIRTTT---FKYENASA--IPSTVDWR 61
Query: 144 DKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGG 202
K AVTP+K+Q +CG CWAFSAVAA EGI ++S L+ LSEQ+L+DC T G + GC GG
Sbjct: 62 KKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGG 121
Query: 203 TMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAV 261
M+ AF++IIQN G++TE +YPY+ V GTC+ + + A I+ YE+VP+ +E AL KAV
Sbjct: 122 LMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELALQKAV 181
Query: 262 SMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDT 321
+ QP+S+ I A ++F+ Y G+F G CGT+LDH VT VG+G DG YWL+KNSWG
Sbjct: 182 ANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGAD 241
Query: 322 WGDAGYMKILRD----EGLCGIGTQSSYPLA 348
WG+ GY+++ R EGLCGI Q+SYP A
Sbjct: 242 WGEEGYIRMQRGIDAAEGLCGIAMQASYPTA 272
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 143/334 (42%), Positives = 204/334 (61%), Gaps = 11/334 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+ + L V AS +S +++ E+WM ++GR YKD EK RF+IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYT-GYKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG N+F+D+TN+EF A YT G P R S + ++ ++ V
Sbjct: 68 IETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGISRPLNIEREPVVS---FDDVDISAV 124
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
P S+DWRD AVT +K+Q CG CWAF+A+A VE I KI L LSEQQ++DC+
Sbjct: 125 PQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAK--G 182
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG +AFE+II N+G+A+ YPY+A +GTC +A I+ Y VP +E +
Sbjct: 183 YGCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTCKTNGVPNSAYITGYARVPRNNESS 242
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
++ AVS QP+++ + A + Y G+FNG CGT L+HAVT +G+G +G YW++KN
Sbjct: 243 MMYAVSKQPITVAVDANANS-QYYNSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVKN 301
Query: 317 SWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
SWG WG+AGY+++ RD G+CGI S YP
Sbjct: 302 SWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 200/324 (61%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R+ F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTN+E+R Y G + R + N ++ P S+DWR K AV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
IKDQ CG CWAFSA+AAVE I +I +LI LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
++II N GI TED+YPY+ C +K A I +YE+V E +L KAV QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPVS 261
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A F+ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 140/309 (45%), Positives = 205/309 (66%), Gaps = 11/309 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E+ E WM++HG+ Y++ EK +RF+IFK+NL++I++ NK + Y LG N F+DL++
Sbjct: 44 LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHR 102
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF Y G K+ S R + F Y+++ ++P S+DWR K AV P+K+Q CG CWA
Sbjct: 103 EFNNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI +I NL LSEQ+L+DC NNGC GG M+ AF +I++N G+ E++
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218
Query: 223 YPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY +GTC ++ IS Y +VP +EQ+LLKA++ QP+S+ I A +F+ Y
Sbjct: 219 YPYIMEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
G+F+G CG+ LDH V VG+GT + G +Y +KNSWG WG+ GY+++ R+ EG+C
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGIC 337
Query: 338 GIGTQSSYP 346
GI +SYP
Sbjct: 338 GIYKMASYP 346
>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
Length = 360
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 151/318 (47%), Positives = 205/318 (64%), Gaps = 17/318 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ +++E+W + H + + EK RF +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EKSLWDLYERWRSHHTVT-RSLDEKHNRFNVFKANVMHVHNTNKL-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
TN EFR +Y K+ HR S + TF Y+N+ +VP+S+DWR K AVT +KDQ
Sbjct: 91 TNYEFRRIYADSKVSH--HRMFRGMSNENGTFMYENVK--NVPSSIDWRKKGAVTDVKDQ 146
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
+CG CWAFS + AVEGI +I L+ LSEQ+LVDC T GN GC GG ME AFE+I QN
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGGNEGCNGGLMEYAFEFIKQN 206
Query: 215 QGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
GI TE YPY A GTC ++ A I YE VP +E ALLKA + QPVS+ I A
Sbjct: 207 -GITTESNYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVSVAIDAG 265
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
F+ Y EG+F+G CGT L+H V +VG+G T+D YW++KNSWG WG+ GY+++ R
Sbjct: 266 GYNFQFYSEGVFSGHCGTDLNHGVAVVGYGVTQDRTKYWIVKNSWGSEWGEQGYIRMQRG 325
Query: 333 ---DEGLCGIGTQSSYPL 347
EGLCGI ++SYP+
Sbjct: 326 ISHKEGLCGIAMEASYPI 343
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 16/337 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
++++ ++ + E+ + +++E+W + H S + EK+ RF +FKENL++I
Sbjct: 13 VVLVFRLADSFDYTEEDLASEERLRDLYERWRSHHTVS-RSLAEKQERFNVFKENLKHIH 71
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPS----PSHRSTTSSTFKYQNLSMTD 135
K N + +R YKL N F+D+TN EF Y G K+ R T S + +
Sbjct: 72 KVNHK-DRPYKLKLNSFADMTNHEFLQHYGGSKVSHYRVLRGQRQGTGSMHE----DTSK 126
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
+P+S+DWR AVT IKDQ +CG CWAFS VAAVEGI KI LI LSEQ+LVDC ++
Sbjct: 127 LPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGINKIKTGELISLSEQELVDCDSD- 185
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDE 254
N+GC GG ME AF +I Q G+ +E+ YPY+A + C S + I YE VP DE
Sbjct: 186 NHGCNGGLMEDAFNFIKQIGGLTSENTYPYRAKEEPCDSNKMNSPVVNIDGYEMVPENDE 245
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
AL+KAV+ QPV+I + A + + Y E IF G CGT+L+H V +VG+GTT+DG YW++
Sbjct: 246 NALMKAVANQPVAIAMDAGGKDLQFYSEAIFTGDCGTELNHGVALVGYGTTQDGTKYWIV 305
Query: 315 KNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
KNSWG WG+ GY+++ R +EGLCGI ++SYP+
Sbjct: 306 KNSWGTDWGEKGYIRMQRGIDAEEGLCGITMEASYPV 342
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 143/304 (47%), Positives = 196/304 (64%), Gaps = 10/304 (3%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR 105
M E W A+HG+SY + EK R IF + L YIEK N N T+ LG N+FSDLTN EFR
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 ALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
A Y G +K P R K ++ ++ +PTSLDWR + AVTPIKDQ +CG CWAFS
Sbjct: 61 ANYVGKFKPPRYQDRRPA----KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFS 116
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYP 224
A+A++E ++ L+ LSEQQL+DC T + GC GG E AF+++++N G+ TE+ YP
Sbjct: 117 AIASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPEDAFKFVVENGGVTTEEAYP 175
Query: 225 YQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGI 284
Y G+C+ A K +I+ Y++V AL+KAVS PV++GI F++Y+ GI
Sbjct: 176 YTGFAGSCN-ANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGI 234
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--EGLCGIGTQ 342
+G C DHAV ++G+G TE G YW+IKNSWG +WG+ G+M+I + EG+CG+ Q
Sbjct: 235 LSGHCSNSRDHAVLVIGYG-TEGGMPYWIIKNSWGTSWGEDGFMRIKKKDGEGMCGMNGQ 293
Query: 343 SSYP 346
SSYP
Sbjct: 294 SSYP 297
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 144/349 (41%), Positives = 219/349 (62%), Gaps = 19/349 (5%)
Query: 16 IPMFIIIILLVSCASQ------VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ +F++ + +C++ VV + W +HG+ Y EK R++
Sbjct: 7 VAVFVLFLAFAACSANHHRDPSVVGYSQEDLALPSSLFRSWSVKHGKLYASPTEKLERYE 66
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSP---SHRSTTSSTF 126
IFK+NL +I + N++ N +Y LG N+F+D+ ++EF+A Y G K P + ++ T + F
Sbjct: 67 IFKQNLMHIAETNRK-NGSYWLGLNQFADVAHEEFKASYLGLKRALPRAGAPQTRTPTAF 125
Query: 127 KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
+Y + +P S+DWR K AVTP+K+Q +CG CWAFS+VAAVEGI +I L+ LSEQ
Sbjct: 126 RYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQ 185
Query: 187 QLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAA----K 242
+LVDC T ++GC GGTM+ AF Y++ +QGI ED+YPY +G C Q
Sbjct: 186 ELVDCDTTLDHGCEGGTMDLAFAYMMGSQGIHAEDDYPYLMEEGYCKEKQPCVLGITEQD 245
Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGF 302
++ +E+VP E +LLKA++ QPVS+GIAA + +F+ Y+ G+F+G C +LDHA+T VG+
Sbjct: 246 LTGFEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYRGGVFDGACSVELDHALTAVGY 305
Query: 303 GTTEDGANYWLIKNSWGDTWGDAGYMKIL----RDEGLCGIGTQSSYPL 347
G++ G NY +KNSWG WG+ GY++I + EG+CGI T +SYP+
Sbjct: 306 GSSY-GQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPV 353
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 197/317 (62%), Gaps = 14/317 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKD--ELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
E+S+ ++E+W + + S + +E RF +FK+N Y+ + NK + ++L N+F+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKQNARYVHEGNKR-DMPFRLALNKFA 92
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKKAVTPIKDQ 154
D+T DEFR Y G ++ H S + D +P ++DWR K AVT IKDQ
Sbjct: 93 DMTTDEFRRTYAGSRVRH--HLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
+CG CWAFS + AVEGI KI L+ LSEQ+L+DC N GC GG M+ AF++I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210
Query: 215 QGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
GI TE YPYQ QG+C A++ A A I YE+VP+ DE AL KAV+ QPVS+ I A
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
+F+ Y EG+F G C T LDH V VG+G T DG YW++KNSWG+ WG+ GY+++ R
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRG 329
Query: 334 ----EGLCGIGTQSSYP 346
EGLCGI Q+SYP
Sbjct: 330 VSQTEGLCGIAMQASYP 346
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 141/309 (45%), Positives = 203/309 (65%), Gaps = 11/309 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E+ E WM++HG+ Y+ EK RF IFK+NL++I++ NK + Y LG N F+DL++
Sbjct: 43 LIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF+ Y G K+ S R + F Y++ ++P S+DWR K AVT +K+Q CG CWA
Sbjct: 102 EFKNKYLGLKVDY-SRRRESPEEFTYKDF---ELPKSVDWRKKGAVTQVKNQGSCGSCWA 157
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI +I NL LSEQ+L+DC NNGC GG M+ AF +I++N G+ E++
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 223 YPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY +GTC ++ IS Y +VP +EQ+LLKA+ QP+S+ I A +F+ Y
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYS 277
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
G+F+G CG+ LDH V VG+GT++ G NY ++KNSWG WG+ GY+++ R+ EG+C
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTSK-GVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEGIC 336
Query: 338 GIGTQSSYP 346
GI +SYP
Sbjct: 337 GIYKMASYP 345
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 153/345 (44%), Positives = 219/345 (63%), Gaps = 17/345 (4%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKD-ELEKEMRF 68
S I + F+ I L + S ++ R+ E V+ ++++W A+HG+ + + E E RF
Sbjct: 6 SSPIMALLFFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPENRF 63
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
IFK+NL++I++ N + N Y+LG N F+DLTN+E+R+ Y G K S S R+ TS+ +Y
Sbjct: 64 HIFKDNLKFIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSN--RY 120
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
D+P S+DWR K AV P+KDQ CG CWAFS VA+VE I +I +LI LSEQ+L
Sbjct: 121 LPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQEL 180
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEE 248
VDC + N GC GG M+ AFE+II+N G+ TE++YPY +C +K A I YE+
Sbjct: 181 VDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNA---IDGYED 237
Query: 249 VPSGDEQALLKA---VSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
VP +E+AL KA + VS+ I F+ Y+ GIF G CGT LDH V +VG+G +
Sbjct: 238 VPVNNEKALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYG-S 296
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E G +YW+++NSWG +WG++GY+K+ R+ GLCGI + SYP
Sbjct: 297 EGGVDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYP 341
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 202/324 (62%), Gaps = 19/324 (5%)
Query: 42 SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR-------TYKLGTN 94
++ HE WMA+HGR+Y D EK R +IF+ N E I+ N + + +++L TN
Sbjct: 38 AMASRHESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATN 97
Query: 95 RFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM-TDVPTSLDWRDKKAVTPIKD 153
RF+DLT++EFRA TG + P+ F+Y+N S+ D S+DWR AVT +KD
Sbjct: 98 RFADLTDEEFRAARTGLRRPAAVA-GAVGGGFRYENFSLQADAAGSMDWRAMGAVTGVKD 156
Query: 154 QQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN-GCGGGTMEKAFEYII 212
Q CGCCWAFSAVAA+EG+TKI L+ LSEQQLVDC G++ GC GG M+ AF+YI
Sbjct: 157 QGSCGCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYIS 216
Query: 213 QNQGIATEDEYPYQAVQ-GTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIA 271
+ G+A+E YPY G+C + + AA I +E+VP+ +E AL+ AV+ QPVS+ I
Sbjct: 217 RQGGLASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVAIN 276
Query: 272 AYTTEFKSYKE----GIFNGVC-GTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAG 326
F+ Y NG C T+LDHA+T VG+G DG YWL+KNSWG WG++G
Sbjct: 277 GGDYVFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWLMKNSWGSGWGESG 336
Query: 327 YMKIL---RDEGLCGIGTQSSYPL 347
Y++I R EG+CG+ +SYP+
Sbjct: 337 YVRIRRGSRGEGVCGLAKLASYPV 360
>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
Length = 365
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 196/317 (61%), Gaps = 14/317 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKD--ELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
E+S+ ++E+W + + S + E RF +FK+N Y+ + NK + ++L N+F+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAGERRFNVFKQNARYVHEGNKR-DMPFRLALNKFA 92
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKKAVTPIKDQ 154
D+T DEFR Y G ++ H S + D +P ++DWR K AVT IKDQ
Sbjct: 93 DMTTDEFRRTYAGSRVRH--HLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
+CG CWAFS + AVEGI KI L+ LSEQ+L+DC N GC GG M+ AF++I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210
Query: 215 QGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
GI TE YPYQ QG+C A++ A A I YE+VP+ DE AL KAV+ QPVS+ I A
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
+F+ Y EG+F G C T LDH V VG+G T DG YW++KNSWG+ WG+ GY+++ R
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRG 329
Query: 334 ----EGLCGIGTQSSYP 346
EGLCGI Q+SYP
Sbjct: 330 VSQTEGLCGIAMQASYP 346
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 135/308 (43%), Positives = 199/308 (64%), Gaps = 10/308 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+++ E+WMA++GR YKD EK RF+IFK N+++IE N +Y LG N+F+D+T
Sbjct: 6 MMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDMTKS 65
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF A YTG +P R S + +++++ VP S+DWRD AV +K+Q CG CWA
Sbjct: 66 EFVAQYTGVSLPLNIEREPVVS---FDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWA 122
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
F+A+A VEGI KI L+ LSEQ+++DC+ + GC GG + KA+++II N G+ TE+
Sbjct: 123 FAAIATVEGIYKIKTGYLVSLSEQEVLDCAVS--YGCKGGWVNKAYDFIISNNGVTTEEN 180
Query: 223 YPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
YPYQA QGTC+A +A I+ Y V DE++++ AVS QP++ I A + F+ Y
Sbjct: 181 YPYQAYQGTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDA-SENFQYYNG 239
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCG 338
G+F+G CGT L+HA+TI+G+G G YW+++NSWG +WG+ GY+++ R G CG
Sbjct: 240 GVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSSSSGACG 299
Query: 339 IGTQSSYP 346
I +P
Sbjct: 300 IAMSPLFP 307
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 209/310 (67%), Gaps = 15/310 (4%)
Query: 47 HEKWMAQHGRSYKDEL--EKEMRFKIFKENLEYIEKANKEGNRT--YKLGTNRFSDLTND 102
++ W+A++G + L E E RF +F +NL++++ N + ++LG NRF+DLTN+
Sbjct: 51 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNE 110
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EFRA + G K+ + RS + +Y++ + ++P S+DWR+K AV P+K+Q +CG CWA
Sbjct: 111 EFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 166
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATED 221
FSAV+ VE I ++ +I LSEQ+LV+CSTNG N+GC GG M AF++II+N GI TED
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTED 226
Query: 222 EYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
+YPY+AV G C + A I +E+VP DE++L KAV+ QPVS+ I A EF+ Y
Sbjct: 227 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 286
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGL 336
G+F+G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+ G
Sbjct: 287 HSGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 345
Query: 337 CGIGTQSSYP 346
CGI +SYP
Sbjct: 346 CGIAMMASYP 355
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 157/292 (53%), Positives = 197/292 (67%), Gaps = 11/292 (3%)
Query: 61 ELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRS 120
ELEK R +IFK NLEYIE N GN++YKLG N++SDLT+DEF A +TG K+ S
Sbjct: 78 ELEK--RKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQLSSS 135
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANL 180
S NL+ DVPT+ DWR + AVT +KDQ CGCCWAFS VAAVEG KI+ L
Sbjct: 136 KMRSAAVPFNLN-DDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGEL 194
Query: 181 IQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAA-QKAA 239
I LSEQQLVDC N+GC GG M+ AF+YIIQ +GI +E +YPYQ TC Q
Sbjct: 195 ISLSEQQLVDCDER-NSGCHGGNMDSAFKYIIQ-KGIVSEADYPYQEGSQTCQLNDQMKF 252
Query: 240 AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTI 299
A+I+N+ +VP+ DEQ LL+AV+ QPVS+GI EF+ Y +++G CG ++HAVT
Sbjct: 253 EAQITNFIDVPANDEQQLLQAVAQQPVSVGIEV-GDEFQHYMGDVYSGTCGQSMNHAVTA 311
Query: 300 VGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYPL 347
VG+G +EDG YWLIKNSWG WG+ GYMK+LR+ G CGI +SYP+
Sbjct: 312 VGYGVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCGIAAHASYPI 363
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 145/316 (45%), Positives = 202/316 (63%), Gaps = 12/316 (3%)
Query: 39 HEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSD 98
+E V M+E+W+ ++ ++Y EKE RFKIFK+NL+++++ N +RT+++G RF+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
LTN+EFRA+Y KM + S + + Y+ + +P +DWR AV +KDQ CG
Sbjct: 96 LTNEEFRAIYLRKKMER-TKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGI 217
CWAFSAV AVEGI +I+ LI LSEQ+LVDC N GC GG M AFE+I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Query: 218 ATEDEYPYQAVQ-GTCSAAQ--KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
T+ +YPY A G C+A + I YE+VP DE++L KAV+ QPVS+ I A +
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F+ YK G+ G CG LDH V +VG+G+T G +YW+I+NSWG WGD+GY+K+ R+
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNI 331
Query: 334 ---EGLCGIGTQSSYP 346
G CGI SYP
Sbjct: 332 DDPFGKCGIAMMPSYP 347
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 140/349 (40%), Positives = 214/349 (61%), Gaps = 18/349 (5%)
Query: 11 FKINTIPMFIIIILLVSCASQV--------VSSRSTHEQSVVEMHEKWMAQHGRSYKDEL 62
F +F++ + +++C++ T V+ + E W+A+H + Y+
Sbjct: 5 FSSKKTSLFLVFVSVLACSALANEFSILGYAPEDLTSIHKVIHLFESWLAKHSKIYESLD 64
Query: 63 EKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK RF+IF +NL++I+ NK+ + Y LG N F+DLT++EF+ + G K P + +
Sbjct: 65 EKLHRFEIFMDNLKHIDDTNKKVS-NYWLGLNEFADLTHEEFKNKFLGLKGELPERKDES 123
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
F Y++ D+P S+DWR K AV P+K+Q +CG CWAFS VAAVEGI +I NL
Sbjct: 124 IEEFSYRDF--VDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTM 181
Query: 183 LSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AA 241
LSEQ+L+DC T NNGC GG M+ AF Y++++ G+ E+EYPY +GTC + +
Sbjct: 182 LSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSETV 240
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
IS Y +VP +E + LKA++ QP+S+ I A +F+ Y G+F+G CGT+LDH V VG
Sbjct: 241 TISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVG 300
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
+GTT+ G +Y +++NSWG WG+ GY+++ R G+CG+ +SYP
Sbjct: 301 YGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRKTGKPHGMCGLYMMASYP 348
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 145/316 (45%), Positives = 202/316 (63%), Gaps = 12/316 (3%)
Query: 39 HEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSD 98
+E V M+E+W+ ++ ++Y EKE RFKIFK+NL+++++ N +RT+++G RF+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
LTN+EFRA+Y KM + S + + Y+ + +P +DWR AV +KDQ CG
Sbjct: 96 LTNEEFRAIYLRKKMER-NKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGI 217
CWAFSAV AVEGI +I+ LI LSEQ+LVDC N GC GG M AFE+I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Query: 218 ATEDEYPYQAVQ-GTCSAAQ--KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
T+ +YPY A G C+A + I YE+VP DE++L KAV+ QPVS+ I A +
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F+ YK G+ G CG LDH V +VG+G+T G +YW+I+NSWG WGD+GY+K+ R+
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNI 331
Query: 334 ---EGLCGIGTQSSYP 346
G CGI SYP
Sbjct: 332 DDPFGKCGIAMMPSYP 347
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 154/345 (44%), Positives = 215/345 (62%), Gaps = 26/345 (7%)
Query: 18 MFIIIILLVSCASQVVSSRSTHE------------QSVVEMHEKWMAQHGRSYKDELEKE 65
+FII ILL + ST E + V E++E W+A+H + Y +E E
Sbjct: 4 LFIISILLFLASFSYAMDISTIEYKYDKSSAWRTDEEVKEIYELWLAKHDKVYSGLVEYE 63
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR--STTS 123
RF+IFK+NL++I++ N E N TYK+G ++DLTN+EF+A+Y G + + HR T +
Sbjct: 64 KRFEIFKDNLKFIDEHNSE-NHTYKMGLTPYTDLTNEEFQAIYLGTRSDT-IHRLKRTIN 121
Query: 124 STFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQL 183
+ +Y + ++P +DWR K AVTP+K+Q +CG CWAFS V+ VE I +I NLI L
Sbjct: 122 ISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNLISL 181
Query: 184 SEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKI 243
SEQQLVDC+ N+GC GG A++YII N GI TE YPY+AVQG C AA+K +I
Sbjct: 182 SEQQLVDCNKK-NHGCKGGAFVYAYQYIIDNGGIDTEANYPYKAVQGPCRAAKK--VVRI 238
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
Y+ VP +E AL KAV+ QP + I A + +F+ YK GIF+G CGT+L+H V IVG+
Sbjct: 239 DGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKLNHGVVIVGYW 298
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILR--DEGLCGIGTQSSYP 346
+YW+++NSWG WG+ GY+++ R GLCGI YP
Sbjct: 299 K-----DYWIVRNSWGRYWGEQGYIRMKRVGGCGLCGIARLPYYP 338
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 143/317 (45%), Positives = 203/317 (64%), Gaps = 14/317 (4%)
Query: 39 HEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSD 98
+E+ + E W +HG+ Y E R+ ++K+NLEYI++ + E NR+Y LG +F+D
Sbjct: 38 NERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQR-HSEKNRSYWLGLTKFAD 96
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
+TNDEFR YTG ++ S RS + F+Y + ++ P S+DWR K AVT +KDQ CG
Sbjct: 97 ITNDEFRRQYTGTRIDR-SKRSKRKTGFRYAD---SEAPESVDWRKKGAVTTVKDQGSCG 152
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFSA+ +VEGI I + LSEQ+LVDC N GC GG M+ AF++I++N GI
Sbjct: 153 SCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILENGGID 212
Query: 219 TEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEF 277
TE++YPY+ + G C +K A I YE+VP DE+AL KAV+ QPVS+ I A +F
Sbjct: 213 TENDYPYKGLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 272
Query: 278 KSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD---- 333
+ Y G+F G CGT LDH V VG+G +E +YW++KNSWG+ WG++GY+++ R+
Sbjct: 273 QLYSGGVFTGECGTDLDHGVLAVGYG-SEGSLDYWIVKNSWGEYWGESGYLRMQRNIKDS 331
Query: 334 ---EGLCGIGTQSSYPL 347
GLCGI + SY +
Sbjct: 332 NHQFGLCGINIEPSYAV 348
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 286 bits (733), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 140/309 (45%), Positives = 205/309 (66%), Gaps = 11/309 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E+ E W+++HG+ Y+ EK RF+IFK+NL++I++ NK + Y LG N F+DL++
Sbjct: 44 LIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 102
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF+ Y G K+ S R + F Y+++ ++P S+DWR K AVT +K+Q CG CWA
Sbjct: 103 EFKNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVTQVKNQGSCGSCWA 158
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI +I NL LSEQ+L+DC NNGC GG M+ AF +I++N G+ E++
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENDGLHKEED 218
Query: 223 YPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY +GTC A ++ IS Y +VP +EQ+LLKA++ QP+S+ I A +F+ Y
Sbjct: 219 YPYIMEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
G+F+G CG+ LDH V VG+GT + G +Y +KNSWG WG+ GY+++ R+ EG+C
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGIC 337
Query: 338 GIGTQSSYP 346
GI +SYP
Sbjct: 338 GIYKMASYP 346
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 286 bits (733), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 204/317 (64%), Gaps = 15/317 (4%)
Query: 44 VEMHEKWMAQHGRSYKDEL----EKEMRFKIFKENLEYIEKANKEG-NRTYKLGTNRFSD 98
+ ++ +W +HG+S + +++ RF IFK+NL +I+ N+ N TYKLG F++
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKKAVTPIKDQQ 155
LTNDE+R+LY G + P R T + KY +++ +VP ++DWR K AV IKDQ
Sbjct: 61 LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS AAVEGI KI L+ LSEQ+LVDC + N GC GG M+ AF++I++N
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179
Query: 216 GIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
G+ TE +YPY G C++ K + I YE+VPS DE AL +AVS QPVS+ I A
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F+ Y+ GIF G CGT +DHAV VG+G +E+G +YW+++NSWG WG+ GY+++ R+
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNV 298
Query: 334 ---EGLCGIGTQSSYPL 347
G CGI ++SYP+
Sbjct: 299 ASKSGKCGIAIEASYPV 315
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 145/356 (40%), Positives = 218/356 (61%), Gaps = 18/356 (5%)
Query: 1 MVLIFERSGSFKINTIPMFIIIILLVSCASQV-----VSSRSTHEQSVVEMHEKWMAQHG 55
M IF S K + + +F+ I+ + A + T V+ + E W+ +H
Sbjct: 1 MAFIFS---SKKTSLLFLFVSILACSALAHEFSILGYAPEDLTSIHKVIHLFESWLVKHS 57
Query: 56 RSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPS 115
+ Y+ EK RF+IF +NL++I++ NK+ + Y LG N F+DLT++EF+ + G+K
Sbjct: 58 KFYESLDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLTHEEFKHKFLGFKGEL 116
Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKI 175
+ +S F Y++ D+P S+DWR K AV P+K+Q +CG CWAFS VAAVEGI +I
Sbjct: 117 AERKDESSKEFGYRDF--VDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQI 174
Query: 176 SGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAA 235
NL LSEQ+L+DC T NNGC GG M+ AF Y++++ G+ E+EYPY +GTC
Sbjct: 175 VTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEK 233
Query: 236 QKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLD 294
+ + IS Y +VP DE + LKA++ QP+S+ I A +F+ Y G+F+G CGT+LD
Sbjct: 234 KDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELD 293
Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYP 346
H V VG+GTT+ G +Y +++NSWG WG+ GY+++ R G+CG+ +SYP
Sbjct: 294 HGVAAVGYGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYP 348
>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
gi|194701540|gb|ACF84854.1| unknown [Zea mays]
Length = 379
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 143/355 (40%), Positives = 206/355 (58%), Gaps = 23/355 (6%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVV------EMHEKWMAQHGRSYKDELEKE 65
+++ + + ++ + S A ++ + E+ + +++E+W H R ++ EK
Sbjct: 3 QVSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKG 61
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKM------PSPSHR 119
RF FKEN+ +I NK G+R Y+L NRF D+ +EFR+ + ++ SP+ R
Sbjct: 62 RRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAAR 121
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGAN 179
+ F Y S D P S+DWR + AVT +K Q CG CWAFS V AVEGI I +
Sbjct: 122 AGAVPGFMYD--SAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGS 179
Query: 180 LIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ--- 236
L LSEQ+L+DC T+ NGC GG ME AFE+I GI TE YPY+A GTC +
Sbjct: 180 LASLSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARR 238
Query: 237 -KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDH 295
I ++ VP+G E AL KAV+ QPVS+ + A F+ Y EG+F G CGT LDH
Sbjct: 239 GGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDH 298
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---DEGLCGIGTQSSYPL 347
V VG+G +DG YW++KNSWG +WG+ GY+++ R + GLCGI ++S+P+
Sbjct: 299 GVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 353
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 147/319 (46%), Positives = 203/319 (63%), Gaps = 17/319 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
+Q + W +HG+ Y E+ RF ++K+NLEYI++ + E N +Y LG +F+DL
Sbjct: 38 DQLLAGQFAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQR-HSEKNLSYWLGLTKFADL 96
Query: 100 TNDEFRALYTGYKMPSPSH----RSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
TN+EFR YTG ++ R+ T S F+Y N ++ P S+DWR+K AVT +KDQ
Sbjct: 97 TNEEFRRQYTGTRIDRSRRLKKGRNATGS-FRYAN---SEAPKSIDWREKGAVTSVKDQG 152
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFSAV +VEGI I + I LS Q+LVDC N GC GG M+ AF+++IQN
Sbjct: 153 SCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQNG 212
Query: 216 GIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
GI TE +YPYQ G C + A I +YE+VP DE+AL KAV+ QPVS+ I A
Sbjct: 213 GIDTEKDYPYQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGG 272
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI---L 331
+F+ Y G+F G CGT LDH V VG+G +E G +YW++KNSWG+ WG++GY+++ L
Sbjct: 273 RDFQLYSGGVFTGRCGTDLDHGVLAVGYG-SEKGLDYWIVKNSWGEYWGESGYLRMQRNL 331
Query: 332 RDE---GLCGIGTQSSYPL 347
+D+ GLCGI + SY +
Sbjct: 332 KDDNGYGLCGINIEPSYAV 350
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 153/353 (43%), Positives = 218/353 (61%), Gaps = 31/353 (8%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEK--------------WMAQHGRSYKDE 61
+ M +++ V+C++ + S H+ SVV ++ W +H + Y
Sbjct: 14 LSMLFLLLGFVACSA----TASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASP 69
Query: 62 LEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST 121
EK R++IFK NL +I + N+ N +Y LG N F+D+ ++EF+A Y G K P + R
Sbjct: 70 KEKVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK-PGLARRDA 127
Query: 122 T---SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGA 178
S+TF+Y N ++P ++DWR K AVTP+K+Q ECG CWAFS VAAVEGI +I
Sbjct: 128 QPHGSTTFRYAN--AVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTG 185
Query: 179 NLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-K 237
L+ LSEQ+L+DC N+GC GG M+ AF YI+ NQGI TE++YPY +G C Q
Sbjct: 186 KLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPH 245
Query: 238 AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAV 297
+ I+ YE+VP+ E +LLKA++ QPVS+GIAA + +F+ YK GIF+G CG Q DHA+
Sbjct: 246 SKVITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHAL 305
Query: 298 TIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYP 346
T VG+G+ G +Y ++KNSWG WG+ GY +I R EG+C I +SYP
Sbjct: 306 TAVGYGSYY-GQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYP 357
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 135/308 (43%), Positives = 197/308 (63%), Gaps = 10/308 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E E+WMA++GR Y D EK RF+IFK N+ +IE N +Y LG N+F+D+TN+
Sbjct: 6 MMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMTNN 65
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF A YTG +P R S + ++ ++ VP S+DWRD AVT +K+Q CG CWA
Sbjct: 66 EFLARYTGASLPLNIERDPVVS---FDDVDISAVPQSIDWRDYGAVTSVKNQGSCGSCWA 122
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FSA+A VEGI KI NLI LSEQ+++DC+ + GC GG + KA+++II N G+ +
Sbjct: 123 FSAIATVEGIYKIKAGNLISLSEQEVLDCALS--YGCDGGWVNKAYDFIISNNGVTSFAN 180
Query: 223 YPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
PY+ +G C+ A I+ Y V S +E++++ AV+ QP++ I A +F+ YK
Sbjct: 181 LPYKGYKGPCNHNDLPNKAYITGYTYVQSNNERSMMIAVANQPIAALIDA-GGDFQYYKS 239
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCG 338
G+F G CGT L+HA+T++G+G T G YW++KNSWG +WG+ GY+++ RD GLCG
Sbjct: 240 GVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARDVSSPYGLCG 299
Query: 339 IGTQSSYP 346
I +P
Sbjct: 300 IAMAPLFP 307
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 203/317 (64%), Gaps = 15/317 (4%)
Query: 44 VEMHEKWMAQHGRSYKDEL----EKEMRFKIFKENLEYIEKANKEG-NRTYKLGTNRFSD 98
+ ++ +W +HG+S + +++ RF IFK+NL +I+ N+ N TYKLG F++
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKKAVTPIKDQQ 155
LTNDE+R+LY G + P R T + KY ++ +VP ++DWR K AV IKDQ
Sbjct: 61 LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQG 119
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS AAVEGI KI L+ LSEQ+LVDC + N GC GG M+ AF++I++N
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179
Query: 216 GIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
G+ TE +YPY G C++ K + I YE+VPS DE AL +AVS QPVS+ I A
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F+ Y+ GIF G CGT +DHAV VG+G +E+G +YW+++NSWG WG+ GY+++ R+
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNV 298
Query: 334 ---EGLCGIGTQSSYPL 347
G CGI ++SYP+
Sbjct: 299 ASKSGKCGIAIEASYPV 315
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 153/353 (43%), Positives = 217/353 (61%), Gaps = 31/353 (8%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEK--------------WMAQHGRSYKDE 61
+ M +++ V+C++ + S H+ SVV ++ W +H + Y
Sbjct: 5 LSMLFLLLGFVACSA----TASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASP 60
Query: 62 LEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST 121
EK R++IFK NL +I + N+ N +Y LG N F+D+ ++EF+A Y G K P + R
Sbjct: 61 KEKVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK-PGLARRDA 118
Query: 122 T---SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGA 178
S+TF+Y N ++P ++DWR K AVTP+K+Q ECG CWAFS VAAVEGI +I
Sbjct: 119 QPHGSTTFRYAN--AVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTG 176
Query: 179 NLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-K 237
L+ LSEQ+L+DC N+GC GG M+ AF YI+ NQGI TE++YPY +G C Q
Sbjct: 177 KLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPH 236
Query: 238 AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAV 297
+ I+ YE+VP E +LLKA++ QPVS+GIAA + +F+ YK GIF+G CG Q DHA+
Sbjct: 237 SKVITITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHAL 296
Query: 298 TIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYP 346
T VG+G+ G +Y ++KNSWG WG+ GY +I R EG+C I +SYP
Sbjct: 297 TAVGYGSYY-GQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYP 348
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 145/328 (44%), Positives = 205/328 (62%), Gaps = 27/328 (8%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT--------YKL 91
E+++ E++ +W + H + EK RF FK N+ +I N N T Y+L
Sbjct: 35 EEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYRL 94
Query: 92 GTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST----FKYQNLSMTDVPTSLDWRDKKA 147
NRF D+ EFR+ + G P HR T + F Y ++ D+P ++DWR K A
Sbjct: 95 RLNRFGDMDQAEFRSTFAG-----PLHRHTRPAQSIPGFIYD--TVKDIPQAVDWRQKGA 147
Query: 148 VTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEK 206
VT +KDQ +CG CWAFSAVA+VEG+ I +L+ LSEQ+L+DC T G +NGC GG ME
Sbjct: 148 VTGVKDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMES 207
Query: 207 AFEYIIQNQ-GIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQ 264
AFE+I + G+ATE YPY A GTC+A + ++ + +I ++ VP+G+E+AL KAV+ Q
Sbjct: 208 AFEFIAHSAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQ 267
Query: 265 PVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTT-EDGANYWLIKNSWGDTWG 323
PVS+ I A F+ Y EG+F G CG++LDH V +VG+G EDG YW++KNSWG WG
Sbjct: 268 PVSVAIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKEYWIVKNSWGPGWG 327
Query: 324 DAGYMKILRDE----GLCGIGTQSSYPL 347
+ GY+++ RD GLCGI ++SYP+
Sbjct: 328 EHGYVRMQRDSGVDGGLCGIAMEASYPV 355
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 145/356 (40%), Positives = 217/356 (60%), Gaps = 18/356 (5%)
Query: 1 MVLIFERSGSFKINTIPMFIIIILLVSCASQV-----VSSRSTHEQSVVEMHEKWMAQHG 55
M IF S K + + +F+ I+ A + T V+ + E W+ +H
Sbjct: 1 MAFIFS---SKKTSLLFLFVSILACSPLAHEFSILGYAPEDLTSIHKVIHLFESWLVKHS 57
Query: 56 RSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPS 115
+ Y+ EK RF+IF +NL++I++ NK+ + Y LG N F+DLT++EF+ + G+K
Sbjct: 58 KFYESLDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLTHEEFKHKFLGFKGEL 116
Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKI 175
+ +S F Y++ D+P S+DWR K AV P+K+Q +CG CWAFS VAAVEGI +I
Sbjct: 117 AERKDESSKEFGYRDF--VDLPKSVDWRKKGAVAPVKNQGQCGNCWAFSTVAAVEGINQI 174
Query: 176 SGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAA 235
NL LSEQ+L+DC T NNGC GG M+ AF Y++++ G+ E+EYPY +GTC
Sbjct: 175 VTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEK 233
Query: 236 QKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLD 294
+ + IS Y +VP DE + LKA++ QP+S+ I A +F+ Y G+F+G CGT+LD
Sbjct: 234 KDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELD 293
Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYP 346
H V VG+GTT+ G +Y +++NSWG WG+ GY+++ R G+CG+ +SYP
Sbjct: 294 HGVAAVGYGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYP 348
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 145/336 (43%), Positives = 201/336 (59%), Gaps = 24/336 (7%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG++Y E+E R+ F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTN+E+R Y G + R + N ++ P S+DWR K AV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
IKDQ CG CWAFSA+AAVEGI +I +LI LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSA-------------AQKAAAAKISNYEEVPSGDEQ 255
++II N GI TED+YPY+ C + A I +YE+V E
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPNSET 261
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
+L KAV+ QPVS+ I A F+ Y GIF G CGT LDH V VG+G TE+G +YW+++
Sbjct: 262 SLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVR 320
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
NSWG +WG++GY+++ R+ G CGI + SYPL
Sbjct: 321 NSWGKSWGESGYVRMERNIKASSGKCGIAVEPSYPL 356
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 204/309 (66%), Gaps = 11/309 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E+ E WM++HG+ Y++ EK +RF+IFK+NL++I++ NK + Y LG + F+DL++
Sbjct: 44 LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLSEFADLSHR 102
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF Y G K+ S R + F Y+++ ++P S+DWR K AV P+K+Q CG CWA
Sbjct: 103 EFNNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI +I NL LSEQ+L+DC NNGC GG M+ AF +I++N G+ E++
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218
Query: 223 YPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY +G C ++ IS Y +VP +EQ+LLKA++ QP+S+ I A +F+ Y
Sbjct: 219 YPYIMEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
G+F+G CG+ LDH V VG+GT + G +Y +KNSWG WG+ GY+++ R+ EG+C
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGIC 337
Query: 338 GIGTQSSYP 346
GI +SYP
Sbjct: 338 GIYKMASYP 346
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 212/340 (62%), Gaps = 45/340 (13%)
Query: 47 HEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT--YKLGTNRFSDLTNDEF 104
++ W+A++GRSY E+E RF++F +NL++++ N + ++LG NRF+DLTNDEF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC------- 157
RA + G K S ++ +Y++ + ++P S+DWR+K AV P+K+Q +C
Sbjct: 109 RATFLGAKFVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCVDRIIVW 164
Query: 158 -------------------------GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
G CWAFSAV+ VE I ++ +I LSEQ+LV+CS
Sbjct: 165 NSMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVSTVESINQLVTGEMITLSEQELVECS 224
Query: 193 TNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVP 250
TNG N+GC GG M+ AF++II+N GI TED+YPY+AV G C ++ A I +E+VP
Sbjct: 225 TNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVP 284
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
DE++L KAV+ QPVS+ I A EF+ Y G+F+G CGT LDH V VG+G T++G +
Sbjct: 285 QNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYG-TDNGKD 343
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
YW+++NSWG WG++GY+++ R+ G CGI +SYP
Sbjct: 344 YWIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMASYP 383
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 201/311 (64%), Gaps = 11/311 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+ E+ + W +HG++Y E E++ R +IFK+N +++ + N N TY L N F+DLT+
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT-DVPTSLDWRDKKAVTPIKDQQECGCCW 161
EF+A G + +PS + K Q+L + VP S+DWR K AVT +KDQ CG CW
Sbjct: 88 EFKASRLGLSVSAPSVIMAS----KGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATED 221
+FSA A+EGI +I +LI LSEQ+L+DC + N GC GG M+ AFE++I+N GI TE
Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 222 EYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
+YPYQ GTC + K I +Y V S DE+AL++AV+ QPVS+GI F+ Y
Sbjct: 204 DYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLY 263
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGL 336
GIF+G C T LDHAV IVG+G +++G +YW++KNSWG +WG G+M + R+ +G+
Sbjct: 264 SRGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGV 322
Query: 337 CGIGTQSSYPL 347
CGI +SYP+
Sbjct: 323 CGINMLASYPI 333
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 201/311 (64%), Gaps = 11/311 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+ E+ + W +HG++Y E E++ R +IFK+N +++ + N N TY L N F+DLT+
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT-DVPTSLDWRDKKAVTPIKDQQECGCCW 161
EF+A G + +PS + K Q+L + VP S+DWR K AVT +KDQ CG CW
Sbjct: 88 EFKASRLGLSVSAPSVIMAS----KGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATED 221
+FSA A+EGI +I +LI LSEQ+L+DC + N GC GG M+ AFE++I+N GI TE
Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 222 EYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
+YPYQ GTC + K I +Y V S DE+AL++AV+ QPVS+GI F+ Y
Sbjct: 204 DYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLY 263
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGL 336
GIF+G C T LDHAV IVG+G +++G +YW++KNSWG +WG G+M + R+ +G+
Sbjct: 264 SSGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGV 322
Query: 337 CGIGTQSSYPL 347
CGI +SYP+
Sbjct: 323 CGINMLASYPI 333
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 141/309 (45%), Positives = 197/309 (63%), Gaps = 10/309 (3%)
Query: 42 SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTN 101
+V E+ E W +HG+SY EK R +F +N E++ N N +Y L N ++DLT+
Sbjct: 24 NVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTH 83
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCW 161
EF+ G+ SP+ R+ + +L DVP SLDWR K AVT +KDQ CG CW
Sbjct: 84 HEFKVSRLGF---SPALRNFRPVLPQEPSLPR-DVPDSLDWRKKGAVTAVKDQGSCGACW 139
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATED 221
+FSA A+EGI +I +LI LSEQ+L+DC + N+GCGGG M+ A++++I N GI TE+
Sbjct: 140 SFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTEN 199
Query: 222 EYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
+YPYQA G+C + + I Y ++PS DE LL+AV+ QPVS+GI F+ Y
Sbjct: 200 DYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLY 259
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGL 336
+GIF+G C T LDHAV IVG+G +E+G +YW++KNSWG +WG GYM + R+ EG+
Sbjct: 260 SKGIFSGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGV 318
Query: 337 CGIGTQSSY 345
CGI +SY
Sbjct: 319 CGINKLASY 327
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 147/348 (42%), Positives = 222/348 (63%), Gaps = 20/348 (5%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQS---------VVEMHEKWMAQHGRSYKDELEKEM 66
+P+ ++ + +C++ S S +V + + W +H + Y EK
Sbjct: 5 LPVLVLFLAFAACSASHHRDPSVVGYSQEDLALPNRLVNLFKSWSVKHRKIYVSPKEKLK 64
Query: 67 RFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSS 124
R+ IFK+NL +I + N++ N +Y LG N+F+D+T++EF+A + G K + ++ T +
Sbjct: 65 RYGIFKQNLMHIAETNRK-NGSYWLGLNQFADITHEEFKANHLGLKQGLSRMGAQTRTPT 123
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
TF+Y + ++P S+DWR K AVTP+K+Q +CG CWAFS+VAAVEGI +I L+ LS
Sbjct: 124 TFRYA--AAANLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLS 181
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKI 243
EQ+L+DC T ++GC GG M+ AF YI+ +QGI ED+YPY +G C Q A I
Sbjct: 182 EQELMDCDTMLDHGCEGGLMDFAFAYIMGSQGIHAEDDYPYLMEEGYCKEKQPYANVVTI 241
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
+ YE+VP E +LLKA++ QPVS+GIAA + +F+ YK G+F+G C +LDHA+T VG+G
Sbjct: 242 TGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYKGGVFDGSCSDELDHALTAVGYG 301
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKIL----RDEGLCGIGTQSSYPL 347
++ G NY +KNSWG WG+ GY++I + EG+CGI T +SYP+
Sbjct: 302 SSY-GQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPV 348
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 139/303 (45%), Positives = 196/303 (64%), Gaps = 8/303 (2%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR 105
M E W A+H +SY + EK R +F + L YIEK N + N T+ LG N+FSDLTN EFR
Sbjct: 1 MFEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
A Y G K P ++ + K ++ ++ +PTSLDWR + AVTPIKDQ +CG CWAFSA
Sbjct: 61 ANYVG-KFKPPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
+A++E ++ L+ LSEQQL+DC T + GC GG + AF+++++N G+ TE+ YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPDDAFKFVVENGGVTTEEAYPY 176
Query: 226 QAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIF 285
G+C+ K +I+ Y++V AL+KAVS PV++GI F++Y+ GI
Sbjct: 177 TGFAGSCN-TNKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--EGLCGIGTQS 343
+G C DHAV ++G+G TE G YW+IKNSWG +WG+ G+MKI + EG+CG+ QS
Sbjct: 236 SGQCCNSRDHAVLVIGYG-TEGGMPYWIIKNSWGTSWGEDGFMKIKKKDGEGMCGMNGQS 294
Query: 344 SYP 346
SYP
Sbjct: 295 SYP 297
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 144/324 (44%), Positives = 200/324 (61%), Gaps = 22/324 (6%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR-TYKLGTNRFSD 98
++++ +++E+W H R ++ EK RF FKEN+ +I NK G+R +Y+L NRF D
Sbjct: 39 DEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFGD 97
Query: 99 LTNDEFRALYTG--------YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTP 150
+ +EFR+ + Y+ SP+ +T F Y + TDVP S+DWR AVT
Sbjct: 98 MGPEEFRSTFADSRINDLRRYRESSPA--ATAVPGFMYDD--ATDVPRSVDWRQHGAVTA 153
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEY 210
+K+Q CG CWAFS V AVEGI I +L+ LSEQ+LVDC T NGC GG ME AF++
Sbjct: 154 VKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDT-AENGCQGGLMENAFDF 212
Query: 211 IIQNQGIATEDEYPYQAVQGTCS---AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
I GI TE YPY+A GTC A + I ++ VP+G E AL KAV+ QPVS
Sbjct: 213 IKSYGGITTESAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVS 272
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGDTWGDAG 326
+ I A F+ Y EG+F G CGT LDH V +VG+G ++ DG YW++KNSWG +WG+ G
Sbjct: 273 VAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSWGPSWGEGG 332
Query: 327 YMKILR---DEGLCGIGTQSSYPL 347
Y+++ R + GLCGI ++S+P+
Sbjct: 333 YIRMQRGAGNGGLCGIAMEASFPI 356
>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 337
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 148/339 (43%), Positives = 206/339 (60%), Gaps = 13/339 (3%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
I F++ + V A + S +H S HEKWMAQHG+ YKD EKE +IF+ N+
Sbjct: 6 ILKFLVAFIEVD-ACSLSESCCSHSLS----HEKWMAQHGKVYKDAAEKERCLQIFENNM 60
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E+IE + G++++ L TN+F+DL ++EF+AL T S +TT + F+Y N+ T
Sbjct: 61 EFIESFDVCGDKSFNLSTNQFADLHDEEFKALLTNGHKKEHSLWTTTETLFRYDNV--TK 118
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFS-AVAAVEGITKISGANLIQLSEQQLVDCSTN 194
+P S+DWR + VTPIKDQ +C CWAFS VA +EG+ +I + L+ LSEQ+LVD
Sbjct: 119 IPASMDWRKRGVVTPIKDQGKCLSCWAFSLCVATIEGLHQIITSELVPLSEQELVDFVKG 178
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGD 253
+ GC G +E AF++I + I +E YPY+ V TC ++ A+I Y++VPS
Sbjct: 179 ESEGCYGDYVEDAFKFITKKGRIESETHYPYKGVNNTCKVKKETHGVAQIKGYKKVPSKS 238
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
E ALLKAV+ Q VS+ + A + F+ Y GIF G CGT DH V + +G + DG YWL
Sbjct: 239 ENALLKAVANQLVSVSVEARDSAFQFYSSGIFTGKCGTDTDHRVALASYGESGDGTKYWL 298
Query: 314 IKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPLA 348
KNSWG WG+ GY++I D EGLCGI YP+A
Sbjct: 299 AKNSWGTEWGEKGYIRIKXDIPAKEGLCGIAKYPYYPIA 337
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 139/301 (46%), Positives = 202/301 (67%), Gaps = 10/301 (3%)
Query: 51 MAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG 110
M++HG+SY+ EK RF++F++NL++I++ NK+ + +Y LG N F+DL+++EF+ Y G
Sbjct: 1 MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKRKYLG 59
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVE 170
K+ P R + F Y++++ D+P S+DWR K AV +K+Q CG CWAFS VAAVE
Sbjct: 60 LKIELPKRRDSPEE-FSYKDVA--DLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVE 116
Query: 171 GITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG 230
GI +I NL LSEQ+L+DC NNGC GG M+ AF +II N G+ E++YPY +G
Sbjct: 117 GINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEG 176
Query: 231 TCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVC 289
TC ++ IS Y +VP +EQ+ LKA++ QP+S+ I A + F+ Y GIFNG C
Sbjct: 177 TCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHC 236
Query: 290 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSY 345
GT+LDH V VG+GT++ G +Y +KNSWG WG+ GY+++ R+ EG+CGI +SY
Sbjct: 237 GTELDHGVAAVGYGTSK-GVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMASY 295
Query: 346 P 346
P
Sbjct: 296 P 296
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 136/299 (45%), Positives = 192/299 (64%), Gaps = 7/299 (2%)
Query: 51 MAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG 110
MA++GR YKD EK RF+IFK N+ +IE N +Y LG N+F+D+TN+EF A YTG
Sbjct: 1 MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVE 170
+ P + + +++++ V S+DWRD AVT +KDQ CG CWAFSA+A VE
Sbjct: 61 -GISRPLNIEK-EPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVE 118
Query: 171 GITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG 230
GI KI L+ LSEQ+++DC+ + NGC GG ++ A+++II N G+A+E +YPYQA QG
Sbjct: 119 GIYKIVTGYLVSLSEQEVLDCAVS--NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQG 176
Query: 231 TCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCG 290
C+A +A I+ Y V S DE ++ AV QP++ I A F+ Y G+F+G CG
Sbjct: 177 DCAANSWPNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCG 236
Query: 291 TQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---DEGLCGIGTQSSYP 346
T L+HA+TI+G+G G YW++KNSWG +WG+ GY+++ R GLCGI YP
Sbjct: 237 TSLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGVSSSGLCGIAMDPLYP 295
>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 157/339 (46%), Positives = 216/339 (63%), Gaps = 27/339 (7%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F +++ + A QV + R+ + S+ E HE+ M ++G+ YKD ++ FKEN+ YI
Sbjct: 12 FAMLLCMAFLAFQV-TCRTLQDASMXERHEQRMTRYGKVYKDPPKRX-----FKENVNYI 65
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N N+ YK G N+F+ R + G+ M S R TT FK++N++ T P+
Sbjct: 66 EACNNAANKPYKRGINQFAP------RNRFKGH-MCSSIIRITT---FKFENVTAT--PS 113
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NN 197
++D R K AVTPIKDQ +CGCCWAFSAVAA EGI +S LI LSEQ+LVDC T G +
Sbjct: 114 TVDCRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDX 173
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYP-YQAVQGTCSAAQKAAAAK--ISNYEEVPSGDE 254
GC GG M+ AF++IIQN G+ + P Y V G C+A + A A I+ YE+VP+ +E
Sbjct: 174 GCEGGLMDDAFKFIIQNHGLKHXSQLPLYMGVDGKCNANEAAKNAATIITGYEDVPANNE 233
Query: 255 QALL-KAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
+A L KAV+ PVS I A ++F+ YK G+F G CGT+LDH VT VG+G ++DG YWL
Sbjct: 234 KAHLQKAVANNPVSEAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWL 293
Query: 314 IKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
+KNSWG WG+ GY+++ R +E LCGI Q+SYP A
Sbjct: 294 VKNSWGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 143/344 (41%), Positives = 220/344 (63%), Gaps = 25/344 (7%)
Query: 24 LLVSCASQVVSSRSTHEQSVV--------------EMHEKWMAQHGRSYKDELEKEMRFK 69
L +S A+ +S ++H+ S+V E+ E W++ ++Y+ EK +RF+
Sbjct: 14 LALSAATLSLSVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKLLRFE 73
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKY 128
+FK+NL++I++ NK+ ++Y LG N F+DL+++EF+ +Y G K S + F Y
Sbjct: 74 VFKDNLKHIDETNKK-VKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAY 132
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+++ VP S+DWR K AV +K+Q CG CWAFS VAAVEGI KI NL LSEQ+L
Sbjct: 133 RDVEA--VPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQEL 190
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYE 247
+DC T NNGC GG M+ AFEYI++N G+ E++YPY +GTC + ++ I ++
Sbjct: 191 IDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTIDGHQ 250
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE-GIFNGVCGTQLDHAVTIVGFGTTE 306
+VP+ DE++LLKA++ QP+S+ I A EF+ Y +F+G CG LDH V VG+G+++
Sbjct: 251 DVPTNDEKSLLKALAHQPLSVAIDASGREFQFYSGVSVFDGRCGVDLDHGVAAVGYGSSK 310
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
G++Y ++KNSWG WG+ GY+++ R+ EGLCGI +S+P
Sbjct: 311 -GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCGINKMASFP 353
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 146/336 (43%), Positives = 211/336 (62%), Gaps = 12/336 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F L+ S A S V+ ++E W+ ++G+SY E+EMR +IFKENL +
Sbjct: 13 LFFSTFLIFSFAIDAKISPLRTNDEVMALYESWLVKYGKSYNSLGEREMRIEIFKENLRF 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + NR+Y +G N+F+DLT++E+R+ Y G+K S +S S+ + Q + +P
Sbjct: 73 IDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKS---SLKSKVSNRYMPQVGEV--LP 127
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+DWR AV +K+Q C CWAF+ +A VE I +I +LI LSEQ+LVDC+ T N
Sbjct: 128 DYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDCNRTPIN 187
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQ 255
GC GG M+ A+E+II N GI TE+ YPY C +K I +YE+VP DE
Sbjct: 188 EGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCDEPKKNQNYVTIDSYEQVPPNDEL 247
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFN-GVCGTQLDHAVTIVGFGTTEDGANYWLI 314
A+ +AV+ QPVS+ I AY F+ Y+ GIF G CGT L+HAVTI+G+G TE+G +YW++
Sbjct: 248 AMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVTIIGYG-TENGIDYWIV 306
Query: 315 KNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
KNS+G WG++GY K+ R+ EG CGI + YP+
Sbjct: 307 KNSYGTQWGESGYGKVQRNVGGEGRCGIASYPFYPV 342
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 203/313 (64%), Gaps = 16/313 (5%)
Query: 45 EMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR-----TYKLGTNRFSDL 99
E+ EKW +H ++Y E EK R K+F++N ++ + N+ N +Y L N F+DL
Sbjct: 31 ELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADL 90
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
T+ EF+ G + + + Q+ + +P+ +DWR AVTP+KDQ CG
Sbjct: 91 THHEFKTTRLGLPLTLLRFKRPQNQ----QSRDLLHIPSQIDWRQSGAVTPVKDQASCGA 146
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFSA A+EGI KI +L+ LSEQ+L+DC T+ N+GCGGG M+ A++++I N+GI T
Sbjct: 147 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGIDT 206
Query: 220 EDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
ED+YPYQA Q +CS + K A I +Y +VP +E+ +LKAV+ QPVS+GI EF+
Sbjct: 207 EDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGSEREFQ 265
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----E 334
Y +GIF G C T LDHAV IVG+G +E+G +YW++KNSWG WG GY+ ++R+ +
Sbjct: 266 LYSKGIFTGPCSTFLDHAVLIVGYG-SENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNSK 324
Query: 335 GLCGIGTQSSYPL 347
G+CGI T +SYP+
Sbjct: 325 GICGINTLASYPV 337
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 197/317 (62%), Gaps = 13/317 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
++++ +++E+W H EK RF FKEN+ +I NK G+R Y+L NRF D+
Sbjct: 35 DEALWDLYERWQTHHHVHRHHG-EKGRRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDM 93
Query: 100 TNDEFRALYTGYKM----PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
+EFR+ + ++ + S + F Y + TD+P S+DWR + AVT +KDQ
Sbjct: 94 GREEFRSTFADSRINDLRRAESPAAPAVPGFMYDGV--TDLPPSVDWRKEGAVTAVKDQG 151
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS V +VEGI I +L+ LSEQ+L+DC T+ NGC GG ME AFE+I
Sbjct: 152 HCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTD-ENGCQGGLMENAFEFIKSYG 210
Query: 216 GIATEDEYPYQAVQGTCSA--AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
G+ TE YPY+A GTC + +++ I ++ VP+G E AL KAV+ QPVS+ I A
Sbjct: 211 GVTTESAYPYRASNGTCDSVRSRRGQIVSIDGHQMVPTGSEDALAKAVANQPVSVAIDAG 270
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
F+ Y EG+F G CGT LDH V VG+G ++DG YW++KNSWG +WG+ GY+++ R
Sbjct: 271 GQAFQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTAYWIVKNSWGPSWGEGGYIRMQRG 330
Query: 333 --DEGLCGIGTQSSYPL 347
+ GLCGI ++S+P+
Sbjct: 331 AGNGGLCGIAMEASFPI 347
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 144/344 (41%), Positives = 203/344 (59%), Gaps = 30/344 (8%)
Query: 28 CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR 87
C + +V+ ++E E+WM +HGR Y D EK+ R ++++ N+E +E N GN
Sbjct: 18 CGAALVA----RADPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN- 72
Query: 88 TYKLGTNRFSDLTNDEFRALYTGYKMPSP---SHRSTTSSTFKYQNLSM------TDVPT 138
Y+L N+F+DLTN+EFRA G+ P + ST ST + +D+P
Sbjct: 73 GYRLADNKFADLTNEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPK 132
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
S+DWR+K AV P+K Q +CG CWAFSAVAA+EGI +I L+ LSEQ+LVDC T G
Sbjct: 133 SVDWREKGAVAPVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IG 191
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQAL 257
C GG M AFE++++N+G+ TE YPYQ + G C + K +A IS Y V E L
Sbjct: 192 CAGGYMSWAFEFVMKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDL 251
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE----------D 307
L+A + QPVS+ + A + ++ Y G+F G C +L+H VT+VG+G T+
Sbjct: 252 LRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVP 311
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G YW++KNSWG WGDAGY+ + R+ GLCGI SYP+
Sbjct: 312 GKKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIAMLPSYPV 355
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 144/344 (41%), Positives = 203/344 (59%), Gaps = 30/344 (8%)
Query: 28 CASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR 87
C + +V+ ++E E+WM +HGR Y D EK+ R ++++ N+E +E N GN
Sbjct: 39 CGAALVA----RADPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN- 93
Query: 88 TYKLGTNRFSDLTNDEFRALYTGYKMPSP---SHRSTTSSTFKYQNLSM------TDVPT 138
Y+L N+F+DLTN+EFRA G+ P + ST ST + +D+P
Sbjct: 94 GYRLADNKFADLTNEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPK 153
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
S+DWR+K AV P+K Q +CG CWAFSAVAA+EGI +I L+ LSEQ+LVDC T G
Sbjct: 154 SVDWREKGAVAPVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IG 212
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQAL 257
C GG M AFE++++N+G+ TE YPYQ + G C + K +A IS Y V E L
Sbjct: 213 CAGGYMSWAFEFVMKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDL 272
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE----------D 307
L+A + QPVS+ + A + ++ Y G+F G C +L+H VT+VG+G T+
Sbjct: 273 LRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVP 332
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G YW++KNSWG WGDAGY+ + R+ GLCGI SYP+
Sbjct: 333 GKKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIAMLPSYPV 376
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 140/314 (44%), Positives = 205/314 (65%), Gaps = 10/314 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T ++++ E W+++H + Y+ EK RF+IFK+NL +I++ NK+ Y LG N F+
Sbjct: 24 TSRDRIIDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKK-VVNYWLGLNEFA 82
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DL+++EF+ Y G + S+R S F Y+++S +P S+DWR K AVT +K+Q C
Sbjct: 83 DLSHEEFKNKYLGLNV-DLSNRRECSEEFTYKDVS--SIPKSVDWRKKGAVTDVKNQGSC 139
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS VAAVEGI +I NL LSEQ+LVDC T NNGC GG M+ AF YII N G+
Sbjct: 140 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNNGCNGGLMDYAFAYIISNGGL 199
Query: 218 ATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
E++YPY +GTC + ++ IS Y +VP E++LLKA++ QP+S+ I A +
Sbjct: 200 HKEEDYPYIMEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDASGRD 259
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ Y G+F+G CGT+LDH V VG+G+ + G ++ ++KNSWG WG+ G++++ R+
Sbjct: 260 FQFYSGGVFDGHCGTELDHGVAAVGYGSAK-GLDFIVVKNSWGSKWGEKGFIRMKRNTGK 318
Query: 334 -EGLCGIGTQSSYP 346
GLCGI +SYP
Sbjct: 319 PAGLCGINKMASYP 332
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 204/318 (64%), Gaps = 18/318 (5%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+ E+ + W +HG++Y E E++ R +IFK+N +++ + N N TY L N F+DLT+
Sbjct: 26 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 85
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT-DVPTSLDWRDKKAVTPIKDQQECGCCW 161
EF+A G + +PS + K Q+L + VP S+DWR K AVT +KDQ CG CW
Sbjct: 86 EFKASRLGLSVSAPSVIMAS----KGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 141
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATED 221
+FSA A+EGI +I +LI LSEQ+L+DC + N GC GG M+ AFE++I+N GI TE
Sbjct: 142 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 201
Query: 222 EYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAA-------Y 273
+YPYQ GTC + K I +Y V S DE+AL++AV+ QPVS+GI Y
Sbjct: 202 DYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLY 261
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
+++F +GIF+G C T LDHAV IVG+G +++G +YW++KNSWG +WG G+M + R+
Sbjct: 262 SSKFYLLMQGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRN 320
Query: 334 ----EGLCGIGTQSSYPL 347
+G+CGI +SYP+
Sbjct: 321 TENSDGVCGINMLASYPI 338
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 280 bits (716), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 197/311 (63%), Gaps = 9/311 (2%)
Query: 44 VEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDE 103
+ M++ W+A+HG++Y E+ RF+IFK NL +I++ N + N TYK+G +F+DLTN+E
Sbjct: 1 MSMYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQ-NHTYKVGLTKFADLTNEE 59
Query: 104 FRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
+RA++ G + + + S + +Y + +P S+DWR K AV PIKDQ CG CWA
Sbjct: 60 YRAMFLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWA 119
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI +I LI LSEQ+LVDC N GC GG M+ AF++II N G+ TE +
Sbjct: 120 FSTVAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGLDTEKD 179
Query: 223 YPYQA-VQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY K A I +E+V DE+AL KAV+ QPVS+ I A + Y+
Sbjct: 180 YPYVGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFYQ 239
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGL 336
G+F G CGT LDH V +VG+ +E+G +YWL++NSWG WG+ GY+K+ R+ G
Sbjct: 240 SGVFTGECGTALDHGVVVVGY-ASENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGR 298
Query: 337 CGIGTQSSYPL 347
CGI +SSYP+
Sbjct: 299 CGIAMESSYPV 309
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 141/329 (42%), Positives = 193/329 (58%), Gaps = 14/329 (4%)
Query: 28 CASQVVSSRSTH-EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGN 86
CA+ R ++++ +++E+W H + EK RF FK+N+ YI + NK
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAP 84
Query: 87 RTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWR 143
L NRF D+ +EFRA + G + F Y+ + D+P ++DWR
Sbjct: 85 GYAPL--NRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVR--DLPRAVDWR 140
Query: 144 DKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGT 203
K AVT +KDQ +CG CWAFS V +VEGI I L+ LSEQ+L+DC T N+GC GG
Sbjct: 141 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGL 200
Query: 204 MEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVS 262
ME AFEYI + GI TE YPY+A GTC A + + I ++ VP+ E AL KAV+
Sbjct: 201 MENAFEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVA 260
Query: 263 MQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTW 322
QPVS+ I A F+ Y +G+F G CGT LDH V +VG+G T DG YW++KNSWG W
Sbjct: 261 NQPVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAW 320
Query: 323 GDAGYMKILRDE----GLCGIGTQSSYPL 347
G+ GY+++ RD GLCGI ++SYP+
Sbjct: 321 GEGGYIRMQRDSGYDGGLCGIAMEASYPV 349
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 141/329 (42%), Positives = 193/329 (58%), Gaps = 14/329 (4%)
Query: 28 CASQVVSSRSTH-EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGN 86
CA+ R ++++ +++E+W H + EK RF FK+N+ YI + NK
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAP 84
Query: 87 RTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWR 143
L NRF D+ +EFRA + G + F Y+ + D+P ++DWR
Sbjct: 85 GYPPL--NRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVR--DLPRAVDWR 140
Query: 144 DKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGT 203
K AVT +KDQ +CG CWAFS V +VEGI I L+ LSEQ+L+DC T N+GC GG
Sbjct: 141 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGL 200
Query: 204 MEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVS 262
ME AFEYI + GI TE YPY+A GTC A + + I ++ VP+ E AL KAV+
Sbjct: 201 MENAFEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVA 260
Query: 263 MQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTW 322
QPVS+ I A F+ Y +G+F G CGT LDH V +VG+G T DG YW++KNSWG W
Sbjct: 261 NQPVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAW 320
Query: 323 GDAGYMKILRDE----GLCGIGTQSSYPL 347
G+ GY+++ RD GLCGI ++SYP+
Sbjct: 321 GEGGYIRMQRDSGYDGGLCGIAMEASYPV 349
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 206/322 (63%), Gaps = 18/322 (5%)
Query: 40 EQSVVEMHEKWMAQH---GRSYKDEL-EKEMRFKIFKENLEYIEKANKEGNRT--YKLGT 93
E +++ W+A+H G S+ + E E RF++F +NL++++ N + ++LG
Sbjct: 58 EAEARAVYDLWVARHRHGGGSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGM 117
Query: 94 NRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVT-PIK 152
NRF+DLTNDEFRA Y G P+ R + Y++ + +P S+DWRDK AV P+K
Sbjct: 118 NRFADLTNDEFRAAYLG-TTPAGRGRHVGEA---YRHDGVEALPDSVDWRDKGAVVAPVK 173
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYI 211
+Q +CG CWAFSAVAAVEGI KI L+ LSEQ+LV+C+ NG N+GC GG M+ AF +I
Sbjct: 174 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFI 233
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
+N G+ TE++YPY A+ G C+ A+K+ I +E+VP DE +L KAV+ QPVS+ I
Sbjct: 234 ARNGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAI 293
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMK 329
A EF+ Y G+F G CGT LDH V VG+GT G +YW ++NSWG WG+ GY++
Sbjct: 294 DAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIR 353
Query: 330 ILRD----EGLCGIGTQSSYPL 347
+ R+ G CGI +SYP+
Sbjct: 354 MERNVTARTGKCGIAMMASYPI 375
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 206/322 (63%), Gaps = 18/322 (5%)
Query: 40 EQSVVEMHEKWMAQH---GRSYKDEL-EKEMRFKIFKENLEYIEKANKEGNRT--YKLGT 93
E +++ W+A+H G S+ + E E RF++F +NL++++ N + ++LG
Sbjct: 58 EAEARAVYDLWVARHRHGGDSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGM 117
Query: 94 NRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVT-PIK 152
NRF+DLTNDEFRA Y G P+ R + Y++ + +P S+DWRDK AV P+K
Sbjct: 118 NRFADLTNDEFRAAYLG-TTPAGRGRHVGEA---YRHDGVEVLPDSVDWRDKGAVVAPVK 173
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYI 211
+Q +CG CWAFSAVAAVEGI KI L+ LSEQ+LV+C+ NG N+GC GG M+ AF +I
Sbjct: 174 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFI 233
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
+N G+ TE++YPY A+ G C+ A+K+ I +E+VP DE +L KAV+ QPVS+ I
Sbjct: 234 ARNGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAI 293
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMK 329
A EF+ Y G+F G CGT LDH V VG+GT G +YW ++NSWG WG+ GY++
Sbjct: 294 DAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIR 353
Query: 330 ILRD----EGLCGIGTQSSYPL 347
+ R+ G CGI +SYP+
Sbjct: 354 MERNVTARTGKCGIAMMASYPI 375
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 142/296 (47%), Positives = 189/296 (63%), Gaps = 10/296 (3%)
Query: 41 QSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLT 100
+S + + E WM +H + YK EK RF+ FK+NL YI++ NK+ N +Y LG N F+DLT
Sbjct: 42 ESSIRLFESWMLKHDKVYKTIDEKIYRFETFKDNLMYIDETNKK-NNSYWLGLNEFADLT 100
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
+DEF+ Y G +P S S ++ N + D P S+DWR K AVTP+K+Q CG C
Sbjct: 101 HDEFKEKYVG-SIPEDSMIIEQSDDVEFPNKHVVDYPESIDWRQKGAVTPVKNQNPCGSC 159
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATE 220
WAFS VA VEGI KI NLI LSEQ+L+DC ++GC GG + +Y++ N G+ TE
Sbjct: 160 WAFSTVATVEGINKIVTGNLISLSEQELLDCDRR-SHGCKGGYQTTSLKYVVDN-GVHTE 217
Query: 221 DEYPYQAVQGTCSAA-QKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKS 279
EYPY+ QG C A +K I+ Y+ VPS DE +L+K +S+QPVS+ + + F+
Sbjct: 218 KEYPYEKKQGNCRAKNKKGLKVYINGYKRVPSNDEISLIKTISIQPVSVLVESKGRPFQF 277
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEG 335
YK G+F G CGT+LDHAVT VG+ G +Y LIKNSWG WGD GY+KI R G
Sbjct: 278 YKGGVFGGPCGTKLDHAVTAVGY-----GKDYILIKNSWGPKWGDKGYIKIKRASG 328
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 206/322 (63%), Gaps = 18/322 (5%)
Query: 40 EQSVVEMHEKWMAQH---GRSYKDEL-EKEMRFKIFKENLEYIEKANKEGNRT--YKLGT 93
E +++ W+A+H G S+ + E E RF++F +NL++++ N + ++LG
Sbjct: 59 EAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGM 118
Query: 94 NRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV-TPIK 152
NRF+DLTNDEFRA Y G +P+ R Y++ + +P S+DWRDK AV +P+K
Sbjct: 119 NRFADLTNDEFRAAYLG---TTPAGRGRHVGEM-YRHDGVEALPDSVDWRDKGAVVSPVK 174
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYI 211
+Q +CG CWAFSAVAAVEGI KI L+ LSEQ+LV+C+ N GN+GC GG M+ AF +I
Sbjct: 175 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNRGNSGCNGGIMDDAFAFI 234
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
+N G+ TE++YPY A+ G C A+K+ I +E+VP DE +L KAV+ QPVS+ I
Sbjct: 235 TRNGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAI 294
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMK 329
A EF+ Y G+F G CGT LDH V VG+GT G +YW ++NSWG WG+ GY++
Sbjct: 295 DAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIR 354
Query: 330 ILRD----EGLCGIGTQSSYPL 347
+ R+ G CGI +SYP+
Sbjct: 355 MERNVTARTGKCGIAMMASYPI 376
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 193/308 (62%), Gaps = 41/308 (13%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR 105
++E W+A+HG+SY EKE RF+IFK+NL +I++ N E NRTYK+ ++R++ D
Sbjct: 3 VYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE-NRTYKI-SDRYAFRVGDS-- 58
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
+P S+DWR K AV +KDQ CG CWAFS
Sbjct: 59 ------------------------------LPESVDWRKKGAVVEVKDQGSCGSCWAFST 88
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
+AAVEGI KI LI LSEQ+LVDC T+ N GC GG M+ AFE+II N GI +E++YPY
Sbjct: 89 IAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPY 148
Query: 226 QAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGI 284
+A G C +K A I YE+VP DE++L KAV+ QPVS+ I A EF+ Y+ GI
Sbjct: 149 KASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGI 208
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGI 339
F G CGT LDH VT VG+G TE+G +YW++KNSWG +WG+ GY+++ RD G CGI
Sbjct: 209 FTGRCGTALDHGVTAVGYG-TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGI 267
Query: 340 GTQSSYPL 347
++SYP+
Sbjct: 268 AMEASYPI 275
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 144/334 (43%), Positives = 204/334 (61%), Gaps = 28/334 (8%)
Query: 40 EQSVVEMHEKWMAQHGRSYKD----ELEKEMRFKIFKENLEYIEKANKE---GNRTYKLG 92
++ V M+E W ++HGR + E +R ++F++NL YI+ N E G T++LG
Sbjct: 47 DEEVRRMYEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLG 106
Query: 93 TNRFSDLTNDEFRALYTGYKMP---SPSHRSTTSSTFKYQNLSMT----------DVPTS 139
F+DLT +E+R G++ PS R+ S S D+P +
Sbjct: 107 LTPFADLTLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCGDLPDA 166
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGC 199
+DWR AVT +K+Q++CG CWAFSAVAA+EGI I NL+ LSEQ+++DC T ++GC
Sbjct: 167 IDWRQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDTQ-DSGC 225
Query: 200 GGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA--AQKAAAAKISNYEEVPSGDEQAL 257
GG ME AF+++I N GI +E +YP+ A GTC A A A I + EV S +E AL
Sbjct: 226 NGGQMENAFQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAAIDGFVEVASNNETAL 285
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+AV++QPVS+ I A F+ Y GIFNG CGT LDH VT+VG+G +E+G YW++KNS
Sbjct: 286 QEAVAIQPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYG-SENGKAYWIVKNS 344
Query: 318 WGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
W D+WG+AGY++I R+ G CGI +SYP+
Sbjct: 345 WSDSWGEAGYIRIRRNVFLPVGKCGIAMDASYPV 378
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 136/309 (44%), Positives = 195/309 (63%), Gaps = 15/309 (4%)
Query: 50 WMAQHGRSYKDELE-KEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALY 108
W A+ G+ + RF+ FKEN YIE+ N+ G +Y+LG N+FSDLT++EFR +
Sbjct: 16 WCAKFGKECASSNSLGDRRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRF 75
Query: 109 TGYK---MPSPSHRSTTSSTFK--YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
G + + SP + S + +QN+ D+P S+DWR AVT KDQ CG CWAF
Sbjct: 76 LGLRPDLIDSPVLKMPRDSDIEEGFQNV---DLPASVDWRKHGAVTAPKDQGSCGGCWAF 132
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
+ A+EGI +I L+ LSEQ+L+DC + GC GG ME A+++I++N G+ TE +Y
Sbjct: 133 ATTGAIEGINQIVTGQLMSLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTETDY 192
Query: 224 PYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
PY A + C+ + + I YE +P GDEQALL+AV+ QPVS+ I + +F+ Y
Sbjct: 193 PYHASESHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVAKQPVSVAIEGASKDFQHYAS 252
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCG 338
G+F G CG +++H V IVG+G TEDG +YW++KNSW TWGD G++K+ R+ GLC
Sbjct: 253 GVFTGHCGEEINHGVLIVGYG-TEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCS 311
Query: 339 IGTQSSYPL 347
I T +SYP+
Sbjct: 312 INTLASYPV 320
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 143/308 (46%), Positives = 194/308 (62%), Gaps = 41/308 (13%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR 105
++E W+ +HG+SY E+E RF+IFK+NL +IE+ N NRTYK+G +R+S FR
Sbjct: 3 VYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVG-DRYS------FR 54
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
A D+P S+DWR+K AV P+KDQ CG CWAFS
Sbjct: 55 A--------------------------GEDLPESVDWREKGAVVPVKDQGNCGSCWAFST 88
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
+AAVEGI +I+ +LI LSEQ+LVDC + N GC GG M+ AFE+II N GI +E++YPY
Sbjct: 89 IAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPY 148
Query: 226 QAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGI 284
+A TC +K A I YE+VP DE++L KAV+ QPVS+ I A F+ Y+ G+
Sbjct: 149 RAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGV 208
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-----DEGLCGI 339
F G CGTQLDH V VG+G TE+ +YW+++NSWG WG++GY+K+ R + G CGI
Sbjct: 209 FTGQCGTQLDHGVVAVGYG-TENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCGI 267
Query: 340 GTQSSYPL 347
+ SYP+
Sbjct: 268 AIEPSYPI 275
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 137/259 (52%), Positives = 180/259 (69%), Gaps = 8/259 (3%)
Query: 95 RFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
+F+++TNDEFR++YTGYK S S T S++F+YQN+S +P ++DWR K AVTPIK
Sbjct: 1 QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYII 212
+Q CGCCWAFSAVAA+EG T+I LI LSEQQLVDC TN + GC GG ++ AFE+I+
Sbjct: 61 NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCSGGLIDTAFEHIM 119
Query: 213 QNQGIATEDEYPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIA 271
G+ TE YPY+ TC + +AA I+ YE+VP DE AL+KAV+ QPVS+GI
Sbjct: 120 ATGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGIE 179
Query: 272 AYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKIL 331
+F+ Y G+F G C T LDHAVT VG+ + G+ YW+IKNSWG WG+ GYM+I
Sbjct: 180 GGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIK 239
Query: 332 RD----EGLCGIGTQSSYP 346
+D EGLCG+ ++SYP
Sbjct: 240 KDIKDKEGLCGLAMKASYP 258
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 277 bits (708), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 144/332 (43%), Positives = 212/332 (63%), Gaps = 18/332 (5%)
Query: 25 LVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN-- 82
+ S + Q+ S E+ M+ +W AQHG +E +E R++ F++NL YI++ N
Sbjct: 26 IASSSGQIRS-----EEETRRMYAEWTAQHGSPITNE--EEGRYEAFRDNLRYIDEHNAA 78
Query: 83 -KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
G +++LG NRF+ LTN+E+RA Y G ++ S + + +Y+ +P S+D
Sbjct: 79 ADAGIHSFRLGLNRFAGLTNEEYRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVD 138
Query: 142 WRDKKAVTPIKDQ-QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCG 200
WR+K AV +KDQ + CG WAFSA+AAVE I +I LI LSEQ+L+DC T+ N GC
Sbjct: 139 WREKGAVGKVKDQGRSCGSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNAGCD 198
Query: 201 GGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLK 259
GG M+ AFE+II N GI T+++YPY+A +C A ++ A I +YE++ +E++L K
Sbjct: 199 GGLMDDAFEFIISNGGIDTDEDYPYKARNDSCDANKRNRKAVTIDDYEDLRM-NEKSLQK 257
Query: 260 AVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
AVS QPVS+ I A +F+ YK GIF G CGT LDHA TIVG+G +E+G +YW++K S+G
Sbjct: 258 AVSNQPVSVAIEAGGRDFQLYKSGIFTGTCGTDLDHATTIVGYG-SENGTDYWIVKESYG 316
Query: 320 DTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+WG++GY ++ R+ G CGI SYP+
Sbjct: 317 TSWGESGYARMERNIKETSGKCGIAMLPSYPV 348
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 140/307 (45%), Positives = 207/307 (67%), Gaps = 12/307 (3%)
Query: 47 HEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN-KEGNRTYKLGTNRFSDLTNDEFR 105
++ W+A++GRSY E E RF++F +NL + + N + + ++LG NRF+DLTN+EFR
Sbjct: 53 YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 112
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
A + G K+ S ++ +Y++ + ++P S+DWR+K AV P+K+Q +CG CWAFSA
Sbjct: 113 ATFLGAKVVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 168
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGT-MEKAFEYIIQNQGIATEDEYP 224
V+ VE I ++ +I LSEQ+LV+CSTNG NG G M+ AF++II+N GI TED+YP
Sbjct: 169 VSTVESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGIDTEDDYP 228
Query: 225 YQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEG 283
Y+AV G C + A I +E+VP DE++L KAV+ QPVS+ I A EF+ Y G
Sbjct: 229 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 288
Query: 284 IFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGI 339
+F+G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+ G CGI
Sbjct: 289 VFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGI 347
Query: 340 GTQSSYP 346
+SYP
Sbjct: 348 AMMASYP 354
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 137/309 (44%), Positives = 197/309 (63%), Gaps = 31/309 (10%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++ E W+++HG+ YK EK RF++F+ENL +I++ NKE + +Y LG N F+DL+++
Sbjct: 45 LIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHE 103
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF++ + D+P S+DWR K AVT +K+Q CG CWA
Sbjct: 104 EFKSK------------------------DVADLPESVDWRKKGAVTHVKNQGACGSCWA 139
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI +I NL LSEQ+L+DC T N+GC GG M+ AF +I N G+ ED+
Sbjct: 140 FSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDD 199
Query: 223 YPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY +GTC ++ IS YE+VP DE++LLKA++ QP+S+ I A +F+ Y
Sbjct: 200 YPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYS 259
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
G+FNG CGT+LDH V VG+G+++ G +Y ++KNSWG WG+ GY+++ R+ EGLC
Sbjct: 260 GGVFNGPCGTELDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLC 318
Query: 338 GIGTQSSYP 346
GI +SYP
Sbjct: 319 GINKMASYP 327
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 147/317 (46%), Positives = 205/317 (64%), Gaps = 20/317 (6%)
Query: 40 EQSVVEMHEKWMAQH--GRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
++++ +++E+W + + RS+ EK+ RF +FKEN++YI + NK ++ YKL N+F
Sbjct: 37 DETLWDLYERWRSVYTSARSFG---EKQNRFHVFKENVKYINEVNKM-DKPYKLRLNQFG 92
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DLT EF Y K+ + S F Y+N+ +VP S+DWR K AVTP+K+Q C
Sbjct: 93 DLTPSEFARTYANSKIIEGTRNE--SGGFMYENV---EVPRSIDWRVKGAVTPVKNQGRC 147
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFSA AAVEGI +I+ LI LSEQQL+DC T N+GC GGTM +AFEYI Q GI
Sbjct: 148 GGCWAFSAAAAVEGINQITTGQLISLSEQQLIDCDTQ-NSGCRGGTMGRAFEYIKQRGGI 206
Query: 218 ATEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT-- 274
+E YPY+A G C + + I Y + E A+LK ++ QPVS+ + A T
Sbjct: 207 TSEANYPYKAQAGMCKNNLIQRPTVSIDGYYNIRR-SEDAVLKILAHQPVSVAVDATTWS 265
Query: 275 -TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
++ Y +G+F G CGT+L+H VT VG+GTT DG +YW+IKNSWG+TWG+ GYM++LR
Sbjct: 266 SLDWMFYFQGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMRMLRG 325
Query: 333 --DEGLCGIGTQSSYPL 347
GLCGI Q+S+P+
Sbjct: 326 VSPYGLCGIAMQASFPI 342
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 197/311 (63%), Gaps = 13/311 (4%)
Query: 45 EMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEF 104
E+ + W +HG++Y E E++ R +IFK+N +++ + N N TY L N F+DLT+ EF
Sbjct: 30 ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLS-MTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
+A G + + S + K Q+L VP S+DWR K AVT +KDQ CG CW+F
Sbjct: 90 KASRLGLSVSASSLIMAS----KGQSLGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSF 145
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
SA A+EGI +I +LI LSEQ+L+DC + N GC GG M+ AFE++I+N GI TE +Y
Sbjct: 146 SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 205
Query: 224 PYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
PYQ GTC + K I +Y V S DE+AL +AV+ QPVS+GI F+ Y
Sbjct: 206 PYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSR 265
Query: 283 --GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGL 336
GIF+G C T LDHAV IVG+G +++G +YW++KNSWG +WG G+M + R+ EG+
Sbjct: 266 VSGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGI 324
Query: 337 CGIGTQSSYPL 347
CGI +SYP+
Sbjct: 325 CGINMLASYPI 335
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 187/308 (60%), Gaps = 9/308 (2%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR 105
M+EKW+ +H + Y EK+ RF+IFK+NL +I++ N + N +YK+G N+F+D+ N+E+R
Sbjct: 3 MYEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQ-NYSYKVGLNKFADINNEEYR 61
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
+Y G K + T T + V +DWR K AVT IKDQ CG CWAFS
Sbjct: 62 DMYLGTKSDAKRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFST 121
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
+A VE I KI + LSEQ+LVDC N GC GG M+ AFE+II+N GI T+ +YPY
Sbjct: 122 IATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDYPY 181
Query: 226 QAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGI 284
+ C +K A I YE+VPS AL KAV+ QPVS+ IA + Y+ G+
Sbjct: 182 NGFERKCDPTKKNAKVVSIDGYEDVPSY-MNALKKAVAHQPVSVAIAGLGRALQLYQSGV 240
Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-----GLCGI 339
F G CGT LDH V +VG+G +E+G +YWL++NSWG WG+ GY KI CGI
Sbjct: 241 FTGKCGTDLDHGVVVVGYG-SENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKCGI 299
Query: 340 GTQSSYPL 347
++SYP+
Sbjct: 300 AMEASYPV 307
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 144/333 (43%), Positives = 210/333 (63%), Gaps = 14/333 (4%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN 82
+L +S V RS E V ++ +W A++ + K E R ++FKENL++++K N
Sbjct: 29 VLTLSKQGGAVPVRSDEE--VRMLYLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHN 86
Query: 83 KEGNR---TYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNLSMTDVPT 138
+R T++LG NRF+DLTN+E+R + + S RS + + +Y+ D+P
Sbjct: 87 AAADRGEHTFRLGMNRFADLTNEEYRTRF--LRDFSRLRRSASGKISSRYRLREGDDLPD 144
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
S+DWR+K AV P+K+Q CG CWAFS VAAVEGI +I +LI LSEQQLVDC+T N+G
Sbjct: 145 SIDWREKGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT-ANHG 203
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALL 258
C GG M AF++I+ N GI +E+ YPY+ G C++ A I +YE VPS +EQ+L
Sbjct: 204 CRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNSTVNAPVVSIDSYENVPSHNEQSLQ 263
Query: 259 KAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
KAV+ QPVS+ + A +F+ Y+ GIF G C +HA+T+VG+GT D +Y +KNSW
Sbjct: 264 KAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDYRTVKNSW 322
Query: 319 GDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G WG++GY+++ R+ G CGI +SYP+
Sbjct: 323 GKNWGESGYIRVERNIGNPNGKCGITRFASYPV 355
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 194/309 (62%), Gaps = 15/309 (4%)
Query: 50 WMAQHGRSYKDELE-KEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALY 108
W A+ G+ + RF+ FKEN YIE+ N+ G +Y+LG N+FSDLT++EFR +
Sbjct: 16 WCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRF 75
Query: 109 TGYK---MPSPSHRSTTSSTFK--YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
G + + SP + S + +QN+ D+P S+DWR AVT KDQ CG CWAF
Sbjct: 76 LGLRPDLIDSPVLKMPRDSDIEEGFQNV---DLPASVDWRQHGAVTAPKDQGSCGGCWAF 132
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
+ A+EGI +I L+ LSEQ+L+DC + GC GG ME A+++I++N G+ TE +Y
Sbjct: 133 ATTGAIEGINQIVTGQLVSLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTETDY 192
Query: 224 PYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
PY A + C+ + + I Y+ +P GDEQALL AV+ QPVS+ I + +F+ Y
Sbjct: 193 PYHASESHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGASKDFQHYAS 252
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCG 338
G+F G CG +++H V IVG+G TEDG +YW++KNSW TWGD G++K+ R+ GLC
Sbjct: 253 GVFTGHCGEEINHGVLIVGYG-TEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCS 311
Query: 339 IGTQSSYPL 347
I T +SYP+
Sbjct: 312 INTLASYPV 320
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 206/335 (61%), Gaps = 18/335 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
++ + IL+++ S V + ST ++ E W Q+G++Y E EK R K+F+EN +
Sbjct: 5 LWAVSILILAVHSSVSEASST-----ADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAF 59
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDV 136
+ + N N +Y L N F+DLT+ EF+A G+ SP +S S Q L V
Sbjct: 60 VTQHNSMANASYTLALNAFADLTHHEFKASRLGF---SPGRAQSIRSVGTPVQEL---HV 113
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
P ++DWR AVT +KDQ CG CW+FS A+EGI KI +L+ LSEQ+LVDC + N
Sbjct: 114 PPAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYN 173
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQ 255
+GC GG M+ A++++I+NQGI +E +YPY + C+ + K I Y ++P DE+
Sbjct: 174 SGCEGGLMDYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEK 233
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
LL+ V+ QPVS+GI F+ Y +G++ G C + LDHAV IVG+G TEDG ++W++K
Sbjct: 234 QLLQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYG-TEDGVDFWIVK 292
Query: 316 NSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
NSWG+ WG GY+ +LR+ EG+CGI +SYP
Sbjct: 293 NSWGEHWGMRGYIHMLRNNGTAEGICGINMLASYP 327
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 138/295 (46%), Positives = 192/295 (65%), Gaps = 14/295 (4%)
Query: 63 EKEMRFKIFKENLEYIEKANKEGNRT--YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRS 120
E E RF++F +NL++++ N + ++LG NRF+DLTN EFRA Y G P+ R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKKAVT-PIKDQQECGCCWAFSAVAAVEGITKISGAN 179
+ Y++ + +P S+DWRDK AV P+K+Q +CG CWAFSAVAAVEGI KI
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 180 LIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA 238
L+ LSEQ+LV+C+ NG N+GC GG M+ AF +I +N G+ TE++YPY A+ G C+ A+++
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 239 -AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAV 297
I +E+VP DE +L KAV+ QPVS+ I A EF+ Y G+F G CGT LDH V
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 298 TIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
VG+GT GA YW ++NSWG WG+ GY+++ R+ G CGI +SYP+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 136/310 (43%), Positives = 186/310 (60%), Gaps = 12/310 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK------EGNRTYKLGTNRFSDLTN 101
E W A+HG++Y E+ R F EN ++ N G +Y L N F+DLT+
Sbjct: 40 EAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLTH 99
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCW 161
DEFRA G P S + + VP +LDWR AVT +KDQ CG CW
Sbjct: 100 DEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCGACW 159
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATED 221
+FSA A+EGI KI+ +L+ LSEQ+L+DC + N GCGGG M A++++I+N GI TED
Sbjct: 160 SFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDTED 219
Query: 222 EYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
+YP++ GTC+ + K I Y+EVPS E LL+AV+ QP+S+GI F+ Y
Sbjct: 220 DYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQLY 279
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGL 336
+GIF+G C T LDHAV IVG+G +E G +YW++KNSWG+ WG GYM + R+ G+
Sbjct: 280 SQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSSGI 338
Query: 337 CGIGTQSSYP 346
CGI +S+P
Sbjct: 339 CGINMMASFP 348
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 138/295 (46%), Positives = 192/295 (65%), Gaps = 14/295 (4%)
Query: 63 EKEMRFKIFKENLEYIEKANKEGNRT--YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRS 120
E E RF++F +NL++++ N + ++LG NRF+DLTN EFRA Y G P+ R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKKAVT-PIKDQQECGCCWAFSAVAAVEGITKISGAN 179
+ Y++ + +P S+DWRDK AV P+K+Q +CG CWAFSAVAAVEGI KI
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 180 LIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA 238
L+ LSEQ+LV+C+ NG N+GC GG M+ AF +I +N G+ TE++YPY A+ G C+ A+++
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 239 -AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAV 297
I +E+VP DE +L KAV+ QPVS+ I A EF+ Y G+F G CGT LDH V
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 298 TIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
VG+GT GA YW ++NSWG WG+ GY+++ R+ G CGI +SYP+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 199/318 (62%), Gaps = 16/318 (5%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ + WM +H + Y+ EK RF+IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
DL+NDEF+ Y G+ H T+K+ +T+ P S+DWR K AVTP+K+Q
Sbjct: 98 DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
CG CWAFS +A VEGI KI NL++LSEQ+LVDC + + GC GG + +Y + N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VAN 211
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
G+ T YPYQA Q C A K KI+ Y+ VPS E + L A++ QP+S+ + A
Sbjct: 212 NGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAG 271
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
F+ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330
Query: 333 ---DEGLCGIGTQSSYPL 347
+G CG+ S YP
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 137/306 (44%), Positives = 193/306 (63%), Gaps = 13/306 (4%)
Query: 50 WMAQHGRSYKDELEK-EMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALY 108
W+ ++YKD +E+ E +F ++ +NLE++ N E + T+KLG F+DLT+DE+R
Sbjct: 51 WVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHN-EKDSTFKLGLTNFADLTHDEYRQHA 109
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
GY+ P + T T K D P S+DWR K AVT +K+QQ+CG CWAFS
Sbjct: 110 LGYR---PELKGTGLGTGKSTGFQYADYEAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTT 166
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQ 226
+VEG I L+ LSEQ+LVDC ++GC GG M+ AF +II+N GI TE +Y Y+
Sbjct: 167 GSVEGANAIYSGELVSLSEQELVDCDVTQDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYK 226
Query: 227 AVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIF 285
A G C+ A +K I +YE+VP DE AL KA + QP+S+ I A EF+ Y G+F
Sbjct: 227 AQDGVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVF 286
Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGT 341
+ CGT LDH V +VG+G +++G +YW++KNSWGD WGD+GY+++ R G CGI
Sbjct: 287 DAPCGTALDHGVLVVGYG-SDNGTDYWIVKNSWGDFWGDSGYIRLARGISNSAGQCGIAM 345
Query: 342 QSSYPL 347
Q+SYP+
Sbjct: 346 QASYPI 351
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 143/317 (45%), Positives = 203/317 (64%), Gaps = 13/317 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR---TYKLGTNRF 96
++ V ++++W A+H + D+ + R ++FKENL ++++ N +R Y+LG NRF
Sbjct: 36 DEEVRIIYQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRF 95
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKKAVTPIKDQQ 155
+DLTN+E+RA + + S RST+ L DV P S+DWR+K AV +K Q
Sbjct: 96 ADLTNEEYRARFL--RDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKSQG 153
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAF+A+A VEGI +I +LI LSEQQLVDCST N+GC GG +AF+YII N
Sbjct: 154 RCGSCWAFAAIATVEGINQIVTGDLISLSEQQLVDCSTR-NHGCEGGWPYRAFQYIINNG 212
Query: 216 GIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
G+ +E+ YPY GTC+ + A I +Y VPS DE++L KAV+ QP+S+GI A
Sbjct: 213 GVNSEEHYPYTGTNGTCNTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISVGINASG 272
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F+ Y GIF G C T L+H VT+VG+GT +G +YW++KNSWG++WGD+GY+ + R+
Sbjct: 273 RNFQLYHSGIFTGSCNTSLNHGVTVVGYGTV-NGNDYWIVKNSWGESWGDSGYILMERNI 331
Query: 334 ---EGLCGIGTQSSYPL 347
G CGI SYP+
Sbjct: 332 AESSGKCGIAISPSYPI 348
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 200/309 (64%), Gaps = 10/309 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E+ E WM++HG+ Y+ EK +RF+IFK+NL++I++ NK + Y LG N F+DL++
Sbjct: 43 LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF+ Y G K+ S R + F Y+++ ++P S+DWR K AV P+K+Q CG CWA
Sbjct: 102 EFKNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI +I NL LSEQ+L+DC +NGC GG M+ AF +I++N G+ E++
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 223 YPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY +GTC ++ IS Y +VP +EQ+LLKA++ Q +S+ I A +F+ Y
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYS 277
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI---LRDEGLCG 338
G+F+G CG+ LDH V VG+GT + G +Y ++KNSWG WG+ GY+++ L G
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYIIVKNSWGSKWGEKGYIRMRGTLETRGNLR 336
Query: 339 IGTQSSYPL 347
+SYPL
Sbjct: 337 YLQMASYPL 345
>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
Length = 322
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 207/327 (63%), Gaps = 35/327 (10%)
Query: 30 SQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTY 89
SQ + +EQS+V+ H++WM Q R Y+DE EKEMR ++FK+NL++IE N GN++Y
Sbjct: 21 SQARPHVTLNEQSIVDYHQQWMTQFSRVYQDESEKEMRLQVFKKNLKFIENFNNMGNQSY 80
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT---SLDWRDKK 146
+G N F+D T +EF A +TG ++ + + T +N +++D+ S DWRD+
Sbjct: 81 TVGVNEFTDWTIEEFLATHTGLRVNVTTLSELFNETMPSRNWNISDIDIDDESKDWRDEG 140
Query: 147 AVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEK 206
AV P+K Q C G+TKISG NL+ LSEQQL+DC T N GC GG +E+
Sbjct: 141 AVIPVKVQGAC-------------GLTKISGKNLLTLSEQQLIDCDTEKNTGCDGGGIEE 187
Query: 207 AFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQP 265
AF+YII+N G++ E EYPYQ +G+C A A+ A +I +E VPS +E+ALL+AV QP
Sbjct: 188 AFKYIIKNGGVSLETEYPYQVKKGSCRANARSATQTQIRGFEMVPSHNERALLEAVRRQP 247
Query: 266 VSIGIAAYTTEFKSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGD 324
VS+ I A FK+YK G++ G+ CGT ++HAVT VG+GT +I+ +WG+
Sbjct: 248 VSVLIDARADSFKTYKGGVYAGLDCGTDVNHAVTFVGYGT--------MIQ-----SWGE 294
Query: 325 AGYMKILRD----EGLCGIGTQSSYPL 347
GYM+I RD +G+CGI ++YP+
Sbjct: 295 NGYMRIRRDVEWPQGMCGIAQVAAYPI 321
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 198/318 (62%), Gaps = 16/318 (5%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ + WM +H + Y+ EK RF+IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
DL+NDEF+ Y G+ H T+K+ +T+ P S+DWR K AVTP+K+Q
Sbjct: 98 DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
CG CWAFS +A VEGI KI NL++LSEQ+LVDC + + GC GG + +Y + N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VAN 211
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
G+ T YPYQA Q C A K KI+ Y+ VPS E + L A++ QP+S + A
Sbjct: 212 NGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAG 271
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
F+ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330
Query: 333 ---DEGLCGIGTQSSYPL 347
+G CG+ S YP
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 194/329 (58%), Gaps = 21/329 (6%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T +++ E+WM +HGR+Y D EK+ RF++++ N+E +E N N YKL N+F+
Sbjct: 23 TRADLMLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFA 81
Query: 98 DLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKKAVTPIKDQ 154
DLTN+EFRA G++ + P +T S+ S D+ P S+DWR K AV +K+Q
Sbjct: 82 DLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQ 141
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
+CG CWAFSAVAA+EGI +I L+ LSEQ+LVDC GCGGG M AFE+++ N
Sbjct: 142 GDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGN 200
Query: 215 QGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
G+ TE YPY A G C AA+ +A I+ Y V E L +A + QPVS+ +
Sbjct: 201 HGLTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGG 260
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN----------YWLIKNSWGDTWG 323
+ F+ Y G++ G C ++H VT+VG+G +E + YW++KNSWG WG
Sbjct: 261 SFMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWG 320
Query: 324 DAGYMKILRD-----EGLCGIGTQSSYPL 347
DAGY+ + RD GLCGI SYP+
Sbjct: 321 DAGYILMQRDVAGLASGLCGIALLPSYPV 349
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 133/262 (50%), Positives = 170/262 (64%), Gaps = 18/262 (6%)
Query: 99 LTNDEFRALYTGYKMPSPSHR---------STTSSTFKYQNLSMTDVPTSLDWRDKKAVT 149
+T DEFR Y G ++ HR S ++S+F Y + DVP S+DWR K AVT
Sbjct: 1 MTADEFRRHYAGSRVAH--HRMFRGDRQGSSASASSFMYAD--ARDVPASVDWRQKGAVT 56
Query: 150 PIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFE 209
+KDQ +CG CWAFS +AAVEGI I NL LSEQQLVDC T N GC GG M+ AF+
Sbjct: 57 DVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQ 116
Query: 210 YIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIG 269
YI ++ G+A ED YPY+A Q +C + A I YE+VP+ DE AL KAV+ QPVS+
Sbjct: 117 YIAKHGGVAAEDAYPYRARQASCKKS-PAPVVTIDGYEDVPANDESALKKAVAHQPVSVA 175
Query: 270 IAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMK 329
I A + F+ Y EG+F+G CGT+LDH V VG+G T DG YWL+KNSWG WG+ GY++
Sbjct: 176 IEASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIR 235
Query: 330 ILRD----EGLCGIGTQSSYPL 347
+ RD EG CGI ++SYP+
Sbjct: 236 MARDVAAKEGHCGIAMEASYPV 257
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 148/344 (43%), Positives = 213/344 (61%), Gaps = 16/344 (4%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
++ +P I+ L A + RS E ++ +++W +H + D+ + R ++FK
Sbjct: 21 VSVVPPLDILTLSKQ-AWAAPAGRSDEEVRII--YQEWRVKHRPAENDQYVGDYRLEVFK 77
Query: 73 ENLEYIEKANKEGNR---TYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
ENL ++++ N +R Y+LG NRF+DLTN+E+RA + + S RST+
Sbjct: 78 ENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFL--RDLSRLGRSTSGEISNQY 135
Query: 130 NLSMTDV-PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
L DV P S+DWR+K AV +K+Q CG CWAF+A+AAVEGI +I +LI LSEQQL
Sbjct: 136 RLREGDVLPDSIDWREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQL 195
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYE 247
VDCST N GC GG +AF+YII N G+ +E+ YPY GTC+ ++ A I +Y
Sbjct: 196 VDCSTR-NYGCEGGWPYRAFQYIINNGGVNSEEHYPYTGTNGTCNTTKENAHVVSIDSYR 254
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VPS DE++L KA + QP+S+GI A F+ Y GIF G C T L+H VT+VG+G TE+
Sbjct: 255 NVPSNDEKSLQKAAANQPISVGIDASGRNFQLYHSGIFTGSCNTSLNHGVTVVGYG-TEN 313
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G +YW++KNSWG+ WG++GY+ + R+ G CGI SYP+
Sbjct: 314 GNDYWIVKNSWGENWGNSGYILMERNIAESSGKCGIAISPSYPI 357
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 136/309 (44%), Positives = 186/309 (60%), Gaps = 11/309 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRAL 107
E+WM +HGR+Y + EK+ RF+++KENL IE+ N G Y L N+F+DLTN+EFRA
Sbjct: 120 EQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNS-GGHGYTLTDNKFADLTNEEFRAK 178
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNL----SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
G P R L + TD+P +DWR K AV +K+Q CG CWAF
Sbjct: 179 MLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGSCWAF 238
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
SAVAA+EG+ +I L+ LSEQ+LVDC GC GG M AFE+++ N G+ TE Y
Sbjct: 239 SAVAAMEGLNQIKNGKLVSLSEQELVDCDAEAV-GCAGGFMSWAFEFVMANHGLTTEASY 297
Query: 224 PYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
PY+ + G C A+ ++ I+ Y V E LLK ++QPVS+ + A F+ Y
Sbjct: 298 PYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLFQLYAG 357
Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCG 338
G+F+G C Q++H VT+VG+G T+ YW++KNSWG WG+AGYM + RD GLCG
Sbjct: 358 GVFSGPCTAQINHGVTVVGYGETDKAEKYWIVKNSWGPEWGEAGYMLMQRDAGVPTGLCG 417
Query: 339 IGTQSSYPL 347
I +SYP+
Sbjct: 418 IAMLASYPV 426
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 198/318 (62%), Gaps = 16/318 (5%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ + WM +H + Y+ EK RF+IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
DL+NDEF+ Y G H T+K+ +T+ P S+DWR K AVTP+K+Q
Sbjct: 98 DLSNDEFKKKYVGSVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
CG CWAFS +A VEG+ KI NL++LSEQ+LVDC N ++GC GG + +Y+ N
Sbjct: 154 GSCGSCWAFSTIATVEGVNKIVTGNLLELSEQELVDCDKN-SHGCKGGYQTTSLQYVADN 212
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
G+ T YPYQA C A K KI+ Y+ VPS E + L A++ QP+S+ + A
Sbjct: 213 -GVHTSKVYPYQAKAMQCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAG 271
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
F+ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330
Query: 333 ---DEGLCGIGTQSSYPL 347
+G CG+ S YP
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 138/341 (40%), Positives = 206/341 (60%), Gaps = 28/341 (8%)
Query: 30 SQVVSSRSTHEQSVVEMHEKWMAQHGR--------------SYKDELEKEMRFKIFKENL 75
++V + ++ V M+E W ++HGR ++E ++ +R ++F++NL
Sbjct: 37 TRVPAPAERADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNL 96
Query: 76 EYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
YI+ N E G T++LG F+DLT +E+R G++ + S + +
Sbjct: 97 RYIDAHNAEADAGLHTFRLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVRG-- 154
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
D+P ++DWR AVT +KDQQ+CG CWAFSAVAA+EG+ I+ NL+ LSEQ+++DC
Sbjct: 155 -GDLPDAIDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCD 213
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA--AAAKISNYEEVP 250
++GC GG ME AF ++I N GI TE +YP+ GTC A+++ A I EV
Sbjct: 214 AQ-DSGCDGGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKNEKVATIDGLVEVA 272
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
S +E AL +AV++QPVS+ I A F+ Y GIFNG CGT LDH VT VG+G +E G +
Sbjct: 273 SNNETALQEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYG-SESGKD 331
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
YW++KNSW +WG+AGY+++ R+ G CGI +SYP+
Sbjct: 332 YWIVKNSWSASWGEAGYIRMRRNVPRPTGKCGIAMDASYPV 372
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 137/324 (42%), Positives = 193/324 (59%), Gaps = 21/324 (6%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+++ E+WM +HGR+Y D EK+ RF++++ N+E +E N N YKL N+F+DLTN+
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKKAVTPIKDQQECGC 159
EFRA G++ + P +T S+ S D+ P S+DWR K AV +K+Q +CG
Sbjct: 86 EFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGS 145
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFSAVAA+EGI +I L+ LSEQ+LVDC GCGGG M AFE+++ N G+ T
Sbjct: 146 CWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGNHGLTT 204
Query: 220 EDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E YPY A G C AA+ +A I+ Y V E L +A + QPVS+ + + F+
Sbjct: 205 EASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQ 264
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN----------YWLIKNSWGDTWGDAGYM 328
Y G++ G C ++H VT+VG+G +E + YW++KNSWG WGDAGY+
Sbjct: 265 LYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYI 324
Query: 329 KILRD-----EGLCGIGTQSSYPL 347
+ RD GLCGI SYP+
Sbjct: 325 LMQRDVAGLASGLCGIALLPSYPV 348
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 141/333 (42%), Positives = 208/333 (62%), Gaps = 14/333 (4%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN 82
+L +S V RS E V ++ +W ++ + K E R ++FKENL+++++ N
Sbjct: 31 VLTLSKQGGAVPVRSDEE--VRMLYLEWRVKNHPAEKYLDLNEYRLEVFKENLQFVDEHN 88
Query: 83 KEGNR---TYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNLSMTDVPT 138
+R T+ LG NRF+DLTN+E+R + + S RS + + +Y+ D+P
Sbjct: 89 AAADRGEHTFLLGMNRFADLTNEEYRTRF--LRDFSRLRRSASGKISSRYRLREGDDLPD 146
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
S+DWR+ AV P+K+Q CG CWAFS VAAVEGI +I +LI LSEQQLVDC+T N+G
Sbjct: 147 SIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT-ANHG 205
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALL 258
C GG M AF++I+ N GI +E+ YPY+ G C++ A I +YE VPS +EQ+L
Sbjct: 206 CRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNSTVNAPVVSIDSYENVPSHNEQSLQ 265
Query: 259 KAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
KAV+ QPVS+ + A +F+ Y+ GIF G C +HA+T+VG+GT D ++W++KNSW
Sbjct: 266 KAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDFWIVKNSW 324
Query: 319 GDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G WG++GY++ R+ G CGI +SYP+
Sbjct: 325 GKNWGESGYIRAERNIENPNGKCGITRFASYPV 357
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 202/332 (60%), Gaps = 26/332 (7%)
Query: 40 EQSVVEMHEKWMAQHGR--SYKD-----------ELEKEMRFKIFKENLEYIEKANKE-- 84
++ V M+E W ++HGR S D E ++ +R ++F++NL YI+K N E
Sbjct: 77 DEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEAD 136
Query: 85 -GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLD 141
G T++LG F+DLT DE+R G++ + + Y+ +P ++D
Sbjct: 137 AGLHTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDAID 196
Query: 142 WRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGG 201
WR AVT +KDQQ+CG CWAFSAVAA+EGI I+ NL+ LSEQ+++DC ++GC G
Sbjct: 197 WRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ-DSGCDG 255
Query: 202 GTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK--AAAAKISNYEEVPSGDEQALLK 259
G ME AF ++I N GI TE +YP+ GTC A+++ A I EV S +E AL +
Sbjct: 256 GQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEVASNNETALQE 315
Query: 260 AVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
AV++QPVS+ I A F+ Y GIFNG CGT LDH VT VG+G +E G +YW++KNSW
Sbjct: 316 AVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYG-SESGKDYWIVKNSWS 374
Query: 320 DTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+WG+AGY+++ R+ G CGI +SYP+
Sbjct: 375 ASWGEAGYIRMRRNVPRPTGKCGIAMDASYPV 406
>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
Length = 355
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 139/344 (40%), Positives = 206/344 (59%), Gaps = 17/344 (4%)
Query: 18 MFIIIILLVSCASQ-----VVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEK 64
M I+++ +V S ++S + H + V+ M E+W+ +H + Y EK
Sbjct: 3 MAIVLLFMVFAVSSALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNALGEK 62
Query: 65 EMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS 124
E RF+IFK NL +I++ N NRTYKLG N F+DLTN E+RA+Y P T
Sbjct: 63 EKRFQIFKNNLRFIDERNSL-NRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTPP 121
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ-QECGCCWAFSAVAAVEGITKISGANLIQL 183
+Y +P S+DWR + AVTP+K+Q C CWAF+AV AVE + KI +LI L
Sbjct: 122 RNRYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLISL 181
Query: 184 SEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKI 243
SEQ++VDC+T+ + GCGGG ++ + YI +N GI+ E +YPY+ +G C + +K A I
Sbjct: 182 SEQEVVDCTTSSSRGCGGGDIQHGYIYIRKN-GISLEKDYPYRGDEGKCDSNKKNAIVTI 240
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
+ VP+ E+AL + ++ QPV++ I A EF+ Y G+F G CGT+L+HA+ +VG+G
Sbjct: 241 DGHGWVPTQLEEALKQGIANQPVAVPIPADDYEFQYYTSGVFKGKCGTELNHALLLVGYG 300
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYPL 347
+DG +YW+ KNS+ D WG+ GY++I R C G YP+
Sbjct: 301 AEKDG-DYWIAKNSYSDKWGENGYIRIQRKLSTCKFGNGGYYPI 343
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 185/309 (59%), Gaps = 12/309 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK------EGNRTYKLGTNRFSDLTN 101
E W A+HG++Y E+ R F EN ++ N G +Y L N F+DLT+
Sbjct: 40 EAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLTH 99
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCW 161
DEFRA G P S + + VP +LDWR AVT +KDQ CG CW
Sbjct: 100 DEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCGACW 159
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATED 221
+FSA A+EGI KI+ +L+ LSEQ+L+DC + N GCGGG M A++++I+N GI TED
Sbjct: 160 SFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDTED 219
Query: 222 EYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
+YP++ GTC+ + K I Y+EVPS E LL+AV+ QP+S+GI F+ Y
Sbjct: 220 DYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQLY 279
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGL 336
+GIF+G C T LDHAV IVG+G +E G +YW++KNSWG+ WG GYM + R+ G+
Sbjct: 280 SQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSSGI 338
Query: 337 CGIGTQSSY 345
CGI +S+
Sbjct: 339 CGINMMASF 347
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 144/315 (45%), Positives = 196/315 (62%), Gaps = 17/315 (5%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++ + E WM +H R Y + EK RF+IFK+NL YI++ NK+ N +Y LG N F
Sbjct: 39 TSTERLIRLFESWMLKHDRVYNNIEEKIHRFEIFKDNLMYIDETNKK-NNSYWLGLNEFV 97
Query: 98 DLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
DLT+DEF+ Y G + + F Y+++ D P S+DWRDK AVTP+K
Sbjct: 98 DLTHDEFKEKYVGSIGEDFVTIEQSNDEEFPYKHV--VDYPESIDWRDKGAVTPVK-PNP 154
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS VA VEGI KI LI LSEQ+L+DC ++GC GG + +Y++ N G
Sbjct: 155 CGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCKGGYQTTSLQYVVDN-G 212
Query: 217 IATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
+ TE EYPY+ QG C A +K +I+ Y+ VP+ DE +L++A++ QPVS+ + +
Sbjct: 213 VHTEKEYPYEKKQGKCRAKEKKGTKVQITGYKRVPANDEISLIQAIANQPVSVLLESKGR 272
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR--- 332
F+ YK GIFNG CGT+LDHAVT +G+G T Y LIKNSWG WG+ GY+KI R
Sbjct: 273 AFQLYKGGIFNGPCGTKLDHAVTAIGYGKT-----YILIKNSWGPNWGEKGYLKIKRASG 327
Query: 333 -DEGLCGIGTQSSYP 346
EG CG+ S +P
Sbjct: 328 KSEGTCGVYKSSYFP 342
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 129/296 (43%), Positives = 195/296 (65%), Gaps = 6/296 (2%)
Query: 41 QSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLT 100
V+ + E + +H + Y+ EK RF+IF +NL++I++ NK+ + Y LG N F+DLT
Sbjct: 43 HKVIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLT 101
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
++EF+ + G+K + + F+Y++ D+P S+DWR K AV+P+K+Q +CG C
Sbjct: 102 HEEFKNKFLGFKGELAERKDESIEQFRYRDF--VDLPKSVDWRKKGAVSPVKNQGQCGSC 159
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATE 220
WAFS VAAVEGI +I NL LSEQ+L+DC T NNGC GG M+ AF Y+ +N G+ E
Sbjct: 160 WAFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGGLMDYAFAYVTRN-GLHKE 218
Query: 221 DEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKS 279
+EYPY +GTC + A+ IS Y +VP +E + LKA++ QP+S+ I A +F+
Sbjct: 219 EEYPYIMSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQF 278
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEG 335
Y G+F+G CGT+LDH V VG+GT++ G +Y +++NSWG WG+ GY+++ R+ G
Sbjct: 279 YSGGVFDGHCGTELDHGVAAVGYGTSK-GLDYVIVRNSWGPKWGEKGYIRMKRNTG 333
>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
Length = 258
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 137/261 (52%), Positives = 181/261 (69%), Gaps = 12/261 (4%)
Query: 94 NRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT---SLDWRDKKAVTP 150
N F+D+TNDEF A+YTG + P P+ + FKY N++++D ++DWR K AVT
Sbjct: 4 NEFADMTNDEFMAMYTGLR-PVPAGAKKMAG-FKYGNVTLSDADDDQQTVDWRQKGAVTG 61
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEY 210
IKDQ++CGCCWAF+AVAAVEGI +I+ NL+ LSEQQ++DC T+GNNGC GG ++ AF+Y
Sbjct: 62 IKDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQY 121
Query: 211 IIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
I+ N G+ATED YPY A Q C + Q AA IS Y++VPSGDE AL AV+ QPVS+ I
Sbjct: 122 IVGNGGLATEDAYPYTAAQAMCQSVQPVAA--ISGYQDVPSGDEAALAAAVANQPVSVAI 179
Query: 271 AAYTTEFKSYKEGIFNGV-CGT--QLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
A+ F+ Y G+ C T L+HAVT VG+GT EDG YWL+KN WG WG+ GY
Sbjct: 180 DAHN--FQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGY 237
Query: 328 MKILRDEGLCGIGTQSSYPLA 348
+++ R CG+ Q+SYP+A
Sbjct: 238 LRLERGANACGVAQQASYPVA 258
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 131/304 (43%), Positives = 190/304 (62%), Gaps = 8/304 (2%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYT 109
WM + + LE RF++F N + IE NK+ + ++ +G N +S LT DEF+ L T
Sbjct: 31 WMKKFAVKL-NPLEWVHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKKLRT 89
Query: 110 GYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAA 168
G ++ SPS+ +S ++MTDVP +DW ++ VTP+K+Q CG CWAFS A
Sbjct: 90 GLRV-SPSYIQSRAKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFSTTGA 148
Query: 169 VEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV 228
+EG +S L+ +SEQ+LVDC NG+ GC GG M+ AF+++ ++G+ E++YPY A
Sbjct: 149 IEGAAFVSSKQLVSVSEQELVDCDHNGDMGCNGGLMDNAFKWVKTHKGLCKEEDYPYHAK 208
Query: 229 QGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGV 288
+GTC+ + K++ + +VP+ DEQAL AV+ QPVS+ I A EF+ YK G+F+
Sbjct: 209 EGTCALKKCKPVTKVTAFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQFYKSGVFDKS 268
Query: 289 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSS 344
CGT+LDH V +VG+G E G YW +KNSWG WGD GY+K+ R + G CG+ S
Sbjct: 269 CGTKLDHGVLVVGYG-EEGGKKYWKVKNSWGADWGDKGYIKLAREFGPETGQCGVAMVPS 327
Query: 345 YPLA 348
YP A
Sbjct: 328 YPTA 331
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 190/316 (60%), Gaps = 11/316 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDL 99
V+E + +H ++Y+DE E+ R KIF EN I K N+ EG ++KL N+++DL
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+ EFR L G+ +FK + + + +P S+DWR K AVT +KDQ
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS+ A+EG L+ LSEQ LVDCST GNNGC GG M+ AF YI N
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 216 GIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYT 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PVS+ I A
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 294
Query: 275 TEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
F+ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K+LR
Sbjct: 295 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 354
Query: 333 D-EGLCGIGTQSSYPL 347
+ E CGI + SSYPL
Sbjct: 355 NKENQCGIASASSYPL 370
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 190/316 (60%), Gaps = 11/316 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDL 99
V+E + +H ++Y+DE E+ R KIF EN I K N+ EG ++KL N+++DL
Sbjct: 59 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 118
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+ EFR L G+ +FK + + + +P S+DWR K AVT +KDQ
Sbjct: 119 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 178
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS+ A+EG L+ LSEQ LVDCST GNNGC GG M+ AF YI N
Sbjct: 179 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 238
Query: 216 GIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYT 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PVS+ I A
Sbjct: 239 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 298
Query: 275 TEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
F+ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K+LR
Sbjct: 299 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 358
Query: 333 D-EGLCGIGTQSSYPL 347
+ E CGI + SSYPL
Sbjct: 359 NKENQCGIASASSYPL 374
>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 350
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 136/330 (41%), Positives = 194/330 (58%), Gaps = 7/330 (2%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
+++L+ + V+++ +V HE+WMA+ GR Y D EK R +F N Y++
Sbjct: 14 LLVLVATAVFHAVAAQGEAGLTVAARHEQWMAKFGRVYTDANEKARRQAVFGANARYVDA 73
Query: 81 ANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
N+ GNRTY LG N FSDLT++EF + GY+ P + + L+ ++P S
Sbjct: 74 VNRAGNRTYTLGLNEFSDLTDNEFAKTHLGYREFRPETANISKGVDPGYGLA-GNIPKSF 132
Query: 141 DWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCG 200
DWR K AVT +K Q CGCCWAF+AVAA EG+ KI+ LI +SEQQ++DC+T GNN C
Sbjct: 133 DWRTKGAVTEVKSQGGCGCCWAFAAVAATEGLVKIAKGTLISMSEQQVLDCTT-GNNTCK 191
Query: 201 GGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVP-SGDEQALL 258
GG M A Y+ + G+ TE++Y Y A +G C A + + E +P G+E L
Sbjct: 192 GGYMNDALSYVFASGGLQTEEDYEYNAEKGACRRDVTPNPATSVGHAEYMPLDGNEFLLQ 251
Query: 259 KAVSMQPVSIGIAAYTTEFKSYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGAN-YWLIK 315
K V+ QPV + + AY T+FK+Y G+F G CG LDH T+VG+G + G YWL+K
Sbjct: 252 KLVARQPVVVAVEAYGTDFKNYGGGVFTGSPSCGQNLDHFFTVVGYGFADGGKQMYWLVK 311
Query: 316 NSWGDTWGDAGYMKILRDEGLCGIGTQSSY 345
N WG +WG++GYM+I R G ++Y
Sbjct: 312 NQWGTSWGESGYMRIARGSSARNCGMTNNY 341
>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 151/339 (44%), Positives = 212/339 (62%), Gaps = 27/339 (7%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F +++ + A QV + R+ + S+ E H + M ++ + KD + +FKEN+ YI
Sbjct: 12 FAMLLSMAFLAFQV-TCRTLQDASMYESHGQRMTRYSKVDKDPPDX-----VFKENVNYI 65
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N ++ YK N+F+ + + G+ M S R TT FK++N++ T P+
Sbjct: 66 EACNNAADKPYKRDINQFAP------KKRFKGH-MCSSIIRITT---FKFENVTAT--PS 113
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS-EQQLVDCSTNG-N 196
++D R K AVTPIKDQ +CGC WA SAVAA EGI + LI LS EQ+LVDC T G +
Sbjct: 114 TVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHALXAGKLILLSSEQELVDCDTKGVD 173
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA--AQKAAAAKISNYEEVPSGDE 254
C GG M+ AF++IIQN G+ TE YPY+ V G C+A A K AA I+ YE+VP+ +E
Sbjct: 174 QDCQGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNAYEADKNAATIITGYEDVPANNE 233
Query: 255 QALL-KAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
+A L KAV+ PVS+ I A ++F+ YK G+F G CGT+LDH VT VG+G ++DG YWL
Sbjct: 234 KAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWL 293
Query: 314 IKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
+KNS G WG+ GY+++ R +E LCGI Q+SYP A
Sbjct: 294 VKNSRGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 190/316 (60%), Gaps = 11/316 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDL 99
V+E + +H ++Y+DE E+ R KIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+ EFR L G+ +FK + + + +P S+DWR K AVT +KDQ
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS+ A+EG L+ LSEQ LVDCST GNNGC GG M+ AF YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 216 GIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYT 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PVS+ I A
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 264
Query: 275 TEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
F+ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K+LR
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 324
Query: 333 D-EGLCGIGTQSSYPL 347
+ E CGI + SSYPL
Sbjct: 325 NKENQCGIASASSYPL 340
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 194/316 (61%), Gaps = 12/316 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRF 96
+ V M+E W ++HG + + +R ++F++NL YI+ N E G T++LG F
Sbjct: 45 DDEVRRMYEAWKSEHGHGHGSD--DRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPF 102
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+DLT +E+R G++ S + D+P ++DWR+ AVT +K+Q++
Sbjct: 103 ADLTLEEYRGRALGFRARRGGASRVGSGSSYRPRPRGGDLPDAIDWRELGAVTGVKNQEQ 162
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFSAVAA+EGI +I NL+ LSEQ+++DC T + GC GG M+ AF+++I N G
Sbjct: 163 CGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCDTQ-DGGCNGGEMQNAFQFVINNGG 221
Query: 217 IATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
I TE +YPY C A + I + V + +E AL +AV+ QPVS+ I A
Sbjct: 222 IDTEADYPYLGTDAACDANRVNERVVTIDGFVSVATENETALQEAVANQPVSVAIDASGR 281
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-- 333
+F+ Y GIFNG CGTQLDH VT VG+G +E+G +YW++KNSW +WG+AGY++I R+
Sbjct: 282 KFQHYTSGIFNGPCGTQLDHGVTAVGYG-SENGKDYWIVKNSWSSSWGEAGYIRIRRNVA 340
Query: 334 --EGLCGIGTQSSYPL 347
G CGI +SYP+
Sbjct: 341 AATGKCGIAMDASYPV 356
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 140/347 (40%), Positives = 203/347 (58%), Gaps = 21/347 (6%)
Query: 18 MFIIIILLVSCASQVVSSRSTHE----------QSVVEMHEKWMAQHGRSYKDELEKEMR 67
M + +LLV+C+ V++ E +S E + W+ R+Y E E R
Sbjct: 1 MRLSCVLLVACSCLAVAAGFPFENHRLFIQQAVESPREAFDFWVQTLKRAYASAEEYERR 60
Query: 68 FKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
F ++ +NL ++ + N G+ ++ L ++DL+ DE+R+ GY R ++ F
Sbjct: 61 FDVWLDNLRFVHEYNA-GHTSHWLSMGVYADLSQDEYRSKALGYNADLHEERPLRAAPFL 119
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
Y+ T P +DW K AVTP+K+Q CG CWAFS AVEG + I+ L LSEQ
Sbjct: 120 YEG---TVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKLASLSEQM 176
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNY 246
LVDC +NGC GG M+ AFE+I++N GI TED+YPY A +G C + + I +Y
Sbjct: 177 LVDCDRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTAEEGMCQDNKMRRHVVTIDDY 236
Query: 247 EEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
++VP DE AL+KAV+ QPVS+ I A F+ Y G+F+ CGT LDH V +VG+GT
Sbjct: 237 QDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFDAECGTALDHGVLVVGYGTAS 296
Query: 307 DGAN---YWLIKNSWGDTWGDAGYMKILR---DEGLCGIGTQSSYPL 347
+G + YWL+KNSWG WGD GY+++LR +EG CG+ Q+S+P+
Sbjct: 297 NGTHHLPYWLVKNSWGAEWGDKGYIRLLRNLGEEGQCGVAMQASFPI 343
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 146/352 (41%), Positives = 207/352 (58%), Gaps = 29/352 (8%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEM-HEKWMA---QHGRSYKDELEKEMRFKIF 71
+ +F++++ ++ A+ V S+ + E+W A QH + Y E E+ +R KI+
Sbjct: 1 MKLFLLLVSFLAAANAV---------SIFNLVKEEWNAFKLQHRKKYDSESEERIRMKIY 51
Query: 72 KENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF--------RALYTGYKMPSPSHRS 120
+N I K N+ G ++L N+++DL ++EF R+ G K+
Sbjct: 52 VQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSAAAGSKLLGREQLM 111
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANL 180
T + + DVPT++DWR+K AVTP+KDQ CG CW+FSA A+EG L
Sbjct: 112 TIEEPITWIEPANVDVPTTIDWREKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKL 171
Query: 181 IQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA 239
+ LSEQ LVDCST GNNGC GG M+ AF+Y+ N+GI TE YPY+A+ C KA
Sbjct: 172 VSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYVKDNKGIDTEKAYPYEAIDDECHYNPKAI 231
Query: 240 AAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHA 296
A + ++P GDE+AL KA+ ++ PVS+ I A F+ Y EG+ + C + QLDH
Sbjct: 232 GATDKGFVDIPQGDEKALKKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHG 291
Query: 297 VTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
V VG+GTTEDG +YWL+KNSWG TWGD GY+K+ R+ E CGI T +SYPL
Sbjct: 292 VLAVGYGTTEDGEDYWLVKNSWGTTWGDQGYVKMARNRENHCGIATTASYPL 343
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 197/318 (61%), Gaps = 16/318 (5%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ + WM +H + Y+ EK RF+IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
DL+NDEF+ Y G+ H T+K+ +T+ P S+DWR K AVTP+K+Q
Sbjct: 98 DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
CG CWAFS +A VEGI KI NL++LSEQ+LVDC + + GC GG + +Y + N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VAN 211
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
G+ T YP QA Q C A K KI+ Y+ VPS E + L A++ QP+S + A
Sbjct: 212 NGVHTSKVYPCQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAG 271
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
F+ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330
Query: 333 ---DEGLCGIGTQSSYPL 347
+G CG+ S YP
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 191/316 (60%), Gaps = 11/316 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDL 99
V+E + +H ++Y+D+ E+ R KIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+ EFR L G+ T +FK + + + +P S+DWR K AVT +KDQ
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGH 144
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS+ A+EG L+ LSEQ LVDCST GNNGC GG M+ AF YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 216 GIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYT 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PVS+ I A
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 264
Query: 275 TEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
F+ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K+LR
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFIKMLR 324
Query: 333 D-EGLCGIGTQSSYPL 347
+ + CGI + SSYPL
Sbjct: 325 NKDNQCGIASASSYPL 340
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 195/322 (60%), Gaps = 15/322 (4%)
Query: 30 SQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTY 89
S + S E + +M +M Q+ ++Y E RF FK N+E I N N +Y
Sbjct: 25 SALFSEEVPSEVMLQDMFTAFMKQYSKAY-SHAEFSSRFNQFKANVETIRLHNTLANASY 83
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVT 149
+G N F+DL+ +EF+ Y GYK R S +Q + PTS+DWR AVT
Sbjct: 84 TMGLNEFADLSFEEFKGKYFGYKHV---EREFARSNNLHQEVEA--APTSIDWRTSNAVT 138
Query: 150 PIKDQQECGCCWAFSAVAAVEGITKISGAN-LIQLSEQQLVDCSTN-GNNGCGGGTMEKA 207
PIKDQ +CG CWAFSA ++EG + G + L LSEQQLVDCST+ GN GC GG M+ A
Sbjct: 139 PIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYA 198
Query: 208 FEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA--AAKISNYEEVPSGDEQALLKAV-SMQ 264
FEYII N+GI E YPY+ V G C QK+ IS Y++V SGDE +LL AV ++
Sbjct: 199 FEYIIANKGICAESAYPYKGVGGLC---QKSCTKVVTISGYKDVASGDEASLLNAVGTVG 255
Query: 265 PVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGD 324
PVS+ I A F+ Y G+F+G CG LDH V VG+GTT +YW++KNSWG +WG+
Sbjct: 256 PVSVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTT-GSQDYWIVKNSWGTSWGE 314
Query: 325 AGYMKILRDEGLCGIGTQSSYP 346
+GY++++R++ CGI Q SYP
Sbjct: 315 SGYIRMIRNKNQCGIAIQPSYP 336
>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 187/312 (59%), Gaps = 14/312 (4%)
Query: 47 HEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRA 106
HE+WMA++GR Y D EK R ++F N +I+ N+ GNRTY LG N FSDLTN+EF
Sbjct: 41 HERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEEFAQ 100
Query: 107 LYTGYK-MPSPS----HRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCW 161
+ GY+ P P S+ ++ + + P S+DWR + AVTP+K Q CG CW
Sbjct: 101 THLGYRHQPGPGGLRPEDSSPAAAVNVTDAQLQSTPDSVDWRARGAVTPVKHQGHCGSCW 160
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATED 221
AF+AVAA EG+ +I+ NLI +SEQQ++DC T G + C G + A YI + G+ TE
Sbjct: 161 AFAAVAATEGLVQIATGNLISMSEQQVLDC-TGGTSSCKSGYVNAALTYITASGGLQTEA 219
Query: 222 EYPYQAVQGTC---SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
Y Y A QG C A+ +AAA + + +GDE AL V+ QPV++ + A +F
Sbjct: 220 AYAYSAEQGACRSGGASPNSAAAVGVHRSAMLNGDEGALQVLVAGQPVAVAVEA-EPDFH 278
Query: 279 SYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEGL 336
YK G++ G CG +L HAVT+VG+G DG YW++KN WG WG+ GYM++ R G
Sbjct: 279 HYKSGVYVGSPSCGQKLHHAVTVVGYGADGDGQGYWVVKNQWGAGWGEVGYMRLTRGNGG 338
Query: 337 --CGIGTQSSYP 346
CG+ T + YP
Sbjct: 339 NNCGMATHAYYP 350
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 268 bits (685), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 193/316 (61%), Gaps = 11/316 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDL 99
V+E + +H ++Y+D+ E+ R KIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+ EFR L G+ +T +FK + + + +P S+DWR K AVT +KDQ
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS+ A+EG L+ LSEQ LVDCST GNNGC GG M+ AF YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 216 GIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYT 274
GI TE YPY+A+ +C + A A + ++P GDE+ + +AV ++ PV++ I A
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASH 264
Query: 275 TEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
F+ Y EG++N C Q LDH V +VG+GT E G +YWL+KNSWG TWGD G++K+LR
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTWGDKGFIKMLR 324
Query: 333 D-EGLCGIGTQSSYPL 347
+ + CGI + SSYPL
Sbjct: 325 NKDNQCGIASASSYPL 340
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 190/316 (60%), Gaps = 11/316 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDL 99
V+E + +H ++Y+D+ E+ R KIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+ EFR L G+ +FK + + + +P S+DWR K AVT +KDQ
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS+ A+EG L+ LSEQ LVDCST GNNGC GG M+ AF YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 216 GIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYT 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PVS+ I A
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 264
Query: 275 TEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
F+ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K+LR
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFIKMLR 324
Query: 333 D-EGLCGIGTQSSYPL 347
+ E CGI + SSYPL
Sbjct: 325 NKENQCGIASASSYPL 340
>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
Length = 369
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 141/314 (44%), Positives = 191/314 (60%), Gaps = 23/314 (7%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+++ E++E+W QH R +D EK RF +FK+N+ I + N+ + YKL NRF D+
Sbjct: 41 EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
T DE Y ++ HR K Q L AV +KDQ +CG
Sbjct: 99 TADESAGAYASSRVSH--HRMFRGRGEKAQRL-------------HGAVGAVKDQGQCGS 143
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFS +AAVEGI I +NL LSEQQLVDC T GN GC GG M+ AF+YI ++ G+A
Sbjct: 144 CWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVA 203
Query: 219 TEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEF 277
YPY+A Q +C ++ ++ I YE+VP+ E AL KAV+ QPVS+ I A + F
Sbjct: 204 ASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHF 263
Query: 278 KSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD---- 333
+ Y EG+F G CGT+LDH V VG+GTT DG YW+++NSWG WG+ GY+++ RD
Sbjct: 264 QFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRDVSAK 323
Query: 334 EGLCGIGTQSSYPL 347
EGLCGI ++SYP+
Sbjct: 324 EGLCGIAMEASYPI 337
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 144/337 (42%), Positives = 210/337 (62%), Gaps = 13/337 (3%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ ++L+ +C VVSS S E +W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
K N + G+ TY LG N+F+DL N+EF A+ TG+++ S ++ STF N ++ +
Sbjct: 60 IKHNLKYDLGHFTYALGMNQFADLKNEEFVAMMTGFRVNGTS-KAAKGSTFLPSN-NIGE 117
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TN 194
+P ++DWR K VTP+KDQ +CG CWAFS ++EG + L+ LSEQ LVDCS
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKE 177
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
GN GC GG M++AF+YII+ GI TE+ YPY+AV G C + A ++ Y +V S E
Sbjct: 178 GNEGCDGGLMDQAFQYIIKAGGIDTEESYPYKAVDGECHFKKANIGATVTGYTDVTSDSE 237
Query: 255 QALLKAVS-MQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANY 311
AL KAV+ + P+S+ I A F+ YK G++N T LDH V VG+GTT DG +Y
Sbjct: 238 TALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTTSDGTDY 297
Query: 312 WLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
W++KNSW +TWG GY+ + R+ + CGI TQ+SYPL
Sbjct: 298 WIVKNSWAETWGMNGYLWMSRNKDNQCGIATQASYPL 334
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 141/312 (45%), Positives = 198/312 (63%), Gaps = 16/312 (5%)
Query: 41 QSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLT 100
+ +V + E W ++ + YK+ EK RF+IFK+NL YI++ NK+ N +Y LG N F+DLT
Sbjct: 16 ERLVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKK-NSSYWLGLNEFADLT 74
Query: 101 NDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
+DEF+A Y G S + F Y+++ D P S+DWR K AVTP+K+Q CG
Sbjct: 75 HDEFKAKYVGSLGEDSTIIEQSDDEEFPYKHV--VDYPESIDWRQKGAVTPVKNQNPCGS 132
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS VA VEGI KI LI LSEQ+L+DC ++GC GG + +Y+ N G+ T
Sbjct: 133 CWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCKGGYQTTSLQYVADN-GVHT 190
Query: 220 EDEYPYQAVQGTCSAA-QKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
E EYPY+ QG C A +K + KI+ Y+ VP+ +E +L++A++ QPVS+ + + F+
Sbjct: 191 EKEYPYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPVSVVVESKGRAFQ 250
Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DE 334
YK GIF G CGT++DHAVT VG+ G NY LIKNSWG WG+ GY++I R +
Sbjct: 251 FYKGGIFEGPCGTKVDHAVTAVGY-----GKNYILIKNSWGPKWGEKGYIRIKRASGKSK 305
Query: 335 GLCGIGTQSSYP 346
G CG+ + S +P
Sbjct: 306 GTCGVYSSSYFP 317
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 198/318 (62%), Gaps = 17/318 (5%)
Query: 42 SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSD 98
S+ ++ +W +HG++Y E EKE+R KIF +N E+++K N E G T+ +G N +D
Sbjct: 63 SLSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLAD 122
Query: 99 LTNDEFRALYTGYKMPSPSHRSTT-SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
LT DEF+ + GY + R+ +ST++Y +++ P +DW AVTP+K+Q++C
Sbjct: 123 LTKDEFKKML-GYNAALRASRAPVDASTWEYADVT---PPEEIDWVASGAVTPVKNQKQC 178
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS AVEG+ I LI LSE++L+ CSTNGN GC GG M+ FE+I+ N+GI
Sbjct: 179 GSCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEWIVNNRGI 238
Query: 218 ATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
TED + Y A + C ++ A I +++VPS DE +L+KAVS QPVS+ I A
Sbjct: 239 DTEDGWEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQS 298
Query: 277 FKSYKEGIFNGV-CGTQLDHAVTIVGFGT---TEDGANYWLIKNSWGDTWGDAGYMKILR 332
F+ Y G+++ CGT+LDH V +VG+G + ++W IKNSWG WG+ GY++I +
Sbjct: 299 FQLYAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRIAK 358
Query: 333 D----EGLCGIGTQSSYP 346
EG CG+ Q SYP
Sbjct: 359 GGSGVEGQCGVAMQPSYP 376
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 141/361 (39%), Positives = 197/361 (54%), Gaps = 61/361 (16%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E E+WM +HGR Y D EK+ R ++++ N+ +E N N Y+L N+F+DLTN+
Sbjct: 28 MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLTNE 87
Query: 103 EFRALYTGYKMPSPSHRSTTSSTF-------------KYQNLSMTDVPTSLDWRDKKAVT 149
EFRA G+ P P R+T +T +Y + ++P S+DWR+K AV
Sbjct: 88 EFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSD----ELPKSVDWREKGAVA 143
Query: 150 PIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFE 209
P+K+Q ECG CWAFSAVAA+EGI +I L+ LSEQ+LVDC T GC GG M AFE
Sbjct: 144 PVKNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IGCAGGYMSWAFE 202
Query: 210 YIIQNQGIATEDEYPYQ----------------------------AVQGTCSAAQ-KAAA 240
+++ N G+ TE YPYQ + G C + K +A
Sbjct: 203 FVMNNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESA 262
Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIV 300
IS Y V + E LL+A + QPVS+ + A + ++ Y G+F G C L+H VT+V
Sbjct: 263 VSISGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVV 322
Query: 301 GFGTTE----------DGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
G+G T+ G YW++KNSWG WGDAGY+ + R+ GLCGI SYP
Sbjct: 323 GYGETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSYP 382
Query: 347 L 347
+
Sbjct: 383 V 383
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 189/309 (61%), Gaps = 24/309 (7%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR 105
M+E+W+ ++ ++Y EKE R KIFKENL++I++ N N+T+++G RF+DLTNDE +
Sbjct: 1 MYERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDEPK 60
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
+ + Y+ + +P +DWR K AV P+KDQ CG CWAFSA
Sbjct: 61 DF-------------MKADRYLYKEGDI--LPDEIDWRAKGAVVPVKDQGNCGSCWAFSA 105
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYP 224
V AVEGI +I LI LS+Q+L+DC N GC GG M AFE+II N GI ++ +YP
Sbjct: 106 VGAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDYP 165
Query: 225 YQAVQ-GTCSAAQK--AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
Y A G C+A +K KI YE V DE++L KAV+ QPV + I A + FK YK
Sbjct: 166 YTATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYK 225
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLC 337
G+F G CG LDH V +VG+GT+ G +YW+I+NSWG WG+ GY+K+ R+ G C
Sbjct: 226 SGVFTGTCGIYLDHGVVVVGYGTSS-GEDYWIIRNSWGLNWGENGYVKLQRNIDDSFGKC 284
Query: 338 GIGTQSSYP 346
G+ SYP
Sbjct: 285 GVAMMPSYP 293
>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 335
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 193/338 (57%), Gaps = 20/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ ++ L + A+ + + + ++M E+WMA+ G++YK EKE RF IF++N+ +
Sbjct: 7 LVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHF 66
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I + +G N+F+DLTNDEF A YTG K P P + + P
Sbjct: 67 IRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIWTP 118
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
+DWR + AVT +KDQ CG CWAF+AVAA+EG+TKI L LSEQ+LVDC TN +N
Sbjct: 119 CCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SN 177
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA--AAAKISNYEEVPSGDEQ 255
GCGGG ++AFE + GI E +Y Y+ QG C AA I Y VP DE+
Sbjct: 178 GCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPNDER 237
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN---YW 312
L AV+ QPV++ I A F+ YK G+F G CG +HAVT+VG+ +DGA+ YW
Sbjct: 238 QLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGASGKKYW 295
Query: 313 LIKNSWGDTWGDAGYM----KILRDEGLCGIGTQSSYP 346
L KNSWG TWG GY+ I++ G CG+ YP
Sbjct: 296 LAKNSWGKTWGQQGYILLEKDIVQPHGTCGLAVSPFYP 333
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 195/322 (60%), Gaps = 15/322 (4%)
Query: 30 SQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTY 89
S + S E + +M +M Q+ ++Y E RF FK N+E I N N +Y
Sbjct: 25 SALFSEEVPSEVMLQDMFTAFMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASY 83
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVT 149
+G N F+DL+ +EF+ Y GYK R S +Q + PTS+DWR AVT
Sbjct: 84 TMGLNEFADLSFEEFKGKYFGYKHV---EREFARSNNLHQEVEA--APTSIDWRTSNAVT 138
Query: 150 PIKDQQECGCCWAFSAVAAVEGITKISGAN-LIQLSEQQLVDCSTN-GNNGCGGGTMEKA 207
PIKDQ +CG CWAFSA ++EG + G + L LSEQQLVDCST+ G+ GC GG M+ A
Sbjct: 139 PIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYA 198
Query: 208 FEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA--AAKISNYEEVPSGDEQALLKAV-SMQ 264
FEYII N+GI E YPY+ V G C QK+ IS Y++V SGDE +LL AV ++
Sbjct: 199 FEYIIANKGICAESAYPYKGVGGLC---QKSCTKVVTISGYKDVASGDEASLLNAVGTVG 255
Query: 265 PVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGD 324
PVS+ I A F+ Y G+F+G CG LDH V VG+GTT +YW++KNSWG +WG+
Sbjct: 256 PVSVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTT-GSQDYWIVKNSWGTSWGE 314
Query: 325 AGYMKILRDEGLCGIGTQSSYP 346
+GY++++R++ CGI Q SYP
Sbjct: 315 SGYIRMIRNKNQCGIAIQPSYP 336
>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
Length = 342
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 144/341 (42%), Positives = 195/341 (57%), Gaps = 28/341 (8%)
Query: 23 ILLVSC---ASQVVSSRSTHEQS-----VVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
+LLV C A Q + + + + ++M E+WMA+ G++YK EKE RF IF++N
Sbjct: 11 VLLVVCTLMALQAMGADAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDN 70
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
+ +I + +G N+F+DLTNDEF A YTG K P P + +
Sbjct: 71 VHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPI 122
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
P +DWR + AVT +KDQ CG CWAF+AVAA+EG+TKI L LSEQ+LVDC TN
Sbjct: 123 WTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN 182
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA--AAAKISNYEEVPSG 252
+NGCGGG ++AFE + GI E +Y Y+ QG C AA+I Y VP
Sbjct: 183 -SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAARIGGYRAVPPN 241
Query: 253 DEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN-- 310
DE+ L AV+ QPV++ I A F+ YK G+F G CG +HAVT+VG+ +DGA+
Sbjct: 242 DERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGASGK 299
Query: 311 -YWLIKNSWGDTWGDAGYM----KILRDEGLCGIGTQSSYP 346
YW+ KNSWG TWG GY+ +L+ G CG+ YP
Sbjct: 300 KYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYP 340
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 190/316 (60%), Gaps = 11/316 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDL 99
V+E + +H ++Y+D+ E+ R KIF EN I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+ EFR L G+ +FK + + + +P S+DWR K AVT +KDQ
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS+ A+EG L+ LSEQ LVDCST GNNGC GG M+ AF YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 216 GIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYT 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PV++ I A
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASH 264
Query: 275 TEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
F+ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K+LR
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 324
Query: 333 D-EGLCGIGTQSSYPL 347
+ E CGI + SSYPL
Sbjct: 325 NKENQCGIASASSYPL 340
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 134/314 (42%), Positives = 189/314 (60%), Gaps = 15/314 (4%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR--------TYKLGTNRFS 97
+ E W A+HG++Y E+ R F +N ++ N G +Y L N F+
Sbjct: 41 LFEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DLT+ EFRA G ++ R+ S ++ + VP +LDWR AVT +KDQ C
Sbjct: 101 DLTHAEFRAARLG-RLAVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGSC 159
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CW+FSA A+EGI KI +LI LSEQ+L+DC + N GCGGG M+ A+ ++I+N GI
Sbjct: 160 GACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRFVIKNGGI 219
Query: 218 ATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
TED+YPY+ GTC+ + K I Y +VP+ E +LL+AV+ QP+S+GI
Sbjct: 220 DTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSARA 279
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ Y +GIF+G C T LDHAV IVG+G +E G +YW++KNSWG+ WG GYM + R+
Sbjct: 280 FQLYSQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTGS 338
Query: 334 -EGLCGIGTQSSYP 346
G+CGI +S+P
Sbjct: 339 SSGICGINMMASFP 352
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 131/315 (41%), Positives = 196/315 (62%), Gaps = 16/315 (5%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR--------TYKLGTNRFS 97
+ + W A+HG++Y E+ R +F +N ++ N N +Y L N F+
Sbjct: 40 LFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFA 99
Query: 98 DLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
DLT++EFRA G + + RS + ++ + + VP +LDWR+ AVT +KDQ
Sbjct: 100 DLTHEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGS 159
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CW+FSA A+EGI KI +L+ LSEQ+L+DC + N+GCGGG M+ A++++++N G
Sbjct: 160 CGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGG 219
Query: 217 IATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
I TE++YPY+ GTC+ + K I Y +VPS E LL+AV+ QPVS+GI
Sbjct: 220 IDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSAR 279
Query: 276 EFKSY-KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F+ Y ++GIF+G C T LDHAV IVG+G +E G +YW++KNSWG++WG GYM + R+
Sbjct: 280 AFQLYSQQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGESWGMKGYMHMHRNT 338
Query: 334 ---EGLCGIGTQSSY 345
+G+CGI +S+
Sbjct: 339 GDSKGVCGINMMASF 353
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 143/304 (47%), Positives = 193/304 (63%), Gaps = 21/304 (6%)
Query: 52 AQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALY 108
+ + +SY+ E + R F+ NLE+I K N E G +Y +G N F+DLT DEF ALY
Sbjct: 3 SDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALY 62
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAA 168
+PS +R+ +T Y + D S+DWR K AVTPIK+Q +CG CW+FS +
Sbjct: 63 ----VPSKFNRTMPYNTV-YLPATSED---SVDWRTKGAVTPIKNQGQCGSCWSFSTTGS 114
Query: 169 VEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA 227
EG I+ NL+ LSEQQLVDCS + GN GC GG M+ AF+YII N+G+ TE++YPY A
Sbjct: 115 TEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTA 174
Query: 228 VQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFN 286
GTC+ ++A AA IS+Y +VP +E L AV+ PVS+ I A + F+ YK G+F+
Sbjct: 175 QDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVFD 234
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQS 343
G CGT LDH V +VG+ T+D YW++KNSWG TWG GY+ + R G+CGI Q
Sbjct: 235 GNCGTNLDHGVLVVGY--TDD---YWIVKNSWGTTWGVEGYINMKRGVSASGICGIAMQP 289
Query: 344 SYPL 347
SYP+
Sbjct: 290 SYPI 293
>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
Length = 336
Score = 266 bits (680), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 140/334 (41%), Positives = 192/334 (57%), Gaps = 20/334 (5%)
Query: 22 IILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKA 81
++ L + A+ + + + ++M E+WMA+ G++YK EKE RF IF++N+ +I
Sbjct: 12 LMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGY 71
Query: 82 NKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
+ +G N+F+DLTNDEF A YTG K P P + + P +D
Sbjct: 72 KPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIWTPCCID 123
Query: 142 WRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGG 201
WR + AVT +KDQ CG CWAF+AVAA+EG+TKI L LSEQ+LVDC TN +NGCGG
Sbjct: 124 WRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGG 182
Query: 202 GTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA--AAAKISNYEEVPSGDEQALLK 259
G ++AFE + GI E +Y Y+ QG C AA I Y VP DE+ L
Sbjct: 183 GHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLAT 242
Query: 260 AVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN---YWLIKN 316
AV+ QPV++ I A F+ YK G+F G CG +HAVT+VG+ +DGA+ YW+ KN
Sbjct: 243 AVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGASGKKYWVAKN 300
Query: 317 SWGDTWGDAGYM----KILRDEGLCGIGTQSSYP 346
SWG TWG GY+ +L+ G CG+ YP
Sbjct: 301 SWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYP 334
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 143/343 (41%), Positives = 202/343 (58%), Gaps = 21/343 (6%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
T+ + I +L+ S V SS +++ + EKW+ H + Y E +RF I++ N
Sbjct: 11 TLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSN 70
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
++ I+ N + +KL NRF+D+TN EF+A + G +T+S + +
Sbjct: 71 VQLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGL--------NTSSLRLHKKQRPVC 121
Query: 135 D----VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
D VP ++DWR + AVTPI++Q +CG CWAFSAVAA+EGI KI NL+ LSEQQL+D
Sbjct: 122 DPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLID 181
Query: 191 CSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEE 248
C N GC GG ME AFE+I N G+ATE +YPY ++GTC + K I Y++
Sbjct: 182 CDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQK 241
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
V +E +L A + QPVS+GI A F+ Y G+F CGT L+H VT+VG+G D
Sbjct: 242 VAQ-NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGVEGD- 299
Query: 309 ANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
YW++KNSWG WG+ GY+++ R D G CGI +SYPL
Sbjct: 300 QKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 145/346 (41%), Positives = 208/346 (60%), Gaps = 24/346 (6%)
Query: 15 TIPMFIIIILLVSCASQVVSS-------RSTHEQSVVEM-HEKWMAQHGRSYKDELEKEM 66
TI + II LLV C + +S ++ + V+ M +E W+ ++G+ Y+++ E E
Sbjct: 4 TITLVAIINLLVLCNLWITASACPAKHNDNSSDSEVMRMRYESWLKKYGQKYRNKDEWEF 63
Query: 67 RFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF 126
RF+I++ N+++IE N + N +YKL N+F DLTN+EFR +Y Y+ RS + F
Sbjct: 64 RFEIYRANVQFIEVYNSQ-NYSYKLMDNKFVDLTNEEFRRMYLVYQP-----RSHLQTRF 117
Query: 127 KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
YQ D+P +DWR + AVT IKDQ CG CW+FSAVA VE I KI L+ LSEQ
Sbjct: 118 MYQ--KHGDLPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQ 175
Query: 187 QLVDC-STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKIS 244
QL+DC + NGN GC GG ME F +I + G+ T+ YPYQ G + A+ + A I
Sbjct: 176 QLIDCDNRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQGSDGDXNKAKVRNHAVAIC 234
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT 304
YE +P+ +E L AV+ QP S+ A F+ Y +G F+G CG L+H +TIVG+G
Sbjct: 235 GYENLPAHNENMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGSCGKDLNHRMTIVGYG- 293
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
E+G YWL+KNSW + G +GY+++ RD +G CG ++SYP
Sbjct: 294 EENGEKYWLVKNSWANDXGVSGYIRMKRDPKDKDGTCGTAMEASYP 339
>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 210/341 (61%), Gaps = 30/341 (8%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F +++ + A QV + R+ + S+ E HE+ M ++ + YKD E F N+ YI
Sbjct: 12 FAMLLCMAFLAFQV-TCRTLQDASMYERHEQRMTRYSKVYKDPPES------FXGNVNYI 64
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
E N ++ YK G N+F R + G+ M S R TT FK++N++ T P+
Sbjct: 65 EACNNAADKPYKXGINQFPP------RNRFKGH-MCSSIIRITT---FKFENVTAT--PS 112
Query: 139 SLDWRDKKAVTP--IKDQQECGCCWAFSAVAAVEGITKISGANLIQLS-EQQLVDCSTNG 195
++D R K AVTP +KDQ +CGC WA SAVAA EGI + LI LS E +LVDC T G
Sbjct: 113 TVDCRQKGAVTPYTVKDQGQCGCFWALSAVAATEGIHALXAGKLILLSXEPELVDCDTKG 172
Query: 196 -NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA--AQKAAAAKISNYEEVPSG 252
+ GC GG + AF++IIQN G+ TE YPY+ V G C+A A K AA I+ Y++VP+
Sbjct: 173 VDQGCEGGLTDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEADKNAATIITGYDDVPAN 232
Query: 253 DEQALL-KAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
+E+A L KAV+ PVS+ I A ++F+ YK G+F G CGT+LDH VT VG+G ++DG Y
Sbjct: 233 NEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEY 292
Query: 312 WLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPLA 348
WL+KNS G WG+ GY+++ R +E LCGI Q+SYP A
Sbjct: 293 WLVKNSRGPEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 333
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 143/336 (42%), Positives = 209/336 (62%), Gaps = 13/336 (3%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ ++L+ +C VVSS S E +W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFTDFDEDWNEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
K N + G+ TY LG N+F+DL N+EF A+ TG+++ S ++ STF N ++ +
Sbjct: 60 IKHNLKYDLGHFTYDLGINQFTDLQNEEFVAMMTGFRVSGTS-KAAKGSTFLPPN-NVGE 117
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
+P ++DWR K VTP+KDQ +CG CWAFS +VEG + L+ LSEQ LVDCS
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDCSGR- 176
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
+ GC GG M++AF+YII GI TE YPY+AV G C + A ++ Y +V SG E+
Sbjct: 177 DAGCDGGFMDRAFQYIIDAGGIDTEASYPYKAVDGKCHFKKANVGATVTGYTDVTSGSEK 236
Query: 256 ALLKAVS-MQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYW 312
AL KAV+ + P+S+ I A F+ YK G++N G T LDH V VG+GT+ DG +YW
Sbjct: 237 ALQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTVLDHGVLAVGYGTSSDGTDYW 296
Query: 313 LIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
++KNSW +TWG GY+ + R+ + CGI T +SYPL
Sbjct: 297 IVKNSWAETWGMNGYVWMSRNKDNQCGIATNASYPL 332
>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 209/319 (65%), Gaps = 20/319 (6%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ +++E+W H S ++ EK RF +FKEN+ ++ N+ ++ YKL N+F+D+
Sbjct: 34 EESLWQLYERWGKHHTIS-RNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADM 91
Query: 100 TNDEFRALYTGYKMPSPSH------RSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKD 153
+N EF Y + SH R + F Y+ TD+P+S+DWR++ AV +K+
Sbjct: 92 SNYEFVNFYA---RSNISHYRKLHERRRGAGGFMYE--QDTDLPSSVDWRERGAVNAVKE 146
Query: 154 QQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQ 213
Q CG CWAFS+VAAVEGI KI L+ LSEQ+L+DC+ N GC GG ME AF++I +
Sbjct: 147 QGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCNGGFMEIAFDFIKR 205
Query: 214 NQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAA 272
N GIATE+ YPY +G C +++ + KI YE VP +E AL++AV+ QPVS+ I A
Sbjct: 206 NGGIATENSYPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDA 264
Query: 273 YTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
+F+ Y +G+F+G CGT+L+H V +G+GTTEDG +YWL++NSWG WG+ GY+++ R
Sbjct: 265 AGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKR 324
Query: 333 D----EGLCGIGTQSSYPL 347
EGLCGI ++SYP+
Sbjct: 325 GVEQAEGLCGIAMEASYPI 343
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 143/309 (46%), Positives = 192/309 (62%), Gaps = 19/309 (6%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRAL 107
E+W+ Q+ R YKD+ E E+RF I++ NLEYIE N + +Y L N+F+DLTN+EF +
Sbjct: 6 ERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQ-EXSYNLTDNKFADLTNEEFVSP 64
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
Y G+ R + F Y D+P S DWR + AV+ IKDQ CG CWAFSAVA
Sbjct: 65 YLGFGT-----RFLPHTGFMYH--EHEDLPESKDWRKEGAVSDIKDQGNCGSCWAFSAVA 117
Query: 168 AVEGITKISGANLIQLSEQQLVDCST-NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQ 226
AVEGI KI L+ LSEQ+ DC +GN GC GG M+ AF +I +N G+ T +YPY+
Sbjct: 118 AVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDYPYE 177
Query: 227 AVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSM--QPVSIGIAAYTTEFKSYKEG 283
V GTC+ + AA IS + +VP+ DE L + Q S+ I A F+ Y +G
Sbjct: 178 GVDGTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAFQLYLKG 237
Query: 284 IFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCG 338
+F+G+CG QL+H VTIVG+G T D YW++KNSWG WG++GY+++ RD G CG
Sbjct: 238 VFSGICGKQLNHGVTIVGYGKGTSD--KYWIVKNSWGADWGESGYIRMKRDAFDKAGTCG 295
Query: 339 IGTQSSYPL 347
I Q+SYPL
Sbjct: 296 IAMQASYPL 304
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 133/317 (41%), Positives = 191/317 (60%), Gaps = 20/317 (6%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR-------------TYKLGTN 94
+ W A+HG++Y E+ R +F +N ++ N +Y L N
Sbjct: 37 DAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTLALN 96
Query: 95 RFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
F+DLT++EFRA G P + RS + + + VP +LDWR AVT +KDQ
Sbjct: 97 AFADLTHEEFRAARLGRIAPGAALRSRAAPVY-WGLGGGAAVPDALDWRKSGAVTKVKDQ 155
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
CG CW+FSA A+EGI KI +L+ LSEQ+L+DC + N+GCGGG M+ A++++I+N
Sbjct: 156 GSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVIKN 215
Query: 215 QGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
GI TE++YPY+ GTC+ + K I Y +VPS E LL+AV+ QPVS+GI
Sbjct: 216 GGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVGICGS 275
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
F+ Y +GIF+G C T LDHAV IVG+G +E G +YW++KNSWG++WG GYM + R+
Sbjct: 276 ARAFQLYYQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGESWGMKGYMHMHRN 334
Query: 334 ----EGLCGIGTQSSYP 346
+G+CGI +S+P
Sbjct: 335 TGDSKGVCGINMMASFP 351
>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 148/374 (39%), Positives = 206/374 (55%), Gaps = 37/374 (9%)
Query: 7 RSGSFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEK--------WMAQHGRSY 58
R S + + ++L C+S+ +++ S H ++ H WM RSY
Sbjct: 12 RCSSLALCVLLATCSFLMLAGCSSESLTTSSEHSDIGIDKHHDLMMARFHVWMTVQNRSY 71
Query: 59 KDELEKEMRFKIFKENLEYIEKANKEGNR---TYKLGTNRFSDLTNDEFRALYTGYKMPS 115
EK RFK+++ N+ YIE N E TY+LG F+DLT++EF +LYTG K+P
Sbjct: 72 PTSSEKAHRFKVYRSNMRYIEALNAEATTSGFTYELGEGPFTDLTDEEFISLYTG-KIPD 130
Query: 116 PSHR----------STTSSTFK-------YQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
HR +T + + Y N S P +DWR + AVTP+KDQ +CG
Sbjct: 131 DDHREDGVHDEQIITTHAGSVNGAEGVTVYANFS-AGAPIRMDWRKRGAVTPVKDQGKCG 189
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIA 218
CWAF VA +EGI KI L+ LSEQQLVDC + GC GG AF++IIQN GI
Sbjct: 190 SCWAFPTVATIEGIHKIKRGRLVSLSEQQLVDCDFL-DGGCNGGWPRNAFQWIIQNGGIT 248
Query: 219 TEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFK 278
T Y Y+A +G C +K AAKI+ Y +V S E +++ V+ QP++ I + +F+
Sbjct: 249 TTSSYTYKAAEGQCKGNRK-PAAKITGYRKVKSNSEVSMVNIVANQPIAASIVVHGGQFQ 307
Query: 279 SYKEGIFNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE--- 334
YK GI+NG C T +L+H +TIVG+G GA YW++KNSWG WG+ GYM + R
Sbjct: 308 HYKGGIYNGPCATSKLNHVITIVGYGQQAYGAKYWIVKNSWGAAWGNKGYMLMKRGTKNP 367
Query: 335 -GLCGIGTQSSYPL 347
G CGI + +PL
Sbjct: 368 LGQCGIAVRPIFPL 381
>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
Length = 319
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 140/327 (42%), Positives = 188/327 (57%), Gaps = 20/327 (6%)
Query: 29 ASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT 88
A+ + + + ++M E+WMA+ G++YK EKE RF IF++N+ +I +
Sbjct: 2 AASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYD 61
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
+G N+F+DLTNDEF A YTG K P P + + P +DWR + AV
Sbjct: 62 SAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIWTPCCIDWRFRGAV 113
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
T +KDQ CG CWAF+AVAA+EG+TKI L LSEQ+LVDC TN +NGCGGG ++AF
Sbjct: 114 TGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGGGHTDRAF 172
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQKA--AAAKISNYEEVPSGDEQALLKAVSMQPV 266
E + GI E +Y Y+ QG C AA I Y VP DE+ L AV+ QPV
Sbjct: 173 ELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPV 232
Query: 267 SIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN---YWLIKNSWGDTWG 323
++ I A F+ YK G+F G CG +HAVT+VG+ +DGA+ YWL KNSWG TWG
Sbjct: 233 TVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGASGKKYWLAKNSWGKTWG 290
Query: 324 DAGYM----KILRDEGLCGIGTQSSYP 346
GY+ I++ G CG+ YP
Sbjct: 291 QQGYILLEKDIVQPHGTCGLAVSPFYP 317
>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
Length = 319
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 138/312 (44%), Positives = 183/312 (58%), Gaps = 20/312 (6%)
Query: 44 VEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDE 103
++M E+WMA+ G++YK EKE RF IF++N+ +I + +G N+F+DLTNDE
Sbjct: 17 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 76
Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
F A YTG K P P + + P +DWR + AVT +KDQ CG CWAF
Sbjct: 77 FVATYTGAKPPHPKEAP--------RPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAF 128
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
+AVAA+EG+TKI L LSEQ+LVDC TN +NGCGGG ++AFE + GI E +Y
Sbjct: 129 AAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGGGHTDRAFELVASKGGITAESDY 187
Query: 224 PYQAVQGTCSAAQKA--AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
Y+ QG C AA I Y VP DE+ L AV+ QPV++ I A F+ YK
Sbjct: 188 RYEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYK 247
Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGAN---YWLIKNSWGDTWGDAGYM----KILRDE 334
G+F G CG +HAVT+VG+ +DGA+ YW+ KNSWG TWG GY+ +L+
Sbjct: 248 SGVFPGPCGASSNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPH 305
Query: 335 GLCGIGTQSSYP 346
G CG+ YP
Sbjct: 306 GTCGLAVSPFYP 317
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 201/343 (58%), Gaps = 21/343 (6%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
T+ + I +L+ S V SS +++ + EKW+ H + Y E +RF I++ N
Sbjct: 11 TLVVLICFVLIASKLCSVNSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSN 70
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
++ I+ N + +KL NRF+D+TN EF+A + G +T+S + +
Sbjct: 71 VQLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGL--------NTSSLRLHKKQRPVC 121
Query: 135 D----VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
D VP ++DWR + AVTPI++Q +CG CWAFSAVAA+EGI KI NL+ LSEQQL+D
Sbjct: 122 DPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLID 181
Query: 191 CSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEE 248
C N GC GG ME AFE+I N G+ TE +YPY ++GTC + K I Y++
Sbjct: 182 CDVGTYNKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIEGTCDQEKAKNKVVTIQGYQK 241
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
V +E +L A + QPVS+GI A F+ Y G+F CGT L+H VT+VG+G D
Sbjct: 242 VAQ-NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVTVVGYGVEGD- 299
Query: 309 ANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
YW++KNSWG WG+ GY+++ R D G CGI +SYPL
Sbjct: 300 QKYWIVKNSWGTGWGEEGYIRMERGISEDTGKCGIAMLASYPL 342
>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 140/343 (40%), Positives = 200/343 (58%), Gaps = 21/343 (6%)
Query: 14 NTIPMFIIIILLVSCASQVVSSRS-----THEQSVVEMHEKWMAQHGRSYKDELEKEMRF 68
T P+ I++LL + S + +++ +W A H RSY E+ RF
Sbjct: 7 GTRPVIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRF 66
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
++++ N+EYI+ N+ G TY+LG N+F+DLT +EF A Y G H + +T
Sbjct: 67 EVYRTNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAG------GHTGSAITTAAE 120
Query: 129 QNLSM-TDVPTSLDWRDKKAVTPIKDQ-QECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
+ S+ D P S+DWR K AVTP+K+Q +C CWAFSAVA +E + I L+ LSEQ
Sbjct: 121 ADGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQ 180
Query: 187 QLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNY 246
QLVDC + GC G +AF++I++N GI T +YPY+AV+G CSAA+ A I+ +
Sbjct: 181 QLVDCDKY-DGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGACSAAKPAV--TITGH 237
Query: 247 EEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
V +E AL AV+ QP+ + I + YK G+F+ CG Q+ HAV VG+G
Sbjct: 238 LAVAK-NELALQSAVARQPIGVAIEV-PISMQFYKSGVFSAACGIQMSHAVVTVGYGADA 295
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYP 346
G YWL+KNSWG TWG+AGY+++ RD GLCGI ++YP
Sbjct: 296 SGLKYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGIALDTAYP 338
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 263 bits (673), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 145/357 (40%), Positives = 203/357 (56%), Gaps = 29/357 (8%)
Query: 17 PMFIIIILLVSCASQVV----------SSRSTHEQ-----SVVEMHEKWMAQHGRSYKDE 61
P+ I++++L A +V + ++T EQ + H + +H ++Y DE
Sbjct: 63 PIAIVVVMLFVNAFILVFILKKRKAYQNLKATEEQPRTSYAATSTH---VLEHRKNYLDE 119
Query: 62 LEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSH 118
E+ R KIF EN I K N+ G +YKL N+++D+ + EFR L G+
Sbjct: 120 TEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMNGFNYTLHKE 179
Query: 119 RSTTSSTFK---YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKI 175
+FK + + +P S+DWRDK AVT +KDQ CG CWAFS+ A+EG
Sbjct: 180 LRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSSTGALEGQHYR 239
Query: 176 SGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA 234
L+ LSEQ LVDCST GNNGC GG M+ AF YI N GI TE YPY+A+ +C
Sbjct: 240 KSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEALDDSCHF 299
Query: 235 AQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVCGTQ 292
+ A + ++P G+E+ L +AV ++ PVS+ I A F+ Y EG++ C Q
Sbjct: 300 NKGTIGATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSEGVYVEPACDAQ 359
Query: 293 -LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
LDH V +VGFGT E G +YWL+KNSWG TWGD G++K+LR+ + CGI + SSYPL
Sbjct: 360 NLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRNKDNQCGIASASSYPL 416
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 208/337 (61%), Gaps = 13/337 (3%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ ++L+ C VVSS S E ++W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAVC---VVSSLSMSFTDFDEDWKEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+ N + G+ TY LG N+F+DL N EF A+ TG+++ S ++ STF N ++
Sbjct: 60 IRHNLKYDLGHFTYDLGMNQFADLQNKEFVAMMTGFRVNGTS-KAAKGSTFLPPN-NVGK 117
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
+P ++DWR K VTP+KDQ +CG CWAFSA ++EG L+ LSEQ LVDCS +
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCS-DK 176
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
N GC GG M++AF+YII GI TE+ YPY A+ G C A ++ Y +V SG E+
Sbjct: 177 NYGCNGGLMDRAFQYIIDAGGIDTEESYPYIAMDGNCHFKTANVGATVTGYTDVTSGSEK 236
Query: 256 ALLKAVS-MQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYW 312
AL KAV+ + P+S+ I A F+ Y+ G++N G T LDH V VG+GTT DG +YW
Sbjct: 237 ALQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTLLDHGVLAVGYGTTIDGTDYW 296
Query: 313 LIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
++KNSW +TWG GY+ + R+ + CGI TQ+SYPL
Sbjct: 297 IVKNSWAETWGMNGYIWMSRNKDNQCGIATQASYPLV 333
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 143/336 (42%), Positives = 208/336 (61%), Gaps = 13/336 (3%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ ++L+ C VVSS S E +W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAVC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
K N + G+ TY LG N+F+DL N+EF A+ TG+++ S ++ STF N ++
Sbjct: 60 IKHNLKYDLGHFTYALGMNQFADLQNEEFVAMMTGFRVNGTS-KAAKGSTFLPSN-NVDK 117
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
+P ++DWR K VTP+KDQ +CG CWAFSA ++EG L+ LSEQ LVDCS
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDCSYR- 176
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
N GC GG M++AF+YII GI TE Y Y+AV G C + A ++ Y +V SG E+
Sbjct: 177 NYGCHGGFMDRAFQYIIDAGGIDTEATYSYRAVDGNCHFKKANVGATVTGYTDVTSGSEK 236
Query: 256 ALLKAVS-MQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYW 312
AL KAV+ + P+S+ I A FK YK G++N G T+L HAV +VG+GTT DG +YW
Sbjct: 237 ALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTRLGHAVLVVGYGTTSDGTDYW 296
Query: 313 LIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
++KNSW TWG GY+ + R+ + CGI +++SYP+
Sbjct: 297 IVKNSWAKTWGMNGYLWMSRNKDNQCGIASEASYPM 332
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 139/347 (40%), Positives = 207/347 (59%), Gaps = 44/347 (12%)
Query: 13 INTIPMFIIIILLVSCAS-----QVVSSRSTHEQS---VVEMHEKWMAQHGRSYKDELEK 64
+++I +F I LV C+ +V H S + E+ E WM++HG++Y+ EK
Sbjct: 5 VSSIFLFTIFTSLVICSVVAHDFSIVGYSPEHLTSMHKLTELFESWMSKHGKTYESIEEK 64
Query: 65 EMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS 124
R ++FK+NL +I++ N++ TY L N F+DL+++EF+ S
Sbjct: 65 LHRLEVFKDNLMHIDRRNRDVT-TYWLALNEFADLSHEEFK-----------------SK 106
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
+ + L +K AV P+K+Q CG CWAFS VAAVEGI +I NL LS
Sbjct: 107 LAQIRRL------------EKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 154
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAA-QKAAAAKI 243
EQ+L+DC T+ N+GC GG M+ AF+YI+ N G+ E++YPY +GTC ++ I
Sbjct: 155 EQELIDCDTSFNSGCNGGLMDYAFDYIVNNGGLHKEEDYPYLMEEGTCDEKREEMEVVTI 214
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
S Y +VP +E++LLKA++ QP+SI I A +F+ Y G+FNG CGT LDH V VG+G
Sbjct: 215 SGYHDVPENNEESLLKALAHQPLSIAIEASGRDFQFYGRGVFNGPCGTDLDHGVAAVGYG 274
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
+++ G +Y ++KNSWG WG+ GY+++ R+ EGLCGI +SYP
Sbjct: 275 SSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 320
>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
Length = 362
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 142/355 (40%), Positives = 205/355 (57%), Gaps = 31/355 (8%)
Query: 17 PMFIIIILLVSC----ASQVVSSRSTH-------EQSVVEMHEKWMAQHGRSYKDELEKE 65
P + + LL SC A+ ++ +R+T + +++ W H RSY E
Sbjct: 10 PPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEAL 69
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKM---PSPSHRSTT 122
RF +++ N E+I+ N G+ TY+L N F+DLT +EF A YTGY P TT
Sbjct: 70 QRFDVYRRNAEFIDAVNLRGDLTYRLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITT 129
Query: 123 S-----STFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE-CGCCWAFSAVAAVEGITKIS 176
++F Y+ DVP S+DWR + AV P K Q C CWAF A +E + I
Sbjct: 130 GAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIK 185
Query: 177 GANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ 236
L+ LSEQQLVDC + + GC G+ +A++++++N G+ TE +YPY A +G C+ A+
Sbjct: 186 TGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAK 244
Query: 237 KA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDH 295
A AAKI+ + +VP +E AL AV+ QPV++ I + + YK G++ G CGT+L H
Sbjct: 245 SAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPCGTRLAH 303
Query: 296 AVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYP 346
AVT+VG+GT GA YW IKNSWG +WG+ GY++ILRD GLCG+ +YP
Sbjct: 304 AVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCGVTLDIAYP 358
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 207/338 (61%), Gaps = 14/338 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M I +L CA VV++ ++ + + E + A H +SY+ +E+ +RFKIF EN
Sbjct: 1 MLRISLL---CAFVVVTTAASSHEILRTQWEAFKATHKKSYQSNMEELLRFKIFSENSLL 57
Query: 78 IEKANKEGNR---TYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
+ + N++ R +YKLG N+F DL EF ++ GY+ + R +T N++ +
Sbjct: 58 VARHNEKYARGLVSYKLGMNQFGDLLPHEFARMFNGYRGARTAGRGST--FLPPANVNYS 115
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+P S+DWR+K AVTP+K+Q +CG CWAFS ++EG + L+ LSEQ LVDCS T
Sbjct: 116 SLPQSMDWREKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSET 175
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
GN+GC GG M+ AF+YI N GI TE YPY+A G C ++ A + + ++ G
Sbjct: 176 FGNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEAEDGECRFKKQNVGATDTGFVDIEQGS 235
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGAN 310
E L KAV ++ PVS+ I A + F+ Y EG+++ C + QLDH V +VG+G EDG
Sbjct: 236 EDDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETECSSEQLDHGVLVVGYG-VEDGKK 294
Query: 311 YWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
YWL+KNSW ++WGD GY+K+ RD + CGI + +SYPL
Sbjct: 295 YWLVKNSWAESWGDNGYIKMSRDKDNQCGIASAASYPL 332
>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 354
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 133/324 (41%), Positives = 191/324 (58%), Gaps = 25/324 (7%)
Query: 42 SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTN 101
+V HE+WMA+ GR+YKD EK R ++F N +++ N+ GNRTY LG N FSDLT+
Sbjct: 33 TVASRHERWMARFGRAYKDADEKARRQEVFGANARHVDAVNRSGNRTYTLGLNHFSDLTD 92
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT----------DVPTSLDWRDKKAVTPI 151
EF + GY+ H+ + ++ M+ DVP S+DWR + AVT I
Sbjct: 93 HEFLQQHLGYRH----HQPGPGGLLRPEDQDMSKATALADYGQDVPDSVDWRAQGAVTEI 148
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYI 211
K+Q+ CG CWAF+AVAA EG+ KI+ NLI +SEQQ++DC T G N C GG + A Y+
Sbjct: 149 KNQRSCGSCWAFAAVAATEGLVKIATGNLISMSEQQVLDC-TGGGNTCDGGDINAALRYV 207
Query: 212 IQNQGIATEDEYPYQAVQGTC---SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+ G+ E Y Y A +G C S A AA+ + + + GDE AL + QPV++
Sbjct: 208 AASGGLQPEAAYAYAAQKGACRGASPANSAASVGGARFARL-GGDEGALRGLAAGQPVAV 266
Query: 269 GIAAYTTEFKSYKEGIFNG--VCGTQLDHAVTIVGFGTTED-GANYWLIKNSWGDTWGDA 325
+ A +F+ YK G++ G CG +L+H VT+VG+G +D G YW++KN WG WG+
Sbjct: 267 ALEASEPDFRHYKSGVYAGSASCGRRLNHGVTVVGYGAEDDSGDEYWVVKNQWGTLWGEK 326
Query: 326 GYMKILRDE---GLCGIGTQSSYP 346
GYM++ R + CGI + + YP
Sbjct: 327 GYMRVARGDVAGANCGIASYAYYP 350
>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 358
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 142/355 (40%), Positives = 205/355 (57%), Gaps = 31/355 (8%)
Query: 17 PMFIIIILLVSC----ASQVVSSRSTH-------EQSVVEMHEKWMAQHGRSYKDELEKE 65
P + + LL SC A+ ++ +R+T + +++ W H RSY E
Sbjct: 6 PPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEAL 65
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKM---PSPSHRSTT 122
RF +++ N E+I+ N G+ TY+L N F+DLT +EF A YTGY P TT
Sbjct: 66 QRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITT 125
Query: 123 S-----STFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE-CGCCWAFSAVAAVEGITKIS 176
++F Y+ DVP S+DWR + AV P K Q C CWAF A +E + I
Sbjct: 126 GAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIK 181
Query: 177 GANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ 236
L+ LSEQQLVDC + + GC G+ +A++++++N G+ TE +YPY A +G C+ A+
Sbjct: 182 TGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAK 240
Query: 237 KA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDH 295
A AAKI+ + +VP +E AL AV+ QPV++ I + + YK G++ G CGT+L H
Sbjct: 241 SAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPCGTRLAH 299
Query: 296 AVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYP 346
AVT+VG+GT GA YW IKNSWG +WG+ GY++ILRD GLCG+ +YP
Sbjct: 300 AVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCGVTLDIAYP 354
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 193/329 (58%), Gaps = 26/329 (7%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ ++E+W A + + +D EK RF +FKEN I + N +GN TY LG NRFSD+
Sbjct: 41 EESLWALYERWCAHYNMA-RDHGEKTRRFDLFKENARRIYEHNHQGNATYTLGLNRFSDM 99
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD----------------VPTSLDWR 143
T++EF G + +P + + D P ++DWR
Sbjct: 100 TDEEFNRSPYGGCLTAPRMSDDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWR 159
Query: 144 DKKAVTPIKDQ-QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGG 202
+ AVT +KDQ CG CWAFSA+AAVEGI I NL+ LSEQQLVDC N+GC GG
Sbjct: 160 GR-AVTRVKDQGPTCGSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCDKL-NHGCNGG 217
Query: 203 TMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS 262
M AF ++++N+G+ E YPY +G C A I Y+ VP D AL+ AV+
Sbjct: 218 LMTTAFSFVVRNRGVVPEGAYPYMGREGRCKHVM-APPVTIYGYQRVPRFDANALMNAVA 276
Query: 263 MQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTW 322
QPVS+ I A + EF+ Y+ G+FNG CG +L HA T VG+G + G +W++KNSWG W
Sbjct: 277 AQPVSVAIEASSFEFRHYQGGVFNGNCGGRLGHAATAVGYG-ADAGGPFWIVKNSWGPGW 335
Query: 323 GDAGYMKILRD----EGLCGIGTQSSYPL 347
G+ GY++I R+ +G+CGI T++SYP+
Sbjct: 336 GEGGYVRISRNTPVRQGVCGILTENSYPV 364
>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
Length = 362
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 142/355 (40%), Positives = 205/355 (57%), Gaps = 31/355 (8%)
Query: 17 PMFIIIILLVSC----ASQVVSSRSTH-------EQSVVEMHEKWMAQHGRSYKDELEKE 65
P + + LL SC A+ ++ +R+T + +++ W H RSY E
Sbjct: 10 PPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEAL 69
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKM---PSPSHRSTT 122
RF +++ N E+I+ N G+ TY+L N F+DLT +EF A YTGY P TT
Sbjct: 70 QRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITT 129
Query: 123 S-----STFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE-CGCCWAFSAVAAVEGITKIS 176
++F Y+ DVP S+DWR + AV P K Q C CWAF A +E + I
Sbjct: 130 GAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIK 185
Query: 177 GANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ 236
L+ LSEQQLVDC + + GC G+ +A++++++N G+ TE +YPY A +G C+ A+
Sbjct: 186 TGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAK 244
Query: 237 KA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDH 295
A AAKI+ + +VP +E AL AV+ QPV++ I + + YK G++ G CGT+L H
Sbjct: 245 SAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPCGTRLAH 303
Query: 296 AVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYP 346
AVT+VG+GT GA YW IKNSWG +WG+ GY++ILRD GLCG+ +YP
Sbjct: 304 AVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCGVTLDIAYP 358
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 202/331 (61%), Gaps = 23/331 (6%)
Query: 32 VVSSRSTH--EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTY 89
VV++ +H E V ++E+W+ +HG++Y EKE RFKIFK+NL++IE+ N + NR+Y
Sbjct: 24 VVTATESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSY 83
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVT 149
G N+FSDLT DEF+A Y G K+ +S + +YQ +P +DWR++ AV
Sbjct: 84 DRGLNQFSDLTVDEFQASYLGGKI---EKKSLSDVAERYQYKEGDILPDEVDWRERGAVV 140
Query: 150 P-IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN-GCGGGTMEKA 207
P +K Q +CG CWAF+A AVEGI +I+ L+ LSEQ+L+DC +N GC GG A
Sbjct: 141 PRVKRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWA 200
Query: 208 FEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK------ISNYEEVPSGDEQALLKAV 261
FE+I +N GI T+++Y Y G +AA KA K I+ +E VP DE +L KAV
Sbjct: 201 FEFIKENGGIVTDEDYGY---TGDDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAV 257
Query: 262 SMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGD 320
S QP+S+ I+A YK G++ G C DH V IVG+GT+ D +YWLI+NSWG
Sbjct: 258 SYQPISVMISA--ANMSDYKSGVYKGPCSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGP 315
Query: 321 TWGDAGYMKILRD----EGLCGIGTQSSYPL 347
WG+ GY+++ R+ G C + YP+
Sbjct: 316 GWGEGGYLRLQRNFNEPTGKCAVAVAPVYPI 346
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 135/311 (43%), Positives = 195/311 (62%), Gaps = 17/311 (5%)
Query: 48 EKWM---AQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDLTN 101
E+W A HG++YK++ E+ R KIF +N + IE N ++G +YK+ N F DL
Sbjct: 25 EEWHVFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMV 84
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCW 161
EF+AL G+KM SP + F S +++P ++DWR K AVTP+KDQ +CG CW
Sbjct: 85 HEFKALMNGFKM-SPDTKRNGELYFP----SNSNLPKTVDWRQKGAVTPVKDQGQCGSCW 139
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATE 220
+FSA ++EG + L+ LSEQ LVDCST+ GNNGC GG M++AF+Y+ N+GI TE
Sbjct: 140 SFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAFQYVSDNKGIDTE 199
Query: 221 DEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKS 279
YPY+A + TC + + ++P+GDE+AL A+ ++ P+S+ I A F+
Sbjct: 200 ASYPYEARENTCRFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPISVAIDANHGSFQF 259
Query: 280 YKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GL 336
Y +G++N LDH V VG+G TE+G +YWL+KNSWG +WG+ GY+KI R+
Sbjct: 260 YSKGVYNEPNCSSYDLDHGVLAVGYG-TENGQDYWLVKNSWGPSWGENGYIKIARNHSNH 318
Query: 337 CGIGTQSSYPL 347
CGI + +SYPL
Sbjct: 319 CGIASMASYPL 329
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 143/351 (40%), Positives = 208/351 (59%), Gaps = 21/351 (5%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
SF+ + ++++ +S + +E V+ M+E+W+ ++G++Y EKE RFK
Sbjct: 4 SFRTLALLTLSVLLISISLGVVTATESQRNEGGVLTMYEQWLVENGKNYNGLGEKERRFK 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM +S + +YQ
Sbjct: 64 IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120
Query: 130 NLSMTDVPTSLDWRDKKAVTP-IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+P +DWR++ AV P +K Q ECG CWAF+A AVEGI +I+ L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180
Query: 189 VDCST-NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK----- 242
+DC N N GC GG AFE+I +N GI +++ Y Y G +AA KA K
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGY---TGEDTAACKAIEMKTTRVV 237
Query: 243 -ISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQL-DHAVTIV 300
I+ +E VP DE +L KAV+ QP+S+ I+A YK G++ G C DH V IV
Sbjct: 238 TINGHEVVPVNDEMSLKKAVAYQPISVMISA--ANMSDYKSGVYKGACSNLWGDHNVLIV 295
Query: 301 GFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G+GT+ D +YWLI+NSWG WG+ GY+++ R+ G C + YP+
Sbjct: 296 GYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPI 346
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 143/351 (40%), Positives = 208/351 (59%), Gaps = 21/351 (5%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
SF+ + ++++ +S + +E V+ M+E+W+ ++G++Y EKE RFK
Sbjct: 4 SFRTLALLTLSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFK 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM +S + +YQ
Sbjct: 64 IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120
Query: 130 NLSMTDVPTSLDWRDKKAVTP-IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+P +DWR++ AV P +K Q ECG CWAF+A AVEGI +I+ L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180
Query: 189 VDCST-NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK----- 242
+DC N N GC GG AFE+I +N GI +++ Y Y G +AA KA K
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGY---TGEDTAACKAIEMKTTRVV 237
Query: 243 -ISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQL-DHAVTIV 300
I+ +E VP DE +L KAV+ QP+S+ I+A YK G++ G C DH V IV
Sbjct: 238 TINGHEVVPVNDEMSLKKAVAYQPISVMISA--ANMSDYKSGVYKGACSNLWGDHNVLIV 295
Query: 301 GFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G+GT+ D +YWLI+NSWG WG+ GY+++ R+ G C + YP+
Sbjct: 296 GYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPI 346
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 138/313 (44%), Positives = 187/313 (59%), Gaps = 12/313 (3%)
Query: 46 MHEKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDL 99
+ E+W +H +++ E+E+ R KIF EN I K N+ +G ++KLG N++SD+
Sbjct: 23 IKEEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDM 82
Query: 100 TNDEFRALYTGYKMP-SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
EF+ GY R+ S Y + +P S+DWR AVT +KDQ CG
Sbjct: 83 LYHEFKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHCG 142
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGI 217
CWAFS+ AA+EG L+ LSEQ LVDCST GNNGC GG M+ AF YI N GI
Sbjct: 143 SCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 202
Query: 218 ATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTE 276
TE YPY+ + +C + A + + ++P GDE+AL+KAV +M PVS+ I A
Sbjct: 203 DTEKSYPYEGIDDSCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPVSVAIDASHES 262
Query: 277 FKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F+ Y EG++N C Q LDH V +VG+GT + G +YWL+KNSWG TWGD GY+K+ R+
Sbjct: 263 FQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTWGDQGYIKMARNQ 322
Query: 334 EGLCGIGTQSSYP 346
+ CGI T SSYP
Sbjct: 323 DNQCGIATASSYP 335
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 469
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 130/309 (42%), Positives = 190/309 (61%), Gaps = 10/309 (3%)
Query: 48 EKWMAQHGRSY-KDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRA 106
++W H RSY D E E RFK++ ENLEY+ N ++ L N +DL+ E+++
Sbjct: 14 KEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNAR-TTSHWLTLNHLADLSTPEYKS 72
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
G+ + R+ + F+Y+++ +P ++DWR K AV +K+Q +CG CWAF+
Sbjct: 73 KLLGFDNQARVARNKLKTGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSCWAFATT 132
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQ 226
+VEGI I +L+ LSEQ+LVDC T + GC GG M+ A+ +II+N+GI TE++YPY
Sbjct: 133 GSVEGINAIVTGSLVSLSEQELVDCDTEQDKGCSGGLMDYAYAWIIKNKGINTEEDYPYT 192
Query: 227 AVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIF 285
A+ G C A+ K I +YE+VP DE AL KA + QPV++ I A F+ Y G++
Sbjct: 193 AMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSFQLYGGGVY 252
Query: 286 NG-VCGTQLDHAVTIVGFG--TTEDGANYWLIKNSWGDTWGDAGYMKI----LRDEGLCG 338
+ CGT L+H V +VG+G T G+NYW++KNSWG WGDAGY+++ EGLCG
Sbjct: 253 DDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGSTDAEGLCG 312
Query: 339 IGTQSSYPL 347
I SYP+
Sbjct: 313 IAMAPSYPV 321
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 195/335 (58%), Gaps = 10/335 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M I IL++ A V S+ +T + + +WM + +SY +E E R+ +++EN +
Sbjct: 1 MRAITILVLLAAICVASTLATTHDPLTGVFAEWMRDNSKSYSNE-EFVFRWNVWRENQQL 59
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE+ N+ N+T L N+F DLTN EF L+ G H + ++ + + +
Sbjct: 60 IEEHNRS-NKTSFLAMNKFGDLTNAEFNKLFKGLAFDYSFHANKAAAE---KAVPAPGLS 115
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
DWR K AVT +K+Q +CG CW+FS + EG + L LSEQ L+DCS + GN
Sbjct: 116 ADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGN 175
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
NGC GG M+ AFEYII N+GI TE YPYQ Q TC + +++Y +V SGDE A
Sbjct: 176 NGCNGGLMDYAFEYIINNKGIDTEASYPYQTAQYTCQYNPANSGGSLTSYTDVSSGDENA 235
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
LL AV+ +P S+ I A F+ Y G++ + TQLDH V VG+G TEDG +YWL+
Sbjct: 236 LLNAVATEPTSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLAVGWG-TEDGQDYWLV 294
Query: 315 KNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPLA 348
KNSWG WG AGY+K+ R+ CGI T +SYP A
Sbjct: 295 KNSWGADWGLAGYIKMARNRSNNCGIATSASYPTA 329
>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
Length = 464
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 208/322 (64%), Gaps = 18/322 (5%)
Query: 40 EQSVVEMHEKWMAQH---GRSYKDEL-EKEMRFKIFKENLEYIE--KANKEGNRTYKLGT 93
E +++ W+A+H G S+ + E E RF++F +NL++++ A+ +G+ ++LG
Sbjct: 59 EAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGM 118
Query: 94 NRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV-TPIK 152
NRF+DLTNDEFRA Y G +P+ R Y++ + +P S+DWRDK AV +P+K
Sbjct: 119 NRFADLTNDEFRAAYLG---TTPAGRGRHVGEM-YRHDGVEALPDSVDWRDKGAVVSPVK 174
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYI 211
+Q +CG CWAFSAVAAVEGI KI L+ LSEQ+LV+C+ GN+GC GG M+ AF +I
Sbjct: 175 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGGNSGCNGGIMDDAFAFI 234
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
+N G+ TE++YPY A+ G C A+K+ I +E+VP DE +L KAV+ QPVS+ I
Sbjct: 235 TRNGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAI 294
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMK 329
A EF+ Y G+F G CGT LDH V VG+GT G +YW ++NSWG WG+ GY++
Sbjct: 295 DAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIR 354
Query: 330 ILRD----EGLCGIGTQSSYPL 347
+ R+ G CGI +SYP+
Sbjct: 355 MERNVTARTGKCGIAMMASYPI 376
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 140/348 (40%), Positives = 204/348 (58%), Gaps = 22/348 (6%)
Query: 19 FIIIILLVSCASQVVSS-----RSTHEQSVVEMH-------EKWMAQHGRSYKDEL-EKE 65
F+I LLV+ + V ++ R HE+ +++ ++WM Q+ ++Y +++ E E
Sbjct: 5 FLIAALLVAASGGVGAAPELQLREQHEKLLLDAKANPMAAFQQWMMQYTKAYANDIKELE 64
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR-ALYTGYKMPSPSHRSTTSS 124
RF ++ ENL YI N ++ L N F+DLT DEFR L +K S+R SS
Sbjct: 65 TRFSVWLENLNYILAYNAR-TTSHWLHLNAFADLTTDEFRNRLGYDFKARQASNR-LQSS 122
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
F Y N+ +PT +DWR K AVT +K+Q +CG CWAF+ +VEGI I L LS
Sbjct: 123 PFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFATTGSVEGINAIVTGELASLS 182
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKI 243
EQ+LVDC T+ + GC GG M+ A+++II+N G+ TED+YPY A G C AA+K I
Sbjct: 183 EQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGLDTEDDYPYTAEDGVCVAAKKNRRVVTI 242
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG-VCGTQLDHAVTIVGF 302
Y ++P DE AL KA + QP+++ I A F+ Y G+++ CGT L+H V +VG+
Sbjct: 243 DGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGGVYDDPTCGTSLNHGVLVVGY 302
Query: 303 GTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
G NYW++KNSWG WGD GY+++ +G+CGI S+P
Sbjct: 303 GKDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQGMCGIAMAPSFP 350
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 140/333 (42%), Positives = 204/333 (61%), Gaps = 18/333 (5%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI-EKAN 82
++V+ S++VS E+S++E+ ++W +H + Y+ E E R++ FK NL+YI EKA
Sbjct: 32 IVVNDFSELVS-----EESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAG 86
Query: 83 KE-GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
K+ + +G N+F+DL+N+EF+ LY + + +T+ ++ +NL D P+SLD
Sbjct: 87 KKTAALGHSVGLNKFADLSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDAPSSLD 146
Query: 142 WRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGG 201
WR K VT +KDQ +CG CW+FS A+EGI I +LI LSEQ+LVDC T N GC G
Sbjct: 147 WRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT-NYGCEG 205
Query: 202 GTMEKAFEYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKA 260
G M+ AFE++I N GI TE YPY V GTC + ++ I Y +V D ALL A
Sbjct: 206 GYMDYAFEWVINNGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDETD-SALLCA 264
Query: 261 VSMQPVSIGIAAYTTEFKSYKEGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNS 317
QP+S+G+ +F+ Y GI++G C +DHAV IVG+G +E+G +YW++KNS
Sbjct: 265 TVQQPISVGMDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYG-SENGEDYWIVKNS 323
Query: 318 WGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
WG WG GY I R+ G+C I ++SYP
Sbjct: 324 WGTEWGMEGYFYIKRNTDLPYGVCAINAEASYP 356
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 204/336 (60%), Gaps = 14/336 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +RFKIF EN I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG N+F DL EF ++ G+ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-G 195
P ++DWR K AVTP+KDQ +CG CWAFSA ++EG + L+ LSEQ LVDCS + G
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
NNGC GG ME AF+YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSED 236
Query: 256 ALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
L KAV ++ P+S+ I A + F+ Y EG+++ C ++ LDH V +VG+G + G YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295
Query: 313 LIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
L+KNSW ++WGD GY+ + RD CGI +Q+SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 145/348 (41%), Positives = 200/348 (57%), Gaps = 27/348 (7%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEM-HEKWMA---QHGRSYKDELEKEMRFKIFKEN 74
F+I+IL A+ +S + E+ E+W A QH + Y E E+ +R KI+ +N
Sbjct: 4 FLILILGFVAAANAIS--------IFELVKEEWTAFKLQHRKKYDSETEERIRMKIYVQN 55
Query: 75 LEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
I K N+ G ++L N+++DL ++EF G+ K
Sbjct: 56 KHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSVSGKGQLLRGELKPIEE 115
Query: 132 SMT-------DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
+T DVPT++DWR K AVT +KDQ CG CW+FSA A+EG L+ LS
Sbjct: 116 PVTWIEPANVDVPTAMDWRTKGAVTQVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLS 175
Query: 185 EQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKI 243
EQ LVDCS GNNGC GG M+ AF+YI N+GI TE YPY+A+ C KA A
Sbjct: 176 EQNLVDCSQKYGNNGCNGGMMDFAFQYIKDNKGIDTEKSYPYEAIDDECHYNPKAVGATD 235
Query: 244 SNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIV 300
+ ++P G+E+AL+KA+ ++ PVS+ I A F+ Y EG+ + C + QLDH V V
Sbjct: 236 KGFVDIPQGNEKALMKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAV 295
Query: 301 GFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
G+GTTEDG +YWL+KNSWG TWGD GY+K+ R+ + CGI T +SYPL
Sbjct: 296 GYGTTEDGEDYWLVKNSWGTTWGDQGYVKMARNRDNHCGIATTASYPL 343
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 205/336 (61%), Gaps = 14/336 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +RFKIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG N+F DL EF ++ G++ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHR----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-G 195
P ++DWR K AVTP+KDQ +CG CWAFSA ++EG + L+ LSEQ LVDCS + G
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
NNGC GG ME AF+YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSEV 236
Query: 256 ALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
L KAV ++ P+S+ I A + F+ Y EG+++ C ++ LDH V +VG+G + G YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295
Query: 313 LIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
L+KNSW ++WGD GY+ + RD CGI +Q+SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 145/353 (41%), Positives = 208/353 (58%), Gaps = 36/353 (10%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEM-HEKWMA---QHGRSYKDELEKEMRFKIF 71
+ + I+++ V+ A+ V S+ E+ E+W A QH ++Y E E+ +R KI+
Sbjct: 1 MKILILLMAFVAAANAV---------SLYELVKEEWNAFKLQHRKNYDSETEERIRLKIY 51
Query: 72 KENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK- 127
+N I K N+ G Y+L N+++DL ++EF G+ +R+ + + K
Sbjct: 52 VQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVNGF------NRTDSKKSLKG 105
Query: 128 --------YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGAN 179
+ + +VPT++DWR K AVTP+KDQ CG CW+FSA A+EG
Sbjct: 106 VRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGK 165
Query: 180 LIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA 238
L+ LSEQ LVDCS GNNGC GG M+ AF+YI N GI TE YPY+A+ TC KA
Sbjct: 166 LVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPYEAIDDTCHFNPKA 225
Query: 239 AAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDH 295
A Y ++P GDE+AL KA+ ++ PVSI I A F+ Y EG+ + C ++ LDH
Sbjct: 226 VGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHESFQFYSEGVYYEPQCDSENLDH 285
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
V VG+GT+E+G +YWL+KNSWG TWGD GY+K+ R+ + CG+ T +SYPL
Sbjct: 286 GVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMARNRDNHCGVATCASYPL 338
>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 139/346 (40%), Positives = 199/346 (57%), Gaps = 18/346 (5%)
Query: 14 NTIPMFIIIILLVSCASQVVSSRS-----THEQSVVEMHEKWMAQHGRSYKDELEKEMRF 68
T P+ I++LL + S + +++ +W A H RSY E+ RF
Sbjct: 7 GTRPVIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRF 66
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALY----TGYKMPSPSHRSTTSS 124
++++ N+EYI+ N+ G TY+LG N+F+DLT +EF A Y TG + + + S
Sbjct: 67 EVYRTNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAGGHTGSAITTAAEADGLWS 126
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ-QECGCCWAFSAVAAVEGITKISGANLIQL 183
+ D P S+DWR K AVTP+K+Q +C CWAFSAVA +E + I L+ L
Sbjct: 127 SGGSDGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVAL 186
Query: 184 SEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKI 243
SEQQLVDC + GC G +AF++I++N GI T +YPY+AV+G CSAA+ A I
Sbjct: 187 SEQQLVDCDKY-DGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGACSAAKPAV--TI 243
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
+ + V +E AL AV+ QP+ + I + YK G+F+ CG Q+ HAV VG+G
Sbjct: 244 TGHLAVAK-NELALQSAVARQPIGVAIEV-PISMQFYKSGVFSAACGIQMSHAVVTVGYG 301
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYP 346
G YWL+KNSWG TWG+AGY+++ RD GLCGI ++YP
Sbjct: 302 ADASGLKYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGIALDTAYP 347
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 204/336 (60%), Gaps = 14/336 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +RFKIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG N+F DL EF ++ G+ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-G 195
P +DWR K AVTP+KDQ +CG CWAFSA ++EG + L+ LSEQ LVDCS + G
Sbjct: 117 PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
NNGC GG ME AF+YI +N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 177 NNGCEGGLMEDAFKYIKENDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSED 236
Query: 256 ALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
L KAV ++ P+S+ I A + F+ Y EG+++ C ++ LDH V +VG+G + G YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295
Query: 313 LIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
L+KNSW ++WGD GY+ + RD CGI +Q+SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 377
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 142/334 (42%), Positives = 190/334 (56%), Gaps = 31/334 (9%)
Query: 41 QSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE--GNRTYKLGTNRFSD 98
Q++ ++W A+HGR+Y E+ R +++ N+ YIE AN + TY+LG ++D
Sbjct: 47 QTMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGETAYTD 106
Query: 99 LTNDEFRALYTGYKMPSP---SHRSTTSSTFK---------------YQNLSMTDVPTSL 140
LT DEF A+YT PSP +H + Y N+S P S+
Sbjct: 107 LTADEFTAMYTS---PSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAGAPASV 163
Query: 141 DWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCG 200
DWR K AVT +K+Q CG CWAFS VA VEGI +I NLI LSEQ+LVDC T + GC
Sbjct: 164 DWRAKGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCDTL-DYGCD 222
Query: 201 GGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLK 259
GG A E+I N GIATE +YPY G C A + AA IS + V + E +L
Sbjct: 223 GGVSYHALEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRSEPSLAN 282
Query: 260 AVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIV-GFGTTEDGANYWLIKNSW 318
AV+ QPV++ I A F+ Y +G++NG CGT+L+H VT+V DG YW++KNSW
Sbjct: 283 AVAAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEKYWIVKNSW 342
Query: 319 GDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
G WGD GY ++ +D EGLCGI + S+PL
Sbjct: 343 GKKWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376
>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 141/319 (44%), Positives = 208/319 (65%), Gaps = 20/319 (6%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ +++E+W H S ++ EK RF +FKEN+ ++ N+ ++ YKL N+F+D+
Sbjct: 34 EESLWQLYERWGKHHTIS-RNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADM 91
Query: 100 TNDEFRALYTGYKMPSPSH------RSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKD 153
+N EF Y + SH R + F Y+ TD+P+S+D R++ AV +K+
Sbjct: 92 SNYEFVNFYA---RSNISHYRKLHERRRGAGGFMYE--QDTDLPSSVDGRERGAVNAVKE 146
Query: 154 QQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQ 213
Q CG CWAFS+VAAVEGI KI L+ LSEQ+L+DC+ N GC GG ME AF++I +
Sbjct: 147 QGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCNGGFMEIAFDFIKR 205
Query: 214 NQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAA 272
N GIATE+ YPY +G C +++ + KI YE VP +E AL++AV+ QPVS+ I A
Sbjct: 206 NGGIATENSYPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDA 264
Query: 273 YTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
+F+ Y +G+F+G CGT+L+H V +G+GTTEDG +YWL++NSWG WG+ GY+++ R
Sbjct: 265 AGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKR 324
Query: 333 D----EGLCGIGTQSSYPL 347
EGLCGI ++SYP+
Sbjct: 325 GVEQAEGLCGIAMEASYPI 343
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 139/322 (43%), Positives = 195/322 (60%), Gaps = 16/322 (4%)
Query: 42 SVVEM-HEKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTN 94
S+ E+ E+W A QH + Y E E+ +R KI+ +N I K N+ +G ++L N
Sbjct: 18 SIFELVKEEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVN 77
Query: 95 RFSDLTNDEFRALYTGYKMPS---PSHRSTT-SSTFKYQNLSMTDVPTSLDWRDKKAVTP 150
+++DL ++EF G+ + P + Y + +VP ++DWR+K AVTP
Sbjct: 78 KYTDLLHEEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTP 137
Query: 151 IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFE 209
+KDQ CG CW+FSA A+EG L+ LSEQ LVDCST GNNGC GG M+ AF+
Sbjct: 138 VKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQ 197
Query: 210 YIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSI 268
YI N GI TE YPY+A+ TC KA A + ++P GDE+AL+KA++ PVS+
Sbjct: 198 YIKDNGGIDTEKAYPYEAIDDTCHYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSV 257
Query: 269 GIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAG 326
I A F+ Y EG+ + C ++ LDH V VG+GT+E+G +YWL+KNSWG TWGD G
Sbjct: 258 AIDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQG 317
Query: 327 YMKILRD-EGLCGIGTQSSYPL 347
Y+K+ R+ + CGI T +SYPL
Sbjct: 318 YVKMARNRDNHCGIATAASYPL 339
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 133/340 (39%), Positives = 198/340 (58%), Gaps = 13/340 (3%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F +I LL++ + ++ ++ + V E + +H ++Y D E+ R KIF EN +I
Sbjct: 3 FALITLLIALVA--MTQAVSYSELVREEWNTFKLEHRKNYADSTEETFRMKIFNENKHHI 60
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLS 132
K N+ G +YKL N+++D+ + EFR G+ +T +F + +
Sbjct: 61 AKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTFISPE 120
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+PT++DWR K AVT +KDQ CG CWAFS+ A+EG L+ LSEQ LVDCS
Sbjct: 121 HVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDCS 180
Query: 193 TN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
T GNNGC GG M+ AF Y+ N GI TE Y Y+ + +C + + A + ++P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCHFDKNSIGATDRGFADIPQ 240
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDG 308
G+E+ L +AV ++ PVS+ I A F+ Y EG+++ LDH V +VG+GT +DG
Sbjct: 241 GNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAENLDHGVLVVGYGTEKDG 300
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
++YWL+KNSWG TWGD G++K+ R+ E CGI + SSYPL
Sbjct: 301 SDYWLVKNSWGTTWGDKGFIKMSRNKENQCGIASASSYPL 340
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 135/335 (40%), Positives = 198/335 (59%), Gaps = 13/335 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M +L + A V S+ + + + WM +H +SY +E E R+ +++EN Y
Sbjct: 1 MRTTTLLALCVALFVASTFAVSHDPLTGVFADWMQEHQKSYANE-EFVYRWNVWRENYLY 59
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N + N+++ L N+F DLTN EF L+ G + + + + +P
Sbjct: 60 IEAHNHQ-NKSFHLAMNKFGDLTNAEFNKLFKGLSITADQAKQESDIA------PAPGLP 112
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GN 196
DWR K AVT +K+Q +CG CW+FS + EG + L LSEQ LVDCST+ GN
Sbjct: 113 ADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVDCSTSYGN 172
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
+GC GG M+ AFEYII+N+GI TE+ YPY A QGTC ++ + ++ +Y VPSG+E A
Sbjct: 173 HGCNGGLMDYAFEYIIRNKGIDTEESYPYHASQGTCRYNKQHSGGELVSYTNVPSGNEGA 232
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLI 314
LL AV+ QP S+ I A + F+ YK G+++ ++LDH V VG+G DG +YWL+
Sbjct: 233 LLNAVATQPTSVAIDASHSSFQFYKGGVYDEPACSSSRLDHGVLAVGWG-VRDGKDYWLV 291
Query: 315 KNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPLA 348
KNSWG WG +GY+++ R++ CGI T +S+P A
Sbjct: 292 KNSWGADWGLSGYIEMSRNKHNQCGIATAASHPHA 326
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 143/336 (42%), Positives = 203/336 (60%), Gaps = 14/336 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H +SY+ +E+ +RFKIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKSYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG N+F DL EF ++ G+ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-G 195
P +DWR K AVTP+KDQ +CG CWAFSA ++EG + L+ LSEQ LVDCS + G
Sbjct: 117 PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
NNGC GG ME AF+YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSEV 236
Query: 256 ALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
L KAV ++ P+S+ I A + F+ Y EG+++ C ++ LDH V +VG+G + G YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295
Query: 313 LIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
L+KNSW ++WGD GY+ + RD CGI +Q+SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 138/344 (40%), Positives = 207/344 (60%), Gaps = 16/344 (4%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
S +++ + ++ ++ AS + + + ++ E + A+H + Y+ E+ MR
Sbjct: 49 SLRVSAGMKLLAVLAVIGLASALSPNPNLNQH-----WENFKAEHNKKYESFPEELMRRL 103
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP--SPSHRSTTSSTFK 127
IF+EN ++IE N + + LG N F DLTN E+R Y GY+ P +PS S S +
Sbjct: 104 IFEENHQFIEDHNSKKEFDFYLGMNHFGDLTNKEYRERYLGYRRPENTPSKASYIFSRAE 163
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
+ DVP +DWRD+ VTP+K+Q +CG CWAFSAV ++EG S L+ LSEQ
Sbjct: 164 ----KIEDVPDQIDWRDQGFVTPVKNQGQCGSCWAFSAVGSLEGQHFKSTGKLVSLSEQN 219
Query: 188 LVDCST-NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNY 246
LVDCST GN+GC GG M++AFEY+ N GI TED YPY G+C K+ A + +
Sbjct: 220 LVDCSTPEGNSGCNGGWMDQAFEYVKDNHGIDTEDSYPYVGTDGSCHFKNKSIGATLKGF 279
Query: 247 EEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGIFN-GVCGT-QLDHAVTIVGFG 303
+V GDE+AL +AV + PVS+ I A + F+ Y+ G++N C T +LDH V +VG+G
Sbjct: 280 MDVKEGDEEALRQAVGVAGPVSVAIDASSMLFQFYRGGVYNVPWCSTSELDHGVLVVGYG 339
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
G ++W++KNSWG WG GY+++ R++G CGI +++S P
Sbjct: 340 KQFQGKDFWMVKNSWGVGWGIYGYIEMSRNKGNQCGIASKASIP 383
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 138/340 (40%), Positives = 195/340 (57%), Gaps = 15/340 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
+I +L + +Q VS + E + + +H + Y+DE E+ R KIF EN I
Sbjct: 4 YIFALLALVAVAQAVSFADV----IKEEWQTFKLEHRKQYQDETEERFRLKIFNENKHKI 59
Query: 79 EKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLS 132
K N+ G ++K+G N+++D+ + EF G+ + +TF + +
Sbjct: 60 AKHNQLYAAGEVSFKMGLNKYADMLHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPE 119
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+P S+DWR+K AVT +KDQ CG CWAFS+ A+EG LI LSEQ LVDCS
Sbjct: 120 HVKLPQSVDWRNKGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCS 179
Query: 193 TN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
T GNNGC GG M+ AF YI N GI TE YPY+ + +C + A + ++P
Sbjct: 180 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKGTIGATDRGFTDIPQ 239
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDG 308
GDE+ L +AV ++ PVS+ I A F+ Y G+++ C Q LDH V +VG+GT E+G
Sbjct: 240 GDEKKLAQAVATIGPVSVAIDASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTDENG 299
Query: 309 ANYWLIKNSWGDTWGDAGYMKILR-DEGLCGIGTQSSYPL 347
+YWL+KNSWG TWGD G++K+ R D+ CGI T SSYPL
Sbjct: 300 KDYWLVKNSWGTTWGDKGFIKMARNDDNQCGIATASSYPL 339
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 143/347 (41%), Positives = 196/347 (56%), Gaps = 33/347 (9%)
Query: 29 ASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR- 87
A + S ST + S++E ++W A + +SY E+ RF++ N+ YIE N E
Sbjct: 32 AGDMERSMSTDDSSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAA 91
Query: 88 --TYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK------------------ 127
TY+LG ++DLTN EF A+YT P+P+ S
Sbjct: 92 GLTYELGETAYTDLTNQEFMAMYTA---PAPAQLPADESVITTRAGPVDAVGGAPGQLPV 148
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
Y NLS T P S+DWR AVTP+K+Q CG CWAFS VA VEGI +I L+ LSEQ+
Sbjct: 149 YVNLS-TSAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQE 207
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNY 246
LVDC T ++GC GG +A +I N GI TE +YPY C+ A+ + A I+
Sbjct: 208 LVDCDTL-DDGCDGGISYRALRWIASNGGITTETDYPYTGTTDACNRAKLSHNAVSIAGL 266
Query: 247 EEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
V + E +L AV+ QPV++ I A F+ YK+G++NG CGT L+H VT+VG+G
Sbjct: 267 RRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEA 326
Query: 307 DGAN-YWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
G + YW++KNSWG WGD GY+++ +D EGLCGI + SYPL
Sbjct: 327 AGGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
Length = 484
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 140/341 (41%), Positives = 200/341 (58%), Gaps = 21/341 (6%)
Query: 26 VSCASQVVSSRSTHEQSVVEMHEKWMAQH----------GRSYKDELEKEMRFKIFKENL 75
++ A V ++ V ++E+W ++H G E + R ++F+ NL
Sbjct: 32 LAAAVTVTPPPERTDEEVRRLYEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNL 91
Query: 76 EYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA-LYTGYKMPSPSHRSTTSSTFKYQNL 131
YI+ N E G ++LG RF+DLT +E+RA L G + + + S +Y L
Sbjct: 92 RYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVVGSR-RYLPL 150
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+ +P ++DWR++ AV +KDQ +CG CWAFSAVAAVEGI KI +LI LSEQ+L+DC
Sbjct: 151 AGEQLPDAVDWRERGAVAEVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELIDC 210
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVP 250
+ GC GG M+ AF ++I+N GI TE +YP+ GTC K I ++E VP
Sbjct: 211 DKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDSFERVP 270
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
E+AL KAV+ QPVS I A F+ Y GIF+G CGT LDH VT+VG+G +E G +
Sbjct: 271 INYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYG-SEGGKD 329
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
YW++KNSWG WG+AGY+++ R+ G CGI + YP+
Sbjct: 330 YWIVKNSWGTQWGEAGYVRMARNVRVRAGKCGIAMEPLYPV 370
>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 400
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 134/308 (43%), Positives = 192/308 (62%), Gaps = 4/308 (1%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN 82
++ V+C Q S+S E E HEKWMAQ+G+ Y+D E E RF+IFK N+++IE N
Sbjct: 92 LVGVTCGRQC-RSKSRLEACTSERHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFN 150
Query: 83 KEGNRTYKLGTNRFSDLTNDEFRALY-TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
G++ + + N+F DL ++EF+AL G + S +T ++F+Y ++ +T++P ++D
Sbjct: 151 VAGDKPFNIRINQFPDLHDEEFKALLINGQRKVSGVETATEETSFRYGSV-VTNIPATMD 209
Query: 142 WRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGG 201
R K VTPIKDQ G CWA SAVAA+EGI +I+ + L+ LS+Q+LVD + GC G
Sbjct: 210 GRKKGVVTPIKDQGIIGSCWALSAVAAIEGIHQITTSKLMFLSKQKLVDSVKGESEGCIG 269
Query: 202 GTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV 261
G +E AFE+I++ GI +E YPY+ V + + A I YE+VPS +++ALLK V
Sbjct: 270 GYVEDAFEFIVKKGGILSETHYPYKGVNXCKVEKETHSVAHIKGYEKVPSNNKKALLKVV 329
Query: 262 SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGD 320
+ QPVS+ I FK Y IFN CG+ +H V +VG+G DGA YW +KNSWG
Sbjct: 330 ANQPVSVYIDVGAHAFKYYSSEIFNARNCGSDPNHVVAVVGYGKALDGAKYWPVKNSWGT 389
Query: 321 TWGDAGYM 328
WG YM
Sbjct: 390 EWGGKWYM 397
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 139/342 (40%), Positives = 194/342 (56%), Gaps = 19/342 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMA---QHGRSYKDELEKEMRFKIFKENL 75
F++ + L SQ VS + E+W A H + Y+ E E+ R KIF EN
Sbjct: 3 FLVFVALCVVGSQAVSFFDL-------VQEQWGAFKVTHKKQYESETEERFRMKIFMENA 55
Query: 76 EYIEKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGY-KMPSPSHRSTTSSTFKYQNL 131
+ K NK +G ++KLG N++SD+ N EF GY + +P + +
Sbjct: 56 HKVAKHNKLYAQGLVSFKLGVNKYSDMLNHEFVHTLNGYNRSKTPLRSGELDESITFIPP 115
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+ ++P +DWR AVTP+KDQ +CG CW+FS ++EG L+ LSEQ L+DC
Sbjct: 116 ANVELPKQIDWRKLGAVTPVKDQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDC 175
Query: 192 STN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
S GNNGC GG M+ AF YI N GI TE YPY+A C + A + ++
Sbjct: 176 SEKYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYKAEDEKCHYKPRNKGATDRGFVDIE 235
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVCGT-QLDHAVTIVGFGTTED 307
SGDE+ L AV ++ P+S+ I A F+ Y EG++ C + QLDH V +VG+GT ED
Sbjct: 236 SGDEEKLKAAVATVGPISVAIDASHPTFQQYSEGVYYEPECSSEQLDHGVLVVGYGTDED 295
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
G +YWL+KNSWGD+WGD GY+K+ R+ + CGI TQ+SYPL
Sbjct: 296 GNDYWLVKNSWGDSWGDQGYIKMARNRDNNCGIATQASYPLV 337
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 138/320 (43%), Positives = 195/320 (60%), Gaps = 14/320 (4%)
Query: 33 VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLG 92
V S T++ S + WM +H R+Y E E R++ FKEN+++I K N + + T LG
Sbjct: 23 VFSSQTYQTSFI----GWMRKHDRAYSHE-EFTDRYQAFKENMDFIHKWNSQESDTV-LG 76
Query: 93 TNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
+F+DLTN+E++ Y G K+ + + K+ + P S+DWR+K AV+ +K
Sbjct: 77 LTKFADLTNEEYKKHYLGIKVNVKKNLNAAQKGLKFFKFTG---PDSIDWREKGAVSQVK 133
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYI 211
DQ +CG CW+FS AVEG +I N++ LSEQ LVDCS GN GC GG M AFEYI
Sbjct: 134 DQGQCGSCWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYI 193
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIA 271
I N GIATE YPY A QG C + A I Y+E+P G+E +L A++ QPVS+ I
Sbjct: 194 IDNGGIATESSYPYTAAQGRCKFTKSMNGANIIGYKEIPQGEEDSLTAALAKQPVSVAID 253
Query: 272 AYTTEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMK 329
A F+ Y G+++ C ++ LDH V VG+GT E G +Y++IKNSWG TWG GY+
Sbjct: 254 ASHMSFQLYSSGVYDEPACSSEALDHGVLAVGYGTLE-GKDYYIIKNSWGPTWGQDGYIF 312
Query: 330 ILRD-EGLCGIGTQSSYPLA 348
+ R+ + CG+ T +SYP++
Sbjct: 313 MSRNAQNQCGVATMASYPIS 332
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 203/335 (60%), Gaps = 12/335 (3%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +RFKIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N + G +YKLG N+F DL EF ++ GY S +S S+ N++ + +P
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYH---GSRKSGGSTFLPPANVNDSSLP 117
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GN 196
++DWR K AVTP+KDQ +CG CWAFS ++EG + L+ LSEQ LVDCS + GN
Sbjct: 118 KAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
NGC GG ME AF+YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 178 NGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGCEDD 237
Query: 257 LLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
L KAV ++ P+S+ I A + F+ Y EG+++ C ++ LDH V +VG+G + G YWL
Sbjct: 238 LKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYWL 296
Query: 314 IKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+KNSW ++WGD GY+ + RD CGI +Q+SYPL
Sbjct: 297 VKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
Length = 246
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 134/270 (49%), Positives = 178/270 (65%), Gaps = 32/270 (11%)
Query: 86 NRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRD 144
+++YKL N F+DLTN+EF +K +H ST +++FKY+N+ T VP++ DWR
Sbjct: 2 DKSYKLSINEFADLTNEEFGTSRNRFK----AHICSTEATSFKYENV--TAVPSTXDWRK 55
Query: 145 KKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGT 203
K AVTPIKDQ +CG CWAFSAVAA+EGIT++S LI LSEQ+LVDC T+G + GC G
Sbjct: 56 KGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXGAN 115
Query: 204 MEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVS 262
YPY GTC+ + A AAKI+ YE+VP+ +E+AL KAV+
Sbjct: 116 -------------------YPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVA 156
Query: 263 MQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTW 322
QP+++ I A EF+ Y G+F G CGT+LDH V VG+GT++DG YWL+KNSWG W
Sbjct: 157 HQPIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTGW 216
Query: 323 GDAGYMKILRD----EGLCGIGTQSSYPLA 348
G+ GY+++ RD EGLCGI Q+SYP A
Sbjct: 217 GEEGYIRMQRDVTAKEGLCGIAMQASYPTA 246
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 136/335 (40%), Positives = 204/335 (60%), Gaps = 11/335 (3%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE+++R KI+ EN + K
Sbjct: 6 LIFLLAAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKLRMKIYLENKHKVAK 64
Query: 81 AN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y++ N+F DL + EFR++ GY+ + S STF + + +VP
Sbjct: 65 HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 123
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GN 196
S+DWR+K A+TP+KDQ +CG CWAFS+ A+EG T L+ LSEQ L+DCS GN
Sbjct: 124 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGN 183
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M++AF+YI N+GI TE+ YPY+A G C + A + ++PSG+E
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDGVCRYNPRNRGAVDRGFVDIPSGEEDK 243
Query: 257 LLKAV-SMQPVSIGIAAYTTEFKSYKEG-IFNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
L AV ++ PVS+ I A F+ Y +G + C + LDH V +VG+G +++G +YWL
Sbjct: 244 LKAAVATVGPVSVAIDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYG-SDNGEDYWL 302
Query: 314 IKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+KNSW + WGD GY+KI R+ + CG+ T +SYPL
Sbjct: 303 VKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 138/318 (43%), Positives = 191/318 (60%), Gaps = 18/318 (5%)
Query: 46 MHEKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDL 99
+ E+W +H ++Y+DE E+ R KIF EN I K N+ G T+K+ N+++D+
Sbjct: 23 IKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADM 82
Query: 100 TNDEFRAL-----YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
+ EFR YT +K S S T TF + + +P S+DWR+K AVT +KDQ
Sbjct: 83 LHHEFRETMNGFNYTLHKELRASDPSFTGITF--ISPAHVKLPKSVDWREKGAVTAVKDQ 140
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQ 213
CG CWAFS+ A+EG L+ LSEQ LVDCS GNNGC GG M+ AF YI
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKD 200
Query: 214 NQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAA 272
N GI TE YPY+ + +C + + A + ++P G+E+ + +AV ++ PVS+ I A
Sbjct: 201 NGGIDTEKSYPYEGIDDSCHFNKDSVGATDRGFADIPQGNEKKMAEAVATIGPVSVAIDA 260
Query: 273 YTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
F+ Y EGI+N C +Q LDH V +VG+GT E G +YWL+KNSWG TWGD G++K+
Sbjct: 261 SHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESGKDYWLVKNSWGTTWGDKGFIKM 320
Query: 331 LRDE-GLCGIGTQSSYPL 347
R+E CGI + SSYPL
Sbjct: 321 ARNEDNQCGIASASSYPL 338
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 129/259 (49%), Positives = 164/259 (63%), Gaps = 14/259 (5%)
Query: 99 LTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKD 153
+TN EFR+ Y G K+ HR + +F Y+ + VP S+DWR K AVTPIKD
Sbjct: 1 MTNHEFRSTYAGSKVNH--HRMFRGSQHAAGSFMYEKVK--SVPPSVDWRKKGAVTPIKD 56
Query: 154 QQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQ 213
Q +CG CWAFS V AVEGI I L+ LSEQ+LVDC T+ N GC GG M AFE+I +
Sbjct: 57 QGQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKE 116
Query: 214 NQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAA 272
GI TE YPY A GTC ++ + I +E VP +E ALLKA + QP+S+ I A
Sbjct: 117 KGGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDA 176
Query: 273 YTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
+ F+ Y EG+F G CGT LDH V IVG+GTT DG YW++KNSWG WG+ GY+++ R
Sbjct: 177 GGSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKR 236
Query: 333 ----DEGLCGIGTQSSYPL 347
EGLCGI ++SYP+
Sbjct: 237 GISAKEGLCGIAVEASYPI 255
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 136/340 (40%), Positives = 197/340 (57%), Gaps = 15/340 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
I+ +L + +Q VS + + + E + +H ++Y+DE E+ R KIF EN I
Sbjct: 5 LILPLLALVAVAQAVS----YAEVIQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60
Query: 79 EKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLS 132
K N+ G ++K+ N+++D+ + EF + G+ +FK + +
Sbjct: 61 AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFISPE 120
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+P +DWR K AVT +KDQ CG CWAFS+ A+EG L+ LSEQ LVDCS
Sbjct: 121 HVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCS 180
Query: 193 TN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
T GNNGC GG M+ AF YI N GI TE YPY+A+ +C + + A + ++P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGSIGATDRGFVDIPQ 240
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDG 308
G+E+ + +AV ++ PV++ I A F+ Y EG++N C Q LDH V +VGFGT E G
Sbjct: 241 GNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGFGTDESG 300
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+YWL+KNSWG TWGD G++K+LR+ E CGI + SSYPL
Sbjct: 301 EDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 139/334 (41%), Positives = 193/334 (57%), Gaps = 22/334 (6%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+ ++ LV+CA+ + + +W A H R Y E+ +R +I+ NLE I
Sbjct: 7 VALLALVACATAMPFA-------------EWKALHNRQYASAQEEALRQEIYLSNLELIN 53
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPS-PSHRSTTSSTFKYQNLSMTDVPT 138
+ N G +Y LG N F DL + EF A Y G + + +S SST+ + M +P
Sbjct: 54 EHNAAGRHSYTLGMNEFGDLAHHEFAAKYLGVRFNGVNATKSFASSTYLPR---MVSLPD 110
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNN 197
S+DWR VTP+K+Q +CG CW+FS +VEG L+ LSEQ LVDCS+ GN
Sbjct: 111 SVDWRTAGIVTPVKNQGQCGSCWSFSTTGSVEGQHARKTGTLVSLSEQNLVDCSSQEGNE 170
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG M+ AFEYII+N GI TE YPY A GTC A +++Y+++ +G E L
Sbjct: 171 GCNGGLMDDAFEYIIKNGGIDTEASYPYTATTGTCKFNAANIGATVASYQDIITGSESDL 230
Query: 258 LKAV-SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLI 314
AV ++ PVS+ I A F+ Y G++N TQLDH V VG+GT+ +G +YWL+
Sbjct: 231 QNAVATVGPVSVAIDASHINFQFYFTGVYNEKKCSTTQLDHGVLAVGYGTSTEGKDYWLV 290
Query: 315 KNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
KNSWG TWG AGY+ + R+ + CGI T +SYPL
Sbjct: 291 KNSWGATWGKAGYIWMSRNADNQCGIATSASYPL 324
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 139/335 (41%), Positives = 203/335 (60%), Gaps = 12/335 (3%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +RFKIF EN I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N + G +YKLG N+F DL EF ++ G+ + ++ SS N++ + +P
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLP 117
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GN 196
+DWR K AVTP+KDQ +CG CWAFSA ++EG + L+ LSEQ LVDCS + GN
Sbjct: 118 KVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
NGC GG ME AF+YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 178 NGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSEVD 237
Query: 257 LLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
L KAV ++ P+S+ I A + F+ Y EG+++ C ++ LDH V +VG+G + G YWL
Sbjct: 238 LKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYWL 296
Query: 314 IKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+KNSW ++WGD GY+ + RD CGI +Q+SYPL
Sbjct: 297 VKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
Length = 380
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 143/346 (41%), Positives = 192/346 (55%), Gaps = 35/346 (10%)
Query: 33 VSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR---TY 89
+ S + ++E ++W A + +SY E RF ++ N+ YIE N E TY
Sbjct: 38 MGSSTDDNSPMIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTY 97
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---------------------Y 128
+LG ++DLTN EF A+YT PSP+ Y
Sbjct: 98 ELGETAYTDLTNQEFMAMYTA--APSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVY 155
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
NLS T P S+DWR AVTP+K+Q CG CWAFS VA VEGI +I L+ LSEQ+L
Sbjct: 156 VNLS-TAAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQEL 214
Query: 189 VDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYE 247
VDC T + GC GG +A +I N G+ TE++YPY C+ A+ A AA I+
Sbjct: 215 VDCDTL-DAGCDGGISYRALRWITSNGGLTTEEDYPYTGTTDACNRAKLAHNAASIAGLR 273
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT-TE 306
V + E +L AV+ QPV++ I A F+ YK G++NG CGT L+H VT+VG+G E
Sbjct: 274 RVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYGQEEE 333
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
DG YW+IKNSWG +WGD GY+K+ +D EGLCGI + S+PL
Sbjct: 334 DGDKYWIIKNSWGASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 139/321 (43%), Positives = 193/321 (60%), Gaps = 26/321 (8%)
Query: 48 EKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTN 101
E+W A QH ++Y E E+ +R KI+ +N I K N+ G Y+L N+++DL +
Sbjct: 25 EEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLH 84
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFK---------YQNLSMTDVPTSLDWRDKKAVTPIK 152
+EF G+ +R+ + + K + + +VPT++DWR K AVTP+K
Sbjct: 85 EEFVQTVNGF------NRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVK 138
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYI 211
DQ CG CW+FSA A+EG L+ LSEQ LVDCS GNNGC GG M+ AF+YI
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYI 198
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGI 270
N GI TE YPY+A+ TC KA A Y ++P GDE+AL KA+ ++ PVSI I
Sbjct: 199 KDNGGIDTEKSYPYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAI 258
Query: 271 AAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYM 328
A F+ Y EG+ + C ++ LDH V VG+GT+E+G +YWL+KNSWG TWGD GY+
Sbjct: 259 DASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYV 318
Query: 329 KILRD-EGLCGIGTQSSYPLA 348
K+ R+ + CG+ T +SYPL
Sbjct: 319 KMARNHDNHCGVATCASYPLV 339
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 136/340 (40%), Positives = 196/340 (57%), Gaps = 15/340 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
I+ +L + +Q VS + + + E + +H ++Y+DE E+ R KIF EN I
Sbjct: 5 LILPLLALVAVAQAVS----YAEVIQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60
Query: 79 EKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLS 132
K N+ G ++K+ N+++D+ + EF + G+ +FK + +
Sbjct: 61 AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFISPE 120
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+P +DWR K AVT +KDQ CG CWAFS+ A+EG L+ LSEQ LVDCS
Sbjct: 121 HVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCS 180
Query: 193 TN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
T GNNGC GG M+ AF YI N GI TE YPY+A+ +C + A + ++P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRGFVDIPQ 240
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDG 308
G+E+ + +AV ++ PV++ I A F+ Y EG++N C Q LDH V +VGFGT E G
Sbjct: 241 GNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGFGTDESG 300
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+YWL+KNSWG TWGD G++K+LR+ E CGI + SSYPL
Sbjct: 301 QDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 139/335 (41%), Positives = 203/335 (60%), Gaps = 12/335 (3%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +RFKIF EN I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N + G +YKLG N+F DL EF ++ G+ + ++ SS N++ + +P
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLP 117
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GN 196
+DWR K AVTP+KDQ +CG CWAFSA ++EG + L+ LSEQ LVDCS + GN
Sbjct: 118 KVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
NGC GG ME AF+YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 178 NGCEGGLMEDAFKYIKANDGIDTEKSYPYKAVDGECRFKKEDVGATDTGYVEIKAGSEVD 237
Query: 257 LLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
L KAV ++ P+S+ I A + F+ Y EG+++ C ++ LDH V +VG+G + G YWL
Sbjct: 238 LKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYWL 296
Query: 314 IKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+KNSW ++WGD GY+ + RD CGI +Q+SYPL
Sbjct: 297 VKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
Length = 330
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 195/329 (59%), Gaps = 21/329 (6%)
Query: 26 VSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEG 85
V AS V S T++ S + WM +H RSY E +++ FK+N+++I N
Sbjct: 16 VCFASNSVYSAQTYQTSFL----GWMKKHDRSYHHH-EFNNKYQAFKDNMDFIHNWNTNK 70
Query: 86 NRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWR 143
N LG +F+DLTN+E+R +Y G K+ + N +M P S+DWR
Sbjct: 71 NSKTVLGLTQFADLTNEEYRKIYLGTKVNVAPEK---------HNFNMIHFTGPDSIDWR 121
Query: 144 DKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGG 202
K AV+ +KDQ +CG CW+FS +VEG +I N++ LSEQ LVDCS GNNGC GG
Sbjct: 122 TKGAVSHVKDQGQCGSCWSFSTTGSVEGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGG 181
Query: 203 TMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS 262
M AF++I+ G+ATED YPY AVQG C + A IS Y+E+ G E L A++
Sbjct: 182 LMVNAFKFIMSQGGVATEDSYPYNAVQGKCKFTKSMVGANISGYKEITQGSELELQAALT 241
Query: 263 MQPVSIGIAAYTTEFKSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGD 320
QPVSI I A F+ YK G+++ C + QLDH V VG+G TE+G +Y+++KNSW D
Sbjct: 242 KQPVSIAIDASQQSFQLYKSGVYDEPECSSYQLDHGVLAVGYG-TENGKDYYIVKNSWAD 300
Query: 321 TWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
+WG GY+ + R+ + CG+ T +SYP++
Sbjct: 301 SWGQDGYIFMSRNAKNQCGVATMASYPIS 329
>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 356
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 187/315 (59%), Gaps = 16/315 (5%)
Query: 47 HEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRA 106
HE+WMA+ GR Y D EK R ++F N Y++ N+ GNRTY LG N+FSDLT+DEF
Sbjct: 39 HEEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFVQ 98
Query: 107 LYTGYK-MPSPSHRSTTSSTFKYQNL--SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
+ GY+ R + K L D+P S+DWR + AVT +K+Q CGCCWAF
Sbjct: 99 THLGYRGHQQGGLRPEEENVSKVAALGYGQADMPESVDWRAQGAVTGVKNQGSCGCCWAF 158
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTN----GN-NGCGGGTMEKAFEYIIQNQGIA 218
+AVAA EG+ KI+ NLI +SEQQ++DC+ GN N C GG ++ A Y+ ++G+
Sbjct: 159 AAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYVAASRGLQ 218
Query: 219 TEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVP-SGDEQALLKAVSMQPVSIGIAAYTTE 276
E Y Y +QG C S +AA + V GDE L V+ QP+++ + A + +
Sbjct: 219 PEAAYAYTGLQGACQSGFTPNSAASFGEPQTVTLQGDEGRLQGLVAGQPIAVSVEA-SDD 277
Query: 277 FKSYKEGIFNG---VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
F+ Y G+F CG +L+HAVT+VG+G+ + G YWL+KN WG +WG+ GYM+I R
Sbjct: 278 FRHYMSGVFTAGTSSCGQRLNHAVTVVGYGSADGGQEYWLVKNQWGTSWGEGGYMRIARG 337
Query: 334 EGL--CGIGTQSSYP 346
G CGI + YP
Sbjct: 338 NGAPNCGISAYAYYP 352
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 137/312 (43%), Positives = 185/312 (59%), Gaps = 13/312 (4%)
Query: 48 EKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDLTN 101
E+W QH ++Y +E+E+ R KIF EN I K N+ +G +YKLG N+++D+ +
Sbjct: 26 EEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLH 85
Query: 102 DEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
EF+ GY + T Y + VP S+DWR+ AVT +KDQ CG
Sbjct: 86 HEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGS 145
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFS+ A+EG L+ LSEQ LVDCST GNNGC GG M+ AF YI N GI
Sbjct: 146 CWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGID 205
Query: 219 TEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEF 277
TE YPY+ + +C + A + + ++P GDE+ + KAV +M PVS+ I A F
Sbjct: 206 TEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESF 265
Query: 278 KSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE- 334
+ Y EG++N C Q LDH V +VG+GT E G +YWL+KNSWG TWG+ GY+K+ R++
Sbjct: 266 QLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARNQN 325
Query: 335 GLCGIGTQSSYP 346
CGI T SSYP
Sbjct: 326 NQCGIATASSYP 337
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 139/335 (41%), Positives = 203/335 (60%), Gaps = 12/335 (3%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +RFKIF EN I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N + G +YKLG N+F DL EF ++ G+ + ++ SS N++ + +P
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLP 117
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GN 196
+DWR K AVTP+KDQ +CG CWAFSA ++EG + L+ LSEQ LVDCS + GN
Sbjct: 118 KVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
NGC GG ME AF+YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 178 NGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSEVD 237
Query: 257 LLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
L KAV ++ P+S+ I A + F+ Y EG+++ C ++ LDH V +VG+G + G YWL
Sbjct: 238 LKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYWL 296
Query: 314 IKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+KNSW ++WGD GY+ + RD CGI +Q+SYPL
Sbjct: 297 VKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
Length = 450
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 130/303 (42%), Positives = 187/303 (61%), Gaps = 7/303 (2%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRAL 107
E W A+HGRSY E+ R F +N ++ A+ +Y L N F+DLT+DEFRA
Sbjct: 39 EAWCAEHGRSYATPGERAARLAAFADNAAFV-AAHNGAPASYALALNAFADLTHDEFRAA 97
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
G + + + + + VP ++DWR AVT +KDQ CG CW+FSA
Sbjct: 98 RLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATG 157
Query: 168 AVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA 227
A+EGI KI +LI LSEQ+L+DC + N+GCGGG M+ A++++++N GI TE +YPY+
Sbjct: 158 AMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRE 217
Query: 228 VQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFN 286
GTC+ + K I Y++VP+ +E LL+AV+ QPVS+GI F+ Y +GIF+
Sbjct: 218 TDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFD 277
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQ 342
G C T LDHA+ IVG+G +E G +YW++KNSWG++WG GYM + R+ G+CGI
Sbjct: 278 GPCPTSLDHAILIVGYG-SEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQM 336
Query: 343 SSY 345
S+
Sbjct: 337 PSF 339
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 204/336 (60%), Gaps = 14/336 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ L V CA V+ ++ ++ + E + H ++Y+ +E+ +RFKIF E+ I +
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIAR 60
Query: 81 ANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG N+F DL EF ++ G+ R T STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-G 195
P ++DWR K AVTP+KDQ +CG CWAFSA ++EG + L+ LSEQ LVDCS + G
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
NNGC GG ME AF+YI N GI TE YPY+AV G C ++ A + Y E+ +G E
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSED 236
Query: 256 ALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
L KAV ++ P+S+ I A + F+ Y EG+++ C ++ LDH V +VG+G + G YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295
Query: 313 LIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
L+KNSW ++WGD GY+ + RD CGI +Q+SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 138/317 (43%), Positives = 192/317 (60%), Gaps = 13/317 (4%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDL 99
V+E E + +H + Y E+E+ R KIF EN I NK +G+ TYKL N++ D+
Sbjct: 25 VLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDM 84
Query: 100 TNDEFRALYTGYKMPS----PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
+ EF + G++ ++R+ T +TF + + +P ++DWR K AVTPIKDQ
Sbjct: 85 LHHEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQ-LPKNVDWRTKGAVTPIKDQG 143
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQN 214
+CG CWAFSA A+EG T L+ LSEQ LVDCS GNNGC GG M+ AFEY+ +N
Sbjct: 144 QCGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKEN 203
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAY 273
GI TE+ YPY A C +AA A+ + +V G E AL KAV ++ PVS+ I A
Sbjct: 204 GGIDTEESYPYDAEDEKCHYNPRAAGAEDKGFVDVREGSEHALKKAVATVGPVSVAIDAS 263
Query: 274 TTEFKSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKIL 331
F+ Y G++ C + LDH V +VG+G +DG +YWL+KNSWG TWGD GY+K+
Sbjct: 264 HESFQFYSHGVYIEPECSPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGDQGYVKMA 323
Query: 332 RD-EGLCGIGTQSSYPL 347
R+ + CGI + +S+PL
Sbjct: 324 RNRDNQCGIASSASFPL 340
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 118/229 (51%), Positives = 157/229 (68%), Gaps = 6/229 (2%)
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
S+ F+Y+N+S+ +P ++DWR AVTPIKDQ +CGCCWAFSAVAA EGI KIS LI
Sbjct: 3 STGFRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLIS 62
Query: 183 LSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAA 241
LSEQ+LVDC G + GC GG M+ AF++II+N G+ TE YPY A G C + +AA
Sbjct: 63 LSEQELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSNSAA- 121
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
I YE+VP+ DE AL+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G
Sbjct: 122 NIKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIG 181
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
+G T DG YWL+KNSWG TWG+ GY+++ +D +G+CG+ + SYP
Sbjct: 182 YGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYP 230
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 136/327 (41%), Positives = 191/327 (58%), Gaps = 26/327 (7%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+++ E+WM +HGR+Y D EK+ RF++++ N+E +E N N YKL N+F+DLTN+
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85
Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKKAVTPIKDQQEC-- 157
EFRA G++ + P +T S+ S D+ P S+DWR+K AV I + C
Sbjct: 86 EFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAV--INRWKICVD 143
Query: 158 -GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
G CWAFSAVAA+EGI +I L+ LSEQ+LVDC GCGGG M AFE+++ N G
Sbjct: 144 AGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGNHG 202
Query: 217 IATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
+ TE YPY A G C AA+ +A I+ Y V E L +A + QPVS+ + +
Sbjct: 203 LTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSF 262
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN----------YWLIKNSWGDTWGDA 325
F+ Y G++ G C ++H VT+VG+G +E + YW++KNSWG WGDA
Sbjct: 263 MFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDA 322
Query: 326 GYMKILRD-----EGLCGIGTQSSYPL 347
GY+ + RD GLCGI SYP+
Sbjct: 323 GYILMQRDVAGLASGLCGIALLPSYPV 349
>gi|194352764|emb|CAQ00110.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 406
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 148/392 (37%), Positives = 220/392 (56%), Gaps = 57/392 (14%)
Query: 10 SFKINTIPMFIII----ILLVSCASQ------VVSSRST------HEQSVVEMHEKWMAQ 53
S + +++ +++++ +LL C+S+ V+ S + H+ +++ WM
Sbjct: 10 SSRCSSLGLYVLLATSCLLLAGCSSESLLTSDVLPSEQSDIDTDNHQDLMMDRFHVWMTV 69
Query: 54 HGRSYKDELEKEMRFKIFKENLEYIEKANKEG---NRTYKLGTNRFSDLTNDEFRALYTG 110
H RSY EK RF++++ N+ +IE N E TY+LG F+DLTN+EF LYTG
Sbjct: 70 HNRSYSTAGEKARRFEVYRSNMRFIEAVNAEAATSGLTYELGEGPFTDLTNEEFMELYTG 129
Query: 111 YKMP-------------------SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPI 151
+ S T Y N S + PTS+DWR + VTP+
Sbjct: 130 QILEDDQSEDGDDDEQIITTHAGSIDGLGTHKGATVYANFSAS-APTSIDWRKRGVVTPV 188
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYI 211
K+Q++CG CWAF VA +EGI KI L+ LSEQQL+DC +NGC GG + +AF++I
Sbjct: 189 KNQKQCGSCWAFPTVATIEGIHKIKRGTLVSLSEQQLIDCDYL-DNGCKGGLVTRAFQWI 247
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIA 271
+N GI + Y Y+AV+G C +K AAKI + +V S E +L+ AV+ QPV++ I+
Sbjct: 248 KKNGGITSTSSYKYKAVRGRCLRNRK-PAAKIVGFRKVKSNSEVSLMNAVANQPVAVSIS 306
Query: 272 AYTTEFKSYKEGIFNGVCG-TQLDHAVTIVGFG-----------TTEDGANYWLIKNSWG 319
++++ F YK GI+NG C T+L+HAVT+VG+G + GA YW++KNSWG
Sbjct: 307 SHSSHFHHYKGGIYNGPCSTTKLNHAVTVVGYGQQQQNGADSVHASAPGAKYWIVKNSWG 366
Query: 320 DTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
TWGD GY+ + R G CGI T+ +PL
Sbjct: 367 TTWGDKGYILMKRGTKHSSGQCGIATRPVFPL 398
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 134/318 (42%), Positives = 195/318 (61%), Gaps = 15/318 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI--EKANKEGNR-TYKLGTNRF 96
E+ V+E+ ++W +H + Y+ E E RF+ FK NL+YI A ++ N+ + +G N+F
Sbjct: 42 EERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNKF 101
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+D++N+EFR Y K+ P ++ T S + + D P+SLDWR+ VT +KDQ
Sbjct: 102 ADMSNEEFRKAYLS-KVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKDQGS 160
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS+ A+EGI + +LI LSEQ+LV+C T+ N GC GG M+ AFE++I N G
Sbjct: 161 CGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS-NYGCEGGYMDYAFEWVINNGG 219
Query: 217 IATEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
I +E +YPY V GTC + ++ I Y++V D ALL AV+ QPVS+GI
Sbjct: 220 IDSESDYPYTGVDGTCNTTKEETKVVSIDGYQDVEQSD-SALLCAVAQQPVSVGIDGSAI 278
Query: 276 EFKSYKEGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
+F+ Y GI++G C +DHAV IVG+G +ED YW++KNSWG +WG GY + R
Sbjct: 279 DFQLYTGGIYDGSCSDDPDDIDHAVLIVGYG-SEDSEEYWIVKNSWGTSWGIDGYFYLKR 337
Query: 333 DE----GLCGIGTQSSYP 346
D G+C + +SYP
Sbjct: 338 DTDLPYGVCAVNAMASYP 355
>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 338
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 149/343 (43%), Positives = 208/343 (60%), Gaps = 19/343 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEK-WMAQHGRSYKDELEKEMRFKIFKENLE 76
M +++ L+ C VS+ S ++ H K W H +SY E E+ R +++ENL+
Sbjct: 1 MNLLVCLVSLCWGLAVSAPLG--DSELDRHWKLWKNWHQKSYH-EAEEGWRRTVWEENLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
I+ N E G TY+LG N+F DLTN+EF+ + TG + S +R S+ + +
Sbjct: 58 AIQLHNLEQSLGLHTYRLGMNQFGDLTNEEFQEILTGERHFSKGNRINGSA---FLEANF 114
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS- 192
VPTS+DWRD VTP+K+Q CG CWAFS A+EG LI LSEQ LVDCS
Sbjct: 115 VQVPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSW 174
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQ-GTCSAAQKAAAAKISNYEEVPS 251
GN GC GG ++ AF+YI+QNQGI +ED YPY A C+ + A A ++ + ++P
Sbjct: 175 QQGNQGCHGGIVDLAFQYILQNQGIDSEDCYPYTAKDTAQCTFKPECATAPVTGFVDIPP 234
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVCGTQ-LDHAVTIVGFG---TT 305
E+AL+KAV ++ PVS+GI A +T F+ Y+ GIF + C ++ LDHAV +VG+G
Sbjct: 235 HSEEALMKAVATVGPVSVGIDASSTSFRFYQSGIFYDPKCSSESLDHAVLVVGYGYERED 294
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYPL 347
E G YW++KNSWG WGD GY+ + +D G CGI T +SYPL
Sbjct: 295 EAGKKYWIVKNSWGKHWGDRGYVYMSKDRGNHCGIATVASYPL 337
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 197/337 (58%), Gaps = 22/337 (6%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
+ ++ +L+ C S++ R H W HG++Y E E+++R I+ +NL
Sbjct: 5 LACLLVAVLIAQCFSELSQDRQWH---------AWKDFHGKTYTGE-EEDLRRAIWNDNL 54
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E ++K N E N +YKL N F+DLT EF+ + GY+ S ST STF LS
Sbjct: 55 EIVKKHNAE-NHSYKLDMNHFADLTVTEFKQRFMGYRAAS---NSTGGSTF--LPLSNVQ 108
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN- 194
+P +DWRDK VT +K+Q +CG CWAFS+ ++EG L+ LSEQ LVDCS
Sbjct: 109 LPAEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDCSKKY 168
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
GNNGC GG M+ AF+YI N GI TE YPY A G C + A ++ Y +V G E
Sbjct: 169 GNNGCEGGLMDYAFKYIKNNDGIDTEQSYPYTARDGQCHFKPGSVGATVTGYTDVQRGSE 228
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANY 311
L AV ++ P+S+ I A + F+ YK G+++ TQLDH V VG+G EDG +Y
Sbjct: 229 GDLQSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQLDHGVLAVGYG-AEDGKDY 287
Query: 312 WLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
WL+KNSWG+ WG GY+K+ R+ + CGI TQ+SYPL
Sbjct: 288 WLVKNSWGEGWGMNGYIKMSRNKDNQCGIATQASYPL 324
>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
Length = 449
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 131/303 (43%), Positives = 186/303 (61%), Gaps = 8/303 (2%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRAL 107
E W A+HGRSY E+ R F +N ++ A+ +Y L N F+DLT+DEFRA
Sbjct: 39 EAWCAEHGRSYATPGERAARLAAFADNAAFV-AAHNGAPASYALALNAFADLTHDEFRAA 97
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
G + R + + VP ++DWR AVT +KDQ CG CW+FSA
Sbjct: 98 RLGRLAAAGPGRDGGAPYLGVDG-GVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATG 156
Query: 168 AVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA 227
A+EGI KI +LI LSEQ+L+DC + N+GCGGG M+ A++++++N GI TE +YPY+
Sbjct: 157 AMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRE 216
Query: 228 VQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFN 286
GTC+ + K I Y++VP+ +E LL+AV+ QPVS+GI F+ Y +GIF+
Sbjct: 217 TDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFD 276
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQ 342
G C T LDHA+ IVG+G +E G +YW++KNSWG++WG GYM + R+ G+CGI
Sbjct: 277 GPCPTSLDHAILIVGYG-SEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQM 335
Query: 343 SSY 345
S+
Sbjct: 336 PSF 338
>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 136/344 (39%), Positives = 199/344 (57%), Gaps = 31/344 (9%)
Query: 33 VSSRSTHEQS--VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR--T 88
+SR E + + + +W A+H R+Y E+ R +++ N+ YIE N + T
Sbjct: 26 ATSRPATEDADPMAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLT 85
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPS-------PSHRSTTSSTFK-----------YQN 130
Y+LG ++DLT+DEF A+YT P P TT + Y N
Sbjct: 86 YELGETAYTDLTSDEFTAMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVN 145
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
S P S+DWR++ AVT +K+Q +CG CWAFS VA +EGI +I L LSEQ+LVD
Sbjct: 146 ES-AGAPASVDWRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVD 204
Query: 191 CSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEV 249
C ++GC GG +A ++I N GI ++D+YPY A TC + + AA IS ++ V
Sbjct: 205 CD-KLDHGCNGGVSYRALQWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGFQRV 263
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE-DG 308
+ E +L AV+MQPV++ I A F+ Y+ G++NG CGT+L+H VT+VG+G E G
Sbjct: 264 ATRSELSLTNAVAMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYGEDEVTG 323
Query: 309 ANYWLIKNSWGDTWGDAGYMK-----ILRDEGLCGIGTQSSYPL 347
+YW++KNSWG+ WGD GY++ I + EG+CGI + S+PL
Sbjct: 324 ESYWIVKNSWGEKWGDNGYLRMKKGIIDKPEGICGIAIRPSFPL 367
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 139/333 (41%), Positives = 198/333 (59%), Gaps = 14/333 (4%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN 82
+LLV+ A VS + E E + HG++YK++ E+ R KIF N + IE N
Sbjct: 3 VLLVAVAVIAVSCANRFYNINPEEWETFKVVHGKNYKNQFEEMFRRKIFMNNKKRIEAHN 62
Query: 83 ---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
++G +YK+ N F DL + E +AL G+KM +P+ + F S +P S
Sbjct: 63 AKYEQGEVSYKMKMNHFGDLMSHEIKALMNGFKM-TPNTKREGKIYFP----SNDKLPKS 117
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNG 198
+DWR K AVTP+KDQ +CG CW+FSA ++EG + L+ LSEQ L+DCS GNNG
Sbjct: 118 VDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMDCSKEYGNNG 177
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALL 258
C GG M+KAF+Y+ N+GI TE YPY+A C + Y ++P GDE+AL
Sbjct: 178 CEGGLMDKAFQYVSDNKGIDTESSYPYEARDYACRFKKDKVGGTDKGYVDIPEGDEKALQ 237
Query: 259 KAV-SMQPVSIGIAAYTTEFKSYKEGIFN-GVCGT-QLDHAVTIVGFGTTEDGANYWLIK 315
A+ ++ P+S+ I A F Y EG++N C + LDH V VG+G TE+G +YWL+K
Sbjct: 238 NALATVGPISVAIDASHESFHFYSEGVYNEPYCSSYDLDHGVLAVGYG-TENGQDYWLVK 296
Query: 316 NSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
NSWG +WG++GY+KI R+ CGI + +SYP+
Sbjct: 297 NSWGPSWGESGYIKIARNHSNHCGIASMASYPI 329
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 135/341 (39%), Positives = 199/341 (58%), Gaps = 23/341 (6%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + ++L+ + ++ + + + + + + + Y+ E+ RF +F +N+++
Sbjct: 1 MMLKLVLVCALVGAAMAEPLSLTVNKGRLFDAFKTKFNKVYESAEEEARRFSVFSQNIDF 60
Query: 78 IEKANKEGNR---TYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
I + N E R T+ + N+F+DLTN+E+R LY P P T + +
Sbjct: 61 INRHNAEAARGVHTHTVDVNQFADLTNEEYRQLYL---RPYP-----TELLGRERQEVWL 112
Query: 135 DVPT--SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
D P S+DWR K AVTPIK+Q +CG CW+FS +VEG I+ NL+ LSEQQLVDCS
Sbjct: 113 DGPNAGSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCS 172
Query: 193 TN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVP 250
+ GN GC GG M+ AF+YII N G+ TE +YPY A G C ++++ A IS Y++VP
Sbjct: 173 GSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGYKDVP 232
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E L AV PVS+ I A F+ Y G+F+G CGT LDH V +VG+ + +
Sbjct: 233 QNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLVVGYTS-----D 287
Query: 311 YWLIKNSWGDTWGDAGYMKILR---DEGLCGIGTQSSYPLA 348
YW++KNSWG +WGD GY+ + R G+CGI Q SYP+A
Sbjct: 288 YWIVKNSWGASWGDQGYIMMKRGVSSAGICGIAMQPSYPIA 328
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 129/314 (41%), Positives = 186/314 (59%), Gaps = 13/314 (4%)
Query: 46 MHEKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDL 99
+ E+W +H ++Y E+E+ R KIF EN I K N+ +G ++KLG N+++D+
Sbjct: 23 IKEEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADM 82
Query: 100 TNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
+ EF+ GY M + Y + + VP ++DWR AVT +KDQ C
Sbjct: 83 LHHEFKETMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGHC 142
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQG 216
G CW+FS+ ++EG L+ LSEQ LVDCST GNNGC GG M+ AF YI N G
Sbjct: 143 GSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 202
Query: 217 IATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTT 275
+ TE YPY+ + +C + A + + ++P GDE+A++KAV +M PV++ I A
Sbjct: 203 VDTEKSYPYEGIDDSCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPVAVAIDASNE 262
Query: 276 EFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
F+ Y EG++N LDH V +VG+GT +DG +YWL+KNSWG TWGD GY+K+ R+
Sbjct: 263 SFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTWGDQGYIKMARN 322
Query: 334 -EGLCGIGTQSSYP 346
+ CGI T SS+P
Sbjct: 323 QDNQCGIATASSFP 336
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 135/306 (44%), Positives = 187/306 (61%), Gaps = 11/306 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRAL 107
+ W A HG SY E+ R I++ NL++IEK N EG+ +YKL N+F+DLT EF A
Sbjct: 23 DSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGH-SYKLAVNKFADLTYPEFAAK 81
Query: 108 YTGYKMPSP-SHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
Y G + + + +S +ST+ + M +P S+DWR VTPIKDQ +CG CW+FS
Sbjct: 82 YLGLRFDATNATKSFAASTYLPR---MVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTT 138
Query: 167 AAVEGITKISGANLIQLSEQQLVDCST-NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
+VEG L+ LSEQ LVDCS+ GN GC GG M++AF+YII N GI TE YPY
Sbjct: 139 GSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPY 198
Query: 226 QAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI 284
A GTC A +++Y+++ SG E L AV ++ P+S+ I A F+ Y G+
Sbjct: 199 TAQDGTCQFNSANVGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGV 258
Query: 285 FN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGT 341
+N +QLDH V VG+GT+ ++YWL+KNSWG +WG +GY+ + R+ CGI T
Sbjct: 259 YNEPACSSSQLDHGVLAVGYGTS-GSSDYWLVKNSWGTSWGQSGYIWMTRNSNNQCGIAT 317
Query: 342 QSSYPL 347
+SYPL
Sbjct: 318 AASYPL 323
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
Length = 341
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 136/317 (42%), Positives = 197/317 (62%), Gaps = 32/317 (10%)
Query: 40 EQSVVEMHEKWMAQHGRSYKD-ELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNR 95
++ V ++++ W ++HGR + +R K+F++NL YI+ N E G T++LG
Sbjct: 44 DEEVRQLYKTWKSEHGRPRDGISVADGLRLKVFRDNLRYIDAHNAEADAGLHTFRLGLTP 103
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
F+DLT +EFRA G+ + S R + +Y + D+P ++DWR + AVT +K+Q
Sbjct: 104 FTDLTLEEFRAHALGF-LNSTLPRVASD---RYLPRAGDDLPDAVDWRQQGAVTGVKNQL 159
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
+CG CWAFSAVAA+EGI KI NLI LSEQ+L+DC T + GC GG M+KAF+++I N
Sbjct: 160 DCGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDTE-DYGCQGGEMQKAFQFVIDNG 218
Query: 216 GIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
GI TE +YP+ GTC A +K I +YE VP+ DE+AL KAV+ QP
Sbjct: 219 GIDTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQP--------- 269
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
GIFNG CG LDH VT VG+G +++G ++W++KNSWG WG++GY+++ R+
Sbjct: 270 --------GIFNGPCGFILDHGVTAVGYG-SDNGEDFWIVKNSWGAEWGESGYIRMKRNV 320
Query: 334 ---EGLCGIGTQSSYPL 347
G CGI +SYP+
Sbjct: 321 LLPMGKCGIAMYASYPV 337
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/344 (40%), Positives = 196/344 (56%), Gaps = 27/344 (7%)
Query: 29 ASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR- 87
A + S S + S++E ++W A + +SY E+ RF+++ N+ YIE N E
Sbjct: 32 AGDTMGSMSNDDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAA 91
Query: 88 --TYKLGTNRFSDLTNDEFRALYTGYKMPS-PSHRSTTSSTFK--------------YQN 130
TY+LG ++DLTN EF A+YT + P+ S ++ Y N
Sbjct: 92 GLTYELGETAYTDLTNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVN 151
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
LS + P S+DWR AVTP+K+Q CG CWAFS VA VEGI +I L+ LSEQ+LVD
Sbjct: 152 LSAS-APASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVD 210
Query: 191 CSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEV 249
C T ++GC GG +A +I N GI TE +YPY C+ A+ + A I+ V
Sbjct: 211 CDTL-DDGCDGGISYRALRWIASNGGITTEADYPYTGTTDACNRAKLSHNAVSIAGLRRV 269
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDG 308
+ E +L AV+ QPV++ I A F+ YK+G++NG CGT L+H VT+VG+G G
Sbjct: 270 ATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAAG 329
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
YW++KNSWG WGD GY+++ +D EGLCGI + SYPL
Sbjct: 330 DRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 201/332 (60%), Gaps = 17/332 (5%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK 83
+L C + ++S ++++ EM + H ++Y E E RF I++ +L I + N
Sbjct: 1 MLACCIAATLASPLVFDEALDEMWTLFKTTHSKTYATEAEDMRRF-IWERHLNMINQHNI 59
Query: 84 E---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
E G T+ LG N + DLT E+ A+ +GYKM + S SS + +NL VP ++
Sbjct: 60 EADLGKHTFSLGMNEYGDLTQHEYAAM-SGYKM---AKSSVGSSFLEPENLQ---VPKTV 112
Query: 141 DWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGC 199
DWR+K VTP+K+Q +CG CWAFS+ ++EG L +SEQ LVDCS + GN GC
Sbjct: 113 DWREKGYVTPVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGC 172
Query: 200 GGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLK 259
GG M+ AF YI +N GI +E YPY+AV G C + + S + ++P GDE AL
Sbjct: 173 SGGLMDNAFTYIKKNMGIDSEKSYPYEAVDGECRYKKSDSVTTDSGFVDIPHGDETALRT 232
Query: 260 AV-SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
AV S+ PVS+ I A T F+ YK G++ TQLDH V +VG+G E+G +YWL+KN
Sbjct: 233 AVASVGPVSVAIDASHTSFQFYKTGVYTEANCSSTQLDHGVLVVGYG-VENGQDYWLVKN 291
Query: 317 SWGDTWGDAGYMKILRDEG-LCGIGTQSSYPL 347
SWG +WG+AGY+K+ R+ G CGI +Q+SYPL
Sbjct: 292 SWGASWGEAGYIKLARNHGNQCGIASQASYPL 323
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 138/317 (43%), Positives = 197/317 (62%), Gaps = 14/317 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI-EKANKEGNRTYKLGTNRFSD 98
++S++E+ ++W +H ++YK E E RF FK NL+YI EK KE +++G N+F+D
Sbjct: 36 DESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHRVGLNKFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFK-YQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
L+N+EF+ LY K+ P +++ + + +NL D P+SLDWR K VT +KDQ +C
Sbjct: 96 LSNEEFKQLYLS-KVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDC 154
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CW+FS A+EGI I ++LI LSEQ+LVDC T N GC GG M+ AFE++I N GI
Sbjct: 155 GSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTT-NYGCEGGYMDYAFEWVINNGGI 213
Query: 218 ATEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
TE YPY V GTC +A ++ I Y++V D ALL A + QP+S+GI +
Sbjct: 214 DTEANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDETD-SALLCAAAQQPISVGIDGSAID 272
Query: 277 FKSYKEGIF---NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
F+ Y GI+ +DHAV IVG+G +E+G +YW++KNSWG +WG GY I R+
Sbjct: 273 FQLYTGGIYDGDCSDDPDDIDHAVLIVGYG-SENGEDYWIVKNSWGTSWGIEGYFYIKRN 331
Query: 334 E----GLCGIGTQSSYP 346
G+C I +SYP
Sbjct: 332 TDLPYGVCAINAMASYP 348
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 136/335 (40%), Positives = 203/335 (60%), Gaps = 11/335 (3%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE++ R KI+ EN + K
Sbjct: 6 LIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 64
Query: 81 AN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y++ N+F DL + EFR++ GY+ + S STF + + +VP
Sbjct: 65 HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 123
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GN 196
S+DWR+K A+TP+KDQ +CG CWAFS+ A+EG T LI LSEQ L+DCS GN
Sbjct: 124 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M++AF+YI N+GI TE+ YPY+A C + A + ++PSG+E
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDK 243
Query: 257 LLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
L AV ++ PVS+ I A F+ Y +G+ + C + LDH V +VG+G +++G +YWL
Sbjct: 244 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDYWL 302
Query: 314 IKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+KNSW + WGD GY+KI R+ + CG+ T +SYPL
Sbjct: 303 VKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337
>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
Length = 510
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 137/305 (44%), Positives = 181/305 (59%), Gaps = 11/305 (3%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT-YKLGTNRFSDLTNDEFRALY 108
WM H S+ D LE R + + N YI + N E T KL N FS ++ +EF+
Sbjct: 32 WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91
Query: 109 TGYKMPSPSHRSTTSSTFKYQNL-SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
TGY MP +S + NL S VP S+DW+DK VTP+K+Q CG CWAFS
Sbjct: 92 TGYVMPEGYLEQRLAS--RVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGSCWAFSTTG 149
Query: 168 AVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA 227
AVEG +S L+ LSEQ+LVDC NG+ GC GG M+ AF +I N GI +ED+Y Y+A
Sbjct: 150 AVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYKA 209
Query: 228 VQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG 287
C +K KIS +++V DE AL AV+ QPVS+ I A F+ YK G+FN
Sbjct: 210 KAQVCRDCEK--VVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 267
Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQS 343
CGT+LDH V VG+G +E+G +W +KNSWG +WG+ GY+++ R+E G CGI +
Sbjct: 268 TCGTRLDHGVLAVGYG-SENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGIASVP 326
Query: 344 SYPLA 348
SYP A
Sbjct: 327 SYPFA 331
>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 535
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 137/305 (44%), Positives = 181/305 (59%), Gaps = 11/305 (3%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT-YKLGTNRFSDLTNDEFRALY 108
WM H S+ D LE R + + N YI + N E T KL N FS ++ +EF+
Sbjct: 32 WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91
Query: 109 TGYKMPSPSHRSTTSSTFKYQNL-SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
TGY MP +S + NL S VP S+DW+DK VTP+K+Q CG CWAFS
Sbjct: 92 TGYVMPEGYLEQRLAS--RVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGSCWAFSTTG 149
Query: 168 AVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA 227
AVEG +S L+ LSEQ+LVDC NG+ GC GG M+ AF +I N GI +ED+Y Y+A
Sbjct: 150 AVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYKA 209
Query: 228 VQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG 287
C +K KIS +++V DE AL AV+ QPVS+ I A F+ YK G+FN
Sbjct: 210 KAQVCRDCEK--VVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 267
Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQS 343
CGT+LDH V VG+G +E+G +W +KNSWG +WG+ GY+++ R+E G CGI +
Sbjct: 268 TCGTRLDHGVLAVGYG-SENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGIASVP 326
Query: 344 SYPLA 348
SYP A
Sbjct: 327 SYPFA 331
>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
Length = 382
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 141/375 (37%), Positives = 203/375 (54%), Gaps = 43/375 (11%)
Query: 15 TIPMFIII--ILLVSCAS----QVVSSRSTHEQ------SVVEMHEKWMAQHGRSYKDEL 62
++P +I+ + + C+S +V S + + +++EM ++W A++ RSY
Sbjct: 8 SMPCLLILLGVFFIGCSSGTARRVTSDTAANTDGEPAATTMMEMFQRWKAEYNRSYATPE 67
Query: 63 EKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTT 122
E+ R +++ N+ YIE N Y+LG ++DLTNDEF A+YT + S +
Sbjct: 68 EERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTNDEFMAMYTAPPLRSAADDDDD 127
Query: 123 SSTFKYQNLSMTDV----------------PTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
++T V P S+DWR AVT +KDQ CG CWAFS V
Sbjct: 128 AATTTIITTRAGPVDEHQQPEVYFNESAGAPASVDWRASGAVTEVKDQGRCGSCWAFSTV 187
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQ 226
A VEGI KI L+ LSEQ+LVDC T ++GC GG +A E+I N GI T D+YPY
Sbjct: 188 AVVEGIQKIKKGKLVSLSEQELVDCDTL-DSGCDGGVSYRALEWITANGGITTRDDYPYT 246
Query: 227 A-VQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGI 284
C A+ AA I+ V + E +L A + QPV++ I A F+ Y++G+
Sbjct: 247 GAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNAAAAQPVAVSIEAGGDNFQHYRKGV 306
Query: 285 FNGVCGTQLDHAVTIVGFGTTE-------DGANYWLIKNSWGDTWGDAGYMKILRD---- 333
++G CGT+L+H VT+VG+G E G YW+IKNSWG WGD GY+K+ +D
Sbjct: 307 YDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWIIKNSWGKNWGDQGYIKMKKDVAGK 366
Query: 334 -EGLCGIGTQSSYPL 347
EGLCGI + S+PL
Sbjct: 367 PEGLCGIAIRPSFPL 381
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 116/226 (51%), Positives = 155/226 (68%), Gaps = 6/226 (2%)
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
F+Y+N+S +PT++DWR K AVTPIKDQ +CGCCWAFSAVAA EGI KIS L+ L+E
Sbjct: 7 FRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAE 66
Query: 186 QQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKIS 244
Q+LVDC + + GC GG M+ AF++II+N G+ TE YPY A G C + +AA I
Sbjct: 67 QELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNSAAT-IK 125
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGT 304
YE+VP+ DE AL+KAV+ QPVS+ + F+ Y G+ G CGT LDH + +G+G
Sbjct: 126 GYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGK 185
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
T DG YWL+KNSWG TWG+ GY+++ +D G+CG+ + SYP
Sbjct: 186 TSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 231
>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 351
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 136/351 (38%), Positives = 208/351 (59%), Gaps = 21/351 (5%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTH-EQSVVEMHEKWMAQHGRSYKDELEKEMRFKI 70
K +P+ +I L C S + + E+S+++++++W + H R ++ E RFK+
Sbjct: 5 KFLIVPLVLIAFLCNICESFELERKDFESEKSLMQLYKRWSSHH-RISRNANEMHNRFKV 63
Query: 71 FKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALY----TGYKMPSPSHRSTTSST- 125
FK N +++ K N G ++ KL N+F+D+++DEFR +Y T YK T
Sbjct: 64 FKNNAKHVFKVNLMG-KSLKLKLNQFADMSDDEFRNMYSSNITYYKDLHAKKIEATGGRI 122
Query: 126 --FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQL 183
F Y++ + ++P+S+DWR K AV IK+Q CG CWAF+AVAAVE I +I L+ L
Sbjct: 123 GGFMYEHAN--NIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIHQIKTNELVSL 180
Query: 184 SEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAAK 242
SE++++DC + GC GG AFE+++ N G+ ED YPY G C + +
Sbjct: 181 SEEEVLDCDYR-DGGCRGGFYNSAFEFMMDNDGVTIEDNYPYYEGNGYCRRRGGRNKRVR 239
Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIV 300
I YE VP +E AL+KAV+ QPV++ IA+ ++FK Y G+F N CG +DH V +V
Sbjct: 240 IDGYENVPRNNEYALMKAVAHQPVAVAIASGGSDFKFYGGGMFTENDFCGFNIDHTVVVV 299
Query: 301 GFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
G+GT EDG +YW+I+N +G WG GYMK+ R +G+CG+ Q +YP+
Sbjct: 300 GYGTDEDG-DYWIIRNQYGHRWGMNGYMKMQRGAHSPQGVCGMAMQPAYPV 349
>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
Length = 291
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 126/287 (43%), Positives = 182/287 (63%), Gaps = 6/287 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+ + L V AS +SR +++ E+WMA++GR YKD EK RF+IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
IE N +Y LG N+F+D+TN+EF A YTG P + S + +++++ V
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPVVS---FDDVNISAV 124
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
S+DWRD AVT +KDQ CG CWAFSA+A VEGI KI L+ LSEQ+++DC+ +
Sbjct: 125 GQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS-- 182
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
NGC GG ++ A+++II N G+A+E +YPYQA QG C+A +A I+ Y V S DE +
Sbjct: 183 NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPNSAYITGYSYVRSNDESS 242
Query: 257 LLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
+ AV QP++ I A F+ Y G+F+G CGT L+HA+TI+G+G
Sbjct: 243 MKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYG 289
>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
Length = 360
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 138/347 (39%), Positives = 199/347 (57%), Gaps = 28/347 (8%)
Query: 11 FKINTIPMFIIIILLVSC----ASQVVSSRSTH-------EQSVVEMHEKWMAQHGRSYK 59
F P + + LL SC A+ ++ +R+T + +++ W H RSY
Sbjct: 4 FMACASPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYP 63
Query: 60 DELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKM---PSP 116
E RF +++ N E+I+ N G+ TY+L N F+DLT +EF A YTGY P
Sbjct: 64 SAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVD 123
Query: 117 SHRSTT-----SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE-CGCCWAFSAVAAVE 170
TT ++F Y+ DVP S+DWR + AV P K Q C CWAF A +E
Sbjct: 124 DSVITTGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIE 179
Query: 171 GITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG 230
+ I L+ LSEQQLVDC + + GC G+ +A++++++N G+ TE +YPY A +G
Sbjct: 180 SLNMIKTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRG 238
Query: 231 TCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVC 289
C+ A+ A AAKI+ + +VP +E AL AV+ QPV++ I + + YK G++ G C
Sbjct: 239 PCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPC 297
Query: 290 GTQLDHAVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILRDEG 335
GT+L HAVT+VG+GT GA YW IKNSWG +WG+ GY++ILRD G
Sbjct: 298 GTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVG 344
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 137/335 (40%), Positives = 201/335 (60%), Gaps = 11/335 (3%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE++ R KI+ EN + K
Sbjct: 6 LIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 64
Query: 81 AN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y++ N+F DL + EFR++ GY+ + S STF + + +VP
Sbjct: 65 HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 123
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GN 196
S+DWR K A+TP+KDQ +CG CWAFS+ A+EG T LI LSEQ L+DCS GN
Sbjct: 124 ESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M++AF+YI N+GI TE+ YPY+A C + A + +PSG+E
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDNVCRYNPRNRGAIDRGFVHIPSGEEDK 243
Query: 257 LLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
L AV ++ PVS+ I A F+ Y +G+ + C + LDH V +VG+G +++G +YWL
Sbjct: 244 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDYWL 302
Query: 314 IKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+KNSW + WGD GY+KI R+ + CGI T +SYPL
Sbjct: 303 VKNSWSEHWGDEGYIKIARNRKNHCGIATAASYPL 337
>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
Length = 324
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 141/342 (41%), Positives = 203/342 (59%), Gaps = 42/342 (12%)
Query: 15 TIPMFIIIILLVSCASQ--VVSSRSTHEQSVVEMHEKWMAQHGRSYKDEL-EKEMRFKIF 71
T+ + II +L S A V S + V + + WM++HG++Y + L +KE RF+ F
Sbjct: 11 TLSLLIIFLLPPSSAMDLSVTSGGLRSNEEVGFIFQTWMSKHGKTYTNALGDKEQRFQNF 70
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
K+NL +I++ N + N +Y+LG +F+DLT E++ L++G + + T +Y L
Sbjct: 71 KDNLRFIDQHNAK-NLSYRLGLTQFADLTVQEYQDLFSGRPI---QKQKALRVTHRYVPL 126
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+ +P S+DWR K AV+ IKDQ C VE I KI LI LSEQ+LVDC
Sbjct: 127 AEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSEQELVDC 176
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA--AAKISNYEEV 249
S + N+GC GG M+ AF+++I N G+ + +YPYQAVQG C+ Q + KI YE+V
Sbjct: 177 SID-NHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKIDGYEDV 235
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P+ +E +L KAV+ QP GI+ G CGT LDHAV IVG+G TE+G
Sbjct: 236 PANNENSLQKAVAHQP-----------------GIYTGPCGTDLDHAVVIVGYG-TENGQ 277
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+YW+++NSWG WG+AGY KI R+ G+CGI +SYP+
Sbjct: 278 DYWIVRNSWGTVWGEAGYAKIARNFENPTGVCGIAMVASYPI 319
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 139/341 (40%), Positives = 192/341 (56%), Gaps = 19/341 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMA---QHGRSYKDELEKEMRFKIFKENL 75
F+I + + SQ VS + E+W A H + Y+ E E+ R KIF EN
Sbjct: 3 FLIFLAICVAGSQAVSFFDL-------VQEQWGAFKMTHNKQYQSETEERFRMKIFMENS 55
Query: 76 EYIEKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNL 131
+ K NK +G ++KLG N+++D+ + EF + G+ RS S + +
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+ +P +DWRDK AVTP+KDQ +CG CW+FSA ++EG L+ LSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDC 175
Query: 192 STN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
S GNNGC GG M+ AF YI N GI TE YPY+A C K A Y ++
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIE 235
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTED 307
SG+E L AV ++ PVS+ I A F+ Y G++ +QLDH V +VG+GT +D
Sbjct: 236 SGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDD 295
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
G +YWL+KNSWG +WGD GY+K+ R+ CGI T++SYPL
Sbjct: 296 GTDYWLVKNSWGKSWGDQGYIKMARNRNNNCGIATEASYPL 336
>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
Length = 347
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 141/353 (39%), Positives = 206/353 (58%), Gaps = 17/353 (4%)
Query: 3 LIFERSGSFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDEL 62
+IF+ S S N + M ++I + ++CAS R H+ + E W +G+ Y+++
Sbjct: 1 MIFQDSKSSPANLLRMKVVIWMFLACASTTAYLR--HDPMLDNHWELWKKTYGKQYEEQN 58
Query: 63 EKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR 119
++ R I+++NL+++ N E G +Y L N SD+T++E +L + ++P+ R
Sbjct: 59 QEVTRRLIWEKNLKFVTLHNLEHSMGLHSYDLSMNHLSDMTSEEVASLMSSLRIPNQWSR 118
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGAN 179
+TT Y+ S +P S+DWRDK VT +K Q CG CWAFSAV A+E K+
Sbjct: 119 NTT-----YRLNSNQKLPDSVDWRDKGCVTEVKYQGTCGSCWAFSAVGALEAQLKLKTGK 173
Query: 180 LIQLSEQQLVDCSTN---GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ 236
L+ LS Q LVDCSTN N+GC GG M +AF+YII N GI ++ YPY+A G C
Sbjct: 174 LVSLSAQNLVDCSTNEKYENHGCNGGCMTEAFQYIIDNNGIDSDASYPYKAKDGKCQYNP 233
Query: 237 KAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLD 294
AA S Y E+P G E AL +AV+ + PVS+GI A F YK G+ ++ C ++
Sbjct: 234 ANRAATCSRYTELPYGSEDALKEAVANKGPVSVGIDASLPSFFLYKSGVYYDPSCTQNVN 293
Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
H V + G+G DG +YWL+KNSWG ++GD GY++I R+ G CGI SYP
Sbjct: 294 HGVLVTGYGNL-DGKDYWLVKNSWGLSFGDKGYIRIARNRGNHCGIANFPSYP 345
>gi|125526835|gb|EAY74949.1| hypothetical protein OsI_02845 [Oryza sativa Indica Group]
Length = 360
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 137/322 (42%), Positives = 188/322 (58%), Gaps = 18/322 (5%)
Query: 41 QSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEG-NRTYKLGTNRFSDL 99
S+ HE+WMA+ GR+Y D EK R ++F N E ++ AN+ G +RTY LG N+FSDL
Sbjct: 37 HSMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDL 96
Query: 100 TNDEFRALYTGYKM--PSPSHRS--TTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
T+DEF + GY P PSHR + TDVP S+DWR + AVT +K+Q+
Sbjct: 97 TDDEFARTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQR 156
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAF+AVAA EG+ +++ NL+ LSEQQ++DC T G N C GG + A YI +
Sbjct: 157 SCGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDC-TGGANTCSGGDVSAALRYIAASG 215
Query: 216 GIATEDEYPYQAVQGTC-----SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
G+ TE Y Y QG C +A AAA + + + GDE AL + QPV + +
Sbjct: 216 GLQTEAAYAYGGQQGACRAGGFAAPNSAAAVGGARWARL-YGDEGALQALAAGQPVVVVV 274
Query: 271 AAYTTEFKSYKEGIFNG--VCGTQLDHAVTIV-GFGTTEDGANYWLIKNSWGDTWGDAGY 327
A +F+ Y+ G++ G CG +L+HAVT+V + G YWL+KN WG WG+ GY
Sbjct: 275 EASEPDFRHYRSGVYAGSAACGRRLNHAVTVVGYGAAADGGGEYWLVKNQWGTWWGEGGY 334
Query: 328 MKILRD---EGLCGIGTQSSYP 346
M++ R G CGI T + YP
Sbjct: 335 MRVARGGAAGGNCGIATYAFYP 356
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 137/341 (40%), Positives = 192/341 (56%), Gaps = 14/341 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M I+ LL A V+ ++ + E + + +H ++Y DE E+ R KIF EN
Sbjct: 1 MRILFALLALVA---VAQAVSYADVIKEEWQTFKLEHRKNYVDETEERFRLKIFNENKHK 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF---KYQNL 131
I K N+ G ++K+ N+++D+ + EF G+ + +F + +
Sbjct: 58 IAKHNQRYASGEVSFKMAVNKYADMLHHEFHTTMNGFNYTLHKQLRASDPSFVGVTFISP 117
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+P S+DWR K AVT +KDQ CG CWAFS+ A+EG LI LSEQ LVDC
Sbjct: 118 EHVKIPKSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDC 177
Query: 192 STN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
ST GNNGC GG M+ AF YI N GI TE YPY+ + +C + A ++P
Sbjct: 178 STKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDRGSVDIP 237
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTED 307
GDE+ + +AV ++ PVS+ I A F+ Y EGI+N C Q LDH V +VG+GT E
Sbjct: 238 QGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGTDES 297
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
G +YWL+KNSWG TWGD G++K+ R+ + CGI + SSYPL
Sbjct: 298 GQDYWLVKNSWGTTWGDKGFIKMARNADNQCGIASASSYPL 338
>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
Length = 335
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 140/341 (41%), Positives = 202/341 (59%), Gaps = 19/341 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
MF +++ L C S V ++ S Q + + W +QHG+SY ++LE R I++ENL
Sbjct: 2 MFALLVTL--CISAVFTAPSIDIQ-LDDHWNSWKSQHGKSYHEDLEVGRRM-IWEENLRK 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E GN T+K+G N+F D+TN+EFR GYK + TS + S
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFF 113
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
P +DWR + VTP+KDQ++CG CW+FS+ A+EG LI +SEQ LVDCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+Y+ +N+G+ +E YPY A C + AKI+ + ++P G
Sbjct: 174 QGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRG 233
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGF---GTTED 307
+E AL+ AV ++ PVS+ I A + Y+ GI + C ++LDHAV +VG+ G
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVA 293
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
G YW++KNSW D WGD GY+ + +D+ CGI T +SYPL
Sbjct: 294 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
Length = 229
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 120/217 (55%), Positives = 151/217 (69%), Gaps = 5/217 (2%)
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
VP S+DWR K AVT +KDQ +CG CWAFS + AVEGI +I L+ LSEQ+LVDC T+
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDE 254
N GC GG M+ AFE+I Q GI TE YPY+A GTC +++ A A I +E VP DE
Sbjct: 62 NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDE 121
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
ALLKAV+ QPVS+ I A ++F+ Y EG+F G CGT+LDH V IVG+GTT DG YW +
Sbjct: 122 NALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTV 181
Query: 315 KNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
KNSWG WG+ GY+++ R EGLCGI ++SYP+
Sbjct: 182 KNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPI 218
>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
Length = 340
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 140/344 (40%), Positives = 206/344 (59%), Gaps = 22/344 (6%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ I+ LL A +S+ + V+ ++E+W+ +H + Y EK RF+IFK+NL Y
Sbjct: 5 VLILSFLLFVSAITCISTNWRSDDEVIALYEEWLVKHQKLYSSLGEKIKRFEIFKDNLRY 64
Query: 78 IEKAN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKM-------PSPSHRSTTSSTFK 127
I++ N K + + LG N+F+DLT DEF ++Y G + +P+H K
Sbjct: 65 IDQQNHYNKVNHMNFTLGLNQFADLTLDEFSSIYLGTSVDYEQIISSNPNHDDVEEDILK 124
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
+ ++P S+DWR+K V PI++Q +CG CW FSAVA++E + I ++I LSEQ+
Sbjct: 125 E---DVVELPDSVDWREKGVVFPIRNQGKCGSCWTFSAVASIETLNGIKKGHMIALSEQE 181
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYE 247
L+DC T + GC GG AF Y+ +N GI +E++YPY QG C QK KIS Y+
Sbjct: 182 LLDCETI-SQGCKGGHYNNAFAYVAKN-GITSEEKYPYIFRQGQC--YQKEKVVKISGYK 237
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP + L AV+ Q VS+ + + +F+ Y GIF+G CG LDHAV IVG+G ++
Sbjct: 238 RVPRNNGGQLQSAVAQQVVSVAVKCESKDFQFYDRGIFSGACGPILDHAVNIVGYG-SKG 296
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
GANYW+++NSWG WG+ GYM+I ++ EG CGI Q SYP+
Sbjct: 297 GANYWIMRNSWGTNWGENGYMRIQKNSKHYEGHCGIAMQPSYPV 340
>gi|115438530|ref|NP_001043562.1| Os01g0613500 [Oryza sativa Japonica Group]
gi|11034572|dbj|BAB17096.1| cysteine proteinase-like [Oryza sativa Japonica Group]
gi|113533093|dbj|BAF05476.1| Os01g0613500 [Oryza sativa Japonica Group]
gi|215697766|dbj|BAG91959.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 360
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 137/322 (42%), Positives = 188/322 (58%), Gaps = 18/322 (5%)
Query: 41 QSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEG-NRTYKLGTNRFSDL 99
S+ HE+WMA+ GR+Y D EK R ++F N E ++ AN+ G +RTY LG N+FSDL
Sbjct: 37 HSMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDL 96
Query: 100 TNDEFRALYTGYKM--PSPSHRS--TTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
T+DEF + GY P PSHR + TDVP S+DWR + AVT +K+Q+
Sbjct: 97 TDDEFAQTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQR 156
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAF+AVAA EG+ +++ NL+ LSEQQ++DC T G N C GG + A YI +
Sbjct: 157 SCGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDC-TGGANTCSGGDVSAALRYIAASG 215
Query: 216 GIATEDEYPYQAVQGTC-----SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
G+ TE Y Y QG C +A AAA + + + GDE AL + QPV + +
Sbjct: 216 GLQTEAAYAYGGQQGACRAGGFAAPNSAAAVGGARWARL-YGDEGALQALAAGQPVVVVV 274
Query: 271 AAYTTEFKSYKEGIFNG--VCGTQLDHAVTIV-GFGTTEDGANYWLIKNSWGDTWGDAGY 327
A +F+ Y+ G++ G CG +L+HAVT+V + G YWL+KN WG WG+ GY
Sbjct: 275 EASEPDFRHYRSGVYAGSAACGRRLNHAVTVVGYGAAADGGGEYWLVKNQWGTWWGEGGY 334
Query: 328 MKILRD---EGLCGIGTQSSYP 346
M++ R G CGI T + YP
Sbjct: 335 MRVARGGAAGGNCGIATYAFYP 356
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 140/338 (41%), Positives = 207/338 (61%), Gaps = 19/338 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F+I+ +LV AS + T EQ + + H + Y+ + R KIF +N I
Sbjct: 8 FLILAVLVGAASAAL----TLEQLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLI 63
Query: 79 EKAN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMT 134
+ N +G TYKL N+F D+ + EF + G S+R+ ST+ + +++S+
Sbjct: 64 ARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLR---SNRTYFGSTWIEPESVSL- 119
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
P S+DWR+K AVTP+K+Q CG CW+FS A+EG L+ LSEQ L+DCST+
Sbjct: 120 --PKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTS 177
Query: 195 -GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
GNNGCGGG M+ AF YI +N GI TE+ YPY+ QG C ++ +A + + + ++PSG+
Sbjct: 178 YGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDSAGRDTGFVDIPSGN 237
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGAN 310
E+AL KA+ ++ PVS+ I A F+ Y EG++N C + LDH V VG+GTT+DG +
Sbjct: 238 ERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQD 297
Query: 311 YWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
Y++IKNSWG+ WG GY+ + R+ + CG+ TQ+SYPL
Sbjct: 298 YYIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPL 335
>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
Length = 335
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 140/341 (41%), Positives = 202/341 (59%), Gaps = 19/341 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
MF ++I L C S V ++ S Q + + W +QHG+SY +++E R I++ENL
Sbjct: 2 MFALLITL--CISAVFTAPSIDIQ-LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E GN T+K+G N+F D+TN+EFR GYK + TS + S
Sbjct: 58 IEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKQDP----NRTSKGALFMEPSFF 113
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
P +DWR + VTP+KDQ++CG CW+FS+ A+EG LI +SEQ LVDCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+Y+ +N+G+ +E YPY A C + AKI+ + ++P G
Sbjct: 174 QGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRG 233
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGF---GTTED 307
+E AL+ AV ++ PVS+ I A + Y+ GI + C ++LDHAV +VG+ G
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVA 293
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
G YW++KNSW D WGD GY+ + +D+ CGI T +SYPL
Sbjct: 294 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 140/338 (41%), Positives = 207/338 (61%), Gaps = 19/338 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F+I+ +LV AS + T EQ + + H + Y+ + R KIF +N I
Sbjct: 3 FLILAVLVGAASAAL----TLEQLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLI 58
Query: 79 EKAN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMT 134
+ N +G TYKL N+F D+ + EF + G S+R+ ST+ + +++S+
Sbjct: 59 ARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLR---SNRTYFGSTWIEPESVSL- 114
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
P S+DWR+K AVTP+K+Q CG CW+FS A+EG L+ LSEQ L+DCST+
Sbjct: 115 --PKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTS 172
Query: 195 -GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
GNNGCGGG M+ AF YI +N GI TE+ YPY+ QG C ++ +A + + + ++PSG+
Sbjct: 173 YGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDSAGRDTGFVDIPSGN 232
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGAN 310
E+AL KA+ ++ PVS+ I A F+ Y EG++N C + LDH V VG+GTT+DG +
Sbjct: 233 ERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQD 292
Query: 311 YWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
Y++IKNSWG+ WG GY+ + R+ + CG+ TQ+SYPL
Sbjct: 293 YYIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPL 330
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 137/330 (41%), Positives = 199/330 (60%), Gaps = 18/330 (5%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN 82
+LL+ + R T + S + +W H ++Y + E+ +R+ I+K+N I + N
Sbjct: 7 LLLLGVTLAYIIERPTEDDSWI----RWKMAHNKAYSHDGEETVRYTIWKDNERRIREHN 62
Query: 83 KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDW 142
+G + L N+F D+TN+EF+ + GY SH+ + STF N + P S+DW
Sbjct: 63 LQGG-DFLLEMNQFGDMTNNEFKD-FNGYL----SHKHVSGSTFLTPNSFV--APDSVDW 114
Query: 143 RDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGG 201
R++ VTP+KDQ +CG CWAFS ++EG L+ LSEQ LVDCST GNNGC G
Sbjct: 115 RNEGYVTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCNG 174
Query: 202 GTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV 261
G M+ AF YI +N GI +E YPY A G C+ + AA + + ++PSGDE L +AV
Sbjct: 175 GLMDNAFTYIKENNGIDSEASYPYTAKDGKCAFTKPNVAATDTGFVDIPSGDENKLKEAV 234
Query: 262 -SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
S+ P+S+ I A F+ Y++G++N T+LDH V +VG+G TE G +YWL+KNSW
Sbjct: 235 ASVGPISVAIDASHFSFQFYRKGVYNERKCSSTELDHGVLVVGYG-TESGKDYWLVKNSW 293
Query: 319 GDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+WGD GY+K+ R+ + CGI T +SYPL
Sbjct: 294 NTSWGDKGYIKMSRNAKNQCGIATNASYPL 323
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 138/341 (40%), Positives = 193/341 (56%), Gaps = 19/341 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMA---QHGRSYKDELEKEMRFKIFKENL 75
F+I + + SQ VS + E+W A H + Y+ + E+ R KIF EN
Sbjct: 3 FLIFLAICVAGSQAVSFFDL-------VQEQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55
Query: 76 EYIEKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNL 131
+ K NK +G ++KLG N+++D+ + EF + G+ RS S + +
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+ +P +DWRDK AVTP+KDQ +CG CW+FSA ++EG L+ LSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175
Query: 192 STN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
S GNNGC GG M+ AF YI N GI TE YPY+A C K A Y ++
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIE 235
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTED 307
SG+E L AV ++ PVS+ I A F+ Y G++ +QLDH V +VG+GT +D
Sbjct: 236 SGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDD 295
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
G +YWL+KNSWG +WGD GY+K+ R+ + CGI T++SYPL
Sbjct: 296 GTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 137/337 (40%), Positives = 202/337 (59%), Gaps = 17/337 (5%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
+ +F+++ +LV+ +S+ S R + V W + HG+SY D E+ R I+++NL
Sbjct: 1 MKVFLVLCVLVA-SSRGWSVRFGQDSEWV----AWKSYHGKSYSDVHEERTRMAIWQQNL 55
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E I++ N E + +YK+ N DLT DEFR Y G + H ST Y S
Sbjct: 56 EKIKRHNAE-DHSYKMAMNHLGDLTEDEFRYFYLGVR---AHHNSTKRGWATYMPPSNVK 111
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TN 194
+P+S+DW K VT +K+Q +CG CWAFS +VEG +L+ LSEQ L+DCS +
Sbjct: 112 IPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLIDCSGSY 171
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
GNNGC GG M+ AF YI N GI TE YPY QG+C + A+++ Y+++P G E
Sbjct: 172 GNNGCQGGLMDNAFRYIESNGGIDTESSYPYLGQQGSCHFSSSHVGARVTGYQDIPQGSE 231
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVC-GTQLDHAVTIVGFGTTEDGANY 311
QAL AV ++ PVS+ + A ++++ Y G++ N C TQLDH V ++G+G +G +Y
Sbjct: 232 QALQSAVATVGPVSVAVDA--SQWQFYSSGVYDNPYCSSTQLDHGVLVIGYGNY-NGQDY 288
Query: 312 WLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
WL+KNSWG +WG GY+ + R++ CGI + +SYPL
Sbjct: 289 WLVKNSWGYSWGVEGYIMMSRNKNNQCGIASSASYPL 325
>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 371
Score = 253 bits (646), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 140/358 (39%), Positives = 200/358 (55%), Gaps = 43/358 (12%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHE--------KWMAQHGRSYKDELEKEMRFK 69
+F+ + L A +++ + H VVE+ + +W A H R+Y D E+ RF+
Sbjct: 27 LFVFLTALPPAA--IMTPAAGH---VVELDDMLMLDRFVRWQAAHNRTYGDAEERLRRFQ 81
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
+++ N+EYIE N+ G TY+LG N+F+DLT++EF ++Y S
Sbjct: 82 VYRANIEYIEATNRRGGLTYELGENQFADLTSEEFLSMYA-------SSYDAGDRADDEA 134
Query: 130 NLSMTDV---------------PTSLDWRDKKAVTPIKDQ-QECGCCWAFSAVAAVEGIT 173
L TDV P S DWR K AVTP K+Q C CWAF VA +EG+T
Sbjct: 135 ALITTDVAGDGAWSDGDLEALPPPSWDWRAKGAVTPPKNQGPTCSSCWAFVTVATIEGLT 194
Query: 174 KISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCS 233
I LI LSEQQLVDC + GC G+ + F ++++N G+ TE EYPY A +G C+
Sbjct: 195 FIKTGKLISLSEQQLVDCDMY-DGGCNTGSYSRGFRWVLENGGLTTEAEYPYTAARGPCN 253
Query: 234 AAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQ 292
A+ A AAKI+ +P +E + KAV+ QPV + I + + YK G+++G CGT
Sbjct: 254 RAKSAHHAAKITGQGRIPPQNELVMQKAVAGQPVGVAIEV-GSGMQFYKTGVYSGPCGTN 312
Query: 293 LDHAVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYP 346
L HAVT+VG+G GA YW++KNSWG WG+ G++++ RD GLCGI +YP
Sbjct: 313 LAHAVTVVGYGVDPASGAKYWIVKNSWGQAWGERGFIRMRRDVGGPGLCGIALDVAYP 370
>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
Length = 335
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 139/341 (40%), Positives = 202/341 (59%), Gaps = 19/341 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
MF +++ L C S V ++ S Q + + W +QHG+SY +++E R I++ENL
Sbjct: 2 MFALLVTL--CISAVFTAPSIDIQ-LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E GN T+K+G N+F D+TN+EFR GYK + TS + S
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKQDP----NRTSKGALFMEPSFF 113
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
P +DWR + VTP+KDQ++CG CW+FS+ A+EG LI +SEQ LVDCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+Y+ +N+G+ +E YPY A C + AKI+ + ++P G
Sbjct: 174 QGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKG 233
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGF---GTTED 307
+E AL+ AV ++ PVS+ I A + Y+ GI + C ++LDHAV +VG+ G
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVA 293
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
G YW++KNSW D WGD GY+ + +D+ CGI T +SYPL
Sbjct: 294 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 137/336 (40%), Positives = 203/336 (60%), Gaps = 14/336 (4%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++ L + CA V+ + + + E + H +SY+ +E+ +RFKIF EN I K
Sbjct: 1 MLRLSLLCAIVAVTVAANSHEILRTQWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAK 60
Query: 81 ANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
N + G +YKLG N+F DL EF ++ GY+ R++ STF N++ + +
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGYR----GQRTSRGSTFMPPANVNDSSL 116
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-G 195
P+++DWR K AVTP+KDQ +CG CWAFSA ++EG + L+ LSEQ LVDCS + G
Sbjct: 117 PSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQSFG 176
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQ 255
NNGC GG M+ AF+YI N GI E+ YPY+A+ C ++ A + + ++ G E
Sbjct: 177 NNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMDDKCRFKKEDVGATDTGFVDIEGGSED 236
Query: 256 ALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYW 312
L KAV ++ P+S+ I A + F+ Y EG+++ C + +LDH V VG+G +DG YW
Sbjct: 237 DLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPECSSEELDHGVLAVGYG-VKDGKKYW 295
Query: 313 LIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
L+KNSWG +WGD GY+ + RD+ CGI + +SYPL
Sbjct: 296 LVKNSWGGSWGDNGYILMSRDKNNQCGIASAASYPL 331
>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
tropicalis]
gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 139/341 (40%), Positives = 202/341 (59%), Gaps = 19/341 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
MF +++ L C S V ++ S Q + + W +QHG+SY +++E R I++ENL
Sbjct: 2 MFALLVTL--CISAVFAASSIDIQ-LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E GN T+K+G N+F D+TN+EFR GYK + TS + S
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFF 113
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
P +DWR + VTP+KDQ++CG CW+FS+ A+EG LI +SEQ LVDCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+Y+ +N+G+ +E YPY A C + AKI+ + ++P G
Sbjct: 174 QGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRG 233
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGF---GTTED 307
+E AL+ AV ++ PVS+ I A + Y+ GI + C ++LDHAV +VG+ G
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVA 293
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
G YW++KNSW D WGD GY+ + +D+ CGI T +SYPL
Sbjct: 294 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 133/317 (41%), Positives = 189/317 (59%), Gaps = 16/317 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN--KEGNRTYKLGTNRFS 97
E+ + E+ + W +H + YK E E R FK NL+YI + N ++ +K+G N+F+
Sbjct: 43 EEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKFA 102
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DL+N+EFR +Y K+ P T K+++L D P+SLDWR+K VT +KDQ +C
Sbjct: 103 DLSNEEFREMYLS-KVKKPI---TIEEKRKHRHLQTCDAPSSLDWRNKGVVTAVKDQGDC 158
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CW+FS A+E I I +LI LSEQ+LVDC T N GC GG M+ AF+++I N GI
Sbjct: 159 GSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGI 218
Query: 218 ATEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
TE +YPY V GTC +A ++ I Y +V D ALL A QP+S+G+ +
Sbjct: 219 DTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSD-SALLCATVQQPISVGMDGSALD 277
Query: 277 FKSYKEGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
F+ Y GI++G C +DHA+ IVG+G +E+ +YW++KNSWG WG GY I R+
Sbjct: 278 FQLYTGGIYDGDCSGDPNDIDHAILIVGYG-SENDEDYWIVKNSWGTEWGMEGYFYIRRN 336
Query: 334 E----GLCGIGTQSSYP 346
G+C I +SYP
Sbjct: 337 TSKPYGVCAINADASYP 353
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 137/315 (43%), Positives = 192/315 (60%), Gaps = 12/315 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ WM H + Y++ EK RF+IFK+NL YI++ NK+ N +Y+LG N F+
Sbjct: 39 TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYRLGLNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DL+NDEF Y G + + +S ++ N + ++P ++DWR K AVTP++ Q C
Sbjct: 98 DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDIVNLPENVDWRKKGAVTPVRHQGSC 154
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFSAVA VEGI KI L++LSEQ+LVDC ++GC GG A EY+ +N GI
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN-GI 212
Query: 218 ATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
+YPY+A QGTC A Q K S V +E LL A++ QPVS+ + +
Sbjct: 213 HLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 272
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
F+ YK GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 273 FQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGN 331
Query: 333 DEGLCGIGTQSSYPL 347
G+CG+ S YP+
Sbjct: 332 SPGVCGLYKSSYYPI 346
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 134/335 (40%), Positives = 202/335 (60%), Gaps = 11/335 (3%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE++ R KI+ EN + K
Sbjct: 2 LIFLLGAVFVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 60
Query: 81 AN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y++ N+F DL + EFR++ GY+ + S STF + + +VP
Sbjct: 61 HNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 119
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GN 196
S+DWR+K A+TP+KDQ +CG CWAFS+ A+EG T L+ L EQ L+DCS GN
Sbjct: 120 ESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGN 179
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M++AF+YI N+GI TE+ YPY+A C + A + ++PSG+E
Sbjct: 180 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDK 239
Query: 257 LLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
L AV ++ PVS+ I A F+ Y +G+ + C + LDH V +VG+G +++G +YWL
Sbjct: 240 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDYWL 298
Query: 314 IKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+KNSW + WGD GY+KI R+ + CG+ T +SYPL
Sbjct: 299 VKNSWSEHWGDQGYIKIARNRKNHCGVATAASYPL 333
>gi|242072388|ref|XP_002446130.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
gi|241937313|gb|EES10458.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
Length = 276
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 125/280 (44%), Positives = 176/280 (62%), Gaps = 29/280 (10%)
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
++N+ ++E N N + LG N+F+DLT +EF+A G+K S TT FKY+NL
Sbjct: 19 RDNVAFVESFNANKNNKFWLGVNQFADLTTEEFKA-NKGFKPTSAEKVPTTG--FKYENL 75
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
S++ +PT++DWR K AVTPIK+Q +CGCCWAFSAVAA+EGI K+S NLI LS+Q+LVDC
Sbjct: 76 SVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSKQELVDC 135
Query: 192 STNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
T+ + GC E + PY+AV G C K+AA I +E+VP
Sbjct: 136 DTHSMDEGC--------------------EVQLPYKAVDGKCKGGSKSAAT-IKGHEDVP 174
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E AL+KAV+ QPVS+ + A F Y G+ G CGT+LDH + +G+G DG
Sbjct: 175 VNNEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTK 234
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
YW++KNSWG TWG+ G++++ +D G+CG+ + SYP
Sbjct: 235 YWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 274
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 139/341 (40%), Positives = 194/341 (56%), Gaps = 19/341 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMA---QHGRSYKDELEKEMRFKIFKENL 75
F+I + + SQ VS + E+W A H + Y+ + E+ R KIF EN
Sbjct: 3 FLIFLAICVAGSQAVSFFDL-------VQEQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55
Query: 76 EYIEKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNL 131
+ K NK +G ++KLG N+++D+ + EF + G+ RS S + +
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+ +P +DWRDK AVTP+KDQ +CG CW+FSA ++EG L+ LSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175
Query: 192 STN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
S GNNGC GG M+ AF YI N GI TE YPY+A C K A Y ++
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIE 235
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCG-TQLDHAVTIVGFGTTED 307
SG+E L AV ++ PVS+ I A F+ Y G+ + C +QLDH V +VG+GT +D
Sbjct: 236 SGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGTEDD 295
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
G +YWL+KNSWG +WGD GY+K+ R+ + CGI T++SYPL
Sbjct: 296 GTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 130/305 (42%), Positives = 180/305 (59%), Gaps = 16/305 (5%)
Query: 54 HGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKM 113
H + Y E E+ R+ IFK NL YI N +G +Y L N+F DLT +EFR Y GYK
Sbjct: 96 HNKFYATEEERLKRYAIFKNNLTYIHNHNMQG-YSYVLKMNKFGDLTLEEFRQRYLGYKK 154
Query: 114 PS----PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAV 169
P P TT +++ D+PT +DWR + VT +KDQ +CG CWAFSA A+
Sbjct: 155 PDLRTPPREVDTT-----LESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATGAM 209
Query: 170 EGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV 228
EG+ L+ LS+QQLVDCS GN GC GG ME+AFEY+++N GI + + YPY
Sbjct: 210 EGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPYMRK 269
Query: 229 QGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGIFNG 287
G C ++Q + A I+ Y VP E+++ A++++ PVS+ I A F+ Y +GIF+
Sbjct: 270 DGVCKSSQCTSVATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYDGIFDA 329
Query: 288 VCGTQLDHAVTIVGFGTTEDG-ANYWLIKNSWGDTWGDAGYMKILRDE---GLCGIGTQS 343
CGT LDH V +VG+ G +YW++KNSWG WG GYM + + G CG+
Sbjct: 330 PCGTNLDHGVLLVGYSAETAGQGDYWIMKNSWGAAWGKGGYMLMAMHKGPAGQCGVLLDG 389
Query: 344 SYPLA 348
S+P+A
Sbjct: 390 SFPVA 394
>gi|222636309|gb|EEE66441.1| hypothetical protein OsJ_22818 [Oryza sativa Japonica Group]
Length = 318
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 136/343 (39%), Positives = 192/343 (55%), Gaps = 46/343 (13%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ ++ L + A ++SR+ ++ H+KWMA+HGR+YKD EK RF++FK N++
Sbjct: 6 LLVVAGGLSTMAKVTMASRAGTMEA---RHDKWMAEHGRTYKDAAEKARRFRVFKANVDL 62
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-- 135
I+++N GN+ Y+L TNRF+DLT+ EF A+YTGY + + + ++T LS D
Sbjct: 63 IDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATT----RLSSEDDQ 118
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
P +DWR + AVT +K+Q+ CGCCWAFS VAAVEGI +I+ L+ L+
Sbjct: 119 QPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLTWPTAAASP--- 175
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC----SAAQKAAAAKISNYEEVPS 251
Y YQ QG C S++ AA IS Y+ V
Sbjct: 176 -----------------------PRRAYAYQGAQGACQFDASSSASGVAATISGYQRVNP 212
Query: 252 GDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGA- 309
DE +L AV+ QPVS+ I F+ Y G+F CGT+LDHAV +VG+G DG+
Sbjct: 213 NDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSG 272
Query: 310 --NYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
YW+IKNSWG TWGD GYMK+ +D +G CG+ SYP+
Sbjct: 273 GGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 315
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 136/357 (38%), Positives = 202/357 (56%), Gaps = 21/357 (5%)
Query: 4 IFERSGSFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEK---WMAQHGRSYKD 60
+F + S I + +++++ AS M ++ W A + RSY
Sbjct: 3 LFRAAASGGFALILLACCSLIMLAAASGGGGVDDDGVGGDRLMMDRFLSWQATYNRSYPT 62
Query: 61 ELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SP 116
E++ RF++++ N+E+IE N+ GN TY LG N+F+DLT +EF LYT MP +
Sbjct: 63 AEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEEEFLDLYTMKGMPVRRDAG 122
Query: 117 SHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ-QECGCCWAFSAVAAVEGITKI 175
R+ SS+ + D PTS+DWR K AVTPIK+Q C CWAF A +E ITKI
Sbjct: 123 KKRANVSSS-----AAAVDAPTSVDWRSKGAVTPIKNQGPSCSSCWAFVTAATIESITKI 177
Query: 176 SGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAA 235
+ L+ LSEQ+L+DC + GC G + ++IQN G+ TE YPYQA + CS +
Sbjct: 178 TTGKLVSLSEQELIDCDPY-DGGCNLGYFVNGYRWVIQNGGLTTEANYPYQARRYACSRS 236
Query: 236 QKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLD 294
+ A AA IS+Y ++P+G+ Q L+ Q + Y G+F+G CGT+++
Sbjct: 237 RAAQHAATISDYVQLPAGEGQ--LQQAVAQQPVAAAIEMGGSLQFYSGGVFSGQCGTRMN 294
Query: 295 HAVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
HA+T+VG+G + G YWL+KNSWG +WG+ GY+++ RD GLCGI +YP+
Sbjct: 295 HAITVVGYGADSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGRGGLCGIALDLAYPV 351
>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
Length = 358
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 128/348 (36%), Positives = 198/348 (56%), Gaps = 24/348 (6%)
Query: 19 FIIIILLVSCASQVVSSRSTH--------------EQSVVEMHEKWMAQHGRSYKDELEK 64
F ++LV+C S ++ + + + +++ +W A + RSY E+
Sbjct: 15 FFFALILVACCSLMLQAAAAAGGGADGVVVGADGDNKLMMDRFLRWQATYNRSYPTAEER 74
Query: 65 EMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS 124
+ RF++++ N+E+IE N+ GN TY LG N+F+DLT +EF LYT MP +
Sbjct: 75 QRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADLTEEEFLDLYTMKGMPPVRRDAGKKQ 134
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ-QECGCCWAFSAVAAVEGITKISGANLIQL 183
+ S+ D PTS+DWR + AVTPIK+Q C CWAF A +E IT+I L+ L
Sbjct: 135 QANFS--SVVDAPTSVDWRSRGAVTPIKNQGPSCSSCWAFVTAATIESITQIRTGKLVSL 192
Query: 184 SEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAK 242
SEQ+L+DC + GC G ++++IQN G+ TE YPYQA + C+ ++ AA+
Sbjct: 193 SEQELIDCDPY-DGGCNLGYFVNGYKWVIQNGGLTTEANYPYQARRYQCNRSKAGQRAAR 251
Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGF 302
ISNY ++P G+ Q + + +F Y G+++G CGT+++HA+T+VG+
Sbjct: 252 ISNYRQLPQGEAQLQQAVAQQPVAAAIEMGGSLQF--YSGGVWSGQCGTRMNHAITVVGY 309
Query: 303 GTTEDGANYWLIKNSWGDTWGDAGYMKI---LRDEGLCGIGTQSSYPL 347
G G YWL+KNSWG TWG+ GY+++ +R GLCGI +YP+
Sbjct: 310 GADSSGVKYWLVKNSWGQTWGERGYLRMRKDVRQGGLCGIALDLAYPI 357
>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
Length = 336
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 135/340 (39%), Positives = 202/340 (59%), Gaps = 17/340 (5%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
++ LLV+ + V + S+ + + + W +QHG+SY +++E R I++ENL IE
Sbjct: 1 MMFALLVTLSISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G N+F D+TN+EFR GYK + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNG 195
P +DWR + VTP+KDQ++CG CW+FS+ A+EG LI +SEQ LVDCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSGDE 254
N GC GG M++AF+Y+ +N+G+ +E YPY A C + AKI+ + ++PSG+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNE 235
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
AL+ AV ++ PVS+ I A + Y+ GI+ ++LDHAV +VG+ G G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
YW++KNSW D WGD GY+ + +D+ CG+ T++SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 144/358 (40%), Positives = 200/358 (55%), Gaps = 28/358 (7%)
Query: 14 NTIPMFII---IILLVSCASQVVSSRSTHE----QSVVEMHEKWMAQHGRSYKDELEKEM 66
N + + +I II LVS A V S + +V + ++W+ +HG+ Y EK
Sbjct: 3 NPLHLLLISATIICLVSAAKAVQHSYEVGDINSGNGLVRLFDRWLGRHGKLYGSHEEKAR 62
Query: 67 RFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGY--KMPSPSHRSTTSS 124
R +IF+ NL+YI NK N +++LG N+F+DLTN+EF+ Y G K R+
Sbjct: 63 RLQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNEEFKTRYFGKNSKQWRDRRRTELEG 122
Query: 125 TFKYQNLSMT--------DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKIS 176
L T + +SLDWR K AVT +KDQ +CG CWAFS A+EG+ IS
Sbjct: 123 AELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEGVNFIS 182
Query: 177 GANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ 236
L+ LSEQ+LV C N GC GG M+ AF ++IQN GI TE +Y Y V TC+ +
Sbjct: 183 TGKLVSLSEQELVACDAT-NYGCEGGDMDYAFTWVIQNGGIDTEKDYSYTGVDSTCNTNK 241
Query: 237 KAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCG---TQ 292
+A I Y +V S D+ ALL A QPVS+GI +F+ Y GI++G C
Sbjct: 242 EAKKIVSIDGYTDV-SPDDSALLCAAGSQPVSVGIDGSAIDFQLYTGGIYDGDCSGNPDD 300
Query: 293 LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
+DHAV +VG+ + ++G +YW++KNSWG WG GY ILR+ G+C I +SYP
Sbjct: 301 IDHAVLVVGY-SAKNGKDYWIVKNSWGTDWGLEGYFYILRNTELPYGVCAINAMASYP 357
>gi|356545079|ref|XP_003540973.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 330
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 131/296 (44%), Positives = 183/296 (61%), Gaps = 12/296 (4%)
Query: 29 ASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT 88
ASQV + R+ + S+ E HE+WM+++G+ YKD E+E RF+IFKEN+ YIE +N +
Sbjct: 5 ASQV-TCRTLQDASMYERHEEWMSRYGKVYKDPREREKRFRIFKENMNYIETSNNVAIKP 63
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
KL N+F+DL N+EF A +K + TF + P K AV
Sbjct: 64 XKLVINQFADLNNEEFIAPRNIFKGMILCRFLSRKHTFPF--------PYVFLGHKKGAV 115
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKA 207
TP+KDQ CG CWAF VA+ EGI ++ LI LSEQ+LVDC T G + GC G M+ A
Sbjct: 116 TPVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCECGLMDDA 175
Query: 208 FEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPV 266
F++IIQN G+ + YPY+ V G C+A ++A AA I+ E+VP+ +E+AL K V+ QPV
Sbjct: 176 FKFIIQNHGVXDAN-YPYKGVDGKCNANEEANPAATITGXEDVPANNEKALQKVVANQPV 234
Query: 267 SIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTW 322
+ I A ++F+ YK G+F G C T+L+H VT +G+G + DG YWL+KNS W
Sbjct: 235 FVAIDACDSDFQFYKSGVFTGSCETELNHGVTTMGYGVSHDGTQYWLVKNSXETEW 290
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 133/335 (39%), Positives = 201/335 (60%), Gaps = 11/335 (3%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
+I LL + Q+ ++ S E H + A H + Y +LE++ R KI+ EN + K
Sbjct: 2 LIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 60
Query: 81 AN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N ++G ++Y + N+F DL + EFR++ GY+ + S STF + + VP
Sbjct: 61 HNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVTVP 119
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GN 196
S+DWR+K A+TP+KDQ +CG CWAFS+ A+EG T L+ LSEQ L+DCS GN
Sbjct: 120 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGN 179
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
GC GG M++AF+YI N+GI TE+ YPY+A C + A + ++PSG+E
Sbjct: 180 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDK 239
Query: 257 LLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
L AV ++ PVS+ I A F+ Y +G+ + C + LDH V +VG+G +++G +YWL
Sbjct: 240 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDYWL 298
Query: 314 IKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+KNSW + WGD GY+K+ R+ + CG+ + +SYPL
Sbjct: 299 VKNSWSEHWGDEGYIKMARNRKNHCGVASAASYPL 333
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 134/269 (49%), Positives = 182/269 (67%), Gaps = 11/269 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F + + + SQ ++ R+ E S+ E HE+WMA + R YKD EK+MR+KIFKEN++ I
Sbjct: 12 FALFFSIGAWTSQCMA-RTLQEASMYERHEQWMASYARVYKDANEKQMRYKIFKENVQRI 70
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
+ N E +++YKL N+F+DLTN+EF++L G+K H S + F+Y+N+ T VP
Sbjct: 71 DSFNSESDKSYKLAVNQFADLTNEEFKSLRNGFK----GHMCSAQAGHFRYENV--TAVP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-N 196
S+DWR K AVT IK+Q +CG CWAFSAVAAVEGIT+I LI LSEQ+LVDC TN +
Sbjct: 125 ASIDWRKKGAVTQIKEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSED 184
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQ 255
GC GG M+ AF++I Q+ G+A+E YPY A TC ++A +AKI+ YE+VP+ DE
Sbjct: 185 QGCQGGLMDDAFKFIEQH-GLASEATYPYDAADSTCKTKEEAKPSAKITGYEDVPANDEA 243
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGI 284
AL AV+ QPVS+ I A EF+ Y GI
Sbjct: 244 ALKNAVANQPVSVAIDAGGFEFQFYSSGI 272
>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 343
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 144/349 (41%), Positives = 199/349 (57%), Gaps = 44/349 (12%)
Query: 15 TIPMFIIIILLVSCASQ--VVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEK 64
TI + +L VS A ++S +H ++ V+ ++E+ +A+HG+ Y E
Sbjct: 10 TIFILFFTVLAVSSALDLSIISYDRSHADKSGWRSDEEVMSIYEEXLAKHGKVYNAIDEM 69
Query: 65 EMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS 124
E RF+I KENL+++E+ N GNRTYK+G NRF+D + M PS R
Sbjct: 70 EERFQISKENLKFVEQHNA-GNRTYKVGLNRFADRSR----------MMTRPSSR----- 113
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
Y ++ S+DWR + AV +K Q EC C F+ +AAVEGI KI NL LS
Sbjct: 114 ---YAPRVSDNLSESVDWRKEGAVVRVKTQSECESCRTFTVIAAVEGINKIVTGNLTALS 170
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKIS 244
DC N GC GG + A E+II N GI TE++YP+Q G C + A +
Sbjct: 171 -----DCDRTVNAGCSGGLADYALEFIINNGGIDTEEDYPFQGAVGICDQYKINA---VD 222
Query: 245 NYEEVPSGDEQALLKAVSMQPVSIG-IAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
YE VP+ DE AL KAV+ QPVS+ I AY EF+ Y+ GIF G CGT +DH VT VG+G
Sbjct: 223 GYERVPAYDELALKKAVANQPVSVAYIEAYGKEFQLYESGIFTGKCGTSIDHGVTAVGYG 282
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
TE+G +YW++KNSWG+ WG+AGY+++ R+ G CGI + YP+
Sbjct: 283 -TENGIDYWIVKNSWGENWGEAGYVRMERNTAEDTAGKCGIAILTLYPI 330
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 137/339 (40%), Positives = 193/339 (56%), Gaps = 14/339 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+++++ ++ A Q VS + V E + QH + Y+ E E+ R KIF +N +
Sbjct: 4 LVLLVTIAVACQAVS----FSELVQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVA 59
Query: 80 KANK---EGNRTYKLGTNRFSDLTNDEFRALYTGY-KMPSPSHRSTTSSTFKYQNLSMTD 135
K NK +G YKL N++ DL + EF L G+ + + R + + + D
Sbjct: 60 KHNKLFEQGLYPYKLAMNKYGDLLHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVD 119
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN- 194
+P ++DWR + AVTP+KDQ CG CW+FSA A+EG L+ LSEQ LVDCS+
Sbjct: 120 IPDTVDWRQEGAVTPVKDQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRF 179
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
GNNGC GG M+ AF YI N GI TE YPY + K A + ++PSGDE
Sbjct: 180 GNNGCNGGLMDNAFRYIKNNGGIDTEAAYPYMGEDEKFRYSAKNRGATDKGFVDIPSGDE 239
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVC-GTQLDHAVTIVGFGTTED-GAN 310
L AV ++ P+SI I A F+ Y G++ + C T+LDH V +VG+GT E G +
Sbjct: 240 DKLKAAVATVGPISIAIDASHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGTDEKTGMD 299
Query: 311 YWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
YWL+KNSWGDTWG GY+K+ R+ + CG+ TQ+SYPL
Sbjct: 300 YWLVKNSWGDTWGLDGYIKMARNQDNQCGVATQASYPLV 338
>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
Length = 330
Score = 250 bits (639), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 129/332 (38%), Positives = 194/332 (58%), Gaps = 14/332 (4%)
Query: 22 IILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKA 81
I LL + ++S+ TH+ S + E+W +HG++Y E + R +++ N++ I
Sbjct: 4 IFLLATLCLGMISAAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNMKMINLH 62
Query: 82 NKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N++ G + L N F DLTN EFR L TG++ P + F + D+P
Sbjct: 63 NEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQSMGPKETTIFREPF------LGDIPK 116
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNN 197
SLDWR+ VTP+K+Q +CG CWAFSAV ++EG L+ LSEQ LVDCS + GN
Sbjct: 117 SLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNL 176
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG ME AF+Y+ +N+G+ T + Y Y+A G C K +AA ++ + +VP ++ +
Sbjct: 177 GCNGGLMEFAFQYVKENRGLDTGESYAYEAQDGLCRYNPKYSAANVTGFVKVPLSEDDLM 236
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
S+ PVS+GI ++ F+ Y G++ T++DHAV +VG+G DG YWL+K
Sbjct: 237 SAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEESDGGKYWLVK 296
Query: 316 NSWGDTWGDAGYMKILRDE-GLCGIGTQSSYP 346
NSWG+ WG GY+K+ +D+ CGI T + YP
Sbjct: 297 NSWGEDWGMDGYIKMAKDQNNNCGIATYAIYP 328
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 250 bits (639), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 121/276 (43%), Positives = 188/276 (68%), Gaps = 7/276 (2%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
+H++ ++E+ E W++ ++Y+ EK +RF++FK+NL++I++ NK+G ++Y LG N F+
Sbjct: 43 SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
DL+++EF+ +Y G K S + F Y+++ VP S+DWR K AV +K+Q
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS VAAVEGI KI NL LSEQ+L+DC T NNGC GG M+ AFEYI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218
Query: 217 IATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
+ E++YPY +GTC + ++ I+ +++VP+ DE++LLKA++ QP+S+ I A
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGR 278
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
EF+ Y G+F+G CG LDH V VG+G+++ G++Y
Sbjct: 279 EFQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDY 313
>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
Length = 335
Score = 250 bits (639), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 135/339 (39%), Positives = 199/339 (58%), Gaps = 16/339 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
++ LLV+ V + + + + + W +QHG+SY +++E R I++ENL IE
Sbjct: 1 MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G N+F D+TN+EFR GYK + TS +
Sbjct: 60 QHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPKFFAA 115
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNG 195
P +DWR + VTP+KDQ++CG CW+FS+ A+EG LI +SEQ LVDCS +G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSGDE 254
N GC GG M++AF+Y+ +N+G+ +E YPY A C + AKI+ + ++P G+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNE 235
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGF---GTTEDGA 309
AL+ AV ++ PVS+ I A + Y+ GI + C +QLDHAV +VG+ G G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAGN 295
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
YW++KNSW D WGD GY+ + +D+ CGI T +SYPL
Sbjct: 296 RYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
Length = 336
Score = 250 bits (639), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 202/342 (59%), Gaps = 20/342 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
MF +++ L C S V ++ S Q + + W +QHG+SY +++E R I++ENL
Sbjct: 2 MFALLVTL--CISAVFAASSIDIQ-LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E GN T+K+G N+F D+TN+EFR GYK + TS + S
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHDP----NQTSQGPLFMEPSFF 113
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
P +DWR + VTP+KDQ++CG CW+FS+ A+EG LI +SEQ LVDCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
+GN GC GG M++AF+Y+ +N+G+ +E YPY A C + AKI+ + ++P G
Sbjct: 174 HGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKG 233
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGF---GTTE 306
+E AL+ AV ++ PVS+ I A + Y+ GI+ ++LDHAV +VG+ G
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADV 293
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
G YW++KNSW D WGD GY+ + +D+ CGI T +SYPL
Sbjct: 294 AGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 335
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 136/317 (42%), Positives = 190/317 (59%), Gaps = 12/317 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDL 99
V+E E + +H + Y+ + E+ R KIF EN + I NK G++TYKLG N++ D+
Sbjct: 25 VMEEWESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDM 84
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL--SMTDV--PTSLDWRDKKAVTPIKDQQ 155
+ EF + G++ + + F+ + DV P S+DWR+K AVT +KDQ
Sbjct: 85 LHHEFVNMMNGFRANTSGAGYKANRGFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKDQG 144
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQN 214
CG CWAFSA A+EG +L+ LSEQ LVDCS+ GNNGC GG M+ AF+YI N
Sbjct: 145 SCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIKVN 204
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAY 273
GI TE YPY+A C A A + +V G+E AL KA+ ++ PVS+ I A
Sbjct: 205 GGIDTEKSYPYEAEDEPCRYNPANAGADDRGFVDVREGNENALKKAIATIGPVSVAIDAS 264
Query: 274 TTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKIL 331
F+ Y+ G+++ C + LDH V VG+GTTEDG +YWL+KNSW +WGD GY+KI
Sbjct: 265 QDSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTTEDGQDYWLVKNSWSKSWGDQGYIKIA 324
Query: 332 RDE-GLCGIGTQSSYPL 347
R++ +CGI + +SYPL
Sbjct: 325 RNQNNMCGIASAASYPL 341
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 136/350 (38%), Positives = 195/350 (55%), Gaps = 29/350 (8%)
Query: 23 ILLVSCASQVVSSRSTHE----------QSVVEMHEKWM----AQHGRSYKDELE-KEMR 67
+LLV+C+ V++ E +S E + W+ R+Y E E R
Sbjct: 12 VLLVACSCLAVAAGFRFENHRLFIQQAIESPREAFDFWVHTVKPPSNRAYASSAEVYERR 71
Query: 68 FKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
F I+ +NL + + N + ++ L ++DL+ DE+R+ GY R ++ F
Sbjct: 72 FNIWLDNLRFAHEYNAR-HTSHWLSMGVYADLSQDEYRSKALGYNAHLHKKRPLRAAPFL 130
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
Y+ T P +DW AVTP+KDQ CG CWAFS AVEG I+ L+ LSEQ
Sbjct: 131 YKG---TVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIATGKLVSLSEQM 187
Query: 188 LVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNY 246
LVDC + GC GG M+ AF++I+ N GI TED+YPY+A G C + + I Y
Sbjct: 188 LVDCDREYDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRAEDGICQDNRTRRHVVTIDGY 247
Query: 247 EEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
++VP DE AL+KAV+ QPVS+ I A F+ Y G+F+ CGT LDHAV +VG+GT
Sbjct: 248 QDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAECGTALDHAVLVVGYGTAS 307
Query: 307 DGAN---YWLIKNSWGDTWGDAGYMKILRD------EGLCGIGTQSSYPL 347
+G + YWL+KNSWG WG+ GY+++LR+ EG CG+ +S+P+
Sbjct: 308 NGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGLAMYASFPI 357
>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
(fragment)
gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
gi|226542|prf||1601514A actinidin
Length = 302
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 131/278 (47%), Positives = 171/278 (61%), Gaps = 10/278 (3%)
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
L +I++ N + NR+YK+G N+F+DLT +EFR+ Y G+ S + T + +Y+
Sbjct: 1 LRFIDEHNADTNRSYKVGLNQFADLTGEEFRSTYLGFTGGS----NKTKVSNRYEPRVSQ 56
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC-ST 193
+P+ +DWR AV IK Q ECG CWAFSA+A VEGI KI LI LSEQ+L+ C T
Sbjct: 57 VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGT 116
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSG 252
GC GG + F++II N GI T + YPY A G C+ Q I Y VP
Sbjct: 117 QNTRGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYVTIDTYGNVPYN 176
Query: 253 DEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
+E AL AV+ QPVS+ + A FK Y GIF G CGT +DHAVTIVG+G TE G +YW
Sbjct: 177 NEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGIDYW 235
Query: 313 LIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
+++NSW TWG+ GYM+ILR+ G CGI T SYP+
Sbjct: 236 IVENSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 273
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 199/343 (58%), Gaps = 18/343 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M+ + L C S V ++ S +Q + + E+W HG++Y E E+ R I+++NL
Sbjct: 1 MWTYLALFTLCLSGVFAAPSLDKQ-LDDHWEQWKTWHGKNYH-EKEEGWRRMIWEKNLRK 58
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
I+ N E G TY+LG N F D+ ++EFR + GYK + R S F N
Sbjct: 59 IQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGYK--HKTERKFKGSLFMEPNF--L 114
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+VP+ LDWR+K VTP+KDQ ECG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 115 EVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRP 174
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+YI N G+ +E+ YPY C K AA + + ++PSG
Sbjct: 175 EGNEGCNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDDQPCHYDPKYNAANDTGFVDIPSG 234
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTE 306
E AL+KAV S+ PVS+ I A F+ Y+ GI F C + +LDH V +VG+ G
Sbjct: 235 KEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSEELDHGVLVVGYGFEGEDV 294
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
DG YW++KNSW ++WGD GY+ + +D + CGI T +SYPL
Sbjct: 295 DGKKYWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAASYPLV 337
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 125/301 (41%), Positives = 174/301 (57%), Gaps = 6/301 (1%)
Query: 52 AQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGY 111
A + +SY E EK+ R+ IFK NL YI N++G +Y L N F DL+ DEFR Y G+
Sbjct: 122 AMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRRKYLGF 180
Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEG 171
K + N+ +++P +DWR + VTP+KDQ++CG CWAFS A+EG
Sbjct: 181 KKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEG 240
Query: 172 ITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG 230
L+ LSEQ+L+DCS GN C GG M AF+Y++ + GI +ED YPY A
Sbjct: 241 AHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDE 300
Query: 231 TCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCG 290
C A KI +++VP E A+ A++ PVSI I A F+ Y EG+F+ CG
Sbjct: 301 ECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCG 360
Query: 291 TQLDHAVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILR---DEGLCGIGTQSSYP 346
T LDH V +VG+GT E ++W++KNSWG WG GYM + +EG CG+ +S+P
Sbjct: 361 TDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDASFP 420
Query: 347 L 347
+
Sbjct: 421 V 421
>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
Length = 336
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 201/342 (58%), Gaps = 20/342 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
MF +++ L C S V ++ S Q + + W +QHG+SY +++E R I++ENL
Sbjct: 2 MFALLVTL--CISAVFAASSIDIQ-LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E GN T+K+G N+F D+TN+EFR GYK + TS + S
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHDP----NQTSQGPLFMEPSFF 113
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
P +DWR + VTP+KDQ++CG CW+FS+ A+EG LI +SEQ LVDCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+Y+ +N+G+ +E YPY A C + AKI+ + ++P G
Sbjct: 174 QGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRG 233
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGF---GTTE 306
+E AL+ AV ++ PVS+ I A + Y+ GI+ ++LDHAV +VG+ G
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADV 293
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
G YW++KNSW D WGD GY+ + +D+ CGI T +SYPL
Sbjct: 294 AGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 335
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 131/309 (42%), Positives = 189/309 (61%), Gaps = 14/309 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
E + H +SY+ ++E+ +R+KIF EN I K N + G +YKLG N+F DL EF
Sbjct: 8 EAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPHEF 67
Query: 105 RALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
++ GY R STF N++ + +P ++DWR K AVTP+KDQ +CG CWAF
Sbjct: 68 AKMFNGYH----GERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAF 123
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDE 222
SA ++EG + L+ LSEQ L+DCS + GN GCGGG M+ AF+YI N GI TE+
Sbjct: 124 SATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEES 183
Query: 223 YPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYK 281
YPY+A+ G C ++ A + + ++ G E L KAV ++ P+S+ I A + F+ Y
Sbjct: 184 YPYEAMDGDCRFKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQLYS 243
Query: 282 EGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCG 338
EG+++ +LDH V VG+G ++G YWL+KNSW +TWGD GY+ + RD + CG
Sbjct: 244 EGVYDEPNCSSEELDHGVLAVGYG-VKNGKKYWLVKNSWAETWGDNGYILMSRDKDNQCG 302
Query: 339 IGTQSSYPL 347
I + +SYPL
Sbjct: 303 IASSASYPL 311
>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 533
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 131/305 (42%), Positives = 183/305 (60%), Gaps = 11/305 (3%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT-YKLGTNRFSDLTNDEFRALY 108
WM+ HG ++ D LE R + + N YI + N E T KLG N FS ++ DEF+
Sbjct: 31 WMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVKLGHNAFSHMSFDEFKFKM 90
Query: 109 TGYKMPSPSHRSTTSSTFKYQNL-SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
TG +P +S + L S +VP+++DW DK VTP+K+Q CG CWAFS
Sbjct: 91 TGLVLPEGYLEQRLAS--RVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCWAFSTTG 148
Query: 168 AVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA 227
AVEG T +S L+ LSEQ+LVDC NG+ GC GG M+ AF++I + GI +ED+Y Y+A
Sbjct: 149 AVEGATFVSSGKLLSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEYKA 208
Query: 228 VQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG 287
C + K++ +++V DE AL AV+ QPVS+ I A F+ YK G+FN
Sbjct: 209 KAQVCRKCD--SVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 266
Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQS 343
CGT+LDH V VG+G ++G +W +KNSWG +WG+ GY+++ R+E G CGI +
Sbjct: 267 TCGTRLDHGVLAVGYG-NDNGQKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIASVP 325
Query: 344 SYPLA 348
SYP A
Sbjct: 326 SYPFA 330
>gi|302763127|ref|XP_002964985.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
gi|300167218|gb|EFJ33823.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
Length = 320
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 132/326 (40%), Positives = 189/326 (57%), Gaps = 29/326 (8%)
Query: 14 NTIPMFIIIILLVSCASQVVSSRSTHEQS----VVEMHEKWMAQHGRSYKDELEKEMRFK 69
N I + +I++++V A ++ + E + M E W A+HG+SY + EK R
Sbjct: 4 NMIALILILLVVVGAAPFAIARPAALEDDRALEIKNMFEDWAAKHGKSYSSDWEKARRMT 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IF + L YIEK N N T+ LG N+FSDLTN EFRA Y G K P ++ + K
Sbjct: 64 IFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRANYVG-KFKPPRYQDRRPA--KDV 120
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
++ ++ +PTSLDWR + AVTPIKDQ +CG CWAFSA+A++E ++ L+ LSEQQL+
Sbjct: 121 DVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIASIESAHFLATNQLVSLSEQQLI 180
Query: 190 DCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEV 249
DC T + GC E+ YPY + G+C+ A K A+I+ + V
Sbjct: 181 DCDTV-DEGC-------------------QEEAYPYTGLAGSCN-ANKNKVAEITGFNVV 219
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
AL+KAVS PV++GI F++Y+ GI +G C DH V ++G+G TE G
Sbjct: 220 TKDKADALMKAVSKTPVTVGICGSDQNFQNYRSGILSGQCCNSRDHVVLVIGYG-TEGGM 278
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG 335
YW+IKNSWG +WG+ G+MKI + +G
Sbjct: 279 PYWIIKNSWGTSWGEDGFMKIEKKDG 304
>gi|162138968|ref|NP_001104662.1| uncharacterized protein LOC567623 precursor [Danio rerio]
gi|158254065|gb|AAI54241.1| Zgc:174153 protein [Danio rerio]
Length = 336
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 139/342 (40%), Positives = 201/342 (58%), Gaps = 20/342 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
MF +II L C S V ++ S Q + + W +QHG+SY +++E R I++ENL
Sbjct: 2 MFALIITL--CISAVFTAPSIDIQ-LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E GN T+K+G N+F D+TN+EFR GYK + TS + S
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFF 113
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
P +DWR + VTP+KDQ++CG CW+FS+ A+EG LI +SEQ LVDCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M+ AF+Y+ +N+G+ +E YPY A C + AK + + ++PSG
Sbjct: 174 QGNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKSTGFVDIPSG 233
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGF---GTTE 306
+E AL+ AV ++ PVS+ I A + Y+ GI+ ++LDHAV +VG+ G
Sbjct: 234 NEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADV 293
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
G YW++KNSW D WGD GY+ + +D+ CG+ T++SYPL
Sbjct: 294 AGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
Length = 336
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 201/342 (58%), Gaps = 20/342 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
MF +++ L C S V ++ S Q + + W +QHG+SY +++E R I++ENL
Sbjct: 2 MFALLVTL--CISAVFAASSIDIQ-LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE+ N E GN T+K+G N+F D+TN+EFR GYK + TS + S
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFF 113
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
P +DWR + VTP+KDQ++CG CW+FS+ A+EG LI +SEQ LVDCS
Sbjct: 114 AAPQQVDWRQRGFVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+Y+ +N+G+ +E YPY A C + AKI+ + ++P G
Sbjct: 174 QGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRG 233
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGF---GTTE 306
+E AL+ AV ++ PVS+ I A + Y+ GI+ ++LDHAV +VG+ G
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADV 293
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
G YW++KNSW D WGD GY+ + +D+ CG+ T +SYPL
Sbjct: 294 AGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATSASYPL 335
>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
Length = 355
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 135/327 (41%), Positives = 196/327 (59%), Gaps = 13/327 (3%)
Query: 33 VSSRSTHEQSVVEMHEKW---MAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GN 86
V + + Q + E KW G+SY+ E E + + F +N+ +IE+ NKE G
Sbjct: 31 VHRQKSLRQKIDEAFNKWDDYKETFGKSYEPEEENDY-MEAFVKNVIHIEEHNKEHRLGR 89
Query: 87 RTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKK 146
+T+++G N +DL ++R L GY+M S S+ K+ +P S+DWR++
Sbjct: 90 KTFEMGLNEIADLPFSQYRKL-NGYRMRRQFGDSMQSNGTKFLVPFNVQIPESVDWREEG 148
Query: 147 AVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTME 205
VTP+K+Q CG CWAFS+ A+EG + L+ LSEQ LVDCST GN+GC GG M+
Sbjct: 149 LVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMD 208
Query: 206 KAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ- 264
AFEYI +N G+ TED YPY + C + A + ++P GDE+AL KAV+ Q
Sbjct: 209 LAFEYIKENHGVDTEDSYPYVGRETKCHFKRNTVGADDKGFVDLPEGDEEALKKAVATQG 268
Query: 265 PVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGDTW 322
P+SI I A F+ YK+G+ F+ C + +LDH V +VG+GT + +YWL+KNSWG TW
Sbjct: 269 PISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTW 328
Query: 323 GDAGYMKILRDE-GLCGIGTQSSYPLA 348
G+ GY++I R+ CG+ T++SYPL
Sbjct: 329 GEKGYIRIARNRNNHCGVATKASYPLV 355
>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
Length = 354
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 135/326 (41%), Positives = 197/326 (60%), Gaps = 13/326 (3%)
Query: 33 VSSRSTHEQSVVEMHEKW---MAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GN 86
V + + Q + E KW G+SY+ + E + + F +N+ +IE+ NKE G
Sbjct: 30 VHRQKSLRQKIDEAFNKWDDYKETFGKSYEPDEENDY-MEAFVKNVIHIEEHNKEHRLGR 88
Query: 87 RTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKK 146
+T+++G N +DL ++R L GY+M S S+ K+ +P S+DWR++
Sbjct: 89 KTFEMGLNEIADLPFSQYRKL-NGYRMRRQFGDSLQSNGTKFLVPFNVQIPESVDWREEG 147
Query: 147 AVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTME 205
VTP+K+Q CG CWAFS+ A+EG + L+ LSEQ LVDCST GN+GC GG M+
Sbjct: 148 LVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMD 207
Query: 206 KAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ- 264
AFEYI +N G+ TED YPY + C + A A + ++P GDE+AL KAV+ Q
Sbjct: 208 LAFEYIKENHGVDTEDSYPYVGRETKCHFKRNAVGADDKGFVDLPEGDEEALKKAVATQG 267
Query: 265 PVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGDTW 322
P+SI I A F+ YK+G+ F+ C + +LDH V +VG+GT + +YWL+KNSWG TW
Sbjct: 268 PISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTW 327
Query: 323 GDAGYMKILRDE-GLCGIGTQSSYPL 347
G+ GY++I R+ CG+ T++SYPL
Sbjct: 328 GEKGYIRIARNRNNHCGVATKASYPL 353
>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
AltName: Allergen=Car p 1; Flags: Precursor
gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
gi|387885|gb|AAA72774.1| papain [synthetic construct]
gi|225437|prf||1303270A papain
Length = 345
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 135/315 (42%), Positives = 194/315 (61%), Gaps = 15/315 (4%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ E WM +H + YK+ EK RF+IFK+NL+YI++ NK+ N +Y LG N F+
Sbjct: 39 TSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKK-NNSYWLGLNVFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
D++NDEF+ YTG + ++ +T S + N ++P +DWR K AVTP+K+Q C
Sbjct: 98 DMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSC 155
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFSAV +EGI KI NL + SEQ+L+DC + GC GG A + + Q GI
Sbjct: 156 GSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR-SYGCNGGYPWSALQLVAQ-YGI 213
Query: 218 ATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
+ YPY+ VQ C + +K AAK +V +E ALL +++ QPVS+ + A +
Sbjct: 214 HYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKD 273
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
F+ Y+ GIF G CG ++DHAV VG+ G NY LIKNSWG WG+ GY++I R
Sbjct: 274 FQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILIKNSWGTGWGENGYIRIKRGTGN 328
Query: 333 DEGLCGIGTQSSYPL 347
G+CG+ T S YP+
Sbjct: 329 SYGVCGLYTSSFYPV 343
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 125/301 (41%), Positives = 174/301 (57%), Gaps = 6/301 (1%)
Query: 52 AQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGY 111
A + +SY E EK+ R+ IFK NL YI N++G +Y L N F DL+ DEFR Y G+
Sbjct: 121 AMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRRKYLGF 179
Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEG 171
K + N+ +++P +DWR + VTP+KDQ++CG CWAFS A+EG
Sbjct: 180 KKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEG 239
Query: 172 ITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG 230
L+ LSEQ+L+DCS GN C GG M AF+Y++ + GI +ED YPY A
Sbjct: 240 AHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDE 299
Query: 231 TCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCG 290
C A KI +++VP E A+ A++ PVSI I A F+ Y EG+F+ CG
Sbjct: 300 ECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCG 359
Query: 291 TQLDHAVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILR---DEGLCGIGTQSSYP 346
T LDH V +VG+GT E ++W++KNSWG WG GYM + +EG CG+ +S+P
Sbjct: 360 TDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDASFP 419
Query: 347 L 347
+
Sbjct: 420 V 420
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 194/338 (57%), Gaps = 17/338 (5%)
Query: 16 IPMFIIIILLV----SCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
+ +F+I+ L++ CA+ + S T++ S + WM +H ++Y E +++ F
Sbjct: 3 LAVFLIVSLVILSINVCAATNLFSAQTYQTSFL----GWMKKHNKAYHHH-EFNDKYQTF 57
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
K+N+++I N + + T LG NRF+DLTN+E++ Y G M + N
Sbjct: 58 KDNMDFIHNWNSKESDTV-LGLNRFADLTNEEYKKTYLG--MSINVNLRANQVPMNGLNF 114
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
P+S+DWR AV +KDQ CG CWAF+ AVEG +I N++ SEQ LVDC
Sbjct: 115 ERFTGPSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDC 174
Query: 192 STN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
S GNNGC GG M AF+YII N GIATE+ YPY A Q C IS Y++VP
Sbjct: 175 SGRYGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTATQNRCVYNTTMLGTAISGYKDVP 234
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFN-GVCGT-QLDHAVTIVGFGTTEDG 308
G E AL A+S QPV++ I A F+ YK G++ C + +L+H V VG+GT E G
Sbjct: 235 RGSESALTAAISKQPVAVAIDASPITFQLYKSGVYQEATCSSYRLNHGVLAVGYGTLE-G 293
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSY 345
+Y+++KNSW +TWG+ GY+ + R+ CGI T +SY
Sbjct: 294 KDYYIVKNSWAETWGNQGYILMARNANNHCGIATMASY 331
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 125/286 (43%), Positives = 172/286 (60%), Gaps = 4/286 (1%)
Query: 52 AQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGY 111
A +G+SY E E + R+ IFK NL YI N++G +Y L N F DL+ +EFR Y GY
Sbjct: 124 ATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQG-YSYSLKMNHFGDLSREEFRRKYLGY 182
Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEG 171
+ + +S +DVP+++DWR+K VTP+KDQ++CG CWAFSA A+EG
Sbjct: 183 NKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSATGALEG 242
Query: 172 ITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG 230
L+ LSEQ+LVDCS GN GC GG M AF+Y++ + G+ +E+ YPY A G
Sbjct: 243 AHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPYLARDG 302
Query: 231 TCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCG 290
C A K IS +++VP E A+ A++ PVSI I A F+ Y EG+F+ CG
Sbjct: 303 ECKRACKKVVT-ISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYHEGVFDASCG 361
Query: 291 TQLDHAVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILRDEG 335
T LDH V +VG+GT E ++W++KNSWG WG GYM + +G
Sbjct: 362 TDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYMAMHKG 407
>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
Length = 330
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 186/321 (57%), Gaps = 9/321 (2%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKL 91
V S+ + + + WM H +SY +E E R+ +++EN +I++ N++ N +Y L
Sbjct: 15 VASTLAYKHDPLTGVFADWMRTHTKSYSNE-EFVFRWNVWRENYNFIQEENRK-NNSYYL 72
Query: 92 GTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPI 151
N+F DLTN EF +Y G +H + +P + DWR K AVT +
Sbjct: 73 TMNKFGDLTNAEFNKVYKGLAFDYSAH--ILKAKAATPAAPAPGLPANFDWRQKGAVTHV 130
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEY 210
K+Q +CG CW+FS + EG + L+ LSEQ L+DCS + GNNGC GG M+ AFEY
Sbjct: 131 KNQGQCGSCWSFSTTGSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEY 190
Query: 211 IIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
II N+GI TE YPY+ Q C + +++Y +V SGDE ALL AV+++P S+ I
Sbjct: 191 IINNKGIDTEASYPYETAQYNCRYNPANSGGSLTSYTDVSSGDENALLNAVAIEPTSVAI 250
Query: 271 AAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYM 328
A F+ Y G++ + TQLDH V VG+G TE+G +YWL+KNSWG WG GY+
Sbjct: 251 DASHNSFQFYSGGVYYESSCSSTQLDHGVLAVGWG-TENGQDYWLVKNSWGADWGLQGYI 309
Query: 329 KILRD-EGLCGIGTQSSYPLA 348
K+ R+ CGI T +SYP A
Sbjct: 310 KMARNRHNNCGIATAASYPTA 330
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 189/314 (60%), Gaps = 12/314 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ WM H + Y++ EK RF+IFK+NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DL+NDEF Y G + + +S ++ N ++P ++DWR K AVTP++ Q C
Sbjct: 98 DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDTVNLPENVDWRKKGAVTPVRHQGSC 154
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFSAVA VEGI KI L++LSEQ+LVDC ++GC GG A EY+ +N GI
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN-GI 212
Query: 218 ATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
+YPY+A QGTC A Q K S V +E LL A++ QPVS+ + +
Sbjct: 213 HLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 272
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
F+ YK GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 273 FQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGN 331
Query: 333 DEGLCGIGTQSSYP 346
G+CG+ S YP
Sbjct: 332 SPGVCGLYKSSYYP 345
>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
Length = 341
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 131/338 (38%), Positives = 195/338 (57%), Gaps = 13/338 (3%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN 82
ILLV CA + + V E + +H + Y E E++ R KI+ EN + K N
Sbjct: 3 ILLVLCAVVAAGTAVSFFDLVREEWNTFKLEHKKQYDSETEEKFRMKIYAENKHKVAKHN 62
Query: 83 K---EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---- 135
+ +G +Y+L TN++SD+ + EF G+ ++ + + +
Sbjct: 63 QRYQKGLVSYRLKTNKYSDMLHHEFVNTMNGFNKTVKHNKGLYAKGNDIRGATFVSPANV 122
Query: 136 -VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
P ++DWR AVTP+KDQ +CG CW+FS A+EG L+ LSEQ L+DCS+
Sbjct: 123 AAPPTVDWRQHGAVTPVKDQGKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLIDCSSA 182
Query: 195 -GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
GNNGC GG M+ AF+YI N GI TE YPY+AV C K + A+ + ++P+GD
Sbjct: 183 YGNNGCNGGLMDNAFKYIKDNDGIDTEKTYPYEAVDDKCRYNPKNSGAEDVGFVDIPAGD 242
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGAN 310
E L+ A+ ++ PVS+ I A F+ Y +G+ ++ C ++ LDH V +VG+GT EDG +
Sbjct: 243 EHKLMLALATVGPVSVAIDASQESFQLYSDGVYYDENCSSENLDHGVLVVGYGTDEDGGD 302
Query: 311 YWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
YWL+KNSWG +WGD GY+K+ R+ + CGI + +SYPL
Sbjct: 303 YWLVKNSWGPSWGDEGYIKMARNRDNHCGIASSASYPL 340
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 202/342 (59%), Gaps = 26/342 (7%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
I + L+V+C S ++R ++ + WM +H +SY ++ E R+ IF++N+
Sbjct: 4 ILALVFCFLIVNCIS---AARVFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYTIFQDNM 59
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL--SM 133
+++ K N++G+ T LG N +DLTN E++ +Y G T +T K NL +
Sbjct: 60 DFVTKWNQKGSDTI-LGLNSMADLTNQEYQRIYLG-----------TKTTVKKPNLIIGV 107
Query: 134 TDV---PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
TDV P S+DWR AVT +K+Q +CG C++FS +VEGI +I+ L+ LSEQQ++D
Sbjct: 108 TDVSKAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILD 167
Query: 191 CS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEV 249
CS + GNNGC GG M +FEYII G+ TE YPY+ V G C + A I+ Y+ V
Sbjct: 168 CSGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKANIGATITGYKNV 227
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTED 307
SG E L AV+ QPVS+ I A F+ Y G++ TQLDH V VG+G ++
Sbjct: 228 KSGSESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYG-SQS 286
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPLA 348
G +YW++KNSWG WG+ G++ + R++ CGI T +SYP A
Sbjct: 287 GQDYWIVKNSWGADWGEKGFILMARNKHNNCGIATMASYPTA 328
>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 388
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 141/342 (41%), Positives = 204/342 (59%), Gaps = 17/342 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M +++ LL C VS+ + + + E W H +SY + E+ R +++ENL+
Sbjct: 51 MKLLVCLLSLCWGLAVSA-PLGDSELDKHWELWKNWHQKSYH-KAEEGWRRMVWEENLKV 108
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G TY+LG N+F DLTN+EF+ + + S +R S+ + ++
Sbjct: 109 IELHNLEQSLGLHTYQLGMNQFGDLTNEEFQQMLISERHFSEGNRINGSA---FLEVNYV 165
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
VPTS+DWRD VTP+K+Q CG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 166 QVPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLVSLSEQNLVDCSWQ 225
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQ-GTCSAAQKAAAAKISNYEEVPSG 252
GN GC GG ++ AF+YI++N+GI +ED YPY A C+ + A A+++ + ++P
Sbjct: 226 QGNQGCNGGIVDFAFQYILENRGIDSEDCYPYTAKDTAQCAFKPECATARVTGFVDIPPH 285
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVCGTQ-LDHAVTIVGF---GTTE 306
E+AL+KAV ++ PVS+ I A+ T F+ Y+ GIF C ++ L+HAV +VG+ G E
Sbjct: 286 SEEALMKAVATVGPVSVAIDAHPTSFRFYQSGIFYEPKCSSERLNHAVLVVGYGYEGEDE 345
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYPL 347
G YW++KNSWG WGD GY + +D G CGI T +SYPL
Sbjct: 346 AGKKYWIVKNSWGKQWGDHGYFYLSKDRGNHCGIATTASYPL 387
>gi|356552228|ref|XP_003544471.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 351
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 135/344 (39%), Positives = 199/344 (57%), Gaps = 21/344 (6%)
Query: 18 MFIIIILLVSCASQ-----VVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEK 64
M I+++ +V S ++S + H + V+ M E+W+ +H + Y EK
Sbjct: 3 MAIVLLFMVFAVSSALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNALGEK 62
Query: 65 EMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSS 124
E RF+IFK NL +I++ N NRTYKLG N F+DLTN E+RA+Y P T
Sbjct: 63 EKRFQIFKNNLRFIDERNSL-NRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTPP 121
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ-QECGCCWAFSAVAAVEGITKISGANLIQL 183
Y +P S+DWR + AVTP+K+Q C CWAF+AV AVE + KI +LI L
Sbjct: 122 RNHYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLISL 181
Query: 184 SEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKI 243
SEQ++VDC+T+ + GCGGG ++ + YI +N GI+ E +YPY+ +G C + +K A I
Sbjct: 182 SEQEVVDCTTSSSRGCGGGDIQHGYIYIRKN-GISLEKDYPYRGDEGKCDSNKKNAIVTI 240
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
+ VP+ E+AL +A+ Y +F +G+F G CGT+L+HA+ +VG+G
Sbjct: 241 DGHGWVPTQLEEALNRALF---CYCAYFLYVDKF-FLCQGVFKGKCGTELNHALLLVGYG 296
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYPL 347
T +DG +YW+ KNS+ D WG+ GY++I R C G YP+
Sbjct: 297 TEKDG-DYWIAKNSYSDKWGENGYIRIQRKLSTCKFGNGGYYPI 339
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 199/342 (58%), Gaps = 19/342 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
MF +++L + C + +S+ S Q + E W H + Y E E+ R ++++NL+
Sbjct: 1 MFPVVVLAL-CVTAALSAPSLDPQ-LDEHWNLWKDWHSKKYH-EKEEGWRRMVWEKNLKK 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G TY LG N F D+T++EFR + GYK+ S R S F N
Sbjct: 58 IELHNLEHSMGKHTYSLGMNHFGDMTHEEFRQIMNGYKLKS--QRKLRGSLFMEPNF--L 113
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+ P S+DWRDK VTP+KDQ +CG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 114 EAPRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRP 173
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV-QGTCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+YI N G+ +E+ YPY +G C +A + + +VPSG
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPCHYDPSYNSANDTGFVDVPSG 233
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTE 306
E+AL+KAV S+ PVS+ I A F+ Y GI ++ C + +LDH V +VG+ G
Sbjct: 234 SERALMKAVASVGPVSVAIDAGHESFQFYHSGIYYDKECSSEELDHGVLVVGYGFEGKDV 293
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WGD GY+ + +D + CGI T +SYPL
Sbjct: 294 DGKKYWIVKNSWSENWGDKGYIYMAKDKKNHCGIATAASYPL 335
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 197/342 (57%), Gaps = 20/342 (5%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMA---QHGRSYKDELEKEMRFKIFKENLEYIE 79
++L+ CA VS+ Q + E+W A QH +Y+ E+E R KI+ E+ I
Sbjct: 4 LVLLLCAVAAVSAV----QFFDLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIA 59
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST-----TSSTFKYQNL 131
K N++ G +YKLG N++ D+ + EF G+ + +++ + K+ +
Sbjct: 60 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISP 119
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+ +P +DWR AVT IKDQ +CG CW+FS A+EG L+ LSEQ L+DC
Sbjct: 120 ANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC 179
Query: 192 STN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
S GNNGC GG M+ AF+YI N GI TE YPY+ V C K A+ + ++P
Sbjct: 180 SEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPKNTGAEDVGFVDIP 239
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTED 307
GDEQ L++AV ++ PVS+ I A T F+ Y G++N T LDH V +VG+GT E
Sbjct: 240 EGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ 299
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPLA 348
G +YWL+KNSWG +WG+ GY+K++R++ CGI + +SYPL
Sbjct: 300 GVDYWLVKNSWGRSWGELGYIKMIRNKNNRCGIASSASYPLV 341
>gi|74211558|dbj|BAE26509.1| unnamed protein product [Mus musculus]
Length = 338
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 140/342 (40%), Positives = 198/342 (57%), Gaps = 19/342 (5%)
Query: 16 IPMF---IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
IP F + LL + VVS+ H+ S+ + E+W +H ++Y E + R +++
Sbjct: 3 IPGFGSMTPVFLLATLCLGVVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWE 61
Query: 73 ENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
N++ I N++ G + L N F DLTN EFR L TG++ S H+ T +Q
Sbjct: 62 NNMKMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQ 115
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
+ DVP S+DWRD VTP+KDQ CG CWAFSAV ++EG L+ LSEQ L+
Sbjct: 116 EPLLGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLM 175
Query: 190 DCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEE 248
DCS + GN GC GG ME AF+Y+ +N+G+ T + Y Y+A G C K +A I+ + +
Sbjct: 176 DCSWSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVK 235
Query: 249 VPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTT 305
VP E AL+ AV S+ PVS+GI + F+ Y+ G + T LDHAV +VG+G
Sbjct: 236 VPL-SEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEE 294
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
DG YWL+KNSWG+ WG GY+K+ +D + CGI T + YP
Sbjct: 295 SDGRKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYP 336
>gi|354502595|ref|XP_003513369.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
Length = 330
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 136/339 (40%), Positives = 199/339 (58%), Gaps = 20/339 (5%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
IP+F + L + VV + TH+ S+ + ++W +HG++Y + E + R +++ N
Sbjct: 2 IPIFFLATLCLG----VVPAAPTHDPSLDDEWQEWKTRHGKTYSMDEEGQKR-AVWENNR 56
Query: 76 EYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ IE N++ G + L N F DLTN EFR L TG++ T +Q
Sbjct: 57 KMIELHNEDYTKGKHGFHLEMNAFGDLTNIEFRQLMTGFQ------SMGTKEMNVFQEPL 110
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+ DVP S+DWR+ VTP+KDQ +C CWAFSAV ++EG LI LSEQ LVDCS
Sbjct: 111 LGDVPKSVDWRNLSYVTPVKDQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLVDCS 170
Query: 193 -TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
+ GN GC GG ME AF Y+ +N+G+ T YPY+A G C K +AA ++++ ++P
Sbjct: 171 WSYGNIGCFGGLMEYAFRYVKENRGLDTRVSYPYEARNGPCRYDPKNSAANVTDFVKIPI 230
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
E AL+KAV ++ P+S+G+ ++ F+ YK G++ + LDHAV +VG+G DG
Sbjct: 231 -SEDALMKAVATVGPISVGVDSHHHSFRFYKGGMYYEPHCSSSNLDHAVLVVGYGEESDG 289
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYP 346
YW++KNSWG WG GY+K+ RD CGI T + YP
Sbjct: 290 NKYWMVKNSWGQGWGMNGYIKMARDRNNNCGIATYAIYP 328
>gi|269954686|ref|NP_954599.2| uncharacterized protein LOC218275 precursor [Mus musculus]
Length = 330
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 137/333 (41%), Positives = 195/333 (58%), Gaps = 16/333 (4%)
Query: 22 IILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKA 81
+ LL + VVS+ H+ S+ + E+W +H ++Y E + R +++ N++ I
Sbjct: 4 VFLLATLCLGVVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWENNMKMIGLH 62
Query: 82 NKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N++ G + L N F DLTN EFR L TG++ S H+ T +Q + DVP
Sbjct: 63 NEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPLLGDVPK 116
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNN 197
S+DWRD VTP+KDQ CG CWAFSAV ++EG L+ LSEQ L+DCS + GN
Sbjct: 117 SVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGNV 176
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG ME AF+Y+ +N+G+ T + Y Y+A G C K +A I+ + +VP E AL
Sbjct: 177 GCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL-SEDAL 235
Query: 258 LKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+ AV S+ PVS+GI + F+ Y+ G + T LDHAV +VG+G DG YWL+
Sbjct: 236 MNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDGRKYWLV 295
Query: 315 KNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
KNSWG+ WG GY+K+ +D + CGI T + YP
Sbjct: 296 KNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYP 328
>gi|30388235|gb|AAH51665.1| CDNA sequence BC051665 [Mus musculus]
Length = 330
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 137/333 (41%), Positives = 195/333 (58%), Gaps = 16/333 (4%)
Query: 22 IILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKA 81
+ LL + VVS+ H+ S+ + E+W +H ++Y E + R +++ N++ I
Sbjct: 4 VFLLATLCLGVVSAAPAHDPSLDAVWEEWKTKHRKTYSMNEEAQKR-AVWENNMKMIGLH 62
Query: 82 NKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N++ G + L N F DLTN EFR L TG++ S H+ T +Q + DVP
Sbjct: 63 NEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPLLGDVPK 116
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNN 197
S+DWRD VTP+KDQ CG CWAFSAV ++EG L+ LSEQ L+DCS + GN
Sbjct: 117 SVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGNV 176
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG ME AF+Y+ +N+G+ T + Y Y+A G C K +A I+ + +VP E AL
Sbjct: 177 GCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL-SEDAL 235
Query: 258 LKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+ AV S+ PVS+GI + F+ Y+ G + T LDHAV +VG+G DG YWL+
Sbjct: 236 MNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDGRKYWLV 295
Query: 315 KNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
KNSWG+ WG GY+K+ +D + CGI T + YP
Sbjct: 296 KNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYP 328
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 137/336 (40%), Positives = 197/336 (58%), Gaps = 23/336 (6%)
Query: 19 FIII-ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
FI+ +L+V+ ++ ++ H QS + +HG++YK++ E+ RF IF+ENL
Sbjct: 4 FILASLLVVAVSATLLKEDGAHFQS-------FKLKHGKTYKNQAEETKRFAIFRENLRK 56
Query: 78 IEKAN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N K+G +Y G N+F+D+T EF+A+ PS +T + +Q
Sbjct: 57 IEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATKT----FQLADGV 112
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
VP S+DWR + VTPIKDQ +CG CWAF+ V + EG +S L + SEQQLVDC+T+
Sbjct: 113 SVPESIDWRSRNVVTPIKDQAQCGSCWAFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTD 172
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
N GC GG ++ F Y IQ G+ E +YPY G CS K+S+Y VP+ +E
Sbjct: 173 LNYGCDGGYLDDTFPY-IQTNGLELESDYPYTGYDGYCSYESSKVVTKVSSYVSVPA-NE 230
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNG-VCGTQ-LDHAVTIVGFGTTEDGANY 311
QALL+AV + PV+I I A +F Y GI + C + LDH V VG+ +E+G +Y
Sbjct: 231 QALLEAVGTAGPVAIAINADDLQF--YFSGIIDDKYCDPEYLDHGVLAVGY-DSENGRDY 287
Query: 312 WLIKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYPL 347
WLIKNSWG WG++GY + LR + +CG+ + YPL
Sbjct: 288 WLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPL 323
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 136/336 (40%), Positives = 198/336 (58%), Gaps = 23/336 (6%)
Query: 19 FIII-ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
FI+ +L+V+ ++ ++ H QS + +HG++YK++ E+ RF IF+ENL
Sbjct: 4 FILASLLVVAVSATLLKEDGVHFQS-------FKLKHGKTYKNQAEETKRFAIFRENLRK 56
Query: 78 IEKAN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N K+G +Y G N+F+D+T EF+A+ PS +T + +Q
Sbjct: 57 IEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATKT----FQLADGV 112
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
VP S+DWR + VTPIKDQ +CG CW+F+ V + EG +S L + SEQQLVDC+T+
Sbjct: 113 SVPESIDWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTD 172
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
N GC GG ++ F Y IQ G+ E +YPY G+CS K+S+Y VP+ +E
Sbjct: 173 LNYGCDGGYLDDTFPY-IQTNGLELESDYPYTGYDGSCSYDSSKVVTKVSSYVSVPA-NE 230
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNG-VCGTQ-LDHAVTIVGFGTTEDGANY 311
QALL+AV + PV+I I A +F Y GI + C + LDH V VG+ +E+G +Y
Sbjct: 231 QALLEAVGTAGPVAIAINADDLQF--YFSGIIDDKYCDPEWLDHGVLAVGY-NSENGLDY 287
Query: 312 WLIKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYPL 347
WLIKNSWG WG++GY + LR + +CG+ + YPL
Sbjct: 288 WLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPL 323
>gi|148709355|gb|EDL41301.1| cDNA sequence BC051665 [Mus musculus]
Length = 349
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 140/342 (40%), Positives = 198/342 (57%), Gaps = 19/342 (5%)
Query: 16 IPMF---IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
IP F + LL + VVS+ H+ S+ + E+W +H ++Y E + R +++
Sbjct: 14 IPGFGSMTPVFLLATLCLGVVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWE 72
Query: 73 ENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
N++ I N++ G + L N F DLTN EFR L TG++ S H+ T +Q
Sbjct: 73 NNMKMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQ 126
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
+ DVP S+DWRD VTP+KDQ CG CWAFSAV ++EG L+ LSEQ L+
Sbjct: 127 EPLLGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLM 186
Query: 190 DCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEE 248
DCS + GN GC GG ME AF+Y+ +N+G+ T + Y Y+A G C K +A I+ + +
Sbjct: 187 DCSWSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVK 246
Query: 249 VPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTT 305
VP E AL+ AV S+ PVS+GI + F+ Y+ G + T LDHAV +VG+G
Sbjct: 247 VPL-SEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEE 305
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
DG YWL+KNSWG+ WG GY+K+ +D + CGI T + YP
Sbjct: 306 SDGRKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYP 347
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 142/341 (41%), Positives = 204/341 (59%), Gaps = 18/341 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
+ +++L +C S V+S+ Q + E + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPLLVLTACLSSVLSAPVLDAQ-LNEHWDLWKSWHSKKYH-EKEEGWRRMVWEKNLQKI 58
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G +++LG N F D+T++EFR + GYK+ + R T S F N MT
Sbjct: 59 ELHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLKT--QRKFTGSLFMEPNF-MT- 114
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TN 194
P+++DWR+K VTP+KDQ +CG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 115 APSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPE 174
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSGD 253
GN GCGGG M++AF+Y+ NQG+ +ED YPY C +A + + +VPSG
Sbjct: 175 GNEGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDDQPCHYDPLYNSANDTGFVDVPSGK 234
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFG-TTED-- 307
E AL+KAV S+ PVS+ I A F+ Y+ GI + C + +LDH V VG+G ED
Sbjct: 235 EHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDKM 294
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
G +W++KNSWG+ WGD GY+ + +D + CGI T +SYPL
Sbjct: 295 GKKFWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
Length = 493
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 129/290 (44%), Positives = 181/290 (62%), Gaps = 11/290 (3%)
Query: 67 RFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA-LYTGYKMPSPSHRSTT 122
R ++F++NL YI+ N E G ++LG RF+DLT +E+RA L G + + +
Sbjct: 92 RLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVV 151
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
+Y L+ +P ++DWR++ AV +KDQ +CG CWAFSAVAAVEGI KI +LI
Sbjct: 152 GRR-RYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSLIS 210
Query: 183 LSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAA 241
LSEQ+L+DC + GC GG M+ AF ++I+N GI TE +YP+ GTC K
Sbjct: 211 LSEQELIDCDKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVV 270
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
I ++E VP E+AL KAV+ QPVS I A F+ Y GIF+G CGT LDH VT+VG
Sbjct: 271 SIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVG 330
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEGL----CGIGTQSSYPL 347
+G +E G +YW++KNSWG WG+AGY+++ R+ + GI + YP+
Sbjct: 331 YG-SEGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLYPV 379
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 139/346 (40%), Positives = 195/346 (56%), Gaps = 27/346 (7%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMA---QHGRSYKDELEKEMRFKIFKENLE 76
I ++L V A+ VS + E+W A +H + Y E+E + R KI+ EN
Sbjct: 4 IAVLLCVVGAACAVSLLDL-------VREEWSAFKLEHSKRYDSEVEDKFRMKIYLENKH 56
Query: 77 YIEKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGY----KMPSPSH---RSTTSSTF 126
I K N+ +G +YKL N+++D+ + EF + G+ K P H R + +TF
Sbjct: 57 RIAKHNQRFEQGAVSYKLRPNKYADMLSHEFVHVMNGFNKTLKHPKAVHGKGRESRPATF 116
Query: 127 KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
+ P +DWR K AVT +KDQ +CG CWAFS A+EG L+ LSEQ
Sbjct: 117 IAP--AHVTYPDHVDWRKKGAVTEVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQ 174
Query: 187 QLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
L+DCS GNNGC GG M+ AF+YI N GI TE YPY+ V C K + A
Sbjct: 175 NLIDCSAAYGNNGCNGGLMDNAFKYIKDNGGIDTEKAYPYEGVDDKCRYNAKNSGADDVG 234
Query: 246 YEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGF 302
+ ++P GDE+ L++AV ++ PVS+ I A F+ Y +G++ T LDH V +VG+
Sbjct: 235 FVDIPQGDEEKLMQAVATVGPVSVAIDASQESFQFYSDGVYYDENCSSTDLDHGVMVVGY 294
Query: 303 GTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
GT E G +YWL+KNSWG TWGD GY+K+ R++ CGI + +SYPL
Sbjct: 295 GTDEQGGDYWLVKNSWGRTWGDLGYIKMARNKNNHCGIASSASYPL 340
>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
endopeptidase; AltName: Full=Papaya peptidase B;
AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
Precursor
gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
Length = 348
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 190/315 (60%), Gaps = 12/315 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ WM +H ++YK+ EK RF+IFK+NL+YI++ NK N Y LG N FS
Sbjct: 39 TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMIN-GYWLGLNEFS 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DL+NDEF+ Y G P + ++ N + D+P S+DWR K AVTP+K Q C
Sbjct: 98 DLSNDEFKEKYVG---SLPEDYTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYC 154
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
CWAFS VA VEGI KI NL++LSEQ+LVDC + GC G + +Y+ QN GI
Sbjct: 155 ESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQN-GI 212
Query: 218 ATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
+YPY A Q TC A Q K + V S +E +LL A++ QPVS+ + + +
Sbjct: 213 HLRAKYPYIAKQQTCRANQVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRD 272
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
F++YK GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 273 FQNYKGGIFEGSCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGN 331
Query: 333 DEGLCGIGTQSSYPL 347
G+CG+ S YP+
Sbjct: 332 SPGVCGVYRSSYYPI 346
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 140/303 (46%), Positives = 188/303 (62%), Gaps = 12/303 (3%)
Query: 53 QHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYT 109
QHGR Y+ E+E RF+IFK+NL+YIE+ NK+ G ++Y LG N+F+D+ N+EFR +Y
Sbjct: 48 QHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR-MYN 106
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAV 169
G + R S + P +DWR K VT +K+Q +CG CW+FS ++
Sbjct: 107 GLRRDYNYSREVQCSN--HLTPEYLVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGSL 164
Query: 170 EGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV 228
EG L+ LSEQQLVDCS GN GC GG M++AFEYII N GI TE+EYPY A
Sbjct: 165 EGQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGIETEEEYPYDAR 224
Query: 229 QGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYKEGIFN- 286
Q C + AA S +V SGDE L +V+ + PVSI I A F+ Y G+++
Sbjct: 225 QERCHFKKSEVAATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGGVYDE 284
Query: 287 -GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSS 344
T+LDH V +VG+G T+DG +YWL+KNSWG TWG GY+K+ R+ + CG+ TQ+S
Sbjct: 285 PKCSSTELDHGVLVVGYG-TDDGQDYWLVKNSWGTTWGLEGYVKMSRNQDNQCGVATQAS 343
Query: 345 YPL 347
YPL
Sbjct: 344 YPL 346
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 134/311 (43%), Positives = 188/311 (60%), Gaps = 18/311 (5%)
Query: 48 EKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTN 101
E+W+A Q G+SYK+ E+ R ++KEN I++ NK G +YKL N F DL
Sbjct: 24 EEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKLKMNHFGDLMQ 83
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCW 161
EF+AL K+ + + + F+ + +P +DWR K AVTP+KD +CG CW
Sbjct: 84 HEFKALN---KLKRSAKQQNSGEVFR---ATGGKLPAKVDWRQKGAVTPVKDPGQCGSCW 137
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATE 220
AFS+ ++ G + L+ LSEQQLVDCS N GN+GC GG M +AF+YI N GI TE
Sbjct: 138 AFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYIKGNGGIDTE 197
Query: 221 DEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAYTTEFKS 279
YPY+A C K+ A Y ++ GDE AL +AV+ + P+S+ I A F+
Sbjct: 198 GSYPYEAEDDKCRYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVAIDAGNLSFQF 257
Query: 280 YKEGIFN-GVC-GTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GL 336
Y EGI++ C T+LDH V +VG+G TE+G +YWL+KNSWG +WG+ GY+KI R+
Sbjct: 258 YSEGIYDEPFCSNTELDHGVLVVGYG-TENGQDYWLVKNSWGPSWGENGYIKIARNHNNH 316
Query: 337 CGIGTQSSYPL 347
CGI + +SYP+
Sbjct: 317 CGIASMASYPI 327
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 197/343 (57%), Gaps = 20/343 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M + + C S V ++ + +Q ++ H E+W HG+ Y E E+ R ++++NL+
Sbjct: 1 MRVFLAAFALCLSAVFAAPTLDKQ--LDNHWEQWKNWHGKKYH-EKEEGWRRMVWEKNLQ 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE N E G TY+LG NRF D+T++EFR + GYK R S F N
Sbjct: 58 KIELHNLEHSMGTHTYRLGMNRFGDMTHEEFRQVMNGYK--HKKERRFRGSLFMEPNF-- 113
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS- 192
+VP SLDWR+K VTP+KDQ ECG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 114 LEVPNSLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSR 173
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPS 251
GN GC GG M++AF+YI G+ +E+ YPY C K +AA + + ++PS
Sbjct: 174 PEGNEGCNGGLMDQAFQYIKDQNGLDSEESYPYVGTDDQPCHYDPKYSAANDTGFVDIPS 233
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
G E AL+KA+ ++ PVS+ I A F+ Y+ GI + C + +LDH V VG+ G
Sbjct: 234 GKEHALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGED 293
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WGD GY+ + +D CGI T +SYPL
Sbjct: 294 VDGKKYWIVKNSWSENWGDKGYVYMAKDRHNHCGIATAASYPL 336
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 138/334 (41%), Positives = 196/334 (58%), Gaps = 15/334 (4%)
Query: 25 LVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE 84
+V C V ++ TH++ V + A HG+ Y + E+ R KI+ EN I + N++
Sbjct: 5 IVLCCLFVTAAAITHQELVGAEWSAFKALHGKDYASDTEEYYRLKIYMENRLKIARHNEK 64
Query: 85 GNRT---YKLGTNRFSDLTNDEFRALYTGYKM---PSPSHRSTTSSTFKYQNLSMTDVPT 138
++ YKL N F DL + EF + G+K SP S +++L + P
Sbjct: 65 YAKSQVSYKLAMNEFGDLLHHEFVSTRNGFKRNYRDSPREGSFFVEPEGFEDLQL---PK 121
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNN 197
++DWR K AVTP+K+Q +CG CWAFS ++EG L+ LSEQ LVDCS + GNN
Sbjct: 122 TVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNN 181
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG M+ AF+YI N+GI TE YPY A G C + A + + ++P GDE L
Sbjct: 182 GCEGGLMDNAFKYIKSNKGIDTEWSYPYNATDGVCHFNRSDVGATDTGFVDIPEGDENKL 241
Query: 258 LKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYWLI 314
KAV ++ PVS+ I A F+ Y EG+++ C + QLDH V +VG+G T+DG +YWL+
Sbjct: 242 KKAVAAVGPVSVAIDASHESFQFYSEGVYDEPECSSEQLDHGVLVVGYG-TKDGQDYWLV 300
Query: 315 KNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
KNSWG TWGD GY+ + R+ + CGI + +SYPL
Sbjct: 301 KNSWGTTWGDEGYIYMTRNKDNQCGIASSASYPL 334
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 130/305 (42%), Positives = 178/305 (58%), Gaps = 7/305 (2%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRAL 107
E W G+SY D +E+ R +++ N ++ N G +Y LG N F+DLT++EF+
Sbjct: 31 EAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFKRF 90
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
Y G K+ RS SSTF ++ +P S+DWR VTP+KDQ +CG CW+FS
Sbjct: 91 YLGTKVDLNRPRSNFSSTF-IPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSFSTTG 149
Query: 168 AVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQ 226
+VEG L+ LSEQ LVDCS GN GC GG M+ AF+YII N+GI TE YPY
Sbjct: 150 SVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASYPYT 209
Query: 227 AVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF 285
A GTC A +S+++++ G E L AV ++ PVS+ I A F+ Y G++
Sbjct: 210 AKDGTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLYTSGVY 269
Query: 286 N--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQ 342
N T LDH V G+GT+ +G YWL+KNSWG +WG AGY+ + R+ CGI T
Sbjct: 270 NEKKCSSTSLDHGVLAAGYGTS-NGTPYWLVKNSWGSSWGQAGYIWMSRNANNQCGIATS 328
Query: 343 SSYPL 347
+SYP+
Sbjct: 329 ASYPI 333
>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
Length = 336
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 133/340 (39%), Positives = 199/340 (58%), Gaps = 17/340 (5%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
++ LLV+ V + + + + + W +QHG+SY +++E R I++ENL IE
Sbjct: 1 MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G N+F D+TN+EFR GY + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNG 195
P +DWR + VTP+KDQ++CG CW+FS+ A+EG LI +SEQ LVDCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSGDE 254
N GC GG M++AF+Y+ +N+G+ +E YPY A C + AKI+ + ++PSG+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNE 235
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
AL+ AV ++ PVS+ I A + Y+ GI+ ++LDHAV +VG+ G G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
YW++KNSW D WGD GY+ + +D+ CG+ T++SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 131/313 (41%), Positives = 184/313 (58%), Gaps = 16/313 (5%)
Query: 46 MHEKWM---AQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDL 99
+ ++W A+HGR Y E+ R +F++N ++I+ N + G T+ L N+F D+
Sbjct: 20 LRQQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDM 79
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
T++EF A G+ + PS R T + +P +DWR K AVTP+KDQ++CG
Sbjct: 80 TSEEFTATMNGF-LNVPSRRPTAI----LRADPDETLPKEVDWRTKGAVTPVKDQKQCGS 134
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFS ++EG + L+ LSEQ LVDCS GN GC GG M++AF YI N+GI
Sbjct: 135 CWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGID 194
Query: 219 TEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEF 277
TED YPY+A G C A + Y +V G E AL KAV ++ P+S+ I A F
Sbjct: 195 TEDSYPYEAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVAIDASQPSF 254
Query: 278 KSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-E 334
+ Y +G++ G T LDH V VG+G TE G YWL+KNSW +WG+ GY+++ RD +
Sbjct: 255 QFYHDGVYYEEGCSSTMLDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNKGYIQMSRDKK 314
Query: 335 GLCGIGTQSSYPL 347
CGI +Q+SYPL
Sbjct: 315 NNCGIASQASYPL 327
>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 201/342 (58%), Gaps = 17/342 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + ++L C + +++ S + + E+W + HG+SY ++ E+ R +++E+L
Sbjct: 1 MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEEHLRV 58
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G +++LG N F D+ N+EFR L GYK +H+ S F N
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQ-THKKLQGSHFLEPNF--L 115
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+VP +DWRD+ VTP+KDQ +CG CWAFS A+EG L+ LSEQ LV+CS
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGT-CSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+Y+ N GI +ED YPY T C + AA + + ++PSG
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSG 235
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVC-GTQLDHAVTIVGFGTTE--- 306
E+AL+KA+ ++ PVS+ I A T F+ Y+ GI F C T LDH V +VG+G +
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WG GY+ + +D + CGI T +SYPL
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 128/306 (41%), Positives = 186/306 (60%), Gaps = 12/306 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRAL 107
E++ A+ G SY E E+ R +F +N++ I + N +G+ TY LG N+F+DLT +EF
Sbjct: 20 EEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGH-TYTLGVNQFADLTVEEFSKT 78
Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
Y G+K P+ + ++ + +PTS+DW + AVTP+K+Q +CG CW+FS
Sbjct: 79 YMGFK--KPAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGSCWSFSTTG 136
Query: 168 AVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQ 226
++EG +IS L+ LSEQQ VDC+ T GN GC GG M+ AF+Y N + TE YPY+
Sbjct: 137 SLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEAN-ALCTEQSYPYK 195
Query: 227 AVQGTCSAAQKA---AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEG 283
G+C A+ + A +S Y++V S EQ ++ AV+ QPVSI I A + F+ Y G
Sbjct: 196 GTDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKSVFQLYSGG 255
Query: 284 IFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE---GLCGIG 340
+ G CG LDH V VG+GT G +YW +KNSWG TWG +GY+ + R + G CG+
Sbjct: 256 VLTGACGASLDHGVLAVGYGTL-SGTDYWKVKNSWGSTWGMSGYVLLQRGKGGSGECGLL 314
Query: 341 TQSSYP 346
++ SYP
Sbjct: 315 SEPSYP 320
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 129/308 (41%), Positives = 194/308 (62%), Gaps = 13/308 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
E++ + GR Y + R IF+ NL++I + N + G+ T+ + N F+DL+N+EF
Sbjct: 34 EQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLSNEEF 93
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
RA + GY+ + + + + + + + +P ++DW K VTPIK+QQ+CG CWAFS
Sbjct: 94 RATFNGYRRLA----AVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCWAFS 149
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
AVA++EG + L+ LSEQ LVDCS G+ GC GG M+ AF+Y+IQN+GI TE Y
Sbjct: 150 AVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRGIDTEASY 209
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKE 282
PY+A+ +C + + A I ++ +V +GDE AL AV S+ P+S+ I A F+ Y
Sbjct: 210 PYKAIDESCEFKRNSVGATIHSFVDVKTGDESALQNAVASIGPISVAIDAAQPSFQFYSS 269
Query: 283 GIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGI 339
G++N C T+ LDH VT VG+GT +GA YW +KNSWG +WG GY+ + R+ + CGI
Sbjct: 270 GVYNEPDCSTEILDHGVTAVGYGTL-NGAPYWKVKNSWGTSWGRKGYIFMSRNKQNQCGI 328
Query: 340 GTQSSYPL 347
T++SYP+
Sbjct: 329 ATKASYPV 336
>gi|357167707|ref|XP_003581294.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 358
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 128/322 (39%), Positives = 185/322 (57%), Gaps = 22/322 (6%)
Query: 42 SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTN 101
++ HE+WMA+ GRSY D EK R ++F N +++ N+ GNRTY LG N+FSDLT+
Sbjct: 37 TMASRHERWMARFGRSYTDAGEKARRQEVFGANARHVDAVNRAGNRTYTLGLNQFSDLTD 96
Query: 102 DEFRALYTGYK-------MPSPSHRSTTSST-FKYQNLSMTDVPTSLDWRDKKAVTPIKD 153
EF + GY + P +T Y D+P S+DWR K AVT IK+
Sbjct: 97 HEFLQQHLGYGRHHGQRGLLLPEEEVMPKATALGYGQ----DMPYSVDWRAKGAVTEIKN 152
Query: 154 QQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQ 213
Q+ CG CWAF+AVAA EG+ KI+ NLI +SEQQ++DC T + C G + A Y++
Sbjct: 153 QRSCGSCWAFAAVAATEGLVKIATGNLISMSEQQVLDC-TGDRSSCDSGYISDALRYVVT 211
Query: 214 NQGIATEDEYPYQAVQGTCSA---AQKAAAAKISN-YEEVPSGDEQALLKAVSMQPVSIG 269
+ G+ E Y Y +G C + A+ +AA + + +GDE AL + QPV++
Sbjct: 212 SGGLQREAAYAYTGQKGACGSRRFARPNSAASVGGVHMATLNGDEGALQGLAARQPVAVI 271
Query: 270 IAAYTTEFKSYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ A +F+ Y G++ G CG +L+HA+T+VG+GT YWL+KN WG WG+ GY
Sbjct: 272 VEASEPDFRHYSSGVYAGSASCGRELNHALTVVGYGTENGAGEYWLVKNQWGTWWGENGY 331
Query: 328 MKILRDEGL---CGIGTQSSYP 346
M++ R G CGI + + YP
Sbjct: 332 MRVARRNGAGANCGIASVAFYP 353
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 197/341 (57%), Gaps = 18/341 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
+ + +L C S +S+ S Q + E + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPVAVLAVCLSAALSAPSLDPQ-LDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKKI 58
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G TY+LG N F D+T++EFR + GYK S R S F N +
Sbjct: 59 ELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYK--RKSERKFKGSLFMEPNF--LE 114
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TN 194
P S+DWRD VTP+KDQ +CG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 115 APRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPE 174
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSGD 253
GN GC GG M++AF+YI NQG+ +ED YPY C K +A + + ++PSG
Sbjct: 175 GNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGK 234
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTED 307
E+AL+KAV ++ PVS+ I A F+ Y+ GI + C + +LDH V +VG+ G D
Sbjct: 235 ERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDVD 294
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
G YW++KNSW + WGD GY+ + +D + CGI T +SYPL
Sbjct: 295 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|163658591|gb|ABY28387.1| cathepsin L [Gnathostoma spinigerum]
Length = 398
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 126/310 (40%), Positives = 189/310 (60%), Gaps = 12/310 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR---TYKLGTNRFSDLTNDEF 104
E + +HG+++ D + F +NLEYI++ N++ R T+++G N +DL DE+
Sbjct: 92 EDFKLEHGKAFDDVENEYDHIFAFTKNLEYIKQHNEKFQRGEVTFEMGVNHLTDLPFDEY 151
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
+ L G++ + R STF + +P ++DWR+ VT +KDQ +CG CWAFS
Sbjct: 152 KKL-NGFRKNNDDSRPRNGSTFLRPHF--VQIPDTVDWRNSSYVTVVKDQGQCGSCWAFS 208
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A A+EG L+ LSEQ LVDCS GNNGC GG M+ AFEYI N GI TE+ Y
Sbjct: 209 ATGALEGQHMRKTHQLVSLSEQNLVDCSRKYGNNGCNGGLMDNAFEYIKDNHGIDTEESY 268
Query: 224 PYQAVQG-TCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYK 281
PY+ V+G C +K A+ Y ++P GDE+AL AV ++ P+S+ I A F++Y+
Sbjct: 269 PYKGVEGKKCHFRRKFVGAEDYGYTDLPEGDEEALKVAVATIGPISVAIDAGHISFQNYR 328
Query: 282 EGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCG 338
+GI+ N LDH V +VG+GT E+ +YW++KNSWG WG+ GY+++ R++ CG
Sbjct: 329 KGIYTENECSPEDLDHGVLVVGYGTDENAGDYWIVKNSWGTRWGEHGYIRMARNKRNQCG 388
Query: 339 IGTQSSYPLA 348
I +++SYP+
Sbjct: 389 IASKASYPIV 398
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 133/338 (39%), Positives = 202/338 (59%), Gaps = 20/338 (5%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
+ I L+++C S ++R ++ + WM +H +SY ++ E R+ +F++N+
Sbjct: 4 VLALIFCFLIINCCS---AARIFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYSVFQDNM 59
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL-SMT 134
+ + K N++G+ T LG N +DLTN+EF+ LY G K + T+K + L ++
Sbjct: 60 DIVAKWNQKGSNTI-LGLNVMADLTNEEFKKLYLGTK---------ANVTYKKKTLVGVS 109
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+P S+DWR AVT +K+Q +CG C+AFS +VEGI +I+ L+ LSEQQ++DCS +
Sbjct: 110 GLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGS 169
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
GNNGC GG M +FEYII G+ TE YPY G C +K A I+ Y+ V SG
Sbjct: 170 EGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYTGEVGKCKFNKKNIGATITGYKNVESGS 229
Query: 254 EQALLKAVSMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANY 311
E L AV+ QPVS+ I A + F+ Y G++ TQLDH V VG+G ++ G +Y
Sbjct: 230 ESDLQTAVAAQPVSVAIDASQSSFQLYASGVYYEPECSSTQLDHGVLAVGYG-SQSGQDY 288
Query: 312 WLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
W++KNSWG WG+ G++ + R+ + CGI T +S+P A
Sbjct: 289 WIVKNSWGADWGENGFILMARNKDNNCGIATMASFPTA 326
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 134/341 (39%), Positives = 200/341 (58%), Gaps = 18/341 (5%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKE 73
+I M ++++L+ C+S + H+ ++ H + W +G+ YK++ E+ +R I+++
Sbjct: 9 SIIMKWLVLVLLGCSSAMAQ---LHKDPTLDRHWDLWKKTYGKQYKEKNEEGVRRLIWEK 65
Query: 74 NLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
NL+++ N E G +Y LG N D+T++E AL + ++PS R+ T + Q
Sbjct: 66 NLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTALMSSLRVPSQWQRNVTYKSNPNQK 125
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
L P S+DWRDK VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVD
Sbjct: 126 L-----PDSVDWRDKGCVTDVKYQGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVD 180
Query: 191 CSTN--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEE 248
CS N GC GG M +AF+YII N GI +E YPY+A+ G C K AA S Y E
Sbjct: 181 CSVGKYSNRGCNGGFMTEAFQYIIDNNGIESEASYPYKAMDGKCQYDSKYRAATCSRYTE 240
Query: 249 VPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTE 306
+P E AL +AV+ + PVS+ I A F Y+ G+ ++ C ++H V +VG+G
Sbjct: 241 LPEDSEDALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDPACTLHVNHGVLVVGYGNL- 299
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
+G +YWL+KNSWG +GD GY+++ R+ G CGI + +SYP
Sbjct: 300 NGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYASYP 340
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 189/308 (61%), Gaps = 11/308 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDLTNDEF 104
E + + H ++YK +E+ +RFKIF EN +I K N +G +YKLG N+F+DL EF
Sbjct: 28 EAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADLLPHEF 87
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
+ GY+ + R +T NL+ + +P ++DWR K AVTP+KDQ +CG CWAFS
Sbjct: 88 VKMMNGYQGKRLAGRGST--YLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFS 145
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
+ ++EG + L+ LSEQ LVDCS+ GN GC GG M+ +F YI N GI TED Y
Sbjct: 146 STGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKANGGIDTEDSY 205
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKE 282
PY+A G C ++ A + + ++ G E+ L KAV ++ PVS+ I A F+ Y E
Sbjct: 206 PYEAEDGDCRYKKEDVGATDTGFVDIKEGSEKDLQKAVATVGPVSVAIDASQQSFQLYSE 265
Query: 283 GIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGI 339
G+++ C ++ LDH V VG+G ++G YWL+KNSW +TWG GY+ + RD+ CGI
Sbjct: 266 GVYDEPNCSSESLDHGVLAVGYG-VKNGKKYWLVKNSWAETWGQDGYILMSRDKNNQCGI 324
Query: 340 GTQSSYPL 347
+ +SYPL
Sbjct: 325 ASSASYPL 332
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 128/314 (40%), Positives = 189/314 (60%), Gaps = 14/314 (4%)
Query: 42 SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSD 98
S+ + + + A+HGR Y E+ R +F++N ++I+ N + G T+ L N+F D
Sbjct: 18 SLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGD 77
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
+T++E A G+ + +P+ R ++ K + ++ P +DWR K AVTP+KDQ++CG
Sbjct: 78 MTSEEIVATMNGF-LGAPTRRP--AAVLKADDETL---PEKVDWRTKGAVTPVKDQKQCG 131
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGI 217
CWAFS ++EG + L+ LSEQ LVDCS GN GC GG M++AF YI N+GI
Sbjct: 132 SCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGI 191
Query: 218 ATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTE 276
TED YPY+A G C A + Y +V G E AL KAV ++ P+S+GI A +
Sbjct: 192 DTEDSYPYEAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQST 251
Query: 277 FKSYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE 334
F Y G+++ T LDH V VG+G+ E+G ++WL+KNSW +WGD GY+K+ R+
Sbjct: 252 FHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRNR 311
Query: 335 -GLCGIGTQSSYPL 347
CGI +Q+SYPL
Sbjct: 312 NNNCGIASQASYPL 325
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 127/304 (41%), Positives = 184/304 (60%), Gaps = 9/304 (2%)
Query: 52 AQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALY 108
A+HG+SY E E+ R KI+ EN I K N++ G Y + N F D+ + EF +
Sbjct: 32 AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAA 168
G+K S+ + +N+ +P ++DWR K AVTP+K+Q +CG CWAFSA +
Sbjct: 92 NGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATGS 151
Query: 169 VEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA 227
+EG +++ LSEQ LVDCST+ GNNGC GG M+ AF+YI N+GI TE YPY
Sbjct: 152 LEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPYNG 211
Query: 228 VQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN 286
GTC + A S + ++ G E L KAV ++ P+S+ I A F+ Y +G+++
Sbjct: 212 TDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYD 271
Query: 287 GV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQS 343
C ++ LDH V +VG+GT +G +YWL+KNSWG TWGD GY+++ R+ + CGI + +
Sbjct: 272 EPECDSESLDHGVLVVGYGTL-NGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQCGIASSA 330
Query: 344 SYPL 347
SYPL
Sbjct: 331 SYPL 334
>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
Length = 336
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 133/340 (39%), Positives = 198/340 (58%), Gaps = 17/340 (5%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
++ LLV+ V + + + + + W +QHG+SY +++E R I++ENL IE
Sbjct: 1 MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
+ N E GN T+K+G N+F D+TN+EFR GY + TS + S
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHDP----NQTSQGPLFMEPSFFAA 115
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNG 195
P +DWR + VTP+KDQ++CG CW+FS+ A+EG LI +SEQ LVDCS G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSGDE 254
N GC GG M+ AF+Y+ +N+G+ +E YPY A C + AKI+ + ++PSG+E
Sbjct: 176 NQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNE 235
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
AL+ AV ++ PVS+ I A + Y+ GI+ ++LDHAV +VG+ G G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
YW++KNSW D WGD GY+ + +D+ CG+ T++SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 197/341 (57%), Gaps = 18/341 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
+ + +L C S +S+ S Q + E + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPVAVLAVCLSAALSAPSLDPQ-LDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKKI 58
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G TY+LG N F D+T++EFR + GYK S R S F N +
Sbjct: 59 ELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYK--RKSERKFKGSLFMEPNF--LE 114
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TN 194
P S+DWRD VTP+KDQ +CG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 115 APRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPE 174
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSGD 253
GN GC GG M++AF+YI NQG+ +ED YPY C K +A + + ++PSG
Sbjct: 175 GNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGK 234
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTED 307
E+AL+KAV ++ PVS+ I A F+ Y+ GI + C + +LDH V +VG+ G D
Sbjct: 235 ERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDVD 294
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
G YW++KNSW + WGD GY+ + +D + CGI T +SYPL
Sbjct: 295 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 120/266 (45%), Positives = 163/266 (61%), Gaps = 7/266 (2%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS E+ M+ +WMA HGR+Y E+E RF++F++NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTNDE+RA Y G + R N D+P S+DWR K AV
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDN---EDLPESVDWRAKGAV 147
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
+KDQ CG CWAFS +AAVEGI +I ++I LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAF 207
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
E+II N GI TE++YPY+ G C +K A I +YE+VP+ E++L KAV+ QP+S
Sbjct: 208 EFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPIS 267
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQL 293
+ I A F+ Y GIF G CG +
Sbjct: 268 VAIEAGGRAFQLYNSGIFTGTCGNSV 293
>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 336
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 134/338 (39%), Positives = 204/338 (60%), Gaps = 15/338 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F++ I LV+CA+ + + + +W H +SY +++ + R +++EN++ I
Sbjct: 6 FLVAIGLVACATAAFVKPTNPDLDSRWL--EWKIAHTKSYTNDMHELERRLVWEENVKMI 63
Query: 79 EKANKEGN---RTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
N + + + ++LG N + D+ E R+ GYK S + STF S
Sbjct: 64 NMHNLDHSLHKKGFRLGMNEYGDMRLHEVRSTMNGYK--SSNVTKVQGSTF--LTPSNIQ 119
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TN 194
VP ++DWR K VTP+K+Q +CG CWAFS ++EG T + L+ LSEQ LVDCS T
Sbjct: 120 VPDTVDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSRTE 179
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
GN GC GG M++ F+Y+I N GI +ED YPY A TC +A+++ + +V SGDE
Sbjct: 180 GNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDETCHYKASCDSAEVTGFTDVTSGDE 239
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANY 311
QAL++AV S+ PVS+ I A F+ Y+ G+++ ++LDH V +VG+G T+ G +Y
Sbjct: 240 QALMEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSELDHGVLVVGYG-TDGGKDY 298
Query: 312 WLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPLA 348
WL+KNSWG+TWG +GY+K+ R++ CGI T +SYPL
Sbjct: 299 WLVKNSWGETWGLSGYIKMSRNKSNQCGIATSASYPLV 336
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 137/339 (40%), Positives = 202/339 (59%), Gaps = 21/339 (6%)
Query: 16 IPMFIIIILL-VSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN 74
+ +F ++LL V+ A + R ++S ++ W H + Y + E+ +R+ I+K+N
Sbjct: 1 MKVFCALLLLGVTLAYTI--ERPVKDESWIQ----WKMYHNKVYSHDGEETVRYTIWKDN 54
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
I + N +G + L N+F D+TN EF+A + GY SH+ STF N +
Sbjct: 55 ERRIREHNLKGG-DFILKMNQFGDMTNSEFKA-FNGYL----SHKHVNGSTFLTPNNFV- 107
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
P ++DWR++ VTP+KDQ +CG CWAFS ++EG L+ LSEQ LVDCST
Sbjct: 108 -APDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTA 166
Query: 195 -GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
GNNGC GG M+ AF YI +N+GI +E YPY A G C + + AA + + ++P G+
Sbjct: 167 YGNNGCDGGLMDNAFTYIKENKGIDSEASYPYTAEDGKCVFKKSSVAATDTGFVDIPEGN 226
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGAN 310
E L +AV S+ P+S+ I A F+ Y G++N T+LDH V +VG+G TE G +
Sbjct: 227 ENKLKEAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYG-TESGKD 285
Query: 311 YWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
YWL+KNSW +WGD GY+K+ R+ + CGI T++SYPL
Sbjct: 286 YWLVKNSWNTSWGDKGYIKMRRNAKNQCGIATKASYPLV 324
>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
Length = 533
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 130/305 (42%), Positives = 180/305 (59%), Gaps = 11/305 (3%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT-YKLGTNRFSDLTNDEFRALY 108
WM HG ++ D LE R + + N YI + N E T LG N FS ++ DEF+
Sbjct: 31 WMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTGVTLGHNAFSHMSFDEFKFKM 90
Query: 109 TGYKMPSPSHRSTTSSTFKYQNL-SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
TG +P +S + L S +VP+++DW DK VTP+K+Q CG CWAFS
Sbjct: 91 TGLVLPEGYLEQRLAS--RVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCWAFSTTG 148
Query: 168 AVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA 227
AVEG T +S L LSEQ+LVDC NG+ GC GG M+ AF++I + GI +ED+Y Y+A
Sbjct: 149 AVEGATFVSSGKLPSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEYKA 208
Query: 228 VQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG 287
C + K++ +++V DE AL AV+ QPVS+ I A F+ YK G+FN
Sbjct: 209 KAQVCRECD--SVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 266
Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQS 343
CGT+LDH V VG+G ++G +W +KNSWG +WG+ GY+++ R+E G CGI +
Sbjct: 267 TCGTRLDHGVLAVGYG-NDNGHKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIASVP 325
Query: 344 SYPLA 348
SYP A
Sbjct: 326 SYPFA 330
>gi|3850787|emb|CAA05360.1| cathepsin S [Mus musculus]
Length = 330
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 134/320 (41%), Positives = 194/320 (60%), Gaps = 16/320 (5%)
Query: 37 STHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLG 92
+T E+ ++ H + W H + YKD+ E+E+R I+++NL++I N E G TY++G
Sbjct: 15 ATAERPTLDHHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVG 74
Query: 93 TNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIK 152
N D+TN+E ++P S ++ T +++ S +P ++DWR+K VT +K
Sbjct: 75 MNDMGDMTNEEILCRMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVK 129
Query: 153 DQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN---GNNGCGGGTMEKAFE 209
Q CG CWAFSAV A+EG K+ LI LS Q LVDCS GN GCGGG M +AF+
Sbjct: 130 YQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQ 189
Query: 210 YIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSI 268
YII N GI + YPY+A+ C K AA S Y ++P GDE AL +AV+ + PVS+
Sbjct: 190 YIIDNGGIEADASYPYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSV 249
Query: 269 GIAAYTTEFKSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
GI A + F YK G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY
Sbjct: 250 GIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGY 308
Query: 328 MKILR-DEGLCGIGTQSSYP 346
+++ R ++ CGI + SYP
Sbjct: 309 IRMARNNKNHCGIASYCSYP 328
>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 291
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 119/219 (54%), Positives = 153/219 (69%), Gaps = 4/219 (1%)
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+ DVP+S+DWR K AVT +KDQ +CG CWAFS +AAVEGI I NL LSEQQLVDC
Sbjct: 58 VRDVPSSVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCD 117
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
T N GC GG M+ AF+YI ++ G+A ED YPY+A Q + + +A I YE+VP+
Sbjct: 118 TKSNAGCNGGLMDYAFQYIAKHGGVAAEDAYPYKARQASSCNKKPSAVVTIDGYEDVPAN 177
Query: 253 DEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
DE AL KAV+ QPV++ I A + F+ Y EG+F G CGT+LDH V VG+GTT DG YW
Sbjct: 178 DETALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYW 237
Query: 313 LIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
++KNSWG WG+ GY+++ RD EGLCGI ++SYP+
Sbjct: 238 IVKNSWGPEWGEKGYIRMKRDVEDKEGLCGIAMEASYPV 276
>gi|413947586|gb|AFW80235.1| hypothetical protein ZEAMMB73_542371 [Zea mays]
Length = 264
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 119/254 (46%), Positives = 174/254 (68%), Gaps = 10/254 (3%)
Query: 13 INTIPMFIIIILLVSCA----SQVVSSRS-THEQSVVEMHEKWMAQHGRSYKDELEKEMR 67
+++ +++ +L+ C S V+++R + + ++ E HE+WMA++GR YKD +K R
Sbjct: 2 VSSKAFLLLLAVLIGCVCSFPSPVLAARELSDDAAMAERHERWMAEYGRVYKDAADKARR 61
Query: 68 FKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
F++FK+N ++E N + + LG N+F+DLT + F+A G+K S TT FK
Sbjct: 62 FEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTEAFKA-NKGFKPISAEKAPTTG--FK 118
Query: 128 YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQ 187
Y+NLS++ +PT++DWR K AVTPIK+Q +CGCCWAFSAVAAVEGI K+S NL+ LSEQ+
Sbjct: 119 YENLSISALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAVEGIVKLSTGNLVSLSEQE 178
Query: 188 LVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNY 246
LVDC T+ + GC GG M+ AFE++I+N G+ATE YPY+AV G C K+AA I +
Sbjct: 179 LVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSKSAAT-IKGH 237
Query: 247 EEVPSGDEQALLKA 260
E+VP +E AL+KA
Sbjct: 238 EDVPPNNEAALMKA 251
>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
Length = 333
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 132/342 (38%), Positives = 208/342 (60%), Gaps = 23/342 (6%)
Query: 17 PMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLE 76
P + IL + + S+ TH+QS+ E +W A+HG+ Y E+ +R ++++NL+
Sbjct: 3 PSLFLTILCLG----IASAAPTHDQSLDEQWNQWTAEHGKVYSTG-EESLRRAVWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE+ N E G T+ +G N F D+TN++FR + TG++ + + F Q
Sbjct: 58 MIEQHNLEYSQGKHTFTMGMNAFGDMTNEDFRQMMTGFQ----NQKYNKGEVF--QPPQP 111
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS- 192
+VP S+DWR+K VTP+K+Q CG CWAFSA A+EG L+ LSEQ LVDCS
Sbjct: 112 LEVPESVDWREKGYVTPVKNQHRCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQ 171
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
N+GC GG + KAF+Y+ N G+ +E+ YPY+ ++ TC + +AA ++ ++ +P+
Sbjct: 172 PQHNSGCKGGLVIKAFQYVKDNGGLDSEESYPYEEMESTCRYSPGNSAATVTGFKHIPA- 230
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGA 309
+E+AL KAV S+ P+S+ I A+ F+ Y GI + C + L+HAV +VG+G ++G+
Sbjct: 231 EEKALEKAVASVGPISVAIDAHHHSFQFYTGGILHEPNCSPKWLNHAVLVVGYGVMQEGS 290
Query: 310 N---YWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
N YWL+KNSWG+ WG GY+ + +D+ CGI + + YP+
Sbjct: 291 NNNTYWLVKNSWGERWGVGGYIMMAKDKNNHCGIASDALYPI 332
>gi|2098464|pdb|1PCI|A Chain A, Procaricain
gi|2098465|pdb|1PCI|B Chain B, Procaricain
gi|2098466|pdb|1PCI|C Chain C, Procaricain
Length = 322
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 189/314 (60%), Gaps = 12/314 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ WM H + Y++ EK RF+IFK+NL YI++ NK+ N +Y LG N F+
Sbjct: 13 TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 71
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DL+NDEF Y G + + +S ++ N + ++P ++DWR K AVTP++ Q C
Sbjct: 72 DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDIVNLPENVDWRKKGAVTPVRHQGSC 128
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFSAVA VEGI KI L++LSEQ+LVDC ++GC GG A EY+ +N GI
Sbjct: 129 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN-GI 186
Query: 218 ATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
+YPY+A QGTC A Q K S V +E LL A++ QPVS+ + +
Sbjct: 187 HLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 246
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
F+ YK GIF G CGT++D AVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 247 FQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGN 305
Query: 333 DEGLCGIGTQSSYP 346
G+CG+ S YP
Sbjct: 306 SPGVCGLYKSSYYP 319
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 201/342 (58%), Gaps = 17/342 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + ++L C + +++ S + + E+W + HG+SY ++ E+ R +++++L
Sbjct: 1 MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G +++LG N F D+ N+EFR L GYK +H+ S F N
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQ-THKKLQGSHFLEPNF--L 115
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+VP +DWRD+ VTP+KDQ +CG CWAFS A+EG L+ LSEQ LV+CS
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGT-CSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+Y+ N GI +ED YPY T C + AA + + ++PSG
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSG 235
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVC-GTQLDHAVTIVGFGTTE--- 306
E+AL+KA+ ++ PVS+ I A T F+ Y+ GI F C T LDH V +VG+G +
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WG GY+ + +D + CGI T +SYPL
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 134/331 (40%), Positives = 195/331 (58%), Gaps = 18/331 (5%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN 82
+LL+ R ++S ++ W H + Y + E+ +R+ I+K+N I + N
Sbjct: 7 LLLLGVTLAYTIERPVKDESWIQ----WKMYHNKVYSHDGEETVRYTIWKDNERRIREHN 62
Query: 83 KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDW 142
+G + L N+F D+TN EF+A + GY SH+ STF N + P ++DW
Sbjct: 63 LKGG-DFLLKMNQFGDMTNSEFKA-FNGYL----SHKHVNGSTFLTPNNFV--APDTVDW 114
Query: 143 RDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGG 201
R++ VTP+KDQ +CG CWAFS ++EG L+ LSEQ LVDCST GNNGC G
Sbjct: 115 RNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCNG 174
Query: 202 GTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV 261
G M+ AF YI +N+GI +E YPY A G C + + AA + + ++P G+E L +AV
Sbjct: 175 GLMDNAFTYIKENKGIDSEASYPYTAEDGKCVFKKPSVAATDTGFVDLPEGNENKLKEAV 234
Query: 262 -SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
S+ P+S+ I A F+ Y G++N T+LDH V +VG+G TE G +YWL+KNSW
Sbjct: 235 ASVGPISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYG-TESGKDYWLVKNSW 293
Query: 319 GDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
+WGD GY+K+ R+ + CGI T++SYPL
Sbjct: 294 NTSWGDKGYIKMRRNAKNQCGIATKASYPLV 324
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/338 (38%), Positives = 199/338 (58%), Gaps = 10/338 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M ++L CA+ + ++ TH++ V + A HG+ Y+ E E+ R KI+ EN
Sbjct: 1 MRGFVVLCFLCAA-MTAAAITHQELVGAEWSAFKALHGKEYQSETEEYYRLKIYMENRMM 59
Query: 78 IEKANKE--GNR-TYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
I + N++ N+ +YKL N + D+ + EF + G++ S S + + +
Sbjct: 60 IARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNGFRRDYRSKPRQGSFYIEPEGIEDK 119
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
+P ++DWR K AVTP+K+Q +CG CWAFS ++EG +++ LSEQ LVDCST
Sbjct: 120 HLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDCSTA 179
Query: 195 -GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
GNNGC GG M+ AF+YI N GI TE YPY GTC + A + + ++P G+
Sbjct: 180 FGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTDGTCHFKKSDVGATDTGFVDIPEGN 239
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGAN 310
E L KAV ++ P+S+ I A F+ Y +G+++ C ++ LDH V +VG+GT +D +
Sbjct: 240 EHLLKKAVATVGPISVAIDASHQSFQFYSQGVYDEPECSSENLDHGVLVVGYGTKDD-QD 298
Query: 311 YWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
YWL+KNSWG TWGD GY+ + R+ + CGI + +SYPL
Sbjct: 299 YWLVKNSWGTTWGDGGYIYMTRNKDNQCGIASSASYPL 336
>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
Length = 342
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 138/341 (40%), Positives = 201/341 (58%), Gaps = 18/341 (5%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKE 73
+I M ++ L+ C+S + H ++ H + W +G+ YK++ E+ R I+++
Sbjct: 9 SITMNWLVWALLLCSSAMAQ---VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEK 65
Query: 74 NLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
NL+ + N E G +Y+LG N D+T++E +L + ++PS R+ T + Q
Sbjct: 66 NLKTVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQK 125
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
L P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVD
Sbjct: 126 L-----PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVD 180
Query: 191 CSTN--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEE 248
CST GN GC GG M +AF+YII N GI +E YPY+A+ G C K AA S Y E
Sbjct: 181 CSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIE 240
Query: 249 VPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTE 306
+P G E+AL +AV+ + PVS+GI A + F YK G+ ++ C ++H V +VG+G
Sbjct: 241 LPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL- 299
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
DG +YWL+KNSWG +GD GY+++ R+ G CGI + SYP
Sbjct: 300 DGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYPSYP 340
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/304 (43%), Positives = 184/304 (60%), Gaps = 11/304 (3%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYT 109
W + HG+ Y ++ E+ MR I++ NL+ I N EG ++KL N D+T+ E
Sbjct: 32 WKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHN-EGKHSFKLAMNHLGDMTSLEISQTLL 90
Query: 110 GYKMPSPSHRSTTSSTF-KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAA 168
G K+ + +TF N+ + D S+DWR K VTP+K+Q +CG CWAFS A
Sbjct: 91 GLKLKKHAESQPKGATFLPPANVKVVD---SIDWRSKGYVTPVKNQGQCGSCWAFSTTGA 147
Query: 169 VEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA 227
+EG L+ LSEQ LVDCS GNNGC GG M+ AF+YI +N GI TE YPY A
Sbjct: 148 LEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPYLA 207
Query: 228 VQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN 286
G C + A AK + + ++P+GDE AL +A+ S+ P+SI I A + F Y +G+++
Sbjct: 208 KDGVCHYNKSAIGAKDTGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQGVYD 267
Query: 287 --GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-DEGLCGIGTQS 343
T+LDH V VG+G T+DG +YWL+KNSWG +WG+ GY+KI R D CG+ +++
Sbjct: 268 DPDCSSTRLDHGVLAVGYG-TDDGKDYWLVKNSWGPSWGEEGYIKIARNDHDKCGVASKA 326
Query: 344 SYPL 347
SYPL
Sbjct: 327 SYPL 330
>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 201/342 (58%), Gaps = 17/342 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + ++L C + +++ S + + E+W + HG+SY ++ E+ R +++++L
Sbjct: 1 MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G +++LG N F D+ N+EFR L GYK +H+ S F N
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQ-THKKLQGSHFLEPNFQ-- 115
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+VP +DWRD+ VTP+KDQ +CG CWAFS A+EG L+ LSEQ LV+CS
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGT-CSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+Y+ N GI +ED YPY T C + AA + + ++PSG
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSG 235
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVC-GTQLDHAVTIVGFGTTE--- 306
E+AL+KA+ ++ PVS+ I A T F+ Y+ GI F C T LDH V +VG+G +
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WG GY+ + +D + CGI T +SYPL
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 133/338 (39%), Positives = 206/338 (60%), Gaps = 23/338 (6%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
+ +FI LLV+ ++ V+ E++ V+ + + +HG++YK+++E+ RF IFK+NL
Sbjct: 1 MKVFIAACLLVAVSATVL------EETGVKF-QAFKLKHGKTYKNQVEETARFNIFKDNL 53
Query: 76 EYIEKAN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
IE+ N ++G +YK G NRF+D+T +EFRA T P H +TT L+
Sbjct: 54 RAIEQHNVLYEQGLVSYKKGINRFTDMTQEEFRAFLTLSSSKKP-HFNTTEHV-----LT 107
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
VP S+DWR K VT +KDQ CG CWAFS + E L+ LSEQQLVDCS
Sbjct: 108 GLAVPDSIDWRTKGQVTGVKDQGNCGSCWAFSVTGSTEAAYYRKAGKLVSLSEQQLVDCS 167
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
T+ N GC GG +++ F Y ++++G+ E YPY+ G+C + K+S ++ + S
Sbjct: 168 TDINAGCNGGYLDETFTY-VKSKGLEAESTYPYKGTDGSCKYSASKVVTKVSGHKSLKSE 226
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVCG-TQLDHAVTIVGFGTTEDGA 309
DE ALL AV ++ PVS+ I A T SY+ GI+ + C ++L+H V +VG+GT+ +G
Sbjct: 227 DENALLDAVGNVGPVSVAIDA--TYLSSYESGIYEDDWCSPSELNHGVLVVGYGTS-NGK 283
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYPL 347
YW++KNSWG ++G++GY ++LR + CG+ + YP+
Sbjct: 284 KYWIVKNSWGGSFGESGYFRLLRGKNECGVAEDTVYPI 321
>gi|354473025|ref|XP_003498737.1| PREDICTED: cathepsin S-like [Cricetulus griseus]
Length = 341
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 199/342 (58%), Gaps = 20/342 (5%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKE 73
TI ++ + +V C ++ ++ H + W HG+ YK++ E+E R I+++
Sbjct: 8 TITRWLFWVPMVCC----LAGDQLQRDPTLDHHWDLWKKFHGKQYKEKNEEEARRLIWEK 63
Query: 74 NLEYIEKANKEGN---RTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
NL+ + N E + +Y LG N D+T++E ++PS HR++T + Q
Sbjct: 64 NLKLVMLHNLEYSLEMHSYSLGMNHMGDMTSEEVLGQMRPLRVPSQRHRNSTYKSNPNQK 123
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
L P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVD
Sbjct: 124 L-----PDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVD 178
Query: 191 CSTN---GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYE 247
CST GN GC GG M +AF+YII N GI ++ YPY+AV C K+ AA S Y
Sbjct: 179 CSTEEKYGNKGCDGGFMTRAFQYIIDNGGIDSDASYPYKAVAEKCHYDSKSRAATCSRYM 238
Query: 248 EVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGIFN-GVCGTQLDHAVTIVGFGTT 305
E+PSGDE+AL +AV+ + PVS+GI A F YK G+++ C ++H V +VG+G
Sbjct: 239 ELPSGDEEALKEAVANKGPVSVGIDASHPSFFLYKSGVYDEPSCTENVNHGVLVVGYGNL 298
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILR-DEGLCGIGTQSSYP 346
DG +YWL+KNSWG +GD GY+++ R ++ CGI + SYP
Sbjct: 299 -DGKDYWLVKNSWGLHFGDQGYIRMARNNKNQCGIASYGSYP 339
>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 132/308 (42%), Positives = 184/308 (59%), Gaps = 12/308 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDLTNDEF 104
E W ++G+SY E+ +R ++++ NL+ +++ N +G Y+LG N ++DL N+EF
Sbjct: 20 ESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEF 79
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
AL + +S+T TFK L +P+S+DWR++ VTP+KDQ +CG CW FS
Sbjct: 80 MALKGSGGLLQAKDKSSTQ-TFK--PLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWTFS 136
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A ++EG NL+ LSEQQLVDC+ GN GC GG ME A++YI G+ E Y
Sbjct: 137 ATGSLEGQHFAKTGNLLSLSEQQLVDCAGRYGNYGCNGGLMESAYDYIKGVGGVELESAY 196
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKE 282
PY A G C + A Y +P GDEQAL++AV ++ PV++ I A F+ Y+
Sbjct: 197 PYTARDGRCKFDRSKVVATCKGYVVIPVGDEQALMQAVGTIGPVAVSIDASGYSFQLYES 256
Query: 283 GI--FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGI 339
G+ F T LDH V VG+G TE G NYWL+KNSWG WGD GY+K+ +D+ CGI
Sbjct: 257 GVYDFRRCSSTNLDHGVLAVGYG-TEGGQNYWLVKNSWGPGWGDQGYIKMSKDKNNQCGI 315
Query: 340 GTQSSYPL 347
T S YPL
Sbjct: 316 ATDSCYPL 323
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 128/308 (41%), Positives = 193/308 (62%), Gaps = 13/308 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
E++ + GR Y + R IF+ NL++I + N + G+ T+ + N F+DL+N+EF
Sbjct: 34 EQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLSNEEF 93
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
RA + GY+ + + + + + + + +P ++DW K VTPIK+QQ+CG CWAFS
Sbjct: 94 RATFNGYRRLA----AVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCWAFS 149
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
AVA++EG + L+ LSEQ LVDCS G+ GC GG M+ AF+Y+IQN+GI TE Y
Sbjct: 150 AVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRGIDTEASY 209
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKE 282
PY+A+ +C + + A I ++ +V +GDE AL AV S+ P+S+ I A F+ Y
Sbjct: 210 PYKAIDESCEFKRNSIGATIHSFVDVKTGDESALQNAVASIGPISVAIDASQPSFQFYSS 269
Query: 283 GIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGI 339
G++N C T+ LDH VT VG+GT +G YW +KNSWG +WG GY+ + R+ + CGI
Sbjct: 270 GVYNEPDCSTEILDHGVTAVGYGTL-NGVPYWKVKNSWGTSWGQKGYIFMSRNKQNQCGI 328
Query: 340 GTQSSYPL 347
T++SYP+
Sbjct: 329 ATKASYPV 336
>gi|52076128|dbj|BAD46641.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|52076135|dbj|BAD46648.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 374
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 140/330 (42%), Positives = 193/330 (58%), Gaps = 27/330 (8%)
Query: 40 EQSVVEMHEKWMAQHG---RSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRF 96
E+S+ ++++W +G S +D +K RF++FK+N YI N++ +YKLG N+F
Sbjct: 36 EESMWSLYQRWRHVYGAASSSPRDLADKGSRFEVFKKNARYIHDFNRKKGMSYKLGLNKF 95
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+DLT +EF A YTG P P + D P + DWR+ AVT +KDQ
Sbjct: 96 ADLTLEEFTAKYTGAN-PGPITGLKNGTGSPPLAAVAGDAPPAWDWREHGAVTRVKDQGP 154
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS V AVEGI I NL+ LSEQQ++DCS G+ C GG AF+Y + N G
Sbjct: 155 CGSCWAFSVVEAVEGINAIMTGNLLTLSEQQVLDCSGAGD--CSGGYTSYAFDYAVSN-G 211
Query: 217 IATED------------EYP-YQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVS 262
I + YP Y+AVQ C KA KI +Y V DE+AL +AV
Sbjct: 212 ITLDQCFSPPTTGENYFYYPAYEAVQEPCRFDPNKAPIVKIDSYSFVDPNDEEALKQAVY 271
Query: 263 MQ-PVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDT 321
Q PVS+ I A + EF Y+ G+F+G CGT+L+HAV +VG+ TEDG YW++KNSWG
Sbjct: 272 SQGPVSVLIEA-SYEFMIYQGGVFSGPCGTELNHAVLVVGYDETEDGTPYWIVKNSWGAG 330
Query: 322 WGDAGYMKILRD----EGLCGIGTQSSYPL 347
WG++GY++++R+ EG+CGI YP+
Sbjct: 331 WGESGYIRMIRNIPAPEGICGIAMYPIYPI 360
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 136/343 (39%), Positives = 196/343 (57%), Gaps = 19/343 (5%)
Query: 13 INTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
++ I + ++ L SC + + H + W + + Y D E+ +R ++
Sbjct: 1 MHAISVLAVLALAFSCTLAFDAKLNQHWKL-------WKEANNKRYSDA-EEHVRRATWE 52
Query: 73 ENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
NL+ +++ N + G TY LG N+++D+T EF + GY R+ TF +
Sbjct: 53 GNLQKVQEHNLQADLGVHTYWLGMNKYADMTVTEFVKVMNGYNATMRGQRTQDRHTFSFN 112
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
S +P ++DWRDK VT +KDQ +CG CWAFS A+EG L+ LSEQ LV
Sbjct: 113 --SKIALPDTVDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLV 170
Query: 190 DCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEE 248
DCS GN GC GG M++AFEYI +N GI TED YPY+AV C A + + +
Sbjct: 171 DCSGKQGNMGCNGGLMDQAFEYIKENNGIDTEDSYPYEAVDNQCRFKAANVGATDTGFTD 230
Query: 249 VPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN-GVCG-TQLDHAVTIVGFGTT 305
+ S DE AL +AV ++ P+S+ I A T F+ YK G++N C T+LDH V VG+G T
Sbjct: 231 ITSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGVYNEPFCSQTRLDHGVLAVGYG-T 289
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
+ G +YWL+KNSWG+ WGD GY+K+ R++ CGI T +SYPL
Sbjct: 290 DSGKDYWLVKNSWGEGWGDKGYIKMTRNKRNQCGIATAASYPL 332
>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 517
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 129/317 (40%), Positives = 191/317 (60%), Gaps = 17/317 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTY--KLGTNRFS 97
E+ V+E+ ++W ++ + Y+ ++++RF+ FK NL+YI + N + Y LG NRF+
Sbjct: 43 EEGVIELFQRWKEENKKIYRSPDQEKLRFENFKRNLKYIAEKNSKRISPYGQSLGLNRFA 102
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
D++N+EF++ +T S R+ S ++ S D P SLDWR K VT +KDQ C
Sbjct: 103 DMSNEEFKSKFTSKVKKPFSKRNGLSG----KDHSCEDAPYSLDWRKKGVVTAVKDQGYC 158
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
GCCWAFS+ A+EGI I +LI LSE +LVDC N+GC GG M+ AFE+++ N GI
Sbjct: 159 GCCWAFSSTGAIEGINAIVSGDLISLSEPELVDCDRT-NDGCDGGHMDYAFEWVMHNGGI 217
Query: 218 ATEDEYPYQAVQGTCSAA-QKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
TE YPY GTC+ A ++ I Y V D ++LL A QP+S GI + +
Sbjct: 218 DTETNYPYSGADGTCNVAKEETKVIGIDGYYNVEQSD-RSLLCATVKQPISAGIDGSSWD 276
Query: 277 FKSYKEGIFNGVCGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
F+ Y GI++G C + +DHA+ +VG+G+ D +YW++KNSWG +WG GY+ I R+
Sbjct: 277 FQLYIGGIYDGDCSSDPDDIDHAILVVGYGSEGD-EDYWIVKNSWGTSWGMEGYIYIRRN 335
Query: 334 E----GLCGIGTQSSYP 346
G+C I +SYP
Sbjct: 336 TNLKYGVCAINYMASYP 352
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 136/334 (40%), Positives = 196/334 (58%), Gaps = 11/334 (3%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN 82
+L++SC + + S + S E + H + Y +ELE+ R KIF EN + IEK N
Sbjct: 4 LLVLSCLIALGQAVSFFDLSADEF-TLFKKFHRKEYDNELEESYRKKIFLENKKRIEKHN 62
Query: 83 ---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
K+G ++KL N +D+ E+ +Y G+ S ++ + S + + + +
Sbjct: 63 SRYKQGKVSFKLKLNHLADMLIHEYSDVYLGFNKSSKANNNKLQS-YTFIPPAHVTLNKE 121
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNG 198
+DWR K AVTP+K+Q CG CWAFS A+EG L+ LSEQ LVDCS + GNNG
Sbjct: 122 VDWRTKGAVTPVKNQGHCGSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCSGSYGNNG 181
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALL 258
C GG M+ AF+YI +N GI TE YPY+ TC + + A S + ++ GDE+AL+
Sbjct: 182 CEGGLMDNAFQYIKENHGIDTEKSYPYEGEDETCRFRKTSIGATDSGFVDITQGDEEALM 241
Query: 259 KAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIK 315
+AV ++ P+S+ I A F+ Y EG+ + C ++ LDH V +VG+G ED YWL+K
Sbjct: 242 QAVATIGPISVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLVVGYG-VEDNQKYWLVK 300
Query: 316 NSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
NSWG WGD GY+K+ RD + CGI TQ+SYPL
Sbjct: 301 NSWGTQWGDGGYIKMARDQDNNCGIATQASYPLV 334
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 130/317 (41%), Positives = 181/317 (57%), Gaps = 16/317 (5%)
Query: 48 EKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDLTN 101
E+W+A QH + Y E+E R KI+ EN I K N+ +G +YKLG N+++D+ +
Sbjct: 26 EEWVAFKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQLYEQGLVSYKLGPNKYTDMLH 85
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM-----TDVPTSLDWRDKKAVTPIKDQQE 156
EF GY + ++ + + P +DW K AVT +KDQ +
Sbjct: 86 HEFIQAMNGYNRTAKHNKGLYGKKHDVRGATFIPPAHVKYPDHVDWTKKGAVTEVKDQGK 145
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC-STNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS A+EG L+ LSEQ L+DC ST GNNGC GG M+ AF+YI N
Sbjct: 146 CGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYIKDNG 205
Query: 216 GIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYT 274
GI TE YPY+ V C K + A+ + ++PSGDE+ L++AV ++ PVS+ I A
Sbjct: 206 GIDTEKTYPYEGVDDKCRYNPKNSGAEDVGFVDIPSGDEEKLMQAVATVGPVSVAIDASQ 265
Query: 275 TEFKSYKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
F+ Y G++ T LDH V +VG+GT E G +YWL+KNSW TWG+ GY+K+ R
Sbjct: 266 NSFQFYSGGVYYDTECSSTDLDHGVLVVGYGTDEAGGDYWLVKNSWSRTWGELGYIKMAR 325
Query: 333 D-EGLCGIGTQSSYPLA 348
+ + CGI T +SYPL
Sbjct: 326 NRDNHCGIATDASYPLV 342
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 134/340 (39%), Positives = 196/340 (57%), Gaps = 15/340 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F ++ L+ +Q VS V E + QH + YK + E++ R KIF EN +
Sbjct: 3 FFVLALVFIVGAQAVSFFDL----VQEQWGTFKLQHKKQYKSDTEEKFRMKIFMENSHKV 58
Query: 79 EKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLS 132
K NK G +YKL N+++D+ + EF G+ + TS + + +
Sbjct: 59 AKXNKLYEMGLVSYKLKINKYADMLHHEFVHTVNGFNRTKNTPLLGTSEDEQGATFIAPA 118
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
P ++DWR+ AVT +KDQ CG CW+FSA A+EG L+ LSEQ LVDCS
Sbjct: 119 NVKFPENVDWREHGAVTXVKDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDCS 178
Query: 193 TN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
T GN+GC GG M+ AF+Y+ N GI TE YPY A C K + A + ++P+
Sbjct: 179 TKFGNDGCNGGLMDNAFKYVKYNHGIDTEASYPYHADDEKCHYNPKTSGATDRGFVDIPT 238
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDG 308
GDE+ L+ AV ++ PVS+ I A F+ Y EG+ ++ C + +LDH V +VG+GT E+G
Sbjct: 239 GDEEKLMAAVATVGPVSVAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYGTDENG 298
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+YW++KNSWG++WG+ GY+K+ R+ + CGI TQ+SYPL
Sbjct: 299 QDYWIVKNSWGESWGEQGYIKMARNRDNNCGIATQASYPL 338
>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 360
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 136/350 (38%), Positives = 196/350 (56%), Gaps = 22/350 (6%)
Query: 17 PMFIIIILLVSCASQVVSSRSTHEQSVVEMHE--KWMAQHGRSYKDELEKEMRFKIFKEN 74
P+ +L++ A+ S R ++ M W A H +SY+ E+ RF+++++N
Sbjct: 10 PVITASTILLAWAAAAASGRGVDVGDMLMMDRFLMWQATHNQSYRSAEERLRRFQVYRDN 69
Query: 75 LEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTS----------- 123
+EYIE N+ G+ TY+LG N+F+DLT +EF A +T Y S
Sbjct: 70 VEYIETTNRRGDLTYQLGENQFADLTREEFIARFTSYNGDDDRTGDDDSVITTAAVGGGD 129
Query: 124 -STFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC-WAFSAVAAVEGITKISGANLI 181
+ ++ P S+DWR K AV P K Q WAF AVA +E + I L+
Sbjct: 130 PDLWSSGGDDVSLDPPSVDWRAKGAVVPPKSQSSSCSSSWAFVAVATIESLHAIKTGKLV 189
Query: 182 QLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAA 240
LSEQQLVDC + GC GT +AF ++IQN G+ TE EYPY A QGTC++A+
Sbjct: 190 ALSEQQLVDCDQY-DGGCNRGTFRRAFHWVIQNGGLTTEAEYPYTAAQGTCNSAKSDHHV 248
Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIV 300
A IS + VP +E A+ AV+ QPV+ I ++ + YK G+++G CG +L+HAVT+V
Sbjct: 249 AAISGHASVPGSNELAMKHAVATQPVAAAI-ELGSDMQFYKSGVYSGPCGARLEHAVTVV 307
Query: 301 GFGTTED-GANYWLIKNSWGDTWGDAGYMKILRD---EGLCGIGTQSSYP 346
G+G E G YW++KNSWG TWG+ GY+++ R GLCGI +YP
Sbjct: 308 GYGADESTGDKYWIVKNSWGQTWGERGYIRMQRKILGPGLCGIMLDVAYP 357
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 132/310 (42%), Positives = 184/310 (59%), Gaps = 14/310 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
E W HG+SY+ +E+++R KI EN I + N E G +Y + N + DL + EF
Sbjct: 28 ESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYGDLLHHEF 87
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
A+ GY+ + + S S +N+ + PT +DWR+ AVTP+K+Q +CG CWAFS
Sbjct: 88 VAMVNGYEYVNKT--SLGGSFIPSKNVKL---PTHVDWREDGAVTPVKNQGQCGSCWAFS 142
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
+ ++EG T LI LSEQ LVDCS GNNGC GG M+ AF YI N+GI TE Y
Sbjct: 143 STGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGIDTEGSY 202
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKE 282
PY+ V G C + + +V G E+ LLKAV S+ PVS+ I A F+ Y
Sbjct: 203 PYEGVGGRCHYDPSKKGSSDIGFVDVKKGSEEELLKAVASVGPVSVAIDASHMSFQFYSH 262
Query: 283 GI-FNGVCGTQ-LDHAVTIVGFGTTED-GANYWLIKNSWGDTWGDAGYMKILRD-EGLCG 338
G+ F C + LDH V +VG+GT E+ G +YWL+KNSW + WGD GY+K+ R+ + +CG
Sbjct: 263 GVYFESKCSPENLDHGVLVVGYGTDENSGEDYWLVKNSWSENWGDQGYIKMARNKKNMCG 322
Query: 339 IGTQSSYPLA 348
I + +SYP+
Sbjct: 323 IASSASYPVV 332
>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 334
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 138/337 (40%), Positives = 201/337 (59%), Gaps = 14/337 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ ++L+ +C VVSS S E +W +HG+ Y + E+ R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFIDFDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
K N + G+ TY LG N+F+DL N+EF +L G++ S ++T STF + ++ D
Sbjct: 60 IKHNLKYDLGHFTYDLGMNQFADLKNEEFVSLMNGFR--GNSSKATRGSTFLPPS-NVFD 116
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TN 194
+PT +DWR K VTP+K+Q +CG CWAFSA ++EG L+ LSEQ LVDCS
Sbjct: 117 MPTMVDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGKE 176
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
GN GC GG M++AF+YI+ GI TE YPY A+ G C + A + Y +V +G E
Sbjct: 177 GNMGCEGGLMDQAFQYILDVGGIDTEMSYPYTAMDGQCHFNKANIGATDTGYTDVTTGSE 236
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANY 311
AL AV S+ P+S+ I A F+ YK G++N T LDH V VG+GT+ DG +Y
Sbjct: 237 SALQMAVASVGPISVAIDASHQSFQLYKSGVYNEPACSSTLLDHGVLAVGYGTSSDGTDY 296
Query: 312 WLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
+ +SWG WG GY+ + R+ + CGI T++SYPL
Sbjct: 297 FFFFHSWGAAWGMNGYLWMSRNKDNQCGIATKASYPL 333
>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
Length = 330
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 132/335 (39%), Positives = 191/335 (57%), Gaps = 16/335 (4%)
Query: 22 IILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKA 81
I LL + ++S+ TH+ S + E+W +HG++Y E + R +++ N++ I
Sbjct: 4 IFLLATLCLGMISAAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNMKMINLH 62
Query: 82 NKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N++ G + L N F DLTN EFR L TG++ F + DVP
Sbjct: 63 NEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQGQKTKMMKVFPEPF------LGDVPK 116
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNN 197
++DWR VTP+K+Q CG CWAFSAV ++EG L+ LSEQ LVDCS ++GN
Sbjct: 117 TVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSHGNK 176
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG + AF+Y+ N G+ T YPY+A+ GTC K +AAK+ + +P E AL
Sbjct: 177 GCDGGLPDFAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKYSAAKVVGFMSIPP-SENAL 235
Query: 258 LKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+KAV ++ P+S+GI F+ YK G++ T L+HAV +VG+G DG YWL+
Sbjct: 236 MKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEESDGRKYWLV 295
Query: 315 KNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
KNSWG WG GY+K+ +D CGI + +SYP+
Sbjct: 296 KNSWGRDWGMDGYIKMAKDWNNNCGIASDASYPIV 330
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 135/337 (40%), Positives = 192/337 (56%), Gaps = 13/337 (3%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
+ + + LL + AS V EQ + W H + Y E+ R I+++NL
Sbjct: 1 MKLLVAACLLFAVASGFVVKFDEDEQQW----QAWKLFHTKKYTTVTEEGARKAIWRDNL 56
Query: 76 EYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+ I+K N EG+ ++ L N DLT DEFR YTG + ++ S F S
Sbjct: 57 KKIQKHNAEGH-SFTLAMNHLGDLTQDEFRYFYTGMRSHYSNYTKKQGSAFLAP--SHVQ 113
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN- 194
VP ++DWR + VTP+K+Q +CG CWAFS ++EG L+ LSEQ LVDCST
Sbjct: 114 VPDTVDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAY 173
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
GNNGC GG M+ AF+YI +N GI TE+ YPY+A C + A + + +V GDE
Sbjct: 174 GNNGCQGGLMDYAFKYIKENGGIDTEESYPYEARNDRCRFQKSNIGAVDTGFVDVTHGDE 233
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANY 311
+AL A ++ P+S+ I A F+ Y G++N G T LDH V +VG+GT + G++Y
Sbjct: 234 EALKTAAGTVGPISVAIDAGHMSFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQ-GSDY 292
Query: 312 WLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
WL+KNSWG+ WG GY+ + R++ CG+ TQ+SYPL
Sbjct: 293 WLVKNSWGERWGMEGYIMMSRNKNNQCGVATQASYPL 329
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 132/318 (41%), Positives = 189/318 (59%), Gaps = 9/318 (2%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT---YKLGTN 94
THE+ V + A HG+ Y+ + E+ R KI+ EN I + N++ ++ YKL N
Sbjct: 14 THEELVGAEWSAFKALHGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMN 73
Query: 95 RFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
F D+ + EF + G+K S + + L +P ++DWR K AVTP+K+Q
Sbjct: 74 EFGDMLHHEFVSTRNGFKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPVKNQ 133
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQ 213
+CG CW+FS ++EG L+ LSEQ L+DCS + GNNGC GG M+ AF+YI
Sbjct: 134 GQCGSCWSFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYIKA 193
Query: 214 NQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAA 272
N+GI TE YPY A G C + A A + + ++P GDE L KAV ++ PVS+ I A
Sbjct: 194 NKGIDTEQSYPYNATDGVCHFNKSAVGATDTGFVDIPEGDENKLKKAVATVGPVSVAIDA 253
Query: 273 YTTEFKSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
F+ Y EG+++ C + QLDH V +VG+G T+DG +YWL+KNSWG TWGD GY+ +
Sbjct: 254 SHESFQFYSEGVYDEPECDSEQLDHGVLVVGYG-TKDGQDYWLVKNSWGTTWGDGGYIYM 312
Query: 331 LRD-EGLCGIGTQSSYPL 347
R+ + CGI + +SYPL
Sbjct: 313 SRNKDNQCGIASAASYPL 330
>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 128/332 (38%), Positives = 204/332 (61%), Gaps = 13/332 (3%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN 82
+ L S + ++ ++++ +W AQHG+SY+ E +R +++NL+ IE+ N
Sbjct: 5 LCLASLCLGLAAAIPPFDRALDSQWHQWKAQHGKSYEAN-EDSLRRATWEKNLKMIERHN 63
Query: 83 KE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
+E G +++L N+F D++ +EF+ + GYK + S R T S ++ L+ +P S
Sbjct: 64 QEYSAGKHSFQLRMNKFGDMSTEEFKQVMNGYK-SNGSQRRTKGSLYRESLLAQ--LPES 120
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNG 198
+DWR+K VTP+K+Q +CG CW+FSAV A+EG L+ LS Q L+DC+ GNNG
Sbjct: 121 VDWREKGYVTPVKEQGDCGACWSFSAVGAIEGQWFRKTGKLVSLSIQNLIDCTIPEGNNG 180
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALL 258
C GG M+ AF+Y+ N GI TE+ YPY A C + + A I+ + ++PS DE+AL+
Sbjct: 181 CDGGFMDNAFQYVQDNGGIDTEECYPYVAQDTECKYKPECSGANITGFVDIPSMDERALM 240
Query: 259 KAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
+AV ++ P+S+GI + FK Y+ G++ +QLDH V +VG+G+ YW++K
Sbjct: 241 EAVATVGPISVGIDSANPSFKFYQSGVYYEPDCSSSQLDHGVLVVGYGSIGKD-EYWIVK 299
Query: 316 NSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
NSWG+ WGD GY+ + +D + CGI T++SYP
Sbjct: 300 NSWGEAWGDNGYILMAKDKDNHCGIATEASYP 331
>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
Length = 262
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 110/223 (49%), Positives = 151/223 (67%), Gaps = 8/223 (3%)
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
++D+P S+DWR K AVT +KDQ +CG CWAFS V +VEGI I +L+ LSEQ+L+DC
Sbjct: 1 VSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD 60
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA----AAAKISNYEE 248
T N+GC GG M+ AFEYI N G+ TE YPY+A +GTC+ A+ A I +++
Sbjct: 61 TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQD 120
Query: 249 VPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
VP+ E+ L +AV+ QPVS+ + A F Y EG+F G CGT+LDH V +VG+G EDG
Sbjct: 121 VPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDG 180
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRDE----GLCGIGTQSSYPL 347
YW +KNSWG +WG+ GY+++ +D GLCGI ++SYP+
Sbjct: 181 KAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPV 223
>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
Length = 337
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 137/341 (40%), Positives = 194/341 (56%), Gaps = 17/341 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
+ + +L C S +S+ S Q + + + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPLAVLAVCLSAALSAPSLDPQ-LDDHWDLWKSWHSKKYH-EKEEGWRRMVWEKNLKKI 58
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
E N E G Y+LG N F D+T++EFR + GYK + R S F N +
Sbjct: 59 ELHNLEHSMGKHPYRLGMNHFGDMTHEEFRQIMNGYKQ-RKTERKFKGSLFMEPNF--LE 115
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TN 194
P +LDWRDK VTP+KDQ +CG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 116 APRALDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPE 175
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSGD 253
GN GC GG M++AF+Y+ NQG+ +ED YPY C +A + + +VPSG
Sbjct: 176 GNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPNYNSANDTGFVDVPSGK 235
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGF---GTTED 307
E+AL+KAV ++ PVS+ I A F+ Y+ GI+ +LDH V +VG+ G D
Sbjct: 236 ERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELDHGVLVVGYGYEGEDVD 295
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
G YW++KNSW + WGD GY+ + +D + CGI T +SYPL
Sbjct: 296 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 336
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 132/306 (43%), Positives = 187/306 (61%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE-GNRTYKLGTNRFSDLTNDEFRALY 108
W A+HG+SY++ E+ +R ++ N +YI++ N+ G Y L N+F DL N EF++LY
Sbjct: 25 WKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFKSLY 84
Query: 109 TGYKMP-SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
GY+M +P + Q D+P S+DW K VTP+K+Q +CG CW+FSA
Sbjct: 85 NGYRMSNAPRKGKPFVPAARVQ-----DLPASVDWSKKGWVTPVKNQGQCGSCWSFSATG 139
Query: 168 AVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQ 226
++EG + L+ LSEQ LVDCS GN+GC GG M+ AFEY+I+N GI TE YPY+
Sbjct: 140 SMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASYPYR 199
Query: 227 AVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF 285
AV TC A IS Y +V E L AV ++ PVS+ I A F+ Y G++
Sbjct: 200 AVDSTCKFNTADVGATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYSSGVY 259
Query: 286 NG-VC-GTQLDHAVTIVGFGTTEDGA-NYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGT 341
+ +C T LDH V VG+GT DG+ +YWL+KNSWG +WG +GY++++R+ CGI T
Sbjct: 260 DPLICSSTNLDHGVLAVGYGT--DGSKDYWLVKNSWGASWGMSGYIEMVRNHNNKCGIAT 317
Query: 342 QSSYPL 347
+SYP+
Sbjct: 318 SASYPV 323
>gi|110625773|ref|NP_081620.2| cathepsin L-like 3 precursor [Mus musculus]
gi|74208432|dbj|BAE26401.1| unnamed protein product [Mus musculus]
gi|187955662|gb|AAI47425.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
gi|187957686|gb|AAI47424.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
Length = 331
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 136/334 (40%), Positives = 192/334 (57%), Gaps = 15/334 (4%)
Query: 22 IILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKA 81
+ LL + VVS+ H S+ + E+W +H ++Y E + R +++ N + I+
Sbjct: 4 VFLLATLCLGVVSAAPAHNPSLDAVWEEWKTKHKKTYNMNDEGQKR-AVWENNKKMIDLH 62
Query: 82 NKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N++ G + L N F DLTN EFR L TG++ + T +Q + DVP
Sbjct: 63 NEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQ-----GQKTKMMMKVFQEPLLGDVPK 117
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNN 197
S+DWRD VTP+KDQ CG CWAFSAV ++EG L+ LS Q LVDCS + GN
Sbjct: 118 SVDWRDHGYVTPVKDQGSCGSCWAFSAVGSLEGQMFRKTGKLVPLSVQNLVDCSWSQGNQ 177
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG + AF+Y+ N G+ T YPY+A+ GTC K +AA ++ + V S E AL
Sbjct: 178 GCDGGLPDLAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKNSAATVTGFVNVQS-SEDAL 236
Query: 258 LKAV-SMQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+KAV ++ P+S+GI F+ YKEG++ T LDHAV +VG+G DG YWL+
Sbjct: 237 MKAVATVGPISVGIDTKHKSFQFYKEGMYYEPDCSSTVLDHAVLVVGYGEESDGRKYWLV 296
Query: 315 KNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
KNSWG WG GY+K+ +D CGI + +SYP+
Sbjct: 297 KNSWGRDWGMNGYIKMAKDRNNNCGIASDASYPV 330
>gi|291383488|ref|XP_002708302.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 344
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 134/341 (39%), Positives = 209/341 (61%), Gaps = 20/341 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + + L V C S ++S+ T + S+ +W+A H R Y E+E R ++++N++
Sbjct: 1 MHLPLFLAVLC-SGMISAAPTPDHSLDTRWRQWLAAHKRRYGVR-EEEWRRAVWEKNMQM 58
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IEK N+E G + + N + D+TN+EFR + G++ + +H+ ++ N +
Sbjct: 59 IEKHNREYSQGKHGFTMAMNAYGDMTNEEFRLMMNGFE--NQNHKRGE----EFHNSLLF 112
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+P LDWR++ VTP+K+Q+ CG WAFSA A+EG L+ LSEQ LVDCS
Sbjct: 113 KIPAFLDWRERGYVTPVKNQELCGSSWAFSATGALEGQMFRKTGRLVSLSEQNLVDCSWP 172
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
GN GC GG M+ AF+Y+ N+G+ +E+ YPY+ +G+C + +AA ++ + +V S D
Sbjct: 173 QGNQGCSGGLMDYAFQYVKDNRGLDSEESYPYEQRKGSCKYNPRFSAANVTGFVDV-SKD 231
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGA- 309
E+AL++AV ++ PVS+GIA F Y+ GI ++ C ++ ++HAV +VG+G E G+
Sbjct: 232 EKALMEAVATVGPVSVGIATTPESFLFYEGGIYYDPKCSSENVNHAVLVVGYGFEEVGSK 291
Query: 310 --NYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
YWLIKNSWG WG GYMK+ +D+ CGI T +SYPL
Sbjct: 292 NNKYWLIKNSWGKDWGMGGYMKMAKDQNNHCGIATAASYPL 332
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 129/306 (42%), Positives = 187/306 (61%), Gaps = 16/306 (5%)
Query: 54 HGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDLTNDEFRALYTG 110
H R+Y E E+ R ++F+ NL+ IE N +G +Y++G N+F+D+ EF ++ G
Sbjct: 51 HERNYG-ETEEMQRKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKEFASVVNG 109
Query: 111 YKMPSPSHRSTTSSTFKYQNLSM---TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
++M ++R+ +S +P +DWR + VTPIKDQ CG CW+FS
Sbjct: 110 FRM---NNRTKVRDHLHSHYISPAIPVSLPAEVDWRKEGYVTPIKDQGHCGSCWSFSTTG 166
Query: 168 AVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQ 226
A+EG L+ LSEQ L+DCST+ GNNGC GG M+ AF+YI N G TED YPY+
Sbjct: 167 ALEGQHFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDNDGDDTEDSYPYE 226
Query: 227 AVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSM-QPVSIGIAAYTTEFKSYKEGIF 285
A G C ++ A + Y ++P GDE+ + +AV+M PVS+ I A T F+ Y+ G++
Sbjct: 227 AADGPCRFKKEYVGATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDASHTSFQMYQSGVY 286
Query: 286 NGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQ 342
+ V C + LDH V +VG+G TE G +YWL+KNSWG WGD GY+K+ R++ CGI +
Sbjct: 287 DEVECDPEGLDHGVLVVGYG-TELGQDYWLVKNSWGTKWGDEGYIKMSRNKNNQCGISSM 345
Query: 343 SSYPLA 348
+SYPL
Sbjct: 346 ASYPLV 351
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 116/250 (46%), Positives = 169/250 (67%), Gaps = 5/250 (2%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T+ ++E+ E WM++H ++YK EK RF++F+ENL +I++ N E N +Y LG N F+
Sbjct: 42 TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DLT++EF+ Y G P S + S+ F+Y+++ TD+P S+DWR K AV P+KDQ +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS VAAVEGI +I+ NL LSEQ+L+DC T N+GC GG M+ AF+YII G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 218 ATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
ED+YPY +G C ++ IS YE+VP D+++L+KA++ QPVS+ I A +
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 277 FKSYKEGIFN 286
F+ YK G++N
Sbjct: 279 FQFYK-GVYN 287
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 136/353 (38%), Positives = 198/353 (56%), Gaps = 22/353 (6%)
Query: 11 FKINTIPM--------FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDEL 62
F + +P+ F+++ L A+ + TH++ V + A HG+ Y E
Sbjct: 11 FLVTHVPLNGIWKNEGFVVLGCLFVTAAAI-----THQELVGAEWSAFKALHGKEYHSET 65
Query: 63 EKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR 119
E+ R KI+ EN I + N++ +YKL N F DL + EF + G+K S
Sbjct: 66 EEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDLLHHEFVSTRNGFKRNYRSTP 125
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGAN 179
S + + + +P ++DWR K AVTP+K+Q +CG CWAFS ++EG
Sbjct: 126 REGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGR 185
Query: 180 LIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA 238
++ LSEQ LVDCS GNNGC GG M+ AF+YI N GI TE YPY G C +
Sbjct: 186 MVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYIKANGGIDTELSYPYNGTDGICHFEKSD 245
Query: 239 AAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDH 295
A + + ++P G+EQ L KAV ++ PVS+ I A F+ Y +G+++ C ++ LDH
Sbjct: 246 VGATDTGFVDIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYSQGVYDEPECSSESLDH 305
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
V +VG+G T+DG +YWL+KNSWG TWGD GY+ + R+ E CGI + +SYPL
Sbjct: 306 GVLVVGYG-TKDGQDYWLVKNSWGTTWGDDGYIYMTRNKENQCGIASSASYPL 357
>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 422
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 130/314 (41%), Positives = 191/314 (60%), Gaps = 17/314 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIE---KANKEGNRTYKLGTNRFSDLTNDEF 104
++W+A HG++Y E+ R IF +N E++ +A+ G +++ L N +DLT +EF
Sbjct: 71 DRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLADLTREEF 130
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKKAVTPIKDQQECGCCWA 162
+ + GY S ++S N DV P ++DW + AVTP+K+Q +CG CWA
Sbjct: 131 KHML-GYDA-SKKRVESSSPPVDAANWEYADVTPPETMDWVSRGAVTPVKNQGQCGSCWA 188
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCST-NGNNGCGGGTMEKAFEYIIQNQGIATED 221
FS V AVEG+ + +LI LSEQ+LV C+ GNNGC GG M+ FE+I++N+G+ E+
Sbjct: 189 FSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIVENRGVDDEE 248
Query: 222 EYPYQAVQGTCS--AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKS 279
++ Y A C+ ++A AA I +++VP DE AL KAVS QPV++ I A EF+
Sbjct: 249 DWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAIEADHREFQL 308
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGA---NYWLIKNSWGDTWGDAGYMKILR---- 332
Y G+F+G CGT LDH V +VG+G + A +YW +KNSWG WG+ GY++I R
Sbjct: 309 YSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGYIRIARGGMG 368
Query: 333 DEGLCGIGTQSSYP 346
G CG+ Q+SYP
Sbjct: 369 PAGQCGVAMQASYP 382
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 188/314 (59%), Gaps = 14/314 (4%)
Query: 42 SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSD 98
S+ + + + A+HGR Y E+ R +F++N ++I+ N + G T+ L N+F D
Sbjct: 17 SLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGD 76
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
+T++E A G+ + +P+ R ++ K + ++ P +DWR K AVTP+KDQ++CG
Sbjct: 77 MTSEEIVATMNGF-LGAPTRRP--AAVLKADDETL---PEKVDWRTKGAVTPVKDQKQCG 130
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN-GCGGGTMEKAFEYIIQNQGI 217
CWAFS ++EG + L+ LSEQ LVDCS N GC GG M++AF YI N+GI
Sbjct: 131 SCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKANKGI 190
Query: 218 ATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTE 276
TED YPY+A G C A + Y +V G E AL KAV ++ P+S+GI A +
Sbjct: 191 DTEDSYPYEAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQST 250
Query: 277 FKSYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE 334
F Y G+++ T LDH V VG+G+ E+G ++WL+KNSW +WGD GY+K+ R+
Sbjct: 251 FHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRNR 310
Query: 335 -GLCGIGTQSSYPL 347
CGI +Q+SYPL
Sbjct: 311 NNNCGIASQASYPL 324
>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
proteinase II; Flags: Precursor
gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
Length = 337
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 130/344 (37%), Positives = 210/344 (61%), Gaps = 13/344 (3%)
Query: 11 FKINTIPMFIIIILLVSCASQ-VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+++ +F +I+L +S S V S ++ S ++ WM + ++Y + E R++
Sbjct: 1 MRLSITLIFTLIVLSISFISAGNVFSHKQYQDSFID----WMRSNNKAYTHK-EFMPRYE 55
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
FK+N++Y+ N +G++T LG N+ +DL+N+E+R Y G + + +
Sbjct: 56 EFKKNMDYVHNWNSKGSKTV-LGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRL 114
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
N P ++DWR+K AVTP+KDQ +CG C++FS +VEG+T I L+ LSEQ ++
Sbjct: 115 NRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNIL 174
Query: 190 DCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQ-AVQGTCSAAQKAAAAKISNYE 247
DCS++ GN GC GG M AFEYII+N G+ +E++YPY+ V C + + AAKI++Y+
Sbjct: 175 DCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKITSYK 234
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFGTT 305
E+ +GDE L A+ + PVS+ I A F+ Y G+ + C ++ LDH V VG G T
Sbjct: 235 EIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMG-T 293
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
++G +Y+++KNSWG +WG GY+ + R+ + CGI T +SYP+A
Sbjct: 294 DNGEDYYIVKNSWGPSWGLNGYIHMARNKDNNCGISTMASYPIA 337
>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 128/306 (41%), Positives = 184/306 (60%), Gaps = 12/306 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT-YKLGTNRFSDLTNDEFRA 106
E W A +G+SY E++ R ++EN I+ N + ++ Y L N F DLT+ EF +
Sbjct: 28 ELWKATYGKSYLTLEEEKYRRDTWEENSLLIKTHNTDSDKHGYTLEMNSFGDLTSAEFSS 87
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
LY GY+ + S SS+ + +P+SLDWRDKK VT +K+Q +CG CWAFS
Sbjct: 88 LYNGYRQNLETSGSVFSSSLR------NAMPSSLDWRDKKVVTDVKNQGKCGSCWAFSTT 141
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
++EG+ + +L+ LSEQQL+DCS GNNGC GG M AF+YI G TE+ YPY
Sbjct: 142 GSLEGLHALKTGHLVSLSEQQLMDCSVKYGNNGCDGGNMRSAFQYIKDAGGDDTEESYPY 201
Query: 226 QAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI 284
A +C K A Y +PSGDE +L+ A+ + P+S+ + A F+ YK+GI
Sbjct: 202 TAKNESCRFDPKKVGATDEGYVRIPSGDEVSLMHALYEVGPISVAMDAGLKTFQFYKKGI 261
Query: 285 FNG-VC-GTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGT 341
++ +C T L+H VT++G+G + DG+ YWL+KNSWG WG GY + R G +CG+ T
Sbjct: 262 YSDYLCSNTHLNHGVTLIGYGESSDGSPYWLVKNSWGKDWGIDGYFMLARYVGNMCGVAT 321
Query: 342 QSSYPL 347
+SYP+
Sbjct: 322 DASYPI 327
>gi|227018328|gb|ACP18830.1| cysteine proteinase 1 [Chrysomela tremula]
Length = 323
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 137/332 (41%), Positives = 204/332 (61%), Gaps = 21/332 (6%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN- 82
L+++ A+ VV+ + +Q E+ + HG++YK E+++RF IF++ L I N
Sbjct: 3 LIIAFAAFVVAINAASDQ---ELWADFKKAHGKTYKSLREEKLRFNIFQDTLREIAAHNA 59
Query: 83 --KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
+ G TY L N+FSD+T++EFRA+ PS + NL++ P S+
Sbjct: 60 KYESGESTYYLAINQFSDITDEEFRAMLMKNVESRPSLED-----MEIANLTVGAAPESI 114
Query: 141 DWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST-NGNNGC 199
DWR + AV PI++Q++CG CWAFSAVAAVEG I + LS QQLVDCST GN+GC
Sbjct: 115 DWRTEGAVLPIRNQEDCGSCWAFSAVAAVEGQAAIKSGSKTPLSVQQLVDCSTEGGNSGC 174
Query: 200 GGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLK 259
GG M AF+YI N G+ ++ +YPY +C A + ++ K++ Y++V S E +L +
Sbjct: 175 NGGLMNGAFDYIKAN-GLESDAKYPYTGTDDSCKADKSSSLVKLTGYKKVAS-SEASLKE 232
Query: 260 AV-SMQPVSIGIAAYTTEFKSYKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKN 316
AV ++ P+S +A Y ++SY GIFN + G LDH VT VG+G T++G YW +KN
Sbjct: 233 AVGTVGPIS--VAVYADLWRSYGGGIFNNILCLGFGLDHGVTAVGYG-TDNGKKYWPVKN 289
Query: 317 SWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
SWG++WG+ GY+++ RD CGI Q+SYP+
Sbjct: 290 SWGESWGEEGYIRMARDTLHNCGINQQASYPI 321
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 129/301 (42%), Positives = 187/301 (62%), Gaps = 13/301 (4%)
Query: 54 HGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTG 110
HG+SY + E+ R ++F +++ I N G TY++G N+F+D+T++EFR + G
Sbjct: 26 HGKSYGHD-EEHFRRQLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDMTSEEFRN-FKG 83
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVE 170
K + ++ + T + L +PT +DWR+K VTP+K+Q +CG CWAFS ++E
Sbjct: 84 LKFDAT--KTKRNGTRFQKELLGEALPTQVDWREKGYVTPVKNQGQCGSCWAFSTTGSLE 141
Query: 171 GITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQ 229
G + L+ LSEQ LVDCS GNNGC GG M+ F YI QN GI TE+ YPY
Sbjct: 142 GQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQNGGIDTEESYPYTGKD 201
Query: 230 GTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN-- 286
G C+ + + A++ + +VP DE AL AV S+ PVS+ I A F+ YKEG+++
Sbjct: 202 GDCAFNENSVGARVKGFVDVPQRDEAALQAAVASVGPVSVAIDASNDSFQYYKEGVYDEP 261
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSY 345
+QLDH V +VG+G TE+G +YWL+KNSWG TWG GY+K++R+ E CGI + +SY
Sbjct: 262 SCSFSQLDHGVLVVGYG-TENGVDYWLVKNSWGPTWGQDGYIKMMRNKENQCGIASMASY 320
Query: 346 P 346
P
Sbjct: 321 P 321
>gi|12805315|gb|AAH02125.1| Ctss protein [Mus musculus]
Length = 340
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 131/306 (42%), Positives = 186/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 39 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 98
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
++P S ++ T +++ S +P ++DWR+K VT +K Q CG CWAFSAV
Sbjct: 99 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN---GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A+EG K+ LI LS Q LVDCS GN GCGGG M +AF+YII N GI + Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKE 282
PY+A+ C K AA S Y ++P GDE AL +AV+ + PVS+GI A + F YK
Sbjct: 214 PYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273
Query: 283 GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 341 TQSSYP 346
+ SYP
Sbjct: 333 SDCSYP 338
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 133/313 (42%), Positives = 192/313 (61%), Gaps = 24/313 (7%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
+++ A++G+ Y+ E R ++++N E+I N++ G ++ L N+F D+T +E
Sbjct: 23 QQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTTEEI 82
Query: 105 RALYTGY-----KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
A G+ K+P R T YQ L + ++P ++DWRDK AVTP+KDQ+ CG
Sbjct: 83 NAAMNGFLSAGKKVP----RGTM-----YQPL-VDELPDTVDWRDKGAVTPVKDQKACGS 132
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFSA ++EG +S L+ LSEQ LVDCS GN GCGGG M+ AF YI N GI
Sbjct: 133 CWAFSATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGID 192
Query: 219 TEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEF 277
TE+ YPY+A G C A +S+Y ++ G E L KAV+ + PVS+ I A T+ F
Sbjct: 193 TEESYPYEAKNGPCRFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTF 252
Query: 278 KSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE- 334
Y GI ++ C + LDH V VG+G T+D ++YWL+KNSW +TWGD+GY+K+ R+
Sbjct: 253 HFYSRGIYYDEKCSSSFLDHGVLAVGYG-TDDSSDYWLVKNSWNETWGDSGYIKMSRNRN 311
Query: 335 GLCGIGTQSSYPL 347
CGI +Q+SYP+
Sbjct: 312 NNCGIASQASYPV 324
>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/353 (39%), Positives = 195/353 (55%), Gaps = 34/353 (9%)
Query: 17 PMFIIIILLVS-CASQVVSSRSTHEQS----VVEMHEKWMAQHGRSYKDELEKEMRFKIF 71
P+ I +ILL + A Q +++ + +M E+WMA+ G+ Y EKE RF +F
Sbjct: 6 PVAIAVILLCTFLAFQAMAADAYGGGGDDGVTTQMFEEWMAKFGKKYPCHGEKEYRFGVF 65
Query: 72 KENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
++N+ +I L N+F+DLTNDEF + +TG K P P + +
Sbjct: 66 RDNVRFIRSYRPPAGYNSALRVNQFADLTNDEFVSTHTGAKPPCPKDAP--------RGV 117
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+P +DWR K AVT +KDQ CG CWAF+AVAA+EG+T+I L LSEQ+LVDC
Sbjct: 118 DPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFAAVAAIEGLTQIRTGKLTPLSEQELVDC 177
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA--AQKAAAAKISNYEEV 249
T G++GC GG ++AFE + GI E Y Y+ +G C A A AA+I + V
Sbjct: 178 DT-GSSGCAGGHTDRAFELVAAKGGITAESGYRYEGYRGKCRADDALFNHAARIGGHRAV 236
Query: 250 PSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQL---------DHAVTIV 300
P GDE+ L AV+ QPV+ I A F+ Y G+F G CG+ +HAVT+V
Sbjct: 237 PPGDERQLATAVARQPVTAYIDASGPAFQFYGSGVFPGPCGSGSGAAAAAPTTNHAVTLV 296
Query: 301 GFGTTEDGAN---YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
G+ +DGA+ YW+ KNSWG TWG+ GY+ + +D G CG+ YP
Sbjct: 297 GY--CQDGASGKKYWVAKNSWGKTWGEKGYILLEKDVASPHGTCGVAVSPFYP 347
>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
Length = 331
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 137/338 (40%), Positives = 198/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ L+ C+S + H ++ H + W +G+ YK++ E+ R I+++NL+
Sbjct: 1 MNWLVWALLLCSSAMAH---VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
+ N E G +Y+LG N D+T++E +L + ++PS R+ T + Q L
Sbjct: 58 TVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKL-- 115
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 116 ---PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M +AF+YII N GI +E YPY+A+ G C K AA S Y E+P
Sbjct: 173 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELPF 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E+AL +AV+ + PVS+GI A + F YK G+ ++ C ++H V +VG+G DG
Sbjct: 233 GSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL-DGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
+YWL+KNSWG +GD GY+++ R+ G CGI SYP
Sbjct: 292 DYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYP 329
>gi|327289213|ref|XP_003229319.1| PREDICTED: cathepsin S-like [Anolis carolinensis]
Length = 333
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 131/317 (41%), Positives = 195/317 (61%), Gaps = 14/317 (4%)
Query: 40 EQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNR 95
+ S+++ H E W ++ + Y+++ E+ +R I+++NL ++ N E G +Y+LG N
Sbjct: 21 KDSMLDGHWELWKKKYNKEYQNKEEEGVRRVIWEKNLRFVMLHNLEQSLGLHSYELGMNH 80
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
D+T++E AL TG K+P R++T Y P ++DWR+K VT +K+Q
Sbjct: 81 LGDMTSEEVTALMTGLKIPVSQSRNST----LYWARQGASAPDTVDWREKGCVTNVKNQG 136
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQN 214
CG CWAFSAV A+E K+ NL+ LS Q LVDCS+ GN+GC GG + AF+Y+I N
Sbjct: 137 SCGSCWAFSAVGALECQLKLKTGNLVSLSPQNLVDCSSAFGNHGCNGGYISAAFQYVIYN 196
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAY 273
GI +E YPY GTC + AA S Y ++PSG+E AL AV+ PVS+ I A
Sbjct: 197 NGIDSEASYPYTGQSGTCRYNLQGRAATCSRYVDLPSGNEAALKDAVANFGPVSVAIDAS 256
Query: 274 TTEFKSYKEGIFNGVCGT--QLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKIL 331
F +++G+++ T ++H V +VG+G TEDG +YWL+KNSWG ++GD GY+KI
Sbjct: 257 RPSFFLFRKGVYDDPSCTSAHINHGVLVVGYG-TEDGIDYWLVKNSWGVSFGDQGYIKIA 315
Query: 332 RD-EGLCGIGTQSSYPL 347
R+ + CGI +Q +YPL
Sbjct: 316 RNHDNRCGIASQCTYPL 332
>gi|194352766|emb|CAQ00111.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 384
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/331 (41%), Positives = 188/331 (56%), Gaps = 35/331 (10%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYT 109
WMA HGRSY EK RF++++ N+E+IE AN++ +Y LG F+DLT+DEF A+Y+
Sbjct: 55 WMAAHGRSYPTVEEKLRRFEVYRSNMEFIEAANRDSRMSYSLGETPFTDLTHDEFMAMYS 114
Query: 110 GYKMPS-------------PSHRSTTS--STFKYQNLSMTDV-PTSLDWRDKKAVTPIKD 153
S P H T + + NL++T V P S+DWR K VTP K+
Sbjct: 115 SNDDSSEWEEATVITTRAGPVHEGTAAVEEPPRRTNLNVTAVLPPSVDWRAKGVVTPAKN 174
Query: 154 Q-QECGCCWAFSAVAAVEGITKIS-GANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYI 211
Q C CWAF++VA +E IS G + LSEQQLVDCST ++GCG G M+ AF+++
Sbjct: 175 QGATCFSCWAFTSVATMESAQAISTGGSPPVLSEQQLVDCSTL-HHGCGRGWMDDAFKWV 233
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEV-PSGDEQALLKAVSMQPVSIGI 270
I N GI TE YPY G C K A ++ +Y++V P G+E L +AV+ QPV++
Sbjct: 234 IMNGGITTEAAYPYTGKAGNCQTG-KPVAVRLRSYKKVTPPGNEAGLKEAVAQQPVAVSF 292
Query: 271 AAYTTEFKSYKEGIFN-----------GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
F+ Y G++N G C T +HA+ +VG+GT DG YW+ KNSW
Sbjct: 293 DYSDPCFQHYIGGVYNAGCSRSGVYIKGACKTAQNHAMALVGYGTKPDGTKYWIGKNSWT 352
Query: 320 DTWGDAGYMKILRDE---GLCGIGTQSSYPL 347
WGD G++ +LRD GLCG+ YP+
Sbjct: 353 AKWGDKGFIYLLRDSPPLGLCGLAKLPVYPI 383
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 193/338 (57%), Gaps = 14/338 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
I ++L + A+Q +S + V E + H ++Y ++E+ R KIF EN I
Sbjct: 5 IFLLLGILAAAQAISFFNL----VTEEWNTFKVTHRKAYDSKIEESFRMKIFMENWHKIA 60
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQNLSMT 134
N++ +YKLG N++ D+ + EF G+ + ++ +
Sbjct: 61 LHNQKYELNEVSYKLGMNKYGDMLHHEFINTLNGFNKSVSAQLRAQRRPIGSRFIEPANV 120
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
++P+S+DWR AVTPIKDQ CG CW+FSA A+EG L+ LSEQ L+DCS
Sbjct: 121 EIPSSVDWRTHGAVTPIKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGR 180
Query: 195 -GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
GNNGC GG M++AF+YI N G+ TE YPY+A C + A S Y ++P G+
Sbjct: 181 YGNNGCNGGLMDQAFQYIKDNHGLDTEISYPYEAENDKCRYNPRNNGATDSGYVDIPEGN 240
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGAN 310
E+ L AV ++ PVS+ I A F+ Y+EG+ + C ++ LDH V +VG+GT ++ +
Sbjct: 241 EKKLKAAVATIGPVSVAIDASAESFQFYREGVYYEPRCSSENLDHGVLVVGYGTDDNDQD 300
Query: 311 YWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
YWL+KNSWG TWGD GY+K+ R+ + CGI + +SYPL
Sbjct: 301 YWLVKNSWGVTWGDEGYIKMARNKDNHCGIASSASYPL 338
>gi|390608645|ref|NP_001254624.1| cathepsin S isoform 1 preproprotein [Mus musculus]
gi|74214026|dbj|BAE29430.1| unnamed protein product [Mus musculus]
Length = 343
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 131/306 (42%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 42 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 101
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
++P S ++ T +++ S +P ++DWR+K VT +K Q CG CWAFSAV
Sbjct: 102 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 156
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN---GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A+EG K+ LI LS Q LVDCS GN GCGGG M +AF+YII N GI + Y
Sbjct: 157 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 216
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKE 282
PY+A C K AA S Y ++P GDE AL +AV+ + PVS+GI A + F YK
Sbjct: 217 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 276
Query: 283 GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 277 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 335
Query: 341 TQSSYP 346
+ SYP
Sbjct: 336 SYCSYP 341
>gi|74178074|dbj|BAE29827.1| unnamed protein product [Mus musculus]
gi|74178231|dbj|BAE29900.1| unnamed protein product [Mus musculus]
gi|74220784|dbj|BAE31361.1| unnamed protein product [Mus musculus]
Length = 326
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 131/306 (42%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 25 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 84
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
++P S ++ T +++ S +P ++DWR+K VT +K Q CG CWAFSAV
Sbjct: 85 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 139
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN---GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A+EG K+ LI LS Q LVDCS GN GCGGG M +AF+YII N GI + Y
Sbjct: 140 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 199
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKE 282
PY+A C K AA S Y ++P GDE AL +AV+ + PVS+GI A + F YK
Sbjct: 200 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 259
Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 260 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 318
Query: 341 TQSSYP 346
+ SYP
Sbjct: 319 SYCSYP 324
>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
Length = 363
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 132/315 (41%), Positives = 193/315 (61%), Gaps = 15/315 (4%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ E WM +H + YK+ EK RF+IFK+NL+YI++ NK+ N +Y LG N F+
Sbjct: 57 TSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKK-NNSYWLGLNVFA 115
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
D++NDEF+ YTG + ++ +T S + N ++P +DWR K AVTP+K+Q C
Sbjct: 116 DMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSC 173
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G WAFSAV+ +E I KI NL + SEQ+L+DC + GC GG A + + Q GI
Sbjct: 174 GSAWAFSAVSTIESIIKIRTGNLNEYSEQELLDCDRR-SYGCNGGYPWSALQLVAQ-YGI 231
Query: 218 ATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
+ YPY+ VQ C + +K AAK +V +E ALL +++ QPVS+ + A +
Sbjct: 232 HYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKD 291
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
F+ Y+ GIF G CG ++DHAV VG+ G NY LI+NSWG WG+ GY++I R
Sbjct: 292 FQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILIRNSWGTGWGENGYIRIKRGTGN 346
Query: 333 DEGLCGIGTQSSYPL 347
G+CG+ T S YP+
Sbjct: 347 SYGVCGLYTSSFYPV 361
>gi|341940310|sp|O70370.2|CATS_MOUSE RecName: Full=Cathepsin S; Flags: Precursor
Length = 340
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 131/306 (42%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 39 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 98
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
++P S ++ T +++ S +P ++DWR+K VT +K Q CG CWAFSAV
Sbjct: 99 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN---GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A+EG K+ LI LS Q LVDCS GN GCGGG M +AF+YII N GI + Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKE 282
PY+A C K AA S Y ++P GDE AL +AV+ + PVS+GI A + F YK
Sbjct: 214 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273
Query: 283 GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 341 TQSSYP 346
+ SYP
Sbjct: 333 SYCSYP 338
>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
Length = 344
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/345 (39%), Positives = 196/345 (56%), Gaps = 23/345 (6%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMA---QHGRSYKDELEKEMRFKIFKENLEYIE 79
++L+ CA VS+ Q + E+W A QH +YK E+E R KI+ E+ I
Sbjct: 4 LVLLLCAVAAVSAV----QFFDLVKEEWSAFKLQHRLNYKSEVEDNFRMKIYAEHKHIIA 59
Query: 80 KANKE---GNRTYKLGTNRF---SDLTNDEFRALYTGYKMPSPSHRST-----TSSTFKY 128
K N++ G +YKLG N + D+ + EF G+ + +++ + K+
Sbjct: 60 KHNQKYEMGLVSYKLGMNSWWEHGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 119
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+ + +P +DWR AVT IKDQ +CG CW+FS A+EG L+ LSEQ L
Sbjct: 120 ISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNL 179
Query: 189 VDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYE 247
+DCS GNNGC GG M+ AF+YI N GI TE YPY+ V C K A+ +
Sbjct: 180 IDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQAYPYEGVDDKCRYNPKNTGAEDVGFV 239
Query: 248 EVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGT 304
++P GDEQ L++AV ++ PVS+ I A T F+ Y G++N T LDH V +VG+GT
Sbjct: 240 DIPEGDEQKLMEAVATVGPVSVAIDASHTHFQLYSSGVYNEEECSSTDLDHGVLVVGYGT 299
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPLA 348
E G +YWL+KNSWG +WG+ GY+K++R++ CGI + +SYPL
Sbjct: 300 DEQGVDYWLVKNSWGRSWGELGYIKMIRNKNNRCGIASSASYPLV 344
>gi|392306967|ref|NP_067256.3| cathepsin S isoform 2 preproprotein [Mus musculus]
gi|26390492|dbj|BAC25906.1| unnamed protein product [Mus musculus]
gi|148706872|gb|EDL38819.1| cathepsin S [Mus musculus]
Length = 342
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 131/306 (42%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 41 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 100
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
++P S ++ T +++ S +P ++DWR+K VT +K Q CG CWAFSAV
Sbjct: 101 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 155
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN---GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A+EG K+ LI LS Q LVDCS GN GCGGG M +AF+YII N GI + Y
Sbjct: 156 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 215
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKE 282
PY+A C K AA S Y ++P GDE AL +AV+ + PVS+GI A + F YK
Sbjct: 216 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 275
Query: 283 GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 276 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 334
Query: 341 TQSSYP 346
+ SYP
Sbjct: 335 SYCSYP 340
>gi|94480716|emb|CAI91577.1| cathepsin L [Aphrocallistes vastus]
Length = 329
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 135/333 (40%), Positives = 199/333 (59%), Gaps = 10/333 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M ++ +LL A V + E+ E W ++ RSY L++E+R KI+ N+ Y
Sbjct: 1 MKLVFLLLGLFAGACVCLQCETEEVQDFAWEGWKLKYNRSYG--LDEELRKKIWANNMLY 58
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
+++ N EG+ +YKL N+F+DLTN E+R +Y GY + R F+ + + D+P
Sbjct: 59 VKEFNAEGH-SYKLAANQFADLTNLEYRQIYLGYDNEARLSRKREGKVFQ-RKMKDEDLP 116
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GN 196
T++DWR K VTP+K+Q +CG CW+FSA ++EG I L+ SEQ+LVDCST+ GN
Sbjct: 117 TTVDWRSKGVVTPVKNQGQCGSCWSFSATGSLEGQYAIKSGKLVSFSEQELVDCSTSLGN 176
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
+GC GG M+ AF+Y N E +Y Y A G C + K S++ ++PS + A
Sbjct: 177 HGCQGGLMDYAFKYWETNLA-EKESDYTYTAKNGKCKYNAQLGVTKDSSFTDIPSENCDA 235
Query: 257 LLKAVSMQ-PVSIGIAAYTTEFKSYKEGIFNG-VCG-TQLDHAVTIVGFGTTEDGANYWL 313
L +AV+ + P+++ + A T F+ Y GI+ +C T+LDH V +VG+G T++G +YWL
Sbjct: 236 LKEAVANKGPIAVAMDASHTSFQMYHSGIYTPFLCSKTKLDHGVLVVGYG-TDNGVDYWL 294
Query: 314 IKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYP 346
IKNSWG WG GY KI CGI TQ+SYP
Sbjct: 295 IKNSWGMAWGMDGYFKIEMKSDKCGICTQASYP 327
>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
Length = 324
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 129/308 (41%), Positives = 181/308 (58%), Gaps = 15/308 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
++ Q+GR Y E+ R ++ +N+E+IE N++ G TY L N+F D+TN+E
Sbjct: 23 HQFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDMTNEEI 82
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
A+ G +P+ R + L P +DWR K AVTP+KDQ+ CG CWAFS
Sbjct: 83 NAVMNGL-LPASESRGVAVLGGRDDTL-----PAEVDWRTKGAVTPVKDQKACGSCWAFS 136
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCST-NGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A ++EG + L+ LSEQ LVDCST G++GCGGG M+ AF YI N GI TE Y
Sbjct: 137 ATGSLEGQHFLKDGKLVSLSEQNLVDCSTKQGDHGCGGGLMDFAFTYIKDNGGIDTEASY 196
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKE 282
PY+A G C + A ++ Y +V E AL KAV ++ P+S+ I A + F Y +
Sbjct: 197 PYEATDGKCQYNPANSGATVTGYVDVEHDSEDALQKAVATIGPISVAIDASRSTFHFYHK 256
Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGI 339
G++ T LDH V VG+G T+DG +YWL+KNSW TWG+ G++++ R+ CGI
Sbjct: 257 GVYYDKECSSTSLDHGVLAVGYG-TQDGTDYWLVKNSWNITWGNHGFIEMSRNRNNNCGI 315
Query: 340 GTQSSYPL 347
TQ+SYPL
Sbjct: 316 ATQASYPL 323
>gi|334324655|ref|XP_001370975.2| PREDICTED: cathepsin S-like isoform 1 [Monodelphis domestica]
Length = 331
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 131/317 (41%), Positives = 191/317 (60%), Gaps = 15/317 (4%)
Query: 39 HEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTN 94
H +++ H + W HG+ YK + E+ R I+++NL+Y+ N E G +Y L N
Sbjct: 19 HRDPMLDGHWDLWKKTHGKQYKGQNEEIARRLIWEKNLKYVTLHNLEHSMGLHSYDLSMN 78
Query: 95 RFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
D+T++E +L + ++P+ +R+TT Y+ S +P S+DWR+K VT +K Q
Sbjct: 79 HLGDMTSEEVISLMSSLRIPNQWNRNTT-----YRLSSNQKLPDSVDWREKGCVTEVKYQ 133
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN--GNNGCGGGTMEKAFEYII 212
CG CWAFSAV A+E K+ L+ LS Q LVDCST+ N+GC GG M AF+Y+I
Sbjct: 134 GSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYVI 193
Query: 213 QNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIA 271
N GI ++ YPY+A G C + AA S Y E+P G E+AL +AV+ + PVS+GI
Sbjct: 194 DNNGIDSDVSYPYKATDGKCQYNPASRAATCSKYTELPYGSEEALKEAVANKGPVSVGID 253
Query: 272 AYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A T F YK G+ ++ C +++H V ++G+G DG +YWL+KNSWG +GD GY++I
Sbjct: 254 AKTPSFFLYKSGVYYDPSCTQKVNHGVLVIGYGNL-DGQDYWLVKNSWGLHFGDKGYVRI 312
Query: 331 LRDEG-LCGIGTQSSYP 346
R+ G CGI SYP
Sbjct: 313 ARNRGNHCGIANFPSYP 329
>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
Length = 333
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 126/312 (40%), Positives = 188/312 (60%), Gaps = 13/312 (4%)
Query: 44 VEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDL 99
+++H + W QHG++YK E+E+ R ++++ NL+ I N E G TY LG N D+
Sbjct: 26 LDLHWQMWKKQHGKNYKTEVEELGRREVWERNLQLISLHNLEASMGMHTYDLGMNHMGDM 85
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
T +E + K+P+ R ++ + S T VP ++DWR K VT +K+Q CG
Sbjct: 86 TEEEILQSFASLKVPADLKREPSA----FVASSGTPVPDTVDWRQKGYVTQVKNQGSCGS 141
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFS+V A+EG + L+ LS Q LVDCS+ GN GC GG M +AF+Y+I N+GI
Sbjct: 142 CWAFSSVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKGCNGGFMSEAFQYVIDNKGID 201
Query: 219 TEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSM-QPVSIGIAAYTTEF 277
++ YPYQ VQGTC +A + Y +P GDE L +AV+M P+S+ I A F
Sbjct: 202 SDTSYPYQGVQGTCHYNPSYRSANCTRYSFLPEGDETTLKQAVAMIGPISVAIDATRPSF 261
Query: 278 KSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-G 335
++ G++N + C +++HAV +VG+GT DG +YWL+KNSWG +G+ GY+++ R+
Sbjct: 262 ILWRSGVYNDLTCTQKINHAVLVVGYGTL-DGQDYWLVKNSWGTRFGENGYIRMSRNRNN 320
Query: 336 LCGIGTQSSYPL 347
CGI YP+
Sbjct: 321 QCGIALYGCYPI 332
>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
Length = 327
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 181/320 (56%), Gaps = 29/320 (9%)
Query: 45 EMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEF 104
+M E+WMA+ G+ Y EKE RF +F++N+ +I L N+F+DLTNDEF
Sbjct: 17 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 76
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
+ +TG K P P + + +P +DWR K AVT +KDQ CG CWAF+
Sbjct: 77 VSTHTGAKPPCPKDAP--------RGVDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFA 128
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYP 224
AVAA+EG+T+I L LSEQ+LVDC T G++GC GG ++AFE + GI E Y
Sbjct: 129 AVAAIEGLTQIRTGKLTPLSEQELVDCDT-GSSGCAGGHTDRAFELVAAKGGITAESGYR 187
Query: 225 YQAVQGTCSA--AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKE 282
Y+ +G C A A AA+I + VP GDE+ L AV+ QPV+ I A F+ Y
Sbjct: 188 YEGYRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGS 247
Query: 283 GIFNGVCGTQ---------LDHAVTIVGFGTTEDGAN---YWLIKNSWGDTWGDAGYMKI 330
G+F G CG+ +HAVT+VG+ +DGA+ YW+ KNSWG TWG+ GY+ +
Sbjct: 248 GVFPGPCGSGSGAAAAAPTTNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWGEKGYILL 305
Query: 331 LRD----EGLCGIGTQSSYP 346
+D G CG+ YP
Sbjct: 306 EKDVASPHGTCGVAVSPFYP 325
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 134/338 (39%), Positives = 198/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++++L+ C+S + H+ ++ H + W +G+ Y +E E+ R I+++NL+
Sbjct: 10 MKWLLLVLLGCSSAMAQ---LHKDPTLDHHWDLWKKTYGKQYTEENEEVTRRFIWEKNLK 66
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
Y+ N E G +Y LG N +D+T++E L + ++PS R+ T + Q L
Sbjct: 67 YVMLHNLEHSMGMHSYDLGMNHLADMTSEEVMLLMSSLRVPSQWQRNVTFKSNPNQKL-- 124
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
P S+DWRDK VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 125 ---PDSMDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQNLVDCST 181
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
N GC GG M +AF+YII N GI +E YPY+A+ G C K AA S Y E+P
Sbjct: 182 GKYSNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSKYVELPF 241
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G+E+AL +AV+ + PVS+ I A F Y+ G+ ++ C ++H V VG+G +G
Sbjct: 242 GNEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDKACTLNVNHGVLAVGYGNY-NGK 300
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
+YWL+KNSWG +G+ GY+++ R+ G CGI + SYP
Sbjct: 301 DYWLVKNSWGLHFGEQGYIRMARNSGNHCGIASYPSYP 338
>gi|194741252|ref|XP_001953103.1| GF17600 [Drosophila ananassae]
gi|190626162|gb|EDV41686.1| GF17600 [Drosophila ananassae]
Length = 333
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 139/340 (40%), Positives = 202/340 (59%), Gaps = 22/340 (6%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMA---QHGRSYKDELEKEMRFKIFKENL 75
I+ +LL++ A V + Q ++E E+WMA ++ + Y+DE E+++RFKIF N
Sbjct: 6 LILFMLLLAIAHAV-----PYAQDILE--EEWMAFKLEYNKVYQDETEEQLRFKIFNYNK 58
Query: 76 EYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
I + N + G ++ L N+F+DL + EF+ L G KM SPS + SSTF ++
Sbjct: 59 LLIARHNLKWAAGKVSFNLAVNKFADLLDHEFQDLMLG-KM-SPSGSNFGSSTF-LPPVN 115
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+T +P ++DWR VTP+KDQ CG CWAFS ++EG LI LSEQ L+DCS
Sbjct: 116 LT-LPDAVDWRKYGFVTPVKDQGSCGSCWAFSTTGSLEGQHFRKTGQLISLSEQNLIDCS 174
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
GNNGC G +E AF YI N+GI TE YPY+A Q C + A + + ++ G
Sbjct: 175 P-GNNGCKNGAVEYAFRYIQSNKGIDTEISYPYEAAQNQCRFRRDTIGATSTGFVKLNPG 233
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CG-TQLDHAVTIVGFGTTEDGA 309
DE L +AV ++ P+S+ I + FK Y +G++N C +L HAV +VG+GT + G
Sbjct: 234 DEMELAQAVATVGPISVLINSSLDSFKFYHDGVYNDPSCNPNKLTHAVLVVGYGTDDRGG 293
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
++WL+KNSW WG+ GY+KI R+ LCGI + + YPL
Sbjct: 294 DFWLVKNSWSTHWGEQGYVKIKRNANNLCGIASNALYPLV 333
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 197/337 (58%), Gaps = 15/337 (4%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEMHEKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEK 80
L + A+ V+S ++ +V+ E+W + QH ++Y E E+ R KIF EN + K
Sbjct: 3 LFLILAAVVISCQAVSFYDLVQ--EQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAK 60
Query: 81 ANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPS--HRSTTSSTFKYQNLSMTD 135
NK +G +KLG N+++D+ + EF + G+ + S + ++ + +
Sbjct: 61 HNKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVK 120
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN- 194
+P ++DWRDK AVT +KDQ CG CW+FSA ++EG L+ LSEQ LVDCS
Sbjct: 121 LPDTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRY 180
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
GNNGC GG M+ AF YI N GI TE YPY A C + + A + ++ +E
Sbjct: 181 GNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAEDEKCHYKAQNSGATDKGFVDIEEANE 240
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANY 311
L AV ++ PVSI I A F+ Y +G+++ C +Q LDH V +VG+GT++DG +Y
Sbjct: 241 DDLKAAVATVGPVSIAIDASHETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDGQDY 300
Query: 312 WLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
WL+KNSWG +WG GY+K+ R+ + +CG+ +Q+SYPL
Sbjct: 301 WLVKNSWGPSWGLNGYIKMARNQDNMCGVASQASYPL 337
>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
Length = 341
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 131/344 (38%), Positives = 196/344 (56%), Gaps = 23/344 (6%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMA---QHGRSYKDELEKEMRFKIFKENLE 76
++I+L V A+ VS + E+W A +H + Y E+E + R KI+ EN
Sbjct: 4 LVILLCVVAAASAVSFFDL-------VKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKH 56
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR-----STTSSTFKY 128
I K N++ G +++L N++ D+ + EF G+ + + + S +
Sbjct: 57 NIAKHNQKYARGEVSFRLKQNKYGDMLHHEFVHTMNGFNKTTKNSKGLFGKSAGERGATF 116
Query: 129 QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+ +P +DWR AVT +KDQ +CG CW+FS+ A+EG L+ LSEQ L
Sbjct: 117 ITPANVHLPDHVDWRKHGAVTEVKDQGKCGSCWSFSSTGALEGQHYRRTNILVSLSEQNL 176
Query: 189 VDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYE 247
+DCS GNNGC GG M+ AF+YI N+GI TE YPY+ + C K A + +
Sbjct: 177 IDCSAAYGNNGCNGGLMDNAFKYIKDNRGIDTEKSYPYEGIDDKCRYNPKNTGADDNGFV 236
Query: 248 EVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVC-GTQLDHAVTIVGFGT 304
++PSGDE L+ AV ++ PVS+ I A + F+ Y +G+ F+ C + LDH V +VG+GT
Sbjct: 237 DIPSGDEGKLMAAVATVGPVSVAIDASQSSFQFYSDGVYFDENCSSSSLDHGVLVVGYGT 296
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
E+G +YWL+KNSWG +WGD GY+K+ R+ + CGI T +SYPL
Sbjct: 297 DENGGDYWLVKNSWGRSWGDLGYIKMARNRDNHCGIATAASYPL 340
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 116/252 (46%), Positives = 164/252 (65%), Gaps = 6/252 (2%)
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
N R T + + R+ ++ +Y+ + +P S+DWR+K AV PIKDQ CG C
Sbjct: 6 NSRPRRRTTYFGVRGAGRRTPGLASDRYRYRAGDALPDSVDWREKGAVVPIKDQGGCGSC 65
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATE 220
WAFS +A+VEGI KI +LI LSEQ+LVDC N+GC GG M+ AF++II N GI TE
Sbjct: 66 WAFSTIASVEGINKIVTGDLISLSEQELVDCDKTYNDGCNGGLMDYAFQFIIDNGGIDTE 125
Query: 221 DEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKS 279
+YPY G C + +K A I++YE+VP DEQAL KA + QP+++ I F+
Sbjct: 126 KDYPYTEQDGRCDSYRKNAKVVSINSYEDVPVNDEQALKKAAASQPIAVAIDGGGRSFQL 185
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EG 335
Y GIF G CGT LDH VT+VG+G +E G +YW+++NSWG++WG+ GY+++ R+ G
Sbjct: 186 YNSGIFTGKCGTSLDHGVTVVGYG-SESGKDYWIVRNSWGESWGEKGYIRMARNIDSPSG 244
Query: 336 LCGIGTQSSYPL 347
+CGI ++SYP+
Sbjct: 245 ICGIAMEASYPI 256
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 125/333 (37%), Positives = 193/333 (57%), Gaps = 17/333 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
F+ ++LL+ S V+ E W ++G++Y+ E MR KI+ +N +Y+
Sbjct: 9 FVAVLLLIGLVSAAVND--------AEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYV 60
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
+ N + +++L N F+DLT +EF ++Y GY ++ ++Y + +P
Sbjct: 61 NEHNSM-DSSFQLEVNEFADLTAEEFSSIYNGYGKGRNRENHENTTIYRYTGGA---IPD 116
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNG 198
S+DWR K VTP+K+Q++CG CWAFS ++EG L+ LSEQ LVDC ++G
Sbjct: 117 SVDWRTKGLVTPVKNQKQCGSCWAFSTTGSLEGAHAKKTGKLVSLSEQNLVDCDKK-DHG 175
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALL 258
C GG M AF+YI +N+GI TE+ YPY+A G C + A + + + + D +AL
Sbjct: 176 CQGGLMTTAFKYIEENKGIDTEESYPYKAKNGRCEFKKDDIGATVERHVSILTTDCEALK 235
Query: 259 KAVS-MQPVSIGIAAYTTEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIK 315
KAV+ + P+S+ + A + F+ YK GI++ +C ++ LDH V +VG+G EDG YWL+K
Sbjct: 236 KAVAEIGPISVAMDASHSSFQLYKSGIYDPKICSSRKLDHGVLVVGYG-KEDGEEYWLVK 294
Query: 316 NSWGDTWGDAGYMKILRDEGLCGIGTQSSYPLA 348
NSWG WG GY KI + LCGI T + YP+
Sbjct: 295 NSWGKNWGMEGYFKIASKKNLCGICTSACYPVV 327
>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 139/355 (39%), Positives = 204/355 (57%), Gaps = 27/355 (7%)
Query: 18 MFIIIILLVSCASQVVS----SRSTH----------EQSVVEMHEKW---MAQHGRSYKD 60
MF ++ L++ CAS S SR H Q + E + W G+SY
Sbjct: 1 MFRLLSLVLLCASVFASIDSGSRRDHTIRLHRVKSLRQKIDEAFKLWDDYKEAFGKSYNK 60
Query: 61 ELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPS 117
+ E + + F +N+ +I++ N+E G +T+++G N +DL ++R L GY+
Sbjct: 61 DEENDY-MEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKL-NGYRHRRNF 118
Query: 118 HRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISG 177
S S+ K+ ++P S+DWRDK VT +K+Q CG CWAFSA A+EG +
Sbjct: 119 GDSMQSNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARAS 178
Query: 178 ANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ 236
++ LSEQ LVDCST GN+GC GG M+ AFEYI N GI TE+ YPY + C +
Sbjct: 179 GKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKCHFKK 238
Query: 237 KAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGT-QL 293
K A+ + ++P GDE+AL AV+ Q P+SI I A F+ YK+G+ ++ C + +L
Sbjct: 239 KDIGAEDKGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSEEL 298
Query: 294 DHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
DH V +VG+GT + +YWLIKNSWG WG+ GY++I R+ CG+ T++SYPL
Sbjct: 299 DHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEKGYIRIARNRSNHCGVATKASYPL 353
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 128/305 (41%), Positives = 186/305 (60%), Gaps = 16/305 (5%)
Query: 54 HGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDLTNDEFRALYTG 110
H R+Y E E+ R ++F+ NL+ I+ N ++G Y++G N+F+D+ +EF ++ G
Sbjct: 50 HERTY-GETEESQRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQFADMEANEFASIMNG 108
Query: 111 YKMPSPSHRSTTSSTFKYQNLSM---TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVA 167
++M ++R+ +S VP +DWR + VTP+K+Q +CG CWAFS
Sbjct: 109 FRM---NNRTEVRDHLHANYISPAIPVSVPAEVDWRKEGYVTPVKNQGQCGSCWAFSTTG 165
Query: 168 AVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQ 226
++EG L+ LSEQ LVDCST+ GN GC GG ++ AF+YI N G TE YPY+
Sbjct: 166 SLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIKDNDGDDTEACYPYE 225
Query: 227 AVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSM-QPVSIGIAAYTTEFKSYKEGIF 285
AV GTC A + Y ++P GDE + +AV++ PVS+ I A + F+ Y+ GI+
Sbjct: 226 AVDGTCRFKSVCVGATCTGYTDLPKGDEAKMKEAVALVGPVSVAIDASHSSFQMYQSGIY 285
Query: 286 --NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQ 342
QLDHAV +VG+G TE G +YWL+KNSWG TWGD GY+K+ R+ + CGI +Q
Sbjct: 286 VEQECSPKQLDHAVLVVGYG-TEQGQDYWLVKNSWGTTWGDEGYIKMARNMDNQCGIASQ 344
Query: 343 SSYPL 347
+SYPL
Sbjct: 345 ASYPL 349
>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 139/355 (39%), Positives = 204/355 (57%), Gaps = 27/355 (7%)
Query: 18 MFIIIILLVSCASQVVS----SRSTH----------EQSVVEMHEKW---MAQHGRSYKD 60
MF ++ L++ CAS S SR H Q + E + W G+SY
Sbjct: 1 MFRLLSLVLLCASVFASIDSGSRHDHTIRLHRVKSLRQKIDEAFKLWDDYKESFGKSYNK 60
Query: 61 ELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPS 117
+ E + + F +N+ +I++ N+E G +T+++G N +DL ++R L GY+
Sbjct: 61 DEENDY-MEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKL-NGYRHRRNF 118
Query: 118 HRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISG 177
S S+ K+ ++P S+DWRDK VT +K+Q CG CWAFSA A+EG +
Sbjct: 119 GDSMQSNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARAS 178
Query: 178 ANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ 236
++ LSEQ LVDCST GN+GC GG M+ AFEYI N GI TE+ YPY + C +
Sbjct: 179 GKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKCHFKK 238
Query: 237 KAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGT-QL 293
K A+ + ++P GDE+AL AV+ Q P+SI I A F+ YK+G+ ++ C + +L
Sbjct: 239 KDIGAEDKGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSEEL 298
Query: 294 DHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
DH V +VG+GT + +YWLIKNSWG WG+ GY++I R+ CG+ T++SYPL
Sbjct: 299 DHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEKGYIRIARNRSNHCGVATKASYPL 353
>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 136/342 (39%), Positives = 197/342 (57%), Gaps = 17/342 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKW---MAQHGRSYKDELEKEMRFKIFKENLE 76
+I++ LV A VSS + +E V+E E+W AQ + Y+D E+ R K++ +N
Sbjct: 4 VIVLGLVVFAISSVSSINLNE--VIE--EEWSLFKAQFKKIYEDVKEEAFRKKVYLDNKL 59
Query: 77 YIEKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
I + NK G TY L N F DL E++ + G+K + T +
Sbjct: 60 KIARHNKLYETGEETYALEMNHFGDLMQHEYKKMMNGFKPSLAGGDKNFTDDDAVTFLKS 119
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
VP ++DWR K VTP+K+Q +CG CW+FSA ++EG L+ LSEQ L+DC
Sbjct: 120 ENVVVPKAIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDC 179
Query: 192 STN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
S GNNGC GG M+ AF+YI N+G+ TE YPY+A C + + A + ++P
Sbjct: 180 SRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFVDIP 239
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVC-GTQLDHAVTIVGFGTTED 307
GDE AL+ A+ ++ PVSI I A + +F+ YK+G+F N C T+LDH V VG+GT
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHK 299
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
G +YW++KNSWG TWGD GY+ + R+ + CG+ + +SYPL
Sbjct: 300 GGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASYPLV 341
>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
Length = 337
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 197/342 (57%), Gaps = 18/342 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + +++LV C +++ Q E + W + H ++Y+ E E+ R ++++NL+
Sbjct: 1 MTLYLVVLVLCTGAALAAPRFDAQ-FDEHWDLWKSWHSKNYQHEKEEGWRRMVWEKNLKK 59
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G +Y LG N F D+TN+EFR + GYK+ R S F N
Sbjct: 60 IEMHNLEHSLGKHSYSLGMNHFGDMTNEEFRQVMNGYKL---QQRKFKGSLFLEPN--NM 114
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+ P +DWR++ VTP+KDQ +CG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 115 EAPKQVDWREEGYVTPVKDQGQCGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDCSRP 174
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+YI N G+ +E+ YPY C+ + +AA + + ++PSG
Sbjct: 175 EGNEGCNGGLMDQAFQYIQDNSGLDSEEAYPYLGTDDQPCNYKAEFSAANDTGFMDIPSG 234
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTE 306
E AL+KA+ S+ PVS+ I A F+ Y+ GI + C + +LDH V VG+ G
Sbjct: 235 KEHALMKAIASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WGD GY+ + +D + CGI T +SYPL
Sbjct: 295 DGKKYWIVKNSWSEKWGDKGYILMAKDRKNHCGIATAASYPL 336
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 135/326 (41%), Positives = 191/326 (58%), Gaps = 20/326 (6%)
Query: 37 STHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT--YKLGTN 94
S E+ VVE+ +KW +HG+ YK E E +F+ F++NL Y+ + N E + + +G N
Sbjct: 41 SIAEERVVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLN 100
Query: 95 RFSDLTNDEFRALYTGYKMPSPS------HRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
+F+D++N+EFR +Y K+ P+ R + ++ D PTSLDWR V
Sbjct: 101 KFADMSNEEFREVYVS-KVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIV 159
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
T +KDQ +CG CWAFS+ A+EGI ++ +LI LSEQ+LVDC + N+GC GG M+ AF
Sbjct: 160 TGVKDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCDST-NDGCEGGYMDYAF 218
Query: 209 EYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
E+++ N GI TE +YPY GTC + ++ A I YE+V +E AL AV QP+S
Sbjct: 219 EWVMSNGGIDTETDYPYTGEDGTCNTTKEETKAVSIDGYEDVAE-EESALFCAVLKQPIS 277
Query: 268 IGIAAYTTEFKSYKEGIF---NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGD 324
+GI +F+ Y GI+ +DHAV +VG+G E G YW+IKNSWG WG
Sbjct: 278 VGIDGGAIDFQLYTGGIYDGDCSDDPDDIDHAVLVVGYG-AESGEEYWIIKNSWGTDWGM 336
Query: 325 AGYMKILR----DEGLCGIGTQSSYP 346
GY I R D G+C I +SYP
Sbjct: 337 KGYAYIKRNTSKDYGVCAINAMASYP 362
>gi|357153071|ref|XP_003576329.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 398
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 186/331 (56%), Gaps = 34/331 (10%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEG---NRTYKLGTNRFSDLTNDEF 104
+ WMA GRSY E RF+++K N+ YIE N E T++LG F+DLT++EF
Sbjct: 63 QGWMAAQGRSYWTAEETARRFEVYKSNVRYIEAVNAEAATTGLTFELGEGPFTDLTHEEF 122
Query: 105 RALYTGYKMPSPSHR------------------STTSSTFKYQNLSMTDV----PTSLDW 142
ALY G MP P + + NLS P S DW
Sbjct: 123 SALYNG-SMPPPEEEEGDDIQEEDEQVIATVVDGVDVNVAVHTNLSAGGPRPWPPRSRDW 181
Query: 143 RDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGG 202
R AVTPIKDQ CG CWAF VA +EG KI NL+ LSEQQL+DC N+GC GG
Sbjct: 182 RKHGAVTPIKDQGRCGSCWAFPTVATIEGKHKIVRGNLVSLSEQQLIDCDYT-NSGCKGG 240
Query: 203 TMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS 262
+ +A+ +I + G+ T YPY+ +G C ++ AAA+I+ + V S E AL+ AV+
Sbjct: 241 FVIRAYRWIRKIGGLTTSSAYPYKGARGKC-MKRRRAAARIAGWRSVRSRSEVALVNAVA 299
Query: 263 MQPVSIGIAAYTTEFKSYKEGIFNGVCGT-QLDHAVTIVGFGTTED-GANYWLIKNSWGD 320
QPV++ I+A F+ YK+GI NG C T +L+HAVT+VG+G D GA YW++KNSWG
Sbjct: 300 GQPVAVYISASGKNFQHYKKGILNGPCDTARLNHAVTVVGYGRQADTGAKYWIVKNSWGT 359
Query: 321 TWGDAGYMKILR----DEGLCGIGTQSSYPL 347
TWG GY+ + R G CGI T +PL
Sbjct: 360 TWGQEGYILMKRGTRNPRGQCGIATSPVFPL 390
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 146/361 (40%), Positives = 201/361 (55%), Gaps = 26/361 (7%)
Query: 2 VLIFERSGSFKINTIPMFIIIIL---LVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSY 58
VL R S +N I+ L L A +V +H Q W + H + Y
Sbjct: 3 VLFLARRLSRFVNMNVCLTILSLCLGLAFAAPRVDPDLDSHWQL-------WKSWHSKDY 55
Query: 59 KDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPS 115
E E+ R ++++NL+ IE N + G +YKLG N+F D+T +EFR L GYK
Sbjct: 56 H-EREESWRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKHKK 114
Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKI 175
S R S F S + P S+DWR+K VTP+KDQ +CG CWAFS A+EG
Sbjct: 115 -SERKYRGSQF--LEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFR 171
Query: 176 SGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCS 233
L+ LSEQ LVDCS GN GC GG M++AF+Y+ N GI +E+ YPY A C
Sbjct: 172 KTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCR 231
Query: 234 AAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT 291
+ AA + + ++P G E+AL+KAV S+ PVS+ I A + F+ Y+ GI + C +
Sbjct: 232 YKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSS 291
Query: 292 Q-LDHAVTIVGF---GTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
+ LDH V +VG+ G DG YW++KNSWG+ WGD GY+ + +D + CGI T +SYP
Sbjct: 292 EDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYP 351
Query: 347 L 347
L
Sbjct: 352 L 352
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 140/341 (41%), Positives = 194/341 (56%), Gaps = 29/341 (8%)
Query: 29 ASQVVSSRSTHEQSVV---------EMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
AS + S S H Q V+ + + +M + R+Y D E E RFKIF N I
Sbjct: 39 ASPLTSLDSMHMQDVIGVDWNFTLSSIWKHFMTTYKRNYIDPSEHERRFKIFANNFVRIS 98
Query: 80 KANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
K N +G +Y +G N FSD T++E + L ++ + R + KY ++
Sbjct: 99 KHNVRFIQGQVSYTMGINEFSDKTDEELKRLRC-FRGSLNASRDGS----KYITIAAPP- 152
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-G 195
P+ +DWR+K AVTP+K+Q CG CWAFSA A+EG ++ NL+ LSEQQLVDCS+ G
Sbjct: 153 PSEIDWRNKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYG 212
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA-----VQGTCSAAQKAAAAKISNYEEVP 250
NN C GG M+ AF+Y+ + GI TE YPY + TC K A +++ Y ++P
Sbjct: 213 NNACNGGLMDNAFKYVKDSNGIDTEASYPYVSGETGDANPTCRFNLKEAVVRVTGYIDLP 272
Query: 251 SGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGIF-NGVCGT-QLDHAVTIVGFGTTED 307
G L +AV P+S+ I A F SYK G++ + C + LDH V +VG+G E+
Sbjct: 273 RGQVSELKQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYG-EEN 331
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
G YWLIKNSWG WG+ GY+KILRD LCG+ + +SYPL
Sbjct: 332 GIPYWLIKNSWGPHWGENGYVKILRDHNNLCGVASMASYPL 372
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 127/303 (41%), Positives = 183/303 (60%), Gaps = 13/303 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYT 109
W H ++Y E E+ +R+ I+K+N+ I + N + ++ L N F D+TN EFRA
Sbjct: 30 WKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSK-SKNVILRMNHFGDMTNTEFRAKMN 88
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAV 169
G + H+ STF S T P ++DWR + VTP+K+Q +CG CWAFS+ A+
Sbjct: 89 GLLL----HKHQNGSTFLVP--SHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGAL 142
Query: 170 EGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV 228
EG L+ LSEQ LVDCST+ GNNGC GG M+ AF YI N GI TE YPY+
Sbjct: 143 EGQHFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTETGYPYEGQ 202
Query: 229 QGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN- 286
GTC ++ + A + + ++P GDE AL +AV ++ PVS+ I A F+ Y G+++
Sbjct: 203 DGTCRYSKSSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGVYDE 262
Query: 287 -GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-DEGLCGIGTQSS 344
+ LDH V +VG+G T++G +YWL+KNSWG WG GY+ + R ++ CGI +++S
Sbjct: 263 PQCSPSALDHGVLVVGYG-TDNGKDYWLVKNSWGTGWGTEGYIYMSRNNQNQCGIASKAS 321
Query: 345 YPL 347
YPL
Sbjct: 322 YPL 324
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 117/252 (46%), Positives = 171/252 (67%), Gaps = 6/252 (2%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
++E+ E WM++HG+ Y+ EK +RF+IFK+NL++I++ NK + Y LG N F+DL++
Sbjct: 4 LIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVS-NYWLGLNEFADLSHH 62
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EF+ Y G K+ + R + S F Y+++ D+P S+DWR K AVT IK+Q CG CWA
Sbjct: 63 EFKKQYLGLKVDFSTRRES-SEEFTYRDV---DLPKSVDWRKKGAVTNIKNQGSCGSCWA 118
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS VAAVEGI +I NL LSEQ+L+DC N+GC GG M+ AF +I++N G+ ED+
Sbjct: 119 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGGLMDYAFSFIVENGGLHKEDD 178
Query: 223 YPYQAVQGTCS-AAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
YPY +GTC + +++ IS Y +VP +EQ+LLKA++ QP+S+ I A +F+ Y
Sbjct: 179 YPYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 238
Query: 282 EGIFNGVCGTQL 293
G+F+G CGTQL
Sbjct: 239 GGVFDGHCGTQL 250
>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
Length = 341
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 131/344 (38%), Positives = 196/344 (56%), Gaps = 23/344 (6%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMA---QHGRSYKDELEKEMRFKIFKENLE 76
+++++ V A+ VS + E+W A +H + Y E+E + R KI+ EN
Sbjct: 4 LVVLMCVVAAASAVSFFDL-------VKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKH 56
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
I K N++ G +++ N++ D+ + EF G+ + + + + + +
Sbjct: 57 KIAKHNQKFARGQVPFRVKQNKYGDMLHHEFVHTMNGFNKTTKNGKGLFGKSAGERGATF 116
Query: 134 -----TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
VP +DWR AVT +KDQ +CG CW+FSA A+EG L+ LSEQ L
Sbjct: 117 IPPANVRVPDHVDWRKHGAVTEVKDQGKCGSCWSFSATGALEGQHYRQTNILVSLSEQNL 176
Query: 189 VDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYE 247
+DCST GNNGC GG M+ AF+YI N+GI TE YPY+AV C + + A +
Sbjct: 177 IDCSTAYGNNGCNGGLMDNAFKYIKDNKGIDTEKSYPYEAVDDKCRYNPRNSGADDVGFI 236
Query: 248 EVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCG-TQLDHAVTIVGFGT 304
++PSGDE L+ AV ++ PVS+ I A F+ Y +G+ F+ C T LDH V +VG+GT
Sbjct: 237 DIPSGDEGKLMAAVATVGPVSVAIDASQETFQFYSDGVYFDENCSSTSLDHGVLVVGYGT 296
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
E+G +YWL+KNSWG +WGD GY+K+ R+ + CGI T +S+PL
Sbjct: 297 DENGGDYWLVKNSWGRSWGDLGYIKMARNRDNHCGIATAASFPL 340
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 134/342 (39%), Positives = 199/342 (58%), Gaps = 18/342 (5%)
Query: 22 IILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKA 81
I+ ++ +++ S ++ V+E + + A+H ++Y +++E++ R KIF +N + I K
Sbjct: 3 ILFFIALTVLSINAVSFYDL-VMEEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQKITKH 61
Query: 82 N---KEGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSM--- 133
N + G YKLG N++SD+ + EF + G+ + P RS T + +
Sbjct: 62 NTKYQRGEVGYKLGLNKYSDMLHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFFIPPA 121
Query: 134 -TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+P +DW AVTP+KDQ CG CWAFSA A+EG+ L+ LSEQ L+DCS
Sbjct: 122 NVKLPKHVDWVKLGAVTPVKDQGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCS 181
Query: 193 T-NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
T GNNGC GG M++AF+Y+ N GI TE YPY+ C + + A + Y +VP
Sbjct: 182 TEEGNNGCNGGLMDQAFQYVRINGGIDTERSYPYEGNNDVCRYEPENSGAIDTGYTDVPL 241
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ---LDHAVTIVGFGTTE 306
GDE AL AV ++ PVS+ I A F+ Y G+ F C + LDH V +VG+GT E
Sbjct: 242 GDEDALKSAVATVGPVSVAIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGTDE 301
Query: 307 DG-ANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
+ +YWL+KNSWGD+WG+ GY+K+ R+ + CGI TQ S+P
Sbjct: 302 ETQQDYWLVKNSWGDSWGENGYIKMARNADNQCGIATQPSFP 343
>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 358
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 135/315 (42%), Positives = 186/315 (59%), Gaps = 21/315 (6%)
Query: 52 AQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGY 111
A + R+Y E+ RF++++ N++YIE N+ G+ TY+LG N+F+DLT EFRA+YT
Sbjct: 45 ATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQEFRAMYTMP 104
Query: 112 ----KMPSPSHRSTTSSTFK----------YQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
P R +T Y + PTS+DWR K AVTP+KDQ C
Sbjct: 105 ARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVDWRSKGAVTPVKDQGGC 164
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
GCCWAF+ VA +EG+ KI L+ LSEQ+LVDC ++GCGGG E A E++ N G+
Sbjct: 165 GCCWAFATVATIEGLHKIKTGQLVSLSEQELVDCDDA-DDGCGGGLPEIAMEWVAHNGGL 223
Query: 218 ATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
TE YPY G C + + AAKI+ + V + E L +AV+ QPV++ I A
Sbjct: 224 TTEANYPYTGKAGKCDRGKASNHAAKIAAAQMVRANSEAELERAVARQPVAVAINA-PDS 282
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
YK G+++G C + DHAVT+VG+G G YW+IKNSW +TWG+ GY ++ R
Sbjct: 283 LMFYKSGVYSGPCTAEFDHAVTVVGYGADNKGHKYWIIKNSWAETWGEKGYGRMQRGVAA 342
Query: 333 DEGLCGIGTQSSYPL 347
EGLCGI T +SYP+
Sbjct: 343 KEGLCGIATHASYPV 357
>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
Length = 341
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 197/342 (57%), Gaps = 17/342 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKW---MAQHGRSYKDELEKEMRFKIFKENLE 76
+I++ LV+ A VSS + +E V+E E+W Q + Y+D E+ R K++ +N
Sbjct: 4 VIVLGLVAFAISTVSSINLNE--VIE--EEWSLFKIQFKKLYEDIKEETFRKKVYLDNKL 59
Query: 77 YIEKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
I + NK G TY L N F DL E+ + G+K + T +
Sbjct: 60 KIARHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKS 119
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+P S+DWR K VTP+K+Q +CG CW+FSA ++EG L+ LSEQ L+DC
Sbjct: 120 ENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDC 179
Query: 192 STN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
S GNNGC GG M+ AF+YI N+G+ TE YPY+A C + + A + ++P
Sbjct: 180 SRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFVDIP 239
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVC-GTQLDHAVTIVGFGTTED 307
GDE AL+ A+ ++ PVSI I A + +F+ YK+G+F N C T+LDH V VGFG+ +
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKK 299
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
G +YW++KNSWG TWGD GY+ + R+ + CG+ + +SYPL
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341
>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 136/342 (39%), Positives = 200/342 (58%), Gaps = 17/342 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + ++L C + +++ S + + E+W + HG+SY ++ E+ R +++++L
Sbjct: 1 MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G +++LG N F D+ N+EFR L GYK +H+ S F N
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQ-THKKLQGSHFLEPNF--L 115
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+VP +DWRD+ VTP+KDQ +CG CWAFS A+EG L+ LSEQ LV+CS
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGT-CSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+Y+ N GI +ED YPY T C + AA + + ++PSG
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSG 235
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVC-GTQLDHAVTIVGFGTTE--- 306
E+AL+KA+ ++ PVS+ I A T F+ Y+ GI F C T LDH V +VG+G +
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + G GY+ + +D + CGI T +SYPL
Sbjct: 296 DGKKYWIVKNSWSEKLGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 133/348 (38%), Positives = 203/348 (58%), Gaps = 26/348 (7%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEM-HEKWMA---QHGRSYKDELEKEMRFKIFKE 73
M + +IL ++ + V H S E+ +++WM +H ++YK ++E+ R KIF +
Sbjct: 1 MKLFLILFITIFATV------HAVSFFELVNQEWMTFKMEHKKAYKSDVEERFRMKIFMD 54
Query: 74 NLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSSTF 126
N I K N +YKL N++ D+ + EF + G+ S R ++F
Sbjct: 55 NKHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERMPIGASF 114
Query: 127 -KYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
+ N+++ P +DWR + AVTP+KDQ CG CW+FSA A+EG L+ LSE
Sbjct: 115 IEPANVAL---PKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSE 171
Query: 186 QQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKIS 244
Q L+DCS GNNGC GG M++AF+YI N+G+ TE YPY+A C + A
Sbjct: 172 QNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDV 231
Query: 245 NYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVCGTQ-LDHAVTIVG 301
Y ++P+G+E+ L AV ++ PVS+ I A F+ Y EG++ C ++ LDH V ++G
Sbjct: 232 GYIDIPTGNEKLLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIG 291
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPLA 348
+GT E+G +YWL+KNSWG+TWG+ GY+K+ R++ CGI + +SYPL
Sbjct: 292 YGTNENGEDYWLVKNSWGETWGNNGYIKMARNKLNHCGIASSASYPLV 339
>gi|354502593|ref|XP_003513368.1| PREDICTED: cathepsin L1-like isoform 2 [Cricetulus griseus]
Length = 330
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 142/337 (42%), Positives = 195/337 (57%), Gaps = 18/337 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSV-VEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
I I+ L + VVS+ TH+ S+ E HE W QHG++Y + E + R +++ N +
Sbjct: 1 MIPILFLATLCLGVVSAAPTHDPSLDAEWHE-WKTQHGKTYVMDEEGQKR-AVWENNRKM 58
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N++ G + L N F DLTN EFR L TG++ S +T + F Q +
Sbjct: 59 IELHNEDYTKGKHGFHLEMNAFGDLTNTEFRQLMTGFQ----SMGTTEMNVF--QEPRLG 112
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
DVP S+DWR VTP+KDQ C CWAFSAV ++EG L+ LSEQ LVDCS +
Sbjct: 113 DVPKSVDWRKHGYVTPVKDQGSCVSCWAFSAVGSLEGQMFRKTGKLVPLSEQNLVDCSRS 172
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
NNGC GG AF+YI N G+ T + YPY+A G C K +AA I+ + VPS +
Sbjct: 173 QHNNGCHGGLFTSAFQYIKDNGGLDTSESYPYEAQDGPCRYDPKHSAANITGFVVVPS-N 231
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQL-DHAVTIVGFGTTEDGAN 310
E+AL+KAV ++ P+SIGI+ YK G ++ C +H+V +VG+G DG
Sbjct: 232 EEALMKAVATVGPISIGISVRLRSLLFYKSGFYYDPDCYNHYPNHSVLLVGYGEESDGQK 291
Query: 311 YWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYP 346
YWL+KNSWG+ WG GY+KI +D C I T ++YP
Sbjct: 292 YWLVKNSWGEEWGMDGYIKIAKDRNNHCSIATIAAYP 328
>gi|228245|prf||1801240C Cys protease 3
Length = 321
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 184/307 (59%), Gaps = 15/307 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
+ + Q+GR Y D E+ R ++F++N + IE NK+ G T+K+ N+F D+TN+EF
Sbjct: 20 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 79
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
A+ GYK S R + F + M +DWR K VTP+KDQ++CG CWAFS
Sbjct: 80 NAVMKGYKKGS---RGEPKAVFTAEGRPMA---RDVDWRTKALVTPVKDQEQCGSCWAFS 133
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A A+EG + L+ LSEQQLVDCST+ GN+GCGGG M AF+YI N GI TE Y
Sbjct: 134 ATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSY 193
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYKE 282
PY+A +C + A + E+ E+AL +AVS + P+S+ I A F+ Y
Sbjct: 194 PYEAEDRSCRFDANSIGAICTGSVEIVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSS 253
Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGI 339
G++ T LDH V VG+G TE +YWL+KNSWG +WGDAGY+K+ R+ + CGI
Sbjct: 254 GVYYEQNCSPTFLDHGVLAVGYG-TESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGI 312
Query: 340 GTQSSYP 346
++ SYP
Sbjct: 313 ASEPSYP 319
>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
Length = 332
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 129/307 (42%), Positives = 181/307 (58%), Gaps = 13/307 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
E W H + Y E E++ R KI+++NL+ + K N E G +Y LG N+++DL +EF
Sbjct: 29 EAWKQTHSKQYTKE-EEDNRRKIWEDNLQKVSKHNTEHSLGLHSYTLGMNKYADLRGEEF 87
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
+ G K + R K+ + + P S+DWRD+ VTP+KDQ +CG CWAFS
Sbjct: 88 VQMMNGLKFDASRERQG----IKFLSYAKFQAPDSVDWRDEGYVTPVKDQGQCGSCWAFS 143
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
++EG S L LSEQ LVDCS + GNNGC GG M+ AF+YI N GI TED+Y
Sbjct: 144 TTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDNLGIDTEDKY 203
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKE 282
PY+A TC + A S Y +V SGDE AL +A + P+S+ I A F+ Y+
Sbjct: 204 PYEAEDDTCRFSPDNVGATDSGYVDVDSGDEDALKEACAANGPISVAIDASHESFQLYES 263
Query: 283 GIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGI 339
G+++ +LDH V +VG+GT G +YW++KNSWG +WG GY+ + R+ + CGI
Sbjct: 264 GVYDEESCSSIELDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWMSRNKDNQCGI 323
Query: 340 GTQSSYP 346
T +SYP
Sbjct: 324 ATSASYP 330
>gi|324512246|gb|ADY45078.1| Cathepsin L [Ascaris suum]
Length = 388
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 188/318 (59%), Gaps = 17/318 (5%)
Query: 43 VVEMHEKWM---AQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRF 96
+ + +E+W QHG++Y+DE + F NLE I K N + G ++++GTN
Sbjct: 76 IKQGYEQWRLFKEQHGKNYEDEETENDHMLAFLSNLEEIRKHNARYQRGESSFEMGTNHI 135
Query: 97 SDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQ 155
+DL +E+R L GYK SHR+ T + +VP DWRD VT +K+Q
Sbjct: 136 TDLPFEEYRKL-NGYKPRYDDSHRNGTKFLVPFN----INVPGHWDWRDHGYVTEVKNQG 190
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQN 214
CG CWAFSA A+EG K +L+ LSEQ LVDCS GNNGC GG M+ AFEYI N
Sbjct: 191 MCGSCWAFSATGALEGQHKRKIGSLVSLSEQNLVDCSRKYGNNGCNGGLMDYAFEYIKDN 250
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAY 273
G+ TE YPY+ + C +K A+ Y ++P GDE+ L AV+ Q P+S+ I A
Sbjct: 251 HGVDTEASYPYKGKEMKCHFNKKTVGAEDEGYVDLPEGDEEKLKIAVATQGPISVAIDAG 310
Query: 274 TTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKIL 331
F+ Y++G+ + C ++ LDH V +VG+GT E +YW++KNSWG WG+ GY++I
Sbjct: 311 HPSFQMYRKGVYYEPQCSSESLDHGVLVVGYGTDEIDGDYWIVKNSWGPGWGEKGYVRIA 370
Query: 332 RD-EGLCGIGTQSSYPLA 348
R+ + CGI +++SYP+
Sbjct: 371 RNRDNHCGIASKASYPIV 388
>gi|242074968|ref|XP_002447420.1| hypothetical protein SORBIDRAFT_06g000780 [Sorghum bicolor]
gi|241938603|gb|EES11748.1| hypothetical protein SORBIDRAFT_06g000780 [Sorghum bicolor]
Length = 381
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 189/313 (60%), Gaps = 21/313 (6%)
Query: 39 HEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSD 98
HE ++E WMA HGRSY EK RF+I+++N+++IE N++ +T+ G N+F+D
Sbjct: 51 HELLMMERFHAWMAAHGRSYPTAEEKLRRFQIYRDNVKFIEAINRDTTKTFTCGENQFTD 110
Query: 99 LTNDEFRALYTGYKMPS-PSHRSTTSSTFKYQNLSM-----------TDVPTSLDWRDKK 146
LT+ EF A YT S P S++ T + +++ TD+P +DWR++
Sbjct: 111 LTHQEFLARYTMASHDSVPLDLSSSVITTRAGDITESDSGTTMQVEDTDLPEHVDWREQD 170
Query: 147 AVTPIKDQ-QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTME 205
AVTP+++Q Q C CW F++VA +E KI +L++LSEQQ+VDC+ CGGGT++
Sbjct: 171 AVTPVQNQLQGCHACWVFASVATIESANKIKNGDLLKLSEQQIVDCTA---EKCGGGTLQ 227
Query: 206 KAFEYIIQNQGIATEDEY-PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ 264
+AF+Y+ +N GIATE+EY Y A G+C A A +I Y+ +P +E AL + V Q
Sbjct: 228 EAFKYVQKNGGIATEEEYGAYTAKAGSCHAGNVRKAVRIQTYDFLPRENETALAEKVVQQ 287
Query: 265 PVSIGIAAYTTEFKSYKEGIFNG---VCGTQLDHAVTIVGFGTTED-GANYWLIKNSWGD 320
PV++ A+ F YK GI++G L+HA+ IVG+G E G YW+ KNSWG
Sbjct: 288 PVAVLFDAHDPAFAYYKGGIYSGGQPRTRYVLNHAMAIVGYGKNESTGQKYWIAKNSWGT 347
Query: 321 TWGDAGYMKILRD 333
WGD GY+ I +D
Sbjct: 348 GWGDGGYVYIAKD 360
>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
Length = 336
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 134/340 (39%), Positives = 194/340 (57%), Gaps = 17/340 (5%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
++ + L++ V S + + + + E W H + Y E E+ R I+++NL IE
Sbjct: 1 MLPLALLALGVSAVLSAPSLDARLSDHWELWKNWHSKKYH-EKEEGWRRMIWEKNLNKIE 59
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
N E G +Y+LG N F D+T++EFR + GY+ + R S F N +
Sbjct: 60 LHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYQ--RKTERKAIGSLFMEPNFMVA-- 115
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNG 195
P+++DWR+K VTP+KDQ +CG CWAFS A+ZG L+ LSEQ LVDCS G
Sbjct: 116 PSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDCSRPEG 175
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSGDE 254
N GCGGG M++AF+Y+ NQG+ +ED YPY C K + + + ++PSG E
Sbjct: 176 NEGCGGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSVNDTGFVDIPSGKE 235
Query: 255 QALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTEDG 308
AL+KAV S+ PVS+ I A F+ Y+ GI + C + +LDH V VG+ G DG
Sbjct: 236 HALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDG 295
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
YW++KNSW + WGD GY+ + +D + CGI T +SYPL
Sbjct: 296 KKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/299 (43%), Positives = 182/299 (60%), Gaps = 16/299 (5%)
Query: 63 EKEMRFKIFKENLEYIEKANKEGNR---TYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR 119
E F++F++NL+ I K N+E N+ +Y++G N F+ LT +EF A Y GY + +
Sbjct: 47 ESTRAFEVFQKNLDMIMKHNEEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGYG-GAEVEQ 105
Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGAN 179
T K++ S +++P S+DWR+K AV +K+Q CG CWAFSAVAA+EG ++
Sbjct: 106 PKTRRAGKHERKSRSEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGE 165
Query: 180 LIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIA--TEDEYPYQAVQGTCSAAQ 236
LI LSEQQLVDCS GN+GC GG M+ AFEY + N G +E +YPY+ + G C +
Sbjct: 166 LISLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCKFSA 225
Query: 237 KAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYKEGIFNGVCGT---Q 292
A IS Y +V G+E LL AV+ + PVS+ I A + Y G+FNGV GT
Sbjct: 226 DGVRATISGYNDVKQGNETDLLDAVANVGPVSVAIHA-GAALQFYLRGVFNGVAGTCFGP 284
Query: 293 LDHAVTIVGFGTTE----DGANYWLIKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYPL 347
L+H VT VG+GT +YW+IKNSWG WG+ G+++ R + LCG+ +SYPL
Sbjct: 285 LNHGVTAVGYGTASLRFGRKMDYWIIKNSWGMGWGEKGFVRFARGKNLCGVANGASYPL 343
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/311 (41%), Positives = 179/311 (57%), Gaps = 19/311 (6%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDLTNDEFRA 106
W + GRSY E++ R +I+ N E + N +G+ TY+LG ++DL ++EF+
Sbjct: 29 WKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEHEEFKQ 88
Query: 107 LYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCW 161
G + P S+ ++ NL P ++DWR VTP+K+Q CG CW
Sbjct: 89 TVFGVCLGSFNASKPRGGSSFLKMHRFYNL-----PQTIDWRQWGFVTPVKNQGSCGSCW 143
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATE 220
+FS+ A+EG L+ LSEQ+LVDCS N GN GC GG M+ AF YI+ GI TE
Sbjct: 144 SFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYIVNKGGIHTE 203
Query: 221 DEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKS 279
D YPY+ G C A A + Y ++PSG+E AL +AV + PVS+ I A F+
Sbjct: 204 DSYPYEGQVGQCRANYGEIGATCTGYYDIPSGNEHALKEAVATFGPVSVAIHASDQSFQL 263
Query: 280 YKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GL 336
Y G++N GT LDHAV IVG+G TE G +YWL+KNSWG WGD GY+K+ R+
Sbjct: 264 YHSGVYNNPYCSGTALDHAVLIVGYG-TEYGQDYWLVKNSWGPAWGDQGYIKMSRNRYNQ 322
Query: 337 CGIGTQSSYPL 347
CGI + +S+PL
Sbjct: 323 CGIASAASFPL 333
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 133/346 (38%), Positives = 197/346 (56%), Gaps = 22/346 (6%)
Query: 16 IPMFIIIILLVSCASQVVS--SRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKE 73
+ +F+ +I+ V +Q +S E + +M +H + YK+++E+ R KIF +
Sbjct: 1 MKLFLFLIVAVLATAQAISFFELVNQEWTTFKM------EHNKVYKNDVEERFRMKIFMD 54
Query: 74 NLEYIEKANKEGNR-----TYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
N I K N GN +YKL N++ D+ + EF G+ + +
Sbjct: 55 NKHKIAKHN--GNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIAA 112
Query: 129 QNLSMTDV--PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
+ +V P ++DWR+ AVTP+KDQ CG CW+FSA A+EG LI LSEQ
Sbjct: 113 SFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQ 172
Query: 187 QLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
L+DCS GNNGC GG M++AF+YI N+G+ TE YPY+A C + A+
Sbjct: 173 NLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVG 232
Query: 246 YEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVCGTQ-LDHAVTIVGF 302
Y ++P G+E+ L AV ++ PVS+ I A F+ Y EG++ C ++ LDH V VG+
Sbjct: 233 YVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGY 292
Query: 303 GTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
GT E+G +YWL+KNSWG+TWGD GY+K+ R++ CGI + +SYPL
Sbjct: 293 GTDENGQDYWLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPL 338
>gi|334332714|ref|XP_001367224.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 129/334 (38%), Positives = 200/334 (59%), Gaps = 13/334 (3%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN 82
+ L S + ++ ++++ +W AQHG+SY E R +++NL+ IE+ N
Sbjct: 5 LCLASLCLGLAAAIPPFDRALDSQWHQWKAQHGKSYAAN-EDSWRRATWEKNLKMIERHN 63
Query: 83 KE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
+E G +++L N+F D++ +EF+ + GYK R+ S Y+ + +P S
Sbjct: 64 QEYSAGKHSFQLRMNKFGDMSTEEFKQVMNGYKSNGSQKRTKGS---LYRESLLAQLPES 120
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNG 198
+DWR+K VTP+K+Q+ C CWAFSA A+EG L+ LS Q LVDCS GNNG
Sbjct: 121 VDWREKGYVTPVKEQRGCYSCWAFSAAGAIEGQWFRKTGKLVSLSVQNLVDCSIPEGNNG 180
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALL 258
C GG M AF+Y+ N GI TE+ YPY A C + + A ++ + ++PS DE+AL+
Sbjct: 181 CDGGLMGNAFQYVQDNGGIDTEECYPYVAQDNECKYQPECSGANVTGFVKIPSTDERALM 240
Query: 259 KAVS-MQPVSIGIAAYTTEFKSYKEGI-FNGVC-GTQLDHAVTIVGFGTT-EDGANYWLI 314
KAV+ + P+S+ I A FK Y+ G+ ++ C +QL+H V +VG+G+ ++G YW++
Sbjct: 241 KAVANVGPISVAIDAGNPSFKFYQSGVYYDPQCSSSQLNHGVLVVGYGSEGKNGRKYWIV 300
Query: 315 KNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
KNSWG+ WGD GY+ + +DE CGI T +SYP+
Sbjct: 301 KNSWGENWGDNGYVLMAKDEDNHCGIITDASYPI 334
>gi|2746723|gb|AAB94925.1| cathepsin S precursor [Mus musculus]
Length = 340
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/306 (42%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 39 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEISC 98
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
++ S ++ T +++ S +P ++DWR+K VT +K Q CG CWAFSAV
Sbjct: 99 RMGALRISRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN---GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A+EG K+ LI LS Q LVDCS GN GCGGG M +AF+YII N GI + Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKE 282
PY+A+ C K AA S Y ++P GDE AL +AV+ + PVS+GI A + F YK
Sbjct: 214 PYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273
Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 341 TQSSYP 346
+ SYP
Sbjct: 333 SYCSYP 338
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/339 (38%), Positives = 194/339 (57%), Gaps = 17/339 (5%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ +L+++ + VS V+ E W HG++Y +E+++R KI+ EN I
Sbjct: 6 LLLSVLVIASTANAVSFFDV----VLSDWESWKLMHGKTYSSSIEEKLRLKIYMENSLKI 61
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+ N E G Y + N + DL + EF A+ GY+ + + S + +N+ +
Sbjct: 62 SRHNSEALNGIHPYYMKMNHYGDLLHHEFVAMVNGYQYANKT-ASLGGTYIPNKNIQL-- 118
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN- 194
PT +DWR++ AVTP+K+Q +CG CW+FSA A+EG LI LSEQ LVDCS
Sbjct: 119 -PTHVDWREEGAVTPVKNQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDCSRKF 177
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
GNNGC GG M+ AF YI N+GI TE YPY+ + G C K + ++ G E
Sbjct: 178 GNNGCEGGLMDFAFTYIRDNKGIDTEASYPYEGIDGHCHYNPKNKGGSDIGFVDIKKGSE 237
Query: 255 QALLKAVS-MQPVSIGIAAYTTEFKSYKEGIF-NGVCGT-QLDHAVTIVGFGT-TEDGAN 310
+ L KAV+ + P+S+ I A F+ Y G++ C + +LDH V +VGFGT + G +
Sbjct: 238 KDLKKAVAGVGPISVAIDASHMSFQFYSHGVYVESKCSSEELDHGVLVVGFGTDSVSGED 297
Query: 311 YWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
YWL+KNSW + WGD GY+K+ R+ E +CGI + +SYP+
Sbjct: 298 YWLVKNSWSEKWGDQGYIKMARNKENMCGIASSASYPVV 336
>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
Length = 331
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 194/336 (57%), Gaps = 19/336 (5%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
++ LLV C++ H ++ H W +G+ Y ++ E+ R I+++NL+++
Sbjct: 4 LVWTLLVCCSAMA----QLHRDPALDHHWHLWKKTYGKQYTEKNEETERRLIWEKNLKFV 59
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
N E G +Y LG N D+T++E +L T K+P S R+ T + Q L
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVVSLMTCLKVPRQSQRNVTYKSSPNQKL---- 115
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
P SLDWR+K VT +K Q CG CWAFSAV A+E K++ L+ LS Q LVDCST
Sbjct: 116 -PDSLDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTEK 174
Query: 196 --NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
N GC GG M +AF+YII N GI +E YPY+A+ C K AA S Y E+P G
Sbjct: 175 YRNEGCHGGFMTEAFQYIIDNNGIDSEASYPYKAMDEKCQYDSKNRAATCSKYTELPFGS 234
Query: 254 EQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANY 311
E+AL +AV+ + PVS+ I A + F Y+ G+ + C ++H V +VG+G +G +Y
Sbjct: 235 EEALKEAVASKGPVSVAIDASHSSFFLYRSGVYYEPACTQVVNHGVLVVGYGNL-NGNDY 293
Query: 312 WLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
WL+KNSWG +GD GY+++ R+ E CGI + SSYP
Sbjct: 294 WLVKNSWGLYFGDKGYIRMARNRENHCGIASYSSYP 329
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 132/347 (38%), Positives = 198/347 (57%), Gaps = 22/347 (6%)
Query: 16 IPMFIIIILLVSCASQVVS--SRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKE 73
+ +F+++I+ + +Q +S E + +M +H + YK+++E+ R KIF +
Sbjct: 1 MKLFLLLIVAILATAQAISFFELVNQEWTTFKM------EHNKVYKNDIEERFRMKIFMD 54
Query: 74 NLEYIEKANKEGNR-----TYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
N I K N GN +YKL N++ D+ + EF G+ + +
Sbjct: 55 NKHKIAKHN--GNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIGA 112
Query: 129 QNLSMTDV--PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
+ +V P ++DWR+ AVTP+KDQ CG CW+FSA A+EG LI LSEQ
Sbjct: 113 SFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQ 172
Query: 187 QLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
L+DCS GNNGC GG M++AF+YI N+G+ TE YPY+A C + A+
Sbjct: 173 NLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVG 232
Query: 246 YEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVCGTQ-LDHAVTIVGF 302
Y ++P G+E+ L AV ++ PVS+ I A F+ Y EG++ C ++ LDH V VG+
Sbjct: 233 YVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGY 292
Query: 303 GTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPLA 348
GT E+G +YWL+KNSWG+TWGD GY+K+ R++ CGI + +SYPL
Sbjct: 293 GTDENGQDYWLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPLV 339
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 129/338 (38%), Positives = 192/338 (56%), Gaps = 15/338 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+ + L C + +++ S Q ++ E + +QH ++Y +E+ +RFKIF EN
Sbjct: 1 MLRLAFLCGCVAAAIAASS---QEILRTEWEAFKSQHNKAYSSHVEELLRFKIFTENTLL 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
+ K N + G +YKL N+F DL EF + GY+ ++ + NL+ +
Sbjct: 58 VAKHNAKYAKGLVSYKLAMNKFGDLLPHEFAKMVNGYR--GKQNKEQRPTFIPPANLNDS 115
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
+PT++DWR K AVTP+K+Q +CG CWAFS ++EG L+ LSEQ LVDCS +
Sbjct: 116 SLPTTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDD 175
Query: 195 -GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
GN GC GG M+ F+YI N GI TE+ +PY A G C + A + + ++ G
Sbjct: 176 FGNQGCNGGLMDNGFQYIKANGGIDTEESHPYTAQDGDCKFKKADVGATDAGFVDIQQGS 235
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGAN 310
E L KAV ++ PVS+ I A F+ Y +G+++ +QLDH V VG+G ++G
Sbjct: 236 EDDLKKAVATVGPVSVAIDASHGSFQLYSQGVYDEPDCSSSQLDHGVLTVGYG-VKNGKK 294
Query: 311 YWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
YWL+KNSWG WGD GY+ + RD + CGI + +SYPL
Sbjct: 295 YWLVKNSWGGDWGDNGYILMSRDKDNQCGIASSASYPL 332
>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
Length = 319
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 134/305 (43%), Positives = 181/305 (59%), Gaps = 15/305 (4%)
Query: 30 SQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTY 89
S + S E + +M +M Q+ ++Y E RF FK ++E I N N +Y
Sbjct: 25 SALXSEEVPSEVMLQDMFTAFMKQYSKAYS-HAEFSSRFNQFKASVETIRLHNTLANASY 83
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVT 149
+G N F+DL+ +EF+ Y G K R S +Q + PTS+DWR AVT
Sbjct: 84 TMGLNEFADLSFEEFKGKYFGCKHV---EREFARSNNLHQEVEA--APTSIDWRTSNAVT 138
Query: 150 PIKDQQECGCCWAFSAVAAVEGITKISGAN-LIQLSEQQLVDCSTN-GNNGCGGGTMEKA 207
PIKDQ +CG CWAFSA ++EG + G + L LSEQQLVDCST+ GN GC GG M+ A
Sbjct: 139 PIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYA 198
Query: 208 FEYIIQNQGIATEDEYPYQAVQGTCSAAQKAA--AAKISNYEEVPSGDEQALLKAV-SMQ 264
FEYII N+GI E YPY+ V G C QK+ IS +++V SGDE + L AV ++
Sbjct: 199 FEYIIANKGICAESAYPYKGVGGLC---QKSCTKVVTISGHKDVASGDEASSLNAVGTVG 255
Query: 265 PVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGD 324
PVS+ I A F+ Y G+F+G CG LDH V VG+GTT +YW++KNSWG +WG+
Sbjct: 256 PVSVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTT-GSQDYWIVKNSWGTSWGE 314
Query: 325 AGYMK 329
+GY++
Sbjct: 315 SGYIR 319
>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
Length = 337
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 137/343 (39%), Positives = 201/343 (58%), Gaps = 20/343 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M + +L + +S V+S+ S Q ++ H W + H ++Y + E+ R ++++NL+
Sbjct: 1 MLPVAVLTLCLSSAVLSAPSLDPQ--LDQHWNLWKSWHSKNYH-QREEGWRRLVWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE N E G +Y+LG N F D+T++EF+ + GYK + R S F N
Sbjct: 58 KIELHNLEHSMGKHSYRLGMNHFGDMTHEEFKQIMNGYK--HKAERKFKGSLFLEPNF-- 113
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS- 192
+ P S+DWR+K VTP+KDQ ECG CWAFS A+EG L+ LS Q LV+CS
Sbjct: 114 LEAPRSVDWREKGYVTPVKDQGECGSCWAFSTTGALEGQEFTRTGKLVSLSGQNLVECSR 173
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPS 251
GN GC GG M++AF+Y+ NQG+ +ED YPY C K +AA + + ++PS
Sbjct: 174 PEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKFSAANDTGFVDIPS 233
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
G+E+AL+KAV S+ PVS+ I A F+ Y+ GI + C + +LDH V VG+ G
Sbjct: 234 GNERALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFQGED 293
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG +W++KNSW + WGD GY+ + +D + CGI T +SYPL
Sbjct: 294 VDGKKFWIVKNSWSENWGDKGYIYMAKDRKNHCGIATAASYPL 336
>gi|2961621|gb|AAC05781.1| cathepsin S [Mus musculus]
Length = 340
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 130/306 (42%), Positives = 184/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 39 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEISC 98
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
++ S ++ T +++ S +P ++DWR+K VT +K Q CG CWAFSAV
Sbjct: 99 RMGALRISRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN---GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A+EG K+ LI LS Q LVDCS GN GCGGG M +AF+YII N GI + Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKE 282
PY+A C K AA S Y ++P GDE AL +AV+ + PVS+GI A + F YK
Sbjct: 214 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273
Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 341 TQSSYP 346
+ SYP
Sbjct: 333 SYCSYP 338
>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
Length = 299
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 124/284 (43%), Positives = 180/284 (63%), Gaps = 22/284 (7%)
Query: 18 MFIIIILLVSCAS-------QVVSSRSTH---------EQSVVEMHEKWMAQHGRSYKDE 61
M ++I+L++S + ++S TH + V+ M+E+W+ +HG+SY
Sbjct: 10 MKLMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGL 69
Query: 62 LEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST 121
EK+ RF+IFK+NL++I++ N N TY+LG RF+DLTN+E+R+ + G K+ P+ R
Sbjct: 70 GEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKI-DPNRRMK 127
Query: 122 T---SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGA 178
S + +Y +P S+DWR + AV +KDQ CG CWAFSA+AAVEGI KI
Sbjct: 128 KLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTG 187
Query: 179 NLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK- 237
+LI LSEQ+LVDC T+ N GC GG M+ AFE+II N GI +ED+YPY+AV G C +K
Sbjct: 188 DLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKN 247
Query: 238 AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYK 281
A I +YE+VP+ DE AL KAV+ QP+++ + EF+ Y+
Sbjct: 248 AKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYE 291
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 133/324 (41%), Positives = 187/324 (57%), Gaps = 24/324 (7%)
Query: 30 SQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEK-EMRFKIFKENLEYIEKANKEGNRT 88
SQ + R+ H V++ + HG Y +L E F+ NL IE A+ GN +
Sbjct: 11 SQFLPRRNLH--LVLKGPTAFRRIHGVFYSSQLGLCEPAFRCHLANLRVIE-AHNAGNSS 67
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS-LDWRDKKA 147
+ +G +F+DLT EF A + M + T + +T+ P +DWR K A
Sbjct: 68 FTMGITQFADLTAAEFSAYVKRFPM---------NVTRPRNEVWITEAPLQEVDWRQKNA 118
Query: 148 VTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEK 206
VT IK+Q +CG CW+FS +VEG I+ L+ LSEQQL+DCST GN+GC GG M+
Sbjct: 119 VTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCSTRYGNHGCNGGLMDY 178
Query: 207 AFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQP 265
AFEY+I N G+ TE++YPY A G C+ +K AA+I + VP E L AVS+ P
Sbjct: 179 AFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEIHGFRNVPKEHEDQLAAAVSIGP 238
Query: 266 VSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDA 325
VS+ I A F+ Y G+F+G CGT LDH V +VG+ +YW++KNSWG +WG+
Sbjct: 239 VSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGYSD-----DYWIVKNSWGKSWGEE 293
Query: 326 GYMKILR---DEGLCGIGTQSSYP 346
GY+++ R +G+CGI Q+SYP
Sbjct: 294 GYIRLKRGVDKKGMCGITMQASYP 317
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 131/328 (39%), Positives = 193/328 (58%), Gaps = 20/328 (6%)
Query: 38 THEQSVVEM-HEKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYK 90
TH S E+ +++WM +H + YK ++E+ R KIF +N I K N +YK
Sbjct: 21 THAVSFFELVNQEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYK 80
Query: 91 LGTNRFSDLTNDEFRALYTGYKMP----SPSHRSTTSSTF-KYQNLSMTDVPTSLDWRDK 145
L N++ D+ + EF + G+ S R ++F + N+ + P +DWR +
Sbjct: 81 LKMNKYGDMLHHEFVNILNGFNKSINTQLRSERLPVGASFIEPANVVL---PKKVDWRKE 137
Query: 146 KAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTM 204
AVTP+KDQ CG CW+FSA A+EG L+ LSEQ L+DCS GNNGC GG M
Sbjct: 138 GAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLM 197
Query: 205 EKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SM 263
++AF+YI N+G+ TE YPY+A C + A Y ++P+GDE+ L AV ++
Sbjct: 198 DQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDVGYIDIPTGDEKLLKAAVATI 257
Query: 264 QPVSIGIAAYTTEFKSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDT 321
PVS+ I A F+ Y EG++ C ++ LDH V ++G+GT E+G +YWL+KNSWG+T
Sbjct: 258 GPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWGET 317
Query: 322 WGDAGYMKILRDE-GLCGIGTQSSYPLA 348
WG+ GY+K+ R++ CGI + +SYPL
Sbjct: 318 WGNNGYIKMARNKLNHCGIASSASYPLV 345
>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
Length = 371
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 189/315 (60%), Gaps = 23/315 (7%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDLTND 102
M + ++ ++ R Y +LE+E R IF EN I + N ++G +Y +G N FSD TN
Sbjct: 66 MWQAFLEKYKRVYDSKLEEERRLGIFTENFIRISEHNLLFEKGEVSYSMGINAFSDKTNS 125
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
E L G++ S + RS + +Y P +DWR K AVTP+K+Q +CG CWA
Sbjct: 126 ELDVL-RGFRHSSKASRSGS----QYIPFDAAP-PAEVDWRTKGAVTPVKNQGDCGSCWA 179
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FSA +EG ++ L+ LSEQQLVDCS++ N+GC GG M+ AFEY+ +++GI TE
Sbjct: 180 FSATGGIEGQHYLATGKLVSLSEQQLVDCSSS-NDGCDGGLMDLAFEYVKEHKGIDTEVH 238
Query: 223 YPYQAVQGT------CSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTT 275
YPY V G CS K AA ++ Y ++P G E L +AV P+S+GI A
Sbjct: 239 YPY--VSGNTGYARQCSFDPKYAAVNVTGYVDIPEGQELLLQQAVGFHGPISVGINAGLP 296
Query: 276 EFKSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
F +Y+ GI+ + C LDH V +VG+G ++G YWLIKNSWG+ WG+ GY++ILR+
Sbjct: 297 SFMAYESGIYSDHRCNPHDLDHGVLVVGYG-VDNGVPYWLIKNSWGEDWGENGYVRILRN 355
Query: 334 E-GLCGIGTQSSYPL 347
LCG+ T +SYPL
Sbjct: 356 HNNLCGVATMASYPL 370
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 131/317 (41%), Positives = 183/317 (57%), Gaps = 18/317 (5%)
Query: 45 EMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYK----LGTNRFSDLT 100
E+ E+WM +H + Y EK R+ F NL ++ K N EG R +G N F+DL+
Sbjct: 49 ELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADLS 108
Query: 101 NDEFRALYTG--YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
N+EFR +Y+ + + R + + ++ D P SLDWR + AVT +K+Q +CG
Sbjct: 109 NEEFREVYSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASLDWRKRGAVTAVKNQGDCG 168
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFS+ A+EGI I+ LI LSEQ+LVDC T N GC GG M+ AFE++I N GI
Sbjct: 169 SCWAFSSTGAMEGINAITTGELISLSEQELVDCDTT-NEGCDGGYMDYAFEWVINNGGID 227
Query: 219 TEDEYPY--QAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
+E YPY QA + ++ I YE+V + E ALL A QPVS+GI + +
Sbjct: 228 SEANYPYTGQADSVCNTTKEEIKVVSIDGYEDVAT-SESALLCAAVQQPVSVGIDGSSLD 286
Query: 277 FKSYKEGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
F+ Y GI++G C +DHAV +VG+G + G +YW++KNSWG WG GY+ I R+
Sbjct: 287 FQLYAGGIYDGDCSGNPDDIDHAVLVVGYG-QQGGTDYWIVKNSWGTDWGMQGYIYIRRN 345
Query: 334 EGL----CGIGTQSSYP 346
GL C I +SYP
Sbjct: 346 TGLPYGVCAIDAMASYP 362
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/302 (42%), Positives = 184/302 (60%), Gaps = 13/302 (4%)
Query: 52 AQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRT--YKLGTNRFSDLTNDEFRALYT 109
+ H +SY+D E+ +R IF++NL IE+ N+ + LG N F+D+TN EF +
Sbjct: 33 STHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEFSNMLL 92
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAV 169
G R+ + +++ + D+P +DW K VT +K+Q +CG CWAFS ++
Sbjct: 93 GL-----GGRNKIAGDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTGSL 147
Query: 170 EGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV 228
EG L+ LSEQ LVDCST+ GN GC GG M++AF YI +N GI TE YPY
Sbjct: 148 EGQVFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPYTGS 207
Query: 229 QGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNG 287
GTC + A +S + +V SGDE AL +AV ++ P+S+ I A + F+ Y+ G++N
Sbjct: 208 DGTCRFLENKVGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVYNP 267
Query: 288 --VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSS 344
T+LDH V +VG+G TE G +YWL+KNSWG +WG GY+K++R+ + CGI TQ+S
Sbjct: 268 WFCSSTELDHGVLVVGYG-TEGGKDYWLVKNSWGSSWGLKGYIKMVRNKKNRCGIATQAS 326
Query: 345 YP 346
YP
Sbjct: 327 YP 328
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 127/308 (41%), Positives = 188/308 (61%), Gaps = 12/308 (3%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDLTNDEF 104
E W ++G+SY E+ +R ++++ NL+ +++ N +G Y+LG N ++DL N+EF
Sbjct: 20 ESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEF 79
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
AL + +S+T TFK L +P+S+DWR++ VTP+KDQ +CG CW+FS
Sbjct: 80 MALKGSSGILQAKDQSSTQ-TFK--PLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFS 136
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A ++EG L+ LSEQQLVDCS + GN GC GG ME A++YI G+ E Y
Sbjct: 137 ATGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYGCSGGLMESAYDYIRDAGGVQLESAY 196
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKE 282
PY A G C Q A A + + +PSGDEQ+L++AV ++ PV++ I A +F+ Y+
Sbjct: 197 PYTAQNGRCHFDQSKAVATCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGYDFQLYES 256
Query: 283 GIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGI 339
G+++ + LDH V G+G TE G +YWL+KNSWG WG GY+K+ R++ CGI
Sbjct: 257 GVYDRSRCSSSSLDHGVLAAGYG-TEGGNDYWLVKNSWGPGWGAQGYIKMSRNKSNQCGI 315
Query: 340 GTQSSYPL 347
T + YPL
Sbjct: 316 ATMACYPL 323
>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 196/342 (57%), Gaps = 17/342 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKW---MAQHGRSYKDELEKEMRFKIFKENLE 76
+I++ LV+ A VSS + +E V+E E+W Q + Y+D E+ R K++ +N
Sbjct: 4 VIVLGLVAFAISTVSSINLNE--VIE--EEWSLFKIQFKKLYEDIKEETFRKKVYLDNKL 59
Query: 77 YIEKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
I NK G TY L N F DL E+ + G+K + T +
Sbjct: 60 KIAGHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKS 119
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+P S+DWR K VTP+K+Q +CG CW+FSA ++EG L+ LSEQ L+DC
Sbjct: 120 ENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDC 179
Query: 192 STN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
S GNNGC GG M+ AF+YI N+G+ TE YPY+A C + + A + ++P
Sbjct: 180 SRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFVDIP 239
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVC-GTQLDHAVTIVGFGTTED 307
GDE AL+ A+ ++ PVSI I A + +F+ YK+G+F N C T+LDH V VGFG+ +
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKK 299
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
G +YW++KNSWG TWGD GY+ + R+ + CG+ + +SYPL
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 193/343 (56%), Gaps = 24/343 (6%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+ + +LV C S V ++ Q H W H +SY E E+ R ++++NL+ IE
Sbjct: 4 LYLAVLVLCVSAVCAAPRFDSQLEDHWH-LWKNWHSKSYH-ESEEGWRRMVWEKNLKKIE 61
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSM 133
N E G +Y+LG N F D+TN+EFR GYK TT FK + +
Sbjct: 62 MHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK-------QTTERKFKGSLFMEPNY 114
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS- 192
P ++DWR+K VTP+KDQ CG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 115 LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR 174
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV-QGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M++AF+YI N G+ TE+ YPY + C + + A + + ++PS
Sbjct: 175 PEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPS 234
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
G E A++KAV ++ PVS+ I A F+ Y+ GI + C + +LDH V +VG+ G
Sbjct: 235 GKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGED 294
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WGD GY+ + +D + CGI T SSYPL
Sbjct: 295 VDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
Length = 348
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 141/352 (40%), Positives = 193/352 (54%), Gaps = 29/352 (8%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVV-EMHEKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M+ +++LL A+ V S + Q +V E E++ +HG+ Y+ E E E R +F ENL
Sbjct: 1 MYSLVVLL---ATLVAYSHAISYQVLVQEQWEQFKLEHGKVYESESENEYRQSVFMENLF 57
Query: 77 YIEKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY----- 128
I + NK G +Y++ N DLT DEF +YT MP S + +
Sbjct: 58 QINEHNKLYEMGLSSYQMAMNHLGDLTKDEFMRIYT-VNMPQLPQSENLSDSEPWLDLPQ 116
Query: 129 -----------QNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISG 177
NL D+PT +DWR K AVTP+K+Q+ CG CW+FSA A+E
Sbjct: 117 DLQGFVTYALPTNLDEVDLPTDIDWRQKGAVTPVKNQRNCGSCWSFSATGALEAQWFKKT 176
Query: 178 ANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ 236
LI LSEQQLVDCS GN+GC GG M AF YI +N GI TE YPY A G C+
Sbjct: 177 NKLISLSEQQLVDCSGRYGNHGCHGGWMHWAFGYIKENGGIDTEQSYPYTAKDGRCAYKP 236
Query: 237 KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFN-GVCGTQLDH 295
AA +S VP G+ Q K S+ P+SI A + +F+ Y G+++ CG L+H
Sbjct: 237 GNKAATVSQVIMVPRGENQLAAKVSSVGPISIA-AEVSHKFQFYHSGVYDEPQCGHSLNH 295
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYP 346
A+ VG+G+ G N+WL+KNSWG WGD GY+++ +D+ CGI +SYP
Sbjct: 296 AMLAVGYGSM-GGKNFWLVKNSWGTGWGDQGYIRMAKDKNNQCGIALMASYP 346
>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
Length = 338
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 129/305 (42%), Positives = 185/305 (60%), Gaps = 14/305 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W +GR Y+++ E+ R I+++NL+ + N E G +Y LG N +D+T++E +
Sbjct: 39 WKKTYGRQYQEKNEEVARRLIWEKNLKSVMLHNLEYSMGMHSYDLGMNHLADMTSEEVSS 98
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
L + ++PS + T Y++ S +P S+DWR+K VT +K Q CG CWAFSAV
Sbjct: 99 LMSSLRVPSQWQANVT-----YKSNSNQKLPDSVDWREKGCVTEVKYQGACGACWAFSAV 153
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN--GNNGCGGGTMEKAFEYIIQNQGIATEDEYP 224
A+E K+ NL+ LS Q LVDCST GN GC GG M KAF+YII N GI +E YP
Sbjct: 154 GALEAQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGFMTKAFQYIIDNNGIDSEVSYP 213
Query: 225 YQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEG 283
Y+A+ G C K AA S Y E+P G E AL +AV+ + PVS+ I A + F YK G
Sbjct: 214 YKAMDGNCRYDSKHRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDAKHSSFFLYKSG 273
Query: 284 I-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGT 341
+ ++ C ++H V +VG+G +G +YWL+KNSWG +G+ GY+++ R+ G CGI +
Sbjct: 274 VYYDPSCTQNVNHGVLVVGYGNL-NGRDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIAS 332
Query: 342 QSSYP 346
SYP
Sbjct: 333 YPSYP 337
>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
Length = 339
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 138/348 (39%), Positives = 196/348 (56%), Gaps = 24/348 (6%)
Query: 8 SGSFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEM 66
+GSF M ++ LL C+ V H+ ++ H W + + YK+E E+
Sbjct: 5 AGSF------MKWLVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVA 55
Query: 67 RFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTS 123
R I+++NL+++ N E G +Y LG N D+T +E +L ++PS R+ T
Sbjct: 56 RRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT- 114
Query: 124 STFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQL 183
Y++ S +P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ L
Sbjct: 115 ----YRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSL 170
Query: 184 SEQQLVDCSTN--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAA 241
S Q LVDCST GN GC GG M AF+YII N GI +E YPY+A+ G C K AA
Sbjct: 171 SAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKRAA 230
Query: 242 KISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTI 299
S Y E+P G E AL +AV+ + PVS+ I A F Y+ G+ + C ++H V +
Sbjct: 231 TCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLV 290
Query: 300 VGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
VG+G +G +YWL+KNSWG +GD GY+++ R+ G CGI + SYP
Sbjct: 291 VGYGNL-NGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 337
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 183/310 (59%), Gaps = 17/310 (5%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W H ++Y E E+ R ++++NL IE N E G +Y+LG N F D+T++EFR
Sbjct: 31 WKGWHSKNYH-EKEEGWRRLVWEKNLRKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQ 89
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
+ GYK R + S F N + P ++DWRDK VTP+KDQ +CG CWAFS
Sbjct: 90 IMNGYK--RREQRKYSGSLFMEPNF--LEAPRAVDWRDKGYVTPVKDQGQCGSCWAFSTT 145
Query: 167 AAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
A+EG L+ LSEQ LVDCS GN GC GG M++AF+Y+ NQG+ +ED YPY
Sbjct: 146 GALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDFYPY 205
Query: 226 QAVQG-TCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEG 283
+ C + +A + + ++PSG E+AL+KAV S+ PVS+ I A F+ Y+ G
Sbjct: 206 KGTDDQPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAIDAGHESFQFYQSG 265
Query: 284 I-FNGVCGT-QLDHAVTIVGF---GTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLC 337
I F C + +LDH V +VG+ G DG YW++KNSW + WGD G++ + +D C
Sbjct: 266 IYFEKECSSDELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGFIYMAKDRHNHC 325
Query: 338 GIGTQSSYPL 347
GI T +SYPL
Sbjct: 326 GIATAASYPL 335
>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
Length = 331
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 134/338 (39%), Positives = 195/338 (57%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ +L+ C+S + H ++ H + W +G+ YK++ E+ R I+++NL+
Sbjct: 1 MKCLVWVLLLCSSAMAQ---LHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
+ N E G +Y LG N D+T++E +L + ++PS R+ T + Q L
Sbjct: 58 TVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKL-- 115
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 116 ---PDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCST 172
Query: 194 NG--NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
N GC GG M +AF+YII N GI +E YPY+AV G C K AA S Y E+P
Sbjct: 173 EKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPF 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
DE AL +AV+ + PVS+ I A + F Y+ G+ ++ C ++H V +VG+G +G
Sbjct: 233 ADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNL-NGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
+YWL+KNSWG +GD GY+++ R+ E CGI SYP
Sbjct: 292 DYWLVKNSWGLNFGDGGYIRMARNSENHCGIANYPSYP 329
>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
Length = 331
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 192/338 (56%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ LL C+ V H+ ++ H W + + YK+E E+ R I+++NL+
Sbjct: 1 MKWLVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T +E +L ++PS R+ T Y++ S
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT-----YRSNSN 112
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 113 QKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M AF+YII N GI +E YPY+A+ G C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKRAATCSKYTELPF 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E AL +AV+ + PVS+ I A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL-NGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
+YWL+KNSWG +GD GY+++ R+ G CGI + SYP
Sbjct: 292 DYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 329
>gi|330793420|ref|XP_003284782.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
gi|325085276|gb|EGC38686.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
Length = 347
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 140/341 (41%), Positives = 188/341 (55%), Gaps = 37/341 (10%)
Query: 35 SRSTHEQSVVEMHEK-----WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTY 89
S +T +Q E+ + WM Q+ R Y E E R+ IFK N++Y+++ N +G+ T
Sbjct: 13 SFATAKQQFSELQYRNAFTNWMIQNQRHYASE-EFAARYNIFKANMDYVQEWNSKGSETV 71
Query: 90 KLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVT 149
LG N F+D+TN EFR++Y G S +T + S+DWR K AVT
Sbjct: 72 -LGLNTFADITNQEFRSIYLGTPFDGSSIINTETEKI------FAAPAASIDWRTKGAVT 124
Query: 150 PIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAF 208
PIK+QQ+CG CW+FS + EG T I+ NL LSEQ L+DCS + GNNGC GG M AF
Sbjct: 125 PIKNQQQCGGCWSFSTTGSTEGATAIAKGNLPSLSEQNLIDCSGSYGNNGCNGGLMTLAF 184
Query: 209 EYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
EYII N+GI TE YPY A G TC A +S+Y V SG E +L A ++ PVS
Sbjct: 185 EYIINNKGIDTESSYPYTAKDGKTCKYNPANIGATLSSYSNVTSGSEPSLESAANIGPVS 244
Query: 268 IGIAAYTTEFKSYKEGI-FNGVCG-TQLDHAVTIVGFGTTE----------------DGA 309
+ I A F+ Y GI + C T LDH V +VG+ + +GA
Sbjct: 245 VAIDASHNSFQLYSSGIYYEPACSTTSLDHGVLVVGYASGSGSGSGSGSGSGSGLAVEGA 304
Query: 310 ---NYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYP 346
NYW++KNSWG +WG GY+ + +D CGI T +S+P
Sbjct: 305 SSGNYWIVKNSWGTSWGIEGYILMSKDRNNNCGIATMASFP 345
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 128/305 (41%), Positives = 184/305 (60%), Gaps = 14/305 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W +G+ YK++ E+ R I+++NL+++ N E G +Y LG N D+T++E +
Sbjct: 40 WKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMTSEEVIS 99
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
L + ++PS R+ T Y++ S +P S+DWR+K VT +K Q CG CWAFSAV
Sbjct: 100 LMSSLRVPSQWPRNVT-----YKSNSNQKLPDSVDWREKGCVTKVKYQGACGACWAFSAV 154
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN--GNNGCGGGTMEKAFEYIIQNQGIATEDEYP 224
A+E K+ L+ LS Q LVDCST GN GC GG M +AF+YII N GI +E YP
Sbjct: 155 GALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYP 214
Query: 225 YQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEG 283
Y+A G C K AA S Y E+PSG E L +AV+ + PVS+ I A + F Y+ G
Sbjct: 215 YKATDGKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSG 274
Query: 284 I-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGT 341
+ ++ C ++H V +VG+G +G +YWL+KNSWG +GD GY+++ R+ G CGI +
Sbjct: 275 VYYDPSCTQNVNHGVLVVGYGNL-NGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIAS 333
Query: 342 QSSYP 346
SYP
Sbjct: 334 YPSYP 338
>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
Length = 341
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 197/342 (57%), Gaps = 17/342 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKW---MAQHGRSYKDELEKEMRFKIFKENLE 76
+I++ LV+ A VSS + +E V+E E+W Q + Y+D E+ R K++ +N
Sbjct: 4 VIVLGLVAFAISSVSSINLNE--VIE--EEWSLFKMQFKKLYEDIKEETFRKKVYLDNKL 59
Query: 77 YIEKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
I + NK G TY L N F DL E+ + G+K + T +
Sbjct: 60 KIARHNKLYESGEETYALEMNHFGDLMQHEYSKMMNGFKPSLAGGDSNFTNDEGVTFLKS 119
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+P S+DWR K VTP+K+Q +CG CW+FSA ++EG L+ LSEQ L+DC
Sbjct: 120 ENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDC 179
Query: 192 STN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
S GNNGC GG M+ AF+YI N+G+ TE YPY+A C + A + + ++P
Sbjct: 180 SRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPDNSGATDNGFVDIP 239
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVC-GTQLDHAVTIVGFGTTED 307
GDE+AL+ A+ ++ PVSI I A + +F+ YK+G+F N C T+LDH V VGF T +
Sbjct: 240 EGDEEALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFRTDKK 299
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
G +YW++KNSWG TWGD GY+ + R+ + CG+ + +SYPL
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 140/355 (39%), Positives = 199/355 (56%), Gaps = 22/355 (6%)
Query: 9 GSFKINTIPMFIIIILLVSCASQ-------VVSSRSTHEQSVVEMHEKWMAQHGRSYKDE 61
GS KI + + + I ++C S + E+ V E+ W +H R YK
Sbjct: 2 GSQKIQ-LALVLFIWASLACLSSSLPTEFYITGEEFASEERVRELFHLWKERHKRVYKHA 60
Query: 62 LEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST 121
E RF+IFKENL+Y+ + N +G+R + LG N+F+D++N+EF+ Y + ++
Sbjct: 61 EETAKRFEIFKENLKYVIERNSKGHR-HTLGMNKFADMSNEEFKEKYLSKIKKPINKKNN 119
Query: 122 --TSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGAN 179
S + + + + P+SLDWR K VT IKDQ +CG CWAFS+ A+EGI I +
Sbjct: 120 YLRRSMQQKKGTASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGD 179
Query: 180 LIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-A 238
LI LSEQ+LVDC T N GC GG M+ AFE++I N GI +E +YPY GTC+ ++
Sbjct: 180 LISLSEQELVDCDTT-NYGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDGTCNTTKEDT 238
Query: 239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNG---VCGTQLDH 295
I Y++V D ALL A QP+S+G+ +F+ Y GI+ G +DH
Sbjct: 239 KVVSIDGYKDVDESD-SALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDH 297
Query: 296 AVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEGL----CGIGTQSSYP 346
AV IVG+G +ED +YW+ KNSWG +WG GY I R+ L C I +SYP
Sbjct: 298 AVLIVGYG-SEDSEDYWICKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYP 351
>gi|305434756|gb|ADM53740.1| cathepsin L1 precursor [Lepeophtheirus salmonis]
Length = 325
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 131/307 (42%), Positives = 187/307 (60%), Gaps = 15/307 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W HG++Y +E+R KIF+EN I+K N E G TY L N++ DL EF
Sbjct: 24 WTKLHGKTYTSFEIEELRVKIFEENRIKIQKHNAEAQNGLHTYSLEMNQYGDLLQSEFLQ 83
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
YTG S S +T + VP+ ++W AVT +KDQ++CG CWAFS
Sbjct: 84 GYTGLAKGSYSGDNTVILD------NSAPVPSYVNWTKNGAVTAVKDQKDCGSCWAFSTT 137
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
+VEG I L+ SEQQLVDCS++ N GC GG M+ AF+Y+I N+GIATED YPY
Sbjct: 138 GSVEGQYFIKNKKLLSFSEQQLVDCSSDFRNEGCNGGWMDNAFKYLIANKGIATEDTYPY 197
Query: 226 QAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYKEGI 284
A G C + AA +IS++++V G E L AV+ + P+S+ I A + +F+ YK+G+
Sbjct: 198 TATDGVCVYNKTMAAGRISSFKDVKHGSEDQLKLAVAQIGPISVAIDASSGDFQFYKKGV 257
Query: 285 F-NGVCGTQ-LDHAVTIVGFGTTED-GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIG 340
+ + C ++ LDH V VG+GT + G +YWL+KNSW +WGD GY+K+ R+ + +CGI
Sbjct: 258 YVDEECSSKYLDHGVLAVGYGTDKGTGLDYWLVKNSWSASWGDQGYIKMARNHKNMCGIA 317
Query: 341 TQSSYPL 347
+ +SYP+
Sbjct: 318 SLASYPV 324
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 128/305 (41%), Positives = 184/305 (60%), Gaps = 14/305 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W +G+ YK++ E+ R I+++NL+++ N E G +Y LG N D+T++E +
Sbjct: 28 WKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMTSEEVIS 87
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
L + ++PS R+ T Y++ S +P S+DWR+K VT +K Q CG CWAFSAV
Sbjct: 88 LMSSLRVPSQWPRNVT-----YKSNSNQKLPDSVDWREKGCVTKVKYQGACGACWAFSAV 142
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN--GNNGCGGGTMEKAFEYIIQNQGIATEDEYP 224
A+E K+ L+ LS Q LVDCST GN GC GG M +AF+YII N GI +E YP
Sbjct: 143 GALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYP 202
Query: 225 YQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEG 283
Y+A G C K AA S Y E+PSG E L +AV+ + PVS+ I A + F Y+ G
Sbjct: 203 YKATDGKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSG 262
Query: 284 I-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGT 341
+ ++ C ++H V +VG+G +G +YWL+KNSWG +GD GY+++ R+ G CGI +
Sbjct: 263 VYYDPSCTQNVNHGVLVVGYGNL-NGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIAS 321
Query: 342 QSSYP 346
SYP
Sbjct: 322 YPSYP 326
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 135/337 (40%), Positives = 194/337 (57%), Gaps = 19/337 (5%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKE--MRFKIFKENLEY 77
+I+ L V+C VS + E E + QH ++Y L+K+ R IF+ N++
Sbjct: 1 MILSLTVACIFVGVSPAAVDAHD--EHWELFKRQHNKTY---LQKQDVGRRAIFEANIKK 55
Query: 78 IEKAN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
I N G +Y+LG N F+D+T DEF Y G + + R S ++++
Sbjct: 56 INAHNLLYDLGRSSYRLGLNGFADMTPDEFEK-YRGTRFEANEARV---SKLQHRDNRSM 111
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
VP ++DWR + VTP+K+Q CG CWAFS A+EG +L+ LSEQ LVDCS
Sbjct: 112 HVPDTVDWRTEGYVTPVKNQGVCGSCWAFSTTGALEGQHFRRSGDLVSLSEQMLVDCSAV 171
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
GN GC GG M+ AF +I G+ TE YPY GTC + AK++ + +VPS D
Sbjct: 172 YGNAGCNGGLMDNAFRFIKDAGGLETEKSYPYTGKDGTCHFDARGIGAKLTGFVDVPSRD 231
Query: 254 EQALLKAVSM-QPVSIGIAAYTTEFKSYKEGIFNGVC--GTQLDHAVTIVGFGTTEDGAN 310
E+AL +A + PVS+ I A F+ YK+G+++ + T LDH V +VG+GTT DG +
Sbjct: 232 EEALKEAAGVVGPVSVAIDASGQNFQFYKDGVYDEITCSSTSLDHGVLVVGYGTTRDGKD 291
Query: 311 YWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
YWL+KNSWG +WG +GY+++ R+ E CGI T +SYP
Sbjct: 292 YWLVKNSWGSSWGQSGYIQMSRNKENQCGIATMASYP 328
>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
Length = 335
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 199/342 (58%), Gaps = 20/342 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + ++ C + V ++ +T + ++ + W H +SY + E+ R ++++NL
Sbjct: 1 MALYLVAAALCLTTVFAAPTT-DPALDDHWHLWKNWHKKSYLPK-EEGWRRVLWEKNLRT 58
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N + G +Y+LG N+F D+TN+EFR L GYK + + STF N
Sbjct: 59 IEFHNLDHSLGKHSYRLGMNQFGDMTNEEFRQLMNGYK----NQKMIKGSTFLAPN--NF 112
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+ P ++DWR+K VTP+KDQ +CG CWAFS A+EG LI LSEQ LVDCS
Sbjct: 113 EAPKTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAGKLISLSEQNLVDCSRA 172
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGT-CSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+Y+ N GI +ED YPY A C +A + + +VPSG
Sbjct: 173 QGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVPSG 232
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTE 306
E+ L+KAV S+ PVS+ + A F+ Y+ GI ++ C ++ LDH V +VG+ G
Sbjct: 233 SEKDLMKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPECSSEDLDHGVLVVGYGFEGEDV 292
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WG+ GY+KI +D CGI T +SYPL
Sbjct: 293 DGKRYWIVKNSWSEKWGNNGYIKIAKDRHNHCGIATAASYPL 334
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 193/343 (56%), Gaps = 24/343 (6%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+ + +LV C S V ++ Q H W H + Y E E+ R ++++NL+ IE
Sbjct: 4 LYLAVLVLCVSAVCAAPRFDSQLEDHWH-LWKNWHSKHYH-ESEEGWRRMVWEKNLKKIE 61
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSM 133
N E G +Y+LG N F D+TN+EFR GYK TT FK + +
Sbjct: 62 IHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK-------QTTERKFKGSLFMEPNY 114
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS- 192
P ++DWR+K VTP+KDQ CG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 115 LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR 174
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV-QGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M++AF+YI N G+ TE+ YPY + C + +AA + + ++PS
Sbjct: 175 PEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPS 234
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
G E A++KAV ++ PVS+ I A F+ Y+ GI + C + +LDH V +VG+ G
Sbjct: 235 GKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGED 294
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WGD GY+ + +D + CGI T SSYPL
Sbjct: 295 VDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
Length = 318
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 129/287 (44%), Positives = 175/287 (60%), Gaps = 18/287 (6%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++ + + WM ++ + YKD EK RF+IFK+NL+YI++ NK+ N TY LG F+
Sbjct: 39 TSTEKLINLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKK-NNTYWLGLTSFT 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSST----FKYQNLSMTDVPTSLDWRDKKAVTPIKD 153
DLTNDEF+ Y G P + STT + F Y ++ ++P S+DWR K AVTP+++
Sbjct: 98 DLTNDEFKEKYVG---SIPENWSTTEESNDKEFIYDDV--VNIPASIDWRQKGAVTPVRN 152
Query: 154 QQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQ 213
Q CG CW FS+VAAVEGI KI L+ LSEQ+L+DC + GC GG A +Y +
Sbjct: 153 QGSCGSCWTFSSVAAVEGINKIVTGQLVSLSEQELLDCERR-SYGCRGGFPPYALQY-VA 210
Query: 214 NQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAA 272
N GI YPY+ VQ C AAQ K K V +EQAL++ +++QPVSI + A
Sbjct: 211 NSGIHLRQYYPYEGVQRQCRAAQAKGPKVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEA 270
Query: 273 YTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
F++Y+ GIF G CGT +DHAV VG+G Y LIKNSWG
Sbjct: 271 KGRAFQNYRGGIFAGPCGTSIDHAVAAVGYGN-----GYILIKNSWG 312
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 133/325 (40%), Positives = 189/325 (58%), Gaps = 20/325 (6%)
Query: 29 ASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK---EG 85
AS ++ + ++V + + +H +SY +++E+ R IF ENL IE+ N G
Sbjct: 7 ASLLIVAVGASLENVGSTFQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEEHNALYAAG 66
Query: 86 NRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDK 145
+Y N+F+DLT DEF+A T + P T +T Y + VPT+LDWR +
Sbjct: 67 LVSYNKSVNQFTDLTIDEFKAYLTLHSKP-------TLNTVPYVRTGL-QVPTTLDWRSQ 118
Query: 146 KAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTME 205
VT +KDQ +CG CWAFS V + EG S L+ LSEQQL+DC+TN N+GC GG +E
Sbjct: 119 GYVTGVKDQGDCGSCWAFSVVGSTEGAYYKSTGKLVSLSEQQLIDCTTNVNDGCDGGYLE 178
Query: 206 KAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQ 264
+ F Y +Q G+ +E YPY G C ++ K+S Y V G E LL+AV S+
Sbjct: 179 ETFPY-VQQTGLVSESSYPYTGRDGNCRISESDVVTKVSKY--VLLGGEADLLEAVGSVG 235
Query: 265 PVSIGIAAYTTEFKSYKEGIF-NGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGDTW 322
PVS+ + A T SY G++ + +C L+H V +VG+G T+DG +YWLIKNSWG+TW
Sbjct: 236 PVSVAMDA--TYIYSYASGVYESSLCSLYSLNHGVLVVGYG-TQDGKDYWLIKNSWGNTW 292
Query: 323 GDAGYMKILRDEGLCGIGTQSSYPL 347
G+ GY+K+LR CGI YP+
Sbjct: 293 GEQGYLKLLRGTNECGIAEDDVYPI 317
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 124/304 (40%), Positives = 181/304 (59%), Gaps = 9/304 (2%)
Query: 52 AQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALY 108
A+HG+SY E E+ R KI+ EN I K N++ G Y + N F D+ + EF +
Sbjct: 32 AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAA 168
G+K S+ + +N+ +P ++DWR K AVTP+K+Q +CG CWAFSA +
Sbjct: 92 NGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATGS 151
Query: 169 VEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA 227
+EG +++ LSEQ LV CST+ GNNGC GG M+ AF+YI N+GI TE YPY
Sbjct: 152 LEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPYNG 211
Query: 228 VQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN 286
GTC + A S + ++ G E L KAV ++ P+S+ I A F+ Y +G+++
Sbjct: 212 TDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYD 271
Query: 287 GV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQS 343
C ++ LDH V +VG+GT +G +YW +KNSWG TWGD GY+++ R+ + CGI + +
Sbjct: 272 EPECDSESLDHGVLVVGYGTL-NGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQCGIASSA 330
Query: 344 SYPL 347
S PL
Sbjct: 331 SIPL 334
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 136/344 (39%), Positives = 197/344 (57%), Gaps = 22/344 (6%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENL 75
+P+ ++ + C S +S+ S Q + + E W + H + Y E E+ R ++++NL
Sbjct: 2 LPLAVVAL----CLSAALSAPSLDPQ-LDDHWELWKSWHSKKYH-EKEEGWRRMVWEKNL 55
Query: 76 EYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ IE N E G +Y+LG N F D+T++EFR L GYK + + S F N
Sbjct: 56 KKIELHNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMNGYKRKAET--KARGSLFLEPNF- 112
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
+ P S+DWRD VTP+KDQ +CG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 113 -LEAPKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCS 171
Query: 193 -TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVP 250
GN GC GG M++AF+Y+ NQG+ +ED YPY C + + + ++P
Sbjct: 172 RPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPTYNSVNDTGFVDIP 231
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GT 304
SG E+AL+KAV ++ PVS+ I A F+ Y+ GI + C + +LDH V +VG+ G
Sbjct: 232 SGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQGE 291
Query: 305 TEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WGD GY+ + +D + CGI T +SYPL
Sbjct: 292 DVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
Length = 328
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 128/332 (38%), Positives = 194/332 (58%), Gaps = 18/332 (5%)
Query: 25 LVSCASQVVSSRST--HEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKA 81
L+ C + +V+ + H ++ H + W HG+ Y+ + E+ R +++NL +
Sbjct: 3 LLRCMAVLVTLVAVMGHPDPTLDQHWQLWKKAHGKEYRHQAEEGQRRATWEKNLRLVMLH 62
Query: 82 NKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
N E G +Y+LG N D+T+++ AL TG ++P + +ST++ + P
Sbjct: 63 NLEHSLGLHSYQLGMNHMGDMTSEDVAALLTGLRVP---YGHNQTSTYRRRG----GAPD 115
Query: 139 SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNN 197
++DWR+K VT +K+Q CG CWAFSAV A+E K+ L+ LS Q LVDCS GN
Sbjct: 116 AMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYGNK 175
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GCGGG M +AF+YII N GI +E+ YPY A GTC AA S Y E+P DE AL
Sbjct: 176 GCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGTCQYNVSTRAATCSKYVELPYADEAAL 235
Query: 258 LKAVS-MQPVSIGIAAYTTEFKSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIK 315
AV+ + PVS+ I A F Y+ G+++ C +++H V +VG+GT + ++WL+K
Sbjct: 236 KDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQEVNHGVLVVGYGTLNE-KDFWLVK 294
Query: 316 NSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
NSWG+ +GD GY+++ R+ CGI + +SYP
Sbjct: 295 NSWGERFGDGGYIRMSRNHANHCGIASYASYP 326
>gi|432910512|ref|XP_004078392.1| PREDICTED: cathepsin K-like [Oryzias latipes]
Length = 331
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 135/335 (40%), Positives = 194/335 (57%), Gaps = 20/335 (5%)
Query: 22 IILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
++LL+S S S +++ ++ H E+W H + Y E+ +R I+++NL IE
Sbjct: 7 VLLLLS-----ASVMSQMDETTLDAHWEEWKMTHTKEYITVEEEGIRRAIWEKNLRMIEA 61
Query: 81 ANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMP-SPSHRSTTSSTFKYQNLSMTDV 136
N+E G TY LG N+F D+T +E TG +MP +P R + + S+ +
Sbjct: 62 HNQEAALGMHTYTLGMNQFGDMTQEEVVERMTGLQMPLNPEPRVPMET-----DGSLIKL 116
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
P S+D+R K VT +K+Q CG CWAFS+V A+EG NL+ LS Q LVDC T N
Sbjct: 117 PKSVDYRKKGMVTSVKNQGSCGSCWAFSSVGALEGQLAKKTGNLVDLSPQNLVDCVTE-N 175
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
+GCGGG M AF+Y+ +N GI +E YPY C AA+I Y+EVP GDE A
Sbjct: 176 DGCGGGYMTNAFKYVQENGGIDSEAAYPYMGEDQPCRYNVSGLAAQIKGYKEVPEGDEHA 235
Query: 257 LLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWL 313
L A+ PVS+GI A F Y++GI F+ C + ++HAV VG+G G +W+
Sbjct: 236 LAVALFKAGPVSVGIDASQNSFLYYQKGIYFDRNCNKEDINHAVLAVGYGVNAKGKKFWI 295
Query: 314 IKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYPL 347
+KNSWG+TWG+ GY+ + R+ G +CGI +SYP+
Sbjct: 296 VKNSWGETWGNKGYVLMARNRGNVCGIANLASYPV 330
>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
Length = 338
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 183/310 (59%), Gaps = 17/310 (5%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W + H + Y E E+ R ++++NL+ IE N + G TY+LG N F D+TN+EFR
Sbjct: 33 WKSWHTKKYH-EKEEGWRRMVWEKNLKKIELHNLDHSMGKHTYRLGMNHFGDMTNEEFRQ 91
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
L GYK + R S F N + P SLDWRDK VTP+KDQ +CG CWAFSA
Sbjct: 92 LMNGYK--HKAERKVKGSLFLEPNF--LEAPRSLDWRDKGYVTPVKDQGQCGSCWAFSAT 147
Query: 167 AAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
A+EG ++QLSEQ LV+CS GN GC GG M++AF+Y+ NQG+ +E+ YPY
Sbjct: 148 GALEGQQFRKTGKMVQLSEQNLVECSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEESYPY 207
Query: 226 QAVQG-TCSAAQKAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYKEG 283
C + A + + ++ SG E AL+KAV+ + P+S+ I A F+ Y+ G
Sbjct: 208 LGTDDQKCHYDPRYNAVNDTGFVDIKSGSEHALMKAVTAVGPISVAIDAGHESFQFYQSG 267
Query: 284 I-FNGVCGT-QLDHAVTIVGF---GTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLC 337
I + C + +LDH V +VG+ G DG YW++KNSW + WGD GY+ + +D + C
Sbjct: 268 IYYEPECSSEELDHGVLLVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYVYMAKDRQNHC 327
Query: 338 GIGTQSSYPL 347
GI T +SYPL
Sbjct: 328 GIATAASYPL 337
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 137/341 (40%), Positives = 199/341 (58%), Gaps = 29/341 (8%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMA---QHGRSYKDELEKEMRFKIFKENL 75
F+I++L V+ A+ M +W A HG+ YK E+ +R IF++N
Sbjct: 3 FLILVLSVTMATA--------------MDVEWEAFKLTHGKQYKSPDEENVRRAIFRDNN 48
Query: 76 EYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I++ N+E G R+Y +G N+F DL + E+ L G + P + ST S +++
Sbjct: 49 QMIKEHNQEAAMGRRSYFMGMNQFGDLAHSEYLELVVGPGL-LPLNLSTPSENV-FESTP 106
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
V ++DWR K AVTPIKDQ CG CWAFS ++EG + L+ LSEQ L+DCS
Sbjct: 107 GLQVDDTVDWRQKGAVTPIKDQGHCGSCWAFSTTGSLEGQHFMKTGKLVSLSEQNLLDCS 166
Query: 193 TN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV-QGTCSAAQKAAAAKISNYEEVP 250
GN GC GG M++AF YI N GI TE+ YPY A + C + A +S+Y ++
Sbjct: 167 RRFGNKGCEGGLMDQAFRYIKSNGGIDTEECYPYMAKDEKVCDYKTSCSGATLSSYTDIK 226
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTED 307
+ DE AL++AV ++ PVS+ I A + YK GI++ T+LDH V VG+G+ D
Sbjct: 227 AMDEMALMQAVGTVGPVSVAIDASHKSLRFYKSGIYDEPECSRTKLDHGVLAVGYGSM-D 285
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYPL 347
G +YWL+KNSWG WGD GY+K+ R++ CGI T++SYP+
Sbjct: 286 GMDYWLVKNSWGSAWGDMGYVKMTRNKNNQCGIATKASYPV 326
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 129/347 (37%), Positives = 200/347 (57%), Gaps = 29/347 (8%)
Query: 20 IIIILLVSCAS-QVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
I+++++++CA+ Q +S Q + + +H + YK E E+ +R KI+ +N I
Sbjct: 4 ILLLIVITCAAVQAISFFELVNQEWI----NFKMEHKKCYKHEAEERLRMKIYMKNKLQI 59
Query: 79 EKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM-- 133
+ N + TY+L N++ D+ N EF+ + GY T + T + + L +
Sbjct: 60 AQHNCDYELKKVTYRLKINKYGDMLNHEFKNMLNGYN-------RTINHTLRNERLPVGA 112
Query: 134 -------TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQ 186
++P +DWR AVT +KDQ CG CWAFSA ++EG L+ LSEQ
Sbjct: 113 AFIEPCNVELPKMVDWRKCGAVTEVKDQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQ 172
Query: 187 QLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISN 245
L+DCS + GNNGC GG M++AF YI N+G+ TE YPY+ C ++++ A
Sbjct: 173 NLIDCSGSYGNNGCNGGLMDQAFSYIKDNKGLDTEKTYPYEGEDDKCRYDKRSSGASDVG 232
Query: 246 YEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCG-TQLDHAVTIVGF 302
+ ++P GDEQ L AV ++ PVS+ I A F+ Y +GI F C T LDH V +VG+
Sbjct: 233 FVDIPVGDEQKLKAAVATVGPVSVAIDASHQSFQFYSDGIYFEPECSSTNLDHGVLVVGY 292
Query: 303 GTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
GT E+G +YW++KNSWG++WG+ GY+K+ R+ + CGI + +SYP+
Sbjct: 293 GTDEEGRDYWIVKNSWGESWGEKGYIKMARNIDNHCGIASSASYPIV 339
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 184/307 (59%), Gaps = 16/307 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
E + Q+GR Y D E+ R ++F++N + +E NK+ G T+K+ N+F D+TN+EF
Sbjct: 13 EHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFGDMTNEEF 72
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
A+ GYK S R ++ F + M +DWR K AVTP+KDQ +CG CWAFS
Sbjct: 73 NAVMKGYKKGS---RGEPTTVFTAEGRPMA---ADVDWRTKGAVTPVKDQGQCGSCWAFS 126
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A ++EG + L+ LSEQ+LVDCST GN+GCGGG M AF+YI N GI TE Y
Sbjct: 127 ATGSLEGQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYIKDNGGIDTESSY 186
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYKE 282
PY+A +C + A + + EV E+AL +AVS + P+S+ I A F+ Y
Sbjct: 187 PYEAQDRSCRFDANSIGATCTGFVEVQH-TEEALHEAVSDIGPISVAIDASHFSFQFYSS 245
Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGI 339
G++ T LDH V VG+G TE +YWL+KNSWG WGDAGY+K+ R+ + CGI
Sbjct: 246 GVYYEKKCSPTNLDHGVLAVGYG-TESTEDYWLVKNSWGSGWGDAGYIKMSRNRDNNCGI 304
Query: 340 GTQSSYP 346
++ SYP
Sbjct: 305 ASEPSYP 311
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 136/342 (39%), Positives = 193/342 (56%), Gaps = 18/342 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + + C S V ++ T +Q + + ++W H + Y E+ R I+++NL+
Sbjct: 1 MRVFLAAFTLCLSAVFAA-PTLDQQLNDHWDQWKKWHSKKYH-ATEEGWRRVIWEKNLKK 58
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G TY+LG N F D+T++EFR + G+K R S F N
Sbjct: 59 IEMHNLEHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFK--HKKDRRFRGSLFMEPNF--I 114
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+VP LDWR+K VTP+KDQ ECG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 115 EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRP 174
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+Y+ G+ +E+ YPY C K +AA + + ++PSG
Sbjct: 175 EGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSG 234
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTE 306
E+AL+KA+ ++ PVS+ I A F+ Y+ GI + C + +LDH V VG+ G
Sbjct: 235 KERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WGD GY+ + +D CGI T +SYPL
Sbjct: 295 DGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPL 336
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 143/363 (39%), Positives = 206/363 (56%), Gaps = 38/363 (10%)
Query: 15 TIPMFIIIILLVSCASQVVS------SRSTHEQSVV----------EMHEKWM---AQHG 55
T+ I ++ +VS A Q V+ ++ H ++ HE W G
Sbjct: 3 TLIAVICVLTVVSAAPQAVNWFEIQPAKVEHASNLKLQVKASTRLGPYHETWKEFKTLFG 62
Query: 56 RSYKDELEKEM-RFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF---RALY 108
+ Y D +E+E+ RF IF++ LE IE+ N++ G ++Y +G N+FSD+++DE+ L
Sbjct: 63 KVY-DTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFSDMSHDEYLRHNGLR 121
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAA 168
G + S + + S + +DWRDK VTP+K+Q +CG CW+FS +
Sbjct: 122 RGNRKYSKGEGCDSYTK------SGKQLDDKVDWRDKGYVTPVKNQGQCGSCWSFSTTGS 175
Query: 169 VEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA 227
+EG LI LSEQQLVDCS T GN GC GG M+ AFEYI G+ ED+YPY A
Sbjct: 176 LEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSIGGLEGEDDYPYTA 235
Query: 228 VQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN 286
QG C + A + +V SGDE AL A+ S+ P+S+ I A F+SY G+++
Sbjct: 236 KQGKCHLKKSLFKANDTGCTDVESGDEDALKDALASVGPISVAIDASHASFQSYDGGVYD 295
Query: 287 -GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQS 343
C +Q LDH V VG+GT E+G +YWL+KNSWG+ WG+ GY+K+ R+ + CGI TQ+
Sbjct: 296 EEECSSQNLDHGVLTVGYGTEENGGDYWLVKNSWGEMWGEEGYIKMSRNKDNQCGIATQA 355
Query: 344 SYP 346
SYP
Sbjct: 356 SYP 358
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 132/308 (42%), Positives = 187/308 (60%), Gaps = 16/308 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE--GNRTYKLGTNRFSDLTNDEFR 105
E + G++Y+ + E +R IF+ NL +IEK N E +R Y LG +F+D++ EFR
Sbjct: 167 EHFKEHFGKTYEGD-EHALRQGIFQRNLAHIEKFNAEKAASRGYTLGITQFADMSTAEFR 225
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKKAVTPIKDQQECGCCWA 162
Y G +M + ST + K Q + D +P ++DWRDK AV+P+KDQ +CG CWA
Sbjct: 226 QTYLGLRMNA----STIAKLRKLQREVVADDRDLPEAVDWRDKGAVSPVKDQGQCGSCWA 281
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDE 222
FS A+EG + L+ LSEQQ+VDCS + GC GG A EY+ N G+ E
Sbjct: 282 FSTSGAIEGQHFLKNGELLSLSEQQMVDCSWL-DFGCNGGQPMLAMEYVRFNGGLELETA 340
Query: 223 YPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYK 281
YPY+ V G+C + +K+AAAKI+ + E AL KAV+ + P+S+G+ A +F+ YK
Sbjct: 341 YPYKGVGGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAKVGPISVGMDASGEDFQHYK 400
Query: 282 EGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDEG-LCG 338
GI+N LDHAV VG+GT++DG +YWL+KNSW +WG+ GY K+ R++G CG
Sbjct: 401 SGIYNPESCSSIGLDHAVLAVGYGTSDDG-DYWLVKNSWNTSWGEKGYFKLPRNKGNKCG 459
Query: 339 IGTQSSYP 346
I T YP
Sbjct: 460 IATTPIYP 467
>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
Length = 341
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 134/342 (39%), Positives = 195/342 (57%), Gaps = 17/342 (4%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKW---MAQHGRSYKDELEKEMRFKIFKENLE 76
+I++ LV A VSS + +E ++E E+W Q + Y+D E+ R K++ +N
Sbjct: 4 VIVLGLVVFAISSVSSINLNE--IIE--EEWDLFKVQFKKIYEDVKEEAFRKKVYLDNKL 59
Query: 77 YIEKANK---EGNRTYKLGTNRFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
I + NK G TY L N F DL E+ + G+K + T +
Sbjct: 60 KIARHNKLYETGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDKNFTDDDAVTFLKS 119
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+P S+DWR K VTP+K+Q +CG CW+FSA ++EG L+ LSEQ L+DC
Sbjct: 120 ENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDC 179
Query: 192 STN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVP 250
S GNNGC GG M+ AF+YI N+G+ TE YPY+A C + + A + ++P
Sbjct: 180 SRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFVDIP 239
Query: 251 SGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIF-NGVC-GTQLDHAVTIVGFGTTED 307
GDE AL+ A+ ++ PVSI I A + +F+ YK+G+F N C T+LDH V VG+GT
Sbjct: 240 EGDEDALVHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHK 299
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
G +YW++KNSWG TWGD GY+ + R+ + CG+ + +SYPL
Sbjct: 300 GGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASYPLV 341
>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 193/343 (56%), Gaps = 24/343 (6%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+ + +LV C S V ++ Q H W H +SY E E+ R ++++NL+ IE
Sbjct: 4 LYLAVLVLCVSAVCAAPRFDSQLEDHWH-LWKNWHSKSYH-ESEEGWRRMVWEKNLKKIE 61
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSM 133
N E G +Y+LG N F D+TN+EFR GYK TT FK + +
Sbjct: 62 MHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK-------QTTERKFKGSLFMEPNY 114
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS- 192
P ++DWR+K VTP+KDQ CG CWAFS A+EG L+ LSEQ LVDCS
Sbjct: 115 LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR 174
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV-QGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M++AF+YI N G+ TE+ YPY + C + + A + + ++PS
Sbjct: 175 PEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPS 234
Query: 252 GDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
G E A++KAV ++ PVS+ I A F+ Y+ GI + C + +LDH V +VG+ G
Sbjct: 235 GKEHAMMKAVAAVGPVSVAIDAGHESFQFYEFGIYYEKECSSEELDHGVLVVGYGFEGED 294
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WGD GY+ + +D + CGI T SSYPL
Sbjct: 295 VDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
Length = 334
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 189/312 (60%), Gaps = 10/312 (3%)
Query: 44 VEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDLT 100
+E H W + GR+Y E+ R + + N + + N +G ++Y+LG F+D+
Sbjct: 24 LEFH-AWRLKFGRTYSSPTEEAQRRQTWLNNRKLVLVHNILADQGIKSYRLGMTYFADME 82
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
N+E++ L + + S + + ++ D+P ++DWRDK VT +KDQ++CG C
Sbjct: 83 NEEYKRLISQGCLGSFNASLPRRGSTFFRLPENKDLPAAVDWRDKGYVTDVKDQKQCGSC 142
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIAT 219
WAFSA ++EG T L+ LSEQQLVDCS + GN GCGGG M+ AF YI GI T
Sbjct: 143 WAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNMGCGGGLMDDAFRYIQATGGIDT 202
Query: 220 EDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFK 278
E+ YPY+A G C A A + Y +V SGDE AL +AV ++ P+S+GI A F+
Sbjct: 203 EESYPYEAEDGECRYKPDAVGATCTGYVDVSSGDEDALQEAVATIGPISVGIDASHISFQ 262
Query: 279 SYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-G 335
Y+ G+++ ++LDH V VG+G +E+G +YWL+KNSWG TWGD GY+K+ +++
Sbjct: 263 LYESGLYDEPQCSSSELDHGVLAVGYG-SENGQDYWLVKNSWGLTWGDQGYIKMSKNKSN 321
Query: 336 LCGIGTQSSYPL 347
CGI T +SYPL
Sbjct: 322 QCGIATAASYPL 333
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 135/331 (40%), Positives = 195/331 (58%), Gaps = 18/331 (5%)
Query: 25 LVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK 83
L+ C+S + H ++ H + W +G+ Y+++ E+ R I+++NL+ + N
Sbjct: 8 LLLCSSAMAQ---VHRDPTLDHHWDLWKKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNL 64
Query: 84 E---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
E G +Y+LG N D+T++E + + ++PS R+ T + Q L P SL
Sbjct: 65 EHSMGMHSYELGMNHLGDMTSEEVISSMSSLRVPSQWPRNVTYKSSPNQKL-----PDSL 119
Query: 141 DWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST--NGNNG 198
DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST GN G
Sbjct: 120 DWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKG 179
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALL 258
C GG M +AF+YII N GI +E YPY+A+ G C K AA S Y E+P G E+AL
Sbjct: 180 CNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGRCQYDVKNRAATCSRYIELPFGSEEALK 239
Query: 259 KAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
+AV+ + PVS+GI A T F YK G+ ++ C ++H V +VG+G+ +G +YWL+KN
Sbjct: 240 EAVANKGPVSVGIDAKQTSFFLYKTGVYYDPSCTQNVNHGVLVVGYGSL-NGKDYWLVKN 298
Query: 317 SWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
SWG +GD GY+++ R+ G CGI SYP
Sbjct: 299 SWGLNFGDQGYIRMARNSGNHCGIANFPSYP 329
>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
Length = 332
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 197/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M AF+YII N+GI ++ YPY+A+ C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPY 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E L +AV+ + PVS+G+ A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
YWL+KNSWG +G+ GY+++ R++G CGI + SYP
Sbjct: 292 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 326
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 132/308 (42%), Positives = 183/308 (59%), Gaps = 16/308 (5%)
Query: 49 KWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFR 105
KW A HG+ Y E+ +RFKIF+EN I + N+E G TY LG N F DL + EF
Sbjct: 25 KWKATHGKVYNSADEESLRFKIFQENSLMITQHNEEYRQGFHTYILGMNHFGDLLHSEFL 84
Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSA 165
G++ T T VP+ +W K AVTP+KDQ +CG CWAFSA
Sbjct: 85 ERSNGFQGGVSGGDVFTFDT-------NAPVPSYANWTAKGAVTPVKDQGKCGSCWAFSA 137
Query: 166 VAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYP 224
+VEG + L+ LSEQQLVDCS + GN GCGGG M+ AF+Y I N+GIA E YP
Sbjct: 138 TGSVEGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYFIANKGIANEKSYP 197
Query: 225 YQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYKEG 283
Y A C + + A IS++++V DE L AV+ + PVS+ I A +++F+ Y+ G
Sbjct: 198 YTAKDNDCKYKKSMSVATISSFKDVKHKDEDQLKMAVANVGPVSVAIDASSSKFQFYESG 257
Query: 284 I-FNGVCGTQ-LDHAVTIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGI 339
+ ++ C ++ LDH V VG+GT + G ++WL+KNSW +WG GY+K+ R+ + CGI
Sbjct: 258 VYYDENCSSEVLDHGVLAVGYGTDKKSGMDFWLVKNSWAASWGLNGYIKMARNKDNNCGI 317
Query: 340 GTQSSYPL 347
T +SYP+
Sbjct: 318 ATMASYPI 325
>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
Length = 318
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 129/287 (44%), Positives = 174/287 (60%), Gaps = 18/287 (6%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++ + + WM ++ + YKD EK RF+IFK+NL+YI++ NK+ N TY LG F+
Sbjct: 39 TSTEKLINLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKK-NNTYWLGLTSFT 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSST----FKYQNLSMTDVPTSLDWRDKKAVTPIKD 153
DLTNDEF+ Y G P + STT F Y ++ ++P S+DWR K AVTP+++
Sbjct: 98 DLTNDEFKEKYVG---SIPENWSTTEEPNDKEFIYDDV--VNIPASIDWRQKGAVTPVRN 152
Query: 154 QQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQ 213
Q CG CW FS+VAAVEGI KI L+ LSEQ+L+DC + GC GG A +Y +
Sbjct: 153 QGSCGSCWTFSSVAAVEGINKIVTGQLVSLSEQELLDCERR-SYGCRGGFPPYALQY-VA 210
Query: 214 NQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAA 272
N GI YPY+ VQ C AAQ K K V +EQAL++ +++QPVSI + A
Sbjct: 211 NSGIHLRQYYPYEGVQRQCRAAQAKGPKVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEA 270
Query: 273 YTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
F++Y+ GIF G CGT +DHAV VG+G Y LIKNSWG
Sbjct: 271 KGRAFQNYRGGIFAGPCGTSIDHAVAAVGYGN-----GYILIKNSWG 312
>gi|403376023|gb|EJY87990.1| Cathepsin L [Oxytricha trifallax]
Length = 343
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 124/302 (41%), Positives = 181/302 (59%), Gaps = 12/302 (3%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYT 109
++A++G+SY + E + R++ +++N+ + + N + T++LG N+F+D T +E++ L
Sbjct: 46 YLAKYGKSYGTKEEFQFRYEQYQKNMAKVAQYNGQNGNTFRLGINKFTDYTPEEYKVL-L 104
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAV 169
GYK S T + LS + P S+DWR+K AVTP+KDQ +CG CWAFSA A+
Sbjct: 105 GYKPQS------KPMTLEASYLSEENTPASIDWREKGAVTPVKDQGQCGSCWAFSATGAL 158
Query: 170 EGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQ 229
EG +IS LI +SEQQLVDCS +GNNGC GG M AF+Y +N+ + E +Y Y A
Sbjct: 159 EGHYQISNNKLISISEQQLVDCSHDGNNGCNGGEMYLAFDYASKNK-MELESDYVYHAKD 217
Query: 230 GTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGV- 288
CS + +++ VP L A++ PVS+ I A F++Y GI N
Sbjct: 218 EKCSYEASKGKMEADHFQRVPKNSPAQLKAALANGPVSVAIEADNEVFQAYDGGILNSKE 277
Query: 289 CGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGDTWGDAGYMKI--LRDEGLCGIGTQSSY 345
CGT LDH V VGFG E +Y+++KNSWG WGD G++KI + EG+CGI + Y
Sbjct: 278 CGTNLDHGVLAVGFGHDEASKQDYFIVKNSWGQYWGDHGFIKIAAVDGEGICGIQMDAVY 337
Query: 346 PL 347
P+
Sbjct: 338 PI 339
>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
Length = 349
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 130/319 (40%), Positives = 190/319 (59%), Gaps = 17/319 (5%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+++ + W A++ R+Y E + RF ++ EN+++IE N+ G+ +Y+LG NRF+DLT +
Sbjct: 33 LLDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGS-SYELGENRFADLTEE 91
Query: 103 EFRALYTGYKM----PSPSHRSTTSSTFKYQNLS----MTDVPTSLDWRDKKAVTPIKDQ 154
EF+ Y K+ SP + T T S + P S+DWR K AVTP+K Q
Sbjct: 92 EFKDTYL-MKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVKSQ 150
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTM-EKAFEYIIQ 213
Q CG CWAF+AVA++EG+ KI L+ LSEQ++VDC GNN G A E++ +
Sbjct: 151 QHCGSCWAFAAVASIEGVHKIKTGLLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVTR 210
Query: 214 NQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAA 272
N G+ TE +YPY QG C + + AAKI + V +E AL AV+ +PV++ I A
Sbjct: 211 NGGLTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSINA 270
Query: 273 YTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
+ F+ YK GIF+G C T +HAVT+VG+G G YW++KNSWG+ WG+ GY+++ R
Sbjct: 271 -SRAFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRMQR 329
Query: 333 ----DEGLCGIGTQSSYPL 347
EG+CGI Y +
Sbjct: 330 GVRAREGVCGIAIAPFYAV 348
>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
Length = 326
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 127/302 (42%), Positives = 183/302 (60%), Gaps = 13/302 (4%)
Query: 53 QHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR---TYKLGTNRFSDLTNDEFRALYT 109
+H + YKD E+ R +F + +EYI++ N E +R ++++G N ++D+ N+EF +
Sbjct: 28 RHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHSFRVGINEYADMPNEEFVRVMN 87
Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAV 169
GYKM R + N+ D+P ++DWR K VT +K+Q +CG CWAFS+ ++
Sbjct: 88 GYKMQE--QRPKAPTYMPPSNVG--DLPATVDWRTKGYVTEVKNQGQCGSCWAFSSTGSL 143
Query: 170 EGITKISGANLIQLSEQQLVDCST-NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV 228
EG T LI LSEQ LVDCST GN GCGGG M++AF YI N GI TE YPY+A
Sbjct: 144 EGQTFKKYNKLISLSEQNLVDCSTEQGNMGCGGGLMDQAFTYIKVNDGIDTETSYPYEAA 203
Query: 229 QGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFNG 287
G C + A + Y ++ S E L AV ++ P+++ I A F+ YK G+++
Sbjct: 204 SGKCRFNKANVGANDTGYTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKSGVYHY 263
Query: 288 V-CG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSS 344
+ C T+LDH V VG+G T+ G +YWL+KNSWG TWG GY+ + R+ + CGI TQ+S
Sbjct: 264 IFCSQTRLDHGVLAVGYG-TDSGKDYWLVKNSWGATWGQQGYIMMSRNRDNNCGIATQAS 322
Query: 345 YP 346
YP
Sbjct: 323 YP 324
>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
Length = 331
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 197/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M AF+YII N+GI ++ YPY+A+ C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPY 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E L +AV+ + PVS+G+ A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
YWL+KNSWG +G+ GY+++ R++G CGI + SYP
Sbjct: 292 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
boliviensis]
gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
boliviensis]
gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
boliviensis]
Length = 333
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 130/341 (38%), Positives = 198/341 (58%), Gaps = 23/341 (6%)
Query: 17 PMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLE 76
P I+ + AS ++ + E + KW A H R Y E+E R ++++N++
Sbjct: 3 PTLILAAFCLGLASAALTFNHSLEAQWI----KWKAMHNRLYGKN-EEEWRRAVWEKNMK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE N E G ++ + N F D+TN+EFR + G++ P + +Q +
Sbjct: 58 TIELHNHEYNQGKHSFTMAMNTFGDMTNEEFRQVMNGFQNRKPRNGKV------FQEPLL 111
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS- 192
+ P S+DWR+K VTP+K+Q +CG CWAFSA A+EG L+ LSEQ LVDCS
Sbjct: 112 HEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M+ AF+Y+ +N G+ +E+ YPY+A + +C K + A + + ++P
Sbjct: 172 PQGNQGCNGGLMDYAFQYVQENGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK- 230
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TTE 306
E+AL+KAV ++ P+S+ I A F+ YKEGI F C ++ +DH V +VG+G T
Sbjct: 231 LEKALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGYGFERTGS 290
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
D + YWL+KNSWG+ WG GY+K+ +D + CGI + +SYP
Sbjct: 291 DNSKYWLVKNSWGEEWGMDGYIKMAKDRKNHCGIASAASYP 331
>gi|355763133|gb|EHH62119.1| hypothetical protein EGM_20318 [Macaca fascicularis]
Length = 331
Score = 236 bits (603), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 129/335 (38%), Positives = 195/335 (58%), Gaps = 18/335 (5%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+I +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+++
Sbjct: 4 LICVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM 60
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
N E G +Y LG N D+T++E +L + ++PS R+ T Y++ + +
Sbjct: 61 LHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNANQIL 115
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-- 194
P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 116 PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKY 175
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
GN GC GG M +AF+YII N GI ++ YPY+A C K AA S Y E+P G E
Sbjct: 176 GNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGRE 235
Query: 255 QALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANYW 312
L + V+ + PVS+G+ A F Y+ G+ + C ++H V +VG+G +G YW
Sbjct: 236 DVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL-NGKEYW 294
Query: 313 LIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
L+KNSWG +G+ GY+++ R++G CGI + SYP
Sbjct: 295 LVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 134/313 (42%), Positives = 187/313 (59%), Gaps = 12/313 (3%)
Query: 44 VEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDLT 100
+E H W + RSY E+ R +I+ N +++ N +G ++Y+LG F+D+
Sbjct: 24 LEFH-AWKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADME 82
Query: 101 NDEF-RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
N+E+ R + G + STF ++ TD+P ++DWRDK VT +KDQ++CG
Sbjct: 83 NEEYKRVISQGCLHSFNASLPRRGSTF-FRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGS 141
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFSA ++EG L+ LSEQQLVDCS + GN GC GG M+ AF+YI N GI
Sbjct: 142 CWAFSATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYIQANGGID 201
Query: 219 TEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEF 277
TE+ YPY+A G C A + Y EV GDE AL +AV ++ P+S+GI A F
Sbjct: 202 TEESYPYEAENGKCRYNPDNIGATSTGYTEVSQGDEDALKEAVATIGPISVGIDASQMSF 261
Query: 278 KSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE- 334
+ Y+ G++N +LDH V VG+G TEDG +YWL+KNSWG WGD GY+K+ R++
Sbjct: 262 QFYESGVYNEPDCSSLELDHGVLAVGYG-TEDGNDYWLVKNSWGLEWGDKGYIKMSRNKS 320
Query: 335 GLCGIGTQSSYPL 347
CGI T +SYPL
Sbjct: 321 NQCGIATAASYPL 333
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 187/311 (60%), Gaps = 17/311 (5%)
Query: 48 EKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTN 101
++W+A HG++Y+++ E+ R K+F +N + I++ N + G +YK+ N DL
Sbjct: 11 QEWLAFKAMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMV 70
Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCW 161
EF+AL G+K + R+ +NL P S+DWR + AVTP+KDQ CG CW
Sbjct: 71 HEFKALMNGFKKTPNAERNGKIYVPSNENL-----PKSVDWRQRGAVTPVKDQGHCGSCW 125
Query: 162 AFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATE 220
+FSA ++EG + L+ LSEQ LVDCS T GN+GC GG M +AF+Y+ N+GI TE
Sbjct: 126 SFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGIDTE 185
Query: 221 DEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKS 279
YPY+A + C + Y ++ E+ L AV ++ P+S+ I A F+
Sbjct: 186 ASYPYEARENNCRFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHESFQF 245
Query: 280 YKEGIFN-GVCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGL 336
Y EG++ C +QLDH V VG+G TE+G +YWL+KNSWG +WG++GY+KI R+ +
Sbjct: 246 YSEGVYKEQYCSPSQLDHGVLTVGYG-TENGQDYWLVKNSWGPSWGESGYIKIARNHKNH 304
Query: 337 CGIGTQSSYPL 347
CGI + +SYP+
Sbjct: 305 CGIASMASYPV 315
>gi|213512938|ref|NP_001133871.1| Cathepsin K precursor [Salmo salar]
gi|209155648|gb|ACI34056.1| Cathepsin K precursor [Salmo salar]
gi|223647252|gb|ACN10384.1| Cathepsin K precursor [Salmo salar]
gi|223673129|gb|ACN12746.1| Cathepsin K precursor [Salmo salar]
Length = 331
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 130/335 (38%), Positives = 189/335 (56%), Gaps = 15/335 (4%)
Query: 23 ILLVSCASQVVSSRSTH---EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
+LL C + S H E S+ + W H R Y E+ +R I+++N+ IE
Sbjct: 1 MLLCGCVLLFLGSVLAHPLNEMSLDAQWDSWKTTHLREYNGLGEEVIRRTIWEKNMRLIE 60
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
N+E G +Y+LG N D+T++E TG ++P RS T + ++ +
Sbjct: 61 AHNEEAALGIHSYELGMNHLGDMTSEEIAEKLTGLQVPMNRDRSNTW----IPDNNVVKI 116
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGN 196
P S+D+R K VTP+K+Q CG CWAFS+ A+EG + LI LS Q LVDC T N
Sbjct: 117 PRSIDYRKKGMVTPVKNQLSCGSCWAFSSAGALEGQLAKTTGKLIDLSPQNLVDCVTE-N 175
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
NGCGGG M AFEY+ +N GI TE+ YPY G C+ A+ ++E+P GDE A
Sbjct: 176 NGCGGGYMTNAFEYVEENGGIDTEEAYPYLGQDGQCAYNASGMGAQCRGFKEIPEGDEWA 235
Query: 257 LLKA-VSMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
L KA V + PV++GI A + F+ Y+ G+ ++ C ++HAV VG+G T G +W+
Sbjct: 236 LTKAVVKVGPVAVGIDATLSTFQFYQRGVYYDPNCNKDDINHAVLAVGYGQTAKGMKFWI 295
Query: 314 IKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYPL 347
+KNSW ++WG GY+ + R+ G CGI +SYP+
Sbjct: 296 VKNSWSESWGKQGYIMMARNRGNACGIANLASYPI 330
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 138/361 (38%), Positives = 201/361 (55%), Gaps = 39/361 (10%)
Query: 19 FIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYI 78
+I I+LL+ + ++ E+ E W+ + + Y D E + RF IFK N++++
Sbjct: 153 YINILLLIFGLIAISNALLFSEEQYKNEFENWIDRFEKKY-DVSEFKKRFSIFKSNMDFV 211
Query: 79 EKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRST---TSSTFKYQNL-SMT 134
N + ++T LG N +DLTN E+R Y G +H+ T + NL S+
Sbjct: 212 HSWNSKNSQTV-LGLNHLADLTNLEYRQFYLG------THKKAVLGTPGNHEVSNLQSVF 264
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
++DWR K AV+PIKDQ +CG CW+FS +VEG +I N+++LSEQ LVDCST+
Sbjct: 265 GDSATVDWRQKGAVSPIKDQGQCGSCWSFSTTGSVEGAHQIKSGNMVELSEQNLVDCSTS 324
Query: 195 -GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M+ AFEYII N GI TE YPY A G TC + + A IS+Y+ + +G
Sbjct: 325 EGNMGCNGGLMDYAFEYIITNNGIDTESSYPYTASSGTTCKYNKANSGATISSYKNITAG 384
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGT----- 304
E L AV + PVS+ I A F+ Y GI ++ C + LDH V +VG+G+
Sbjct: 385 SESDLADAVKNAGPVSVAIDASHNSFQLYSHGIYYDASCSSVNLDHGVLVVGYGSGTPDS 444
Query: 305 ----------------TEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
T+D NYW++KNSWG +WGD G++ + +D + CGI + +SYP+
Sbjct: 445 DSRVHKGSQVRVKVPKTDDTKNYWIVKNSWGTSWGDKGFIYMSKDRDNNCGIASCASYPI 504
Query: 348 A 348
Sbjct: 505 V 505
>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
Length = 331
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 197/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M AF+YII N+GI ++ YPY+A+ C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDLKCQYDSKYRAATCSKYTELPY 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E L +AV+ + PVS+G+ A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
YWL+KNSWG +G+ GY+++ R++G CGI + SYP
Sbjct: 292 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|114559418|ref|XP_001171268.1| PREDICTED: cathepsin S isoform 3 [Pan troglodytes]
gi|397492866|ref|XP_003817341.1| PREDICTED: cathepsin S isoform 1 [Pan paniscus]
gi|410225070|gb|JAA09754.1| cathepsin S [Pan troglodytes]
gi|410251608|gb|JAA13771.1| cathepsin S [Pan troglodytes]
gi|410328325|gb|JAA33109.1| cathepsin S [Pan troglodytes]
gi|410328327|gb|JAA33110.1| cathepsin S [Pan troglodytes]
Length = 331
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 196/338 (57%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M AF+YII N+GI ++ YPY+A C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPY 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E L +AV+ + PVS+G+ A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GREDVLKEAVANKGPVSVGVDALHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
YWL+KNSWG +G+ GY+++ R++G CGI + SYP
Sbjct: 292 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|405966499|gb|EKC31777.1| Cathepsin L [Crassostrea gigas]
Length = 331
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 127/301 (42%), Positives = 184/301 (61%), Gaps = 14/301 (4%)
Query: 54 HGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDLTNDEFRALYTG 110
H ++Y + E++MR I+++N+ YI+K N G TY LG N ++D+T EFRA+ G
Sbjct: 35 HKKTYSQD-EEQMRRLIWEDNVNYIQKHNLAADRGEHTYWLGQNEYADMTIFEFRAIMNG 93
Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVE 170
YKM + + T ++ D+P S+DWR + VT IK+Q CG CW+FSA ++E
Sbjct: 94 YKMSA----NRTKGDLYMSPSNIGDLPDSVDWRKEGYVTDIKNQGHCGSCWSFSATGSLE 149
Query: 171 GITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQ 229
G + L+ LSEQ LVDCS GN+GC GG M+ AF YI N+GI TE+ YPY A
Sbjct: 150 GQHFKASKKLVSLSEQNLVDCSKKEGNHGCQGGLMDNAFRYIESNKGIDTEESYPYTAKN 209
Query: 230 GTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGIFN-- 286
G C + A + Y ++P E L +AV ++ P+S+GI A F+ Y+EG+++
Sbjct: 210 GFCHFKAENVGATDTGYVDIPHMQEDKLQEAVATVGPISVGIDAGHKSFQLYREGVYSEP 269
Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSY 345
++LDH V VG+G TE G +YWL+KNSWG +WG GY+ + R++ +CGI TQ+SY
Sbjct: 270 ACSSSKLDHGVLAVGYG-TESGDDYWLVKNSWGTSWGMQGYVMMARNKHNMCGIATQASY 328
Query: 346 P 346
P
Sbjct: 329 P 329
>gi|402856105|ref|XP_003892640.1| PREDICTED: cathepsin S isoform 1 [Papio anubis]
Length = 331
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 194/338 (57%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T + Q L
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNPNQML-- 115
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 116 ---PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M +AF+YII N GI ++ YPY+A C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPY 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E L + V+ + PVS+G+ A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL-NGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
YWL+KNSWG +G+ GY+++ R++G CGI + SYP
Sbjct: 292 EYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
Length = 331
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 132/337 (39%), Positives = 194/337 (57%), Gaps = 16/337 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M + +L+ C++ V ++ + ++ + W + + Y++++E+ R I+++NL++
Sbjct: 1 MKWLACVLLGCSAAV--AQLQRDPTLDRHWDLWKKTYSKHYREKIEEVARRLIWEKNLKF 58
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
+ N E G +Y LG N D+T++E +L +PS R+ T + Q L
Sbjct: 59 VMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMGSLTVPSQWQRNVTYKSNPNQKL--- 115
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
P SLDWRDK VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 116 --PDSLDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 195 --GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
N GC GG M AF+YII N GI +E YPY+A G C K AA S Y E+P G
Sbjct: 174 KYSNKGCNGGFMTSAFQYIIDNNGIDSEASYPYKAQDGKCQYDSKFRAATCSKYTELPFG 233
Query: 253 DEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGAN 310
E+AL +AV+ + PVS+ I A F Y+ G+ ++ C +++H V +VG+G DG +
Sbjct: 234 SEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDQSCTLKVNHGVLVVGYGNL-DGKD 292
Query: 311 YWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
YWL+KNSWG +GD GY+++ R+ G CGI + SYP
Sbjct: 293 YWLVKNSWGLNFGDKGYIRMARNSGNHCGIASYPSYP 329
>gi|403302730|ref|XP_003942006.1| PREDICTED: cathepsin S isoform 1 [Saimiri boliviensis boliviensis]
Length = 339
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 129/340 (37%), Positives = 197/340 (57%), Gaps = 17/340 (5%)
Query: 15 TIPMFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKE 73
+I M ++ +L C+S V H+ ++ H W +G+ YK++ E+ +R I+++
Sbjct: 7 SITMKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEK 63
Query: 74 NLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
NL+++ N E G +Y LG N D+T++E +L + ++P+ R+ T + Q
Sbjct: 64 NLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPNQWQRNITYKSNPNQM 123
Query: 131 LSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVD 190
L P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVD
Sbjct: 124 L-----PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVD 178
Query: 191 CSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEV 249
CS GN GC GG M +AF+YII N+GI +E YPY+A C K AA S Y E+
Sbjct: 179 CSEKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKCQYDSKYRAATCSKYTEL 238
Query: 250 PSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTED 307
P G E L +AV+ + PV +G+ A F Y+ G+ ++ C +++H V ++G+G +
Sbjct: 239 PYGREDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYGDL-N 297
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
G YWL+KNSWG +G+ GY+++ R++G CGI + SYP
Sbjct: 298 GKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 337
>gi|355558399|gb|EHH15179.1| hypothetical protein EGK_01236 [Macaca mulatta]
gi|380809986|gb|AFE76868.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416071|gb|AFH31249.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416073|gb|AFH31250.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416075|gb|AFH31251.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416077|gb|AFH31252.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416079|gb|AFH31253.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
Length = 331
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/335 (38%), Positives = 195/335 (58%), Gaps = 18/335 (5%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLEYIE 79
++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+++
Sbjct: 4 LVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM 60
Query: 80 KANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
N E G +Y LG N D+T++E +L + ++PS R+ T Y++ + +
Sbjct: 61 LHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNANQIL 115
Query: 137 PTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-- 194
P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 116 PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKY 175
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
GN GC GG M +AF+YII N GI ++ YPY+A C K AA S Y E+P G E
Sbjct: 176 GNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGRE 235
Query: 255 QALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANYW 312
L + V+ + PVS+G+ A F Y+ G+ + C ++H V +VG+G +G YW
Sbjct: 236 DVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL-NGKEYW 294
Query: 313 LIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
L+KNSWG +G+ GY+++ R++G CGI + SYP
Sbjct: 295 LVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|179959|gb|AAA35655.1| cathepsin [Homo sapiens]
gi|248406|gb|AAB22005.1| cathepsin S [Homo sapiens]
Length = 331
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 197/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLTSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCST 172
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M AF+YII N+GI ++ YPY+A+ C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPY 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E L +AV+ + PVS+G+ A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
YWL+KNSWG +G+ GY+++ R++G CGI + SYP
Sbjct: 292 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|12803615|gb|AAH02642.1| Cathepsin S [Homo sapiens]
gi|49456313|emb|CAG46477.1| CTSS [Homo sapiens]
gi|60821573|gb|AAX36579.1| cathepsin S [synthetic construct]
gi|189069420|dbj|BAG37086.1| unnamed protein product [Homo sapiens]
gi|261858586|dbj|BAI45815.1| cathepsin S [synthetic construct]
Length = 331
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 197/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 113 WILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M AF+YII N+GI ++ YPY+A+ C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPY 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E L +AV+ + PVS+G+ A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
YWL+KNSWG +G+ GY+++ R++G CGI + SYP
Sbjct: 292 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
Length = 349
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 190/319 (59%), Gaps = 17/319 (5%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTND 102
+++ + W A++ R+Y E + RF ++ EN+++IE N+ G+ +Y+LG N+F+DLT +
Sbjct: 33 LLDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGS-SYELGENQFADLTEE 91
Query: 103 EFRALYTGYKM----PSPSHRSTTSSTFKYQNLS----MTDVPTSLDWRDKKAVTPIKDQ 154
EF+ Y K+ SP + T T S + P S+DWR K AVTP+K Q
Sbjct: 92 EFKDTYL-MKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVKSQ 150
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTM-EKAFEYIIQ 213
Q CG CWAF+AVA++EG+ KI L+ LSEQ++VDC GNN G A E++ +
Sbjct: 151 QHCGSCWAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVTR 210
Query: 214 NQGIATEDEYPYQAVQGTCSAAQKA-AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAA 272
N G+ TE +YPY QG C + + AAKI + V +E AL AV+ +PV++ I A
Sbjct: 211 NGGLTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSINA 270
Query: 273 YTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
+ F+ YK GIF+G C T +HAVT+VG+G G YW++KNSWG+ WG+ GY+++ R
Sbjct: 271 -SRAFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRMQR 329
Query: 333 ----DEGLCGIGTQSSYPL 347
EG+CGI Y +
Sbjct: 330 GVRAREGVCGIAIAPFYAV 348
>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 132/336 (39%), Positives = 192/336 (57%), Gaps = 24/336 (7%)
Query: 24 LLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKEN----LEYIE 79
LL++ A+ +V + + + E+ W +G+ Y E E+ R I++ N LE+
Sbjct: 3 LLIAVAALIVCATAFEYTAEWEL---WKRTNGKDYSSEKEELYRQTIWEANKKIVLEHNA 59
Query: 80 KANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
A+K G + L N F+DL + EF A+Y GY+ + +T +Y + +P +
Sbjct: 60 NADKWG---WTLEMNAFADLESSEFAAMYNGYRRSARKSNAT-----RYHVPTGNALPDT 111
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNG 198
+DWR K AVTP+K+Q++CG CWAFS ++EG T + L LSEQQLVDCS GN+G
Sbjct: 112 VDWRTKGAVTPVKNQKQCGSCWAFSTTGSLEGQTFLKKGTLPSLSEQQLVDCSDKYGNHG 171
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALL 258
C GG M+ AF+YI N GI +E YPY+A G C Q A AA + Y+++P D L
Sbjct: 172 CQGGLMDNAFKYIEANGGIDSEASYPYEAKNGKCRFQQSAVAATCTGYKDIPHDDIDGLQ 231
Query: 259 KAVS-MQPVSIGIAAYTTEFKSYKEGIFNGVC--GTQLDHAVTIVGFGTTEDG-----AN 310
AV+ + P+S+ + A + F+ Y G+++ + T+LDH V VG+GT G
Sbjct: 232 DAVANVGPISVAMDASHSSFQLYAAGVYDPLLCSSTRLDHGVLAVGYGTEPSGLFHEEKP 291
Query: 311 YWLIKNSWGDTWGDAGYMKILRDEGLCGIGTQSSYP 346
YWL+KNSWG WG GY KI+R + CGI T +SYP
Sbjct: 292 YWLVKNSWGPDWGQQGYFKIVRKDNKCGIATDASYP 327
>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
Length = 344
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/323 (39%), Positives = 183/323 (56%), Gaps = 23/323 (7%)
Query: 46 MHEKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNR---TYKLGTNRFSDL 99
+ E+W A +H + Y E+E + R KI+ EN I K N+ + +YKL N+++D+
Sbjct: 23 VREEWNAFKMEHSKQYDSEVEDKFRMKIYVENKHRIAKHNQRFEQRLVSYKLKPNKYADM 82
Query: 100 TNDEFRALYTGY----------KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVT 149
+ EF G+ K R ++TF + P +DWR K AVT
Sbjct: 83 LHHEFVHTMNGFNKTAKHGGRNKAVHSKGRDGRAATFIAP--AHVSYPDHVDWRKKGAVT 140
Query: 150 PIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAF 208
+KDQ +CG CWAFS A+EG L+ LSEQ LVDCS GNNGC GG M+ AF
Sbjct: 141 DVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLVDCSAAYGNNGCNGGLMDNAF 200
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVS 267
+YI N GI TE YPY+AV C K + A + ++P GDE+ L++AV ++ P+S
Sbjct: 201 KYIKDNGGIDTEKSYPYEAVDDKCRYNPKNSGADDVGFVDIPQGDEEKLMQAVATVGPIS 260
Query: 268 IGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDA 325
+ I A F+ Y +G++ T LDH V +VG+GT E+G +YWL+KNSWG +WG+
Sbjct: 261 VAIDASQETFQFYSKGVYYDENCSSTDLDHGVMVVGYGTEEEGGDYWLVKNSWGRSWGEL 320
Query: 326 GYMKILRDE-GLCGIGTQSSYPL 347
GY+K+ ++ CGI + +SYPL
Sbjct: 321 GYIKMAHNKNNHCGIASSASYPL 343
>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
Length = 321
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 134/307 (43%), Positives = 184/307 (59%), Gaps = 16/307 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
+ + Q+GR Y D E+ R ++F++N + IE NK+ G T+K+ N+F D+TN+EF
Sbjct: 21 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 80
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
A+ GYK S R + F + M +DWR K VTP+KDQ++CG CWAFS
Sbjct: 81 NAVMKGYKKGS---RGEPKAVFTAEAGPMA---ADVDWRTKALVTPVKDQEQCGSCWAFS 134
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A A+EG + L+ LSEQQLVDCST+ GN+GCGGG M AF+YI N GI TE Y
Sbjct: 135 ATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSY 194
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYKE 282
PY+A +C + A + EV E+AL +AVS + P+S+ I A F+ Y
Sbjct: 195 PYEAEDRSCRFDANSIGAICTGSVEVQH-TEEALQEAVSGVGPISVAIDASHFSFQFYSS 253
Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGI 339
G++ T LDH V VG+G TE +YWL+KNSWG +WGDAGY+K+ R+ + CGI
Sbjct: 254 GVYYEQNCSPTFLDHGVLAVGYG-TESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGI 312
Query: 340 GTQSSYP 346
++ SYP
Sbjct: 313 ASEPSYP 319
>gi|23306947|dbj|BAC16538.1| cathepsin L [Engraulis japonicus]
Length = 336
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 194/342 (56%), Gaps = 19/342 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M++ + + C S V+++ S ++ + + W H + Y E E+ R ++++NL
Sbjct: 1 MYVAAVFTL-CLSAVLAAPSF-DRELDDHWNHWKNFHTKKYH-EKEEGWRRVVWEKNLRK 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G +Y+LG N F D+T++EFR + GYK + R S F N
Sbjct: 58 IEMHNLEHSMGAHSYRLGMNHFGDMTHEEFRQVMNGYK--HKAERRVKGSLFMEPNF--I 113
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+ P +D+RD TP+KDQ +CG CWAFS A+EG G L+ LSEQ LVDCS
Sbjct: 114 EAPKKIDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSEQNLVDCSRP 173
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+YI N G+ TED YPY C K +AA + + ++P G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDTEDAYPYLGTDDQDCHYDPKYSAANDTGFVDIPEG 233
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVC-GTQLDHAVTIVGF---GTTE 306
E+AL+KAV ++ PVS+ I A F+ Y GI F C T+LDH V +VG+ G
Sbjct: 234 KERALMKAVAAVGPVSVAIDAGHESFQFYHSGIYFEKECSSTELDHGVLVVGYGFEGEDV 293
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WGD GY+ + +D + CGI T +SYPL
Sbjct: 294 DGKKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPL 335
>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 325
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 134/339 (39%), Positives = 191/339 (56%), Gaps = 31/339 (9%)
Query: 20 IIIILLVSCASQVVSSRSTHEQSVVEMHEKWM---AQHGRSYKDELEKEMRFKIFKENLE 76
+I I L + A Q ++ + E+W+ ++ +SYK +E++ RF+IF+ENL
Sbjct: 4 LIFIFLATAAVQALNDK-----------EEWVQFKVKNNKSYKSYVEEQTRFRIFQENLR 52
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE N++ G T+K G +F+DLT EF L K P+ T + +
Sbjct: 53 KIENHNEKYNNGESTFKFGVTKFTDLTEKEFLDLLVLSKNARPNRTHAT-----HLLAPL 107
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
D+P++ DWRDK AVT +KDQ CG CW FS +VE + NL+ LSEQ LVDC+
Sbjct: 108 RDLPSAFDWRDKGAVTEVKDQGMCGSCWTFSTTGSVEAAHFLKTGNLVSLSEQNLVDCAK 167
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
+ GCGGG M+KA EY I+ GI +E +YPY+ V C AAKISN+ + D
Sbjct: 168 DTCYGCGGGWMDKALEY-IEKGGIMSEKDYPYEGVDDNCRFDISKVAAKISNFTYIKKND 226
Query: 254 EQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGIFNGV-CGTQLD---HAVTIVGFGTTEDG 308
E+ L AV+ + P+S+ I A T F+ Y GI + C + D H V +VG+G TE+G
Sbjct: 227 EEDLKNAVAAKGPISVAIDASAT-FQLYVSGILDDTECSNEFDSLNHGVLVVGYG-TENG 284
Query: 309 ANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYP 346
+YW+IKNSWG WG GY+++ R++ CGI T YP
Sbjct: 285 KDYWIIKNSWGVNWGMDGYIRMSRNKNNQCGITTDGVYP 323
>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 365
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 133/362 (36%), Positives = 199/362 (54%), Gaps = 41/362 (11%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN 82
+ LVS +V++ ++++ +W AQH R Y + ++ R I+++NL IE N
Sbjct: 5 LCLVSLCLGLVAAIPKLDRTLDAQWYQWKAQHRRDYGEN--EDWRRAIWEKNLRSIEMHN 62
Query: 83 KE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK------------ 127
E G ++++ N+F D+TN+EFR + G+ R T F+
Sbjct: 63 LEYSAGKHSFQMEMNKFGDMTNEEFRQVMNGFSTHR-VQRRTKGRLFREPLLVQIPKSVD 121
Query: 128 ------------------YQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAV 169
++ + +P S+DWRDK VTP+K+Q +CG CWAFSA ++
Sbjct: 122 WRDKGYVTPVKNQLVRRLFREPLLVQIPKSVDWRDKGYVTPVKNQGQCGSCWAFSATGSL 181
Query: 170 EGITKISGANLIQLSEQQLVDCST-NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAV 228
EG L+ LSEQ LVDCST GN+GC GG M+ AFEY+ +N GI TE+ YPY A
Sbjct: 182 EGQWFRKTGKLVSLSEQNLVDCSTAQGNSGCQGGLMDNAFEYVKENGGIDTEESYPYIAA 241
Query: 229 QGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FN 286
TC + + A I+ Y ++PS E+AL KAV ++ P+S+ I A + F+ Y+ G+ +
Sbjct: 242 DDTCQYKPQYSGANITGYVDIPSRMEKALEKAVATVGPISVAIDAGHSSFQFYRSGVYYE 301
Query: 287 GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSS 344
C ++ LDH V VG+G YW++KNSWG+ WGD+GY+ + RD CGI T +S
Sbjct: 302 PECSSEDLDHGVLAVGYGVQGKNGKYWIVKNSWGEEWGDSGYILMARDRNNHCGIATAAS 361
Query: 345 YP 346
YP
Sbjct: 362 YP 363
>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 320
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 134/307 (43%), Positives = 184/307 (59%), Gaps = 16/307 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
+ + Q+GR Y D E+ R ++F++N + IE NK+ G T+K+ N+F D+TN+EF
Sbjct: 20 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 79
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
A+ GYK S R + F + M +DWR K VTP+KDQ++CG CWAFS
Sbjct: 80 NAVMKGYKKGS---RGEPKAVFTAEAGPMA---ADVDWRTKALVTPVKDQEQCGSCWAFS 133
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A A+EG + L+ LSEQQLVDCST+ GN+GCGGG M AF+YI N GI TE Y
Sbjct: 134 ATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSY 193
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYKE 282
PY+A +C + A + EV E+AL +AVS + P+S+ I A F+ Y
Sbjct: 194 PYEAEDRSCRFDANSIGAICTGSVEVQH-TEEALQEAVSGVGPISVAIDASHFSFQFYSS 252
Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGI 339
G++ T LDH V VG+G TE +YWL+KNSWG +WGDAGY+K+ R+ + CGI
Sbjct: 253 GVYYEQNCSPTFLDHGVLAVGYG-TESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGI 311
Query: 340 GTQSSYP 346
++ SYP
Sbjct: 312 ASEPSYP 318
>gi|5853329|gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
Length = 501
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 136/361 (37%), Positives = 215/361 (59%), Gaps = 23/361 (6%)
Query: 2 VLIFERSGSFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDE 61
+LIF S+ I+T + +L + ++SS V ++ KW HG++Y+ E
Sbjct: 10 ILIFLTYVSYSISTKTLPSEFSILEGQENDILSS-----AKVSDLFGKWKELHGKTYQHE 64
Query: 62 LEKEMRFKIFKENLEYIEKANKE--GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHR 119
E+ +R + FK++++++ + N E + +G N+F+DL+N+EF+ +Y S S+
Sbjct: 65 EEENLRLENFKKSVKFVMEKNSERKSELDHTVGLNKFADLSNEEFKEMYMSKVKGSRSNE 124
Query: 120 STTSSTFKYQNLS--MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISG 177
+ ++S D PTSLDWRDK VTP+KDQ +CG CWAFS ++E I+
Sbjct: 125 LKMGGVKRNMSVSSRTCDAPTSLDWRDKGVVTPMKDQGQCGSCWAFSVSGSIESANAIAT 184
Query: 178 ANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK 237
+LI+LSEQ+LVDC T + GC GG M+ A+ +II+N G+ +ED+YPY + G K
Sbjct: 185 GDLIRLSEQELVDCDTY-DYGCDGGNMDTAYRWIIKNGGLDSEDDYPYTSSNGRDGKCDK 243
Query: 238 AAAAK----ISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQ- 292
+AK + +Y EV S +E A+L AV+ PV+IGI +F+ Y G++NG C ++
Sbjct: 244 TKSAKSVVSLDSYVEVES-NEDAVLCAVATTPVTIGIVGSAYDFQLYTGGVYNGQCSSKP 302
Query: 293 --LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYP 346
+DHAV IVG+G ++DG +YW++KNSWG WG GY+ + R+ G+CG+ + YP
Sbjct: 303 YDIDHAVLIVGYG-SQDGKDYWIVKNSWGTYWGLEGYILMERNTDIKNGVCGMYLEPVYP 361
Query: 347 L 347
+
Sbjct: 362 I 362
>gi|149751225|ref|XP_001490531.1| PREDICTED: cathepsin S-like [Equus caballus]
Length = 332
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 129/317 (40%), Positives = 184/317 (58%), Gaps = 15/317 (4%)
Query: 39 HEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTN 94
H ++ H + W +G+ YK++ E+ R I++ NL+++ N E G +Y LG N
Sbjct: 20 HRDPTLDNHWDLWKKTYGKQYKEKNEEVARRLIWERNLKFVMLHNLEHSMGMHSYDLGMN 79
Query: 95 RFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
D+T++E +L + ++PS R+ T Y++ +P SLDWR+K VT +K Q
Sbjct: 80 HLGDMTSEEVTSLMSSLRVPSQWQRNVT-----YKSNPNEKLPDSLDWREKGCVTEVKYQ 134
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN--GNNGCGGGTMEKAFEYII 212
CG CWAFSAV A+E K+ NL+ LS Q LVDCST N GC GG M AF+YII
Sbjct: 135 GSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYII 194
Query: 213 QNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIA 271
N GI ++ YPY+A+ G C K AA S Y E+P G E L +AV+ + PVS+ I
Sbjct: 195 DNNGIDSDASYPYKAMDGKCRYDSKNRAATCSKYTELPFGSEDDLKEAVANKGPVSVAID 254
Query: 272 AYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A F YK G+ ++ C ++H V +VG+G +G +YWL+KNSWG +GD GY+++
Sbjct: 255 ASHPSFFLYKSGVYYDPSCTQNVNHGVLVVGYGNL-NGKDYWLVKNSWGINFGDKGYIRM 313
Query: 331 LRDEG-LCGIGTQSSYP 346
R+ G CGI SYP
Sbjct: 314 ARNSGNHCGIANYCSYP 330
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 131/313 (41%), Positives = 184/313 (58%), Gaps = 17/313 (5%)
Query: 45 EMHEKW---MAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSD 98
E+ +W + HG+ Y E E R I++ NL+YIEK N G+ ++ LG N + D
Sbjct: 22 ELDSEWQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYGD 80
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
+TN+EFR+ GYKM T+ + ++ D+P ++DWR K VTPIK+Q +CG
Sbjct: 81 MTNEEFRSTMNGYKM----RNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCG 136
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGI 217
CW+FSA ++EG T L LSEQ LVDCS GN+GC GG M+ AF+YI N GI
Sbjct: 137 SCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSGI 196
Query: 218 ATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTE 276
TE YPY+A G C A S + ++ S E L AV ++ P+S+ I A
Sbjct: 197 DTESSYPYEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPISVAIDASHMS 256
Query: 277 FKSYKEGIFNG-VCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE 334
F+ Y+ G+++ C T+LDH V VG+G TE G +YWL+KNSWG++WG GY+ + R++
Sbjct: 257 FQLYRSGVYHEFFCSETRLDHGVLAVGYG-TESGKDYWLVKNSWGESWGQKGYIMMSRNK 315
Query: 335 -GLCGIGTQSSYP 346
CGI T +SYP
Sbjct: 316 RNNCGIATSASYP 328
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 132/310 (42%), Positives = 184/310 (59%), Gaps = 16/310 (5%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W + H + Y E E+ R ++++NL+ IE N + G +YKLG N+F D+T +EFR
Sbjct: 13 WKSWHNKDYH-EREESWRRVVWEKNLKMIELHNLDHTLGKHSYKLGMNQFGDMTTEEFRQ 71
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
L GY S R S F S + P S+DWR+K VTP+KDQ +CG CWAFS
Sbjct: 72 LMNGYAHKK-SERKYRGSQF--LEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTT 128
Query: 167 AAVEGITKISGANLIQLSEQQLVDCS-TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPY 225
A+EG L+ LSEQ LVDCS GN GC GG M++AF+Y+ N GI +E+ YPY
Sbjct: 129 GALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPY 188
Query: 226 QAVQG-TCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKEG 283
A C + AA + + ++P G E+AL+KAV ++ PVS+ I A + F+ Y+ G
Sbjct: 189 TAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSG 248
Query: 284 I-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLC 337
I + C ++ LDH V +VG+ G DG YW++KNSWG+ WGD GY+ + +D + C
Sbjct: 249 IYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHC 308
Query: 338 GIGTQSSYPL 347
GI T +SYPL
Sbjct: 309 GIATAASYPL 318
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 127/334 (38%), Positives = 192/334 (57%), Gaps = 13/334 (3%)
Query: 21 IIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEK 80
+ L S +V++ +Q++ +W AQH R+Y E R +++NL+ IE
Sbjct: 3 FYLCLASLCLGLVAATPEFDQTLDSQWHQWKAQHRRTYAAN-EDGWRRATWEKNLKMIEM 61
Query: 81 ANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
N E G +++LG N+F D+T +EF+ + GY R+ S Y+ + +P
Sbjct: 62 HNLEYSAGKHSFQLGMNKFGDMTTEEFKQVMNGYNSNGSQKRTKGS---LYREPLLAQLP 118
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GN 196
S+DWR+K VTP+K+Q +CG CWAFSA ++EG L+ LSEQ LVDCST+ GN
Sbjct: 119 KSVDWREKGYVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTSEGN 178
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQA 256
NGC GG M+ AFEY+ N GI TE YPY C + + A ++ + ++PS +E+A
Sbjct: 179 NGCSGGLMDNAFEYVKNNGGIDTEQAYPYLGQDNECKYRAECSGANVTGFVDIPSMNERA 238
Query: 257 LLKAVS-MQPVSIGIAAYTTEFKSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWL 313
L+KAV+ + P+S+ I A F+ Y+ G++ +QLDH V +VG+G+ YW+
Sbjct: 239 LMKAVANVGPISVAIDAGNPSFQFYESGVYYEPQCSSSQLDHGVLVVGYGSIGK-DEYWI 297
Query: 314 IKNSWGDTWGDAGYMKILR-DEGLCGIGTQSSYP 346
+KNSWG+ WG GY+ + + CGI T +SYP
Sbjct: 298 VKNSWGEEWGKKGYVLMAKFRNNHCGIATAASYP 331
>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
Length = 328
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 114/231 (49%), Positives = 155/231 (67%), Gaps = 7/231 (3%)
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
S + +Y +P S+DWR + AV +KDQ CG CWAFSA+AAVEGI KI +LI
Sbjct: 11 SKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLIS 70
Query: 183 LSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAA 241
LSEQ+LVDC T+ N GC GG M+ AFE+II N GI +ED+YPY+AV G C +K A
Sbjct: 71 LSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVV 130
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
I +YE+VP+ DE AL KAV+ QP+++ + EF+ Y+ G+ G CGT LDH V VG
Sbjct: 131 TIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVLTGRCGTALDHGVAAVG 190
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-----EGLCGIGTQSSYPL 347
+G TE+G +YW+++NSWG +WG+ GY+++ R+ G CGI + SYP+
Sbjct: 191 YG-TENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 240
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 129/312 (41%), Positives = 193/312 (61%), Gaps = 10/312 (3%)
Query: 44 VEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRTYKLGTNRFSDLT 100
+E H W + GRSY+ E+ R +I+ N + + N +G ++Y+LG +F+D+
Sbjct: 25 MEFH-AWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMD 83
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
N+E+++L + + + + + + ++ T +PT++DWRDK VT +KDQ++CG C
Sbjct: 84 NEEYKSLISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSC 143
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIAT 219
WAFSA ++EG L+ LSEQQLVDCS + GN GC GG M+ AF+YI +N GI T
Sbjct: 144 WAFSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDT 203
Query: 220 EDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFK 278
E YPY+A G C + AK + Y +V GDE AL +AV ++ PVS+GI A + F+
Sbjct: 204 EKSYPYEAEDGQCRFKPENVGAKCTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQ 263
Query: 279 SYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EG 335
Y G+++ C +Q LDH V VG+G T++G +YWL+KNSWG WG GY+ + R+ +
Sbjct: 264 LYDSGVYDEQDCSSQDLDHGVLAVGYG-TDNGQDYWLVKNSWGLGWGQEGYIMMSRNKDN 322
Query: 336 LCGIGTQSSYPL 347
CGI T +SYPL
Sbjct: 323 QCGIATAASYPL 334
>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
Length = 401
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 134/316 (42%), Positives = 182/316 (57%), Gaps = 16/316 (5%)
Query: 44 VEMHEK-----WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE--GNRTYKLGTNRF 96
VE+ E+ WM H +SY + RF+I+K N +I NK+ ++ + N+F
Sbjct: 87 VELEEQRAFTEWMRTHRKSYHHD-HFLPRFEIWKTNNRWITHWNKKHANASSFTVAINQF 145
Query: 97 SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
DLT+DEF LY G + S + +++ Q + +P S DWR K V+ +KDQ
Sbjct: 146 GDLTSDEFNRLYNGLHVFS-APKASEKVERPRQWANTAGIPESGDWRQKGVVSRVKDQGM 204
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG--NNGCGGGTMEKAFEYIIQN 214
CG CWAFS + EGI I+ + L+ LSEQ LVDC+T N GC GG M+ AF YII N
Sbjct: 205 CGSCWAFSTTGSTEGINAITTSRLVPLSEQNLVDCATAAYDNYGCNGGFMDNAFRYIIDN 264
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
+GI +E YPY A G C K K + +P GDE+ALL A + QP+S+GI A
Sbjct: 265 KGIDSEASYPYVAADGQCRFNPKTVYGGKGGTLKSLPKGDEKALLVAAARQPISVGIDAG 324
Query: 274 TTEFKSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKIL 331
F+ Y +G++N T+L+H V IVG+G E G YWL+KNSWG TWG GY+K+
Sbjct: 325 RPSFQFYSKGVYNEPECSSTELNHGVLIVGWG-VERGQAYWLVKNSWGQTWGMDGYIKMS 383
Query: 332 RDE-GLCGIGTQSSYP 346
RD+ CGI T +SYP
Sbjct: 384 RDKNNQCGIATLASYP 399
>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 130/335 (38%), Positives = 196/335 (58%), Gaps = 19/335 (5%)
Query: 23 ILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN 82
LL + + S+ +Q++ +W A H R Y E+ R ++++N+ IE N
Sbjct: 5 FLLAAVCWGIASAIPKFDQNLDTQWYQWKATHKRLYGLN-EEGWRRAVWEKNMRMIELHN 63
Query: 83 KE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
E G + +G N + D+TN+EFR + G++ + H+ +++ + P S
Sbjct: 64 GEYSQGKHGFTMGMNAYGDMTNEEFRQVMNGFQ--NQKHKKGK----MFRDPLLLQYPKS 117
Query: 140 LDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGNNG 198
+DWR+K VTP+K+Q +CG CWAFSA A+EG LI LSEQ LVDCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLSEQNLVDCSHPQGNQG 177
Query: 199 CGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALL 258
C GG M+ AF+Y+ N G+ +E+ YPY+ + GTC + + A + + ++P G E+ALL
Sbjct: 178 CNGGLMDYAFQYVKDNSGLDSEESYPYEGMDGTCKYKPECSVANDTGFVDIP-GHEKALL 236
Query: 259 KAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYW 312
+AV ++ P+S I A F+ YK GI ++ C ++ LDH + +VG+ GT + YW
Sbjct: 237 RAVATVGPISAAIDAGHMSFQFYKSGIYYDPDCSSKDLDHGILVVGYGFEGTNSNATKYW 296
Query: 313 LIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
L+KNSWG TWGD GY+KI+RD + CGI T +SYP
Sbjct: 297 LVKNSWGTTWGDEGYVKIIRDKDNHCGIATAASYP 331
>gi|34850847|dbj|BAC87861.1| cathepsin L [Engraulis japonicus]
Length = 336
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 195/342 (57%), Gaps = 19/342 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M++ + + C S V+++ S ++ + + W + H + Y E E+ R ++++NL
Sbjct: 1 MYVAAVFTL-CLSAVLAAPSF-DRELDDHWNHWKSFHTKKYH-EKEEGWRRVVWEKNLRK 57
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G +Y+LG N F D+T++EFR + GYK + R S F N
Sbjct: 58 IEMHNLEHSMGAHSYRLGMNHFGDMTHEEFRQVMNGYK--HKAERRVKGSLFMEPNF--I 113
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+ P +D+RD TP+KDQ +CG CWAFS A+EG G L+ LSEQ LVDCS
Sbjct: 114 EAPKKIDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSEQNLVDCSRP 173
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQG-TCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M++AF+YI N G+ TED YPY C K +AA + + ++P G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDTEDAYPYLGTDDQDCHYDPKYSAANDTGFVDIPEG 233
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVC-GTQLDHAVTIVGF---GTTE 306
E+AL+KAV ++ PVS+ I A F+ Y GI F C T+LDH V +VG+ G
Sbjct: 234 KERALMKAVAAVGPVSVAIDAGHECFQFYHSGIYFEKECSSTELDHGVLVVGYGFEGEDV 293
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
DG YW++KNSW + WGD GY+ + +D + CGI T +SYPL
Sbjct: 294 DGKKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPL 335
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.130 0.387
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,340,824,621
Number of Sequences: 23463169
Number of extensions: 220972795
Number of successful extensions: 664295
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6643
Number of HSP's successfully gapped in prelim test: 1145
Number of HSP's that attempted gapping in prelim test: 632232
Number of HSP's gapped (non-prelim): 9716
length of query: 348
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 205
effective length of database: 9,003,962,200
effective search space: 1845812251000
effective search space used: 1845812251000
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)